Researcher profile

Jing Hu

Jing Hu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
13topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2026arXiv

MoE Adapter for Large Audio Language Models: Sparsity, Disentanglement, and Gradient-Conflict-Free

Extending the input modality of Large Language Models~(LLMs) to the audio domain is essential for achieving comprehensive multimodal perception. However, it is well-known that acoustic information is intrinsically \textit{heterogeneous}, entangling attributes such as speech, music, and environmental context. Existing research is limited to a dense, parameter-shared adapter to model these diverse patterns, which induces \textit{gradient conflict} during optimization, as parameter updates required for distinct attributes contradict each other. To address this limitation, we introduce the \textit{\textbf{MoE-Adapter}}, a sparse Mixture-of-Experts~(MoE) architecture designed to decouple acoustic information. Specifically, it employs a dynamic gating mechanism that routes audio tokens to specialized experts capturing complementary feature subspaces while retaining shared experts for global context, thereby mitigating gradient conflicts and enabling fine-grained feature learning. Comprehensive experiments show that the MoE-Adapter achieves superior performance on both audio semantic and paralinguistic tasks, consistently outperforming dense linear baselines with comparable computational costs. Furthermore, we will release the related code and models to facilitate future research.

preprint2023arXiv

Dynamic Combination of Heterogeneous Models for Hierarchical Time Series

We introduce a framework to dynamically combine heterogeneous models called \texttt{DYCHEM}, which forecasts a set of time series that are related through an aggregation hierarchy. Different types of forecasting models can be employed as individual ``experts'' so that each model is tailored to the nature of the corresponding time series. \texttt{DYCHEM} learns hierarchical structures during the training stage to help generalize better across all the time series being modeled and also mitigates coherency issues that arise due to constraints imposed by the hierarchy. To improve the reliability of forecasts, we construct quantile estimations based on the point forecasts obtained from combined heterogeneous models. The resulting quantile forecasts are coherent and independent of the choice of forecasting models. We conduct a comprehensive evaluation of both point and quantile forecasts for hierarchical time series (HTS), including public data and user records from a large financial software company. In general, our method is robust, adaptive to datasets with different properties, and highly configurable and efficient for large-scale forecasting pipelines.

preprint2022arXiv

Efficient Forecasting of Large Scale Hierarchical Time Series via Multilevel Clustering

We propose a novel approach to the problem of clustering hierarchically aggregated time-series data, which has remained an understudied problem though it has several commercial applications. We first group time series at each aggregated level, while simultaneously leveraging local and global information. The proposed method can cluster hierarchical time series (HTS) with different lengths and structures. For common two-level hierarchies, we employ a combined objective for local and global clustering over spaces of discrete probability measures, using Wasserstein distance coupled with Soft-DTW divergence. For multi-level hierarchies, we present a bottom-up procedure that progressively leverages lower-level information for higher-level clustering. Our final goal is to improve both the accuracy and speed of forecasts for a larger number of HTS needed for a real-world application. To attain this goal, each time series is first assigned the forecast for its cluster representative, which can be considered as a "shrinkage prior" for the set of time series it represents. Then this base forecast can be quickly fine-tuned to adjust to the specifics of that time series. We empirically show that our method substantially improves performance in terms of both speed and accuracy for large-scale forecasting tasks involving much HTS.

preprint2022arXiv

Stochastic Planner-Actor-Critic for Unsupervised Deformable Image Registration

Large deformations of organs, caused by diverse shapes and nonlinear shape changes, pose a significant challenge for medical image registration. Traditional registration methods need to iteratively optimize an objective function via a specific deformation model along with meticulous parameter tuning, but which have limited capabilities in registering images with large deformations. While deep learning-based methods can learn the complex mapping from input images to their respective deformation field, it is regression-based and is prone to be stuck at local minima, particularly when large deformations are involved. To this end, we present Stochastic Planner-Actor-Critic (SPAC), a novel reinforcement learning-based framework that performs step-wise registration. The key notion is warping a moving image successively by each time step to finally align to a fixed image. Considering that it is challenging to handle high dimensional continuous action and state spaces in the conventional reinforcement learning (RL) framework, we introduce a new concept `Plan' to the standard Actor-Critic model, which is of low dimension and can facilitate the actor to generate a tractable high dimensional action. The entire framework is based on unsupervised training and operates in an end-to-end manner. We evaluate our method on several 2D and 3D medical image datasets, some of which contain large deformations. Our empirical results highlight that our work achieves consistent, significant gains and outperforms state-of-the-art methods.

preprint2022arXiv

TCR: A Transformer Based Deep Network for Predicting Cancer Drugs Response

Predicting clinical outcomes to anti-cancer drugs on a personalized basis is challenging in cancer treatment due to the heterogeneity of tumors. Traditional computational efforts have been made to model the effect of drug response on individual samples depicted by their molecular profile, yet overfitting occurs because of the high dimension for omics data, hindering models from clinical application. Recent research shows that deep learning is a promising approach to build drug response models by learning alignment patterns between drugs and samples. However, existing studies employed the simple feature fusion strategy and only considered the drug features as a whole representation while ignoring the substructure information that may play a vital role when aligning drugs and genes. Hereby in this paper, we propose TCR (Transformer based network for Cancer drug Response) to predict anti-cancer drug response. By utilizing an attention mechanism, TCR is able to learn the interactions between drug atom/sub-structure and molecular signatures efficiently in our study. Furthermore, a dual loss function and cross sampling strategy were designed to improve the prediction power of TCR. We show that TCR outperformed all other methods under various data splitting strategies on all evaluation matrices (some with significant improvement). Extensive experiments demonstrate that TCR shows significantly improved generalization ability on independent in-vitro experiments and in-vivo real patient data. Our study highlights the prediction power of TCR and its potential value for cancer drug repurpose and precision oncology treatment.

preprint2022arXiv

Topological classification for intersection singularities of exceptional surfaces in pseudo-Hermitian systems

Exceptional points play a pivotal role in the topology of non-Hermitian systems, and significant advances have been made in classifying exceptional points and exploring the associated phenomena. Exceptional surfaces, which are hypersurfaces of exceptional degeneracies in parameter space, can support hypersurface singularities, such as cusps, intersections and swallowtail catastrophes. Here we topologically classify the intersection singularity of exceptional surfaces for a generic pseudo-Hermitian system with parity-time symmetry. By constructing the quotient space under equivalence relations of eigenstates, we reveal that the topology of such gapless structures can be described by a non-Abelian free group on three generators. Importantly, the classification predicts a new kind of non-Hermitian gapless topological phase and can systematically explain how the exceptional surfaces and their intersections evolve under perturbations with symmetries preserved. Our work opens a new pathway for designing systems with robust topological phases, and provides inspiration for applications such as sensing and lasing which can utilize the special properties inherent in exceptional surfaces and intersections.

preprint2020arXiv

Koopman analysis in oscillator synchronization

Synchronization is an important dynamical phenomenon in coupled nonlinear systems, which has been studied extensively in recent years. However, analysis focused on individual orbits seems hard to extend to complex systems while a global statistical approach is overly cursory. Koopman operator technique seems to well balance the two approaches. In this paper, we extend Koopman analysis to the study of synchronization of coupled oscillators by extracting important eigenvalues and eigenfunctions from the observed time series. A renormalization group analysis is designed to derive an analytic approximation of the eigenfunction in case of weak coupling that dominates the oscillation. For moderate or strong couplings, numerical computation further confirms the importance of the average frequencies and the associated eigenfunctions. The synchronization transition points could be located with quite high accuracy by checking the correlation of neighbouring eigenfunctions at different coupling strengths, which is readily applied to other nonlinear systems.

preprint2020arXiv

Robust Multimodal Image Registration Using Deep Recurrent Reinforcement Learning

The crucial components of a conventional image registration method are the choice of the right feature representations and similarity measures. These two components, although elaborately designed, are somewhat handcrafted using human knowledge. To this end, these two components are tackled in an end-to-end manner via reinforcement learning in this work. Specifically, an artificial agent, which is composed of a combined policy and value network, is trained to adjust the moving image toward the right direction. We train this network using an asynchronous reinforcement learning algorithm, where a customized reward function is also leveraged to encourage robust image registration. This trained network is further incorporated with a lookahead inference to improve the registration capability. The advantage of this algorithm is fully demonstrated by our superior performance on clinical MR and CT image pairs to other state-of-the-art medical image registration methods.

preprint2019arXiv

Spatio-temporal variation of temperature for the recent 40 years in Lhasa

It was all known that Lhasa went through a high temperature of 30.8$^{\circ}$C in late June 2019, which hit record highs. To better understand the reasons, based on observations recorded at automatic weather stations in Lhasa, we studied the characteristics of temperature variation at multiple time scales using the linear trend method, Mann-Kendall mutation test, morlet wavelet analysis, R/S analysis and so on. The results showed that: (a) The annual mean temperature (AMT) is rising at a rate of 0.5$^{\circ}$C/10yr, and the average temperature for different seasons also increased significantly, especially in winter. (b) Although there was an intersection in 1995, we found that AMT, did not pass the reliability test of significance level $α$ =0.05, this means there are no abrupt changes for AMT, the values are 7.97$^{\circ}$C and 9.15$^{\circ}$C respectively before and after the intersection point. (c) AMT has a periodic oscillation for 18~25yr and 25~32yr based on a mass of data and the wavelet variance diagrams in Lhasa. AMT has a main cycle of 28yr, cyclic Patterns of temperature changes in spring, summer and autumn is similar to AMT, but it is relatively complex in winter. (d) The Hurst index of AMT and different seasons demonstrates that the temperature are likely to continue to rise in the future in Lhasa.

preprint2019arXiv

Stochastic thermodynamics of an electron-spin-resonance quantum dot system

We present a stochastic thermodynamics analysis of an electron-spin-resonance pumped quantum dot device in the Coulomb-blocked regime, where a pure spin current is generated without an accompanying net charge current. Based on a generalized quantum master equation beyond secular approximation, quantum coherences are accounted for in terms of an effective average spin in the Floquet basis. Elegantly, this effective spin undergoes a precession about an effective magnetic field, which originates from the non-secular treatment and energy renormalization. It is shown that the interaction between effective spin and effective magnetic field may have the dominant roles to play in both energy transport and irreversible entropy production. In the stationary limit, the energy and entropy balance relations are also established based on the theory of counting statistics.