Researcher profile

Yudong Zhang

Yudong Zhang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
11works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

11 published item(s)

preprint2026arXiv

AirQualityBench: A Realistic Evaluation Benchmark for Global Air Quality Forecasting

Air-quality forecasting models are commonly evaluated on regional, preprocessed, and normalized datasets, where missing observations are removed or artificially completed. Such protocols simplify comparison but hide the conditions that dominate real monitoring networks: uneven global coverage, structured missingness, heterogeneous pollutant scales, and deployment cost. We introduce \textbf{AirQualityBench}, a global multi-pollutant benchmark designed to evaluate forecasting models under these realistic conditions. The benchmark contains hourly observations from 3,720 monitoring stations over 2021--2025, covers six major pollutants, and preserves provider-native observation masks. Rather than imputing a dense data tensor, AirQualityBench exposes missingness as part of the forecasting problem and reports errors on valid future observations after inverse transformation to physical concentration scales. Evaluating representative spatio-temporal models under this unified protocol shows that strong performance on sanitized datasets does not reliably transfer to global, fragmented monitoring streams. AirQualityBench therefore serves as a realistic testbed for scalable, mask-aware, and physically interpretable air-quality forecasting. All benchmark data, code, evaluation scripts, and baseline implementations are available at \href{https://github.com/Star-Learning/AirQualityBench}{GitHub}.

preprint2026arXiv

CoRE: Concept-Reasoning Expansion for Continual Brain Lesion Segmentation

Accurate brain lesion segmentation in MRI is vital for effective clinical diagnosis and treatment planning. Due to high annotation costs and strict data privacy regulations, universal models require employing Continual Learning (CL) to adapt to evolving clinical tasks without losing previously acquired knowledge. However, existing CL paradigms often suffer from capacity limits or redundant parameter growth, and even advanced dynamic methods rely mostly on image-perception strategies that struggle to handle the substantial pathological and multimodal heterogeneity inherent in brain imaging. To address this issue, we propose Concept-Reasoning Expansion (CoRE) framework, which establishes a joint decision-making mechanism by integrating visual features with structured concepts. Through the alignment of image tokens with a hierarchical concept library, CoRE simulates clinical reasoning to guide both interpretable expert routing and demand-based model growth. This collaborative process ensures model evolution is grounded in clinical priors, preventing redundant parameter expansion while maximizing knowledge reuse. Extensive evaluations across 12 sequential brain lesion MRI tasks demonstrate that CoRE achieves state-of-the-art performance and provides a high knowledge starting point for efficient future adaptation. Its superior few-shot transferability and clinical interpretability further validate its effectiveness in managing non-stationary clinical data streams. Our code will be released soon.

preprint2026arXiv

MHSA: A Lightweight Framework for Mitigating Hallucinations via Steered Attention in LVLMs

Large vision-language models (LVLMs) have achieved remarkable performance across diverse multimodal tasks, yet they continue to suffer from hallucinations, generating content that is inconsistent with the visual input. Prior work DHCP (Detecting Hallucinations by Cross-modal Attention Pattern) has explored hallucination detection from the perspective of cross-modal attention, but does not address hallucination mitigation. In this paper, we propose MHSA (Mitigating Hallucinations via Steered Attention), a lightweight framework that mitigates hallucinations by learning to correct cross-modal attention patterns in LVLMs. MHSA trains a simple three-layer MLP generator to produce corrected attention, guided by supervisory signals from the DHCP discriminator and the LVLM itself. During inference, MHSA mitigates both discriminative and generative hallucinations across various datasets and LVLMs by simply replacing the original cross-modal attention with the corrected one, without modifying any LVLM parameters. By extending cross-modal attention mechanisms from hallucination detection to hallucination mitigation, MHSA offers a novel perspective on hallucination research in LVLMs and helps enhance their reliability.

preprint2025arXiv

MGML: A Plug-and-Play Meta-Guided Multi-Modal Learning Framework for Incomplete Multimodal Brain Tumor Segmentation

Leveraging multimodal information from Magnetic Resonance Imaging (MRI) plays a vital role in lesion segmentation, especially for brain tumors. However, in clinical practice, multimodal MRI data are often incomplete, making it challenging to fully utilize the available information. Therefore, maximizing the utilization of this incomplete multimodal information presents a crucial research challenge. We present a novel meta-guided multi-modal learning (MGML) framework that comprises two components: meta-parameterized adaptive modality fusion and consistency regularization module. The meta-parameterized adaptive modality fusion (Meta-AMF) enables the model to effectively integrate information from multiple modalities under varying input conditions. By generating adaptive soft-label supervision signals based on the available modalities, Meta-AMF explicitly promotes more coherent multimodal fusion. In addition, the consistency regularization module enhances segmentation performance and implicitly reinforces the robustness and generalization of the overall framework. Notably, our approach does not alter the original model architecture and can be conveniently integrated into the training pipeline for end-to-end model optimization. We conducted extensive experiments on the public BraTS2020 and BraTS2023 datasets. Compared to multiple state-of-the-art methods from previous years, our method achieved superior performance. On BraTS2020, for the average Dice scores across fifteen missing modality combinations, building upon the baseline, our method obtained scores of 87.55, 79.36, and 62.67 for the whole tumor (WT), the tumor core (TC), and the enhancing tumor (ET), respectively. We have made our source code publicly available at https://github.com/worldlikerr/MGML.

preprint2022arXiv

A Review on Serious Games for Exercise Rehabilitation

Disability is an important factor affecting todays society. At the same time, more and more sub-healthy people are sick due to reduced body functions and cognitive functions. Exercise rehabilitation is a kind of physical therapy, which can recover the motor ability, cognitive ability, and mental state of them through exercise. But the traditional exercise rehabilitation has some drawbacks so that people who need exercise rehabilitation cannot stick to it. Therefore, many researchers improved the drawbacks of traditional exercise rehabilitation by serious games for exercise rehabilitation. Although there were abundant achievements in the games, its relevant technologies and representative games are not be summarized systematically. To fill this gap, we introduced the significance of the convergence of exercise rehabilitation and serious games. Then, our paper sorted out the development of the games based on interaction mode between games and players. Besides, we analyzed the characteristics of different user groups and the specific functions of the games corresponding to them, and gave our classification based on this. Based on the classification, we reviewed related studies of the games in the past decade years and gave some suggestions on game design and development. Finally, we proposed serval research directions worth studying about the games technology development, functional design and social popularization.

preprint2022arXiv

Discrete Boltzmann modeling of high-speed compressible flows with various depths of non-equilibrium

The non-equilibrium high-speed compressible flows present wealthy applications in engineering and science. With the deepening of Thermodynamic Non-Equilibrium (TNE), higher-order non-conserved kinetic moments of the distribution function are needed to capture the main feature of the flow state and evolution process. Based on the ellipsoidal statistical Bhatnagar-Gross-Krook model, Discrete Boltzmann Models (DBMs) that consider various orders (from the first up to the sixth order) of TNE effects are developed to study flows in various depths of TNE. Specifically, at first, two types of one-dimensional Riemann problems and a Couette flow are used to show the model's capability to capture large flow structures with zero-order and first-order TNE effects, respectively. Then, a shock wave structure given by Direct simulation Monte Carlo is used to verify the model's capability to capture fine structures at the level of mean free path of molecules. Further, we focus on the TNE degree of two colliding fluids. A five-component vector $\mathbf{S}_{TNE} = (τ, Δ\mathbf{u}, ΔT, \bm{Δ_{2}^{*}},\bm{Δ_{3,1}^{*}})$ is introduced to roughly characterize the TNE degree. It is found that the TNE strengths obtained from various perspectives are different. These findings demonstrate that the inadequacy of focusing only on the few kinetic moments appearing in Navier-Stokes increases with the degree of discreteness and deviation from thermodynamic equilibrium. Finally, a two-dimensional free jet is simulated to indicate that, to obtain satisfying hydrodynamic quantities, the DBM should include at least up to the third-order TNE effects.

preprint2022arXiv

Discrete Boltzmann modeling of Rayleigh-Taylor instability: effects of interfacial tension, viscosity and heat conductivity

The Rayleigh-Taylor Instability (RTI) in compressible flow with inter-molecular interactions is probed via the Discrete Boltzmann Method (DBM). The effects of interfacial tension, viscosity and heat conduction are investigated. It is found that the influences of interfacial tension on the perturbation amplitude, bubble velocity, and two kinds of entropy production rates all show differences at different stages of RTI evolution. It inhibits the RTI evolution at the bubble acceleration stage, while at the asymptotic velocity stage, it first promotes and then inhibits the RTI evolution. Viscosity and heat conduction inhibit the RTI evolution. Viscosity shows a suppressive effect on entropy generation rate related to heat flow at the early stage but a first promotive and then suppressive effect on entropy generation rate related to heat flow at a later stage. Heat conduction shows a promotive effect on entropy generation rate related to heat flow at an early stage. Still, it offers a first promotive and then suppressive effect on entropy generation rate related to heat flow at a later stage. By introducing the morphological boundary length, we found that the stage of exponential growth of interface length with time corresponds to the bubble acceleration stage. The first maximum point of interface length change rate and the first maximum point of the change rate of entropy generation rate related to viscous stress can be used as a new criterion for RTI to enter the asymptotic velocity stage.

preprint2021arXiv

Effects of the initial perturbations on the Rayleigh-Taylor-Kelvin-Helmholtz instability system

In the paper, the effects of initial perturbations on the Rayleigh-Taylor instability (RTI), Kelvin-Helmholtz instability (KHI), and the coupled Rayleigh-Taylor-Kelvin-Helmholtz instability (RTKHI) systems are investigated using a multiple-relaxation-time discrete Boltzmann model. Six different perturbation interfaces are designed to study the effects of the initial perturbations on the instability systems. Based on the mean heat flux strength $D_{3,1}$, the effects of initial interfaces on the coupled RTKHI are examined in detail. The research is focused on two aspects: (i) the main mechanism in the early stage of the RTKHI, (ii) the transition point from KHI-like to RTI-like for the case where the KHI dominates at earlier time and the RTI dominates at later time. It is found that the early main mechanism is related to the shape of the initial interface, which is represented by both the bilateral contact angle $θ_{1}$ and the middle contact angle $θ_{2}$. The influence of inverted parabolic and inverted ellipse perturbations ($θ_{1}<90$) on the transition point of the RTKHI system is greater than that of other interfaces.

preprint2020arXiv

Morphological and non-equilibrium analysis of coupled Rayleigh-Taylor-Kelvin-Helmholtz instability

In this paper, the coupled Rayleigh-Taylor-Kelvin-Helmholtz instability(RTI, KHI and RTKHI, respectively) system is investigated using a multiple-relaxation-time discrete Boltzmann model. Both the morphological boundary length and thermodynamic nonequilibrium (TNE) strength are introduced to probe the complex configurations and kinetic processes. In the simulations, RTI always plays a major role in the later stage, while the main mechanism in the early stage depends on the comparison of buoyancy and shear strength. It is found that, both the total boundary length $L$ of the condensed temperature field and the mean heat flux strength $D_{3,1}$ can be used to measure the ratio of buoyancy to shear strength, and to quantitatively judge the main mechanism in the early stage of the RTKHI system. Specifically, when KHI (RTI) dominates, $L^{KHI} > L^{RTI}$ ($L^{KHI} < L^{RTI}$), $D_{3,1}^{KHI} > D_{3,1}^{RTI}$ ($D_{3,1}^{KHI} < D_{3,1}^{RTI}$); when KHI and RTI are balanced, $L^{KHI} = L^{RTI}$, $D_{3,1}^{KHI} = D_{3,1}^{RTI}$. A second sets of findings are as below: For the case where the KHI dominates at earlier time and the RTI dominates at later time, the evolution process can be roughly divided into two stages. Before the transition point of the two stages, $L^{RTKHI}$ initially increases exponentially, and then increases linearly. Hence, the ending point of linear increasing $L^{RTKHI}$ can work as a geometric criterion for discriminating the two stages. The TNE quantity, heat flux strength $D_{3,1}^{RTKHI}$, shows similar behavior. Therefore, the ending point of linear increasing $D_{3,1}^{RTKHI}$ can work as a physical criterion for discriminating the two stages.

preprint2019arXiv

Knowledge-aided Two-dimensional Autofocus for Spotlight SAR Filtered Backprojection Imagery

Filtered backprojection (FBP) algorithm is a popular choice for complicated trajectory SAR image formation processing due to its inherent nonlinear motion compensation capability. However, how to efficiently autofocus the defocused FBP imagery when the motion measurement is not accurate enough is still a challenging problem. In this paper, a new interpretation of the FBP derivation is presented from the Fourier transform point of view. Based on this new viewpoint, the property of the residual 2-D phase error in FBP imagery is analyzed in detail. Then, by incorporating the derived a priori knowledge on the 2-D phase error, an accurate and efficient 2-D autofocus approach is proposed. The new approach performs the parameter estimation in a dimension-reduced parameter subspace by exploiting the a priori analytical structure of the 2-D phase error, therefore possesses much higher accuracy and efficiency than conventional blind methods. Finally, experimental results clearly demonstrate the effectiveness and robustness of the proposed method.

preprint2018arXiv

Comparative study of discrete Boltzmann model and Navier-Stokes

Discrete Boltzmann model (DBM) is a type of coarse-grained mesoscale kinetic model derived from the Boltzmann equation. Physically, it is roughly equivalent to a hydrodynamic model supplemented by a coarse-grained model for the relevant thermodynamic non-equilibrium (TNE) behaviours. The Navier-Stokes (NS) model is a traditional macroscopic hydrodynamic model based on continuity hypothesis and conservation laws. In this study, the two models are compared from two aspects, physical capability and computational cost, by simulating two kinds of flow problems including the thermal Couette flow and a Mach 3 step problem. In the cases where the TNE effects are weak, both the two models give accurate results for the hydrodynamic behaviour. Besides, DBM can provide more detailed non-equilibrium information, while the NS is more efficient if concern only the density, momentum, energy and their derived quantities. It is concluded that, if the TNE effects are strong or are to be investigated, the NS is insufficient while DBM is a good choice. While in the cases where the TNE effects are weak and only the macro flow fields are to be studied, the NS is more preferable.