Researcher profile

Yue Zhu

Yue Zhu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
11topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2026arXiv

IIB-LPO: Latent Policy Optimization via Iterative Information Bottleneck

Recent advances in Reinforcement Learning with Verifiable Rewards (RLVR) for Large Language Model (LLM) reasoning have been hindered by a persistent challenge: exploration collapse. The semantic homogeneity of random rollouts often traps models in narrow, over-optimized behaviors. While existing methods leverage policy entropy to encourage exploration, they face inherent limitations. Global entropy regularization is susceptible to reward hacking, which can induce meaningless verbosity, whereas local token-selective updates struggle with the strong inductive bias of pre-trained models. To address this, we propose Latent Policy Optimization via Iterative Information Bottleneck (IIB-LPO), a novel approach that shifts exploration from statistical perturbation of token distributions to topological branching of reasoning trajectories. IIB-LPO triggers latent branching at high-entropy states to diversify reasoning paths and employs the Information Bottleneck principle both as a trajectory filter and a self-reward mechanism, ensuring concise and informative exploration. Empirical results across four mathematical reasoning benchmarks demonstrate that IIB-LPO achieves state-of-the-art performance, surpassing prior methods by margins of up to 5.3% in accuracy and 7.4% in diversity metrics.

preprint2026arXiv

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Recent large vision-language models (VLMs) remain fundamentally constrained by a persistent dichotomy: understanding and generation are treated as distinct problems, leading to fragmented architectures, cascaded pipelines, and misaligned representation spaces. We argue that this divide is not merely an engineering artifact, but a structural limitation that hinders the emergence of native multimodal intelligence. Hence, we introduce SenseNova-U1, a native unified multimodal paradigm built upon NEO-unify, in which understanding and generation evolve as synergistic views of a single underlying process. We launch two native unified variants, SenseNova-U1-8B-MoT and SenseNova-U1-A3B-MoT, built on dense (8B) and mixture-of-experts (30B-A3B) understanding baselines, respectively. Designed from first principles, they rival top-tier understanding-only VLMs across text understanding, vision-language perception, knowledge reasoning, agentic decision-making, and spatial intelligence. Meanwhile, they deliver strong semantic consistency and visual fidelity, excelling in conventional or knowledge-intensive any-to-image (X2I) synthesis, complex text-rich infographic generation, and interleaved vision-language generation, with or without think patterns. Beyond performance, we show detailed model design, data preprocessing, pre-/post-training, and inference strategies to support community research. Last but not least, preliminary evidence demonstrates that our models extend beyond perception and generation, performing strongly in vision-language-action (VLA) and world model (WM) scenarios. This points toward a broader roadmap where models do not translate between modalities, but think and act across them in a native manner. Multimodal AI is no longer about connecting separate systems, but about building a unified one and trusting the necessary capabilities to emerge from within.

preprint2026arXiv

Theoretical analysis of performance limitation of computational refocusing in optical coherence tomography

High-numerical-aperture optical coherence tomography (OCT) enables sub-cellular imaging but faces a trade-off between lateral resolution and depth of focus. Computational refocusing can correct defocus in Fourier-domain OCT, yet its limitations remain unaddressed theoretically. We formulate the lateral imaging process of OCT by using pupil-based imaging theory and the constraints of the computational refocusing in point-scanning OCT and spatially-coherent full-field OCT (FFOCT) are analyzed. The constrains in lateral sampling density and the confocality are considered, and it is shown that the maximum correctable defocus (MCD) is primarily limited by confocality in point-scanning OCT, while spatially-coherent FFOCT has no such constraint and can achieve virtually infinite MCD with a proper and reasonable sampling density. This makes spatially-coherent FFOCT particularly suitable for optical coherence microscopy.

preprint2025arXiv

Revisiting Disaggregated Large Language Model Serving for Performance and Energy Implications

Different from traditional Large Language Model (LLM) serving that colocates the prefill and decode stages on the same GPU, disaggregated serving dedicates distinct GPUs to prefill and decode workload. Once the prefill GPU completes its task, the KV cache must be transferred to the decode GPU. While existing works have proposed various KV cache transfer paths across different memory and storage tiers, there remains a lack of systematic benchmarking that compares their performance and energy efficiency. Meanwhile, although optimization techniques such as KV cache reuse and frequency scaling have been utilized for disaggregated serving, their performance and energy implications have not been rigorously benchmarked. In this paper, we fill this research gap by re-evaluating prefill-decode disaggregation under different KV transfer mediums and optimization strategies. Specifically, we include a new colocated serving baseline and evaluate disaggregated setups under different KV cache transfer paths. Through GPU profiling using dynamic voltage and frequency scaling (DVFS), we identify and compare the performance-energy Pareto frontiers across all setups to evaluate the potential energy savings enabled by disaggregation. Our results show that performance benefits from prefill-decode disaggregation are not guaranteed and depend on the request load and KV transfer mediums. In addition, stage-wise independent frequency scaling enabled by disaggregation does not lead to energy saving due to inherently higher energy consumption of disaggregated serving.

preprint2022arXiv

Impedance-based Root-cause Analysis: Comparative Study of Impedance Models and Calculation of Eigenvalue Sensitivity

Impedance models of power systems are useful when state-space models of apparatus such as inverter-based resources (IBRs) have not been made available and instead only black-box impedance models are available. For tracing the root causes of poor damping and tuning modes of the system, the sensitivity of the modes to components and parameters are needed. The so-called critical admittance-eigenvalue sensitivity based on nodal admittance model has provided a partial solution but omits meaningful directional information. The alternative whole-system impedance model yields participation factors of shunt-connected apparatus with directional information that allows separate tuning for damping and frequency, yet do not cover series-connected components. This paper formalises the relationships between the two forms of impedance models and between the two forms of root-cause analysis. The calculation of system eigenvalue sensitivity in impedance models is further developed, which fills the gaps of previous research and establishes a complete theory of impedance-based root-cause analysis. The theoretical relationships and the tuning of parameters have been illustrated with a three-node passive network, a modified IEEE 14-bus network and a modified NETS-NYPS 68-bus network, showing that tools can be developed for tuning of IBR-rich power systems where only black-box impedance models are available.

preprint2020arXiv

Calibration of the Instrumental Response of Insight-HXMT/HE CsI Detectors for Gamma-Ray Monitoring

The CsI detectors of the High Energy X-ray Telescope of the Hard X-ray Modulation Telescope (HXMT/CsI) can be used for gamma-ray all sky monitoring and searching for the electromagnetic counterpart of gravitational wave source. The instrumental responses are mainly obtained by Monte Carlo simulation with the Geant4 tool and the mass model of both the satellite and all the payloads, which is updated and tested with the Crab pulse emission in various incident directions. Both the Energy-Channel relationship and the energy resolution are calibrated in two working modes (Normal-Gain mode & Low-Gain Mode) with the different detection energy ranges. The simulative spectral analyses show that HXMT/CsI can constrain the spectral parameters much better in the high energy band than that in the low energy band. The joint spectral analyses are performed to ten bright GRBs observed simultaneously with HXMT/CsI and other instruments (Fermi/GBM, Swift/BAT, Konus-Wind), and the results show that the GRB flux given by HXMT/CsI is systematically higher by $7.0\pm8.8\%$ than those given by the other instruments. The HXMT/CsI-Fermi/GBM joint fittings also show that the high energy spectral parameter can be constrained much better as the HXMT/CsI data are used in the joint fittings.

preprint2020arXiv

Discovery of oscillations above 200 keV in a black hole X-ray binary with Insight-HXMT

Low-frequency quasi-periodic oscillations (LFQPOs) are commonly found in black hole X-ray binaries, and their origin is still under debate. The properties of LFQPOs at high energies (above 30 keV) are closely related to the nature of the accretion flow in the innermost regions, and thus play a crucial role in critically testing various theoretical models. The Hard X-ray Modulation Telescope (Insight-HXMT) is capable of detecting emissions above 30 keV, and is therefore an ideal instrument to do so. Here we report the discovery of LFQPOs above 200 keV in the new black hole MAXI J1820+070 in the X-ray hard state, which allows us to understand the behaviours of LFQPOs at hundreds of kiloelectronvolts. The phase lag of the LFQPO is constant around zero below 30 keV, and becomes a soft lag (that is, the high-energy photons arrive first) above 30 keV. The soft lag gradually increases with energy and reaches ~0.9s in the 150-200 keV band. The detection at energies above 200 keV, the large soft lag and the energy-related behaviors of the LFQPO pose a great challenge for most currently existing models, but suggest that the LFQPO probably originates from the precession of a small-scale jet.

preprint2020arXiv

Impedance-Based Whole-System Modeling for a Composite Grid via Frame-Dynamics Embedding

The paper establishes a methodology to overcome the difficulty of dynamic frame alignment and system separation in impedance modeling of ac grids, and thereby enables impedance-based whole-system modeling of generator-converter composite power systems. The methodology is based on a frame-dynamics-embedding transformation via an intermediary steady frame between local and global frames, which yields a locally defined impedance model for each generator or converter that does not rely on a global frame but retains all frame dynamics. The individual impedance model can then be readily combined into a whole-system model even for meshed networks via the proposed closed-loop formulation without network separation. Compared to start-of-the-art impedance-based models, the proposed method retains both frame dynamics and scalability, and is generally applicable to various network topologies (meshed, radial, etc) and combinations of machines (generators, motors, converters, etc). The methodology is used to analyze the dynamic interaction between generators and converters in a composite grid, which yields important findings and potential solutions for unstable oscillation caused by PLL-swing coupling in low-inertia grids.

preprint2020arXiv

Transferring Inter-Class Correlation

The Teacher-Student (T-S) framework is widely utilized in the classification tasks, through which the performance of one neural network (the student) can be improved by transferring knowledge from another trained neural network (the teacher). Since the transferring knowledge is related to the network capacities and structures between the teacher and the student, how to define efficient knowledge remains an open question. To address this issue, we design a novel transferring knowledge, the Self-Attention based Inter-Class Correlation (ICC) map in the output layer, and propose our T-S framework, Inter-Class Correlation Transfer (ICCT).

preprint2019arXiv

Overview to the Hard X-ray Modulation Telescope (Insight-HXMT) Satellite

As China's first X-ray astronomical satellite, the Hard X-ray Modulation Telescope (HXMT), which was dubbed as Insight-HXMT after the launch on June 15, 2017, is a wide-band (1-250 keV) slat-collimator-based X-ray astronomy satellite with the capability of all-sky monitoring in 0.2-3 MeV. It was designed to perform pointing, scanning and gamma-ray burst (GRB) observations and, based on the Direct Demodulation Method (DDM), the image of the scanned sky region can be reconstructed. Here we give an overview of the mission and its progresses, including payload, core sciences, ground calibration/facility, ground segment, data archive, software, in-orbit performance, calibration, background model, observations and some preliminary results.