Source author record

Yue Zhu

Yue Zhu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

astro-ph.HE Artificial Intelligence astro-ph.IM Computer Vision eess.SY Machine Learning nucl-ex physics.ins-det Systems and Control Distributed, Parallel, and Cluster Computing Hardware Architecture Performance physics.optics

Catalog footprint

What is connected

14works

13topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

IIB-LPO: Latent Policy Optimization via Iterative Information Bottleneck

Recent advances in Reinforcement Learning with Verifiable Rewards (RLVR) for Large Language Model (LLM) reasoning have been hindered by a persistent challenge: exploration collapse. The semantic homogeneity of random rollouts often traps models in narrow, over-optimized behaviors. While existing methods leverage policy entropy to encourage exploration, they face inherent limitations. Global entropy regularization is susceptible to reward hacking, which can induce meaningless verbosity, whereas local token-selective updates struggle with the strong inductive bias of pre-trained models. To address this, we propose Latent Policy Optimization via Iterative Information Bottleneck (IIB-LPO), a novel approach that shifts exploration from statistical perturbation of token distributions to topological branching of reasoning trajectories. IIB-LPO triggers latent branching at high-entropy states to diversify reasoning paths and employs the Information Bottleneck principle both as a trajectory filter and a self-reward mechanism, ensuring concise and informative exploration. Empirical results across four mathematical reasoning benchmarks demonstrate that IIB-LPO achieves state-of-the-art performance, surpassing prior methods by margins of up to 5.3% in accuracy and 7.4% in diversity metrics.

preprint2026arXiv

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Recent large vision-language models (VLMs) remain fundamentally constrained by a persistent dichotomy: understanding and generation are treated as distinct problems, leading to fragmented architectures, cascaded pipelines, and misaligned representation spaces. We argue that this divide is not merely an engineering artifact, but a structural limitation that hinders the emergence of native multimodal intelligence. Hence, we introduce SenseNova-U1, a native unified multimodal paradigm built upon NEO-unify, in which understanding and generation evolve as synergistic views of a single underlying process. We launch two native unified variants, SenseNova-U1-8B-MoT and SenseNova-U1-A3B-MoT, built on dense (8B) and mixture-of-experts (30B-A3B) understanding baselines, respectively. Designed from first principles, they rival top-tier understanding-only VLMs across text understanding, vision-language perception, knowledge reasoning, agentic decision-making, and spatial intelligence. Meanwhile, they deliver strong semantic consistency and visual fidelity, excelling in conventional or knowledge-intensive any-to-image (X2I) synthesis, complex text-rich infographic generation, and interleaved vision-language generation, with or without think patterns. Beyond performance, we show detailed model design, data preprocessing, pre-/post-training, and inference strategies to support community research. Last but not least, preliminary evidence demonstrates that our models extend beyond perception and generation, performing strongly in vision-language-action (VLA) and world model (WM) scenarios. This points toward a broader roadmap where models do not translate between modalities, but think and act across them in a native manner. Multimodal AI is no longer about connecting separate systems, but about building a unified one and trusting the necessary capabilities to emerge from within.

preprint2026arXiv

Theoretical analysis of performance limitation of computational refocusing in optical coherence tomography

High-numerical-aperture optical coherence tomography (OCT) enables sub-cellular imaging but faces a trade-off between lateral resolution and depth of focus. Computational refocusing can correct defocus in Fourier-domain OCT, yet its limitations remain unaddressed theoretically. We formulate the lateral imaging process of OCT by using pupil-based imaging theory and the constraints of the computational refocusing in point-scanning OCT and spatially-coherent full-field OCT (FFOCT) are analyzed. The constrains in lateral sampling density and the confocality are considered, and it is shown that the maximum correctable defocus (MCD) is primarily limited by confocality in point-scanning OCT, while spatially-coherent FFOCT has no such constraint and can achieve virtually infinite MCD with a proper and reasonable sampling density. This makes spatially-coherent FFOCT particularly suitable for optical coherence microscopy.

preprint2025arXiv

Revisiting Disaggregated Large Language Model Serving for Performance and Energy Implications

Different from traditional Large Language Model (LLM) serving that colocates the prefill and decode stages on the same GPU, disaggregated serving dedicates distinct GPUs to prefill and decode workload. Once the prefill GPU completes its task, the KV cache must be transferred to the decode GPU. While existing works have proposed various KV cache transfer paths across different memory and storage tiers, there remains a lack of systematic benchmarking that compares their performance and energy efficiency. Meanwhile, although optimization techniques such as KV cache reuse and frequency scaling have been utilized for disaggregated serving, their performance and energy implications have not been rigorously benchmarked. In this paper, we fill this research gap by re-evaluating prefill-decode disaggregation under different KV transfer mediums and optimization strategies. Specifically, we include a new colocated serving baseline and evaluate disaggregated setups under different KV cache transfer paths. Through GPU profiling using dynamic voltage and frequency scaling (DVFS), we identify and compare the performance-energy Pareto frontiers across all setups to evaluate the potential energy savings enabled by disaggregation. Our results show that performance benefits from prefill-decode disaggregation are not guaranteed and depend on the request load and KV transfer mediums. In addition, stage-wise independent frequency scaling enabled by disaggregation does not lead to energy saving due to inherently higher energy consumption of disaggregated serving.

preprint2022arXiv

Impedance-based Root-cause Analysis: Comparative Study of Impedance Models and Calculation of Eigenvalue Sensitivity

Impedance models of power systems are useful when state-space models of apparatus such as inverter-based resources (IBRs) have not been made available and instead only black-box impedance models are available. For tracing the root causes of poor damping and tuning modes of the system, the sensitivity of the modes to components and parameters are needed. The so-called critical admittance-eigenvalue sensitivity based on nodal admittance model has provided a partial solution but omits meaningful directional information. The alternative whole-system impedance model yields participation factors of shunt-connected apparatus with directional information that allows separate tuning for damping and frequency, yet do not cover series-connected components. This paper formalises the relationships between the two forms of impedance models and between the two forms of root-cause analysis. The calculation of system eigenvalue sensitivity in impedance models is further developed, which fills the gaps of previous research and establishes a complete theory of impedance-based root-cause analysis. The theoretical relationships and the tuning of parameters have been illustrated with a three-node passive network, a modified IEEE 14-bus network and a modified NETS-NYPS 68-bus network, showing that tools can be developed for tuning of IBR-rich power systems where only black-box impedance models are available.

preprint2020arXiv

Calibration of the Instrumental Response of Insight-HXMT/HE CsI Detectors for Gamma-Ray Monitoring

The CsI detectors of the High Energy X-ray Telescope of the Hard X-ray Modulation Telescope (HXMT/CsI) can be used for gamma-ray all sky monitoring and searching for the electromagnetic counterpart of gravitational wave source. The instrumental responses are mainly obtained by Monte Carlo simulation with the Geant4 tool and the mass model of both the satellite and all the payloads, which is updated and tested with the Crab pulse emission in various incident directions. Both the Energy-Channel relationship and the energy resolution are calibrated in two working modes (Normal-Gain mode & Low-Gain Mode) with the different detection energy ranges. The simulative spectral analyses show that HXMT/CsI can constrain the spectral parameters much better in the high energy band than that in the low energy band. The joint spectral analyses are performed to ten bright GRBs observed simultaneously with HXMT/CsI and other instruments (Fermi/GBM, Swift/BAT, Konus-Wind), and the results show that the GRB flux given by HXMT/CsI is systematically higher by $7.0\pm8.8\%$ than those given by the other instruments. The HXMT/CsI-Fermi/GBM joint fittings also show that the high energy spectral parameter can be constrained much better as the HXMT/CsI data are used in the joint fittings.

preprint2020arXiv

Discovery of oscillations above 200 keV in a black hole X-ray binary with Insight-HXMT

Low-frequency quasi-periodic oscillations (LFQPOs) are commonly found in black hole X-ray binaries, and their origin is still under debate. The properties of LFQPOs at high energies (above 30 keV) are closely related to the nature of the accretion flow in the innermost regions, and thus play a crucial role in critically testing various theoretical models. The Hard X-ray Modulation Telescope (Insight-HXMT) is capable of detecting emissions above 30 keV, and is therefore an ideal instrument to do so. Here we report the discovery of LFQPOs above 200 keV in the new black hole MAXI J1820+070 in the X-ray hard state, which allows us to understand the behaviours of LFQPOs at hundreds of kiloelectronvolts. The phase lag of the LFQPO is constant around zero below 30 keV, and becomes a soft lag (that is, the high-energy photons arrive first) above 30 keV. The soft lag gradually increases with energy and reaches ~0.9s in the 150-200 keV band. The detection at energies above 200 keV, the large soft lag and the energy-related behaviors of the LFQPO pose a great challenge for most currently existing models, but suggest that the LFQPO probably originates from the precession of a small-scale jet.

preprint2020arXiv

Impedance-Based Whole-System Modeling for a Composite Grid via Frame-Dynamics Embedding

The paper establishes a methodology to overcome the difficulty of dynamic frame alignment and system separation in impedance modeling of ac grids, and thereby enables impedance-based whole-system modeling of generator-converter composite power systems. The methodology is based on a frame-dynamics-embedding transformation via an intermediary steady frame between local and global frames, which yields a locally defined impedance model for each generator or converter that does not rely on a global frame but retains all frame dynamics. The individual impedance model can then be readily combined into a whole-system model even for meshed networks via the proposed closed-loop formulation without network separation. Compared to start-of-the-art impedance-based models, the proposed method retains both frame dynamics and scalability, and is generally applicable to various network topologies (meshed, radial, etc) and combinations of machines (generators, motors, converters, etc). The methodology is used to analyze the dynamic interaction between generators and converters in a composite grid, which yields important findings and potential solutions for unstable oscillation caused by PLL-swing coupling in low-inertia grids.

preprint2020arXiv

Transferring Inter-Class Correlation

The Teacher-Student (T-S) framework is widely utilized in the classification tasks, through which the performance of one neural network (the student) can be improved by transferring knowledge from another trained neural network (the teacher). Since the transferring knowledge is related to the network capacities and structures between the teacher and the student, how to define efficient knowledge remains an open question. To address this issue, we design a novel transferring knowledge, the Self-Attention based Inter-Class Correlation (ICC) map in the output layer, and propose our T-S framework, Inter-Class Correlation Transfer (ICCT).

preprint2019arXiv

Overview to the Hard X-ray Modulation Telescope (Insight-HXMT) Satellite

As China's first X-ray astronomical satellite, the Hard X-ray Modulation Telescope (HXMT), which was dubbed as Insight-HXMT after the launch on June 15, 2017, is a wide-band (1-250 keV) slat-collimator-based X-ray astronomy satellite with the capability of all-sky monitoring in 0.2-3 MeV. It was designed to perform pointing, scanning and gamma-ray burst (GRB) observations and, based on the Direct Demodulation Method (DDM), the image of the scanned sky region can be reconstructed. Here we give an overview of the mission and its progresses, including payload, core sciences, ground calibration/facility, ground segment, data archive, software, in-orbit performance, calibration, background model, observations and some preliminary results.

preprint2015arXiv

An improved limit to the diffuse flux of ultra-high energy neutrinos from the Pierre Auger Observatory

Neutrinos in the cosmic ray flux with energies near 1 EeV and above are detectable with the Surface Detector array of the Pierre Auger Observatory. We report here on searches through Auger data from 1 January 2004 until 20 June 2013. No neutrino candidates were found, yielding a limit to the diffuse flux of ultra-high energy neutrinos that challenges the Waxman-Bahcall bound predictions. Neutrino identification is attempted using the broad time-structure of the signals expected in the SD stations, and is efficiently done for neutrinos of all flavors interacting in the atmosphere at large zenith angles, as well as for "Earth-skimming" neutrino interactions in the case of tau neutrinos. In this paper the searches for downward-going neutrinos in the zenith angle bins $60^\circ-75^\circ$ and $75^\circ-90^\circ$ as well as for upward-going neutrinos, are combined to give a single limit. The $90\%$ C.L. single-flavor limit to the diffuse flux of ultra-high energy neutrinos with an $E^{-2}$ spectrum in the energy range $1.0 \times 10^{17}$ eV - $2.5 \times 10^{19}$ eV is $E_ν^2 dN_ν/dE_ν< 6.4 \times 10^{-9}~ {\rm GeV~ cm^{-2}~ s^{-1}~ sr^{-1}}$.

preprint2015arXiv

Measurement of the cosmic ray spectrum above $4{\times}10^{18}$ eV using inclined events detected with the Pierre Auger Observatory

A measurement of the cosmic-ray spectrum for energies exceeding $4{\times}10^{18}$ eV is presented, which is based on the analysis of showers with zenith angles greater than $60^{\circ}$ detected with the Pierre Auger Observatory between 1 January 2004 and 31 December 2013. The measured spectrum confirms a flux suppression at the highest energies. Above $5.3{\times}10^{18}$ eV, the "ankle", the flux can be described by a power law $E^{-γ}$ with index $γ=2.70 \pm 0.02 \,\text{(stat)} \pm 0.1\,\text{(sys)}$ followed by a smooth suppression region. For the energy ($E_\text{s}$) at which the spectral flux has fallen to one-half of its extrapolated value in the absence of suppression, we find $E_\text{s}=(5.12\pm0.25\,\text{(stat)}^{+1.0}_{-1.2}\,\text{(sys)}){\times}10^{19}$ eV.

preprint2014arXiv

A digital CDS technique and the performance testing

Readout noise is a critical parameter for characterizing the performance of charge-coupled devices (CCDs), which can be greatly reduced by the correlated double sampling (CDS) circuit. However, conventional CDS circuit inevitably introduces new noises since it consists of several active analog components such as operational amplifiers. This paper proposes a digital CDS circuit technique, which transforms the pre-amplified CCD signal into a train of digital presentations by a high-speed data acquisition card directly without the noisy CDS circuit first, then implement the digital CDS algorithm through numerical method. The readout noise of 3.3 e$^{-}$ and the energy resolution of 121 eV@5.9keV can be achieved via the digital CDS technique.

preprint2014arXiv

Proton irradiation effect on SCDs

The Low Energy X-ray Telescope is a main payload on the Hard X-ray Modulation Telescope satellite. The swept charge device is selected for the Low Energy X-ray Telescope. As swept charge devices are sensitive to proton irradiation, irradiation test was carried out on the HI-13 accelerator at the China Institute of Atomic Energy. The beam energy was measured to be 10 MeV at the SCD. The proton fluence delivered to the SCD was $3\times10^{8}\mathrm{protons}/\mathrm{cm}^{2}$ over two hours. It is concluded that the proton irradiation affects both the dark current and the charge transfer inefficiency of the SCD through comparing the performance both before and after the irradiation. The energy resolution of the proton-irradiated SCD is 212 eV@5.9 keV at $-60\,^{\circ}\mathrm{C}$, while it before irradiated is 134 eV. Moreover, better performance can be reached by lowering the operating temperature of the SCD on orbit.

Yue Zhu

What is connected

Connect this record

See the researcher in context

Building this map preview

14 published item(s)

IIB-LPO: Latent Policy Optimization via Iterative Information Bottleneck

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Theoretical analysis of performance limitation of computational refocusing in optical coherence tomography

Revisiting Disaggregated Large Language Model Serving for Performance and Energy Implications

Impedance-based Root-cause Analysis: Comparative Study of Impedance Models and Calculation of Eigenvalue Sensitivity

Calibration of the Instrumental Response of Insight-HXMT/HE CsI Detectors for Gamma-Ray Monitoring

Discovery of oscillations above 200 keV in a black hole X-ray binary with Insight-HXMT

Impedance-Based Whole-System Modeling for a Composite Grid via Frame-Dynamics Embedding

Transferring Inter-Class Correlation

Overview to the Hard X-ray Modulation Telescope (Insight-HXMT) Satellite

An improved limit to the diffuse flux of ultra-high energy neutrinos from the Pierre Auger Observatory

Measurement of the cosmic ray spectrum above $4{\times}10^{18}$ eV using inclined events detected with the Pierre Auger Observatory

A digital CDS technique and the performance testing

Proton irradiation effect on SCDs