Researcher profile

Lei Yang

Lei Yang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
12topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2026arXiv

DeepSight: Long-Horizon World Modeling via Latent States Prediction for End-to-End Autonomous Driving

End-to-end autonomous driving systems are increasingly integrating Vision-Language Model (VLM) architectures, incorporating text reasoning or visual reasoning to enhance the robustness and accuracy of driving decisions. However, the reasoning mechanisms employed in most methods are direct adaptations from general domains, lacking in-depth exploration tailored to autonomous driving scenarios, particularly within visual reasoning modules. In this paper, we propose a driving world model that performs parallel prediction of latent semantic features for consecutive future frames in the bird's-eye-view (BEV) space, thereby enabling long-horizon modeling of future world states. We also introduce an efficient and adaptive text reasoning mechanism that utilizes additional social knowledge and reasoning capabilities to further improve driving performance in challenging long-tail scenarios. We present a novel, efficient, and effective approach that achieves state-of-the-art (SOTA) results on the closed-loop Bench2drive benchmark. Codes are available at: https://github.com/hotdogcheesewhite/DeepSight.

preprint2026arXiv

Dust-obscured radio-emitting tidal disruption event coincident with a high-energy neutrino event

Despite the growing number of high-energy neutrinos (TeV-PeV) detected by IceCube, their astrophysical origins remain largely unidentified. Recent observations have linked a few tidal disruption events (TDEs) to the production of high-energy neutrino emission, all of which display dust-reprocessed infrared flares, indicating a dust- and gas-rich environment. By cross-matching the neutrino events and a sample of mid-infrared outbursts in nearby galaxies with transient radio flares, we uncover an optically obscured TDE candidate, SDSS J151345.75 $+$ 311125.2, which shows both spatial and temporal coincidence with the sub-PeV neutrino event IC170514B. Using a standard equipartition analysis of the synchrotron spectral evolution spanning 605 days post mid-infrared discovery, we find a little evolution in the radio-emitting region, with a kinetic energy up to $10^{51}$ erg, depending on the outflow geometry and shock acceleration efficiency assumed. High-resolution European VLBI Network imaging reveals a compact radio emission that is unresolved at a scale of $<$ 2.1 pc, with a brightness temperature of $T_b>5\times10^6$ K, suggesting that the observed late-time radio emission might originate from the interaction between a decelerating outflow and a dense circumnuclear medium. If the association is genuine, the neutrino production is possibly related to the acceleration of protons through pp collisions during the outflow expanding process, implying that the outflow-cloud interaction could provide a physical site with a high-density environment for producing the sub-PeV neutrinos. Such a scenario can be tested with future identifications of radio transients coincident with high-energy neutrinos.

preprint2026arXiv

Impact of Nuclear Reaction Rates on Calcium Production in Population III Stars: A Global Analysis

We investigate the sensitivity of calcium production to nuclear reaction rates of a 40 solar-mass Population III star using 1D multi-zone stellar models. A comprehensive nuclear reaction network was constructed, and all $(p,γ)$ and $(p,α)$ reaction rates were individually varied by a factor of 10 up and down, identifying 13 preliminary key reactions for calcium production. To propagate the reaction rate uncertainties on calcium production, two sets of Monte Carlo simulations were performed for these key reactions: one adopting STARLIB reaction rates and the other incorporating updated rates from recent experimental data and evaluations. Our results show that Monte Carlo simulations using the updated rates show good agreement with the observed calcium abundance of the extremely iron-poor star SMSS J031300.36-670839.3 within the 68% confidence interval predicted by the models. In contrast, the observed calcium abundance lies marginally outside the 68% C.I. when using the STARLIB rates. Spearman rank-order correlation analysis and SHAP values show that the $(p,γ)$ and $(p,α)$ reactions of F18 and F19 exhibit strong coupled effects on calcium production. These reaction-rate uncertainties need to be reduced to constrain the stellar model predictions. Our study provides insights for future nuclear physics experiments aimed at reducing reaction rate uncertainties in the nucleosynthesis of Population III Stars. Additionally, comparisons between 20 solar-mass and 40 solar-mass Population III stellar models confirm that the latter, with updated reaction rates, is more capable of reproducing the observed Ca abundance and [Ca/Mg] ratio.

preprint2026arXiv

Polynomially effective equidistribution for unipotent orbits in products of $\mathrm{SL}_2$ factors

We sketch the proof of an effective equidistribution theorem for one-parameter unipotent subgroups in $S$-arithmetic quotients arising from $\mathbf K$-forms of $\mathrm{SL}_2^{\mathsf n}$ where $\mathbf K$ is a number field. This gives an effective version of equidistribution results of Ratner and Shah with a polynomial rate. The key new phenomenon is the existence of many intermediate groups between the $\mathrm{SL}_2$ containing our unipotent and the ambient group, which introduces potential local and global obstruction to equidistribution. Our approach relies on a Bourgain-type projection theorem in the presence of obstructions, together with a careful analysis of these obstructions.

preprint2026arXiv

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Recent large vision-language models (VLMs) remain fundamentally constrained by a persistent dichotomy: understanding and generation are treated as distinct problems, leading to fragmented architectures, cascaded pipelines, and misaligned representation spaces. We argue that this divide is not merely an engineering artifact, but a structural limitation that hinders the emergence of native multimodal intelligence. Hence, we introduce SenseNova-U1, a native unified multimodal paradigm built upon NEO-unify, in which understanding and generation evolve as synergistic views of a single underlying process. We launch two native unified variants, SenseNova-U1-8B-MoT and SenseNova-U1-A3B-MoT, built on dense (8B) and mixture-of-experts (30B-A3B) understanding baselines, respectively. Designed from first principles, they rival top-tier understanding-only VLMs across text understanding, vision-language perception, knowledge reasoning, agentic decision-making, and spatial intelligence. Meanwhile, they deliver strong semantic consistency and visual fidelity, excelling in conventional or knowledge-intensive any-to-image (X2I) synthesis, complex text-rich infographic generation, and interleaved vision-language generation, with or without think patterns. Beyond performance, we show detailed model design, data preprocessing, pre-/post-training, and inference strategies to support community research. Last but not least, preliminary evidence demonstrates that our models extend beyond perception and generation, performing strongly in vision-language-action (VLA) and world model (WM) scenarios. This points toward a broader roadmap where models do not translate between modalities, but think and act across them in a native manner. Multimodal AI is no longer about connecting separate systems, but about building a unified one and trusting the necessary capabilities to emerge from within.

preprint2026arXiv

Theoretical investigation of non-Förster exciton transfer mechanisms in perylene diimide donor, phenylene bridge, and terrylene diimide acceptor systems

The rates of exciton transfer within dyads of perylene diimide and terrylene diimide connected by oligophenylene bridge units have been shown to deviate significantly from those of Förster&#39;s resonance energy transfer theory, according to single molecule spectroscopy experiments. The present work provides a detailed computational and theoretical study investigating the source of such discrepancy. Electronic spectroscopy data are calculated by time-dependent density function theory and then compared with experimental results. Electronic couplings between exciton donor and acceptor are estimated based on both transition density cube method and transition dipole approximation. These results confirm that the delocalization of exciton to the bridge parts contribute to significant enhancement of donor-acceptor electronic coupling. Mechanistic details of exciton transfer are examined by estimating the contributions of the bridge electronic states, vibrational modes of the dyads commonly coupled to both donor and acceptor, inelastic resonance energy transfer mechanism, and dark exciton states. These analyses suggest that the contribution of common vibrational modes serves as the main source of deviation from Förster&#39;s spectral overlap expression.

preprint2026arXiv

V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception

Modern autonomous vehicle perception systems often struggle with occlusions and limited perception range. Previous studies have demonstrated the effectiveness of cooperative perception in extending the perception range and overcoming occlusions, thereby enhancing the safety of autonomous driving. In recent years, a series of cooperative perception datasets have emerged; however, these datasets primarily focus on cameras and LiDAR, neglecting 4D Radar, a sensor used in single-vehicle autonomous driving to provide robust perception in adverse weather conditions. In this paper, to bridge the gap created by the absence of 4D Radar datasets in cooperative perception, we present V2X-Radar, the first large-scale, real-world multi-modal dataset featuring 4D Radar. V2X-Radar dataset is collected using a connected vehicle platform and an intelligent roadside unit equipped with 4D Radar, LiDAR, and multi-view cameras. The collected data encompasses sunny and rainy weather conditions, spanning daytime, dusk, and nighttime, as well as various typical challenging scenarios. The dataset consists of 20K LiDAR frames, 40K camera images, and 20K 4D Radar data, including 350K annotated boxes across five categories. To support various research domains, we have established V2X-Radar-C for cooperative perception, V2X-Radar-I for roadside perception, and V2X-Radar-V for single-vehicle perception. Furthermore, we provide comprehensive benchmarks across these three sub-datasets. We will release all datasets and benchmark codebase at https://huggingface.co/datasets/yanglei18/V2X-Radar and https://github.com/yanglei18/V2X-Radar.