Researcher profile

Yiran Zhang

Yiran Zhang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
11topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2026arXiv

Global Context Compression with Interleaved Vision-Text Transformation

Recent achievements of vision-language models in end-to-end OCR point to a new avenue for low-loss compression of textual information. This motivates earlier works that render the Transformer's input into images for prefilling, which effectively reduces the number of tokens through visual encoding, thereby alleviating the quadratically increased Attention computations. However, this partial compression fails to save computational or memory costs at token-by-token inference. In this paper, we investigate global context compression, which saves tokens at both prefilling and inference stages. Consequently, we propose VIST2, a novel Transformer that interleaves input text chunks alongside their visual encoding, while depending exclusively on visual tokens in the pre-context to predict the next text token distribution. Around this idea, we render text chunks into sketch images and train VIST2 in multiple stages, starting from curriculum-scheduled pretraining for optical language modeling, followed by modal-interleaved instruction tuning. We conduct extensive experiments using VIST2 families scaled from 0.6B to 8B to explore the training recipe and hyperparameters. With a 4$\times$ compression ratio, the resulting models demonstrate significant superiority over baselines on long writing tasks, achieving, on average, a 3$\times$ speedup in first-token generation, 77% reduction in memory usage, and 74% reduction in FLOPS. Our codes and datasets will be public to support further studies.

preprint2026arXiv

Reflection Anchors for Propagation-Aware Visual Retention in Long-Chain Multimodal Reasoning

Long chain-of-thought (CoT) reasoning improves large vision--language models, but visual information often fades during generation, limiting long-horizon multimodal reasoning. Existing methods either re-inject vision at inference or train policies for stronger grounding, but where to intervene relies on perception heuristics rather than principled gain analysis, and how local visual influence propagates remains implicit. We study this problem from an information-theoretic standpoint and derive a lower bound on the downstream visual gain of a one-step intervention, which suggests two factors: local branching room (token entropy) and downstream visual propagation potential (suffix divergence from a vision-marginalized reference). Guided by this analysis, we propose reflection-anchor policy optimization (RAPO), a GRPO-based policy optimization method that selects high-entropy reflection anchors and optimizes a chain-masked finite-window KL surrogate for downstream visual dependence. Experiments on reasoning-intensive and general-domain benchmarks show that RAPO delivers substantial gains over strong baselines across multiple LVLM backbones. Mechanism analyses further indicate that reflection anchors are enriched for visually sensitive decision points and that RAPO increases contrastive visual-dependence signals along generated trajectories.

preprint2022arXiv

A Three-component Model for Cosmic-ray Spectrum and Dipole Anisotropy

Using a three-component, multi-scale diffusion model, we show that the cosmic-ray (CR) proton and helium spectra and the dipole anisotropy can be explained with reasonable parameters. The model includes a nearby source associated with the supernova remnant (SNR) that gave rise to the Geminga pulsar, a source at the Galactic center, and a component associated with the Galactic disk. The CR flux below TeV is dominated by the disk component. The center source with a continuous injection of CRs starting about 18 Myr ago is needed to explain the anisotropy above 100 TeV. With the assumption of universal CR spectra injected by all SNRs, the nearby source can produce a TeV spectral bump observed at Earth via slow diffusion across the interstellar magnetic field, which needs to have an angle $ θ\approx 5^{\circ} $ between the field line and the line of sight toward the source, and have weak magnetic turbulence with the Alfvén Mach number $ M_{\text{A}}\approx 0.1 $. Considering the modulation of the Galactic-scale anisotropy by this magnetic field, in a quasi-local approach the field may be directed at a right ascension about $ -90^{\circ} $ and a declination about $ -7.4^{\circ} $ in the equatorial coordinate system.

preprint2022arXiv

Brain-Inspired Modelling and Decision-making for Human-Like Autonomous Driving in Mixed Traffic Environment

In this paper, a human-like driving framework is designed for autonomous vehicles (AVs), which aims to make AVs better integrate into the transportation ecology of human driving and eliminate the misunderstanding and incompatibility of human drivers to autonomous driving. Based on the analysis of the real world INTERACTION dataset, a driving aggressiveness estimation model is established with the fuzzy inference approach. Then, a human-like driving model, which integrates the brain emotional learning circuit model (BELCM) with the two-point preview model, is designed. In the human-like lane-change decision-making algorithm, the cost function is designed comprehensively considering driving safety and travel efficiency. Based on the cost function and multi-constraint, the dynamic game algorithm is applied to modelling the interaction and decision making between AV and human driver. Additionally, to guarantee the lane-change safety of AVs, an artificial potential field model is built for collision risk assessment. Finally, the proposed algorithm is evaluated through human-in-the-loop experiments on a driving simulator, and the results demonstrated the feasibility and effectiveness of the proposed method.

preprint2022arXiv

Hierarchy of Symmetry Breaking Correlated Phases in Twisted Bilayer Graphene

Twisted bilayer graphene (TBG) near the magic twist angle of $\sim1.1^{o}$ exhibits a rich phase diagram. However, the interplay between different phases and their dependence on twist angle is still elusive. Here, we explore the stability of various TBG phases and demonstrate that superconductivity near filling of two electrons per moiré unit cell alongside Fermi surface reconstructions, as well as entropy-driven high-temperature phase transitions and linear-in-T resistance occur over a range of twist angles which extends far beyond those exhibiting correlated insulating phases. In the vicinity of the magic angle, we also find a metallic phase that displays a hysteretic anomalous Hall effect and incipient Chern insulating behaviour. Such a metallic phase can be rationalized in terms of the interplay between interaction-driven deformations of TBG bands leading to Berry curvature redistribution and Fermi surface reconstruction. Our results provide an extensive perspective on the hierarchy of correlated phases in TBG as classified by their robustness against deviations from the magic angle or, equivalently, their electronic interaction requirements.

preprint2022arXiv

Power-law Spectrum of Energetic Particles in Classical Thermal Equilibrium by Pitch-angle Scattering Process

The Boltzmann-Gibbs thermodynamic equilibrium state of charged particles pitch-angle scattered by weak plasma waves is discussed. Degrees of freedom of these waves play a fundamental role in constructing the grand canonical ensemble. Via the gyro-resonance condition, fast particles have an inverse break power-law spectrum for $ \varepsilon -μ\ll T $, where $ \varepsilon $ is the particle energy, $ μ$ is the chemical potential, $ T $ is the temperature. The break energies are the rest energy and $ -μ$. For $ \varepsilon \ll -μ\ll T $, the energy spectral index $ α$ is $ δ/2+1 $ and $ δ+1 $ for non- and ultra-relativistic particles, respectively, with $ δ$ an effective fractal dimension of background magnetic field lines. The spectral index for $ -μ\ll \varepsilon \ll T $ is $ α+1 $. This thermal equilibrium scenario, combined with the leaky-box model and cosmic-ray observations, seems to suggest that the Galactic magnetic field is super-diffusive with $ δ\approx 1.4 $.

preprint2021arXiv

Interaction-driven Band Flattening and Correlated Phases in Twisted Bilayer Graphene

Flat electronic bands, characteristic of magic-angle twisted bilayer graphene (TBG), host a wealth of correlated phenomena. Early theoretical considerations suggested that, at the magic angle, the Dirac velocity vanishes and the entire width of the moiré bands becomes extremely narrow. Yet, this scenario contradicts experimental studies that reveal a finite Dirac velocity as well as bandwidths significantly larger than predicted. Here we use spatially resolved spectroscopy in finite and zero magnetic fields to examine the electronic structure of moiré bands and their intricate connection to correlated phases. By following the relative shifts of Landau levels in finite fields, we detect filling-dependent band flattening, that unexpectedly starts already at ~1.3 degrees, well above the magic angle and hence nominally in the weakly correlated regime. We further show that, as the twist angle is reduced, the moiré bands become maximally flat at progressively lower doping levels. Surprisingly, when the twist angles reach values for which the maximal flattening occurs at approximate filling of $-2$, $+1$,$+2$,$+3$ electrons per moiré unit cell, the corresponding zero-field correlated phases start to emerge. Our observations are corroborated by calculations that incorporate an interplay between the Coulomb charging energy and exchange interactions; together these effects produce band flattening and hence a significant density-of-states enhancement that facilitates the observed symmetry-breaking cascade transitions. Besides emerging phases pinned to integer fillings, we also experimentally identify a series of pronounced correlation-driven band deformations and soft gaps in a wider doping range around $\pm 2$ filling where superconductivity is expected. Our results highlight the role of interaction-driven band-flattening in forming robust correlated phases in TBG.

preprint2020arXiv

Self-similar Blast Wave for A Two-component Fluid with Variable Adiabatic Index

We propose a self-similar (SS) solution to hydrodynamic non-relativistic flow behind a spherical strong blast wave (BW) passing through a homogeneous plasma with efficient relativistic particle acceleration at the shock front. The flow is described by an ideal two-fluid model with a relativistic component so that the post-shock gas has an effective SS adiabatic index $ γ$ varying from $ 5/3 $ to $ 4/3 $. This solution is calculated numerically and compared with the standard Sedov solution. We find that the BW center in our solution is dominated by the relativistic component with $ γ=4/3 $ for the divergence of expansion there, and the relativistic component dominates the interior for a moderate acceleration efficiency at the shock front. The overall efficiency of relativistic particle acceleration can be enhanced by a factor of $ 2 $ due to the slower adiabatic energy loss rate of the relativistic component during expansion. Tendency of the dominance by the relativistic component may be common in expanding astrophysical two-fluid systems such as supernova remnants, lobes of radio galaxies.

preprint2020arXiv

Superconductivity without insulating states in twisted bilayer graphene stabilized by monolayer WSe$_2$

Magic-angle twisted bilayer graphene (TBG), with rotational misalignment close to 1.1$^\circ$, features isolated flat electronic bands that host a rich phase diagram of correlated insulating, superconducting, ferromagnetic, and topological phases. The origins of the correlated insulators and superconductivity, and the interplay between them, are particularly elusive. Both states have been previously observed only for angles within $\pm0.1^\circ$ from the magic-angle value and occur in adjacent or overlapping electron density ranges; nevertheless, it is still unclear how the two states are related. Beyond the twist angle and strain, the dependence of the TBG phase diagram on the alignment and thickness of insulating hexagonal boron nitride (hBN) used to encapsulate the graphene sheets indicates the importance of the microscopic dielectric environment. Here we show that adding an insulating tungsten-diselenide (WSe$_2$) monolayer between hBN and TBG stabilizes superconductivity at twist angles much smaller than the established magic-angle value. For the smallest angle of $θ$ = 0.79$^\circ$, we still observe clear superconducting signatures, despite the complete absence of the correlated insulating states and vanishing gaps between the dispersive and flat bands. These observations demonstrate that, even though electron correlations may be important, superconductivity in TBG can exist even when TBG exhibits metallic behaviour across the whole range of electron density. Finite-magnetic-field measurements further reveal breaking of the four-fold spin-valley symmetry in the system, consistent with large spin-orbit coupling induced in TBG via proximity to WSe$_2$. Our results highlight the importance of symmetry breaking effects in stabilizing electronic states in TBG and open new avenues for engineering quantum phases in moiré systems.

preprint2020arXiv

Tracing out Correlated Chern Insulators in Magic Angle Twisted Bilayer Graphene

Magic-angle twisted bilayer graphene (MATBG) exhibits a range of correlated phenomena that originate from strong electron-electron interactions. These interactions make the Fermi surface highly susceptible to reconstruction when $ \pm 1, \pm 2, \pm 3$ electrons occupy each moir\' e unit cell and lead to the formation of correlated insulating, superconducting and ferromagnetic phases. While some phases have been shown to carry a non-zero Chern number, the local microscopic properties and topological character of many other phases remain elusive. Here we introduce a set of novel techniques hinging on scanning tunneling microscopy (STM) to map out topological phases in MATBG that emerge in finite magnetic field. By following the evolution of the local density of states (LDOS) at the Fermi level with electrostatic doping and magnetic field, we visualize a local Landau fan diagram that enables us to directly assign Chern numbers to all observed phases. We uncover the existence of six topological phases emanating from integer fillings in finite fields and whose origin relates to a cascade of symmetry-breaking transitions driven by correlations. The spatially resolved and electron-density-tuned LDOS maps further reveal that these topological phases can form only in a small range of twist angles around the magic-angle value. Both the microscopic origin and extreme sensitivity to twist angle differentiate these topological phases from the Landau levels observed near charge neutrality. Moreover, we observe that even the charge-neutrality Landau spectrum taken at low fields is considerably modified by interactions and exhibits an unexpected splitting between zero Landau levels that can be as large as ${\sim }\,3-5$ meV. Our results show how strong electronic interactions affect the band structure of MATBG and lead to the formation of correlation-enabled topological phases.