Researcher profile

Zihao Yang

Zihao Yang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
8works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

8 published item(s)

preprint2026arXiv

Compared to What? Baselines and Metrics for Counterfactual Prompting

Counterfactual prompting (i.e., perturbing a single factor and measuring output change) is widely used to evaluate things like LLM bias and CoT faithfulness. But in this work we argue that observed effects cannot be attributed to the targeted factor without accounting for baseline ``meaning-preserving'' modifications to text that establish general model sensitivity. This is because every counterfactual edit is a compound treatment that bundles the variable of interest with incidental surface-form variation; this violates treatment variation irrelevance. We observe prediction flip rates on MedQA of 14.9% when we surgically change patient gender. However, this is statistically indistinguishable from the flip rates induced by simply paraphrasing inputs (14.1%). In this case, it would therefore be unwarranted to conclude that the LLM is especially sensitive to patient gender. To account for this and robustly measure the effects of targeted interventions, we propose a framework in which we compare (via statistical testing) differences observed under target interventions to those induced by paraphrasing inputs. We then use this framework to revisit a analysis done on the MedPerturb dataset, which reported evidence of model sensitivity to patient demographics and stylistic cues. We find that these effects largely dissipate when we account for general model sensitivity, with only 5 of 120 tests reaching statistical significance. Applying the same framework to occupational biography classification, we detect clearly significant directional gender bias, showing that the framework identifies real directional effects even when they are small. We evaluate a range of metrics -- aggregate, per-sample distributional, and regression -- and find that per-sample metrics are dramatically more powerful than aggregate metrics and regression powerfully and uniquely characterizes effect direction and magnitude.

preprint2026arXiv

HorizonDrive: Self-Corrective Autoregressive World Model for Long-horizon Driving Simulation

Closed-loop driving simulation requires real-time interaction beyond short offline clips, pushing current driving world models toward autoregressive (AR) rollout. Existing AR distillation approaches typically rely on frame sinks or student-side degradation training. The former transfers poorly to driving due to fast ego-motion and rapid scene changes, while the latter remains bounded by the teacher's single-pass output length and thus provides only a limited supervision horizon. A natural question is: can the teacher itself be extended via AR rollout to provide unbounded-horizon supervision at bounded memory cost? The key difficulty is that a standard teacher drifts under its own predictions, contaminating the supervision it provides. Our key insight is to make the teacher rollout-capable, ensuring reliable supervision from its own AR rollouts. This is instantiated as HorizonDrive, an anti-drifting training-and-distillation framework for AR driving simulation. First, scheduled rollout recovery (SRR) trains the base model to reconstruct ground-truth future clips from prediction-corrupted histories, yielding a teacher that remains stable across long AR rollouts. Second, the rollout-capable teacher is extended via AR rollout, providing long-horizon distribution-matching supervision under bounded memory, while a short-window student aligns to it with teacher rollout DMD (TRD) for efficient real-time deployment. HorizonDrive natively supports minute-scale AR rollout under bounded memory; on nuScenes, HorizonDrive reduces FID by 52% and FVD by 37%, and lowers ARE and DTW by 21% and 9% relative to the strongest long-horizon streaming baselines, while remaining competitive with single-pass driving video generators.

preprint2026arXiv

Measuring the Coronal Magnetic Field with 2D Coronal Seismology: A Forward-Modeling Validation

In recent years, a two-dimensional (2D) coronal seismology technique applied to spectral-imaging data from the Coronal Multi-channel Polarimeter (CoMP) and UCoMP has enabled routine measurement of the global coronal magnetic field. The technique combines coronal transverse wave phase speed from Doppler measurements with electron densities from the Fe \sc{xiii}\rm{} 10798/10747 Å intensity ratio to infer the magnetic field strength, while the wave propagation directions from Doppler measurements trace the magnetic field direction. To validate the accuracy and robustness of this method, we use forward modeling of a MURaM simulation that produces open and closed magnetic structures with excited waves. From the synthetic Doppler velocity, Fe \sc{xiii}\rm{} infrared line intensities, and linear polarization signals, we apply the 2D coronal seismology technique to estimate the magnetic field strength and direction. A comparison with the simulation ground truth shows close agreement, indicating that the technique can recover the line-of-sight emissivity-weighted magnetic field direction and strength with high accuracy. We also perform a parameter-space analysis to quantify sensitivities of the method to parameter choice. These findings provide practical guidance for CoMP/UCoMP-like analysis and demonstrate that 2D coronal seismology can deliver reliable, LOS emissivity-weighted measurements of the coronal magnetic field from coronal wave observations.

preprint2022arXiv

An efficient multi-modes Monte Carlo homogenization method for random materials

In this paper, we propose and analyze a new stochastic homogenization method for diffusion equations with random and fast oscillatory coefficients. In the proposed method, the homogenized solutions are sought through a two-stage procedure. In the first stage, the original oscillatory diffusion equation is approximated, for each fixed random sample w, by a spatially homogenized diffusion equation with piecewise constant coefficients, resulting a random diffusion equation. In the second stage, the resulted random diffusion equation is approximated and computed by using an efficient multi-modes Monte Carlo method which only requires to solve a diffusion equation with a constant diffusion coefficient and a random right-hand side. The main advantage of the proposed method is that it separates the computational difficulty caused by the spatial fast oscillation of the solution and that caused by the randomness of the solution, so they can be overcome separately using different strategies. The convergence of the solution of the spatially homogenized equation (from the first stage) to the solution of the original random diffusion equation is established and the optimal rate of convergence is also obtained for the proposed multi-modes Monte Carlo method. Numerical experiments on some benchmark test problems for random composite materials are also presented to gauge the efficiency and accuracy of the proposed two-stage stochastic homogenization method.

preprint2022arXiv

Can we detect coronal mass ejections through asymmetries of Sun-as-a-star extreme-ultraviolet spectral line profiles?

Coronal mass ejections (CMEs) are the largest-scale eruptive phenomena in the solar system. Associated with enormous plasma ejections and energy release, CMEs have an important impact on the solar-terrestrial environment. Accurate predictions of the arrival times of CMEs at the Earth depend on the precise measurements on their three-dimensional velocities, which can be achieved using simultaneous line-of-sight (LOS) and plane-of-sky (POS) observations. Besides the POS information from routine coronagraph and extreme ultraviolet (EUV) imaging observations, spectroscopic observations could unveil the physical properties of CMEs including their LOS velocities. We propose that spectral line asymmetries measured by Sun-as-a-star spectrographs can be used for routine detections of CMEs and estimations of their LOS velocities during their early propagation phases. Such observations can also provide important clues for the detection of CMEs on other solar-like stars. However, few studies have concentrated on whether we can detect CME signals and accurately diagnose CME properties through Sun-as-a-star spectral observations. In this work, we constructed a geometric CME model and derived the analytical expressions for full-disk integrated EUV line profiles during CMEs. For different CME properties and instrumental configurations, full disk-integrated line profiles were synthesized. We further evaluated the detectability and diagnostic potential of CMEs from the synthetic line profiles. Our investigations provide important constraints on the future design of Sun-as-a-star spectrographs for CME detections through EUV line asymmetries.

preprint2022arXiv

On the Nature of the Three-part Structure of Solar Coronal Mass Ejections

Coronal mass ejections (CMEs) result from eruptions of magnetic flux ropes (MFRs) and can possess a three-part structure in white-light coronagraphs, including a bright front, dark cavity and bright core. In the traditional opinion, the bright front forms due to the plasma pileup along the MFR border, the cavity represents the cross section of the MFR, and the bright core corresponds to the erupted prominence. However, this explanation on the nature of the three-part structure is being challenged. In this paper, we report an intriguing event occurred on 2014 June 14 that was recorded by multiple space- and ground-based instruments seamlessly, clearly showing that the CME front originates from the plasma pileup along the magnetic arcades overlying the MFR, and the core corresponds to a hot-channel MFR. Thus the dark cavity is not an MFR, instead it is a low-density zone between the CME front and a trailing MFR. These observations are consistent with a new explanation on the CME structure. If the new explanation is correct, most (if not all) CMEs should exhibit the three-part appearance in their early eruption stage. To examine this prediction, we make a survey study of all CMEs in 2011 and find that all limb events have the three-part feature in the low corona, regardless of their appearances in the high corona. Our studies suggest that the three-part structure is the intrinsic structure of CMEs, which has fundamental importance for understanding CMEs.

preprint2022arXiv

Sun-as-a-star spectroscopic observations of the line-of-sight velocity of a solar eruption on October 28, 2021

The propagation direction and true velocity of a solar coronal mass ejection, which are among the most decisive factors for its geo-effectiveness, are difficult to determine through single-perspective imaging observations. Here we show that Sun-as-a-star spectroscopic observations, together with imaging observations, could allow us to solve this problem. Using observations of the Extreme-ultraviolet Variability Experiment onboard the Solar Dynamics Observatory, we found clear blue-shifted secondary emission components in extreme ultraviolet spectral lines during a solar eruption on October 28, 2021. From simultaneous imaging observations, we found that the secondary components are caused by a mass ejection from the flare site. We estimated the line-of-sight (LOS) velocity of the ejecta from both the double Gaussian fitting method and the red-blue asymmetry analysis. The results of both methods agree well with each other, giving an average LOS velocity of the plasma of $\sim 423~\rm{km~s^{-1}}$. From the $304$ Å~image series taken by the Extreme Ultraviolet Imager onboard the Solar Terrestrial Relation Observatory-A (STEREO-A) spacecraft, we estimated the plane-of-sky (POS) velocity from the STEREO-A viewpoint {to be around $587~\rm{km~s^{-1}}$}. The full velocity of the bulk motion of the ejecta was then computed by combining the imaging and spectroscopic observations, which turns out to be around $596~\rm{km~s^{-1}}$ with an angle of $42.4^\circ$ to the west of the Sun-Earth line and $16.0^\circ$ south to the ecliptic plane.

preprint2020arXiv

Global maps of the magnetic field in the solar corona

Understanding many physical processes in the solar atmosphere requires determination of the magnetic field in each atmospheric layer. However, direct measurements of the magnetic field in the Sun's corona are difficult to obtain. Using observations with the Coronal Multi-channel Polarimeter, we have determined the spatial distribution of the plasma density in the corona, and the phase speed of the prevailing transverse magnetohydrodynamic waves within the plasma. We combine these measurements to map the plane-of-sky component of the global coronal magnetic field. The derived field strengths in the corona from 1.05 to 1.35 solar radii are mostly 1-4 Gauss. These results demonstrate the capability of imaging spectroscopy in coronal magnetic field diagnostics.