Researcher profile

Yuxuan Jiang

Yuxuan Jiang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
9works
0followers
12topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2026arXiv

Cornerstones or Stumbling Blocks? Deciphering the Rock Tokens in On-Policy Distillation

While recent work in Reinforcement Learning with Verifiable Rewards (RLVR) has shown that a small subset of critical tokens disproportionately drives reasoning gains, an analogous token-level understanding of On-Policy Distillation (OPD) remains largely unexplored. In this work, we investigate high-loss tokens, a token type that--as the most direct signal of student-teacher mismatch under OPD's per-token KL objective--should progressively diminish as training converges according to existing studies; however, our empirical analysis shows otherwise. Even after OPD training reaches apparent saturation, a substantial subset of tokens continues to exhibit persistently high loss; these tokens, which we term Rock Tokens, can account for up to 18\% of the tokens in generated outputs. Our investigation reveals two startling paradoxes. First, despite their high occurrence frequency providing a disproportionately large share of total gradient norms, Rock Tokens themselves remain stagnant throughout training, resisting teacher-driven corrections. Second, through causal intervention, we find that these tokens provide negligible functional contribution to the model's actual reasoning performance. These findings suggest that a vast amount of optimization bandwidth is spent on structural and discourse residuals that the student model cannot or need not internalize. By deconstructing these dynamics, we demonstrate that strategically bypassing these ``stumbling blocks'' can significantly streamline the alignment process, challenging the necessity of uniform token weighting and offering a more efficient paradigm for large-scale model distillation.

preprint2026arXiv

In Vivo Quantification of Arterial Active Mechanics Using Deep Learning-Assisted Pressure-Area Analysis

Active arterial mechanics, governed by vascular smooth muscle contraction, are critical to physiological regulation, cardiovascular disease progression, and clinical diagnosis. Although various in vivo methods have been developed to assess arterial stiffness, most cannot distinguish the contribution of smooth muscle tone; therefore, quantitative characterization of arterial activity remains challenging. In this study, we developed a pressure-area analysis framework integrating ultrasound imaging, blood pressure measurement, neural network-based segmentation of arterial cross-sectional area, and biomechanical model-driven inversion to infer active mechanical properties. A total of 233 volunteers (aged 18-65 year) were recruited to acquire cross-sectional ultrasound videos of the right common carotid artery for training the neural network. The segmentation results demonstrate good spatial and temporal performance of the neural network. We further recruited 10 additional volunteers (aged 25 +/- 3 year) to perform a 1-minute step test, followed by pressure-area measurements over a 30-minute recovery period. Using the proposed approach, we quantified post-exercise changes in carotid arterial active mechanics relative to baseline (i.e., the resting state). Results showed that active mechanics remained elevated for approximately 15 minutes compared to baseline (p < 0.05), whereas systolic pressure differed significantly only within the first approximately 5 minutes post-exercise (p < 0.001). These results indicate a dissociation between blood pressure and smooth muscle recovery, which may offer new insight into vascular smooth muscle regulation during physiological stress.

preprint2026arXiv

Learning How to Use Tools, Not Just When: Pattern-Aware Tool-Integrated Reasoning

Tool-integrated reasoning (TIR) has become a key approach for improving large reasoning models (LRMs) on complex problems. Prior work has mainly studied when to invoke tools, while overlooking how tools are applied. We identify two common patterns: a calculator pattern that uses code for direct computation, and an algorithmic pattern that encodes problems as programs. Misaligned choices often cause failures even when reasoning is sound. We propose a two-stage framework that first builds code competence from both patterns and then aligns pattern selection with teacher preferences. Across challenging math datasets, our pattern-aware method substantially improves both code usage and accuracy, for instance raising Code@1 on MATH500 from 64.0% to 70.5% and on AIME24 from 26.7% to 50.0%. These gains highlight the effectiveness of a pattern-aware approach for tool-integrated reasoning.

preprint2026arXiv

PRB-RUPFormer: A Recursive Unified Probabilistic Transformer for Residual PRB Forecasting

Accurate forecasting of residual Physical Resource Blocks (PRBs) is critical for proactive network slice provisioning, energy-efficient operation, and spectrum-aware decision making in cellular systems, where residual PRBs serve as a practical proxy for short- and medium-term spectrum availability. Existing PRB prediction methods typically rely only on historical PRB values and are trained independently per carrier or sector, limiting their ability to capture cross-carrier dependencies and providing no measure of forecast uncertainty. Moreover, point forecasts alone are insufficient for robust spectrum-aware control under highly variable traffic conditions. This paper proposes PRB-RUPFormer, a recursive unified probabilistic Transformer for residual PRB forecasting. The proposed model jointly processes multivariate KPI time series using temporal, seasonal, and carrier-aware embeddings, preserving inter-metric temporal coupling during recursive rollout and stabilizing long-horizon forecasting. A single shared model is trained across all carriers and sectors of an eNB, enabling efficient learning of joint traffic dynamics with low computational overhead. Forecast uncertainty is captured through quantile-based prediction intervals, providing confidence-aware estimates of future PRB availability. Evaluations on six months of commercial LTE network data from multiple U.S. locations demonstrate median MAE below 0.05 and hit probabilities above 0.80 for both one-day and seven-day recursive forecasts. These probabilistic predictions directly support spectrum-aware RAN functions such as dynamic carrier activation, congestion avoidance, and proactive spectrum sharing, making the proposed framework well-suited for dynamic spectrum access scenarios.

preprint2021arXiv

A new parsimonious method for classifying Cancer Tissue-of-Origin Based on DNA Methylation 450K data

DNA methylation is a well-studied genetic modification that regulates gene transcription of Eukaryotes. Its alternations have been recognized as a significant component of cancer development. In this study, we use the DNA methylation 450k data from The Cancer Genome Atlas to evaluate the efficacy of DNA methylation data on cancer classification for 30 cancer types. We propose a new method for gene selection in high dimensional data(over 450 thousand). Variance filtering is first introduced for dimension reduction and Recursive feature elimination (RFE) is then used for feature selection. We address the problem of selecting a small subsets of genes from large number of methylated sites, and our parsimonious model is demonstrated to be efficient, achieving an accuracy over 91%, outperforming other studies which use DNA micro-arrays and RNA-seq Data . The performance of 20 models, which are based on 4 estimators (Random Forest, Decision Tree, Extra Tree and Support Vector Machine) and 5 classifiers (k-Nearest Neighbours, Support Vector Machine, XGboost, Light GBM and Multi-Layer Perceptron), is compared and robustness of the RFE algorithm is examined. Results suggest that the combined model of extra tree plus catboost classifier offers the best performance in cancer identification, with an overall validation accuracy of 91% , 92.3%, 93.3% and 93.5% for 20, 30, 40 and 50 features respectively. The biological functions in cancer development of 50 selected genes is also explored through enrichment analysis and the results show that 12 out of 16 of our top features have already been identified to be specific with cancer and we also propose some more genes to be tested for future studies. Therefore, our method may be utilzed as an auxiliary diagnostic method to determine the actual clinicopathological status of a specific cancer.

preprint2021arXiv

Magneto-transport evidence for strong topological insulator phase in ZrTe5

The identification of a non-trivial band topology usually relies on directly probing the protected surface/edge states. But, it is difficult to achieve electronically in narrow-gap topological materials due to the small (meV) energy scales. Here, we demonstrate that band inversion, a crucial ingredient of the non-trivial band topology, can serve as an alternative, experimentally accessible indicator. We show that an inverted band can lead to a four-fold splitting of the non-zero Landau levels, contrasting the two-fold splitting (spin splitting only) in the normal band. We confirm our predictions in magneto-transport experiments on a narrow-gap strong topological insulator, zirconium pentatelluride (ZrTe$_5$), with the observation of additional splittings in the quantum oscillations and also an anomalous peak in the extreme quantum limit. Our work establishes an effective strategy for identifying the band inversion as well as the associated topological phases for future topological materials research.

preprint2021arXiv

Mechanically Modulated Sideband and Squeezing Effects of Membrane Resonators

We investigate the sideband spectra of a driven nonlinear mode with its eigenfrequency being modulated at a low frequency (< 1 kHz). This additional parametric modulation leads to prominent antiresonance lineshapes in the sideband spectra, which can be controlled through the vibration state of the driven mode. We also establish a direct connection between the antiresonance frequency and the squeezing of thermal fluctuation in the system. Our work not only provides a simple and robust method for squeezing characterization but also opens a new possibility toward sideband applications.

preprint2020arXiv

Electron-Hole Asymmetry of Surface States in Topological Insulator Sb2Te3 Thin Films Revealed by Magneto-Infrared Spectroscopy

When surface states (SSs) form in topological insulators (TIs), they inherit the properties of bulk bands, including the electron-hole (e-h) asymmetry but with much more profound impacts. Here, via combining magneto-infrared spectroscopy with theoretical analysis, we show that e-h asymmetry significantly modifies the SS electronic structures when interplaying with the quantum confinement effect. Compared to the case without e-h asymmetry, the SSs now bear not only a band asymmetry as that in the bulk but also a shift of the Dirac point relative to the bulk bands and a reduction of the hybridization gap up to 70%. Our results signify the importance of e-h asymmetry in band engineering of TIs in the thin film limit.

preprint2019arXiv

Dirac energy spectrum and inverted band gap in metamorphic InAsSb/InSb superlattices

A Dirac-type energy spectrum was demonstrated in gapless ultra-short-period metamorphic InAsSb/InSb superlattices by angle-resolved photoemission spectroscopy (ARPES_ measurements. The Fermi velocity value 7.4x10^5 m/s in a gapless superlattice with a period of 6.2nm is in a good agreement with the results of magneto-absorption experiments. An &#34;inverted&#34; bandgap opens in the center of the Brillouin zone at higher temperatures and in the SL with a larger period. The ARPES data indicate the presence of a surface electron accumulation layer