Researcher profile

Jiawen Zhang

Jiawen Zhang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
19works
0followers
15topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

19 published item(s)

preprint2026arXiv

Confidence-Aware Alignment Makes Reasoning LLMs More Reliable

Large reasoning models often reach correct answers through flawed intermediate steps, creating a gap between final accuracy and reasoning reliability. Existing alignment strategies address this with external verifiers or massive sampling, limiting scalability. In this work, we introduce CASPO (Confidence-Aware Step-wise Preference Optimization), a framework that aligns token-level confidence with step-wise logical correctness through iterative Direct Preference Optimization, without training a separate reward model. During inference, we propose Confidence-aware Thought (CaT), which leverages this calibrated confidence to dynamically prune uncertain reasoning branches with negligible O(V) latency. Experiments across ten benchmarks and multiple model families show that CASPO consistently improves reasoning reliability and inference efficiency. CASPO scales to Qwen3-8B-Base and surpasses tree-search baselines on AIME'24 and AIME'25 without using reward-model data. We also release a step-wise dataset with confidence annotations to support fine-grained analysis of reasoning reliability. Code is available at https://github.com/Thecommonirin/CASPO.

preprint2026arXiv

Evaluating Cognitive Age Alignment in Interactive AI Agents

While agentic AI and its core multimodal large language models (MLLMs) have demonstrated remarkable promise in language and visual reasoning across domains ranging from daily life to advanced scientific research, a profound gap remains between artificial and human intelligence. Despite the integration of powerful tools and advanced MLLMs, state-of-the-art AI agents frequently fail at foundational, seemingly simple tasks that a child can resolve with ease. Inspired by the Wechsler Intelligence Scale for Children (WISC), we introduce ChildAgentEval, the first psychometrically grounded interactive benchmark for evaluating cognitive age alignment in MLLM-based agents. ChildAgentEval systematically compares the reasoning performance of various MLLM-based interactive agents against age-specific human developmental stages, exposing where current agentic AI systems can and cannot simulate age-specific cognitive behavior.

preprint2026arXiv

Mitigating Many-shot Jailbreak Attacks with One Single Demonstration

Many-shot jailbreaking (MSJ) causes safety-aligned language models to answer harmful queries by preceding them with many harmful question-answer demonstrations. We study why this attack becomes stronger as the number of demonstrations increases. Empirically, we find that MSJ induces a progressive activation drift: the representation of a fixed harmful query moves step by step away from the safety-aligned region as more harmful demonstrations are added. Theoretically, we show that this drift can be interpreted as implicit malicious fine-tuning: conditioning on N harmful demonstrations induces SGD-style updates equivalent to optimizing on the corresponding N harmful samples. This view turns the attack mechanism into a defense principle. We append a fixed one-shot safety demonstration at inference time, which induces a counteracting safety-oriented update and restores refusal behavior. The resulting method improves the model's robustness to MSJ without modifying its parameters or requiring white-box access at deployment. Code is available at https://github.com/Thecommonirin/SafeEnd.

preprint2026arXiv

Multiple nodal superconducting phases and order-parameter evolution in pressurized UTe$_2$

Spin-triplet superconductivity (SC) offers a unique avenue for realizing non-Abelian Majorana zero modes and thus the fault-tolerant topological quantum computation, and has attracted a broad audience for both fundamental research and potential applications. The recently discovered heavy-fermion spin-triplet superconductor candidate UTe$_2$ has sparked great interest for its ultrahigh upper critical field and reentrant SC phases in the proximity to a field-polarized magnetic state. Despite extensive studies on the phase diagrams and competing orders induced by pressure and magnetic field, limited has been known about its SC order parameters and their evolution with these control parameters, largely due to the lack of appropriate symmetry-sensitive detections. Here, we report comprehensive point-contact spectroscopy measurements of pressurized UTe$_2$ on the (0~0~1) surface. The observation of Andreev bound state strongly suggests the presence of a $p_z$ component in the SC order parameters. Quantitative analysis based on an extended Blonder-Tinkham-Klapwijk model unveils $B_{2u}$ or $B_{3u}$ as the most likely representation for both ambient and pressurized UTe$_2$, and remarkably, the multiple SC phases can be distinguished by a single parameter $\langle Δ_{z}\rangle/\langleΔ_{x(y)}\rangle$, the relative weight between the $p_z$-wave and $p_{x(y)}$-wave pairings. These findings not only impose stringent constraints on the superconducting order parameter in UTe$_2$, but also provide key spectroscopic evidence for the existence of multiple SC phases tuned through pressure.

preprint2026arXiv

Safety at One Shot: Patching Fine-Tuned LLMs with A Single Instance

Fine-tuning safety-aligned large language models (LLMs) can substantially compromise their safety. Previous approaches require many safety samples or calibration sets, which not only incur significant computational overhead during realignment but also lead to noticeable degradation in model utility. Contrary to this belief, we show that safety alignment can be fully recovered with only a single safety example, without sacrificing utility and at minimal cost. Remarkably, this recovery is effective regardless of the number of harmful examples used in fine-tuning or the size of the underlying model, and convergence is achieved within just a few epochs. Furthermore, we uncover the low-rank structure of the safety gradient, which explains why such efficient correction is possible. We validate our findings across five safety-aligned LLMs and multiple datasets, demonstrating the generality of our approach.

preprint2026arXiv

Understanding and Preserving Safety in Fine-Tuned LLMs

Fine-tuning is an essential and pervasive functionality for applying large language models (LLMs) to downstream tasks. However, it has the potential to substantially degrade safety alignment, e.g., by greatly increasing susceptibility to jailbreak attacks, even when the fine-tuning data is entirely harmless. Despite garnering growing attention in defense efforts during the fine-tuning stage, existing methods struggle with a persistent safety-utility dilemma: emphasizing safety compromises task performance, whereas prioritizing utility typically requires deep fine-tuning that inevitably leads to steep safety declination. In this work, we address this dilemma by shedding new light on the geometric interaction between safety- and utility-oriented gradients in safety-aligned LLMs. Through systematic empirical analysis, we uncover three key insights: (I) safety gradients lie in a low-rank subspace, while utility gradients span a broader high-dimensional space; (II) these subspaces are often negatively correlated, causing directional conflicts during fine-tuning; and (III) the dominant safety direction can be efficiently estimated from a single sample. Building upon these novel insights, we propose safety-preserving fine-tuning (SPF), a lightweight approach that explicitly removes gradient components conflicting with the low-rank safety subspace. Theoretically, we show that SPF guarantees utility convergence while bounding safety drift. Empirically, SPF consistently maintains downstream task performance and recovers nearly all pre-trained safety alignment, even under adversarial fine-tuning scenarios. Furthermore, SPF exhibits robust resistance to both deep fine-tuning and dynamic jailbreak attacks. Together, our findings provide new mechanistic understanding and practical guidance toward always-aligned LLM fine-tuning.

preprint2025arXiv

Critical fluctuations and conserved dynamics in a strange ferromagnetic metal

The origin of the strange metallic behavior observed in a wide range of quantum materials is an open challenge to condensed matter physics. Historically, strange metals were uniquely associated with antiferromagnetic quantum critical points (QCPs), but a new generation of materials reveals their association with uniform order parameters, such as ferromagnetism, valley or nematic order, suggesting a deeper common denominator. At a QCP, order parameter fluctuations are characterized by the dynamical critical exponent $z$, which quantifies the space-time scaling asymmetry. Here, we report the observation of a divergence in the Grüneisen ratio at the QCP of the strange-metal ferromagnet CeRh$_6$Ge$_4$ with a dynamical critical exponent $z=3$, signaling that the underlying quantum singularity involves a conserved degree of freedom. Yet the magnetization of this easy-plane ferromagnet is not conserved. We argue that the $z=3$ strange criticality requires a description beyond the Landau paradigm, proposing a link with the gauge modes of the small-to-large Fermi surface transition and the associated gauge charge of the delocalizing heavy electrons.

preprint2024arXiv

Rigidity for geometric ideals in uniform Roe algebras

In this paper, we investigate the rigidity problems for geometric ideals in uniform Roe algebras associated to discrete metric spaces of bounded geometry. These ideals were introduced by Chen and Wang, and can be fully characterised in terms of ideals in the associated coarse structures. Our main result is that if two geometric ideals in uniform Roe algebras are stably isomorphic, then the coarse spaces associated to these ideals are coarsely equivalent. We also discuss the case of ghostly ideals and pose some open questions.

preprint2023arXiv

Well-Posedness for 2D Combustion Model in Bounded Domains and Serrin-Type Blowup Criterion

We investigate the 2D combustion model with Dirichlet boundary conditions and slip boundary conditions in bounded domains. The global existence of weak and strong solutions and the uniqueness of strong solutions are obtained provided the initial density is small in some precise sense. Using the energy method and the estimates of boundary integrals, we obtain the a priori bounds if the density and velocity field. In addition, we prove the local existence of the strong solutions via iterative method and the contraction mapping theorem. Finally, we extend the well known Serrion's blowup criterion to the 2D combustion model. Under the suitable boundary conditions, the Serrin's condition on the velocity can be removed in this criteria.

preprint2022arXiv

Can Open Domain Question Answering Systems Answer Visual Knowledge Questions?

The task of Outside Knowledge Visual Question Answering (OKVQA) requires an automatic system to answer natural language questions about pictures and images using external knowledge. We observe that many visual questions, which contain deictic referential phrases referring to entities in the image, can be rewritten as "non-grounded" questions and can be answered by existing text-based question answering systems. This allows for the reuse of existing text-based Open Domain Question Answering (QA) Systems for visual question answering. In this work, we propose a potentially data-efficient approach that reuses existing systems for (a) image analysis, (b) question rewriting, and (c) text-based question answering to answer such visual questions. Given an image and a question pertaining to that image (a visual question), we first extract the entities present in the image using pre-trained object and scene classifiers. Using these detected entities, the visual questions can be rewritten so as to be answerable by open domain QA systems. We explore two rewriting strategies: (1) an unsupervised method using BERT for masking and rewriting, and (2) a weakly supervised approach that combines adaptive rewriting and reinforcement learning techniques to use the implicit feedback from the QA system. We test our strategies on the publicly available OKVQA dataset and obtain a competitive performance with state-of-the-art models while using only 10% of the training data.

preprint2022arXiv

Mass Testing and Characterization of 20-inch PMTs for JUNO

Main goal of the JUNO experiment is to determine the neutrino mass ordering using a 20kt liquid-scintillator detector. Its key feature is an excellent energy resolution of at least 3 % at 1 MeV, for which its instruments need to meet a certain quality and thus have to be fully characterized. More than 20,000 20-inch PMTs have been received and assessed by JUNO after a detailed testing program which began in 2017 and elapsed for about four years. Based on this mass characterization and a set of specific requirements, a good quality of all accepted PMTs could be ascertained. This paper presents the performed testing procedure with the designed testing systems as well as the statistical characteristics of all 20-inch PMTs intended to be used in the JUNO experiment, covering more than fifteen performance parameters including the photocathode uniformity. This constitutes the largest sample of 20-inch PMTs ever produced and studied in detail to date, i.e. 15,000 of the newly developed 20-inch MCP-PMTs from Northern Night Vision Technology Co. (NNVT) and 5,000 of dynode PMTs from Hamamatsu Photonics K. K.(HPK).

preprint2022arXiv

Nodeless superconductivity in the topological nodal-line semimetal CaSb2

CaSb2 is a topological nodal-line semimetal that becomes superconducting below 1.6 K, providing an ideal platform to investigate the interplay between topologically nontrivial electronic bands and superconductivity. In this work, we investigated the superconducting order parameter of CaSb2 by measuring its magnetic penetration depth change Δλ(T) down to 0.07 K, using a tunneling diode oscillator (TDO) based technique. Well inside the superconducting state, Δλ(T) shows an exponential activated behavior, and provides direct evidence for a nodeless superconducting gap. By analyzing the temperature dependence of the superfluid density and the electronic specific heat, we find both can be consistently described by a two-gap s-wave model, in line with the presence of multiple Fermi surfaces associated with distinct Sb sites in this compound. These results demonstrate fully-gapped superconductivity in CaSb2 and constrain the allowed pairing symmetry.

preprint2022arXiv

The equivariant coarse Baum-Connes conjecture for actions by a-T-menable groups

The equivariant coarse Baum-Connes conjecture was firstly introduced by Roe [29] as a unified way to approach both the Baum-Connes conjecture and its coarse counterpart. In this paper, we prove that if an a-T-menable group $Γ$ acts properly and isometrically on a bounded geometry metric space $X$ with controlled distortion such that the quotient space $X/Γ$ is coarsely embeddable, then the equivariant coarse Baum-Connes conjecture holds for this action. This answers a question posed in [8] affirmatively.

preprint2021arXiv

Asymptotic expansion in measure and strong ergodicity

In this paper, we introduce and study a notion of asymptotic expansion in measure for measurable actions. This generalises expansion in measure and provides a new perspective on the classical notion of strong ergodicity. Moreover, we obtain structure theorems for asymptotically expanding actions, showing that they admit exhaustions by domains of expansion. As an application, we recover a recent result of Marrakchi, characterising strong ergodicity in terms of local spectral gaps. We also show that homogeneous strongly ergodic actions are always expanding in measure and establish a connection between asymptotic expansion in measure and asymptotic expanders by means of approximating spaces.

preprint2021arXiv

JUNO Physics and Detector

The Jiangmen Underground Neutrino Observatory (JUNO) is a 20 kton LS detector at 700-m underground. An excellent energy resolution and a large fiducial volume offer exciting opportunities for addressing many important topics in neutrino and astro-particle physics. With 6 years of data, the neutrino mass ordering can be determined at 3-4 sigma and three oscillation parameters can be measured to a precision of 0.6% or better by detecting reactor antineutrinos. With 10 years of data, DSNB could be observed at 3-sigma; a lower limit of the proton lifetime of 8.34e33 years (90% C.L.) can be set by searching for p->nu_bar K^+; detection of solar neutrinos would shed new light on the solar metallicity problem and examine the vacuum-matter transition region. A core-collapse supernova at 10 kpc would lead to ~5000 IBD and ~2000 (300) all-flavor neutrino-proton (electron) scattering events. Geo-neutrinos can be detected with a rate of ~400 events/year. We also summarize the final design of the JUNO detector and the key R&D achievements. All 20-inch PMTs have been tested. The average photon detection efficiency is 28.9% for the 15,000 MCP PMTs and 28.1% for the 5,000 dynode PMTs, higher than the JUNO requirement of 27%. Together with the >20 m attenuation length of LS, we expect a yield of 1345 p.e. per MeV and an effective energy resolution of 3.02%/\sqrt{E (MeV)}$ in simulations. The underwater electronics is designed to have a loss rate <0.5% in 6 years. With degassing membranes and a micro-bubble system, the radon concentration in the 35-kton water pool could be lowered to <10 mBq/m^3. Acrylic panels of radiopurity <0.5 ppt U/Th are produced. The 20-kton LS will be purified onsite. Singles in the fiducial volume can be controlled to ~10 Hz. The JUNO experiment also features a double calorimeter system with 25,600 3-inch PMTs, a LS testing facility OSIRIS, and a near detector TAO.

preprint2020arXiv

Coarse median algebras: The intrinsic geometry of coarse median spaces and their intervals

This paper establishes a new combinatorial framework for the study of coarse median spaces, bridging the worlds of asymptotic geometry, algebra and combinatorics. We introduce a simple and entirely algebraic notion of coarse median algebra which simultaneously generalises the concepts of bounded geometry coarse median spaces and classical discrete median algebras. We study the coarse median universe from the perspective of intervals, with a particular focus on cardinality as a proxy for distance. In particular we prove that the metric on a quasi-geodesic coarse median space of bounded geometry can be constructed up to quasi-isometry using only the coarse median operator. Finally we develop a concept of rank for coarse median algebras in terms of the geometry of intervals and show that the notion of finite rank coarse median algebra provides a natural higher dimensional analogue of Gromov&#39;s concept of $δ$-hyperbolicity.

preprint2020arXiv

Feasibility and physics potential of detecting $^8$B solar neutrinos at JUNO

The Jiangmen Underground Neutrino Observatory~(JUNO) features a 20~kt multi-purpose underground liquid scintillator sphere as its main detector. Some of JUNO&#39;s features make it an excellent experiment for $^8$B solar neutrino measurements, such as its low-energy threshold, its high energy resolution compared to water Cherenkov detectors, and its much large target mass compared to previous liquid scintillator detectors. In this paper we present a comprehensive assessment of JUNO&#39;s potential for detecting $^8$B solar neutrinos via the neutrino-electron elastic scattering process. A reduced 2~MeV threshold on the recoil electron energy is found to be achievable assuming the intrinsic radioactive background $^{238}$U and $^{232}$Th in the liquid scintillator can be controlled to 10$^{-17}$~g/g. With ten years of data taking, about 60,000 signal and 30,000 background events are expected. This large sample will enable an examination of the distortion of the recoil electron spectrum that is dominated by the neutrino flavor transformation in the dense solar matter, which will shed new light on the tension between the measured electron spectra and the predictions of the standard three-flavor neutrino oscillation framework. If $Δm^{2}_{21}=4.8\times10^{-5}~(7.5\times10^{-5})$~eV$^{2}$, JUNO can provide evidence of neutrino oscillation in the Earth at the about 3$σ$~(2$σ$) level by measuring the non-zero signal rate variation with respect to the solar zenith angle. Moveover, JUNO can simultaneously measure $Δm^2_{21}$ using $^8$B solar neutrinos to a precision of 20\% or better depending on the central value and to sub-percent precision using reactor antineutrinos. A comparison of these two measurements from the same detector will help elucidate the current tension between the value of $Δm^2_{21}$ reported by solar neutrino experiments and the KamLAND experiment.

preprint2020arXiv

Quasi-local Algebras and Asymptotic Expanders

In this paper, we study the relation between the uniform Roe algebra and the uniform quasi-local algebra associated to a metric space of bounded geometry. In the process, we introduce a weakening of the notion of expanders, called asymptotic expanders. We show that being a sequence of asymptotic expanders is a coarse property under certain connectedness condition, and it implies non-uniformly local amenability. Moreover, we also analyse some $C^*$-algebraic properties of uniform quasi-local algebras. In particular, we show that a uniform quasi-local algebra is nuclear if and only if the underlying metric space has Property A.

preprint2020arXiv

TAO Conceptual Design Report: A Precision Measurement of the Reactor Antineutrino Spectrum with Sub-percent Energy Resolution

The Taishan Antineutrino Observatory (TAO, also known as JUNO-TAO) is a satellite experiment of the Jiangmen Underground Neutrino Observatory (JUNO). A ton-level liquid scintillator detector will be placed at about 30 m from a core of the Taishan Nuclear Power Plant. The reactor antineutrino spectrum will be measured with sub-percent energy resolution, to provide a reference spectrum for future reactor neutrino experiments, and to provide a benchmark measurement to test nuclear databases. A spherical acrylic vessel containing 2.8 ton gadolinium-doped liquid scintillator will be viewed by 10 m^2 Silicon Photomultipliers (SiPMs) of >50% photon detection efficiency with almost full coverage. The photoelectron yield is about 4500 per MeV, an order higher than any existing large-scale liquid scintillator detectors. The detector operates at -50 degree C to lower the dark noise of SiPMs to an acceptable level. The detector will measure about 2000 reactor antineutrinos per day, and is designed to be well shielded from cosmogenic backgrounds and ambient radioactivities to have about 10% background-to-signal ratio. The experiment is expected to start operation in 2022.