Researcher profile

Nan Jia

Nan Jia contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2026arXiv

Asymmetric On-Policy Distillation: Bridging Exploitation and Imitation at the Token Level

On-policy distillation (OPD) trains a student on its own trajectories with token-level teacher feedback and often outperforms off-policy distillation and standard reinforcement learning. However, we find that its standard advantage weighted policy gradient suffers from three structural weaknesses, including high variance updates, vanishing gradients in zero-advantage regions, and exploration bottlenecks when corrective signals are insufficient. We therefore propose Asymmetric On-Policy Distillation (AOPD), which replaces ineffective negative reinforcement with localized divergence minimization in non-positive advantage regions while preserving positive reinforcement learning. Experiments on mathematical reasoning benchmarks show that AOPD consistently outperforms standard OPD, with average gains of 4.09 / 8.34 under strong / weak initialization, respectively. AOPD also maintains higher policy entropy during training and better capability retention during sequential tool-use adaptation.

preprint2025arXiv

CoHalLo: code hallucination localization via probing hidden layer vector

The localization of code hallucinations aims to identify specific lines of code containing hallucinations, helping developers to improve the reliability of AI-generated code more efficiently. Although recent studies have adopted several methods to detect code hallucination, most of these approaches remain limited to coarse-grained detection and lack specialized techniques for fine-grained hallucination localization. This study introduces a novel method, called CoHalLo, which achieves line-level code hallucination localization by probing the hidden-layer vectors from hallucination detection models. CoHalLo uncovers the key syntactic information driving the model's hallucination judgments and locates the hallucinating code lines accordingly. Specifically, we first fine-tune the hallucination detection model on manually annotated datasets to ensure that it learns features pertinent to code syntactic information. Subsequently, we designed a probe network that projects high-dimensional latent vectors onto a low-dimensional syntactic subspace, generating vector tuples and reconstructing the predicted abstract syntax tree (P-AST). By comparing P-AST with the original abstract syntax tree (O-AST) extracted from the input AI-generated code, we identify the key syntactic structures associated with hallucinations. This information is then used to pinpoint hallucinated code lines. To evaluate CoHalLo's performance, we manually collected a dataset of code hallucinations. The experimental results show that CoHalLo achieves a Top-1 accuracy of 0.4253, Top-3 accuracy of 0.6149, Top-5 accuracy of 0.7356, Top-10 accuracy of 0.8333, IFA of 5.73, Recall@1% Effort of 0.052721, and Effort@20% Recall of 0.155269, which outperforms the baseline methods.

preprint2022arXiv

Detailed analysis on the reflection component for the black hole candidate MAXI J1348-630

The black hole candidate MAXI J1348-630 was discovered on January 26th, 2019, with the Gas Slit Camera (GSC) on-board \textit{MAXI}. We report a detailed spectral analysis of this source by using the archived data of \textit{NuSTAR}. A total of 9 observations covered the complete outburst evolution of MAXI J1348-630 from the hard state to the soft state and finally back to the hard state. Additionally, the intermediate state is found in the transition from the hard state to the soft state. We use the state-of-art reflection model \verb'relxill' family to fit all the 9 spectra, and the spectra from two focal plane module detectors of \textit{NuSTAR} are jointly fitted for each observation. In particular, we concentrate on the results of the black hole spin parameter and the inclination of the accretion disk. Based on the analysis of the inner radius of the accretion disk, we obtain the spin parameter $a_* =0.78_{-0.04}^{+0.04}$, and the inclination angle of the inner disk $i = 29.2_{-0.5}^{+0.3}$ degrees. Furthermore, we also find that when the black hole is in the hard state, the accretion disk would show a significant truncation. The high iron abundance and ionization of the accretion disk obtained in the fitting results can be possibly explained by the high density of the accretion disk.

preprint2022arXiv

The Spin of New Black Hole Candidate: MAXI J1803-298 Observed by NuSTAR and NICER

MAXI J1803-298, a newly-discovered Galactic transient and black hole candidate, was first detected by \emph{MAXI}/GSC on May 1st, 2021. In this paper, we present a detailed spectral analysis of MAXI J1803-298. Utilizing the X-ray reflection fitting method, we perform a joint fit to the spectra of MAXI J1803-298, respectively, observed by \emph{NuSTAR} and \emph{NICER}/XTI on the same day over the energy range between 0.7-79.0 keV, and found its spin (and the inclination angle i) can be constrained to be close to an extreme value, 0.991 ($i\sim$ $70 ^{\circ}$), at 68\% confidence interval. The results suggest that MAXI J1803-298 may be a fast-rotating black hole with a large inclination angle.

preprint2021arXiv

Estimating the spin of the black hole candidate MAXI J1659-152 with the X-ray continuum-fitting method

As a transient X-ray binary, MAXI J1659-152 contains a black hole candidate as its compact star. MAXI J1659-152 was discovered on 2010 September 25 during its only known outburst. Previously-published studies of this outburst indicate that MAXI J1659-152 may have an extreme retrograde spin, which, if confirmed, would provide an important clue as to the origin of black hole spin. In this paper, utilizing updated dynamical binary-system parameters (i.e. the black hole mass, the orbital inclination and the source distance) provided by \cite{Torres2021}, we analyze 65 spectra of MAXI J1659-152 from \emph{RXTE}/PCA, in order to assess the spin parameter. With a final selection of 9 spectra matching our $f_{\mathrm{sc}} \lesssim 25 \%$, soft-state criteria, we apply a relativistic thin disk spectroscopic model \texttt{kerrbb2} over 3.0-45.0 keV. We find that inclination angle correlates inversely with spin, and, considering the possible values for inclination angle, we constrain spin to be $-1 < a_{*} \lesssim 0.44$ at 90\% confidence interval via X-ray continuum-fitting. We can only rule out an extreme prograde (positive) spin. We confirm that an extreme retrograde solution is possible and is not ruled out by considering accretion torques given the young age of the system.

preprint2021arXiv

Spectral Analysis of New Black Hole Candidate AT2019wey Observed by NuSTAR

AT2019wey is a new galactic X-ray binary that was first discovered as an optical transient by the Australia Telescope Large Area Survey (ATLAS) on December 7, 2019. AT2019wey consists of a black hole candidate as well as a low-mass companion star ($M_{\text {star }} \lesssim 0.8 M_{\odot}$) and is likely to have a short orbital period ($P_{\text {orb }} \lesssim 8$ h). Although AT2019wey began activation in the X-ray band during almost the entire outburst on March 8, 2020, it did not enter the soft state during the entire outburst. In this study, we present a detailed spectral analysis of AT2019wey in the low/hard state during its X-ray outburst on the basis of Nuclear Spectroscopic Telescope Array \emph observations. We obtain tight constraints on several of its important physical parameters by applying the State-of-art \texttt{relxill} relativistic reflection model family. In particular, we determine that the measured inner radius of the accretion disk is most likely to have extended to the innermost stable circular orbit (ISCO) radius, i.e., $R_{\text{in}}=1.38^{+0.23}_{-0.16}~R_{\text{ISCO}}$. Hence, assuming $R_{\text{in}}$=$R_{\text{ISCO}}$, we find the spin of AT2019wey to be $a_{*}\sim$ $0.97$, which is close to the extreme and an inner disk inclination angle of ~$i\sim$ $22 ^{\circ}$. Additionally, according to our adopted models, AT2019wey tends to have a relatively high iron abundance of $A_{\mathrm{Fe}}\sim$ 5 $A_{\mathrm{Fe}, \odot}$ and a high disk ionization state of $\log ξ\sim$ 3.4.