Researcher profile

Xiao Kong

Xiao Kong contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
9works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2026arXiv

CuraView: A Multi-Agent Framework for Medical Hallucination Detection with GraphRAG-Enhanced Knowledge Verification

Discharge summaries require extracting critical information from lengthy electronic health records (EHRs), a process that is labor-intensive when performed manually. Large language models (LLMs) can improve generation efficiency; however, they are prone to producing faithfulness hallucinations, statements that contradict source records, posing direct risks to patient safety. To address this, we present CuraView, a multi-agent framework for sentence-level detection and evidence-grounded explanation of faithfulness hallucinations in discharge summaries. CuraView constructs a GraphRAG-based knowledge graph from patient-level EHRs and implements a closed-loop generation-detection pipeline with sentence-level evidence retrieval and classification spanning four evidence grades from strong support to direct contradiction (E1-E4), yielding structured and interpretable evidence chains. We evaluate CuraView on a subset of 250 patients from the Discharge-Me benchmark, with 50 patients held out for testing. Our fine-tuned Qwen3-14B detection model achieves an F1 of 0.831 on the safety-critical E4 metric (90.9% recall, 76.5% precision) and an F1 of 0.823 on E3+E4, representing a 50.0% relative improvement over the base model and outperforming RAGTruth-style and QAGS-style baselines. These results demonstrate that evidence-chain-based graph retrieval verification substantially improves the factual reliability of clinical documentation, while simultaneously producing reusable annotated datasets for downstream model training and distillation.

preprint2026arXiv

StellarF: A Physics-Informed LoRA Framework for Stellar Flare Forecasting with Historical & Statistical Data

Stellar flare forecasting represents a critical frontier in astrophysics, offering profound insights into stellar activity mechanisms and exoplanetary habitability assessments. Yet the inherent unpredictability of flare activity, rooted in stellar diversity and evolutionary stages, underpins the field's core challenges: (1) sparse, incomplete, noisy lightcurve data from traditional observations; (2) ineffective multi-scale flare evolution capture via single representations; (3) poor physical interpretability in data-driven models lacking physics-informed priors. To address these challenges, we propose StellarF, a physics-informed framework synergizing general Al with astrophysical domain knowledge via three core components: a unified preprocessing pipeline for lightcurve refinement (missing-value imputation, temporal patch partitioning, adaptive sample filtering); a Low-Rank Adaptation (LoRA)-finetuned large language model (LLM) backbone enhanced by first-order difference augmentation, flare statistical information, and flare historical record modules for multimodal fusion instead of only simple representations; and a novel physics-informed loss embedding a minimum rising rate prior, appended to the cross-entropy loss, to align with flare physics. Extensive experiments on Kepler and TESS datasets show StellarF achieves state-of-the-art performance across key metrics, setting new benchmarks for flare forecasting. This work bridges general AI with astrophysics, offering a practical, physically interpretable paradigm for transient event forecasting in time-domain astronomy.

preprint2025arXiv

Non-Euclidean interfaces decode the continuous landscape of graphene-induced surface reconstructions

Interfacial reconstruction between two-dimensional (2D) materials and metal substrates fundamentally governs heterostructure properties, yet conventional flat substrates fail to capture the continuous crystallographic landscape. Here, we overcome this topological limitation using non-Euclidean interfaces-curved 2D graphene-copper surfaces as a model system-to traverse the infinite spectrum of lattice orientations. By integrating multimodal microscopy with a deep-learning-enhanced dimensional upscaling framework, we translate 2D scanning electron microscopy (SEM) contrast into quantitative three-dimensional (3D) morphologies with accurate facet identification. Coupling these observations with machine-learning-assisted density functional theory, we demonstrate that reconstruction is governed by a unified thermodynamic mechanism where high-index facets correspond to specific local minima in the surface energy landscape. This work resolves the long-standing complexity of graphene-copper faceting and establishes non-Euclidean surface topologies as a generalizable paradigm for decoding and controlling interfacial reconstruction in diverse metal-2D material systems.

preprint2025arXiv

Scalable Stellar Parameter Inference Using Python-based LASP: From CPU Optimization to GPU Acceleration

To enhance the efficiency, scalability, and cross-survey applicability of stellar parameter inference in large spectroscopic datasets, we present a modular, parallelized Python framework with automated error estimation, built on the LAMOST Atmospheric Parameter Pipeline (LASP) originally implemented in IDL. Rather than a direct code translation, this framework refactors LASP with two complementary modules: LASP-CurveFit, a new implementation of the LASP fitting procedure that runs on a CPU, preserving legacy logic while improving data I/O and multithreaded execution efficiency; and LASP-Adam-GPU, a GPU-accelerated method that introduces grouped optimization by constructing a joint residual function over multiple observed and model spectra, enabling high-throughput parameter inference across tens of millions of spectra. Applied to 10 million LAMOST spectra, the framework reduces runtime from 84 to 48 hr on the same CPU platform and to 7 hr on an NVIDIA A100 GPU, while producing results consistent with those from the original pipeline. The inferred errors agree well with the parameter variations from repeat observations of the same target (excluding radial velocities), while the official empirical errors used in LASP are more conservative. When applied to DESI DR1, our effective temperatures and surface gravities agree better with APOGEE than those from the DESI pipeline, particularly for cool giants, while the latter performs slightly better in radial velocity and metallicity. These results suggest that the framework delivers reliable accuracy, efficiency, and transferability, offering a practical approach to parameter inference in large spectroscopic surveys. The code and DESI-based catalog are available via \dataset[DOI: 10.12149/101679]{https://doi.org/10.12149/101679} and \dataset[DOI: 10.12149/101675]{https://doi.org/10.12149/101675}, respectively.

preprint2024arXiv

Projected rotational velocities for LAMOST stars with effective temperature lower than 9000 K

In Data Release 9 of LAMOST, we present measurements of v sin i for a total of 121,698 stars measured using the Medium Resolution Spectrograph (MRS) and 80,108 stars using the Low Resolution Spectrograph (LRS). These values were obtained through a chi^2 minimisation process, comparing LAMOST spectra with corresponding grids of synthetically broadened spectra. Due to the resolution and the spectral range of LAMOST, v sin i measurements are limited to stars with effective temperature (Teff) ranging from 5000 K to 8500 K for MRS and 7000 K to 9000 K for LRS. The detectable v sin i for MRS is set between 27 km/s and 350 km/s , and for LRS between 110 km/s and 350 km/s, This limitation is because the convolved reference spectra become less informative beyond 350 km/s. The intrinsic precisions of v sin i , determined from multi-epoch observations, is approximately 4.0 km/s for MRS and 10.0 km/s for LRS at signal-to-noise ratio (S/N) greater than 50. Our v sin i values show consistence with those from APOGEE17, displaying a scatter of 8.79 km/s. They are also in agreement with measurements from the Gaia DR3 and SUN catalogs. An observed trend in LAMOST MRS data is the decrease in v sin i with dropping Teff, particularly transiting around 7000 K for dwarfs and 6500 K for giants, primarily observed in stars with near-solar abundances.

preprint2022arXiv

Estimating Atmospheric Parameters from LAMOST Low-Resolution Spectra with Low SNR

Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) acquired tens of millions of low-resolution stellar spectra. The large amount of the spectra result in the urgency to explore automatic atmospheric parameter estimation methods. There are lots of LAMOST spectra with low signal-to-noise ratios (SNR), which result in a sharp degradation on the accuracy of their estimations. Therefore, it is necessary to explore better estimation methods for low-SNR spectra. This paper proposed a neural network-based scheme to deliver atmospheric parameters, LASSO-MLPNet. Firstly, we adopt a polynomial fitting method to obtain pseudo-continuum and remove it. Then, some parameter-sensitive features in the existence of high noises were detected using Least Absolute Shrinkage and Selection Operator (LASSO). Finally, LASSO-MLPNet used a Multilayer Perceptron network (MLPNet) to estimate atmospheric parameters $T_{\mathrm{eff}}$, log $g$ and [Fe/H]. The effectiveness of the LASSO-MLPNet was evaluated on some LAMOST stellar spectra of the common star between APOGEE (The Apache Point Observatory Galactic Evolution Experiment) and LAMOST. it is shown that the estimation accuracy is significantly improved on the stellar spectra with $10<\mathrm{SNR}\leq80$. Especially, LASSO-MLPNet reduces the mean absolute error (MAE) of the estimation of $T_{\mathrm{eff}}$, log $g$ and [Fe/H] from (144.59 K, 0.236 dex, 0.108 dex) (LASP) to (90.29 K, 0.152 dex, 0.064 dex) (LASSO-MLPNet) on the stellar spectra with $10<\mathrm{SNR}\leq20$. To facilitate reference, we release the estimates of the LASSO-MLPNet from more than 4.82 million stellar spectra with $10<\mathrm{SNR}\leq80$ and 3500 < SNR$g$ $\leq$ 6500 as a value-added output.

preprint2022arXiv

Strong [OIII]λ5007 emission line compact galaxies in LAMOST DR9: Blueberries, Green Peas and Purple Grapes

Green Pea and Blueberry galaxies are well-known for their compact size, low mass, strong emission lines and analogs to high-z Lyα emitting galaxies. In this study, 1547 strong [OIII]λ5007 emission line compact galaxies with 1694 spectra are selected from LAMOST DR9 at the redshift range from 0.0 to 0.59. According to the redshift distribution, these samples can be separated into three groups: Blueberries, Green Peas and Purple Grapes. Optical [MgII]λ2800 line feature, BPT diagram, multi-wavelength SED fitting, MIR color, and MIR variability are deployed to identify 23 AGN candidates from these samples, which are excluded for the following SFR discussions. We perform the multi-wavelength SED fitting with GALEX UV and WISE MIR data. Color excess from Balmer decrement shows these strong [OIII]λ5007 emission line compact galaxies are not highly reddened. The stellar mass of the galaxies is obtained by fitting LAMOST calibrated spectra with the emission lines masked. We find that the SFR is increasing with the increase of redshift, while for the sources within the same redshift bin, the SFR increases with mass with a similar slope as the SFMS. These samples have a median metallicity of 12+log(O/H) of 8.10. The metallicity increases with mass, and all the sources are below the mass-metallicity relation. The direct-derived Te-based metallicity from the [OIII]λ4363 line agrees with the empirical N2-based empirical gas-phase metallicity. Moreover, these compact strong [OIII]λ5007 are mostly in a less dense environment.

preprint2022arXiv

Ultracool dwarfs identified using spectra in LAMOST DR7

In this work, we identify 734 ultracool dwarfs with a spectral type of M6 or later, including one L0. Of this sample, 625 were studied spectroscopically for the first time. All of these ultracool dwarfs are within 360~pc, with a \textit{Gaia} G magnitude brighter than ~19.2 mag. By studying the spectra and checking their stellar parameters (Teff, logg, and [FeH] derived with the LAMOST pipeline, we found their cool red nature and their metallicity to be consistent with the nature of Galactic thin-disk objects. Furthermore, 77 of them show lithium absorption lines at 6708A, further indicating their young ages and substellar nature. Kinematics obtained through LAMOST radial velocities, along with the proper motion and parallax data from Gaia EDR3, also suggest that the majority of our targets are thin-disk objects. Kinematic ages were estimated through the relationship between the velocity dispersion and the average age for a certain population. Moreover, we identified 35 binaries, with 6 of them reported as binaries for the first time.

preprint2018arXiv

Emergent Quantum Dynamics of Vortex Line under Linear Local Induction Approximation

Using the linear local induction approximation, we investigated the self-induced motion of a vortex line that corresponds to the motion of a particle in quantum mechanics. Concerning Kelvin waves, the effective Schrödinger equation, physical quantities operators, and the corresponding path-integral formula are obtained. The vortex line-particle mapping may help in understanding particle motion in quantum mechanics.