Source author record

Xiao Kong

Xiao Kong appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

astro-ph.GA astro-ph.SR astro-ph.IM Artificial Intelligence Computation and Language cond-mat.mes-hall Machine Learning quant-ph

Catalog footprint

What is connected

10works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

CuraView: A Multi-Agent Framework for Medical Hallucination Detection with GraphRAG-Enhanced Knowledge Verification

Discharge summaries require extracting critical information from lengthy electronic health records (EHRs), a process that is labor-intensive when performed manually. Large language models (LLMs) can improve generation efficiency; however, they are prone to producing faithfulness hallucinations, statements that contradict source records, posing direct risks to patient safety. To address this, we present CuraView, a multi-agent framework for sentence-level detection and evidence-grounded explanation of faithfulness hallucinations in discharge summaries. CuraView constructs a GraphRAG-based knowledge graph from patient-level EHRs and implements a closed-loop generation-detection pipeline with sentence-level evidence retrieval and classification spanning four evidence grades from strong support to direct contradiction (E1-E4), yielding structured and interpretable evidence chains. We evaluate CuraView on a subset of 250 patients from the Discharge-Me benchmark, with 50 patients held out for testing. Our fine-tuned Qwen3-14B detection model achieves an F1 of 0.831 on the safety-critical E4 metric (90.9% recall, 76.5% precision) and an F1 of 0.823 on E3+E4, representing a 50.0% relative improvement over the base model and outperforming RAGTruth-style and QAGS-style baselines. These results demonstrate that evidence-chain-based graph retrieval verification substantially improves the factual reliability of clinical documentation, while simultaneously producing reusable annotated datasets for downstream model training and distillation.

preprint2026arXiv

StellarF: A Physics-Informed LoRA Framework for Stellar Flare Forecasting with Historical & Statistical Data

Stellar flare forecasting represents a critical frontier in astrophysics, offering profound insights into stellar activity mechanisms and exoplanetary habitability assessments. Yet the inherent unpredictability of flare activity, rooted in stellar diversity and evolutionary stages, underpins the field's core challenges: (1) sparse, incomplete, noisy lightcurve data from traditional observations; (2) ineffective multi-scale flare evolution capture via single representations; (3) poor physical interpretability in data-driven models lacking physics-informed priors. To address these challenges, we propose StellarF, a physics-informed framework synergizing general Al with astrophysical domain knowledge via three core components: a unified preprocessing pipeline for lightcurve refinement (missing-value imputation, temporal patch partitioning, adaptive sample filtering); a Low-Rank Adaptation (LoRA)-finetuned large language model (LLM) backbone enhanced by first-order difference augmentation, flare statistical information, and flare historical record modules for multimodal fusion instead of only simple representations; and a novel physics-informed loss embedding a minimum rising rate prior, appended to the cross-entropy loss, to align with flare physics. Extensive experiments on Kepler and TESS datasets show StellarF achieves state-of-the-art performance across key metrics, setting new benchmarks for flare forecasting. This work bridges general AI with astrophysics, offering a practical, physically interpretable paradigm for transient event forecasting in time-domain astronomy.

preprint2025arXiv

Non-Euclidean interfaces decode the continuous landscape of graphene-induced surface reconstructions

Interfacial reconstruction between two-dimensional (2D) materials and metal substrates fundamentally governs heterostructure properties, yet conventional flat substrates fail to capture the continuous crystallographic landscape. Here, we overcome this topological limitation using non-Euclidean interfaces-curved 2D graphene-copper surfaces as a model system-to traverse the infinite spectrum of lattice orientations. By integrating multimodal microscopy with a deep-learning-enhanced dimensional upscaling framework, we translate 2D scanning electron microscopy (SEM) contrast into quantitative three-dimensional (3D) morphologies with accurate facet identification. Coupling these observations with machine-learning-assisted density functional theory, we demonstrate that reconstruction is governed by a unified thermodynamic mechanism where high-index facets correspond to specific local minima in the surface energy landscape. This work resolves the long-standing complexity of graphene-copper faceting and establishes non-Euclidean surface topologies as a generalizable paradigm for decoding and controlling interfacial reconstruction in diverse metal-2D material systems.

preprint2025arXiv

Scalable Stellar Parameter Inference Using Python-based LASP: From CPU Optimization to GPU Acceleration

To enhance the efficiency, scalability, and cross-survey applicability of stellar parameter inference in large spectroscopic datasets, we present a modular, parallelized Python framework with automated error estimation, built on the LAMOST Atmospheric Parameter Pipeline (LASP) originally implemented in IDL. Rather than a direct code translation, this framework refactors LASP with two complementary modules: LASP-CurveFit, a new implementation of the LASP fitting procedure that runs on a CPU, preserving legacy logic while improving data I/O and multithreaded execution efficiency; and LASP-Adam-GPU, a GPU-accelerated method that introduces grouped optimization by constructing a joint residual function over multiple observed and model spectra, enabling high-throughput parameter inference across tens of millions of spectra. Applied to 10 million LAMOST spectra, the framework reduces runtime from 84 to 48 hr on the same CPU platform and to 7 hr on an NVIDIA A100 GPU, while producing results consistent with those from the original pipeline. The inferred errors agree well with the parameter variations from repeat observations of the same target (excluding radial velocities), while the official empirical errors used in LASP are more conservative. When applied to DESI DR1, our effective temperatures and surface gravities agree better with APOGEE than those from the DESI pipeline, particularly for cool giants, while the latter performs slightly better in radial velocity and metallicity. These results suggest that the framework delivers reliable accuracy, efficiency, and transferability, offering a practical approach to parameter inference in large spectroscopic surveys. The code and DESI-based catalog are available via \dataset[DOI: 10.12149/101679]{https://doi.org/10.12149/101679} and \dataset[DOI: 10.12149/101675]{https://doi.org/10.12149/101675}, respectively.

preprint2024arXiv

Projected rotational velocities for LAMOST stars with effective temperature lower than 9000 K

In Data Release 9 of LAMOST, we present measurements of v sin i for a total of 121,698 stars measured using the Medium Resolution Spectrograph (MRS) and 80,108 stars using the Low Resolution Spectrograph (LRS). These values were obtained through a chi^2 minimisation process, comparing LAMOST spectra with corresponding grids of synthetically broadened spectra. Due to the resolution and the spectral range of LAMOST, v sin i measurements are limited to stars with effective temperature (Teff) ranging from 5000 K to 8500 K for MRS and 7000 K to 9000 K for LRS. The detectable v sin i for MRS is set between 27 km/s and 350 km/s , and for LRS between 110 km/s and 350 km/s, This limitation is because the convolved reference spectra become less informative beyond 350 km/s. The intrinsic precisions of v sin i , determined from multi-epoch observations, is approximately 4.0 km/s for MRS and 10.0 km/s for LRS at signal-to-noise ratio (S/N) greater than 50. Our v sin i values show consistence with those from APOGEE17, displaying a scatter of 8.79 km/s. They are also in agreement with measurements from the Gaia DR3 and SUN catalogs. An observed trend in LAMOST MRS data is the decrease in v sin i with dropping Teff, particularly transiting around 7000 K for dwarfs and 6500 K for giants, primarily observed in stars with near-solar abundances.

preprint2022arXiv

Estimating Atmospheric Parameters from LAMOST Low-Resolution Spectra with Low SNR

Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) acquired tens of millions of low-resolution stellar spectra. The large amount of the spectra result in the urgency to explore automatic atmospheric parameter estimation methods. There are lots of LAMOST spectra with low signal-to-noise ratios (SNR), which result in a sharp degradation on the accuracy of their estimations. Therefore, it is necessary to explore better estimation methods for low-SNR spectra. This paper proposed a neural network-based scheme to deliver atmospheric parameters, LASSO-MLPNet. Firstly, we adopt a polynomial fitting method to obtain pseudo-continuum and remove it. Then, some parameter-sensitive features in the existence of high noises were detected using Least Absolute Shrinkage and Selection Operator (LASSO). Finally, LASSO-MLPNet used a Multilayer Perceptron network (MLPNet) to estimate atmospheric parameters $T_{\mathrm{eff}}$, log $g$ and [Fe/H]. The effectiveness of the LASSO-MLPNet was evaluated on some LAMOST stellar spectra of the common star between APOGEE (The Apache Point Observatory Galactic Evolution Experiment) and LAMOST. it is shown that the estimation accuracy is significantly improved on the stellar spectra with $10<\mathrm{SNR}\leq80$. Especially, LASSO-MLPNet reduces the mean absolute error (MAE) of the estimation of $T_{\mathrm{eff}}$, log $g$ and [Fe/H] from (144.59 K, 0.236 dex, 0.108 dex) (LASP) to (90.29 K, 0.152 dex, 0.064 dex) (LASSO-MLPNet) on the stellar spectra with $10<\mathrm{SNR}\leq20$. To facilitate reference, we release the estimates of the LASSO-MLPNet from more than 4.82 million stellar spectra with $10<\mathrm{SNR}\leq80$ and 3500 < SNR$g$ $\leq$ 6500 as a value-added output.

preprint2022arXiv

Strong [OIII]λ5007 emission line compact galaxies in LAMOST DR9: Blueberries, Green Peas and Purple Grapes

Green Pea and Blueberry galaxies are well-known for their compact size, low mass, strong emission lines and analogs to high-z Lyα emitting galaxies. In this study, 1547 strong [OIII]λ5007 emission line compact galaxies with 1694 spectra are selected from LAMOST DR9 at the redshift range from 0.0 to 0.59. According to the redshift distribution, these samples can be separated into three groups: Blueberries, Green Peas and Purple Grapes. Optical [MgII]λ2800 line feature, BPT diagram, multi-wavelength SED fitting, MIR color, and MIR variability are deployed to identify 23 AGN candidates from these samples, which are excluded for the following SFR discussions. We perform the multi-wavelength SED fitting with GALEX UV and WISE MIR data. Color excess from Balmer decrement shows these strong [OIII]λ5007 emission line compact galaxies are not highly reddened. The stellar mass of the galaxies is obtained by fitting LAMOST calibrated spectra with the emission lines masked. We find that the SFR is increasing with the increase of redshift, while for the sources within the same redshift bin, the SFR increases with mass with a similar slope as the SFMS. These samples have a median metallicity of 12+log(O/H) of 8.10. The metallicity increases with mass, and all the sources are below the mass-metallicity relation. The direct-derived Te-based metallicity from the [OIII]λ4363 line agrees with the empirical N2-based empirical gas-phase metallicity. Moreover, these compact strong [OIII]λ5007 are mostly in a less dense environment.

preprint2022arXiv

Ultracool dwarfs identified using spectra in LAMOST DR7

In this work, we identify 734 ultracool dwarfs with a spectral type of M6 or later, including one L0. Of this sample, 625 were studied spectroscopically for the first time. All of these ultracool dwarfs are within 360~pc, with a \textit{Gaia} G magnitude brighter than ~19.2 mag. By studying the spectra and checking their stellar parameters (Teff, logg, and [FeH] derived with the LAMOST pipeline, we found their cool red nature and their metallicity to be consistent with the nature of Galactic thin-disk objects. Furthermore, 77 of them show lithium absorption lines at 6708A, further indicating their young ages and substellar nature. Kinematics obtained through LAMOST radial velocities, along with the proper motion and parallax data from Gaia EDR3, also suggest that the majority of our targets are thin-disk objects. Kinematic ages were estimated through the relationship between the velocity dispersion and the average age for a certain population. Moreover, we identified 35 binaries, with 6 of them reported as binaries for the first time.

preprint2018arXiv

Emergent Quantum Dynamics of Vortex Line under Linear Local Induction Approximation

Using the linear local induction approximation, we investigated the self-induced motion of a vortex line that corresponds to the motion of a particle in quantum mechanics. Concerning Kelvin waves, the effective Schrödinger equation, physical quantities operators, and the corresponding path-integral formula are obtained. The vortex line-particle mapping may help in understanding particle motion in quantum mechanics.

preprint2016arXiv

LAMOST Spectrograph Response Curves: Stability and Application to flux calibration

The task of flux calibration for LAMOST (Large sky Area Multi-Object Spectroscopic Telescope) spectra is difficult due to many factors. For example, the lack of standard stars, flat fielding for large field of view, and variation of reddening between different stars especially at low galactic latitudes etc. Poor selection, bad spectral quality, or extinction uncertainty of standard stars not only might induce errors to the calculated spectral response curve (SRC), but also might lead to failures in producing final 1D spectra. In this paper, we inspected spectra with Galactic latitude |b|>=60 degree and reliable stellar parameters, determined through the LAMOST Stellar Parameter Pipeline (LASP), to study the stability of the spectrograph. To guarantee the selected stars had been observed by each fiber, we selected 37,931 high quality exposures of 29,000 stars from LAMOST DR2, and more than 7 exposures for each fiber. We calculated the SRCs for each fiber for each exposure, and calculated the statistics of SRCs for spectrographs with both the fiber variations and time variations. The result shows that the average response curve of each spectrograph (henceforth ASPSRC) is relatively stable with statistical errors <= 10%. From the comparison between each ASPSRC and the SRCs for the same spectrograph obtained by 2D pipeline, we find that the ASPSRCs are good enough to use for the calibration. The ASPSRCs have been applied to spectra which were abandoned by LAMOST 2D pipeline due to the lack of standard stars, increasing the number of LAMOST spectra by 52,181 in DR2. Comparing those same targets with SDSS, the relative flux differences between SDSS spectra and that of LAMOST spectra with the ASPSRC method are less than 10%, which underlines that the ASPSRC method is feasible for LAMOST flux calibration.

Institution

Affiliation not imported yet

This author record came from a source that does not expose affiliation metadata. Once the author claims the profile or we enrich the record from another provider, this section will link to the concrete institution.

Topic footprint

Fields this researcher appears in

astro-ph.GA astro-ph.SR astro-ph.IM Artificial Intelligence Computation and Language cond-mat.mes-hall Machine Learning quant-ph

Source provenance

Where this author record came from

arxivconfidence 95%

external id: arxiv:2512.24840:author:13:xiao-kong

Imported May 21, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2512.24220:author:9:xiao-kong

Imported May 21, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2605.03476:author:2:xiao-kong

Imported May 20, 2026Synced May 20, 2026

5 works

A-Li Luo

Researcher

A-Li Luo contributes to research discovery and scholarly infrastructure.

Open to collaborate

5 works

Bing Du

Researcher

Bing Du contributes to research discovery and scholarly infrastructure.

Open to collaborate

3 works

Fang Zuo

Researcher

Fang Zuo contributes to research discovery and scholarly infrastructure.

Open to collaborate

3 works

Hugh R. A. Jones

Researcher

Hugh R. A. Jones contributes to research discovery and scholarly infrastructure.

Open to collaborate

Xiao Kong

What is connected

Connect this record

See the researcher in context

Building this map preview

10 published item(s)

CuraView: A Multi-Agent Framework for Medical Hallucination Detection with GraphRAG-Enhanced Knowledge Verification

StellarF: A Physics-Informed LoRA Framework for Stellar Flare Forecasting with Historical & Statistical Data

Non-Euclidean interfaces decode the continuous landscape of graphene-induced surface reconstructions

Scalable Stellar Parameter Inference Using Python-based LASP: From CPU Optimization to GPU Acceleration

Projected rotational velocities for LAMOST stars with effective temperature lower than 9000 K

Estimating Atmospheric Parameters from LAMOST Low-Resolution Spectra with Low SNR

Strong [OIII]λ5007 emission line compact galaxies in LAMOST DR9: Blueberries, Green Peas and Purple Grapes

Ultracool dwarfs identified using spectra in LAMOST DR7

Emergent Quantum Dynamics of Vortex Line under Linear Local Induction Approximation

LAMOST Spectrograph Response Curves: Stability and Application to flux calibration