Researcher profile

Weixiang Sun

Weixiang Sun contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 17 - UnverifiedVerification L1Unclaimed author
4works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

4 published item(s)

preprint2026arXiv

PreScam: A Benchmark for Predicting Scam Progression from Early Conversations

Conversational scams, such as romance and investment scams, are emerging as a major form of online fraud. Unlike one-shot scam lures such as fake lottery or unpaid toll messages, they unfold through multi-turn conversations in which scammers gradually manipulate victims using evolving psychological techniques. However, existing research mainly focuses on static scam detection or synthetic scams, leaving open whether language models can understand how real-world scams progress over time. We introduce PreScam, a benchmark for modeling scam progression from early conversations. Built from user-submitted scam reports, PreScam filters and structures 177,989 raw reports into 11,573 conversational scam instances spanning 20 scam categories. Each instance is hierarchically structured according to the scam lifecycle defined by the proposed scam kill chain, and further annotated at the turn level with scammer psychological actions and victim responses. We benchmark models on two tasks: real-time termination prediction, which estimates whether a conversation is approaching the termination stage, and scammer action prediction, which forecasts the scammer's subsequent actions. Results show a clear gap between surface-level fluency and progression modeling: supervised encoders substantially outperform zero-shot LLMs on real-time termination prediction, while next-action prediction remains only moderately successful even for strong LLMs. Taken together, these results show that current models can capture some scam-related cues, yet still struggle to track how risk escalates and how manipulation unfolds across turns.

preprint2022arXiv

Estimating accurate reddening values of LAMOST M dwarfs

M dwarfs are the dominating type of stars in the solar neighbourhood. They serve as excellent tracers for the study of the distribution and properties of the nearby interstellar dust. In this work, we aim to obtain high accuracy reddening values of M dwarf stars from the Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) Data Release 8 (DR8). Combining the LAMOST spectra with the high-quality optical photometry from the Gaia Early Data Release 3 (Gaia EDR3), we have estimated the reddening values $E(G_{\rm BP}-G_{\rm RP})$ of 641,426 M dwarfs with the machine-learning algorithm Random Forest regression. The typical reddening uncertainty is only 0.03 mag in $E(G_{\rm BP}-G_{\rm RP})$. We have obtained the reddening coefficient $R_{(G_{\rm BP}-G_{\rm RP})}$, which is a function of the stellar intrinsic colour $(G_{\rm BP}-G_{\rm RP})_0$ and reddening value $E(B-V)$. The values of $E(B-V)$ are also provided for the individual stars in our catalogue. Our resultant high accuracy reddening values of M dwarfs, combined with the Gaia parallaxes, will be very powerful to map the fine structures of the dust in the solar neighbourhood.

preprint2020arXiv

Discovery of A candidate Hypervelocity star originated from the Sagittarius Dwarf Spheroidal galaxy

In this letter, we report the discovery of an intriguing HVS (J1443+1453) candidate that is probably from the Sagittarius Dwarf Spheroidal galaxy (Sgr dSph). The star is an old and very metal-poor low-mass main-sequence turn-off star (age $\sim14.0$ Gyr and [Fe/H] $= -2.23$ dex) and has a total velocity of $559.01^{+135.07}_{-87.40}$ km s$^{-1}$ in the Galactic rest-frame and a heliocentric distance of $2.90^{+0.72}_{-0.48}$ kpc. The velocity of J1443+1453 is larger than the escape speed at its position, suggesting it a promising HVS candidate. By reconstructing its trajectory in the Galactic potential, we find that the orbit of J1443+1453 intersects closely with that of the Sgr dSph $37.8^{+4.6}_{-6.0}$ Myr ago, when the latter has its latest pericentric passage through the Milky Way. The encounter occurs at a distance $2.42^{+1.80}_{-0.77}$ kpc from the centre of Sgr dSph, smaller than the size of the Sgr dSph. Chemical properties of this star are also consistent with those of one Sgr dSph associated globular cluster or of the Sgr stream member stars. Our finding suggests that J1443+1453 is an HVS either tidally stripped from the Sgr dSph or ejected from the Sgr dSph by the gravitational slingshot effect, requiring a (central) massive/intermediate-mass black hole or a (central) massive primordial black hole in the Sgr dSph.

preprint2020arXiv

Mapping the Galactic disk with the LAMOST and Gaia Red clump sample: I: precise distances, masses, ages and 3D velocities of $\sim$ 140000 red clump stars

We present a sample of $\sim$ 140,000 primary red clump (RC) stars of spectral signal-to-noise ratios higher than 20 from the LAMOST Galactic spectroscopic surveys, selected based on their positions in the metallicity-dependent effective temperature--surface gravity and color--metallicity diagrams, supervised by high-quality $Kepler$ asteroseismology data. The stellar masses and ages of those stars are further determined from the LAMOST spectra, using the Kernel Principal Component Analysis method, trained with thousands of RCs in the LAMOST-$Kepler$ fields with accurate asteroseismic mass measurements. The purity and completeness of our primary RC sample are generally higher than 80 per cent. For the mass and age, a variety of tests show typical uncertainties of 15 and 30 per cent, respectively. Using over ten thousand primary RCs with accurate distance measurements from the parallaxes of Gaia DR2, we re-calibrate the $K_{\rm s}$ absolute magnitudes of primary RCs by, for the first time, considering both the metallicity and age dependencies. With the the new calibration, distances are derived for all the primary RCs, with a typical uncertainty of 5--10 per cent, even better than the values yielded by the Gaia parallax measurements for stars beyond 3--4 kpc. The sample covers a significant volume of the Galactic disk of $4 \leq R \leq 16$ kpc, $|Z| \leq 5$ kpc, and $-20 \leq ϕ\leq 50^{\circ}$. Stellar atmospheric parameters, line-of-sight velocities and elemental abundances derived from the LAMOST spectra and proper motions of Gaia DR2 are also provided for the sample stars. Finally, the selection function of the sample is carefully evaluated in the color-magnitude plane for different sky areas. The sample is publicly available.