Researcher profile

Rui Xue

Rui Xue contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
13works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

13 published item(s)

preprint2025arXiv

Training Report of TeleChat3-MoE

TeleChat3-MoE is the latest series of TeleChat large language models, featuring a Mixture-of-Experts (MoE) architecture with parameter counts ranging from 105 billion to over one trillion,trained end-to-end on Ascend NPU cluster. This technical report mainly presents the underlying training infrastructure that enables reliable and efficient scaling to frontier model sizes. We detail systematic methodologies for operator-level and end-to-end numerical accuracy verification, ensuring consistency across hardware platforms and distributed parallelism strategies. Furthermore, we introduce a suite of performance optimizations, including interleaved pipeline scheduling, attention-aware data scheduling for long-sequence training,hierarchical and overlapped communication for expert parallelism, and DVM-based operator fusion. A systematic parallelization framework, leveraging analytical estimation and integer linear programming, is also proposed to optimize multi-dimensional parallelism configurations. Additionally, we present methodological approaches to cluster-level optimizations, addressing host- and device-bound bottlenecks during large-scale training tasks. These infrastructure advancements yield significant throughput improvements and near-linear scaling on clusters comprising thousands of devices, providing a robust foundation for large-scale language model development on hardware ecosystems.

preprint2022arXiv

Can one-zone hadronuclear model explain the hard-TeV spectrum of BL Lac objects?

Context. The intrinsic TeV emission of some BL Lacs are characterized by a hard spectrum (the hard-TeV spectrum) after correcting for the extragalactic background light. The hard-TeV spectra pose a challenge to conventional one-zone models, including the leptonic model, the photohadronic model, the proton synchrotron model, etc. Aims. In this work, we study if the one-zone hadronuclear (pp) model can be used to interpret the hard-TeV spectra of BL Lacs without introducing extreme parameters. Methods. We give analytical calculations to study if there is a parameter space and the charge neutrality condition of jet can be satisfied when interpreting the hard-TeV spectra of BL Lacs without introducing a super-Eddington jet power. Results. We find that in a sample of hard-TeV BL Lacs collected by Xue et al. (2019a), only the hard-TeV spectrum of 1ES 0229+200 could be explained by gamma-ray from pi-0 decay produced in the pp interactions, but at the cost of setting a small radius of the radiation region that comparable to the Schwarzschild radius of the central black hole. Combining with previous studies of other one-zone models, we suggest that the hard-TeV spectra of BL Lacs cannot be explained by any one-zone models without introducing extreme parameters, and should originate from the multiple radiation regions.

preprint2022arXiv

The Astropy Project: Sustaining and Growing a Community-oriented Open-source Project and the Latest Major Release (v5.0) of the Core Package

The Astropy Project supports and fosters the development of open-source and openly-developed Python packages that provide commonly needed functionality to the astronomical community. A key element of the Astropy Project is the core package $\texttt{astropy}$, which serves as the foundation for more specialized projects and packages. In this article, we summarize key features in the core package as of the recent major release, version 5.0, and provide major updates for the Project. We then discuss supporting a broader ecosystem of interoperable packages, including connections with several astronomical observatories and missions. We also revisit the future outlook of the Astropy Project and the current status of Learn Astropy. We conclude by raising and discussing the current and future challenges facing the Project.

preprint2022arXiv

The Evolution of Molecular Gas Fraction Traced by the CO Tully-Fisher Relation

Carbon monoxide (CO) observations show a luminosity$-$line-width correlation that evolves with redshift. We present a method to use CO measurements alone to infer the molecular gas fraction ($f_{\rm mol}$) and constrain the CO$-$H$_2$ conversion factor ($α_{\rm CO}$). We compile from the literature spatially integrated low-$J$ CO observations of six galaxy populations, including a total of 449 galaxies between $0.01 \leq z \leq 3.26$. The CO data of each population provide an estimate of the $α_{\rm CO}$-normalized mean molecular gas fraction ($f_{\rm mol}/α_{\rm CO}$). The redshift evolution of the luminosity$-$line-width correlation thus indicates an evolution of $f_{\rm mol}/α_{\rm CO}$. We use a Bayesian-based Monte-Carlo Markov Chain sampler to derive the posterior probability distribution functions of $f_{\rm mol}/α_{\rm CO}$ for these galaxy populations, accounting for random inclination angles and measurement errors in the likelihood function. We find that the molecular gas fraction evolves rapidly with redshift, $f_{\rm mol} \propto (1+z)^β$ with $β\simeq 2$, for both normal star-forming and starburst galaxies. Furthermore, the evolution trend agrees well with that inferred from the Kennicutt-Schmidt relation and the star-forming main sequence. Finally, at $z < 0.1$ normal star-forming galaxies require a $\sim5\times$ larger $α_{\rm CO}$ than starburst galaxies to match their molecular gas fractions, but at $z > 1$ both star-forming types exhibit sub-Galactic $α_{\rm CO}$ values and normal star-forming galaxies appear more gas-rich than starbursts. Future applications of this method include calibrating Tully-Fisher relations without inclination correction and inferring the evolution of the atomic gas fraction with HI observations.

preprint2021arXiv

A Long Stream of Metal-Poor Cool Gas around a Massive Starburst Galaxy at z = 2.67

We present the first detailed dissection of the circumgalactic medium (CGM) of massive starburst galaxies at z > 2. Our target is a submillimeter galaxy (SMG) at z = 2.674 that has a star formation rate of 1200 $M_\odot$/yr and a molecular gas reservoir of $1.3\times10^{11} M_\odot$. We characterize its CGM with two background QSOs at impact parameters of 93 kpc and 176 kpc. We detect strong HI and metal-line absorption near the redshift of the SMG towards both QSOs, each consisting of three main subsystems spanning over 1500 km/s. The absorbers show remarkable kinematic and metallicity coherence across a separation of 86 kpc. In particular, the cool gas in the CGM of the SMG exhibits high HI column densities ($\log N_{\rm HI}/{\rm cm}^{-2} = 20.2, 18.6$), low metallicities ([M/H] $\approx$ -2.0), and similar radial velocities ($\approx$ -300 km/s). While the HI column densities match previous results on the CGM around QSOs at z > 2, the metallicities are lower by more than an order of magnitude, making it an outlier in the line width$-$metallicity relation of damped Ly$α$ absorbers. The large physical extent, the velocity coherence, the high surface density, and the low metallicity are all consistent with the cool, inflowing, and near-pristine gas streams predicted to penetrate hot massive halos at z > 1.5. We estimate a total gas accretion rate of ~100 $M_\odot$/yr from three such streams, which falls short of the star formation rate but is consistent with simulations. At this rate, it takes about a gigayear to acquire the molecular gas reservoir of the central starburst.

preprint2021arXiv

A unified model for orphan and multi-wavelength blazar flares

Blazars are a class of active galactic nuclei which host relativistic jets oriented close to the observer&#39;s line of sight. Blazars have very complex variability properties. Flares, namely flux variations around the mean value with a well-defined shape and duration, are one of the identifying properties of the blazar phenomenon. Blazars are known to exhibit multi-wavelength flares, but also &#34;orphan&#34; flares, namely flux changes that appear only in a specific energy range. Various models, sometimes at odds with each other, have been proposed to explain specific flares even for a single source, and cannot be synthesized into a coherent picture. In this paper, we propose a unified model for explaining orphan and multi-wavelength flares from blazars in a common framework. We assume that the blazar emission during a flare consists of two components: (i) a quasi-stable component that arises from the superposition of numerous but comparatively weak dissipation zones along the jet, forming the background (low-state) emission of the blazar, and (ii) a transient component, which is responsible for the sudden enhancement of the blazar flux, forming at a random distance along the jet by a strong energy dissipation event. Whether a multi-wavelength or orphan flare is emitted depends on the distance from the base of the jet where the dissipation occurs. Generally speaking, if the dissipation occurs at a small/large distance from the supermassive black hole, the inverse Compton/synchrotron radiation dominates and an orphan gamma-ray/optical flare tends to appear. On the other hand, we may expect a multi-wavelength flare if the dissipation occurs at an intermediate distance. We show that the model can successfully describe the spectral energy distribution of different flares from the flat spectrum radio quasar 3C 279 and the BL Lac object PKS 2155-304.

preprint2021arXiv

Detection of a possible high-confidence radio quasi-periodic oscillation in the BL Lac PKS J2134-0153

We have searched quasi-periodic oscillations (QPOs) for BL Lac PKS J2134-0153 in the 15 GHz radio light curve announced by the Owens Valley Radio Observatory 40-m telescope during the period from 2008-01-05 to 2019-05-18, utilizing the Lomb-Scargle periodogram (LSP) and the weighted wavelet Z-transform (WWZ) techniques. This is the first time that to search for periodic radio signal in BL Lac PKS J2134-0153 by these two methods. These two methods consistently reveal a QPO of 4.69 $\pm$ 0.14 years (>5 $σ$ confidence level). We discuss possible causes for this QPO, and we expected that the binary black holes scenario, where the QPO is caused by the precession of the binary black holes, is the most likely explanation. BL Lac PKS J2134-0153 thus could be a good binary black hole candidate. In the binary black holes scenario, the distance between the primary black hole and the secondary black hole is 1.83$\times$10$^{16}$ cm.

preprint2020arXiv

A two-zone blazar radiation model for &#34;orphan&#34; neutrino flares

In this work, we investigate the 2014-2015 neutrino flare associated with the blazar TXS 0506+056 and a recently discovered muon neutrino event IceCube-200107A in spatial coincidence with the blazar 4FGL J0955.1+3551, under the framework of a two-zone radiation model of blazars where an inner/outer blob close to/far from the supermassive black hole are invoked. An interesting feature that the two sources share in common is that no evidence of GeV gamma-ray activity is found during the neutrino detection period, probably implying a large opacity for GeV gamma rays in the neutrino production region. In our model, continuous particle acceleration/injection takes place in the inner blob at the jet base, where the hot X-ray corona of the supermassive black hole provides target photon fields for efficient neutrino production and strong GeV gamma-ray absorption. We show that this model can self-consistently interpret the neutrino emission from both two blazars in a large parameter space. In the meantime, the dissipation processes in outer blob are responsible for the simultaneous multi-wavelength emission of both sources. In agreement with previous studies of TXS 0506+056 and, an intense MeV emission from the induced electromagnetic cascade in the inner blob is robustly expected to accompany the neutrino flare in our model could be used to test the model with the next-generation MeV gamma-ray detector in the future.

preprint2020arXiv

Constraints on the intergalactic magnetic field from $γ$-ray observations of GRB 190114C

Very high energy photons from cosmological gamma-ray bursts (GRBs) are expected to interact with extragalactic background light (EBL) and produce electron-positron pairs when they propagate through intergalactic medium (IGM). These relativistic pairs will then up-scatter cosmic microwave background (CMB) photons and emit secondary GeV emission. Meanwhile, the motion of these pairs are deflected by intergalactic magnetic field (IGMF), so the secondary GeV photons arrive later than the primary emission. It has been suggested that the properties of the secondary GeV emission can be used to constrain IGMF. Recently, TeV gamma-ray emission has been detected, for the first time, from a GRB (GRB 190114C) by the MAGIC telescope and its steep ${\rm γ-ray}$ spectrum shows a clear evidence of absorption by EBL. We then constrain the IGMF with the GeV flux limit obtained from the $Fermi$-LAT observations. We find a limit of $>10^{-19.5}$ G for the coherence length of $λ\leq 1$ Mpc. Although this limit is weaker than that obtained by using blazars, it represents the first limit from ${\rm γ-ray}$ observations of GRBs, which provides an independent constraint on IGMF. We also find that, for transient ${\rm γ-ray}$ sources, one can choose a favorable time window to search for the echo emission at a particular energy.

preprint2020arXiv

COVID-19 Docking Server: A meta server for docking small molecules, peptides and antibodies against potential targets of COVID-19

Motivation: The coronavirus disease 2019 (COVID-19) caused by a new type of coronavirus has been emerging from China and led to thousands of death globally since December 2019. Despite many groups have engaged in studying the newly emerged virus and searching for the treatment of COVID-19, the understanding of the COVID-19 target-ligand interactions represents a key chal-lenge. Herein, we introduce COVID-19 Docking Server, a web server that predicts the binding modes between COVID-19 targets and the ligands including small molecules, peptides and anti-bodies. Results: Structures of proteins involved in the virus life cycle were collected or constructed based on the homologs of coronavirus, and prepared ready for docking. The meta platform provides a free and interactive tool for the prediction of COVID-19 target-ligand interactions and following drug discovery for COVID-19.

preprint2020arXiv

Multicolor Optical Monitoring of the Blazar S5 0716+714 from 2017 to 2019

We continuously monitored the blazar S5 0716+714 in the optical $g$, $r$ and $i$ bands from Nov. 10, 2017 to Jun. 06, 2019. The total number of observations is 201 nights including 26973 data points. This is a very large quasi-simultaneous multicolor sample for the blazar. The average time spans and time resolutions are 3.4 hours and 2.9 minutes per night, respectively. During the period of observations, the target source in the $r$ band brightens from $14^{\rm m}.16$ to $12^{\rm m}.29$ together with five prominent sub-flares, and then first becomes fainter to $14^{\rm m}.76$ and again brightens to $12^{\rm m}.94$ with seven prominent sub-flares. For the long-term variations, we find a strong flatter when brighter (FWB) trend at a low flux state and then a weak FWB trend at a higher flux state. A weak FWB trend at a low flux state and then a strong FWB trend at a higher flux state are also reported. Most of sub-flares show the strong FWB trends, except for two flares with a weak FWB trend. The particle acceleration and cooling mechanisms together with the superposition of different FWB-slopes from sub-flares are likely to explain the optical color behaviours. A scenario of bent jet is discussed.

preprint2020arXiv

The physical properties of $Fermi$-4LAC flat spectrum radio quasars

In this work, we collect quasi-simultaneous infrared, optical, X-ray and $γ$-ray data of 60 $Fermi$-4LAC flat spectrum radio quasars (FSRQs). In the framework of the conventional one-zone leptonic model, we investigate the physical properties of $Fermi$-4LAC FSRQs&#39; jets by modeling their quasi-simultaneous spectral energy distributions (SEDs). Our main results are summarized as follows. (1) There is a linear correlation between synchrotron peak frequency and curvature of the electron energy distribution. As suggested by previous works, the slope of the best linear fitting equation of this correlation is consistent with statistic acceleration which needs a fluctuation of fractional acceleration gain. (2) The gamma-ray dissipation regions are located at the range from 0.1 to 10 pc away from the super-massive black hole, and located outside the broad-line region (BLR) and within the dusty torus (DT). (3) A size relation $P_{\rm e}$ (the kinetic power carried in relativistic electrons) $\sim$ $P_{\rm B}$ (Poynting flux) $\leq$ $P_{\rm r}$ (the radiative power ) $<$ $P_{\rm p}$ (the kinetic power in cold protons) is found in our modeling. Among them, $P_{\rm e}\sim P_{\rm B}$ suggests that SEDs of almost all FSRQs with parameters are close to equipartition between the magnetic field and the relativistic electrons. The $P_{\rm e} < P_{\rm r}$ suggest that the most energy of the relativistic electrons are dissipated by EC radiation for FSRQs. (4) There is an anti-correlation between the peak energy of SEDs ($γ_{\rm peak}$) and the jet power ($P_{\rm jet}$), which is consistent with the blazar sequence.

preprint2019arXiv

A two-zone model for blazar emission: implications for TXS 0506+056 and the neutrino event IceCube-170922A

A high-energy muon neutrino event, IceCube-170922A, was recently discovered in both spatial and temporal coincidence with a gamma-ray flare of the blazar TXS 0506+056. It has been shown, with standard one-zone models, that neutrinos can be produced in the blazar jet via hadronic interactions, but with a flux which is mostly limited by the X-ray data. In this work, we explore the neutrino production from TXS 0506+056 by invoking two physically distinct emission zones in the jet, separated by the broad line region (BLR). Using the Doppler-boosted radiation of the BLR as the target photon field, the inner zone accounts for the neutrino and gamma-ray emission via $pγ$ interactions and inverse Compton scattering respectively, while the outer zone produces the optical and X-ray emission via synchrotron and synchrotron self-Compton processes. The different conditions of the two zones allow us to suppress the X-ray emission from the electromagnetic cascade, and set a much higher upper limit on the muon neutrino flux (i.e., $\sim 10^{-11}\rm erg~cm^{-2}s^{-1}$) than in one-zone models. We compare, in detail, our scenario with one-zone models discussed in the literature, and argue that differentiating between such scenarios will become possible with next generation neutrino telescopes, such as IceCube-Gen2.