Researcher profile

Ji Xu

Ji Xu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
16works
0followers
17topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

16 published item(s)

preprint2026arXiv

Testing a Linear Relation: Short-Range Correlations and the EMC Effect for Gluons and Quarks in Nuclei

In this work, we focus on the possible linear relation between short-range correlations (SRCs) and the EMC effect for partons in nuclei. First, we test a linear relationship pertaining to gluons in bound nuclei; it is manifested as a correlation between the slope of the reduced cross section ratio in deep inelastic scattering (DIS) and the cross section of sub-threshold $J/ψ$ photoproduction. For comparison, the results from four different global analyses groups of nuclear parton distribution functions (nPDFs) are utilized. These results show a good linear correlation between the gluons in bound nuclei and the slope of the reduced cross section ratio, consistent with the possible presence of nuclear effects in the gluon distributions. Second, we investigate the linear relationship of quarks in the proton-induced Drell-Yan process. The corresponding results for quarks show strong sensitivity to the parameterization forms adopted by the different groups. These findings enhance our understanding of the substructure in bound nuclei and provide valuable reference for future global fitting of nPDFs.

preprint2023arXiv

Determinate Node Selection for Semi-supervised Classification Oriented Graph Convolutional Networks

Graph Convolutional Networks (GCNs) have been proved successful in the field of semi-supervised node classification by extracting structural information from graph data. However, the random selection of labeled nodes used by GCNs may lead to unstable generalization performance of GCNs. In this paper, we propose an efficient method for the deterministic selection of labeled nodes: the Determinate Node Selection (DNS) algorithm. The DNS algorithm identifies two categories of representative nodes in the graph: typical nodes and divergent nodes. These labeled nodes are selected by exploring the structure of the graph and determining the ability of the nodes to represent the distribution of data within the graph. The DNS algorithm can be applied quite simply on a wide range of semi-supervised graph neural network models for node classification tasks. Through extensive experimentation, we have demonstrated that the incorporation of the DNS algorithm leads to a remarkable improvement in the average accuracy of the model and a significant decrease in the standard deviation, as compared to the original method.

preprint2023arXiv

Symmetry breaking of Pancharatnam-Berry phase using non-axisymmetric meta-atoms

The Pancharatnam-Berry (PB) phase in metasurfaces obeys the symmetry restriction, according to which the PB phases of two orthogonal circularly polarized waves are the same but with opposite signs. Here, we reveal a general mechanism to break the axisymmetry of meta-atoms in order to break the PB-phase symmetry restriction. As a proof of concept, we designed a novel meta-atom with a QR-code structure and successfully demonstrated circular-polarization multiplexing metasurface holography. This study provides a fundamentally new understanding of the PB phase and opens a path for arbitrary wavefront engineering using asymmetric electromagnetic structures.

preprint2022arXiv

Distribution Amplitudes of $K^*$ and $ϕ$ at Physical Pion Mass from Lattice QCD

We present the first lattice QCD calculation of the distribution amplitudes of longitudinally and transversely polarized vector mesons $K^*$ and $ϕ$ using large momentum effective theory. We use the clover fermion action on three ensembles with 2+1+1 flavors of highly improved staggered quarks (HISQ) action, generated by MILC collaboration, at physical pion mass and \{0.06, 0.09, 0.12\} fm lattice spacings, and choose three different hadron momenta $P_z=\{1.29, 1.72, 2.15\}$ GeV. The resulting lattice matrix elements are nonperturbatively renormalized in a hybrid scheme proposed recently. An extrapolation to the continuum and infinite momentum limit is carried out. We find that while the longitudinal distribution amplitudes tend to be close to the asymptotic form, the transverse ones deviate rather significantly from the asymptotic form. Our final results provide crucial {\it ab initio} theory inputs for analyzing pertinent exclusive processes.

preprint2022arXiv

Improving CTC-based speech recognition via knowledge transferring from pre-trained language models

Recently, end-to-end automatic speech recognition models based on connectionist temporal classification (CTC) have achieved impressive results, especially when fine-tuned from wav2vec2.0 models. Due to the conditional independence assumption, CTC-based models are always weaker than attention-based encoder-decoder models and require the assistance of external language models (LMs). To solve this issue, we propose two knowledge transferring methods that leverage pre-trained LMs, such as BERT and GPT2, to improve CTC-based models. The first method is based on representation learning, in which the CTC-based models use the representation produced by BERT as an auxiliary learning target. The second method is based on joint classification learning, which combines GPT2 for text modeling with a hybrid CTC/attention architecture. Experiment on AISHELL-1 corpus yields a character error rate (CER) of 4.2% on the test set. When compared to the vanilla CTC-based models fine-tuned from the wav2vec2.0 models, our knowledge transferring method reduces CER by 16.1% relatively without external LMs.

preprint2022arXiv

Inverse moment of the $B$-meson quasi distribution amplitude

We perform a study on the structure of inverse moment (IM) of quasi distributions, by taking $B$-meson quasi distribution amplitude (quasi-DA) as an example. Based on a one-loop calculation, we derive the renormalization group equation and velocity evolution equation for the first IM of quasi-DA. We find that, in the large velocity limit, the first IM of $B$-meson quasi-DA can be factorized into IM as well as logarithmic moments of light-cone distribution amplitude (LCDA), accompanied by short distance coefficients. Our results can be useful either in understanding the patterns of perturbative matching in Large Momentum Effective Theory or evaluating inverse moment of $B$-meson LCDA on the lattice.

preprint2022arXiv

Matching the B-meson quasidistribution amplitude in the RI/MOM scheme

Within the framework of large momentum effective theory (LaMET), the light-cone distribution amplitude of $B$-meson in heavy-quark effective theory (HQET) can be extracted from lattice calculations of quasidistribution amplitude through hard-collinear factorization formula. This quasiquantity can be renormalized in a regularization-independent momentum subtraction scheme (RI/MOM). In this work, we derive the matching coefficient which connects the renormalized quasiditribution amplitude in the RI/MOM scheme and standard LCDA in the $\overline{\textrm{MS}}$ scheme at one-loop accuracy. Our numerical analysis approves of the feasibility of RI/MOM scheme for renormalizing $B$-meson quasidistribution amplitude. These results will be crucial for exploring the partonic structure of heavy-quark hadrons.

preprint2022arXiv

Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset

This paper introduces a high-quality rich annotated Mandarin conversational (RAMC) speech dataset called MagicData-RAMC. The MagicData-RAMC corpus contains 180 hours of conversational speech data recorded from native speakers of Mandarin Chinese over mobile phones with a sampling rate of 16 kHz. The dialogs in MagicData-RAMC are classified into 15 diversified domains and tagged with topic labels, ranging from science and technology to ordinary life. Accurate transcription and precise speaker voice activity timestamps are manually labeled for each sample. Speakers' detailed information is also provided. As a Mandarin speech dataset designed for dialog scenarios with high quality and rich annotations, MagicData-RAMC enriches the data diversity in the Mandarin speech community and allows extensive research on a series of speech-related tasks, including automatic speech recognition, speaker diarization, topic detection, keyword search, text-to-speech, etc. We also conduct several relevant tasks and provide experimental results to help evaluate the dataset.

preprint2021arXiv

Consistent Risk Estimation in Moderately High-Dimensional Linear Regression

Risk estimation is at the core of many learning systems. The importance of this problem has motivated researchers to propose different schemes, such as cross validation, generalized cross validation, and Bootstrap. The theoretical properties of such estimates have been extensively studied in the low-dimensional settings, where the number of predictors $p$ is much smaller than the number of observations $n$. However, a unifying methodology accompanied with a rigorous theory is lacking in high-dimensional settings. This paper studies the problem of risk estimation under the moderately high-dimensional asymptotic setting $n,p \rightarrow \infty$ and $n/p \rightarrow δ>1$ ($δ$ is a fixed number), and proves the consistency of three risk estimates that have been successful in numerical studies, i.e., leave-one-out cross validation (LOOCV), approximate leave-one-out (ALO), and approximate message passing (AMP)-based techniques. A corner stone of our analysis is a bound that we obtain on the discrepancy of the `residuals' obtained from AMP and LOOCV. This connection not only enables us to obtain a more refined information on the estimates of AMP, ALO, and LOOCV, but also offers an upper bound on the convergence rate of each estimate.

preprint2021arXiv

Weak decays of doubly heavy baryons: four-body nonleptonic decay channels

The LHCb Collaboration announced the observation of doubly charmed baryon through $Ξ_{c c}^{++} \rightarrow Λ_{c}^{+} K^{-} π^{+} π^{+}$ in 2017. Since then, a series of studies of doubly heavy baryons have been presented. $Ξ_{cc}^{++}$ was discovered through nonleptonic four-body decay mode, and experimental data has indicated that the decay modes of $Ξ_{c c}^{++}$ are not saturated by two and three-body intermediate states. In this work, we analyze the four-body weak decays of doubly heavy baryons $Ξ_{cc}^{++}, Ξ_{cc}^+$, and $Ω_{cc}^+$. Decay amplitudes for various channels are parametrized in terms of SU(3) irreducible amplitudes. We point out that branching fractions for Cabibbo-allowed processes $Ξ_{cc}^{+}\toΛ_c^+π^+ π^0 K^-$, $Ω_{cc}^{+}\toΛ_c^+π^+ \overline K^0 K^-$ would be helpful to search for $Ξ_{cc}^+$ and $Ω_{cc}^+$ in future measurements at experimental facilities like LHC, Belle II, and CEPC.

preprint2020arXiv

$B$-meson light-cone distribution amplitude from the Euclidean quantity

A new method for the model-independent determination of the light-cone distribution amplitude (LCDA) of the $B$-meson in heavy quark effective theory (HQET) is proposed by combining the large momentum effective theory (LaMET) and the numerical simulation technique on the Euclidean lattice. We demonstrate the autonomous scale dependence of the non-local quasi-HQET operator with the aid of the auxiliary field approach, and further determine the perturbative matching coefficient entering the hard-collinear factorization formula for the $B$-meson quasi-distribution amplitude at the one-loop accuracy. These results will be crucial to explore the partonic structure of heavy-quark hadrons in the static limit and to improve the theory description of exclusive $B$-meson decay amplitudes based upon perturbative QCD factorization theorems.

preprint2020arXiv

Analysis of $B_c \to τν_τ$ at CEPC

The precise determination of the $B_c \to τν_τ$ branching ratio provides an advantageous opportunity for understanding the electroweak structure of the Standard Model, measuring the CKM matrix element $|V_{cb}|$ and probing new physics models. In this paper, we discuss the potential of measuring the processes of $B_c \to τν_τ$ with $τ$ decaying leptonically at the proposed Circular Electron Positron Collider (CEPC). We conclude that during the $Z$ pole operation, the channel signal can achieve five $σ$ significance with $\sim 10^9$ $Z$ decays, and the signal strength accuracies for $B_c \to τν_τ$ can reach around 1% level at the nominal CEPC $Z$ pole statistics of one trillion $Z$ decays assuming the total $B_c \to τν_τ$ yield is $3.6 \times 10^6$. Our theoretical analysis indicates the accuracy could provide a strong constraint on the general effective Hamiltonian for the $b \to cτν$ transition. If the total $B_c$ yield can be determined to $\mathcal{O}(1\%)$ level of accuracy in the future, these results also imply $|V_{cb}|$ could be measured up to $\mathcal{O}(1\%)$ level of accuracy.

preprint2020arXiv

Spectral Method for Phase Retrieval: an Expectation Propagation Perspective

Phase retrieval refers to the problem of recovering a signal $\mathbf{x}_{\star}\in\mathbb{C}^n$ from its phaseless measurements $y_i=|\mathbf{a}_i^{\mathrm{H}}\mathbf{x}_{\star}|$, where $\{\mathbf{a}_i\}_{i=1}^m$ are the measurement vectors. Many popular phase retrieval algorithms are based on the following two-step procedure: (i) initialize the algorithm based on a spectral method, (ii) refine the initial estimate by a local search algorithm (e.g., gradient descent). The quality of the spectral initialization step can have a major impact on the performance of the overall algorithm. In this paper, we focus on the model where the measurement matrix $\mathbf{A}=[\mathbf{a}_1,\ldots,\mathbf{a}_m]^{\mathrm{H}}$ has orthonormal columns, and study the spectral initialization under the asymptotic setting $m,n\to\infty$ with $m/n\toδ\in(1,\infty)$. We use the expectation propagation framework to characterize the performance of spectral initialization for Haar distributed matrices. Our numerical results confirm that the predictions of the EP method are accurate for not-only Haar distributed matrices, but also for realistic Fourier based models (e.g. the coded diffraction model). The main findings of this paper are the following: (1) There exists a threshold on $δ$ (denoted as $δ_{\mathrm{weak}}$) below which the spectral method cannot produce a meaningful estimate. We show that $δ_{\mathrm{weak}}=2$ for the column-orthonormal model. In contrast, previous results by Mondelli and Montanari show that $δ_{\mathrm{weak}}=1$ for the i.i.d. Gaussian model. (2) The optimal design for the spectral method coincides with that for the i.i.d. Gaussian model, where the latter was recently introduced by Luo, Alghamdi and Lu.

preprint2020arXiv

The Multi-granularity in Graph Revealed by a Generalized Leading Tree

There are hierarchical characteristics in the network and how to effectively reveal the hierarchical characteristics in the network is a problem in the research of network structure. If a node is assigned to the community to which it belongs, how to assign the community to a higher level of community to which it belongs is a problem. In this paper, the density of data points is investigated based on the clustering task. By forming the density of data points, the hierarchical difference of data points is constructed. In combination with the distance between data points, a density-based leading tree can be constructed. But in a graph structure, it is a problem to build a lead tree that reveals the hierarchical relationships of the nodes on the graph. Based on the method of tree formation based on density, this paper extends the model of leading tree to the hierarchical structure of graph nodes, discusses the importance of graph nodes, and forms a leading tree that can reveal the hierarchical structure of graph nodes and the dependency of community. Experiments were carried out on real data sets, and a tree structure was formed in the experiment. This graph leading tree can well reveal the hierarchical relationships in the graph structure.

preprint2019arXiv

Gluonic Probe for the Short Range Correlation in Nucleus

We investigate the gluonic probe to the nucleon-nucleon short range correlation (SRC) in nucleus through heavy flavor production in deep inelastic scattering (DIS). The relevant EMC effects of $F_2^{c\bar c}$ structure function will provide a universality test of the SRCs which have been extensively studied in the quark-channel. These SRCs can also be studied through the sub-threshold production of heavy flavor in $eA$ collisions at the intermediate energy electron-ion collider, including open Charm and $J/ψ$ ($Υ$) production.

preprint2019arXiv

Sub-threshold $J/ψ$ and $Υ$ Production in $γA$ Collisions

We study sub-threshold heavy quarkonium ($J/ψ$ and $Υ$) photo-productions in $γA$ collisions as an independent test of the universality of the nucleon-nucleon short range correlation (SRC) in nuclear scattering processes. Just below the $γp$ threshold, the cross section is dominated by the mean field contribution of nucleons inside the nucleus. The SRC contributions start to dominate at lower photon energies, depending on the fraction of the SRC pairs in the target nucleus. We give an estimate of the cross sections in the sub-threshold region both for $J/ψ$ and $Υ$. This may be helpful for future measurements at JLab as well as at the Electron-Ion Collider in the U.S., and especially in China.