Researcher profile

Ya Li

Ya Li contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
13works
0followers
11topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

13 published item(s)

preprint2025arXiv

Cabibbo-suppressed charged-current semileptonic decays of $Ξ_b$ baryons

We present the first perturbative QCD calculations of the $Ξ_b \to (Λ, Σ)$ transition form factors at leading order in $α_s$, which govern the Cabibbo-suppressed semileptonic decays $Ξ_b \to (Λ, Σ)\ell ν_\ell$ with $\ell = e, μ, τ$. Using these form factors, we evaluate differential and integrated branching fractions and angular observables within the helicity formalism. The branching ratios are predicted to be of order $10^{-4}$ for $Σ$ final states and $10^{-5}$ for $Λ$ final states, making them accessible to ongoing experiments such as LHCb. Ratios of decay rates between $τ$ and $e$ channels are also provided, offering new probes of lepton-flavor universality. Lepton-mass effects are found to significantly impact the integrated angular observables. Furthermore, a combined analysis of $b \to u$ and $b \to c$ transitions in $Ξ_b$ decays yields sub-percent precision for the ratios $\mathcal{R}_\ell(Σ/Ξ_c)$, enabling an independent determination of $|V_{ub}/V_{cb}|$ once the relevant decay-rate measurements become available.

preprint2024arXiv

Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation

Recent advancements in diffusion models and large language models (LLMs) have significantly propelled the field of AIGC. Text-to-Audio (TTA), a burgeoning AIGC application designed to generate audio from natural language prompts, is attracting increasing attention. However, existing TTA studies often struggle with generation quality and text-audio alignment, especially for complex textual inputs. Drawing inspiration from state-of-the-art Text-to-Image (T2I) diffusion models, we introduce Auffusion, a TTA system adapting T2I model frameworks to TTA task, by effectively leveraging their inherent generative strengths and precise cross-modal alignment. Our objective and subjective evaluations demonstrate that Auffusion surpasses previous TTA approaches using limited data and computational resource. Furthermore, previous studies in T2I recognizes the significant impact of encoder choice on cross-modal alignment, like fine-grained details and object bindings, while similar evaluation is lacking in prior TTA works. Through comprehensive ablation studies and innovative cross-attention map visualizations, we provide insightful assessments of text-audio alignment in TTA. Our findings reveal Auffusion's superior capability in generating audios that accurately match textual descriptions, which further demonstrated in several related tasks, such as audio style transfer, inpainting and other manipulations. Our implementation and demos are available at https://auffusion.github.io.

preprint2024arXiv

CRB Minimization for RIS-aided mmWave Integrated Sensing and Communications

In this paper, reconfigurable intelligent surface (RIS) is employed in a millimeter wave (mmWave) integrated sensing and communications (ISAC) system. To alleviate the multi-hop attenuation, the semi-self sensing RIS approach is adopted, wherein sensors are configured at the RIS to receive the radar echo signal. Focusing on the estimation accuracy, the Cramer-Rao bound (CRB) for estimating the direction-of-the-angles is derived as the metric for sensing performance. A joint optimization problem on hybrid beamforming and RIS phaseshifts is proposed to minimize the CRB, while maintaining satisfactory communication performance evaluated by the achievable data rate. The CRB minimization problem is first transformed as a more tractable form based on Fisher information matrix (FIM). To solve the complex non-convex problem, a double layer loop algorithm is proposed based on penalty concave-convex procedure (penalty-CCCP) and block coordinate descent (BCD) method with two sub-problems. Successive convex approximation (SCA) algorithm and second order cone (SOC) constraints are employed to tackle the non-convexity in the hybrid beamforming optimization. To optimize the unit modulus constrained analog beamforming and phase shifts, manifold optimization (MO) is adopted. Finally, the numerical results verify the effectiveness of the proposed CRB minimization algorithm, and show the performance improvement compared with other baselines. Additionally, the proposed hybrid beamforming algorithm can achieve approximately 96% of the sensing performance exhibited by the full digital approach within only a limited number of radio frequency (RF) chains.

preprint2022arXiv

$CP$-violating observables in four-body $B\rightarrow ϕ(\rightarrow K\bar K)K^*(\rightarrow Kπ)$ decays

We analyse the four-body $B\rightarrow ϕ(\rightarrow K\bar K)K^*(\rightarrow Kπ)$ decays in the perturbative QCD approach,where the invariant mass of $K\bar K$($Kπ$) system is limited in a window of $\pm 15$ MeV ($\pm150$ MeV) around the $ϕ(K^*(892))$ mass.In addition to the P wave resonances,two important S wave backgrounds in the selected invariant mass region are also accounted for. Angular momentum conservation allows six helicity amplitudes to contribute,including three P waves, two single S waves,and one double S wave. We calculated the branching ratio for each component and found sizable S wave contributions,coincide with the experimental observation.The obtained branching ratios of $B^{0(+)}\rightarrow ϕK^{*0(+)}$ are comparable with the previous predictions and support the measurements, whereas the predicted $\mathcal{B}(B^0_s\rightarrow ϕ\bar K^{*0})$ is smaller than the world average. The longitudinal polarizations are predicted to be around 0.7,consistent with previous PQCD results but larger than the data. Aside from the direct CP asymmetries,the true and fake triple product asymmetries(TPAs) are calculated in this work. In the case of neutral modes, both direct CP asymmetries and true TPAs are expected to be zero due to the vanishing weak phase difference. The direct CP asymmetries for the $B^+$ mode are predicted to be tiny,since the tree contributions are suppressed with respect to the penguin ones. The true asymmetries have shown no significant deviations from zero.In contrast,large fake asymmetries are observed in these decays,indicating the presence of significant final state interactions.We give the predictions of the S wave induced TPAs for the first time,which is consistent with LHCb data and would be checked with future measurements from Belle and BABAR experiments if the S wave components can be properly taken into account in angular analysis.

preprint2022arXiv

$Λ_b\to p$ transition form factors in perturbative QCD

We reanalyze the $Λ_b\to p$ transition form factors in the perturbative QCD (PQCD) approach by including higher-twist light-cone distribution amplitudes (LCDAs) of a $Λ_b$ baryon and a proton. The previous PQCD evaluation performed decades ago with only the leading-twist $Λ_b$ baryon and proton LCDAs gave the form factors, which are two orders of magnitude smaller than indicated by experimental data. We find that the twist-4 $Λ_b$ baryon LCDAs and the twist-4 and -5 proton LCDAs contribute dominantly, and the enhanced form factors become consistent with those from lattice QCD and other nonperturbative methods. The estimated branching ratios of the semileptonic decays $Λ_b\to p\ell\barν_\ell$ and the hadronic decay $Λ_b\to pπ$ are also close to the data. It implies that the $b$ quark mass is not really heavy enough, and higher-power contributions play a crucial role, similar to the observation made in analyses of $B$ meson transition form factors. With the formalism established in this work, we are ready to study various exclusive heavy baryon decays systematically in the PQCD approach.

preprint2022arXiv

ECAPA-TDNN for Multi-speaker Text-to-speech Synthesis

In recent years, neural network based methods for multi-speaker text-to-speech synthesis (TTS) have made significant progress. However, the current speaker encoder models used in these methods still cannot capture enough speaker information. In this paper, we focus on accurate speaker encoder modeling and propose an end-to-end method that can generate high-quality speech and better similarity for both seen and unseen speakers. The proposed architecture consists of three separately trained components: a speaker encoder based on the state-of-the-art ECAPA-TDNN model which is derived from speaker verification task, a FastSpeech2 based synthesizer, and a HiFi-GAN vocoder. The comparison among different speaker encoder models shows our proposed method can achieve better naturalness and similarity. To efficiently evaluate our synthesized speech, we are the first to adopt deep learning based automatic MOS evaluation methods to assess our results, and these methods show great potential in automatic speech quality assessment.

preprint2022arXiv

Study of $B_{(s)}^0 \to ϕϕ\to (K^+K^-)(K^+K^-)$ decays in the perturbative QCD approach

In this work, we make a detailed analysis on the penguin-dominant processes $B_{(s)}^0 \to ϕϕ\to (K^+K^-)(K^+K^-)$ in the perturbative QCD (PQCD) approach. In addition to the dominant $P$-wave resonance, the scalar background $f_0(980) \to K^+K^-$ is also accounted for. We improve the Gegenbauer moments in $KK$ two-meson distribution amplitudes by fitting the PQCD factorization formulas to measured branching ratios of three-body and four-body $B$ decays. We extract the branching ratios of two-body $B_{(s)}^0 \to ϕϕ$ decays from the corresponding four-body decay modes and calculate the relevant polarization fractions together with two relative phases $ϕ_{\parallel,\perp}$, which are consistent with the previous theoretical predictions. The PQCD predictions for the "true" triple product asymmetries (TPAs) are zero which are expected in the standard model due to the vanishing weak phase difference, and support the current data reported by the CDF and LHCb Collaborations. A large "fake" TPA $\mathcal{A}_\text{T-fake}^1=30.4\%$ of the decay $B^0_s \to ϕϕ\to (K^+K^-)(K^+K^-)$ is predicted for the first time, which indicates the presence of the significant final-state interactions. The TPAs of the rare decay channel $B^0 \to ϕϕ\to (K^+K^-)(K^+K^-)$ are also predicted and can be tested in the near future.

preprint2021arXiv

Generation of entanglement between a highly wave-packet-tunable photon and a spin-wave memory in cold atoms

Controls of waveforms (pulse durations) of single photons are important tasks for effectively interconnecting disparate atomic memories in hybrid quantum networks. So far, the waveform control of single photon that is entangled with an atomic memory remains unexplored. Here, we demonstrated control of waveform length of the photon that is entangled with an atomic spin-wave memory by varying light-atom interaction time in cold atoms. The Bell parameter S as a function of the duration of photon pulse is measured, which shows that violations of Bell equality can be achieved for the photon pulse in the duration range from 40 ns to 50 us, where, S=2.64+/-0.02 and S=2.26+/-0.05 for the 40-ns and 50-μs durations, respectively. The measured results show that S parameter decreases with the increase in the pulse duration. We confirm that the increase in photon noise probability per pulse with the pulse-duration is responsible for the S decrease.

preprint2021arXiv

Noise suppression in a temporal-multimode quantum memory entangled with a photon via asymmetrical photon-collection channel

Quantum interfaces (QIs) that generate entanglement between a multimode atomic memory and a photon forms a multiplexed repeater node and hold promise to greatly improve quantum repeater rates. Recently, the temporal multimode spin-wave memory that is entangled with a photon has been demonstrated with cold atoms. However, due to additional noise generated in multimode operation, the fidelity of spin-wave-photon entanglement significantly decreases with the mode number. So far, the improvement on temporal-multimode entanglement fidelity via suppressing the additional noise remains unexplored. Here, we propose and experimentally demonstrate a scheme that can suppress the additional noise of a temporally-multiplexed QI. The scheme uses an asymmetric channel to collect the photons coming and retrieving from the temporally-multiplexed QI. For making comparisons, we also set up a QI that uses symmetric channel for the photon collections. When the QIs store 14 modes, the measured Bell parameter S for the QIs using the asymmetric and the symmetric photon-collection channels are 2.36+/-0.03 and 2.24+/-0.04, respectively, showing that the QI using the asymmetric channel gives rise to a 3% increase in entanglement fidelity, i.e., a 1.7-fold decrease in the additional noise, compared with the QI using the symmetric one. On the other hand, the 14-mode entanglement QIs that use the asymmetric and symmetric collections preserve the violation of a Bell inequality for storage times up to 25 us and 20 us, respectively, showing that the asymmetric QI has a higher entanglement storage performance.

preprint2020arXiv

$P$-wave contributions to $B_{(s)}\toψKπ$ decays in perturbative QCD approach

In this work, we studied the quasi-two-body decays $B_{(s)} \to ψ[K^*(892), K^*(1410),$ $K^*(1680)] \to ψKπ$ by employing the perturbative QCD (PQCD) factorization approach, where the charmonia $ψ$ represents $J/ψ$ and $ψ(2S)$. The corresponding decay channels are studied by constructing the kaon-pion distribution amplitude (DA) $Φ_{K π}^{\rm P}$, which contained the important final state interactions between the kaon and pion in the resonant region. The relativistic Breit-Wigner formulas are adopted to parameterize the time-like form factor $F_{Kπ}$ appeared in the kaon-pion DAs. The SU(3) flavor symmetry breaking effect resulting from the mass difference between kaon and pion is taken into account, which makes significant contributions to the longitudinal polarizations. We accommodate well the observed branching ratios and the polarization fractions of the $B_{(s)} \to ψK^*(892) \to ψKπ$ by tuning the hadronic parameters for the kaon-pion DAs. The PQCD predictions for the $B_{(s)} \to ψ[K^*(1410), K^*(1680)] \to ψKπ$ modes from the same set of parameters can be tested by the future precise data from the LHCb and the Belle II experiments.

preprint2020arXiv

$S$, $P$ and $D$-wave resonance contributions to $B_{(s)} \to η_c(1S,2S) Kπ$ decays in the perturbative QCD approach

In this work, we analyze the three-body $B_{(s)} \to η_c(1S,2S) K π$ decays within the framework of the perturbative QCD approach (PQCD) under the quasi-two-body approximation, where the kaon-pion invariant mass spectra are dominated by the $K_0^*(1430)^0,K_0^*(1950)^0,K^*(892)^0,K^*(1410)^0,K^*(1680)^0$ and $K_2^*(1430)^0$ resonances. The time-like form factors are adopted to parametrize the corresponding $S$, $P$, $D$-wave kaon-pion distribution amplitudes for the concerned decay modes, which describe the final-state interactions between the kaon and pion in the resonant region. The $Kπ$ $S$-wave component at low $Kπ$ mass is described by the LASS line shape, while the time-like form factors of other resonances are modeled by the relativistic Breit-Wigner function. We find the following main points: (a) the PQCD predictions of the branching ratios for most considered $B \to η_c(1S)(K^{*0}\to )K^+π^-$ decays agree well with the currently available data within errors; (b) for ${\cal B}(B^0 \to η_c (K_0^*(1430)\to )K^+π^-)$ and ${\cal B}(B^0 \to η_c K^+π^-({\rm NR}))$ (here NR means nonresonant), our predictions of the branching ratios are a bit smaller than the measured ones; and (c) the PQCD results for the $D$-wave contributions considered in this work can be tested once the precise data from the future LHCb and Belle-II experiments are available.

preprint2020arXiv

Resonant contributions to three-body $B_{(s)} \to [ D^{(*)}, \bar{D}^{(*)} ] K^+K^-$ decays in the perturbative QCD approach

In this work, we study the $S$, $P$ and $D$ wave resonance contributions to three-body decays $B_{(s)} \to [ D^{(*)}, \bar{D}^{(*)} ] K^+K^-$ by employing the perturbative QCD (PQCD) approach, where the kaon-kaon invariant mass spectra are dominated by the $f_0(980),f_0(1370),$ $ϕ(1020),ϕ(1680),f_2(1270),f^{\prime}_2(1525),f_2(1750)$ and $f_2(1950)$ resonances. The $KK$ $S$-wave component $f_0(980)$ is modeled with the Flatté formalism, while other resonances are described by the relativistic Breit-Wigner (BW) line shape. The corresponding decay channels are studied by constructing the kaon-kaon distribution amplitude $Φ_{KK}$, which captures important final state interactions between the kaon pair in the resonant region. We found that the PQCD predictions for the branching ratios for most considered decays agree with currently available data within errors. The associated polarization fractions of those vector-vector and vector-tensor decay modes are also predicted, which are expected to be tested in the near future experiments. The invariant mass spectra for the corresponding resonances in the $B_{(s)} \to [D^{(*)}, \bar{D}^{(*)} ] K^+K^-$ decays are well established, which can be confronted with the precise data from the LHCb and Belle II experiments.

preprint2020arXiv

Transferable, Controllable, and Inconspicuous Adversarial Attacks on Person Re-identification With Deep Mis-Ranking

The success of DNNs has driven the extensive applications of person re-identification (ReID) into a new era. However, whether ReID inherits the vulnerability of DNNs remains unexplored. To examine the robustness of ReID systems is rather important because the insecurity of ReID systems may cause severe losses, e.g., the criminals may use the adversarial perturbations to cheat the CCTV systems. In this work, we examine the insecurity of current best-performing ReID models by proposing a learning-to-mis-rank formulation to perturb the ranking of the system output. As the cross-dataset transferability is crucial in the ReID domain, we also perform a back-box attack by developing a novel multi-stage network architecture that pyramids the features of different levels to extract general and transferable features for the adversarial perturbations. Our method can control the number of malicious pixels by using differentiable multi-shot sampling. To guarantee the inconspicuousness of the attack, we also propose a new perception loss to achieve better visual quality. Extensive experiments on four of the largest ReID benchmarks (i.e., Market1501 [45], CUHK03 [18], DukeMTMC [33], and MSMT17 [40]) not only show the effectiveness of our method, but also provides directions of the future improvement in the robustness of ReID systems. For example, the accuracy of one of the best-performing ReID systems drops sharply from 91.8% to 1.4% after being attacked by our method. Some attack results are shown in Fig. 1. The code is available at https://github.com/whj363636/Adversarial-attack-on-Person-ReID-With-Deep-Mis-Ranking.