Source author record

Zi-Qiang Zhang

Zi-Qiang Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

hep-ph hep-th eess.AS nucl-th Sound eess.IV

Catalog footprint

What is connected

13works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition

Wav2vec2.0 is a popular self-supervised pre-training framework for learning speech representations in the context of automatic speech recognition (ASR). It was shown that wav2vec2.0 has a good robustness against the domain shift, while the noise robustness is still unclear. In this work, we therefore first analyze the noise robustness of wav2vec2.0 via experiments. We observe that wav2vec2.0 pre-trained on noisy data can obtain good representations and thus improve the ASR performance on the noisy test set, which however brings a performance degradation on the clean test set. To avoid this issue, in this work we propose an enhanced wav2vec2.0 model. Specifically, the noisy speech and the corresponding clean version are fed into the same feature encoder, where the clean speech provides training targets for the model. Experimental results reveal that the proposed method can not only improve the ASR performance on the noisy test set which surpasses the original wav2vec2.0, but also ensure a tiny performance decrease on the clean test set. In addition, the effectiveness of the proposed method is demonstrated under different types of noise conditions.

preprint2022arXiv

Joint Training of Speech Enhancement and Self-supervised Model for Noise-robust ASR

Speech enhancement (SE) is usually required as a front end to improve the speech quality in noisy environments, while the enhanced speech might not be optimal for automatic speech recognition (ASR) systems due to speech distortion. On the other hand, it was shown that self-supervised pre-training enables the utilization of a large amount of unlabeled noisy data, which is rather beneficial for the noise robustness of ASR. However, the potential of the (optimal) integration of SE and self-supervised pre-training still remains unclear. In order to find an appropriate combination and reduce the impact of speech distortion caused by SE, in this paper we therefore propose a joint pre-training approach for the SE module and the self-supervised model. First, in the pre-training phase the original noisy waveform or the waveform obtained by SE is fed into the self-supervised model to learn the contextual representation, where the quantified clean speech acts as the target. Second, we propose a dual-attention fusion method to fuse the features of noisy and enhanced speeches, which can compensate the information loss caused by separately using individual modules. Due to the flexible exploitation of clean/noisy/enhanced branches, the proposed method turns out to be a generalization of some existing noise-robust ASR models, e.g., enhanced wav2vec2.0. Finally, experimental results on both synthetic and real noisy datasets show that the proposed joint training approach can improve the ASR performance under various noisy settings, leading to a stronger noise robustness.

preprint2022arXiv

Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition

With the advance in self-supervised learning for audio and visual modalities, it has become possible to learn a robust audio-visual speech representation. This would be beneficial for improving the audio-visual speech recognition (AVSR) performance, as the multi-modal inputs contain more fruitful information in principle. In this paper, based on existing self-supervised representation learning methods for audio modality, we therefore propose an audio-visual representation learning approach. The proposed approach explores both the complementarity of audio-visual modalities and long-term context dependency using a transformer-based fusion module and a flexible masking strategy. After pre-training, the model is able to extract fused representations required by AVSR. Without loss of generality, it can be applied to single-modal tasks, e.g. audio/visual speech recognition by simply masking out one modality in the fusion module. The proposed pre-trained model is evaluated on speech recognition and lipreading tasks using one or two modalities, where the superiority is revealed.

preprint2020arXiv

Effect of gluon condensate on holographic Schwinger effect

We perform the potential analysis in holographic Schwinger effect in a deformed anti-de Sitter (AdS) background with backreaction due to the gluon condensate. We determine the potential by analyzing the classical string action attaching on a probe D3-brane sitting at an intermediate position in the bulk AdS space. It is found that the inclusion of the gluon condensate reduces the production rate, reverse to the effect of the temperature. Also, we evaluate the critical electric field by Dirac-Born-Infeld (DBI) action.

preprint2020arXiv

Entropic destruction of heavy quarkonium in heavy quark cloud

Previous research has shown that the peak of the quarkonium entropy at the deconfinement transition would be related to the entropic force which induces the melting of quarkonium. In this article, we study the effect of backreaction on the entropic force in a strongly coupled plasma of adjoint matter. The backreaction covered here comes from the presence of static heavy quarks evenly distributed over such a plasma. It is found that the inclusion of backreaction increases the entropic force thus enhancing the quarkonium dissociation, in accord with the findings of the imaginary potential.

preprint2020arXiv

Holographic Schwinger effect in a soft wall AdS/QCD model

We perform the potential analysis for the holographic Schwinger effect in a deformed $AdS_5$ model with conformal invariance broken by a background dilaton. We evaluate the static potential by analyzing the classical action of a string attaching the rectangular Wilson loop on a probe D3 brane sitting at an intermediate position in the bulk AdS space. We observe that the inclusion of chemical potential tends to enhance the production rate, reverse to the effect of confining scale. Also, we calculate the critical electric field by Dirac-Born-Infeld (DBI) action.

preprint2020arXiv

Jet quenching parameter from a soft wall AdS/QCD model

We study the effect of chemical potential and nonconformality on the jet quenching parameter in a holographic QCD model with conformal invariance broken by a background dilaton. It turns out that the presence of chemical potential and nonconformality both increase the jet quenching parameter thus enhancing the energy loss, consistently with the findings of the drag force.

preprint2016arXiv

Entropic destruction of a rotating heavy quarkonium

Using the AdS/CFT duality, we study the destruction of a rotating heavy quarkonium due to the entropice force in $\mathcal{N}=4$ SYM theory and a confining YM theory. It is shown that in both theories increasing the angular velocity leads to decreasing the entropic force. This result implies that the rotating quarkonium dissociates harder than the static case.

preprint2016arXiv

Holographic Schwinger effect in a confining D3-brane background with chemical potential

Using the AdS/CFT correspondence, we investigate the Schwinger effect in a confining D3-brane background with chemical potential. The potential between a test particle pair on the D3-brane in an external electric field is obtained. The critical field $E_c$ in this case is calculated. Also, we apply numerical method to evaluate the production rate for various cases. The results imply that the presence of chemical potential tends to suppress the pair production effect.

preprint2016arXiv

R^2 corrections to the jet quenching parameter

A calculation of the $R^2$ corrections to the jet quenching parameter from AdS/CFT correspondence is presented. These corrections are related to curvature-squared corrections in the corresponding gravity dual. It is shown that the corrections will increase or decrease the jet quenching parameter depending on the coefficients of the high curvature terms.

preprint2015arXiv

Melting temperature of heavy quarkonium with a holographic potential up to sub-leading order

A calculation of the melting temperatures of heavy quarkonium states with the holographic potential was introduced in a previous work. In this paper, we consider the holographic potential at sub-leading order, which permits finite coupling corrections to be taken into account. It is found that this correction lowers the dissociation temperatures of heavy quarkonium.

preprint2012arXiv

The finite 't Hooft coupling correction on the jet quenching parameter in a $\mathcal N=4$ Super Yang-Mills Plasma

We derive the quadratic action of the fluctuations around the classical world sheet underlying the jet quenching from AdS/CFT. After obtaining the correspondence partition function, the expansion of the jet quenching parameter of $\mathcal N=4$ super symmetric Yang-Mills theory is carried out to the sub-leading term in the large 't Hooft coupling $λ$ at a nonzero temperature. The strong coupling corresponds to the semi-classical expansion of the string-sigma model, the gravity dual of the Wilson loop operator, with the sub-leading term expressed in terms of functional determinants of fluctuations. The contribution of these determinants are evaluated numerically. We find the jet quenching parameter is reduced due to world sheet fluctuations by a factor $(1-1.97λ^{-1/2}) $

preprint2011arXiv

The Subleading Term of the Strong Coupling Expansion of the Heavy-Quark Potential in a $\mathcal N=4$ Super Yang-Mills Plasma

Applying the AdS/CFT correspondence, the expansion of the heavy-quark potential of the ${\cal N}$ supersymmetric Yang-Mills theory at large $N_c$ is carried out to the sub-leading term in the large 't Hooft coupling at nonzero temperatures. The strong coupling corresponds to the semi-classical expansion of the string-sigma model, the gravity dual of the Wilson loop operator, with the sub-leading term expressed in terms of functional determinants of fluctuations. The contributions of these determinants are evaluated numerically.

Zi-Qiang Zhang

What is connected

Connect this record

See the researcher in context

Building this map preview

13 published item(s)

A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition

Joint Training of Speech Enhancement and Self-supervised Model for Noise-robust ASR

Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition

Effect of gluon condensate on holographic Schwinger effect

Entropic destruction of heavy quarkonium in heavy quark cloud

Holographic Schwinger effect in a soft wall AdS/QCD model

Jet quenching parameter from a soft wall AdS/QCD model

Entropic destruction of a rotating heavy quarkonium

Holographic Schwinger effect in a confining D3-brane background with chemical potential

R^2 corrections to the jet quenching parameter

Melting temperature of heavy quarkonium with a holographic potential up to sub-leading order

The finite 't Hooft coupling correction on the jet quenching parameter in a $\mathcal N=4$ Super Yang-Mills Plasma

The Subleading Term of the Strong Coupling Expansion of the Heavy-Quark Potential in a $\mathcal N=4$ Super Yang-Mills Plasma