Researcher profile

Guoqing Zhao

Guoqing Zhao contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 15 - UnverifiedVerification L1Unclaimed author
3works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

3 published item(s)

preprint2024arXiv

SELM: Speech Enhancement Using Discrete Tokens and Language Models

Language models (LMs) have shown superior performances in various speech generation tasks recently, demonstrating their powerful ability for semantic context modeling. Given the intrinsic similarity between speech generation and speech enhancement, harnessing semantic information holds potential advantages for speech enhancement tasks. In light of this, we propose SELM, a novel paradigm for speech enhancement, which integrates discrete tokens and leverages language models. SELM comprises three stages: encoding, modeling, and decoding. We transform continuous waveform signals into discrete tokens using pre-trained self-supervised learning (SSL) models and a k-means tokenizer. Language models then capture comprehensive contextual information within these tokens. Finally, a detokenizer and HiFi-GAN restore them into enhanced speech. Experimental results demonstrate that SELM achieves comparable performance in objective metrics alongside superior results in subjective perception. Our demos are available https://honee-w.github.io/SELM/.

preprint2022arXiv

TPSNet: Reverse Thinking of Thin Plate Splines for Arbitrary Shape Scene Text Representation

The research focus of scene text detection and recognition has shifted to arbitrary shape text in recent years, where the text shape representation is a fundamental problem. An ideal representation should be compact, complete, efficient, and reusable for subsequent recognition in our opinion. However, previous representations have flaws in one or more aspects. Thin-Plate-Spline (TPS) transformation has achieved great success in scene text recognition. Inspired by this, we reversely think of its usage and sophisticatedly take TPS as an exquisite representation for arbitrary shape text representation. The TPS representation is compact, complete, and efficient. With the predicted TPS parameters, the detected text region can be directly rectified to a near-horizontal one to assist the subsequent recognition. To further exploit the potential of the TPS representation, the Border Alignment Loss is proposed. Based on these designs, we implement the text detector TPSNet, which can be extended to a text spotter conveniently. Extensive evaluation and ablation of several public benchmarks demonstrate the effectiveness and superiority of the proposed method for text representation and spotting. Particularly, TPSNet achieves the detection F-Measure improvement of 4.4\% (78.4\% vs. 74.0\%) on Art dataset and the end-to-end spotting F-Measure improvement of 5.0\% (78.5\% vs. 73.5\%) on Total-Text, which are large margins with no bells and whistles.

preprint2021arXiv

Growth of Outward Propagating Fast-Magnetosonic/Whistler Waves in the Inner Heliosphere Observed by Parker Solar Probe

The solar wind in the inner heliosphere has been observed by Parker Solar Probe (PSP) to exhibit abundant wave activities. The cyclotron wave modes in the sense of ions or electrons are among the most crucial wave components. However, their origin and evolution in the inner heliosphere close to the Sun remain mysteries. Specifically, it remains unknown whether it is an emitted signal from the solar atmosphere or an eigenmode growing locally in the heliosphere due to plasma instability. To address and resolve this controversy, we must investigate the key quantity of the energy change rate of the wave mode. We develop a new technique to measure the energy change rate of plasma waves, and apply this technique to the wave electromagnetic fields measured by PSP. We provide the wave Poynting flux in the solar wind frame, identify the wave nature to be the outward propagating fast-magnetosonic/whistler wave mode instead of the sunward propagating waves. We provide the first evidence for growth of the fast-magnetosonic/whistler wave mode in the inner heliosphere based on the derived spectra of the real and imaginary parts of the wave frequencies. The energy change rate rises and stays at a positive level in the same wavenumber range as the bumps of the electromagnetic field power spectral densities, clearly manifesting that the observed fast-magnetosonic/whistler waves are locally growing to a large amplitude.