Researcher profile

Haitao Chen

Haitao Chen contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 13 - UnverifiedVerification L1Unclaimed author
2works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

2 published item(s)

preprint2022arXiv

IQDUBBING: Prosody modeling based on discrete self-supervised speech representation for expressive voice conversion

Prosody modeling is important, but still challenging in expressive voice conversion. As prosody is difficult to model, and other factors, e.g., speaker, environment and content, which are entangled with prosody in speech, should be removed in prosody modeling. In this paper, we present IQDubbing to solve this problem for expressive voice conversion. To model prosody, we leverage the recent advances in discrete self-supervised speech representation (DSSR). Specifically, prosody vector is first extracted from pre-trained VQ-Wav2Vec model, where rich prosody information is embedded while most speaker and environment information are removed effectively by quantization. To further filter out the redundant information except prosody, such as content and partial speaker information, we propose two kinds of prosody filters to sample prosody from the prosody vector. Experiments show that IQDubbing is superior to baseline and comparison systems in terms of speech quality while maintaining prosody consistency and speaker similarity.

preprint2020arXiv

Exciton interaction induced spin splitting in MoS$_2$ monolayer

By pumping nonresonantly a MoS$_2$ monolayer at $13$ K under a circularly polarized cw laser, we observe exciton energy redshifts that break the degeneracy between B excitons with opposite spin. The energy splitting increases monotonically with the laser power reaching as much as $18$ meV, while it diminishes with the temperature. The phenomenon can be explained theoretically by considering simultaneously the bandgap renormalization which gives rise to the redshift and exciton-exciton Coulomb exchange interaction which is responsible for the spin-dependent splitting. Our results offer a simple scheme to control the valley degree of freedom in MoS$_2$ monolayer and provide an accessible method in investigating many-body exciton exciton interaction in such materials.