Source author record

Peng-Jen Chen

Peng-Jen Chen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language eess.AS cond-mat.mes-hall cond-mat.str-el Machine Learning Sound Artificial Intelligence cond-mat.mtrl-sci cond-mat.supr-con physics.chem-ph quant-ph

Catalog footprint

What is connected

11works

11topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Accurate and Efficient Quantum Computations of Molecular Properties Using Daubechies Wavelet Molecular Orbitals: A Benchmark Study against Experimental Data

Although quantum computation (QC) is regarded as a promising numerical method for computational quantum chemistry, current applications of quantum-chemistry calculations on quantum computers are limited to small molecules. This limitation can be ascribed to technical problems in building and manipulating more qubits and the associated complicated operations of quantum gates in a quantum circuit when the size of the molecular system becomes large. As a result, reducing the number of required qubits is necessary to make QC practical. Currently, the minimal STO-3G basis set is commonly used in benchmark studies because it requires the minimum number of spin orbitals. Nonetheless, the accuracy of using STO-3G is generally low and thus cannot provide useful predictions. We propose to adopt Daubechies wavelet functions as an accurate and efficient method for QCs of molecular electronic properties. We demonstrate that a minimal basis set constructed from Daubechies wavelet basis can yield accurate results through a better description of the molecular Hamiltonian, while keeping the number of spin orbitals minimal. With the improved Hamiltonian through Daubechies wavelets, we calculate vibrational frequencies for H$_2$ and LiH using quantum-computing algorithm to show that the results are in excellent agreement with experimental data. As a result, we achieve quantum calculations in which accuracy is comparable with that of the full configuration interaction calculation using the cc-pVDZ basis set, whereas the computational cost is the same as that of a STO-3G calculation. Thus, our work provides a more efficient and accurate representation of the molecular Hamiltonian for efficient QCs of molecular systems, and for the first time demonstrates that predictions in agreement with experimental measurements are possible to be achieved with quantum resources available in near-term quantum computers.

preprint2022arXiv

Direct Simultaneous Speech-to-Speech Translation with Variational Monotonic Multihead Attention

We present a direct simultaneous speech-to-speech translation (Simul-S2ST) model, Furthermore, the generation of translation is independent from intermediate text representations. Our approach leverages recent progress on direct speech-to-speech translation with discrete units, in which a sequence of discrete representations, instead of continuous spectrogram features, learned in an unsupervised manner, are predicted from the model and passed directly to a vocoder for speech synthesis on-the-fly. We also introduce the variational monotonic multihead attention (V-MMA), to handle the challenge of inefficient policy learning in speech simultaneous translation. The simultaneous policy then operates on source speech features and target discrete units. We carry out empirical studies to compare cascaded and direct approach on the Fisher Spanish-English and MuST-C English-Spanish datasets. Direct simultaneous model is shown to outperform the cascaded model by achieving a better tradeoff between translation quality and latency.

preprint2022arXiv

Direct speech-to-speech translation with discrete units

We present a direct speech-to-speech translation (S2ST) model that translates speech from one language to speech in another language without relying on intermediate text generation. We tackle the problem by first applying a self-supervised discrete speech encoder on the target speech and then training a sequence-to-sequence speech-to-unit translation (S2UT) model to predict the discrete representations of the target speech. When target text transcripts are available, we design a joint speech and text training framework that enables the model to generate dual modality output (speech and text) simultaneously in the same inference pass. Experiments on the Fisher Spanish-English dataset show that the proposed framework yields improvement of 6.7 BLEU compared with a baseline direct S2ST model that predicts spectrogram features. When trained without any text transcripts, our model performance is comparable to models that predict spectrograms and are trained with text supervision, showing the potential of our system for translation between unwritten languages. Audio samples are available at https://facebookresearch.github.io/speech_translation/direct_s2st_units/index.html .

preprint2022arXiv

Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation

Direct speech-to-speech translation (S2ST) models suffer from data scarcity issues as there exists little parallel S2ST data, compared to the amount of data available for conventional cascaded systems that consist of automatic speech recognition (ASR), machine translation (MT), and text-to-speech (TTS) synthesis. In this work, we explore self-supervised pre-training with unlabeled speech data and data augmentation to tackle this issue. We take advantage of a recently proposed speech-to-unit translation (S2UT) framework that encodes target speech into discrete representations, and transfer pre-training and efficient partial finetuning techniques that work well for speech-to-text translation (S2T) to the S2UT domain by studying both speech encoder and discrete unit decoder pre-training. Our experiments on Spanish-English translation show that self-supervised pre-training consistently improves model performance compared with multitask learning with an average 6.6-12.1 BLEU gain, and it can be further combined with data augmentation techniques that apply MT to create weakly supervised training data. Audio samples are available at: https://facebookresearch.github.io/speech_translation/enhanced_direct_s2st_units/index.html .

preprint2022arXiv

Textless Speech-to-Speech Translation on Real Data

We present a textless speech-to-speech translation (S2ST) system that can translate speech from one language into another language and can be built without the need of any text data. Different from existing work in the literature, we tackle the challenge in modeling multi-speaker target speech and train the systems with real-world S2ST data. The key to our approach is a self-supervised unit-based speech normalization technique, which finetunes a pre-trained speech encoder with paired audios from multiple speakers and a single reference speaker to reduce the variations due to accents, while preserving the lexical content. With only 10 minutes of paired data for speech normalization, we obtain on average 3.2 BLEU gain when training the S2ST model on the VoxPopuli S2ST dataset, compared to a baseline trained on un-normalized speech target. We also incorporate automatically mined S2ST data and show an additional 2.0 BLEU gain. To our knowledge, we are the first to establish a textless S2ST technique that can be trained with real-world data and works for multiple language pairs. Audio samples are available at https://facebookresearch.github.io/speech_translation/textless_s2st_real_data/index.html .

preprint2020arXiv

Multilingual Translation with Extensible Multilingual Pretraining and Finetuning

Recent work demonstrates the potential of multilingual pretraining of creating one model that can be used for various tasks in different languages. Previous work in multilingual pretraining has demonstrated that machine translation systems can be created by finetuning on bitext. In this work, we show that multilingual translation models can be created through multilingual finetuning. Instead of finetuning on one direction, a pretrained model is finetuned on many directions at the same time. Compared to multilingual models trained from scratch, starting from pretrained models incorporates the benefits of large quantities of unlabeled monolingual data, which is particularly important for low resource languages where bitext is not available. We demonstrate that pretrained models can be extended to incorporate additional languages without loss of performance. We double the number of languages in mBART to support multilingual machine translation models of 50 languages. Finally, we create the ML50 benchmark, covering low, mid, and high resource languages, to facilitate reproducible research by standardizing training and evaluation data. On ML50, we demonstrate that multilingual finetuning improves on average 1 BLEU over the strongest baselines (being either multilingual from scratch or bilingual finetuning) while improving 9.3 BLEU on average over bilingual baselines from scratch.

preprint2020arXiv

The Source-Target Domain Mismatch Problem in Machine Translation

While we live in an increasingly interconnected world, different places still exhibit strikingly different cultures and many events we experience in our every day life pertain only to the specific place we live in. As a result, people often talk about different things in different parts of the world. In this work we study the effect of local context in machine translation and postulate that particularly in low resource settings this causes the domains of the source and target language to greatly mismatch, as the two languages are often spoken in further apart regions of the world with more distinctive cultural traits and unrelated local events. We first formalize the concept of source-target domain mismatch, propose a metric to quantify it, and provide empirical evidence corroborating our intuition that organic text produced by people speaking very different languages exhibits the most dramatic differences. We conclude with an empirical study of how source-target domain mismatch affects training of machine translation systems for low resource language pairs. In particular, we find that it severely affects back-translation, but the degradation can be alleviated by combining back-translation with self-training and by increasing the relative amount of target side monolingual data.

preprint2016arXiv

Superconducting Topological Surface States in Non-centrosymmetric Bulk Superconductor PbTaSe2

The search for topological superconductors (TSCs) is one of the most urgent contemporary problems in condensed matter systems. TSCs are characterized by a full superconducting gap in the bulk and topologically protected gapless surface (or edge) states. Within each vortex core of TSCs, there exist the zero energy Majorana bound states, which are predicted to exhibit non-Abelian statistics and to form the basis of the fault-tolerant quantum computation. So far, no stoichiometric bulk material exhibits the required topological surface states (TSSs) at Fermi level combined with fully gapped bulk superconductivity. Here, we report atomic scale visualization of the TSSs of the non-centrosymmetric fully-gapped superconductor, PbTaSe2. Using quasiparticle scattering interference (QPI) imaging, we find two TSSs with a Dirac point at E~1.0eV, of which the inner TSS and partial outer TSS cross Fermi level, on the Pb-terminated surface of this fully gapped superconductor. This discovery reveals PbTaSe2 as a promising candidate as a TSC.

preprint2016arXiv

Topological Dirac States and Pairing Correlations in the Non-Centrosymmetric Superconductor PbTaSe2

Superconductivity in topological band structures is a platform for many novel exotic quantum phenomena such as emergent supersymmetry. This potential nourishes the search for topological materials with intrinsic superconducting instabilities, in which Cooper pairing is introduced to electrons with helical spin texture such as the Dirac states of topological insulators and Dirac Semimetals, forming a natural topological superconductor of helical kind. We employ first-principles calculations, ARPES experiments and new theoretical analysis to reveal that PbTaSe2, a non-centrosymmetric superconductor, possesses a nonzero Z2 topological invariant and fully spin-polarized Dirac states. Moreover, we analyze the phonon spectrum of PbTaSe2 to show how superconductivity can emerge due to a stiffening of phonons by the Pb intercalation, which diminishes a competing charge-density-wave instability. Our work establishes PbTaSe2 as a stoichiometric superconductor with nontrivial Z2 topological band structure, and shows that it holds great promise for studying novel forms of topological superconductivity not realized previously.

preprint2015arXiv

Drumhead Surface States and Topological Nodal-Line Fermions in TlTaSe2

A topological nodal-line semimetal is a new condensed matter state with one-dimensional bulk nodal lines and two-dimensional drumhead surface bands. Based on first-principles calculations and our effective k . p model, we propose the existence of topological nodal-line fermions in the ternary transition- metal chalcogenide TlTaSe2. The noncentrosymmetric structure and strong spin-orbit coupling give rise to spinful nodal-line bulk states which are protected by a mirror reflection symmetry of this compound. This is remarkably distinguished from other proposed nodal-line semimetals such as Cu3NPb(Zn) in which nodal lines exist only in the limit of vanishing spin-orbit coupling. We show that the drumhead surface states in TlTaSe2, which are associated with the topological nodal lines, exhibit an unconventional chiral spin texture and an exotic Lifshitz transition as a consequence of the linkage among multiple drumhead surface-state pockets.

preprint2015arXiv

Two distinct topological phases in the mixed valence compound YbB6 and its differences from SmB6

We discuss the evolution of topological states and their orbital textures in the mixed valence compounds SmB6 and YbB6 within the framework of the generalized gradient approximation plus onsite Coulomb interaction (GGA+U) scheme for a wide range of values of U. In SmB6, the topological Kondo insulator (TKI) gap is found to be insensitive to the value of U, but in sharp contrast, Kondo physics in isostructural YbB6 displays a surprising sensitivity to U. In particular, as U is increased in YbB6, the correlated TKI state in the weak-coupling regime transforms into a d-p-type topological insulator phase with a band inversion between Yb-5d and B-2p orbitals in the intermediate coupling range, without closing the insulating energy gap throughout this process. Our theoretical predictions related to the TKI and non-TKI phases in SmB6 and YbB6 are in substantial accord with recent angle-resolved photoemission spectroscopy (ARPES) experiments.

Peng-Jen Chen

What is connected

Connect this record

See the researcher in context

Building this map preview

11 published item(s)

Accurate and Efficient Quantum Computations of Molecular Properties Using Daubechies Wavelet Molecular Orbitals: A Benchmark Study against Experimental Data

Direct Simultaneous Speech-to-Speech Translation with Variational Monotonic Multihead Attention

Direct speech-to-speech translation with discrete units

Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation

Textless Speech-to-Speech Translation on Real Data

Multilingual Translation with Extensible Multilingual Pretraining and Finetuning

The Source-Target Domain Mismatch Problem in Machine Translation

Superconducting Topological Surface States in Non-centrosymmetric Bulk Superconductor PbTaSe2

Topological Dirac States and Pairing Correlations in the Non-Centrosymmetric Superconductor PbTaSe2

Drumhead Surface States and Topological Nodal-Line Fermions in TlTaSe2

Two distinct topological phases in the mixed valence compound YbB6 and its differences from SmB6