Source author record

Xiang Kong

Xiang Kong appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language cond-mat.mes-hall

Catalog footprint

What is connected

6works

2topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Multilingual Neural Machine Translation with Deep Encoder and Multiple Shallow Decoders

Recent work in multilingual translation advances translation quality surpassing bilingual baselines using deep transformer models with increased capacity. However, the extra latency and memory costs introduced by this approach may make it unacceptable for efficiency-constrained applications. It has recently been shown for bilingual translation that using a deep encoder and shallow decoder (DESD) can reduce inference latency while maintaining translation quality, so we study similar speed-accuracy trade-offs for multilingual translation. We find that for many-to-one translation we can indeed increase decoder speed without sacrificing quality using this approach, but for one-to-many translation, shallow decoders cause a clear quality drop. To ameliorate this drop, we propose a deep encoder with multiple shallow decoders (DEMSD) where each shallow decoder is responsible for a disjoint subset of target languages. Specifically, the DEMSD model with 2-layer decoders is able to obtain a 1.8x speedup on average compared to a standard transformer model with no drop in translation quality.

preprint2020arXiv

Fully Non-autoregressive Neural Machine Translation: Tricks of the Trade

Fully non-autoregressive neural machine translation (NAT) is proposed to simultaneously predict tokens with single forward of neural networks, which significantly reduces the inference latency at the expense of quality drop compared to the Transformer baseline. In this work, we target on closing the performance gap while maintaining the latency advantage. We first inspect the fundamental issues of fully NAT models, and adopt dependency reduction in the learning space of output tokens as the basic guidance. Then, we revisit methods in four different aspects that have been proven effective for improving NAT models, and carefully combine these techniques with necessary modifications. Our extensive experiments on three translation benchmarks show that the proposed system achieves the new state-of-the-art results for fully NAT models, and obtains comparable performance with the autoregressive and iterative NAT systems. For instance, one of the proposed models achieves 27.49 BLEU points on WMT14 En-De with approximately 16.5X speed up at inference time.

preprint2020arXiv

SCDE: Sentence Cloze Dataset with High Quality Distractors From Examinations

We introduce SCDE, a dataset to evaluate the performance of computational models through sentence prediction. SCDE is a human-created sentence cloze dataset, collected from public school English examinations. Our task requires a model to fill up multiple blanks in a passage from a shared candidate set with distractors designed by English teachers. Experimental results demonstrate that this task requires the use of non-local, discourse-level context beyond the immediate sentence neighborhood. The blanks require joint solving and significantly impair each other's context. Furthermore, through ablations, we show that the distractors are of high quality and make the task more challenging. Our experiments show that there is a significant performance gap between advanced models (72%) and humans (87%), encouraging future models to bridge this gap.

preprint2016arXiv

Evaluating Automatic Speech Recognition Systems in Comparison With Human Perception Results Using Distinctive Feature Measures

This paper describes methods for evaluating automatic speech recognition (ASR) systems in comparison with human perception results, using measures derived from linguistic distinctive features. Error patterns in terms of manner, place and voicing are presented, along with an examination of confusion matrices via a distinctive-feature-distance metric. These evaluation methods contrast with conventional performance criteria that focus on the phone or word level, and are intended to provide a more detailed profile of ASR system performance,as well as a means for direct comparison with human perception results at the sub-phonemic level.

preprint2016arXiv

Nature of excitons bound to inversion domain boundaries: Origin of the 3.45-eV luminescence lines in spontaneously formed GaN nanowires on Si(111)

We investigate the 3.45-eV luminescence band of spontaneously formed GaN nanowires on Si(111) by photoluminescence and cathodoluminescence spectroscopy. This band is found to be particularly prominent for samples synthesized at comparatively low temperatures. At the same time, these samples exhibit a peculiar morphology, namely, isolated long nanowires are interspersed within a dense matrix of short ones. Cathodoluminescence intensity maps reveal the 3.45-eV band to originate primarily from the long nanowires. Transmission electron microscopy shows that these long nanowires are either Ga polar and are joined by an inversion domain boundary with their short N-polar neighbors, or exhibit a Ga-polar core surrounded by a N-polar shell with a tubular inversion domain boundary at the core/shell interface. For samples grown at high temperatures, which exhibit a uniform nanowire morphology, the 3.45-eV band is also found to originate from particular nanowires in the ensemble and thus presumably from inversion domain boundaries stemming from the coexistence of N- and Ga-polar nanowires. For several of the investigated samples, the 3.45-eV band splits into a doublet. We demonstrate that the higher-energy component of this doublet arises from the recombination of two-dimensional excitons free to move in the plane of the inversion domain boundary. In contrast, the lower-energy component of the doublet originates from excitons localized in the plane of the inversion domain boundary. We propose that this in-plane localization is due to shallow donors in the vicinity of the inversion domain boundaries.

preprint2012arXiv

Residual disorder and diffusion in thin Heusler alloy films

Co2FeSi/GaAs(110) and Co2FeSi/GaAs(111)B hybrid structures were grown by molecular-beam epitaxy and characterized by transmission electron microscopy (TEM) and X-ray diffraction. The films contained inhomogeneous distributions of ordered L2_1 and B2 phases. The average stoichiometry was controlled by lattice parameter measurements, however diffusion processes lead to inhomogeneities of the atomic concentrations and the degradation of the interface, influencing long-range order. An average long-range order of 30-60% was measured by grazing-incidence X-ray diffraction, i.e. the as-grown Co2FeSi films were highly but not fully ordered. Lateral inhomogeneities of the spatial distribution of long-range order in Co2FeSi were found using dark-field TEM images taken with superlattice reflections.