Source author record

Xin Qi

Xin Qi appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.ST Methodology Statistics Theory Artificial Intelligence Computer Vision eess.AS eess.IV physics.plasm-ph Sound

Catalog footprint

What is connected

7works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Cross-Modal ASR Post-Processing System for Error Correction and Utterance Rejection

Although modern automatic speech recognition (ASR) systems can achieve high performance, they may produce errors that weaken readers' experience and do harm to downstream tasks. To improve the accuracy and reliability of ASR hypotheses, we propose a cross-modal post-processing system for speech recognizers, which 1) fuses acoustic features and textual features from different modalities, 2) joints a confidence estimator and an error corrector in multi-task learning fashion and 3) unifies error correction and utterance rejection modules. Compared with single-modal or single-task models, our proposed system is proved to be more effective and efficient. Experiment result shows that our post-processing system leads to more than 10% relative reduction of character error rate (CER) for both single-speaker and multi-speaker speech on our industrial ASR system, with about 1.7ms latency for each token, which ensures that extra latency introduced by post-processing is acceptable in streaming speech recognition.

preprint2020arXiv

AE-OT-GAN: Training GANs from data specific latent distribution

Though generative adversarial networks (GANs) areprominent models to generate realistic and crisp images,they often encounter the mode collapse problems and arehard to train, which comes from approximating the intrinsicdiscontinuous distribution transform map with continuousDNNs. The recently proposed AE-OT model addresses thisproblem by explicitly computing the discontinuous distribu-tion transform map through solving a semi-discrete optimaltransport (OT) map in the latent space of the autoencoder.However the generated images are blurry. In this paper, wepropose the AE-OT-GAN model to utilize the advantages ofthe both models: generate high quality images and at thesame time overcome the mode collapse/mixture problems.Specifically, we first faithfully embed the low dimensionalimage manifold into the latent space by training an autoen-coder (AE). Then we compute the optimal transport (OT)map that pushes forward the uniform distribution to the la-tent distribution supported on the latent manifold. Finally,our GAN model is trained to generate high quality imagesfrom the latent distribution, the distribution transform mapfrom which to the empirical data distribution will be con-tinuous. The paired data between the latent code and thereal images gives us further constriction about the generator.Experiments on simple MNIST dataset and complex datasetslike Cifar-10 and CelebA show the efficacy and efficiency ofour proposed method.

preprint2016arXiv

Effects of beam velocity and density on an ion-beam pulse moving in magnetized plasmas

The wakefield and stopping power of an ion-beam pulse moving in magnetized plasmas are investigated by particle-in-cell (PIC) simulations. The effects of beam velocity and density on the wake and stopping power are discussed. In the presence of magnetic field, it is found that beside the longitudinal conversed V-shaped wakes, the strong whistler wave are observed when low-density and low-velocity pulses moving in plasmas. The corresponding stopping powers are enhanced due to the drag of these whistler waves. As beam velocities increase, the whistler waves disappear, and only are conversed V-shape wakes observed. The corresponding stopping powers are reduced compared with these in isotropic plasmas. When high-density pulses transport in the magnetized plasmas, the whistler waves are greatly inhibited for low-velocity pulses and disappear for high-velocity pulses. Additionally, the magnetic field reduces the stopping powers for all high-density cases.

preprint2015arXiv

Asymptotic optimality of sparse linear discriminant analysis with arbitrary number of classes

Many sparse linear discriminant analysis (LDA) methods have been proposed to overcome the major problems of the classic LDA in high-dimensional settings. However, the asymptotic optimality results are limited to the case that there are only two classes, which is due to the fact that the classification boundary of LDA is a hyperplane and explicit formulas exist for the classification error in this case. In the situation where there are more than two classes, the classification boundary is usually complicated and no explicit formulas for the classification errors exist. In this paper, we consider the asymptotic optimality in the high-dimensional settings for a large family of linear classification rules with arbitrary number of classes under the situation of multivariate normal distribution. Our main theorem provides easy-to-check criteria for the asymptotic optimality of a general classification rule in this family as dimensionality and sample size both go to infinity and the number of classes is arbitrary. We establish the corresponding convergence rates. The general theory is applied to the classic LDA and the extensions of two recently proposed sparse LDA methods to obtain the asymptotic optimality. We conduct simulation studies on the extended methods in various settings.

preprint2015arXiv

Signal extraction approach for sparse multivariate response regression

In this paper, we consider multivariate response regression models with high dimensional predictor variables. One way to model the correlation among the response variables is through the low rank decomposition of the coefficient matrix, which has been considered by several papers for the high dimensional predictors. However, all these papers focus on the singular value decomposition of the coefficient matrix. Our target is the decomposition of the coefficient matrix which leads to the best lower rank approximation to the regression function, the signal part in the response. Given any rank, this decomposition has nearly the smallest expected prediction error among all approximations to the the coefficient matrix with the same rank. To estimate the decomposition, we formulate a penalized generalized eigenvalue problem to obtain the first matrix in the decomposition and then obtain the second one by a least squares method. In the high-dimensional setting, we establish the oracle inequalities for the estimates. Compared to the existing theoretical results, we have less restrictions on the distribution of the noise vector in each observation and allow correlations among its coordinates. Our theoretical results do not depend on the dimension of the multivariate response. Therefore, the dimension is arbitrary and can be larger than the sample size and the dimension of the predictor. Simulation studies and application to real data show that the proposed method has good prediction performance and is efficient in dimension reduction for various reduced rank models.

preprint2015arXiv

Sparse Fisher's discriminant analysis with thresholded linear constraints

Various regularized linear discriminant analysis (LDA) methods have been proposed to address the problems of the classic methods in high-dimensional settings. Asymptotic optimality has been established for some of these methods in high dimension when there are only two classes. A major difficulty in proving asymptotic optimality for multiclass classification is that the classification boundary is typically complicated and no explicit formula for classification error generally exists when the number of classes is greater than two. For the Fisher's LDA, one additional difficulty is that the covariance matrix is also involved in the linear constraints. The main purpose of this paper is to establish asymptotic consistency and asymptotic optimality for our sparse Fisher's LDA with thresholded linear constraints in the high-dimensional settings for arbitrary number of classes. To address the first difficulty above, we provide asymptotic optimality and the corresponding convergence rates in high-dimensional settings for a large family of linear classification rules with arbitrary number of classes, and apply them to our method. To overcome the second difficulty, we propose a thresholding approach to avoid the estimate of the covariance matrix. We apply the method to the classification problems for multivariate functional data through the wavelet transformations.

preprint2010arXiv

Asymptotic efficiency and finite-sample properties of the generalized profiling estimation of parameters in ordinary differential equations

Ordinary differential equations (ODEs) are commonly used to model dynamic behavior of a system. Because many parameters are unknown and have to be estimated from the observed data, there is growing interest in statistics to develop efficient estimation procedures for these parameters. Among the proposed methods in the literature, the generalized profiling estimation method developed by Ramsay and colleagues is particularly promising for its computational efficiency and good performance. In this approach, the ODE solution is approximated with a linear combination of basis functions. The coefficients of the basis functions are estimated by a penalized smoothing procedure with an ODE-defined penalty. However, the statistical properties of this procedure are not known. In this paper, we first give an upper bound on the uniform norm of the difference between the true solutions and their approximations. Then we use this bound to prove the consistency and asymptotic normality of this estimation procedure. We show that the asymptotic covariance matrix is the same as that of the maximum likelihood estimation. Therefore, this procedure is asymptotically efficient. For a fixed sample and fixed basis functions, we study the limiting behavior of the approximation when the smoothing parameter tends to infinity. We propose an algorithm to choose the smoothing parameters and a method to compute the deviation of the spline approximation from solution without solving the ODEs.

Xin Qi

What is connected

Connect this record

See the researcher in context

Building this map preview

7 published item(s)

Cross-Modal ASR Post-Processing System for Error Correction and Utterance Rejection

AE-OT-GAN: Training GANs from data specific latent distribution

Effects of beam velocity and density on an ion-beam pulse moving in magnetized plasmas

Asymptotic optimality of sparse linear discriminant analysis with arbitrary number of classes

Signal extraction approach for sparse multivariate response regression

Sparse Fisher's discriminant analysis with thresholded linear constraints

Asymptotic efficiency and finite-sample properties of the generalized profiling estimation of parameters in ordinary differential equations