Source author record

Lixing Zhu

Lixing Zhu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Methodology math.ST Statistics Theory Computation and Language physics.flu-dyn physics.med-ph Social and Information Networks

Catalog footprint

What is connected

37works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Asymptotic Distribution-Free Tests for Ultra-high Dimensional Parametric Regressions via Projected Empirical Processes and $p$-value Combination

This paper develops a novel methodology for testing the goodness-of-fit of sparse parametric regression models based on projected empirical processes and p-value combination, where the covariate dimension may substantially exceed the sample size. In such ultra-high dimensional settings, traditional empirical process-based tests often fail due to the curse of dimensionality or their reliance on the asymptotic linearity and normality of parameter estimators--properties that may not hold under ultra-high dimensional scenarios. To overcome these challenges, we first extend the classic martingale transformation to ultra-high dimensional settings under mild conditions and construct a Cramer-von Mises type test based on a martingale-transformed, projected residual-marked empirical process for any projection on the unit sphere. The martingale transformation renders this projected test asymptotically distribution-free and enables us to derive its limiting distribution using only standard convergence rates of parameter estimators. While the projected test is consistent for almost all projections on the unit sphere under mild conditions, it may still suffer from power loss for specific projections. Therefore, we further employ powerful p-value combination procedures, such as the Cauchy combination, to aggregate p-values across multiple projections, thereby enhancing overall robustness. Furthermore, recognizing that empirical process-based tests excel at detecting low-frequency signals while local smoothing tests are generally superior for high-frequency alternatives, we propose a novel hybrid test that aggregates both approaches using Cauchy combination. The resulting hybrid test is powerful against both low-frequency and high-frequency alternatives. $\cdots$

preprint2023arXiv

Are NLP Models Good at Tracing Thoughts: An Overview of Narrative Understanding

Narrative understanding involves capturing the author's cognitive processes, providing insights into their knowledge, intentions, beliefs, and desires. Although large language models (LLMs) excel in generating grammatically coherent text, their ability to comprehend the author's thoughts remains uncertain. This limitation hinders the practical applications of narrative understanding. In this paper, we conduct a comprehensive survey of narrative understanding tasks, thoroughly examining their key features, definitions, taxonomy, associated datasets, training objectives, evaluation metrics, and limitations. Furthermore, we explore the potential of expanding the capabilities of modularized LLMs to address novel narrative understanding tasks. By framing narrative understanding as the retrieval of the author's imaginative cues that outline the narrative structure, our study introduces a fresh perspective on enhancing narrative comprehension.

preprint2022arXiv

A general Monte Carlo method for multivariate goodness-of-fit testing applied to elliptical families

A general and relatively simple method for construction of multivariate goodness-of-fit tests is introduced. The proposed test is applied to elliptical distributions. The method is based on a characterization of probability distributions via their characteristic function. The consistency and other limit properties of the new test statistics are studied. Also in a simulation study the proposed tests are compared with earlier as well as more recent competitors.

preprint2022arXiv

Disentangled Learning of Stance and Aspect Topics for Vaccine Attitude Detection in Social Media

Building models to detect vaccine attitudes on social media is challenging because of the composite, often intricate aspects involved, and the limited availability of annotated data. Existing approaches have relied heavily on supervised training that requires abundant annotations and pre-defined aspect categories. Instead, with the aim of leveraging the large amount of unannotated data now available on vaccination, we propose a novel semi-supervised approach for vaccine attitude detection, called VADet. A variational autoencoding architecture based on language models is employed to learn from unlabelled data the topical information of the domain. Then, the model is fine-tuned with a few manually annotated examples of user attitudes. We validate the effectiveness of VADet on our annotated data and also on an existing vaccination corpus annotated with opinions on vaccines. Our results show that VADet is able to learn disentangled stance and aspect topics, and outperforms existing aspect-based sentiment analysis models on both stance detection and tweet clustering.

preprint2020arXiv

A Neural Generative Model for Joint Learning Topics and Topic-Specific Word Embeddings

We propose a novel generative model to explore both local and global context for joint learning topics and topic-specific word embeddings. In particular, we assume that global latent topics are shared across documents, a word is generated by a hidden semantic vector encoding its contextual semantic meaning, and its context words are generated conditional on both the hidden semantic vector and global latent topics. Topics are trained jointly with the word embeddings. The trained model maps words to topic-dependent embeddings, which naturally addresses the issue of word polysemy. Experimental results show that the proposed model outperforms the word-level embedding methods in both word similarity evaluation and word sense disambiguation. Furthermore, the model also extracts more coherent topics compared with existing neural topic models or other models for joint learning of topics and word embeddings. Finally, the model can be easily integrated with existing deep contextualized word embedding learning methods to further improve the performance of downstream tasks such as sentiment classification.

preprint2020arXiv

Detecting multiple change points: a PULSE criterion

The research described herewith investigates detecting change points of means and of variances in a sequence of observations. The number of change points can be divergent at certain rate as the sample size goes to infinity. We define a MOSUM-based objective function for this purpose. Unlike all existing MOSUM-based methods, the novel objective function exhibits an useful ``PULSE" pattern near change points in the sense: at the population level, the value at any change point plus 2 times of the segment length of the moving average attains a local minimum tending to zero following by a local maximum going to infinity. This feature provides an efficient way to simultaneously identify all change points at the sample level. In theory, the number of change points can be consistently estimated and the locations can also be consistently estimated in a certain sense. Further, because of its visualization nature, in practice, the locations can be relatively more easily identified by plots than existing methods in the literature. The method can also handle the case in which the signals of some change points are very weak in the sense that those changes go to zero. Further, the computational cost is very inexpensive. The numerical studies we conduct validate its good performance.

preprint2020arXiv

Doubly robust estimation for conditional treatment effect: a study on asymptotics

In this paper, we apply doubly robust approach to estimate, when some covariates are given, the conditional average treatment effect under parametric, semiparametric and nonparametric structure of the nuisance propensity score and outcome regression models. We then conduct a systematic study on the asymptotic distributions of nine estimators with different combinations of estimated propensity score and outcome regressions. The study covers the asymptotic properties with all models correctly specified; with either propensity score or outcome regressions locally / globally misspecified; and with all models locally / globally misspecified. The asymptotic variances are compared and the asymptotic bias correction under model-misspecification is discussed. The phenomenon that the asymptotic variance, with model-misspecification, could sometimes be even smaller than that with all models correctly specified is explored. We also conduct a numerical study to examine the theoretical results.

preprint2020arXiv

Doubly robust estimation of average treatment effect revisited

The research described herewith is to re-visit the classical doubly robust estimation of average treatment effect by conducting a systematic study on the comparisons, in the sense of asymptotic efficiency, among all possible combinations of the estimated propensity score and outcome regression. To this end, we consider all nine combinations under, respectively, parametric, nonparametric and semiparametric structures. The comparisons provide useful information on when and how to efficiently utilize the model structures in practice. Further, when there is model-misspecification, either propensity score or outcome regression, we also give the corresponding comparisons. Three phenomena are observed. Firstly, when all models are correctly specified, any combination can achieve the same semiparametric efficiency bound, which coincides with the existing results of some combinations. Secondly, when the propensity score is correctly modeled and estimated, but the outcome regression is misspecified parametrically or semiparametrically, the asymptotic variance is always larger than or equal to the semiparametric efficiency bound. Thirdly, in contrast, when the propensity score is misspecified parametrically or semiparametrically, while the outcome regression is correctly modeled and estimated, the asymptotic variance is not necessarily larger than the semiparametric efficiency bound. In some cases, the "super-efficiency" phenomenon occurs. We also conduct a small numerical study.

preprint2020arXiv

Integrated conditional moment test and beyond: when the number of covariates is divergent

The classic integrated conditional moment test is a promising method for testing regression model misspecification. However, it severely suffers from the curse of dimensionality. To extend it to handle the testing problem for parametric multi-index models with diverging number of covariates, we investigate three issues in inference in this paper. First, we study the consistency and asymptotically linear representation of the least squares estimator of the parameter matrix at faster rates of divergence than those in the literature for nonlinear models. Second, we propose, via sufficient dimension reduction techniques, an adaptive-to-model version of the integrated conditional moment test. We study the asymptotic properties of the new test under both the null and alternative hypothesis to examine its ability of significance level maintenance and its sensitivity to the global and local alternatives that are distinct from the null at the fastest possible rate in hypothesis testing. Third, we derive the consistency of the bootstrap approximation for the new test in the diverging dimension setting. The numerical studies show that the new test can very much enhance the performance of the original ICM test in high-dimensional scenarios. We also apply the test to a real data set for illustrations.

preprint2020arXiv

Limiting laws for extreme eigenvalues of large-dimensional spiked Fisher matrices with a divergent number of spikes

Consider the $p\times p$ matrix that is the product of a population covariance matrix and the inverse of another population covariance matrix. Suppose that their difference has a divergent rank with respect to $p$, when two samples of sizes $n$ and $T$ from the two populations are available, we construct its corresponding sample version. In the regime of high dimension where both $n$ and $T$ are proportional to $p$, we investigate the limiting laws for extreme (spiked) eigenvalues of the sample (spiked) Fisher matrix when the number of spikes is divergent and these spikes are unbounded.

preprint2020arXiv

Model Checking for Parametric Ordinary Differential Equations System

Ordinary differential equations have been used to model dynamical systems in a broad range. Model checking for parametric ordinary differential equations is a necessary step to check whether the assumed models are plausible. In this paper we introduce three test statistics for their different purposes. We first give a trajectory matching-based test for the whole system. To further identify which component function(s) would be wrongly modelled, we introduce two test statistics that are based on integral matching and gradient matching respectively. We investigate the asymptotic properties of the three test statistics under the null, global and local alternative hypothesis. To achieve these purposes, we also investigate the asymptotic properties of nonlinear least squares estimation and two-step collocation estimation under both the null and alternatives. The results about the estimations are also new in the literature. To examine the performances of the tests, we conduct several numerical simulations. A real data example about immune cell kinetics and trafficking for influenza infection is analyzed for illustration.

preprint2020arXiv

Neural Temporal Opinion Modelling for Opinion Prediction on Twitter

Opinion prediction on Twitter is challenging due to the transient nature of tweet content and neighbourhood context. In this paper, we model users' tweet posting behaviour as a temporal point process to jointly predict the posting time and the stance label of the next tweet given a user's historical tweet sequence and tweets posted by their neighbours. We design a topic-driven attention mechanism to capture the dynamic topic shifts in the neighbourhood context. Experimental results show that the proposed model predicts both the posting time and the stance labels of future tweets more accurately compared to a number of competitive baselines.

preprint2020arXiv

Outcome regression-based estimation of conditional average treatment effect

The research is about a systematic investigation on the following issues. First, we construct different outcome regression-based estimators for conditional average treatment effect under, respectively, true (oracle), parametric, nonparametric and semiparametric dimension reduction structure. Second, according to the corresponding asymptotic variance functions, we answer the following questions when supposing the models are correctly specified: what is the asymptotic efficiency ranking about the four estimators in general? how is the efficiency related to the affiliation of the given covariates in the set of arguments of the regression functions? what do the roles of bandwidth and kernel function selections play for the estimation efficiency; and in which scenarios should the estimator under semiparametric dimension reduction regression structure be used in practice? As a by-product, the results show that any outcome regression-based estimation should be asymptotically more efficient than any inverse probability weighting-based estimation. All these results give a relatively complete picture of the outcome regression-based estimation such that the theoretical conclusions could provide guidance for practical use when more than one estimations can be applied to the same problem. Several simulation studies are conducted to examine the performances of these estimators in finite sample cases and a real dataset is analyzed for illustration.

preprint2020arXiv

The motion of respiratory droplets produced by coughing

Coronavirus disease 2019 (COVID-19) has become a global pandemic infectious respiratory disease with high mortality and infectiousness. This paper investigates respiratory droplet transmission, which is critical to understanding, modeling and controlling epidemics. In the present work, we implemented flow visualization, particle image velocimetry (PIV) and particle shadow tracking velocimetry (PSTV) to measure the velocity of the airflow and droplets involved in coughing and then constructed a physical model considering the evaporation effect to predict the motion of droplets under different weather conditions. The experimental results indicate that the convection velocity of cough airflow presents the relationship $t^{-0.7}$ with time; hence, the distance from the cougher increases by $t^{0.3}$ in the range of our measurement domain. Substituting these experimental results into the physical model reveals that the small droplets (initial diameter $D \leq$ 100 $μ$m) evaporate to droplet nuclei and that the large droplets with $D \geq$ 500 $μ$m and initial velocity $u_0 \geq$ 5 m/s travel more than 2 m. Winter conditions of low temperature and high relative humidity can cause more droplets to settle to the ground, which may be a possible driver of a second pandemic wave in the autumn and winter seasons.

preprint2020arXiv

The Role of Propensity Score Structure in Asymptotic Efficiency of Estimated Conditional Quantile Treatment Effect

When a strict subset of covariates are given, we propose conditional quantile treatment effect to capture the heterogeneity of treatment effects via the quantile sheet that is the function of the given covariates and quantile. We focus on deriving the asymptotic normality of probability score-based estimators under parametric, nonparametric and semiparametric structure. We make a systematic study on the estimation efficiency to check the importance of propensity score structure and the essential differences from the unconditional counterparts. The derived unique properties can answer: what is the general ranking of these estimators? how does the affiliation of the given covariates to the set of covariates of the propensity score affect the efficiency? how does the convergence rate of the estimated propensity score affect the efficiency? and why would semiparametric estimation be worth of recommendation in practice? We also give a brief discussion on the extension of the methods to handle large-dimensional scenarios and on the estimation for the asymptotic variances. The simulation studies are conducted to examine the performances of these estimators. A real data example is analyzed for illustration and some new findings are acquired.

preprint2016arXiv

A projection-based adaptive-to-model test for regressions

A longstanding problem of existing empirical process-based tests for regressions is that when the number of covariates is greater than one, they either have no tractable limiting null distributions or are not omnibus. To attack this problem, we in this paper propose a projection-based adaptive-to-model approach. When the hypothetical model is parametric single-index, the method can fully utilize the dimension reduction model structure under the null hypothesis as if the covariate were one-dimensional such that the martingale transformation-based test can be asymptotically distribution-free. Further, the test can automatically adapt to the underlying model structure such that the test can be omnibus and thus detect alternative models distinct from the hypothetical model at the fastest possible rate in hypothesis testing. The method is examined through simulation studied and is illustrated by a real data analysis.

preprint2016arXiv

An Adaptive-to-Model Test for Parametric Single-Index Errors-in-Variables Models

This paper provides some useful tests for fitting a parametric single-index regression model when covariates are measured with error and validation data is available. We propose two tests whose consistency rates do not depend on the dimension of the covariate vector when an adaptive-to-model strategy is applied. One of these tests has a bias term that becomes arbitrarily large with increasing sample size but its asymptotic variance is smaller, and the other is asymptotically unbiased with larger asymptotic variance. Compared with the existing local smoothing tests, the new tests behave like a classical local smoothing test with only one covariate, and still are omnibus against general alternatives. This avoids the difficulty associated with the curse of dimensionality. Further, a systematic study is conducted to give an insight on the effect of the values of the ratio between the sample size and the size of validation data on the asymptotic behavior of these tests. Simulations are conducted to examine the performance in several finite sample scenarios.

preprint2016arXiv

Dimensionality determination: a thresholding double ridge ratio criterion

Popularly used eigendecomposition-based criteria such as BIC type, ratio estimation and principal component-based criterion often underdetermine model dimensionality for regressions or the number of factors for factor models. This longstanding problem is caused by the existence of one or two dominating eigenvalues compared to other nonzero eigenvalues. To alleviate this difficulty, we propose a thresholding double ridge ratio criterion such that the true dimension can be better identified and is less underdetermined. Unlike all existing eigendecomposition-based criteria, this criterion can define consistent estimate without requiring the uniqueness of minimum and can then handle possible multiple local minima scenarios. This generic strategy would be readily applied to other dimensionality or order determination problems. In this paper, we systematically investigate, for general sufficient dimension reduction theory, the dimensionality determination with fixed and divergent dimensions; for local alternative models that converge to its limiting model with fewer projected covariates, discuss when the number of projected covariates can be consistently estimated, when cannot; and for ultra-high dimensional factor models, study the estimation consistency for the number of common factors. Numerical studies are conducted to examine the finite sample performance of the method.

preprint2016arXiv

Penalized Maximum Likelihood Estimator for Skew Normal Mixtures

Skew normal mixture models provide a more flexible framework than the popular normal mixtures for modelling heterogeneous data with asymmetric behaviors. Due to the unboundedness of likelihood function and the divergency of shape parameters, the maximum likelihood estimators of the parameters of interest are often not well defined, leading to dissatisfactory inferential process. We put forward a proposal to deal with these issues simultaneously in the context of penalizing the likelihood function. The resulting penalized maximum likelihood estimator is proved to be strongly consistent when the putative order of mixture is equal to or larger than the true one. We also provide penalized EM-type algorithms to compute penalized estimators. Finite sample performances are examined by simulations and real data applications and the comparison to the existing methods.

preprint2015arXiv

A robust adaptive-to-model enhancement test for parametric single-index models

In the research on checking whether the underlying model is of parametric single-index structure with outliers in observations, the purpose of this paper is two-fold. First, a test that is robust against outliers is suggested. The Hampel's second-order influence function of the test statistic is proved to be bounded. Second, the test fully uses the dimension reduction structure of the hypothetical model and automatically adapts to alternative models when the null hypothesis is false. Thus, the test can greatly overcome the dimensionality problem and is still omnibus against general alternative models. The performance of the test is demonstrated by both Monte Carlo simulation studies and an application to a real dataset.

preprint2015arXiv

An adaptive-to-model test for partially parametric single-index models

Residual marked empirical process-based tests are commonly used in regression models. However, they suffer from data sparseness in high-dimensional space when there are many covariates. This paper has three purposes. First, we suggest a partial dimension reduction adaptive-to-model testing procedure that can be omnibus against general global alternative models although it fully use the dimension reduction structure under the null hypothesis. This feature is because that the procedure can automatically adapt to the null and alternative models, and thus greatly overcomes the dimensionality problem. Second, to achieve the above goal, we propose a ridge-type eigenvalue ratio estimate to automatically determine the number of linear combinations of the covariates under the null and alternatives. Third, a Monte-Carlo approximation to the sampling null distribution is suggested. Unlike existing bootstrap approximation methods, this gives an approximation as close to the sampling null distribution as possible by fully utilising the dimension reduction model structure under the null. Simulation studies and real data analysis are then conducted to illustrate the performance of the new test and compare it with existing tests.

preprint2015arXiv

Dimension reduction-based significance testing in nonparametric regression

A dimension reduction-based adaptive-to-model test is proposed for significance of a subset of covariates in the context of a nonparametric regression model. Unlike existing local smoothing significance tests, the new test behaves like a local smoothing test as if the number of covariates were just that under the null hypothesis and it can detect local alternatives distinct from the null at the rate that is only related to the number of covariates under the null hypothesis. Thus, the curse of dimensionality is largely alleviated when nonparametric estimation is inevitably required. In the cases where there are many insignificant covariates, the improvement of the new test is very significant over existing local smoothing tests on the significance level maintenance and power enhancement. Simulation studies and a real data analysis are conducted to examine the finite sample performance of the proposed test.

preprint2015arXiv

Enhancements of nonparametric generalized likelihood ratio test: Bias-correction and dimension reduction

Nonparametric generalized likelihood ratio test is popularly used for model checking for regressions. However, there are two issues that may be the barriers for its powerfulness. First, the bias term in its liming null distribution causes the test not to well control type I error and thus Monte Carlo approximation for critical value determination is required. Second, it severely suffers from the curse of dimensionality due to the use of multivariate nonparametric function estimation. The purpose of this paper is thus two-fold: a bias-correction is suggested to this test and a dimension reduction-based model-adaptive enhancement is recommended to promote the power performance. The proposed test still possesses the Wilks phenomenon, and the test statistic can converge to its limit at a much faster rate and is much more sensitive to alternative models than the original nonparametric generalized likelihood ratio test as if the dimension of covariates were one. Simulation studies are conducted to evaluate the finite sample performance and to compare with other popularly used tests. A real data analysis is conducted for illustration.

preprint2015arXiv

Heteroscedasticity Testing for Regression Models: A Dimension Reduction-based Model Adaptive

Heteroscedasticity testing is of importance in regression analysis. Existing local smoothing tests suffer severely from curse of dimensionality even when the number of covariates is moderate because of use of nonparametric estimation. In this paper, a dimension reduction-based model adaptive test is proposed which behaves like a local smoothing test as if the number of covariates were equal to the number of their linear combinations in the mean regression function, in particular, equal to 1 when the mean function contains a single index. The test statistic is asymptotically normal under the null hypothesis such that critical values are easily determined. The finite sample performances of the test are examined by simulations and a real data analysis.

preprint2015arXiv

Variable selection and estimation for semi-parametric multiple-index models

In this paper, we propose a novel method to select significant variables and estimate the corresponding coefficients in multiple-index models with a group structure. All existing approaches for single-index models cannot be extended directly to handle this issue with several indices. This method integrates a popularly used shrinkage penalty such as LASSO with the group-wise minimum average variance estimation. It is capable of simultaneous dimension reduction and variable selection, while incorporating the group structure in predictors. Interestingly, the proposed estimator with the LASSO penalty then behaves like an estimator with an adaptive LASSO penalty. The estimator achieves consistency of variable selection without sacrificing the root-$n$ consistency of basis estimation. Simulation studies and a real-data example illustrate the effectiveness and efficiency of the new method.

preprint2014arXiv

Estimation for ultra-high dimensional factor model: a pivotal variable detection based approach

For factor model, the involved covariance matrix often has no row sparse structure because the common factors may lead some variables to strongly associate with many others. Under the ultra-high dimensional paradigm, this feature causes existing methods for sparse covariance matrix in the literature not directly applicable. In this paper, for general covariance matrix, a novel approach to detect these variables that is called the pivotal variables is suggested. Then, two-stage estimation procedures are proposed to handle ultra-high dimensionality in factor model. In these procedures, pivotal variable detection is performed as a screening step and then existing approaches are applied to refine the working model. The estimation efficiency can be promoted under weaker assumptions on the model structure. Simulations are conducted to examine the performance of the new method and a real dataset is analysed for illustration.

preprint2014arXiv

Inference for biased models: a quasi-instrumental variable approach

For linear regression models who are not exactly sparse in the sense that the coefficients of the insignificant variables are not exactly zero, the working models obtained by a variable selection are often biased. Even in sparse cases, after a variable selection, when some significant variables are missing, the working models are biased as well. Thus, under such situations, root-n consistent estimation and accurate prediction could not be expected. In this paper, a novel remodelling method is proposed to produce an unbiased model when quasi-instrumental variables are introduced. The root-n estimation consistency and the asymptotic normality can be achieved, and the prediction accuracy can be promoted as well. The performance of the new method is examined through simulation studies.

preprint2014arXiv

Model checking for generalized linear models: a dimension-reduction model-adaptive approach

Local smoothing testing that is based on multivariate nonparametric regression estimation is one of the main model checking methodologies in the literature. However, relevant tests suffer from the typical curse of dimensionality resulting in slow convergence rates to their limits under the null hypotheses and less deviation from the null under alternatives. This problem leads tests to not well maintain the significance level and to be less sensitive to alternatives. In this paper, a dimension-reduction model-adaptive test is proposed for generalized linear models. The test behaves like a local smoothing test as if the model were univariate, and can be consistent against any global alternatives and can detect local alternatives distinct from the null at a fast rate that existing local smoothing tests can achieve only when the model is univariate. Simulations are carried out to examine the performance of our methodology. A real data analysis is conducted for illustration. The method can readily be extended to global smoothing methodology and other testing problems.

preprint2014arXiv

Transformed sufficient dimension reduction

A novel general framework is proposed in this paper for dimension reduction in regression to fill the gap between linear and fully nonlinear dimension reduction. The main idea is to transform first each of the raw predictors monotonically, and then search for a low-dimensional projection in the space defined by the transformed variables. Both user-specified and data-driven transformations are suggested. In each case, the methodology is discussed first in a general manner, and a representative method, as an example, is then proposed and evaluated by simulation. The proposed methods are applied to a real data set for illustration.

preprint2014arXiv

Upper expectation parametric regression

Every observation may follow a distribution that is randomly selected in a class of distributions. It is called the distribution uncertainty. This is a fact acknowledged in some research fields such as financial risk measure. Thus, the classical expectation is not identifiable in general.In this paper, a distribution uncertainty is defined, and then an upper expectation regression is proposed, which can describe the relationship between extreme events and relevant covariates under the framework of distribution uncertainty. As there are no classical methods available to estimate the parameters in the upper expectation regression, a two-step penalized maximum least squares procedure is proposed to estimate the mean function and the upper expectation of the error. The resulting estimators are consistent and asymptotically normal in a certain sense.Simulation studies and a real data example are conducted to show that the classical least squares estimation does not work and the penalized maximum least squares performs well.

preprint2013arXiv

Asymptotic Composite Estimation

Composition methodologies in the current literature are mainly to promote estimation efficiency via direct composition, either, of initial estimators or of objective functions. In this paper, composite estimation is investigated for both estimation efficiency and bias reduction. To this end, a novel method is proposed by utilizing a regression relationship between initial estimators and values of model-independent parameter in an asymptotic sense. The resulting estimators could have smaller limiting variances than those of initial estimators, and for nonparametric regression estimation, could also have faster convergence rate than the classical optimal rate that the corresponding initial estimators can achieve. The simulations are carried out to examine its performance in finite sample situations.

preprint2012arXiv

Robust rank correlation based screening

Independence screening is a variable selection method that uses a ranking criterion to select significant variables, particularly for statistical models with nonpolynomial dimensionality or "large p, small n" paradigms when p can be as large as an exponential of the sample size n. In this paper we propose a robust rank correlation screening (RRCS) method to deal with ultra-high dimensional data. The new procedure is based on the Kendall τcorrelation coefficient between response and predictor variables rather than the Pearson correlation of existing methods. The new method has four desirable features compared with existing independence screening methods. First, the sure independence screening property can hold only under the existence of a second order moment of predictor variables, rather than exponential tails or alikeness, even when the number of predictor variables grows as fast as exponentially of the sample size. Second, it can be used to deal with semiparametric models such as transformation regression models and single-index models under monotonic constraint to the link function without involving nonparametric estimation even when there are nonparametric functions in the models. Third, the procedure can be largely used against outliers and influence points in the observations. Last, the use of indicator functions in rank correlation screening greatly simplifies the theoretical derivation due to the boundedness of the resulting statistics, compared with previous studies on variable screening. Simulations are carried out for comparisons with existing methods and a real data example is analyzed.

preprint2012arXiv

The EFM approach for single-index models

Single-index models are natural extensions of linear models and circumvent the so-called curse of dimensionality. They are becoming increasingly popular in many scientific fields including biostatistics, medicine, economics and financial econometrics. Estimating and testing the model index coefficients $\boldsβ$ is one of the most important objectives in the statistical analysis. However, the commonly used assumption on the index coefficients, $\|\boldsβ\|=1$, represents a nonregular problem: the true index is on the boundary of the unit ball. In this paper we introduce the EFM approach, a method of estimating functions, to study the single-index model. The procedure is to first relax the equality constraint to one with (d-1) components of $\boldsβ$ lying in an open unit ball, and then to construct the associated (d-1) estimating functions by projecting the score function to the linear space spanned by the residuals with the unknown link being estimated by kernel estimating functions. The root-n consistency and asymptotic normality for the estimator obtained from solving the resulting estimating equations are achieved, and a Wilks type theorem for testing the index is demonstrated. A noticeable result we obtain is that our estimator for $\boldsβ$ has smaller or equal limiting variance than the estimator of Carroll et al. [J. Amer. Statist. Assoc. 92 (1997) 447-489]. A fixed-point iterative scheme for computing this estimator is proposed. This algorithm only involves one-dimensional nonparametric smoothers, thereby avoiding the data sparsity problem caused by high model dimensionality. Numerical studies based on simulation and on applications suggest that this new estimating system is quite powerful and easy to implement.

preprint2011arXiv

Estimation and inference for high-dimensional non-sparse models

To successfully work on variable selection, sparse model structure has become a basic assumption for all existing methods. However, this assumption is questionable as it is hard to hold in most of cases and none of existing methods may provide consistent estimation and accurate model prediction in nons-parse scenarios. In this paper, we propose semiparametric re-modeling and inference when the linear regression model under study is possibly non-sparse. After an initial working model is selected by a method such as the Dantzig selector adopted in this paper, we re-construct a globally unbiased semiparametric model by use of suitable instrumental variables and nonparametric adjustment. The newly defined model is identifiable, and the estimator of parameter vector is asymptotically normal. The consistency, together with the re-built model, promotes model prediction. This method naturally works when the model is indeed sparse and thus is of robustness against non-sparseness in certain sense. Simulation studies show that the new approach has, particularly when $p$ is much larger than $n$, significant improvement of estimation and prediction accuracies over the Gaussian Dantzig selector and other classical methods. Even when the model under study is sparse, our method is also comparable to the existing methods designed for sparse models.

preprint2010arXiv

Adaptive post-Dantzig estimation and prediction for non-sparse "large $p$ and small $n$" models

For consistency (even oracle properties) of estimation and model prediction, almost all existing methods of variable/feature selection critically depend on sparsity of models. However, for ``large $p$ and small $n$" models sparsity assumption is hard to check and particularly, when this assumption is violated, the consistency of all existing estimations is usually impossible because working models selected by existing methods such as the LASSO and the Dantzig selector are usually biased. To attack this problem, we in this paper propose adaptive post-Dantzig estimation and model prediction. Here the adaptability means that the consistency based on the newly proposed method is adaptive to non-sparsity of model, choice of shrinkage tuning parameter and dimension of predictor vector. The idea is that after a sub-model as a working model is determined by the Dantzig selector, we construct a globally unbiased sub-model by choosing suitable instrumental variables and nonparametric adjustment. The new estimation of the parameters in the sub-model can be of the asymptotic normality. The consistent estimator, together with the selected sub-model and adjusted model, improves model predictions. Simulation studies show that the new approach has the significant improvement of estimation and prediction accuracies over the Gaussian Dantzig selector and other classical methods have.

preprint2010arXiv

Bounds smaller than the Fisher information for generalized linear models

In this paper, we propose a parameter space augmentation approach that is based on "intentionally" introducing a pseudo-nuisance parameter into generalized linear models for the purpose of variance reduction. We first consider the parameter whose norm is equal to one. By introducing a pseudo-nuisance parameter into models to be estimated, an extra estimation is asymptotically normal and is, more importantly, non-positively correlated to the estimation that asymptotically achieves the Fisher/quasi Fisher information. As such, the resulting estimation is asymptotically with smaller variance-covariance matrices than the Fisher/quasi Fisher information. For general cases where the norm of the parameter is not necessarily equal to one, two-stage quasi-likelihood procedures separately estimating the scalar and direction of the parameter are proposed. The traces of the limiting variance-covariance matrices are in general smaller than or equal to that of the Fisher/quasi-Fisher information. We also discuss the pros and cons of the new methodology, and possible extensions. As this methodology of parameter space augmentation is general, and then may be readily extended to handle, say, cluster data and correlated data, and other models.

preprint2010arXiv

Component Selection in the Additive Regression Model

Similar to variable selection in the linear regression model, selecting significant components in the popular additive regression model is of great interest. However, such components are unknown smooth functions of independent variables, which are unobservable. As such, some approximation is needed. In this paper, we suggest a combination of penalized regression spline approximation and group variable selection, called the lasso-type spline method (LSM), to handle this component selection problem with a diverging number of strongly correlated variables in each group. It is shown that the proposed method can select significant components and estimate nonparametric additive function components simultaneously with an optimal convergence rate simultaneously. To make the LSM stable in computation and able to adapt its estimators to the level of smoothness of the component functions, weighted power spline bases and projected weighted power spline bases are proposed. Their performance is examined by simulation studies across two set-ups with independent predictors and correlated predictors, respectively, and appears superior to the performance of competing methods. The proposed method is extended to a partial linear regression model analysis with real data, and gives reliable results.

Lixing Zhu

What is connected

Connect this record

See the researcher in context

Building this map preview

37 published item(s)

Asymptotic Distribution-Free Tests for Ultra-high Dimensional Parametric Regressions via Projected Empirical Processes and $p$-value Combination

Are NLP Models Good at Tracing Thoughts: An Overview of Narrative Understanding

A general Monte Carlo method for multivariate goodness-of-fit testing applied to elliptical families

Disentangled Learning of Stance and Aspect Topics for Vaccine Attitude Detection in Social Media

A Neural Generative Model for Joint Learning Topics and Topic-Specific Word Embeddings

Detecting multiple change points: a PULSE criterion

Doubly robust estimation for conditional treatment effect: a study on asymptotics

Doubly robust estimation of average treatment effect revisited

Integrated conditional moment test and beyond: when the number of covariates is divergent

Limiting laws for extreme eigenvalues of large-dimensional spiked Fisher matrices with a divergent number of spikes

Model Checking for Parametric Ordinary Differential Equations System

Neural Temporal Opinion Modelling for Opinion Prediction on Twitter

Outcome regression-based estimation of conditional average treatment effect

The motion of respiratory droplets produced by coughing

The Role of Propensity Score Structure in Asymptotic Efficiency of Estimated Conditional Quantile Treatment Effect

A projection-based adaptive-to-model test for regressions

An Adaptive-to-Model Test for Parametric Single-Index Errors-in-Variables Models

Dimensionality determination: a thresholding double ridge ratio criterion

Penalized Maximum Likelihood Estimator for Skew Normal Mixtures

A robust adaptive-to-model enhancement test for parametric single-index models

An adaptive-to-model test for partially parametric single-index models

Dimension reduction-based significance testing in nonparametric regression

Enhancements of nonparametric generalized likelihood ratio test: Bias-correction and dimension reduction

Heteroscedasticity Testing for Regression Models: A Dimension Reduction-based Model Adaptive

Variable selection and estimation for semi-parametric multiple-index models

Estimation for ultra-high dimensional factor model: a pivotal variable detection based approach

Inference for biased models: a quasi-instrumental variable approach

Model checking for generalized linear models: a dimension-reduction model-adaptive approach

Transformed sufficient dimension reduction

Upper expectation parametric regression

Asymptotic Composite Estimation

Robust rank correlation based screening

The EFM approach for single-index models

Estimation and inference for high-dimensional non-sparse models

Adaptive post-Dantzig estimation and prediction for non-sparse "large $p$ and small $n$" models

Bounds smaller than the Fisher information for generalized linear models

Component Selection in the Additive Regression Model