Source author record

Jingxiao Zhang

Jingxiao Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Methodology

Catalog footprint

What is connected

5works

1topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Successive classification learning for estimating quantile optimal treatment regimes

Quantile optimal treatment regimes (OTRs) aim to assign treatments that maximize a specified quantile of patients' outcomes. Compared to treatment regimes that target the mean outcomes, quantile OTRs offer fairer regimes when a lower quantile is selected, as it improves outcomes for vulnerable patients. In this paper, we propose a novel method for estimating quantile OTRs by reformulating the problem as a successive classification task, solvable via training a sequence of classifiers, each successive classifier built on the output of its predecessors. This reformulation enables us to leverage the powerful machine learning technique to enhance computational efficiency and handle complex decision boundaries. We also investigate the estimation of quantile OTRs when outcomes are discrete, a setting that has received limited attention in the literature. A key challenge is that direct extensions of existing methods to discrete outcomes often lead to inconsistency and ineffectiveness issues. To overcome this, we introduce a smoothing technique that maps discrete outcomes to continuous surrogates, enabling consistent and effective estimation. We provide theoretical guarantees to support our methodology, and demonstrate its superior performance through comprehensive simulation studies and real-data analysis.

preprint2023arXiv

Kernel partial least squares regression for functional nonlinear models

Functional regression is very crucial in functional data analysis and a linear relationship between scalar response and functional predictor is often assumed. However, the linear assumption may not hold in practice, which makes the methods for linear models invalid. To gain more flexibility, we focus on functional nonlinear models and aim to develop new method that requires no strict constraint on the nonlinear structure. Inspired by the idea of the kernel method in machine learning, we propose a kernel functional partial least squares (KFPLS) method for the functional nonlinear models. The innovative algorithm works on the prediction of the scalar response and is accompanied by R package KFPLS for implementation. The simulation study demonstrates the effectiveness of the proposed method for various types of nonlinear models. Moreover, the real world application also shows the superiority of the proposed KFPLS method.

preprint2022arXiv

Locally sparse estimator of generalized varying coefficient model for asynchronous longitudinal data

In longitudinal study, it is common that response and covariate are not measured at the same time, which complicates the analysis to a large extent. In this paper, we take into account the estimation of generalized varying coefficient model with such asynchronous observations. A penalized kernel-weighted estimating equation is constructed through kernel technique in the framework of functional data analysis. Moreover, local sparsity is also considered in the estimating equation to improve the interpretability of the estimate. We extend the iteratively reweighted least squares (IRLS) algorithm in our computation. The theoretical properties are established in terms of both consistency and sparsistency, and the simulation studies further verify the satisfying performance of our method when compared with existing approaches. The method is applied to an AIDS study to reveal its practical merits.

preprint2022arXiv

Partial Replacement Imputation Estimation Method for Complex Missing Covariates in Additive Partially Linear Models

Missing data is a common problem in clinical data collection, which causes difficulty in the statistical analysis of such data. In this article, we consider the problem under a framework of a semiparametric partially linear model when observations are subject to missingness with complex patterns. If the correct model structure of the additive partially linear model is available, we propose to use a new imputation method called Partial Replacement IMputation Estimation (PRIME), which can overcome problems caused by incomplete data in the partially linear model. Also, we use PRIME in conjunction with model averaging (PRIME-MA) to tackle the problem of unknown model structure in the partially linear model. In simulation studies, we use various error distributions, sample sizes, missing data rates, covariate correlations, and noise levels, and PRIME outperforms other methods in almost all cases. With an unknown correct model structure, PRIME-MA has satisfactory performance in terms of prediction, while slightly worse than PRIME. Moreover, we conduct a study of influential factors in Pima Indians Diabetes data, which shows that our method performs better than the other models.

preprint2021arXiv

Robust Functional Principal Component Analysis for Non-Gaussian Longitudinal Data

Functional principal component analysis is essential in functional data analysis, but the inferences will become unconvincing when some non-Gaussian characteristics occur, such as heavy tail and skewness. The focus of this paper is to develop a robust functional principal component analysis methodology in dealing with non-Gaussian longitudinal data, for which sparsity and irregularity along with non-negligible measurement errors must be considered. We introduce a Kendall's $τ$ function whose particular properties make it a nice proxy for the covariance function in the eigenequation when handling non-Gaussian cases. Moreover, the estimation procedure is presented and the asymptotic theory is also established. We further demonstrate the superiority and robustness of our method through simulation studies and apply the method to the longitudinal CD4 cell count data in an AIDS study.