Source author record

Aijun Zhang

Aijun Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning math.DS Computation Methodology math-ph math.MP math.OC

Catalog footprint

What is connected

9works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Model-free Subsampling Method Based on Uniform Designs

Subsampling or subdata selection is a useful approach in large-scale statistical learning. Most existing studies focus on model-based subsampling methods which significantly depend on the model assumption. In this paper, we consider the model-free subsampling strategy for generating subdata from the original full data. In order to measure the goodness of representation of a subdata with respect to the original data, we propose a criterion, generalized empirical F-discrepancy (GEFD), and study its theoretical properties in connection with the classical generalized L2-discrepancy in the theory of uniform designs. These properties allow us to develop a kind of low-GEFD data-driven subsampling method based on the existing uniform designs. By simulation examples and a real case study, we show that the proposed subsampling method is superior to the random sampling method. Moreover, our method keeps robust under diverse model specifications while other popular subsampling methods are under-performing. In practice, such a model-free property is more appealing than the model-based subsampling methods, where the latter may have poor performance when the model is misspecified, as demonstrated in our simulation studies.

preprint2022arXiv

Traversing the Local Polytopes of ReLU Neural Networks: A Unified Approach for Network Verification

Although neural networks (NNs) with ReLU activation functions have found success in a wide range of applications, their adoption in risk-sensitive settings has been limited by the concerns on robustness and interpretability. Previous works to examine robustness and to improve interpretability partially exploited the piecewise linear function form of ReLU NNs. In this paper, we explore the unique topological structure that ReLU NNs create in the input space, identifying the adjacency among the partitioned local polytopes and developing a traversing algorithm based on this adjacency. Our polytope traversing algorithm can be adapted to verify a wide range of network properties related to robustness and interpretability, providing an unified approach to examine the network behavior. As the traversing algorithm explicitly visits all local polytopes, it returns a clear and full picture of the network behavior within the traversed region. The time and space complexity of the traversing algorithm is determined by the number of a ReLU NN's partitioning hyperplanes passing through the traversing region.

preprint2020arXiv

Adaptive Iterative Hessian Sketch via A-Optimal Subsampling

Iterative Hessian sketch (IHS) is an effective sketching method for modeling large-scale data. It was originally proposed by Pilanci and Wainwright (2016; JMLR) based on randomized sketching matrices. However, it is computationally intensive due to the iterative sketch process. In this paper, we analyze the IHS algorithm under the unconstrained least squares problem setting, then propose a deterministic approach for improving IHS via A-optimal subsampling. Our contributions are three-fold: (1) a good initial estimator based on the A-optimal design is suggested; (2) a novel ridged preconditioner is developed for repeated sketching; and (3) an exact line search method is proposed for determining the optimal step length adaptively. Extensive experimental results demonstrate that our proposed A-optimal IHS algorithm outperforms the existing accelerated IHS methods.

preprint2020arXiv

An Effective and Efficient Initialization Scheme for Training Multi-layer Feedforward Neural Networks

Network initialization is the first and critical step for training neural networks. In this paper, we propose a novel network initialization scheme based on the celebrated Stein's identity. By viewing multi-layer feedforward neural networks as cascades of multi-index models, the projection weights to the first hidden layer are initialized using eigenvectors of the cross-moment matrix between the input's second-order score function and the response. The input data is then forward propagated to the next layer and such a procedure can be repeated until all the hidden layers are initialized. Finally, the weights for the output layer are initialized by generalized linear modeling. Such a proposed SteinGLM method is shown through extensive numerical results to be much faster and more accurate than other popular methods commonly used for training neural networks.

preprint2020arXiv

Balance-Subsampled Stable Prediction

In machine learning, it is commonly assumed that training and test data share the same population distribution. However, this assumption is often violated in practice because the sample selection bias may induce the distribution shift from training data to test data. Such a model-agnostic distribution shift usually leads to prediction instability across unknown test data. In this paper, we propose a novel balance-subsampled stable prediction (BSSP) algorithm based on the theory of fractional factorial design. It isolates the clear effect of each predictor from the confounding variables. A design-theoretic analysis shows that the proposed method can reduce the confounding effects among predictors induced by the distribution shift, hence improve both the accuracy of parameter estimation and prediction stability. Numerical experiments on both synthetic and real-world data sets demonstrate that our BSSP algorithm significantly outperforms the baseline methods for stable prediction across unknown test data.

preprint2020arXiv

BeSS: An R Package for Best Subset Selection in Linear, Logistic and CoxPH Models

We introduce a new R package, BeSS, for solving the best subset selection problem in linear, logistic and Cox's proportional hazard (CoxPH) models. It utilizes a highly efficient active set algorithm based on primal and dual variables, and supports sequential and golden search strategies for best subset selection. We provide a C++ implementation of the algorithm using Rcpp interface. We demonstrate through numerical experiments based on enormous simulation and real datasets that the new BeSS package has competitive performance compared to other R packages for best subset selection purpose.

preprint2014arXiv

Spreading Speeds and Traveling Waves of Nonlocal Monostable Equations in Time and Space Periodic Habitats

This paper is devoted to the investigation of spatial spreading speeds and traveling wave solutions of monostable evolution equations with nonlocal dispersal in time and space periodic habitats. It has been shown in an earlier work by the first two authors of the current paper that such an equation has a unique time and space periodic positive stable solution $u^*(t,x)$. In this paper, we show that such an equation has a spatial spreading speed $c^*(ξ)$ in the direction of any given unit vector $ξ$. A variational characterization of $c^*(ξ)$ is given. Under the assumption that the nonlocal dispersal operator associated to the linearization of the monostable equation at the trivial solution $0$ has a principal eigenvalue, we also show that the monostable equation has a continuous periodic traveling wave solution connecting $u^*(\cdot,\cdot)$ and $0$ propagating in any given direction of $ξ$ with speed $c>c^*(ξ)$.

preprint2013arXiv

Competing Interactions and Traveling Wave Solutions in Lattice Differential Equations

The existence of traveling front solutions to bistable lattice differential equations in the absence of a comparison principle is studied. The results are in the spirit of those in Bates, Chen, and Chmaj in[1], but are applicable to vector equations and to more general limiting systems. An abstract result on the persistence of traveling wave solutions is obtained and is then applied to lattice differential equations with repelling first and/or second neighbor interactions and to some problems with infinite range interactions.

preprint2012arXiv

Traveling Wave Solutions of Spatially Periodic Nonlocal Monostable Equations

This paper deals with front propagation dynamics of monostable equations with nonlocal dispersal in spatially periodic habitats. In the authors' earlier works, it is shown that a general spatially periodic monostable equation with nonlocal dispersal has a unique spatially periodic positive stationary solution and has a spreading speed in every direction. In this paper, we show that a spatially periodic nonlocal monostable equation with certain spatial homogeneity or small nonlocal dispersal distance has a unique stable periodic traveling wave solutions connecting its unique spatially periodic positive stationary solution and the trivial solution in every direction for all speeds greater than the spreading speed in that direction.

Aijun Zhang

What is connected

Connect this record

See the researcher in context

Building this map preview

9 published item(s)

Model-free Subsampling Method Based on Uniform Designs

Traversing the Local Polytopes of ReLU Neural Networks: A Unified Approach for Network Verification

Adaptive Iterative Hessian Sketch via A-Optimal Subsampling

An Effective and Efficient Initialization Scheme for Training Multi-layer Feedforward Neural Networks

Balance-Subsampled Stable Prediction

BeSS: An R Package for Best Subset Selection in Linear, Logistic and CoxPH Models

Spreading Speeds and Traveling Waves of Nonlocal Monostable Equations in Time and Space Periodic Habitats

Competing Interactions and Traveling Wave Solutions in Lattice Differential Equations

Traveling Wave Solutions of Spatially Periodic Nonlocal Monostable Equations