Researcher profile

Jon A. Wellner

Jon A. Wellner contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
13works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

13 published item(s)

preprint2013arXiv

On the Hermite spline conjecture and its connection to k-monotone densities

The k-monotone classes of densities defined on (0, \infty) have been known in the mathematical literature but were for the first time considered from a statistical point of view by Balabdaoui and Wellner (2007, 2010). In these works, the authors generalized the results established for monotone (k=1) and convex (k=2) densities by giving a characterization of the Maximum Likelihood and Least Square estimators (MLE and LSE) and deriving minimax bounds for rates of convergence. For k strictly larger than 2, the pointwise asymptotic behavior of the MLE and LSE studied by Balabdaoui and Wellner (2007) would show that the MLE and LSE attain the minimax lower bounds in a local pointwise sense. However, the theory assumes that a certain conjecture about the approximation error of a Hermite spline holds true. The main goal of the present note is to show why such a conjecture cannot be true. We also suggest how to bypass the conjecture and rebuild the key proofs in the limit theory of the estimators.

preprint2013arXiv

Weighted likelihood estimation under two-phase sampling

We develop asymptotic theory for weighted likelihood estimators (WLE) under two-phase stratified sampling without replacement. We also consider several variants of WLEs involving estimated weights and calibration. A set of empirical process tools are developed including a Glivenko-Cantelli theorem, a theorem for rates of convergence of M-estimators, and a Donsker theorem for the inverse probability weighted empirical processes under two-phase sampling and sampling without replacement at the second phase. Using these general results, we derive asymptotic distributions of the WLE of a finite-dimensional parameter in a general semiparametric model where an estimator of a nuisance parameter is estimable either at regular or nonregular rates. We illustrate these results and methods in the Cox model with right censoring and interval censoring. We compare the methods via their asymptotic variances under both sampling without replacement and the more usual (and easier to analyze) assumption of Bernoulli sampling at the second phase.

preprint2012arXiv

A general semiparametric Z-estimation approach for case-cohort studies

Case-cohort design, an outcome-dependent sampling design for censored survival data, is increasingly used in biomedical research. The development of asymptotic theory for a case-cohort design in the current literature primarily relies on counting process stochastic integrals. Such an approach, however, is rather limited and lacks theoretical justification for outcome-dependent weighted methods due to non-predictability. Instead of stochastic integrals, we derive asymptotic properties for case-cohort studies based on a general Z-estimation theory for semiparametric models with bundled parameters using modern empirical processes. Both the Cox model and the additive hazards model with time-dependent covariates are considered.

preprint2012arXiv

Nonparametric estimation of multivariate convex-transformed densities

We study estimation of multivariate densities $p$ of the form $p(x)=h(g(x))$ for $x\in \mathbb {R}^d$ and for a fixed monotone function $h$ and an unknown convex function $g$. The canonical example is $h(y)=e^{-y}$ for $y\in \mathbb {R}$; in this case, the resulting class of densities [\mathcal {P}(e^{-y})={p=\exp(-g):g is convex}] is well known as the class of log-concave densities. Other functions $h$ allow for classes of densities with heavier tails than the log-concave class. We first investigate when the maximum likelihood estimator $\hat{p}$ exists for the class $\mathcal {P}(h)$ for various choices of monotone transformations $h$, including decreasing and increasing functions $h$. The resulting models for increasing transformations $h$ extend the classes of log-convex densities studied previously in the econometrics literature, corresponding to $h(y)=\exp(y)$. We then establish consistency of the maximum likelihood estimator for fairly general functions $h$, including the log-concave class $\mathcal {P}(e^{-y})$ and many others. In a final section, we provide asymptotic minimax lower bounds for the estimation of $p$ and its vector of derivatives at a fixed point $x_0$ under natural smoothness hypotheses on $h$ and $g$. The proofs rely heavily on results from convex analysis.

preprint2010arXiv

A local maximal inequality under uniform entropy

We derive an upper bound for the mean of the supremum of the empirical process indexed by a class of functions that are known to have variance bounded by a small constant $δ$. The bound is expressed in the uniform entropy integral of the class at $δ$. The bound yields a rate of convergence of minimum contrast estimators when applied to the modulus of continuity of the contrast functions.

preprint2010arXiv

Nonparametric estimation of a convex bathtub-shaped hazard function

In this paper, we study the nonparametric maximum likelihood estimator (MLE) of a convex hazard function. We show that the MLE is consistent and converges at a local rate of $n^{2/5}$ at points $x_0$ where the true hazard function is positive and strictly convex. Moreover, we establish the pointwise asymptotic distribution theory of our estimator under these same assumptions. One notable feature of the nonparametric MLE studied here is that no arbitrary choice of tuning parameter (or complicated data-adaptive selection of the tuning parameter) is required.

preprint2010arXiv

Nonparametric estimation of multivariate scale mixtures of uniform densities

Suppose that $\m{U} = (U_1, \ldots , U_d) $ has a Uniform$([0,1]^d)$ distribution, that $\m{Y} = (Y_1 , \ldots , Y_d) $ has the distribution $G$ on $\RR_+^d$, and let $\m{X} = (X_1 , \ldots , X_d) = (U_1 Y_1 , \ldots , U_d Y_d )$. The resulting class of distributions of $\m{X}$ (as $G$ varies over all distributions on $\RR_+^d$) is called the {\sl Scale Mixture of Uniforms} class of distributions, and the corresponding class of densities on $\RR_+^d$ is denoted by $\{\cal F}_{SMU}(d)$. We study maximum likelihood estimation in the family ${\cal F}_{SMU}(d)$. We prove existence of the MLE, establish Fenchel characterizations, and prove strong consistency of the almost surely unique maximum likelihood estimator (MLE) in ${\cal F}_{SMU}(d)$. We also provide an asymptotic minimax lower bound for estimating the functional $f \mapsto f(\m{x})$ under reasonable differentiability assumptions on $f\in{\cal F}_{SMU} (d)$ in a neighborhood of $\m{x}$. We conclude the paper with discussion, conjectures and open problems pertaining to global and local rates of convergence of the MLE.

preprint2010arXiv

Squaring the Circle and Cubing the Sphere: Circular and Spherical Copulas

Do there exist circular and spherical copulas in $R^d$? That is, do there exist circularly symmetric distributions on the unit disk in $R^2$ and spherically symmetric distributions on the unit ball in $R^d$, $d\ge3$, whose one-dimensional marginal distributions are uniform? The answer is yes for $d=2$ and 3, where the circular and spherical copulas are unique and can be determined explicitly, but no for $d\ge4$. A one-parameter family of elliptical bivariate copulas is obtained from the unique circular copula in $R^2$ by oblique coordinate transformations. Copulas obtained by a non-linear transformation of a uniform distribution on the unit ball in $R^d$ are also described, and determined explicitly for $d=2$.

preprint2009arXiv

Nemirovski's Inequalities Revisited

An important tool for statistical research are moment inequalities for sums of independent random vectors. Nemirovski and coworkers (1983, 2000) derived one particular type of such inequalities: For certain Banach spaces $(\B,\|\cdot\|)$ there exists a constant $K = K(\B,\|\cdot\|)$ such that for arbitrary independent and centered random vectors $X_1, X_2, ..., X_n \in \B$, their sum $S_n$ satisfies the inequality $ E \|S_n \|^2 \le K \sum_{i=1}^n E \|X_i\|^2$. We present and compare three different approaches to obtain such inequalities: Nemirovski's results are based on deterministic inequalities for norms. Another possible vehicle are type and cotype inequalities, a tool from probability theory on Banach spaces. Finally, we use a truncation argument plus Bernstein's inequality to obtain another version of the moment inequality above. Interestingly, all three approaches have their own merits.

preprint2008arXiv

Inconsistency of the MLE for the joint distribution of interval censored survival times and continuous marks

This paper considers the nonparametric maximum likelihood estimator (MLE) for the joint distribution function of an interval censored survival time and a continuous mark variable. We provide a new explicit formula for the MLE in this problem. We use this formula and the mark specific cumulative hazard function of Huang and Louis (1998) to obtain the almost sure limit of the MLE. This result leads to necessary and sufficient conditions for consistency of the MLE which imply that the MLE is inconsistent in general. We show that the inconsistency can be repaired by discretizing the marks. Our theoretical results are supported by simulations.