Source author record

Ryan Martin

Ryan Martin appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.ST Methodology Statistics Theory Computation math.CO Machine Learning physics.ins-det math.PR nucl-ex

Catalog footprint

What is connected

48works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2025arXiv

No-prior Bayes reIMagined: probabilistic approximations of inferential models

When prior information is lacking, the go-to strategy for probabilistic inference is to combine a "default prior" and the likelihood via Bayes's theorem. Objective Bayes, (generalized) fiducial inference, etc. fall under this umbrella. This construction is natural, but the corresponding posterior distributions generally only offer limited, approximately valid uncertainty quantification. The present paper takes a reimagined approach that yields posterior distributions with stronger reliability properties. The proposed construction starts with an inferential model (IM), one that takes the mathematical form of a data-driven possibility measure and features exactly valid uncertainty quantification, and then returns a so-called inner probabilistic approximation thereof. This inner probabilistic approximation inherits many of the original IM's desirable properties, including credible sets with exact coverage and asymptotic efficiency. The approximation also agrees with the familiar Bayes/fiducial solution in applications where the model has a group invariance structure. A Monte Carlo method for evaluating the probabilistic approximation is presented, along with numerical illustrations.

preprint2023arXiv

Elucidating Inferential Models with the Cauchy Distribution

Statistical inference as a formal scientific method to covert experience to knowledge has proven to be elusively difficult. While frequentist and Bayesian methodologies have been accepted in the contemporary era as two dominant schools of thought, it has been a good part of the last hundred years to see growing interests in development of more sound methods, both philosophically, in terms of scientific meaning of inference, and mathematically, in terms of exactness and efficiency. These include Fisher's fiducial argument, the Dempster-Shafe theory of belief functions, generalized fiducial, Confidence Distributions, and the most recently proposed inferential framework, called Inferential Models. Since it is notoriously challenging to make exact and efficient inference about the Cauchy distribution, this article takes it as an example to elucidate different schools of thought on statistical inference. It is shown that the standard approach of Inferential Models produces exact and efficient prior-free probabilistic inference on the location and scale parameters of the Cauchy distribution, whereas all other existing methods suffer from various difficulties.

preprint2022arXiv

Direct Gibbs posterior inference on risk minimizers: construction, concentration, and calibration

Real-world problems, often couched as machine learning applications, involve quantities of interest that have real-world meaning, independent of any statistical model. To avoid potential model misspecification bias or over-complicating the problem formulation, a direct, model-free approach is desired. The traditional Bayesian framework relies on a model for the data-generating process so, apparently, the desired direct, model-free, posterior-probabilistic inference is out of reach. Fortunately, likelihood functions are not the only means of linking data and quantities of interest. Loss functions provide an alternative link, where the quantity of interest is defined, or at least could be defined, as a minimizer of the corresponding risk, or expected loss. In this case, one can obtain what is commonly referred to as a Gibbs posterior distribution by using the empirical risk function directly. This manuscript explores the Gibbs posterior construction, its asymptotic concentration properties, and the frequentist calibration of its credible regions. By being free from the constraints of model specification, Gibbs posteriors create new opportunities for probabilistic inference in modern statistical learning problems.

preprint2022arXiv

Generalized Bayes inference on a linear personalized minimum clinically important difference

Inference on the minimum clinically important difference, or MCID, is an important practical problem in medicine. The basic idea is that a treatment being statistically significant may not lead to an improvement in the patients' well-being. The MCID is defined as a threshold such that, if a diagnostic measure exceeds this threshold, then the patients are more likely to notice an improvement. Typical formulations use an underspecified model, which makes a genuine Bayesian solution out of reach. Here, for a challenging personalized MCID problem, where the practically-significant threshold depends on patients' profiles, we develop a novel generalized posterior distribution, based on a working binary quantile regression model, that can be used for estimation and inference. The advantage of this formulation is two-fold: we can theoretically control the bias of the misspecified model and it has a latent variable representation which we can leverage for efficient Gibbs sampling. To ensure that the generalized Bayes inferences achieve a level of frequentist reliability, we propose a variation on the so-called generalized posterior calibration algorithm to suitably tune the spread of our proposed posterior.

preprint2022arXiv

Powers of Hamiltonian cycles in multipartite graphs

We prove that if $G$ is a $k$-partite graph on $n$ vertices in which all of the parts have order at most $n/r$ and every vertex is adjacent to at least a $1-1/r+o(1)$ proportion of the vertices in every other part, then $G$ contains the $(r-1)$-st power of a Hamiltonian cycle

preprint2022arXiv

Validity, consonant plausibility measures, and conformal prediction

Prediction of future observations is an important and challenging problem. The two mainstream approaches for quantifying prediction uncertainty use prediction regions and predictive distributions, respectively, with the latter believed to be more informative because it can perform other prediction-related tasks. The standard notion of validity, what we refer to here as Type-1 validity, focuses on coverage probability of prediction regions, while a notion of validity relevant to the other prediction-related tasks performed by predictive distributions is lacking. Here we present a new notion, called Type-2 validity, relevant to these other prediction tasks. We establish connections between Type-2 validity and coherence properties, and show that imprecise probability considerations are required in order to achieve it. We go on to show that both types of prediction validity can be achieved by interpreting the conformal prediction output as the contour function of a consonant plausibility measure. We also offer an alternative characterization of conformal prediction, based on a new nonparametric inferential model construction, wherein the appearance of consonance is natural, and prove its validity.

preprint2021arXiv

Asymptotically optimal inference in sparse sequence models with a simple data-dependent measure

For high-dimensional inference problems, statisticians have a number of competing interests. On the one hand, procedures should provide accurate estimation, reliable structure learning, and valid uncertainty quantification. On the other hand, procedures should be computationally efficient and able to scale to very high dimensions. In this note, I show that a very simple data-dependent measure can achieve all of these desirable properties simultaneously, along with some robustness to the error distribution, in sparse sequence models.

preprint2021arXiv

Gibbs posterior inference on multivariate quantiles

Bayesian and other likelihood-based methods require specification of a statistical model and may not be fully satisfactory for inference on quantities, such as quantiles, that are not naturally defined as model parameters. In this paper, we construct a direct and model-free Gibbs posterior distribution for multivariate quantiles. Being model-free means that inferences drawn from the Gibbs posterior are not subject to model misspecification bias, and being direct means that no priors for or marginalization over nuisance parameters are required. We show here that the Gibbs posterior enjoys a root-$n$ convergence rate and a Bernstein--von Mises property, i.e., for large n, the Gibbs posterior distribution can be approximated by a Gaussian. Moreover, we present numerical results showing the validity and efficiency of credible sets derived from a suitably scaled Gibbs posterior.

preprint2021arXiv

Stochastic optimization for numerical evaluation of imprecise probabilities

In applications of imprecise probability, analysts must compute lower (or upper) expectations, defined as the infimum of an expectation over a set of parameter values. Monte Carlo methods consistently approximate expectations at fixed parameter values, but can be costly to implement in grid search to locate minima over large subsets of the parameter space. We investigate the use of stochastic iterative root-finding methods for efficiently computing lower expectations. In two examples we illustrate the use of various stochastic approximation methods, and demonstrate their superior performance in comparison to grid search.

preprint2020arXiv

Empirical priors for prediction in sparse high-dimensional linear regression

In this paper we adopt the familiar sparse, high-dimensional linear regression model and focus on the important but often overlooked task of prediction. In particular, we consider a new empirical Bayes framework that incorporates data in the prior in two ways: one is to center the prior for the non-zero regression coefficients and the other is to provide some additional regularization. We show that, in certain settings, the asymptotic concentration of the proposed empirical Bayes posterior predictive distribution is very fast, and we establish a Bernstein--von Mises theorem which ensures that the derived empirical Bayes prediction intervals achieve the targeted frequentist coverage probability. The empirical prior has a convenient conjugate form, so posterior computations are relatively simple and fast. Finally, our numerical results demonstrate the proposed method's strong finite-sample performance in terms of prediction accuracy, uncertainty quantification, and computation time compared to existing Bayesian methods.

preprint2020arXiv

Variational approximations of empirical Bayes posteriors in high-dimensional linear models

In high-dimensions, the prior tails can have a significant effect on both posterior computation and asymptotic concentration rates. To achieve optimal rates while keeping the posterior computations relatively simple, an empirical Bayes approach has recently been proposed, featuring thin-tailed conjugate priors with data-driven centers. While conjugate priors ease some of the computational burden, Markov chain Monte Carlo methods are still needed, which can be expensive when dimension is high. In this paper, we develop a variational approximation to the empirical Bayes posterior that is fast to compute and retains the optimal concentration rate properties of the original. In simulations, our method is shown to have superior performance compared to existing variational approximations in the literature across a wide range of high-dimensional settings.

preprint2019arXiv

Model-free posterior inference on the area under the receiver operating characteristic curve

The area under the receiver operating characteristic curve (AUC) serves as a summary of a binary classifier's performance. Methods for estimating the AUC have been developed under a binormality assumption which restricts the distribution of the score produced by the classifier. However, this assumption introduces an infinite-dimensional nuisance parameter and can be inappropriate, especially in the context of machine learning. This motivates us to adopt a model-free Gibbs posterior distribution for the AUC. We present the asymptotic Gibbs posterior concentration rate, and a strategy for tuning the learning rate so that the corresponding credible intervals achieve the nominal frequentist coverage probability. Simulation experiments and a real data analysis demonstrate the Gibbs posterior's strong performance compared to existing methods based on a rank likelihood.

preprint2018arXiv

Bayesian inference in high-dimensional linear models using an empirical correlation-adaptive prior

In the context of a high-dimensional linear regression model, we propose the use of an empirical correlation-adaptive prior that makes use of information in the observed predictor variable matrix to adaptively address high collinearity, determining if parameters associated with correlated predictors should be shrunk together or kept apart. Under suitable conditions, we prove that this empirical Bayes posterior concentrates around the true sparse parameter at the optimal rate asymptotically. A simplified version of a shotgun stochastic search algorithm is employed to implement the variable selection procedure, and we show, via simulation experiments across different settings and a real-data application, the favorable performance of the proposed method compared to existing methods.

preprint2018arXiv

Empirical priors and posterior concentration rates for a monotone density

In a Bayesian context, prior specification for inference on monotone densities is conceptually straightforward, but proving posterior convergence theorems is complicated by the fact that desirable prior concentration properties often are not satisfied. In this paper, I first develop a new prior designed specifically to satisfy an empirical version of the prior concentration property, and then I give sufficient conditions on the prior inputs such that the corresponding empirical Bayes posterior concentrates around the true monotone density at nearly the optimal minimax rate. Numerical illustrations also reveal the practical benefits of the proposed empirical Bayes approach compared to Dirichlet process mixtures.

preprint2018arXiv

On nonparametric estimation of a mixing density via the predictive recursion algorithm

Nonparametric estimation of a mixing density based on observations from the corresponding mixture is a challenging statistical problem. This paper surveys the literature on a fast, recursive estimator based on the predictive recursion algorithm. After introducing the algorithm and giving a few examples, I summarize the available asymptotic convergence theory, describe an important semiparametric extension, and highlight two interesting applications. I conclude with a discussion of several recent developments in this area and some open problems.

preprint2016arXiv

Avoiding rainbow induced subgraphs in vertex-colorings

For a fixed graph $H$ on $k$ vertices, and a graph $G$ on at least $k$ vertices, we write $G\rightarrow H$ if in any vertex-coloring of $G$ with $k$ colors, there is an induced subgraph isomorphic to $H$ whose vertices have distinct colors. In other words, if $G\rightarrow H$ then a totally multicolored induced copy of $H$ is unavoidable in any vertex-coloring of $G$ with $k$ colors. In this paper, we show that, with a few notable exceptions, for any graph $H$ on $k$ vertices and for any graph $G$ which is not isomorphic to $H$, $G\not\!\rightarrow H$. We explicitly describe all exceptional cases. This determines the induced vertex-anti-Ramsey number for all graphs and shows that totally multicolored induced subgraphs are, in most cases, easily avoidable.

preprint2016arXiv

Exact prior-free probabilistic inference in a class of non-regular models

The use of standard statistical methods, such as maximum likelihood, is often justified based on their asymptotic properties. For suitably regular models, this theory is standard but, when the model is non-regular, e.g., the support depends on the parameter, these asymptotic properties may be difficult to assess. Recently, an inferential model (IM) framework has been developed that provides valid prior-free probabilistic inference without the need for asymptotic justification. In this paper, we construct an IM for a class of highly non-regular models with parameter-dependent support. This construction requires conditioning, which is facilitated through the solution of a particular differential equation. We prove that the plausibility intervals derived from this IM are exact confidence intervals, and we demonstrate their efficiency in a simulation study.

preprint2016arXiv

On weighted Ramsey numbers

The weighted Ramsey number, ${\rm wR}(n,k)$, is the minimum $q$ such that there is an assignment of nonnegative real numbers (weights) to the edges of $K_n$ with the total sum of the weights equal to ${n\choose 2}$ and there is a Red/Blue coloring of edges of the same $K_n$, such that in any complete $k$-vertex subgraph $H$, of $K_n$, the sum of the weights on Red edges in $H$ is at most $q$ and the sum of the weights on Blue edges in $H$ is at most $q$. This concept was introduced recently by Fujisawa and Ota. We provide new bounds on ${\rm wR}(n,k)$, for $k\geq 4$ and $n$ large enough and show that determining ${\rm wR}(n,3)$ is asymptotically equivalent to the problem of finding the fractional packing number of monochromatic triangles in colorings of edges of complete graphs with two colors.

preprint2016arXiv

Valid uncertainty quantification about the model in a linear regression setting

In scientific applications, there often are several competing models that could be fit to the observed data, so quantification of the model uncertainty is of fundamental importance. In this paper, we develop an inferential model (IM) approach for simultaneously valid probabilistic inference over a collection of assertions of interest without requiring any prior input. Our construction guarantees that the approach is optimal in the sense that it is the most efficient among those which are valid. Connections between the IM's simultaneous validity and post-selection inference are also made. We apply the general results to obtain valid uncertainty quantification about the set of predictor variables to be included in a linear regression model.

preprint2015arXiv

A semiparametric scale-mixture regression model and predictive recursion maximum likelihood

To avoid specification of the error distribution in a regression model, we propose a general nonparametric scale mixture model for the error distribution. For fitting such mixtures, the predictive recursion method is a simple and computationally efficient alternative to existing methods. We define a predictive recursion-based marginal likelihood function, and estimation of the regression parameters proceeds by maximizing this function. A hybrid predictive recursion--EM algorithm is proposed for this purpose. The method's performance is compared with that of existing methods in simulations and real data analyses.

preprint2015arXiv

On Posterior Concentration in Misspecified Models

We investigate the asymptotic behavior of Bayesian posterior distributions under independent and identically distributed ($i.i.d.$) misspecified models. More specifically, we study the concentration of the posterior distribution on neighborhoods of $f^{\star}$, the density that is closest in the Kullback--Leibler sense to the true model $f_0$. We note, through examples, the need for assumptions beyond the usual Kullback--Leibler support assumption. We then investigate consistency with respect to a general metric under three assumptions, each based on a notion of divergence measure, and then apply these to a weighted $L_1$-metric in convex models and non-convex models. Although a few results on this topic are available, we believe that these are somewhat inaccessible due, in part, to the technicalities and the subtle differences compared to the more familiar well-specified model case. One of our goals is to make some of the available results, especially that of , more accessible. Unlike their paper, our approach does not require construction of test sequences. We also discuss a preliminary extension of the $i.i.d.$ results to the independent but not identically distributed ($i.n.i.d.$) case.

preprint2015arXiv

Simulating from a gamma distribution with small shape parameter

Simulating from a gamma distribution with small shape parameter is a challenging problem. Towards an efficient method, we obtain a limiting distribution for a suitably normalized gamma distribution when the shape parameter tends to zero. Then this limiting distribution provides insight to the construction of a new, simple, and highly efficient acceptance--rejection algorithm. Comparisons based on acceptance rates show that the proposed procedure is more efficient than existing acceptance--rejection methods.

preprint2015arXiv

Status Update of the MAJORANA DEMONSTRATOR Neutrinoless Double Beta Decay Experiment

Neutrinoless double beta decay searches play a major role in determining neutrino properties, in particular the Majorana or Dirac nature of the neutrino and the absolute scale of the neutrino mass. The consequences of these searches go beyond neutrino physics, with implications for Grand Unification and leptogenesis. The \textsc{Majorana} Collaboration is assembling a low-background array of high purity Germanium (HPGe) detectors to search for neutrinoless double-beta decay in $^{76}$Ge. The \textsc{Majorana Demonstrator}, which is currently being constructed and commissioned at the Sanford Underground Research Facility in Lead, South Dakota, will contain 44 kg (30 kg enriched in $^{76}$Ge) of HPGe detectors. Its primary goal is to demonstrate the scalability and background required for a tonne-scale Ge experiment. This is accomplished via a modular design and projected background of less than 3 cnts/tonne-yr in the region of interest. The experiment is currently taking data with the first of its enriched detectors.

preprint2015arXiv

Ultra-Low Noise Mechanically Cooled Germanium Detector

Low capacitance, large volume, high purity germanium (HPGe) radiation detectors have been successfully employed in low-background physics experiments. However, some physical processes may not be detectable with existing detectors whose energy thresholds are limited by electronic noise. In this paper, methods are presented which can lower the electronic noise of these detectors. Through ultra-low vibration mechanical cooling and wire bonding of a CMOS charge sensitive preamplifier to a sub-pF p-type point contact HPGe detector, we demonstrate electronic noise levels below 40 eV-FWHM.

preprint2014arXiv

A note on p-values interpreted as plausibilities

P-values are a mainstay in statistics but are often misinterpreted. We propose a new interpretation of p-value as a meaningful plausibility, where this is to be interpreted formally within the inferential model framework. We show that, for most practical hypothesis testing problems, there exists an inferential model such that the corresponding plausibility function, evaluated at the null hypothesis, is exactly the p-value. The advantages of this representation are that the notion of plausibility is consistent with the way practitioners use and interpret p-values, and the plausibility calculation avoids the troublesome conditioning on the truthfulness of the null. This connection with plausibilities also reveals a shortcoming of standard p-values in problems with non-trivial parameter constraints.

preprint2014arXiv

Asymptotically minimax empirical Bayes estimation of a sparse normal mean vector

For the important classical problem of inference on a sparse high-dimensional normal mean vector, we propose a novel empirical Bayes model that admits a posterior distribution with desirable properties under mild conditions. In particular, our empirical Bayes posterior distribution concentrates on balls, centered at the true mean vector, with squared radius proportional to the minimax rate, and its posterior mean is an asymptotically minimax estimator. We also show that, asymptotically, the support of our empirical Bayes posterior has roughly the same effective dimension as the true sparse mean vector. Simulation from our empirical Bayes posterior is straightforward, and our numerical results demonstrate the quality of our method compared to others having similar large-sample properties.

preprint2014arXiv

Conditional inferential models: combining information for prior-free probabilistic inference

The inferential model (IM) framework provides valid prior-free probabilistic inference by focusing on predicting unobserved auxiliary variables. But, efficient IM-based inference can be challenging when the auxiliary variable is of higher dimension than the parameter. Here we show that features of the auxiliary variable are often fully observed and, in such cases, a simultaneous dimension reduction and information aggregation can be achieved by conditioning. This proposed conditioning strategy leads to efficient IM inference, and casts new light on Fisher's notions of sufficiency, conditioning, and also Bayesian inference. A differential equation-driven selection of a conditional association is developed, and validity of the conditional IM is proved under some conditions. For problems that do not admit a valid conditional IM of the standard form, we propose a more flexible class of conditional IMs based on localization. Examples of local conditional IMs in a bivariate normal model and a normal variance components model are also given.

preprint2014arXiv

Discussion: Foundations of Statistical Inference, Revisited

This is an invited contribution to the discussion on Professor Deborah Mayo's paper, "On the Birnbaum argument for the strong likelihood principle," to appear in Statistical Science. Mayo clearly demonstrates that statistical methods violating the likelihood principle need not violate either the sufficiency or conditionality principle, thus refuting Birnbaum's claim. With the constraints of Birnbaum's theorem lifted, we revisit the foundations of statistical inference, focusing on some new foundational principles, the inferential model framework, and connections with sufficiency and conditioning. [arXiv:1302.7021]

preprint2014arXiv

Exact prior-free probabilistic inference on the heritability coefficient in a linear mixed model

Linear mixed-effect models with two variance components are often used when variability comes from two sources. In genetics applications, variation in observed traits can be attributed to biological and environmental effects, and the heritability coefficient is a fundamental quantity that measures the proportion of total variability due to the biological effect. We propose a new inferential model approach which yields exact prior-free probabilistic inference on the heritability coefficient. In particular we construct exact confidence intervals and demonstrate numerically our method's efficiency compared to that of existing methods.

preprint2014arXiv

Frameworks for prior-free posterior probabilistic inference

The development of statistical methods for valid and efficient probabilistic inference without prior distributions has a long history. Fisher's fiducial inference is perhaps the most famous of these attempts. We argue that, despite its seemingly prior-free formulation, fiducial and its various extensions are not prior-free and, therefore, do not meet the requirements for prior-free probabilistic inference. In contrast, the inferential model (IM) framework is genuinely prior-free and is shown to be a promising new method for generating both valid and efficient probabilistic inference. With a brief introduction to the two fundamental principles, namely, the validity and efficiency principles, the three-step construction of the basic IM framework is discussed in the context of the validity principle. Efficient IM methods, based on conditioning and marginalization are illustrated with two benchmark examples, namely, the bivariate normal with unknown correlation coefficient and the Behrens--Fisher problem.

preprint2014arXiv

Marginal inferential models: prior-free probabilistic inference on interest parameters

The inferential models (IM) framework provides prior-free, frequency-calibrated, posterior probabilistic inference. The key is the use of random sets to predict unobservable auxiliary variables connected to the observable data and unknown parameters. When nuisance parameters are present, a marginalization step can reduce the dimension of the auxiliary variable which, in turn, leads to more efficient inference. For regular problems, exact marginalization can be achieved, and we give conditions for marginal IM validity. We show that our approach provides exact and efficient marginal inference in several challenging problems, including a many-normal-means problem. In non-regular problems, we propose a generalized marginalization technique and prove its validity. Details are given for two benchmark examples, namely, the Behrens--Fisher and gamma mean problems.

preprint2014arXiv

Plausibility functions and exact frequentist inference

In the frequentist program, inferential methods with exact control on error rates are a primary focus. The standard approach, however, is to rely on asymptotic approximations, which may not be suitable. This paper presents a general framework for the construction of exact frequentist procedures based on plausibility functions. It is shown that the plausibility function-based tests and confidence regions have the desired frequentist properties in finite samples---no large-sample justification needed. An extension of the proposed method is also given for problems involving nuisance parameters. Examples demonstrate that the plausibility function-based method is both exact and efficient in a wide variety of problems.

preprint2014arXiv

Prior-free probabilistic prediction of future observations

Prediction of future observations is a fundamental problem in statistics. Here we present a general approach based on the recently developed inferential model (IM) framework. We employ an IM-based technique to marginalize out the unknown parameters, yielding prior-free probabilistic prediction of future observables. Verifiable sufficient conditions are given for validity of our IM for prediction, and a variety of examples demonstrate the proposed method's performance. Thanks to its generality and ease of implementation, we expect that our IM-based method for prediction will be a useful tool for practitioners.

preprint2013arXiv

A note on Bayesian convergence rates under local prior support conditions

Bounds on Bayesian posterior convergence rates, assuming the prior satisfies both local and global support conditions, are now readily available. In this paper we explore, in the context of density estimation, Bayesian convergence rates assuming only local prior support conditions. Our results give optimal rates under minimal conditions using very simple arguments.

preprint2013arXiv

Inferential models: A framework for prior-free posterior probabilistic inference

Posterior probabilistic statistical inference without priors is an important but so far elusive goal. Fisher's fiducial inference, Dempster-Shafer theory of belief functions, and Bayesian inference with default priors are attempts to achieve this goal but, to date, none has given a completely satisfactory picture. This paper presents a new framework for probabilistic inference, based on inferential models (IMs), which not only provides data-dependent probabilistic measures of uncertainty about the unknown parameter, but does so with an automatic long-run frequency calibration property. The key to this new approach is the identification of an unobservable auxiliary variable associated with observable data and unknown parameter, and the prediction of this auxiliary variable with a random set before conditioning on data. Here we present a three-step IM construction, and prove a frequency-calibration property of the IM's belief function under mild conditions. A corresponding optimality theory is developed, which helps to resolve the non-uniqueness issue. Several examples are presented to illustrate this new approach.

preprint2013arXiv

Random sets and exact confidence regions

An important problem in statistics is the construction of confidence regions for unknown parameters. In most cases, asymptotic distribution theory is used to construct confidence regions, so any coverage probability claims only hold approximately, for large samples. This paper describes a new approach, using random sets, which allows users to construct exact confidence regions without appeal to asymptotic theory. In particular, if the user-specified random set satisfies a certain validity property, confidence regions obtained by thresholding the induced data-dependent plausibility function are shown to have the desired coverage probability.

preprint2012arXiv

An approximate Bayesian marginal likelihood approach for estimating finite mixtures

Estimation of finite mixture models when the mixing distribution support is unknown is an important problem. This paper gives a new approach based on a marginal likelihood for the unknown support. Motivated by a Bayesian Dirichlet prior model, a computationally efficient stochastic approximation version of the marginal likelihood is proposed and large-sample theory is presented. By restricting the support to a finite grid, a simulated annealing method is employed to maximize the marginal likelihood and estimate the support. Real and simulated data examples show that this novel stochastic approximation--simulated annealing procedure compares favorably to existing methods.

preprint2012arXiv

Asymptotically optimal nonparametric empirical Bayes via predictive recursion

An empirical Bayes problem has an unknown prior to be estimated from data. The predictive recursion (PR) algorithm provides fast nonparametric estimation of mixing distributions and is ideally suited for empirical Bayes applications. This paper presents a general notion of empirical Bayes asymptotic optimality, and it is shown that PR-based procedures satisfy this property under certain conditions. As an application, the problem of in-season prediction of baseball batting averages is considered. There the PR-based empirical Bayes rule performs well in terms of prediction error and ability to capture the distribution of the latent features.

preprint2012arXiv

On convergence rates of Bayesian predictive densities and posterior distributions

Frequentist-style large-sample properties of Bayesian posterior distributions, such as consistency and convergence rates, are important considerations in nonparametric problems. In this paper we give an analysis of Bayesian asymptotics based primarily on predictive densities. Our analysis is unified in the sense that essentially the same approach can be taken to develop convergence rate results in iid, mis-specified iid, independent non-iid, and dependent data cases.

preprint2012arXiv

On epsilon-optimality of the pursuit learning algorithm

Estimator algorithms in learning automata are useful tools for adaptive, real-time optimization in computer science and engineering applications. This paper investigates theoretical convergence properties for a special case of estimator algorithms: the pursuit learning algorithm. In this note, we identify and fill a gap in existing proofs of probabilistic convergence for pursuit learning. It is tradition to take the pursuit learning tuning parameter to be fixed in practical applications, but our proof sheds light on the importance of a vanishing sequence of tuning parameters in a theoretical convergence analysis.

preprint2012arXiv

Optimal inferential models for a Poisson mean

Statistical inference on the mean of a Poisson distribution is a fundamentally important problem with modern applications in, e.g., particle physics. The discreteness of the Poisson distribution makes this problem surprisingly challenging, even in the large-sample case. Here we propose a new approach, based on the recently developed framework of inferential models (IMs). Specifically, we construct optimal, or at least approximately optimal, IMs for two important classes of assertions/hypotheses about the Poisson mean. For point assertions, we develop a novel recursive sorting algorithm to construct this optimal IM. Numerical comparisons of the proposed method to existing methods are given, for both the mean and the more challenging mean-plus-background problem.

preprint2011arXiv

A nonparametric empirical Bayes framework for large-scale multiple testing

We propose a flexible and identifiable version of the two-groups model, motivated by hierarchical Bayes considerations, that features an empirical null and a semiparametric mixture model for the non-null cases. We use a computationally efficient predictive recursion marginal likelihood procedure to estimate the model parameters, even the nonparametric mixing distribution. This leads to a nonparametric empirical Bayes testing procedure, which we call PRtest, based on thresholding the estimated local false discovery rates. Simulations and real-data examples demonstrate that, compared to existing approaches, PRtest's careful handling of the non-null density can give a much better fit in the tails of the mixture distribution which, in turn, can lead to more realistic conclusions.

preprint2011arXiv

Convergence rate for predictive recursion estimation of finite mixtures

Predictive recursion (PR) is a fast stochastic algorithm for nonparametric estimation of mixing distributions in mixture models. It is known that the PR estimates of both the mixing and mixture densities are consistent under fairly mild conditions, but currently very little is known about the rate of convergence. Here I first investigate asymptotic convergence properties of the PR estimate under model misspecification in the special case of finite mixtures with known support. Tools from stochastic approximation theory are used to prove that the PR estimates converge, to the best Kullback--Leibler approximation, at a nearly root-$n$ rate. When the support is unknown, PR can be used to construct an objective function which, when optimized, yields an estimate the support. I apply the known-support results to derive a rate of convergence for this modified PR estimate in the unknown support case, which compares favorably to known optimal rates.

preprint2011arXiv

On the edit distance from $K_{2,t}$-free graphs II: Cases $t\geq 5$

The edit distance between two graphs on the same vertex set is defined to be size of the symmetric difference of their edge sets. The edit distance function of a hereditary property, $\mathcal{H}$, is a function of $p$ and measures, asymptotically, the furthest graph with edge density $p$ from $\mathcal{H}$ under this metric. The edit distance function has proven to be difficult to compute for many hereditary properties. Some surprising connections to extremal graph theory problems, such as strongly regular graphs and the problem of Zarankiewicz, have been uncovered in attempts to compute various edit distance functions. In this paper, we address the hereditary property $\forb(K_{2,t})$ when $t\geq5$, the property of having no induced copy of the complete bipartite graph with 2 vertices in one class and $t$ in the other. This work continues from a prior paper by the authors. Employing an assortment of techniques and colored regularity graph constructions, we are able to extend the interval over which the edit distance function for this hereditary property is generally known and determine its maximum value for all odd $t$. We also explore several constructions to improve upon known upper bounds for the function.

preprint2011arXiv

Semiparametric inference in mixture models with predictive recursion marginal likelihood

Predictive recursion is an accurate and computationally efficient algorithm for nonparametric estimation of mixing densities in mixture models. In semiparametric mixture models, however, the algorithm fails to account for any uncertainty in the additional unknown structural parameter. As an alternative to existing profile likelihood methods, we treat predictive recursion as a filter approximation to fitting a fully Bayes model, whereby an approximate marginal likelihood of the structural parameter emerges and can be used for inference. We call this the predictive recursion marginal likelihood. Convergence properties of predictive recursion under model mis-specification also lead to an attractive construction of this new procedure. We show pointwise convergence of a normalized version of this marginal likelihood function. Simulations compare the performance of this new marginal likelihood approach that of existing profile likelihood methods as well as Dirichlet process mixtures in density estimation. Mixed-effects models and an empirical Bayes multiple testing application in time series analysis are also considered.

preprint2011arXiv

Stochastic Approximation and Newton's Estimate of a Mixing Distribution

Many statistical problems involve mixture models and the need for computationally efficient methods to estimate the mixing distribution has increased dramatically in recent years. Newton [Sankhya Ser. A 64 (2002) 306--322] proposed a fast recursive algorithm for estimating the mixing distribution, which we study as a special case of stochastic approximation (SA). We begin with a review of SA, some recent statistical applications, and the theory necessary for analysis of a SA algorithm, which includes Lyapunov functions and ODE stability theory. Then standard SA results are used to prove consistency of Newton's estimate in the case of a finite mixture. We also propose a modification of Newton's algorithm that allows for estimation of an additional unknown parameter in the model, and prove its consistency.

preprint2010arXiv

Dempster--Shafer Theory and Statistical Inference with Weak Beliefs

The Dempster--Shafer (DS) theory is a powerful tool for probabilistic reasoning based on a formal calculus for combining evidence. DS theory has been widely used in computer science and engineering applications, but has yet to reach the statistical mainstream, perhaps because the DS belief functions do not satisfy long-run frequency properties. Recently, two of the authors proposed an extension of DS, called the weak belief (WB) approach, that can incorporate desirable frequency properties into the DS framework by systematically enlarging the focal elements. The present paper reviews and extends this WB approach. We present a general description of WB in the context of inferential models, its interplay with the DS calculus, and the maximal belief solution. New applications of the WB method in two high-dimensional hypothesis testing problems are given. Simulations show that the WB procedures, suitably calibrated, perform well compared to popular classical methods. Most importantly, the WB approach combines the probabilistic reasoning of DS with the desirable frequency properties of classical statistics.

preprint2010arXiv

Lower bounds for identifying codes in some infinite grids

An $r$-identifying code on a graph $G$ is a set $C\subset V(G)$ such that for every vertex in $V(G)$, the intersection of the radius-$r$ closed neighborhood with $C$ is nonempty and unique. On a finite graph, the density of a code is $|C|/|V(G)|$, which naturally extends to a definition of density in certain infinite graphs which are locally finite. We present new lower bounds for densities of codes for some small values of $r$ in both the square and hexagonal grids.

Ryan Martin

What is connected

Connect this record

See the researcher in context

Building this map preview

48 published item(s)

No-prior Bayes reIMagined: probabilistic approximations of inferential models

Elucidating Inferential Models with the Cauchy Distribution

Direct Gibbs posterior inference on risk minimizers: construction, concentration, and calibration

Generalized Bayes inference on a linear personalized minimum clinically important difference

Powers of Hamiltonian cycles in multipartite graphs

Validity, consonant plausibility measures, and conformal prediction

Asymptotically optimal inference in sparse sequence models with a simple data-dependent measure

Gibbs posterior inference on multivariate quantiles

Stochastic optimization for numerical evaluation of imprecise probabilities

Empirical priors for prediction in sparse high-dimensional linear regression

Variational approximations of empirical Bayes posteriors in high-dimensional linear models

Model-free posterior inference on the area under the receiver operating characteristic curve

Bayesian inference in high-dimensional linear models using an empirical correlation-adaptive prior

Empirical priors and posterior concentration rates for a monotone density

On nonparametric estimation of a mixing density via the predictive recursion algorithm

Avoiding rainbow induced subgraphs in vertex-colorings

Exact prior-free probabilistic inference in a class of non-regular models

On weighted Ramsey numbers

Valid uncertainty quantification about the model in a linear regression setting

A semiparametric scale-mixture regression model and predictive recursion maximum likelihood

On Posterior Concentration in Misspecified Models

Simulating from a gamma distribution with small shape parameter

Status Update of the MAJORANA DEMONSTRATOR Neutrinoless Double Beta Decay Experiment

Ultra-Low Noise Mechanically Cooled Germanium Detector

A note on p-values interpreted as plausibilities

Asymptotically minimax empirical Bayes estimation of a sparse normal mean vector

Conditional inferential models: combining information for prior-free probabilistic inference

Discussion: Foundations of Statistical Inference, Revisited

Exact prior-free probabilistic inference on the heritability coefficient in a linear mixed model

Frameworks for prior-free posterior probabilistic inference

Marginal inferential models: prior-free probabilistic inference on interest parameters

Plausibility functions and exact frequentist inference

Prior-free probabilistic prediction of future observations

A note on Bayesian convergence rates under local prior support conditions

Inferential models: A framework for prior-free posterior probabilistic inference

Random sets and exact confidence regions

An approximate Bayesian marginal likelihood approach for estimating finite mixtures

Asymptotically optimal nonparametric empirical Bayes via predictive recursion

On convergence rates of Bayesian predictive densities and posterior distributions

On epsilon-optimality of the pursuit learning algorithm

Optimal inferential models for a Poisson mean

A nonparametric empirical Bayes framework for large-scale multiple testing

Convergence rate for predictive recursion estimation of finite mixtures

On the edit distance from $K_{2,t}$-free graphs II: Cases $t\geq 5$

Semiparametric inference in mixture models with predictive recursion marginal likelihood

Stochastic Approximation and Newton's Estimate of a Mixing Distribution

Dempster--Shafer Theory and Statistical Inference with Weak Beliefs

Lower bounds for identifying codes in some infinite grids