Source author record

Peter McCullagh

Peter McCullagh appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.ST Statistics Theory Methodology math.CO Applications Computation

Catalog footprint

What is connected

10works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2020arXiv

A likelihood analysis of quantile-matching transformations

Quantile matching is a strictly monotone transformation that sends the observed response values $\{y_1, . . . , y_n\}$ to the quantiles of a given target distribution. A likelihood based criterion is developed for comparing one target distribution with another in a linear-model setting.

preprint2016arXiv

Survival models and health sequences

Medical investigations focusing on patient survival often generate not only a failure time for each patient but also a sequence of measurements on patient health at annual or semi-annual check-ups while the patient remains alive. Such a sequence of random length accompanied by a survival time is called a survival process. Ordinarily robust health is associated with longer survival, so the two parts of a survival process cannot be assumed independent. This paper is concerned with a general technique---time reversal---for constructing statistical models for survival processes. A revival model is a regression model in the sense that it incorporates covariate and treatment effects into both the distribution of survival times and the joint distribution of health outcomes. It also allows individual health outcomes to be used clinically for predicting the subsequent survival time.

preprint2016arXiv

Vital variables and survival processes

The focus of a survival study is partly on the distribution of survival times, and partly on the health or quality of life of patients while they live. Health varies over time, and survival is the most basic aspect of health, so the two aspects are closely intertwined. Depending on the nature of the study, a range of variables may be measured; some constant in time, others not; some regarded as responses, others as explanatory risk factors; some directly and personally health-related, others less directly so. This paper begins by classifying variables that may arise in such a setting, emphasizing in particular, the mathematical distinction between vital and non-vital variables. We examine also various types of probabilistic relationships that may exist among variables. Independent evolution is an asymmetric relation, which is intended to encapsulate the notion of one process driving the other; $X$~is a driver of~$Y$ if $X$ evolves independently of the history of~$Y$. This concept arises in several places in the study of survival processes.

preprint2015arXiv

Reversible Markov structures on divisible set partitions

We study $k$-divisible partition structures, which are families of random set partitions whose block sizes are divisible by an integer $k=1,2,\ldots$. In this setting, exchangeability corresponds to the usual invariance under relabeling by arbitrary permutations; however, for $k>1$, the ordinary deletion maps on partitions no longer preserve divisibility, and so a random deletion procedure is needed to obtain a partition structure. We describe explicit Chinese restaurant-type seating rules for generating families of exchangeable $k$-divisible partitions that are consistent under random deletion. We further introduce the notion of {\em Markovian partition structures}, which are ensembles of exchangeable Markov chains on $k$-divisible partitions that are consistent under a random process of {\em Markovian deletion}. The Markov chains we study are reversible and refine the class of Markov chains introduced in {\em J.\ Appl.\ Probab.}~{\bf48}(3):778--791.

preprint2015arXiv

The pilgrim process

Pilgrim's monopoly is a probabilistic process giving rise to a non-negative sequence $T_1, T_2,\ldots$ that is infinitely exchangeable, a natural model for time-to-event data. The one-dimensional marginal distributions are exponential. The rules are simple, the process is easy to generate sequentially, and a simple expression is available for both the joint density and the multivariate survivor function. There is a close connection with the Kaplan-Meier estimator of the survival distribution. Embedded within the process is an infinitely exchangeable ordered partition processes connected to Markov branching processes in neutral evolutionary theory. Some aspects of the process, such as the distribution of the number of blocks, can be investigated analytically and confirmed by simulation. By ignoring the order, the embedded process can be considered as an infinitely exchangeable partition process, shown to be closely related to the Chinese restaurant process. Further connection to the Indian buffet process is also provided. Thus we establish a previously unknown link between the well-known Kaplan-Meier estimator and the important Ewens sampling formula.

preprint2015arXiv

Weak continuity of predictive distribution for Markov survival processes

We explore the concept of a consistent exchangeable survival process - a joint distribution of survival times in which the risk set evolves as a continuous-time Markov process with homogeneous transition rates. We show a correspondence with the de Finetti approach of constructing an exchangeable survival process by generating iid survival times conditional on a completely independent hazard measure. We describe several specific processes, showing how the number of blocks of tied failure times grows asymptotically with the number of individuals in each case. In particular, we show that the set of Markov survival processes with weakly continuous predictive distributions can be characterized by a two-dimensional family called the harmonic process. We end by applying these methods to data, showing how they can be easily extended to handle censoring.

preprint2014arXiv

A characterization of a Cauchy family on the complex space

It is shown that a family of distributions on the complex space is characterized as the only family such that the orbit of one distribution under a certain group of transformations on the complex space is the same as that under the group of affine transformations. The resulting family is compared with some existing families.

preprint2012arXiv

An asymptotic approximation for the permanent of a doubly stochastic matrix

A determinantal approximation is obtained for the permanent of a doubly stochastic matrix. For moderate-deviation matrix sequences, the asymptotic relative error is of order $O(n^{-1})$.

preprint2012arXiv

Classification based on a permanental process with cyclic approximation

We introduce a doubly stochastic marked point process model for supervised classification problems. Regardless of the number of classes or the dimension of the feature space, the model requires only 2--3 parameters for the covariance function. The classification criterion involves a permanental ratio for which an approximation using a polynomial-time cyclic expansion is proposed. The approximation is effective even if the feature region occupied by one class is a patchwork interlaced with regions occupied by other classes. An application to DNA microarray analysis indicates that the cyclic approximation is effective even for high-dimensional data. It can employ feature variables in an efficient way to reduce the prediction error significantly. This is critical when the true classification relies on non-reducible high-dimensional features.

preprint2011arXiv

On Bayes' theorem for improper mixtures

Although Bayes's theorem demands a prior that is a probability distribution on the parameter space, the calculus associated with Bayes's theorem sometimes generates sensible procedures from improper priors, Pitman's estimator being a good example. However, improper priors may also lead to Bayes procedures that are paradoxical or otherwise unsatisfactory, prompting some authors to insist that all priors be proper. This paper begins with the observation that an improper measure on Theta satisfying Kingman's countability condition is in fact a probability distribution on the power set. We show how to extend a model in such a way that the extended parameter space is the power set. Under an additional finiteness condition, which is needed for the existence of a sampling region, the conditions for Bayes's theorem are satisfied by the extension. Lack of interference ensures that the posterior distribution in the extended space is compatible with the original parameter space. Provided that the key finiteness condition is satisfied, this probabilistic analysis of the extended model may be interpreted as a vindication of improper Bayes procedures derived from the original model.

Peter McCullagh

What is connected

Connect this record

See the researcher in context

Building this map preview

10 published item(s)

A likelihood analysis of quantile-matching transformations

Survival models and health sequences

Vital variables and survival processes

Reversible Markov structures on divisible set partitions

The pilgrim process

Weak continuity of predictive distribution for Markov survival processes

A characterization of a Cauchy family on the complex space

An asymptotic approximation for the permanent of a doubly stochastic matrix

Classification based on a permanental process with cyclic approximation

On Bayes' theorem for improper mixtures