Source author record

A. Philip Dawid

A. Philip Dawid appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.ST Statistics Theory Methodology Applications Artificial Intelligence Genomics Machine Learning math.PR q-fin.PR q-fin.TR

Catalog footprint

What is connected

22works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

On Learnability under General Stochastic Processes

Statistical learning theory under independent and identically distributed (iid) sampling and online learning theory for worst case individual sequences are two of the best developed branches of learning theory. Statistical learning under general non-iid stochastic processes is less mature. We provide two natural notions of learnability of a function class under a general stochastic process. We show that both notions are in fact equivalent to online learnability. Our results hold for both binary classification and regression.

preprint2020arXiv

Decision-theoretic foundations for statistical causality

We develop a mathematical and interpretative foundation for the enterprise of decision-theoretic statistical causality (DT), which is a straightforward way of representing and addressing causal questions. DT reframes causal inference as "assisted decision-making", and aims to understand when, and how, I can make use of external data, typically observational, to help me solve a decision problem by taking advantage of assumed relationships between the data and my problem. The relationships embodied in any representation of a causal problem require deeper justification, which is necessarily context-dependent. Here we clarify the considerations needed to support applications of the DT methodology. Exchangeability considerations are used to structure the required relationships, and a distinction drawn between intention to treat and intervention to treat forms the basis for the enabling condition of "ignorability". We also show how the DT perspective unifies and sheds light on other popular formalisations of statistical causality, including potential responses and directed acyclic graphs.

preprint2017arXiv

A Note on Bayesian Model Selection for Discrete Data Using Proper Scoring Rules

We consider the problem of choosing between parametric models for a discrete observable, taking a Bayesian approach in which the within-model prior distributions are allowed to be improper. In order to avoid the ambiguity in the marginal likelihood function in such a case, we apply a homogeneous scoring rule. For the particular case of distinguishing between Poisson and Negative Binomial models, we conduct simulations that indicate that, applied prequentially, the method will consistently select the true model.

preprint2017arXiv

A Note on Prediction Markets

In a prediction market, individuals can sequentially place bets on the outcome of a future event. This leaves a trail of personal probabilities for the event, each being conditional on the current individual's private background knowledge and on the previously announced probabilities of other individuals, which give partial information about their private knowledge. By means of theory and examples, we revisit some results in this area. In particular, we consider the case of two individuals, who start with the same overall probability distribution but different private information, and then take turns in updating their probabilities. We note convergence of the announced probabilities to a limiting value, which may or may not be the same as that based on pooling their private information.

preprint2015arXiv

A Commentary on Statistical Assessment of Violence Recidivism Risk

Increasing integration and availability of data on large groups of persons has been accompanied by proliferation of statistical and other algorithmic prediction tools in banking, insurance, marketiNg, medicine, and other FIelds (see e.g., Steyerberg (2009a;b)). Controversy may ensue when such tools are introduced to fields traditionally reliant on individual clinical evaluations. Such controversy has arisen about "actuarial" assessments of violence recidivism risk, i.e., the probability that someone found to have committed a violent act will commit another during a specified period. Recently Hart et al. (2007a) and subsequent papers from these authors in several reputable journals have claimed to demonstrate that statistical assessments of such risks are inherently too imprecise to be useful, using arguments that would seem to apply to statistical risk prediction quite broadly. This commentary examines these arguments from a technical statistical perspective, and finds them seriously mistaken in many particulars. They should play no role in reasoned discussions of violence recidivism risk assessment.

preprint2015arXiv

Bayesian Model Selection Based on Proper Scoring Rules

Bayesian model selection with improper priors is not well-defined because of the dependence of the marginal likelihood on the arbitrary scaling constants of the within-model prior densities. We show how this problem can be evaded by replacing marginal log-likelihood by a homogeneous proper scoring rule, which is insensitive to the scaling constants. Suitably applied, this will typically enable consistent selection of the true model.

preprint2015arXiv

Extended Conditional Independence and Applications in Causal Inference

The goal of this paper is to integrate the notions of stochastic conditional independence and variation conditional independence under a more general notion of extended conditional independence. We show that under appropriate assumptions the calculus that applies for the two cases separately (axioms of a separoid) still applies for the extended case. These results provide a rigorous basis for a wide range of statistical concepts, including ancillarity and sufficiency, and, in particular, the Decision Theoretic framework for statistical causality, which uses the language and calculus of conditional independence in order to express causal properties and make causal inferences.

preprint2015arXiv

Rejoinder to "Bayesian Model Selection Based on Proper Scoring Rules"

We are deeply appreciative of the initiative of the editor, Marina Vanucci, in commissioning a discussion of our paper, and extremely grateful to all the discussants for their insightful and thought-provoking comments. We respond to the discussions in alphabetical order [arXiv:1409.5291].

preprint2015arXiv

Structural Markov graph laws for Bayesian model uncertainty

This paper considers the problem of defining distributions over graphical structures. We propose an extension of the hyper Markov properties of Dawid and Lauritzen [Ann. Statist. 21 (1993) 1272-1317], which we term structural Markov properties, for both undirected decomposable and directed acyclic graphs, which requires that the structure of distinct components of the graph be conditionally independent given the existence of a separating component. This allows the analysis and comparison of multiple graphical structures, while being able to take advantage of the common conditional independence constraints. Moreover, we show that these properties characterise exponential families, which form conjugate priors under sampling from compatible Markov distributions.

preprint2014arXiv

Comparisons of Hyvärinen and pairwise estimators in two simple linear time series models

The aim of this paper is to compare numerically the performance of two estimators based on Hyvärinen's local homogeneous scoring rule with that of the full and the pairwise maximum likelihood estimators. In particular, two different model settings, for which both full and pairwise maximum likelihood estimators can be obtained, have been considered: the first order autoregressive model (AR(1)) and the moving average model (MA(1)). Simulation studies highlight very different behaviours for the Hyvärinen scoring rule estimators relative to the pairwise likelihood estimators in these two settings.

preprint2014arXiv

On Individual Risk

We survey a variety of possible explications of the term "Individual Risk." These in turn are based on a variety of interpretations of "Probability," including Classical, Enumerative, Frequency, Formal, Metaphysical, Personal, Propensity, Chance and Logical conceptions of Probability, which we review and compare. We distinguish between "groupist" and "individualist" understandings of Probability, and explore both "group to individual" (G2i) and "individual to group" (i2G) approaches to characterising Individual Risk. Although in the end that concept remains subtle and elusive, some pragmatic suggestions for progress are made.

preprint2014arXiv

Statistical Causality from a Decision-Theoretic Perspective

We present an overview of the decision-theoretic framework of statistical causality, which is well-suited for formulating and solving problems of determining the effects of applied causes. The approach is described in detail, and is related to and contrasted with other current formulations, such as structural equation models and potential responses. Topics and applications covered include confounding, the effect of treatment on the treated, instrumental variables, and dynamic treatment strategies.

preprint2014arXiv

Stochastic Mechanistic Interaction

We propose a fully probabilistic formulation of the notion of mechanistic interaction (interaction in some fundamental mechanistic sense) between the effects of putative (possibly continuous) causal factors A and B on a binary outcome variable Y indicating 'survival' vs 'failure'. We define mechanistic interaction in terms of departure from a generalized 'noisy OR' model, under which the multiplicative causal effect of A (resp., B) on the probability of failure cannot be enhanced by manipulating B (resp., A). We present conditions under which mechanistic interaction in the above sense can be assessed via simple tests on excess risk or superadditivity, in a possibly retrospective regime of observation. These conditions are defined in terms of generalized conditional independence relationships (generalised because they may involve non-stochastic 'regime indicators') that can often be checked on a graphical representation of the problem. Inference about mechanistic interaction between direct, or path-specific, causal effects can be accommodated in the proposed framework. The method is illustrated with the aid of a study in experimental psychology.

preprint2014arXiv

Theory and Applications of Proper Scoring Rules

We give an overview of some uses of proper scoring rules in statistical inference, including frequentist estimation theory and Bayesian model selection with improper priors.

preprint2013arXiv

A Formal Treatment of Sequential Ignorability

Taking a rigorous formal approach, we consider sequential decision problems involving observable variables, unobservable variables, and action variables. We can typically assume the property of extended stability, which allows identification (by means of G-computation) of the consequence of a specified treatment strategy if the unobserved variables are, in fact, observed - but not generally otherwise. However, under certain additional special conditions we can infer simple stability (or sequential ignorability), which supports G-computation based on the observed variables alone. One such additional condition is sequential randomization, where the unobserved variables essentially behave as random noise in their effects on the actions. Another is sequential irrelevance, where the unobserved variables do not influence future observed variables. In the latter case, to deduce sequential ignorability in full generality requires additional positivity conditions. We show here that these positivity conditions are not required when all variables are discrete.

preprint2013arXiv

Retrospective-prospective symmetry in the likelihood and Bayesian analysis of case-control studies

Prentice & Pyke (1979) established that the maximum likelihood estimate of an odds-ratio in a case-control study is the same as would be found by running a logistic regression: in other words, for this specific target the incorrect prospective model is inferentially equivalent to the correct retrospective model. Similar results have been obtained for other models, and conditions have also been identified under which the corresponding Bayesian property holds, namely that the posterior distribution of the odds-ratio be the same, whether computed using the prospective or the retrospective likelihood. Here we demonstrate how these results follow directly from certain parameter independence properties of the models and priors, and identify prior laws that support such reverse analysis, for both standard and stratified designs.

preprint2012arXiv

Proper local scoring rules

We investigate proper scoring rules for continuous distributions on the real line. It is known that the log score is the only such rule that depends on the quoted density only through its value at the outcome that materializes. Here we allow further dependence on a finite number $m$ of derivatives of the density at the outcome, and describe a large class of such $m$-local proper scoring rules: these exist for all even $m$ but no odd $m$. We further show that for $m\geq2$ all such $m$-local rules can be computed without knowledge of the normalizing constant of the distribution.

preprint2012arXiv

Proper local scoring rules on discrete sample spaces

A scoring rule is a loss function measuring the quality of a quoted probability distribution $Q$ for a random variable $X$, in the light of the realized outcome $x$ of $X$; it is proper if the expected score, under any distribution $P$ for $X$, is minimized by quoting $Q=P$. Using the fact that any differentiable proper scoring rule on a finite sample space ${\mathcal{X}}$ is the gradient of a concave homogeneous function, we consider when such a rule can be local in the sense of depending only on the probabilities quoted for points in a nominated neighborhood of $x$. Under mild conditions, we characterize such a proper local scoring rule in terms of a collection of homogeneous functions on the cliques of an undirected graph on the space ${\mathcal{X}}$. A useful property of such rules is that the quoted distribution $Q$ need only be known up to a scale factor. Examples of the use of such scoring rules include Besag's pseudo-likelihood and Hyvärinen's method of ratio matching.

preprint2011arXiv

Probability-free pricing of adjusted American lookbacks

Consider an American option that pays G(X^*_t) when exercised at time t, where G is a positive increasing function, X^*_t := \sup_{s\le t}X_s, and X_s is the price of the underlying security at time s. Assuming zero interest rates, we show that the seller of this option can hedge his position by trading in the underlying security if he begins with initial capital X_0\int_{X_0}^{\infty}G(x)x^{-2}dx (and this is the smallest initial capital that allows him to hedge his position). This leads to strategies for trading that are always competitive both with a given strategy's current performance and, to a somewhat lesser degree, with its best performance so far. It also leads to methods of statistical testing that avoid sacrificing too much of the maximum statistical significance that they achieve in the course of accumulating data.

preprint2010arXiv

Deep determinism and the assessment of mechanistic interaction between categorical and continuous variables

Our aim is to detect mechanistic interaction between the effects of two causal factors on a binary response, as an aid to identifying situations where the effects are mediated by a common mechanism. We propose a formalization of mechanistic interaction which acknowledges asymmetries of the kind "factor A interferes with factor B, but not viceversa". A class of tests for mechanistic interaction is proposed, which works on discrete or continuous causal variables, in any combination. Conditions under which these tests can be applied under a generic regime of data collection, be it interventional or observational, are discussed in terms of conditional independence assumptions within the framework of Augmented Directed Graphs. The scientific relevance of the method and the practicality of the graphical framework are illustrated with the aid of two studies in coronary artery disease. Our analysis relies on the "deep determinism" assumption that there exists some relevant set V - possibly unobserved - of "context variables", such that the response Y is a deterministic function of the values of V and of the causal factors of interest. Caveats regarding this assumption in real studies are discussed.

preprint2010arXiv

Identifying the consequences of dynamic treatment strategies: A decision-theoretic overview

We consider the problem of learning about and comparing the consequences of dynamic treatment strategies on the basis of observational data. We formulate this within a probabilistic decision-theoretic framework. Our approach is compared with related work by Robins and others: in particular, we show how Robins's 'G-computation' algorithm arises naturally from this decision-theoretic perspective. Careful attention is paid to the mathematical and substantive conditions required to justify the use of this formula. These conditions revolve around a property we term stability, which relates the probabilistic behaviours of observational and interventional regimes. We show how an assumption of 'sequential randomization' (or 'no unmeasured confounders'), or an alternative assumption of 'sequential irrelevance', can be used to infer stability. Probabilistic influence diagrams are used to simplify manipulations, and their power and limitations are discussed. We compare our approach with alternative formulations based on causal DAGs or potential response models. We aim to show that formulating the problem of assessing dynamic treatment strategies as a problem of decision analysis brings clarity, simplicity and generality.

preprint2010arXiv

Insuring against loss of evidence in game-theoretic probability

We consider the game-theoretic scenario of testing the performance of Forecaster by Sceptic who gambles against the forecasts. Sceptic's current capital is interpreted as the amount of evidence he has found against Forecaster. Reporting the maximum of Sceptic's capital so far exaggerates the evidence. We characterize the set of all increasing functions that remove the exaggeration. This result can be used for insuring against loss of evidence.

A. Philip Dawid

What is connected

Connect this record

See the researcher in context

Building this map preview

22 published item(s)

On Learnability under General Stochastic Processes

Decision-theoretic foundations for statistical causality

A Note on Bayesian Model Selection for Discrete Data Using Proper Scoring Rules

A Note on Prediction Markets

A Commentary on Statistical Assessment of Violence Recidivism Risk

Bayesian Model Selection Based on Proper Scoring Rules

Extended Conditional Independence and Applications in Causal Inference

Rejoinder to "Bayesian Model Selection Based on Proper Scoring Rules"

Structural Markov graph laws for Bayesian model uncertainty

Comparisons of Hyvärinen and pairwise estimators in two simple linear time series models

On Individual Risk

Statistical Causality from a Decision-Theoretic Perspective

Stochastic Mechanistic Interaction

Theory and Applications of Proper Scoring Rules

A Formal Treatment of Sequential Ignorability

Retrospective-prospective symmetry in the likelihood and Bayesian analysis of case-control studies

Proper local scoring rules

Proper local scoring rules on discrete sample spaces

Probability-free pricing of adjusted American lookbacks

Deep determinism and the assessment of mechanistic interaction between categorical and continuous variables

Identifying the consequences of dynamic treatment strategies: A decision-theoretic overview

Insuring against loss of evidence in game-theoretic probability