Researcher profile

Mike Baiocchi

Mike Baiocchi contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 13 - UnverifiedVerification L1Unclaimed author
2works
0followers
2topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

2 published item(s)

preprint2022arXiv

Statistical matching and subclassification with a continuous dose: characterization, algorithm, and application to a health outcomes study

Subclassification and matching are often used in empirical studies to adjust for observed covariates; however, they are largely restricted to relatively simple study designs with a binary treatment and less developed for designs with a continuous exposure. Matching with exposure doses is particularly useful in instrumental variable designs and in understanding the dose-response relationships. In this article, we propose two criteria for optimal subclassification based on subclass homogeneity in the context of having a continuous exposure dose, and propose an efficient polynomial-time algorithm that is guaranteed to find an optimal subclassification with respect to one criterion and serves as a 2-approximation algorithm for the other criterion. We discuss how to incorporate dose and use appropriate penalties to control the number of subclasses in the design. Via extensive simulations, we systematically compare our proposed design to optimal non-bipartite pair matching, and demonstrate that combining our proposed subclassification scheme with regression adjustment helps reduce model dependence for parametric causal inference with a continuous dose. We apply the new design and associated randomization-based inferential procedure to study the effect of transesophageal echocardiography (TEE) monitoring during coronary artery bypass graft (CABG) surgery on patients' post-surgery clinical outcomes using Medicare and Medicaid claims data, and find evidence that TEE monitoring lowers patients' all-cause $30$-day mortality rate.

preprint2020arXiv

A Causal Machine Learning Framework for Predicting Preventable Hospital Readmissions

Clinical predictive algorithms are increasingly being used to form the basis for optimal treatment policies--that is, to enable interventions to be targeted to the patients who will presumably benefit most. Despite taking advantage of recent advances in supervised machine learning, these algorithms remain, in a sense, blunt instruments--often being developed and deployed without a full accounting of the causal aspects of the prediction problems they are intended to solve. Indeed, in many settings, including among patients at risk of readmission, the riskiest patients may derive less benefit from a preventative intervention compared to those at lower risk. Moreover, targeting an intervention to a population, rather than limiting it to a small group of high-risk patients, may lead to far greater overall utility if the patients with the most modifiable (or preventable) outcomes across the population could be identified. Based on these insights, we introduce a causal machine learning framework that decouples this prediction problem into causal and predictive parts, which clearly delineates the complementary roles of causal inference and prediction in this problem. We estimate treatment effects using causal forests, and characterize treatment effect heterogeneity across levels of predicted risk using these estimates. Furthermore, we show how these effect estimates could be used in concert with the modeled "payoffs" associated with successful prevention of individual readmissions to maximize overall utility. Based on data taken from before and after the implementation of a readmissions prevention intervention at Kaiser Permanente Northern California, our results suggest that nearly four times as many readmissions could be prevented annually with this approach compared to targeting this intervention using predicted risk.