Researcher profile

Jonathan R. Walsh

Jonathan R. Walsh contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2026arXiv

Digital Twins as Synthetic Controls in Single-Arm Trials

Single-arm trials are an important study design for evaluating drug efficacy and safety without enrolling patients into a control arm. Although they do not provide the gold-standard evidence of randomized controlled trials, they are increasingly used in clinical development as they offer an efficient, ethical, and practical alternative. A wide variety of approaches can be used to construct control comparators and estimate treatment effects, from fixed comparators informed by clinical knowledge to data-based and model-based patient-level comparators, also known as synthetic controls. Powerful and flexible machine learning models can allow outcome-model-based synthetic controls to overcome key limitations of direct data-based approaches, yield more robust estimates of treatment effects, and provide a principled way to incorporate corrections or encode additional assumptions when external data are not directly comparable. In this work, we argue that outcome-model-based synthetic control arms are an important tool for single-arm trials. We focus on digital twins, personalized predictions of disease progression generated from machine learning models trained on historical datasets, which naturally leverage these flexible approaches. We review doubly robust estimators, present power and sample size formulas, and discuss trade-offs in selecting historical data for training and analysis. We also outline practical considerations for deploying digital twins within the framework of recent FDA draft guidance on the use of artificial intelligence in drug development. Finally, we reanalyze data from trials in amyotrophic lateral sclerosis and Huntington's disease to demonstrate the proposed methods.

preprint2020arXiv

Generating Digital Twins with Multiple Sclerosis Using Probabilistic Neural Networks

Multiple Sclerosis (MS) is a neurodegenerative disorder characterized by a complex set of clinical assessments. We use an unsupervised machine learning model called a Conditional Restricted Boltzmann Machine (CRBM) to learn the relationships between covariates commonly used to characterize subjects and their disease progression in MS clinical trials. A CRBM is capable of generating digital twins, which are simulated subjects having the same baseline data as actual subjects. Digital twins allow for subject-level statistical analyses of disease progression. The CRBM is trained using data from 2395 subjects enrolled in the placebo arms of clinical trials across the three primary subtypes of MS. We discuss how CRBMs are trained and show that digital twins generated by the model are statistically indistinguishable from their actual subject counterparts along a number of measures.

preprint2012arXiv

Disentangling Clustering Effects in Jet Algorithms

Clustering algorithms build jets though the iterative application of single particle and pairwise metrics. This leads to phase space constraints that are extremely complicated beyond the lowest orders in perturbation theory, and in practice they must be implemented numerically. This complication presents a significant barrier to gaining an analytic understanding of the perturbative structure of jet cross sections. We present a novel framework to express the jet algorithm's phase space constraints as a function of clustered groups of particles, which are the possible outcomes of the algorithm. This approach highlights the analytic properties of jet observables, rather than the explicit constraints on individual final state momenta, which can be unwieldy at higher orders. We derive the form of the n-particle phase space constraints for a jet algorithm with any measurement. We provide an expression for the measurement that makes clustering effects manifest and relates them to constraints from clustering at lower orders. The utility of this framework is demonstrated by using it to understand clustering effects for a large class of jet shape observables in the soft/collinear limit. We apply this framework to isolate divergences and analyze the logarithmic structure of the Abelian terms in the soft function, providing the all-orders form of these terms and showing that corrections from clustering start at next-to-leading logarithmic order in the exponent of the cross section.

preprint2010arXiv

Consistent Factorization of Jet Observables in Exclusive Multijet Cross-Sections

We demonstrate the consistency at the next-to-leading-logarithmic (NLL) level of a factorization theorem based on Soft-Collinear Effective Theory (SCET) for jet shapes in e+e- collisions. We consider measuring jet observables in exclusive multijet final states defined with cone and k_T-type jet algorithms. Consistency of the factorization theorem requires that the renormalization group evolution of hard, jet, and soft functions is such that the physical cross-section is independent of the factorization scale mu. The anomalous dimensions of the various factorized pieces, however, depend on the color representation of jets, choice of jet observable, the number of jets whose shapes are measured, and the jet algorithm, making it highly nontrivial to satisfy the consistency condition. We demonstrate the intricate cancellations between anomalous dimensions that occur at the NLL level, so that, up to power corrections that we identify, our factorization of the jet shape distributions is consistent for any number of quark and gluon jets, for any number of jets whose shapes are measured or unmeasured, for any angular size R of the jets, and for any of the algorithms we consider. Corrections to these results are suppressed by the SCET expansion parameter lambda (the ratio of soft to collinear or collinear to hard scales) and in the jet separation measure 1/t^2 = tan^2(R/2)/tan^2(psi/2), where psi is the angular separation between jets. Our results can be used to calculate a wide variety of jet observables in multijet final states to NLL accuracy.

preprint2010arXiv

Jet Shapes and Jet Algorithms in SCET

Jet shapes are weighted sums over the four-momenta of the constituents of a jet and reveal details of its internal structure, potentially allowing discrimination of its partonic origin. In this work we make predictions for quark and gluon jet shape distributions in N-jet final states in e+e- collisions, defined with a cone or recombination algorithm, where we measure some jet shape observable on a subset of these jets. Using the framework of Soft-Collinear Effective Theory, we prove a factorization theorem for jet shape distributions and demonstrate the consistent renormalization-group running of the functions in the factorization theorem for any number of measured and unmeasured jets, any number of quark and gluon jets, and any angular size R of the jets, as long as R is much smaller than the angular separation between jets. We calculate the jet and soft functions for angularity jet shapes τ_a to one-loop order (O(alpha_s)) and resum a subset of the large logarithms of τ_a needed for next-to-leading logarithmic (NLL) accuracy for both cone and kT-type jets. We compare our predictions for the resummed τ_a distribution of a quark or a gluon jet produced in a 3-jet final state in e+e- annihilation to the output of a Monte Carlo event generator and find that the dependence on a and R is very similar.