Researcher profile

Daniele Bertolini

Daniele Bertolini contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
11works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

11 published item(s)

preprint2026arXiv

Digital Twins as Synthetic Controls in Single-Arm Trials

Single-arm trials are an important study design for evaluating drug efficacy and safety without enrolling patients into a control arm. Although they do not provide the gold-standard evidence of randomized controlled trials, they are increasingly used in clinical development as they offer an efficient, ethical, and practical alternative. A wide variety of approaches can be used to construct control comparators and estimate treatment effects, from fixed comparators informed by clinical knowledge to data-based and model-based patient-level comparators, also known as synthetic controls. Powerful and flexible machine learning models can allow outcome-model-based synthetic controls to overcome key limitations of direct data-based approaches, yield more robust estimates of treatment effects, and provide a principled way to incorporate corrections or encode additional assumptions when external data are not directly comparable. In this work, we argue that outcome-model-based synthetic control arms are an important tool for single-arm trials. We focus on digital twins, personalized predictions of disease progression generated from machine learning models trained on historical datasets, which naturally leverage these flexible approaches. We review doubly robust estimators, present power and sample size formulas, and discuss trade-offs in selecting historical data for training and analysis. We also outline practical considerations for deploying digital twins within the framework of recent FDA draft guidance on the use of artificial intelligence in drug development. Finally, we reanalyze data from trials in amyotrophic lateral sclerosis and Huntington's disease to demonstrate the proposed methods.

preprint2026arXiv

FRESH: Information-Geometric Calibration of Patient-Level Models to Aggregate Evidence

This note introduces FRESH (Fusion of Recent Evidence and Subject Histories), a method for incorporating population-level summary results -- published clinical trials, registry summaries, prior natural-history studies, and peer-reviewed indirect comparisons -- into predictive models trained on patient-level data. This method provides a principled means of combining both patient-level and aggregate-level data types into a unified data-efficient model for clinical decision making. FRESH assumes access to a generative model trained on patient-level data sources (e.g. clinical trial or real-world data). The method produces patient-level predictions from a re-calibrated model that matches a set of specified aggregate statistics for a target population. This can be understood as a patient-level recapitulation of the aggregate source -- with the key property that the recalibration is a minimal perturbation of the original joint distribution in a specific information-geometric sense. The resulting samples can be analyzed directly or combined into a post-training procedure to update the original generative model. This approach enables several applications where rigorously incorporating patient-level data with summary information is valuable, including (i) contextualizing single-arm trial results with respect to recent standard-of-care, (ii) clinical-trial simulations for design and probability-of-technical-success estimation, and (iii) comparative-effectiveness analyses of on-market therapies.

preprint2020arXiv

Incidence Networks for Geometric Deep Learning

Sparse incidence tensors can represent a variety of structured data. For example, we may represent attributed graphs using their node-node, node-edge, or edge-edge incidence matrices. In higher dimensions, incidence tensors can represent simplicial complexes and polytopes. In this paper, we formalize incidence tensors, analyze their structure, and present the family of equivariant networks that operate on them. We show that any incidence tensor decomposes into invariant subsets. This decomposition, in turn, leads to a decomposition of the corresponding equivariant linear maps, for which we prove an efficient pooling-and-broadcasting implementation.

preprint2016arXiv

Non-Gaussian Covariance of the Matter Power Spectrum in the Effective Field Theory of Large Scale Structure

We compute the non-Gaussian contribution to the covariance of the matter power spectrum at one-loop order in Standard Perturbation Theory (SPT), and using the framework of the effective field theory (EFT) of large scale structure (LSS). The complete one-loop contributions are evaluated for the first time, including the leading EFT corrections that involve seven independent operators, of which four appear in the power spectrum and bispectrum. We compare the non-Gaussian part of the one-loop covariance computed with both SPT and EFT of LSS to two separate simulations. In one simulation, we find that the one-loop prediction from SPT reproduces the simulation well to $k_i + k_j \sim$ 0.25 h/Mpc, while in the other simulation we find a substantial improvement of EFT of LSS (with one free parameter) over SPT, more than doubling the range of $k$ where the theory accurately reproduces the simulation. The disagreement between these two simulations points to unaccounted for systematics, highlighting the need for improved numerical and analytic understanding of the covariance.

preprint2016arXiv

The Trispectrum in the Effective Field Theory of Large Scale Structure

We compute the connected four point correlation function (the trispectrum in Fourier space) of cosmological density perturbations at one-loop order in Standard Perturbation Theory (SPT) and the Effective Field Theory of Large Scale Structure (EFT of LSS). This paper is a companion to our earlier work on the non-Gaussian covariance of the matter power spectrum, which corresponds to a particular wavenumber configuration of the trispectrum. In the present calculation, we highlight and clarify some of the subtle aspects of the EFT framework that arise at third order in perturbation theory for general wavenumber configurations of the trispectrum. We consistently incorporate vorticity and non-locality in time into the EFT counterterms and lay out a complete basis of building blocks for the stress tensor. We show predictions for the one-loop SPT trispectrum and the EFT contributions, focusing on configurations which have particular relevance for using LSS to constrain primordial non-Gaussianity.

preprint2015arXiv

The First Calculation of Fractional Jets

In collider physics, jet algorithms are a ubiquitous tool for clustering particles into discrete jet objects. Event shapes offer an alternative way to characterize jets, and one can define a jet multiplicity event shape, which can take on fractional values, using the framework of "jets without jets". In this paper, we perform the first analytic studies of fractional jet multiplicity $\tilde{N}_{\rm jet}$ in the context of $e^+e^-$ collisions. We use fixed-order QCD to understand the $\tilde{N}_{\rm jet}$ cross section at order $α_s^2$, and we introduce a candidate factorization theorem to capture certain higher-order effects. The resulting distributions have a hybrid jet algorithm/event shape behavior which agrees with parton shower Monte Carlo generators. The $\tilde{N}_{\rm jet}$ observable does not satisfy ordinary soft-collinear factorization, and the $\tilde{N}_{\rm jet}$ cross section exhibits a number of unique features, including the absence of collinear logarithms and the presence of soft logarithms that are purely non-global. Additionally, we find novel divergences connected to the energy sharing between emissions, which are reminiscent of rapidity divergences encountered in other applications. Given these interesting properties of fractional jet multiplicity, we advocate for future measurements and calculations of $\tilde{N}_{\rm jet}$ at hadron colliders like the LHC.

preprint2014arXiv

Jet Observables Without Jet Algorithms

We introduce a new class of event shapes to characterize the jet-like structure of an event. Like traditional event shapes, our observables are infrared/collinear safe and involve a sum over all hadrons in an event, but like a jet clustering algorithm, they incorporate a jet radius parameter and a transverse momentum cut. Three of the ubiquitous jet-based observables---jet multiplicity, summed scalar transverse momentum, and missing transverse momentum---have event shape counterparts that are closely correlated with their jet-based cousins. Due to their "local" computational structure, these jet-like event shapes could potentially be used for trigger-level event selection at the LHC. Intriguingly, the jet multiplicity event shape typically takes on non-integer values, highlighting the inherent ambiguity in defining jets. By inverting jet multiplicity, we show how to characterize the transverse momentum of the n-th hardest jet without actually finding the constituents of that jet. Since many physics applications do require knowledge about the jet constituents, we also build a hybrid event shape that incorporates (local) jet clustering information. As a straightforward application of our general technique, we derive an event-shape version of jet trimming, allowing event-wide jet grooming without explicit jet identification. Finally, we briefly mention possible applications of our method for jet substructure studies.

preprint2014arXiv

Pileup Per Particle Identification

We propose a new method for pileup mitigation by implementing "pileup per particle identification" (PUPPI). For each particle we first define a local shape $α$ which probes the collinear versus soft diffuse structure in the neighborhood of the particle. The former is indicative of particles originating from the hard scatter and the latter of particles originating from pileup interactions. The distribution of $α$ for charged pileup, assumed as a proxy for all pileup, is used on an event-by-event basis to calculate a weight for each particle. The weights describe the degree to which particles are pileup-like and are used to rescale their four-momenta, superseding the need for jet-based corrections. Furthermore, the algorithm flexibly allows combination with other, possibly experimental, probabilistic information associated with particles such as vertexing and timing performance. We demonstrate the algorithm improves over existing methods by looking at jet $p_T$ and jet mass. We also find an improvement on non-jet quantities like missing transverse energy.

preprint2013arXiv

TASI 2012: Super-Tricks for Superspace

These lectures from the TASI 2012 summer school outline the basics of supersymmetry (SUSY) in 3+1 dimensions. Starting from a ground-up development of superspace, we develop all of the tools necessary to construct SUSY lagrangians. While aimed at an introductory level, these lectures incorporate a number of "super-tricks" for SUSY aficionados, including SUSY-covariant derivatives, equations of motion in superspace, background field methods, and non-linear realizations of goldstinos.

preprint2012arXiv

The Social Higgs

Using published Higgs search data we investigate whether any evidence supports the possibility that the Higgs may be mixed with other neutral scalars. We combine the positive evidence for the Higgs at 125.5 GeV with search constraints at other masses to explore the viability of two simple models. The first Higgs 'friend' model is simply a neutral scalar mixed with the Higgs. In the second Higgs 'accomplice' model the new scalar has an enhanced coupling to photons due to couplings to additional charged fields. We find that the latter scenario allows improvement in fitting the data by accommodating enhanced diphoton rates and suppression in other channels for a Higgs mass of 125.5 GeV. Small excesses at other masses allow the additional scalar to further improve the fit to the data, particularly if it has mass in the vicinity of 210 GeV. Due to observed event rates at 125.5 GeV and strong limits in high mass Higgs searches, mixing angles greater than pi/4 are typically disfavored at the 95% confidence level, depending on the mass of the scalar.

preprint2012arXiv

Visible Supersymmetry Breaking and an Invisible Higgs

If there are multiple hidden sectors which independently break supersymmetry, then the spectrum will contain multiple goldstini. In this paper, we explore the possibility that the visible sector might also break supersymmetry, giving rise to an additional pseudo-goldstino. By the standard lore, visible sector supersymmetry breaking is phenomenologically excluded by the supertrace sum rule, but this sum rule is relaxed with multiple supersymmetry breaking. However, we find that visible sector supersymmetry breaking is still phenomenologically disfavored, not because of a sum rule, but because the visible sector pseudo-goldstino is generically overproduced in the early universe. A way to avoid this cosmological bound is to ensure that an R symmetry is preserved in the visible sector up to supergravity effects. A key expectation of this R-symmetric case is that the Higgs boson will dominantly decay invisibly at the LHC.