Researcher profile

Rob Phillips

Rob Phillips contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
16works
0followers
14topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

16 published item(s)

preprint2026arXiv

Conditioning as a route to stereotyped behavior in growing populations

Biological systems perform complex multi-step processes in a reproducible way despite underlying stochasticity. The standard explanation is micromanagement by molecular machinery that recognizes and corrects specific errors. Here we study conditioning, a qualitatively different strategy in which attempts failing a coarse criterion are destroyed and do not leave a physical record. The surviving, i.e., conditioned, ensemble is narrower and therefore more ordered. We model conditioning through stochastic resets in a ''socks-before-shoes'' model of a growing population, where $n$ actions must be completed in any order to replicate and any replication attempt not finished by a threshold time is discarded. We find that resets impose hierarchical temporal ordering of the $n$ actions without microscopic control over which action happens when. When disorder carries a sufficient time penalty, this ordering is free: the fastest-growing population is automatically the most ordered, with no direct selection for order required. Save points, at which verified progress is preserved across resets, allow conditioning to scale to complex multi-step processes. Conditioning provides a minimal route to reliable behavior, requiring only a clock rather than molecular machinery that recognizes specific errors. For the right class of processes, it pays for itself.

preprint2026arXiv

Informational blueprints reveal condition-dependent gene regulatory architectures

While coding regions in the genome have a direct interpretation in terms of protein products, significant fractions are non-coding and yet control essential biological functions. Unlike the genetic code, there is no "lookup table" that identifies where regulatory proteins, known as transcription factors (TFs), bind. Here, we extract these binding sites by distilling sequences of nucleotide letters into collective coordinates (hyperletters) representing the binding sites that are active under specific environmental conditions. Going beyond local information footprints between individual bases and expression levels, our $\textit{information blueprint}$ algorithm compresses the global information by optimising filters that simultaneously scan an entire promoter sequence. Inspired by renormalisation-group techniques, we identify TF binding sites as coarse-grained variables combining groups of correlated mutations with the highest collective impact on gene expression. We validate our approach on experimental data for $\textit{E. coli}$ and discover novel regulatory elements illustrating its deployment at scale across growth conditions.

preprint2022arXiv

Modeling and mechanical perturbations reveal how spatially regulated anchorage gives rise to spatially distinct mechanics across the mammalian spindle

During cell division, the spindle generates force to move chromosomes. In mammals, microtubule bundles called kinetochore-fibers (k-fibers) attach to and segregate chromosomes. To do so, k-fibers must be robustly anchored to the dynamic spindle. We previously developed microneedle manipulation to mechanically challenge k-fiber anchorage, and observed spatially distinct response features revealing the presence of heterogeneous anchorage (Suresh et al. 2020). How anchorage is precisely spatially regulated, and what forces are necessary and sufficient to recapitulate the k-fiber's response to force remain unclear. Here, we develop a coarse-grained k-fiber model and combine with manipulation experiments to infer underlying anchorage using shape analysis. By systematically testing different anchorage schemes, we find that forces solely at k-fiber ends are sufficient to recapitulate unmanipulated k-fiber shapes, but not manipulated ones for which lateral anchorage over a 3 $μ$m length scale near chromosomes is also essential. Such anchorage robustly preserves k-fiber orientation near chromosomes while allowing pivoting around poles. Anchorage over a shorter length scale cannot robustly restrict pivoting near chromosomes, while anchorage throughout the spindle obstructs pivoting at poles. Together, this work reveals how spatially regulated anchorage gives rise to spatially distinct mechanics in the mammalian spindle, which we propose are key for function.

preprint2021arXiv

Programming Boundary Deformation Patterns in Active Networks

Active materials take advantage of their internal sources of energy to self-organize in an automated manner. This feature provides a novel opportunity to design micron-scale machines with minimal required control. However, self-organization goes hand in hand with predetermined dynamics that are hardly susceptible to environmental perturbations. Therefore utilizing this feature of active systems requires harnessing and directing the macroscopic dynamics to achieve specific functions; which in turn necessitates understanding the underlying mechanisms of active forces. Here we devise an optical control protocol to engineer the dynamics of active networks composed of microtubules and light-activatable motor proteins. The protocol enables carving activated networks of different shapes, and isolating them from the embedding solution. Studying a large set of shapes, we observe that the active networks contract in a shape-preserving manner that persists over the course of contraction. We formulate a coarse-grained theory and demonstrate that self-similarity of contraction is associated with viscous-like active stresses. These findings help us program the dynamics of the network through manipulating the light intensity in space and time, and maneuver the network into bending in specific directions, as well as temporally alternating directions. Our work improves understanding the active dynamics in contractile networks, and paves a new path towards engineering the dynamics of a large class of active materials.

preprint2021arXiv

Schrödinger's "What is Life?" at 75

2019 marked the 75th anniversary of the publication of Erwin Schrödinger's "What is Life?", a short book described by Roger Penrose in his preface to a reprint of this classic as "among the most influential scientific writings of the 20th century." In this article, I review the long argument made by Schrödinger as he mused on how the laws of physics could help us understand "the events in space and time which take place within the spatial boundary of a living organism." Though Schrödinger's book is often hailed for its influence on some of the titans who founded molecular biology, this article takes a different tack. Instead of exploring the way the book touched biologists such as James Watson and Francis Crick, as well as its critical reception by others such as Linus Pauling and Max Perutz, I argue that Schrödinger's classic is a timeless manifesto, rather than a dated historical curiosity. "What is Life?" is full of timely outlooks and approaches to understanding the mysterious living world that includes and surrounds us and can instead be viewed as a call to arms to tackle the great unanswered challenges in the study of living matter that remain for 21$^{st}$ century science.

preprint2021arXiv

The Anthropocene by the Numbers: A Quantitative Snapshot of Humanity's Influence on the Planet

The presence and action of humans on Earth has exerted a strong influence on the evolution of the planet over the past $\approx$ 10,000 years, the consequences of which are now becoming broadly evident. Despite a deluge of tightly-focused and necessarily technical studies exploring each facet of "human impacts" on the planet, their integration into a complete picture of the human-Earth system lags far behind. Here, we quantify twelve dimensionless ratios which put the magnitude of human impacts in context, comparing the magnitude of anthropogenic processes to their natural analogues. These ratios capture the extent to which humans alter the terrestrial surface, hydrosphere, biosphere, atmosphere, and biogeochemistry of Earth. In almost all twelve cases, the impact of human processes rivals or exceeds their natural counterparts. The values and corresponding uncertainties for these impacts at global and regional resolution are drawn from the primary scientific literature, governmental and international databases, and industry reports. We present this synthesis of the current "state of affairs" as a graphical snapshot designed to be used as a reference. Furthermore, we establish a searchable database termed the Human Impacts Database (www.anthroponumbers.org) which houses all quantities reported here and many others with extensive curation and annotation. While necessarily incomplete, this work collates and contextualizes a set of essential numbers summarizing the broad impacts of human activities on Earth's atmosphere, land, water, and biota.

preprint2020arXiv

A quantitative compendium of COVID-19 epidemiology

Accurate numbers are needed to understand and predict viral dynamics. Curation of high-quality literature values for the infectious period duration or household secondary attack rate, for example, is especially pressing currently because these numbers inform decisions about how and when to lockdown or reopen societies. We aim to provide a curated source for the key numbers that help us understand the virus driving our current global crisis. This compendium focuses solely on COVID-19 epidemiology. The numbers reported in summary format are substantiated by annotated references. For each property, we provide a concise definition, description of measurement and inference methods, and associated caveats. We hope this compendium will make essential numbers more accessible and avoid common sources of confusion for the many newcomers to the field such as using the incubation period to denote and quantify the latent period or using the hospitalization duration for the infectiousness period duration. This document will be repeatedly updated and the community is invited to participate in improving it.

preprint2020arXiv

Allostery and Kinetic Proofreading

Kinetic proofreading is an error correction mechanism present in the processes of the central dogma and beyond, and typically requires the free energy of nucleotide hydrolysis for its operation. Though the molecular players of many biological proofreading schemes are known, our understanding of how energy consumption is managed to promote fidelity remains incomplete. In our work, we introduce an alternative conceptual scheme called 'the piston model of proofreading' where enzyme activation through hydrolysis is replaced with allosteric activation achieved through mechanical work performed by a piston on regulatory ligands. Inspired by Feynman's ratchet and pawl mechanism, we consider a mechanical engine designed to drive the piston actions powered by a lowering weight, whose function is analogous to that of ATP synthase in cells. Thanks to its mechanical design, the piston model allows us to tune the 'knobs' of the driving engine and probe the graded changes and trade-offs between speed, fidelity and energy dissipation. It provides an intuitive explanation of the conditions necessary for optimal proofreading and reveals the unexpected capability of allosteric molecules to beat the Hopfield limit of fidelity by leveraging the diversity of states available to them. The framework that we built for the piston model can also serve as a basis for additional studies of driven biochemical systems.

preprint2020arXiv

Deciphering the regulatory genome of $\textit{Escherichia coli}$, one hundred promoters at a time

Advances in DNA sequencing have revolutionized our ability to read genomes. However, even in the most well-studied of organisms, the bacterium ${\it Escherichia coli}$, for $\approx$ 65$\%$ of the promoters we remain completely ignorant of their regulation. Until we have cracked this regulatory Rosetta Stone, efforts to read and write genomes will remain haphazard. We introduce a new method (Reg-Seq) linking a massively-parallel reporter assay and mass spectrometry to produce a base pair resolution dissection of more than 100 promoters in ${\it E. coli}$ in 12 different growth conditions. First, we show that our method recapitulates regulatory information from known sequences. Then, we examine the regulatory architectures for more than 80 promoters in the ${\it E. coli}$ genome which previously had no known regulation. In many cases, we also identify which transcription factors mediate their regulation. The method introduced here clears a path for fully characterizing the regulatory genome of model organisms, with the potential of moving on to an array of other microbes of ecological and medical relevance.

preprint2020arXiv

First-principles prediction of the information processing capacity of a simple genetic circuit

Given the stochastic nature of gene expression, genetically identical cells exposed to the same environmental inputs will produce different outputs. This heterogeneity has been hypothesized to have consequences for how cells are able to survive in changing environments. Recent work has explored the use of information theory as a framework to understand the accuracy with which cells can ascertain the state of their surroundings. Yet the predictive power of these approaches is limited and has not been rigorously tested using precision measurements. To that end, we generate a minimal model for a simple genetic circuit in which all parameter values for the model come from independently published data sets. We then predict the information processing capacity of the genetic circuit for a suite of biophysical parameters such as protein copy number and protein-DNA affinity. We compare these parameter-free predictions with an experimental determination of protein expression distributions and the resulting information processing capacity of E. coli cells. We find that our minimal model captures the scaling of the cell-to-cell variability in the data and the inferred information processing capacity of our simple genetic circuit up to a systematic deviation.

preprint2020arXiv

Proofreading through spatial gradients

Key enzymatic processes in biology use the nonequilibrium error correction mechanism called kinetic proofreading to enhance their specificity. Kinetic proofreading typically requires several dedicated structural features in the enzyme, such as a nucleotide hydrolysis site and multiple enzyme-substrate conformations that delay product formation. Such requirements limit the applicability and the adaptability of traditional proofreading schemes. Here, we explore an alternative conceptual mechanism of error correction that achieves delays between substrate binding and subsequent product formation by having these events occur at distinct physical locations. The time taken by the enzyme-substrate complex to diffuse from one location to another is leveraged to discard wrong substrates. This mechanism does not require dedicated structural elements on the enzyme, making it easier to overlook in experiments but also making proofreading tunable on the fly. We discuss how tuning the length scales of enzyme or substrate concentration gradients changes the fidelity, speed and energy dissipation, and quantify the performance limitations imposed by realistic diffusion and reaction rates in the cell. Our work broadens the applicability of kinetic proofreading, and sets the stage for the study of spatial gradients as a possible route to specificity.

preprint2020arXiv

Quantitative clarification of key questions about COVID-19 epidemiology

Modeling the spread of COVID-19 is crucial for informing public health policy. All models for COVID-19 epidemiology rely on parameters describing the dynamics of the infection process. The meanings of epidemiological parameters like R_0, R_t, the "serial interval" and "generation interval" can be challenging to understand, especially as these and other parameters are conceptually overlapping and sometimes confusingly named. Moreover, the procedures used to estimate these parameters make various assumptions and use different mathematical approaches that should be understood and accounted for when relying on parameter values and reporting them to the public. Here, we offer several insights regarding the derivation of commonly-reported epidemiological parameters, and describe how mitigation measures like lockdown are expected to affect their values. We aim to present these quantitative relationships in a manner that is accessible to the widest audience possible. We hope that better communicating the intricacies of epidemiological models will improve our collective understanding of their strengths and weaknesses, and will help avoid possible pitfalls when using them.

preprint2020arXiv

SARS-CoV-2 (COVID-19) by the numbers

The current SARS-CoV-2 pandemic is a harsh reminder of the fact that, whether in a single human host or a wave of infection across continents, viral dynamics is often a story about the numbers. In this snapshot, our aim is to provide a one-stop, curated graphical source for the key numbers that help us understand the virus driving our current global crisis. The discussion is framed around two broad themes: 1) the biology of the virus itself and 2) the characteristics of the infection of a single human host. Our one-page summary provides the key numbers pertaining to SARS-CoV-2, based mostly on peer-reviewed literature. The numbers reported in summary format are substantiated by the annotated references below. Readers are urged to remember that much uncertainty remains and knowledge of this pandemic and the virus driving it is rapidly evolving. In the paragraphs below we provide "back of the envelope" calculations that exemplify the insights that can be gained from knowing some key numbers and using quantitative logic. These calculations serve to improve our intuition through sanity checks, but do not replace detailed epidemiological analysis.

preprint2019arXiv

Harnessing Avidity: Quantifying Entropic and Energetic Effects of Linker Length and Rigidity Required for Multivalent Binding of Antibodies to HIV-1 Spikes

Due to the low density of envelope (Env) spikes on the surface of HIV-1, neutralizing IgG antibodies rarely bind bivalently using both antigen-binding arms (Fabs) to crosslink between spikes (inter-spike crosslinking), instead resorting to weaker monovalent binding that is more sensitive to Env mutations. Synthetic antibodies designed to bivalently bind a single Env trimer (intra-spike crosslinking) were previously shown to exhibit increased neutralization potencies. In initial work, diFabs joined by varying lengths of rigid double-stranded DNA (dsDNA) were considered. Anticipating future experiments to improve synthetic antibodies, we investigate whether linkers with different rigidities could enhance diFab potency by modeling DNA-Fabs containing different combinations of rigid dsDNA and flexible single-stranded DNA (ssDNA) and characterizing their neutralization potential. Model predictions suggest that while a long flexible polymer may be capable of bivalent binding, it exhibits weak neutralization due to the large loss in entropic degrees of freedom when both Fabs are bound. In contrast, the strongest neutralization potencies are predicted to require a rigid linker that optimally spans the distance between two Fab binding sites on an Env trimer, and avidity can be further boosted by incorporating more Fabs into these constructs. These results inform the design of multivalent anti-HIV-1 therapeutics that utilize avidity effects to remain potent against HIV-1 in the face of the rapid mutation of Env spikes.

preprint2019arXiv

The Energetics of Molecular Adaptation in Transcriptional Regulation

Mutation is a critical mechanism by which evolution explores the functional landscape of proteins. Despite our ability to experimentally inflict mutations at will, it remains difficult to link sequence-level perturbations to systems-level responses. Here, we present a framework centered on measuring changes in the free energy of the system to link individual mutations in an allosteric transcriptional repressor to the parameters which govern its response. We find the energetic effects of the mutations can be categorized into several classes which have characteristic curves as a function of the inducer concentration. We experimentally test these diagnostic predictions using the well-characterized LacI repressor of Escherichia coli, probing several mutations in the DNA binding and inducer binding domains. We find that the change in gene expression due to a point mutation can be captured by modifying only a subset of the model parameters that describe the respective domain of the wild-type protein. These parameters appear to be insulated, with mutations in the DNA binding domain altering only the DNA affinity and those in the inducer binding domain altering only the allosteric parameters. Changing these subsets of parameters tunes the free energy of the system in a way that is concordant with theoretical expectations. Finally, we show that the induction profiles and resulting free energies associated with pairwise double mutants can be predicted with quantitative accuracy given knowledge of the single mutants, providing an avenue for identifying and quantifying epistatic interactions.

preprint2017arXiv

The Energetic Cost of Building a Virus

Viruses are incapable of autonomous energy production. Although many experimental studies make it clear that viruses are parasitic entities that hijack the host's molecular resources, a detailed estimate for the energetic cost of viral synthesis is largely lacking. To quantify the energetic cost of viruses to their hosts, we enumerated the costs associated with two very distinct but representative DNA and RNA viruses, namely T4 and influenza. We found that for these viruses, translation of viral proteins is the most energetically expensive process. Interestingly, the cost of building a T4 phage and a single influenza virus are nearly the same. Due to influenza's higher burst size, however, the overall cost of a T4 phage infection is only 2-3% of the cost of an influenza infection. The costs of these infections relative to their host's estimated energy budget during the infection reveal that a T4 infection consumes about a third of its host's energy budget, where as an influenza infection consumes only 1%. Building on our estimates for T4, we show how the energetic costs of double-stranded DNA viruses scale with virus size, revealing that the dominant cost of building a virus can switch from translation to genome replication above a critical virus size. Lastly, using our predictions for the energetic cost of viruses, we provide estimates for the strengths of selection and genetic drift acting on newly incorporated genetic elements in viral genomes, under conditions of energy limitation.