Researcher profile

Stephen Whitelam

Stephen Whitelam contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2026arXiv

Nonlinear thermodynamic computing out of equilibrium

We present the design for a thermodynamic computer that can perform arbitrary nonlinear calculations in or out of equilibrium. Simple thermodynamic circuits, fluctuating degrees of freedom in contact with a thermal bath and confined by a quartic potential, display an activity that is a nonlinear function of their input. Such circuits can therefore be regarded as thermodynamic neurons, and can serve as the building blocks of networked structures that act as thermodynamic neural networks, universal function approximators whose operation is powered by thermal fluctuations. We simulate a digital model of a thermodynamic neural network, and show that its parameters can be adjusted by genetic algorithm to perform nonlinear calculations at specified observation times, regardless of whether the system has attained thermal equilibrium. This work expands the field of thermodynamic computing beyond the regime of thermal equilibrium, enabling fully nonlinear computations, analogous to those performed by classical neural networks, at specified observation times.

preprint2022arXiv

Learning stochastic dynamics and predicting emergent behavior using transformers

We show that a neural network originally designed for language processing can learn the dynamical rules of a stochastic system by observation of a single dynamical trajectory of the system, and can accurately predict its emergent behavior under conditions not observed during training. We consider a lattice model of active matter undergoing continuous-time Monte Carlo dynamics, simulated at a density at which its steady state comprises small, dispersed clusters. We train a neural network called a transformer on a single trajectory of the model. The transformer, which we show has the capacity to represent dynamical rules that are numerous and nonlocal, learns that the dynamics of this model consists of a small number of processes. Forward-propagated trajectories of the trained transformer, at densities not encountered during training, exhibit motility-induced phase separation and so predict the existence of a nonequilibrium phase transition. Transformers have the flexibility to learn dynamical rules from observation without explicit enumeration of rates or coarse-graining of configuration space, and so the procedure used here can be applied to a wide range of physical systems, including those with large and complex dynamical generators.

preprint2022arXiv

Training neural networks using Metropolis Monte Carlo and an adaptive variant

We examine the zero-temperature Metropolis Monte Carlo algorithm as a tool for training a neural network by minimizing a loss function. We find that, as expected on theoretical grounds and shown empirically by other authors, Metropolis Monte Carlo can train a neural net with an accuracy comparable to that of gradient descent, if not necessarily as quickly. The Metropolis algorithm does not fail automatically when the number of parameters of a neural network is large. It can fail when a neural network's structure or neuron activations are strongly heterogenous, and we introduce an adaptive Monte Carlo algorithm, aMC, to overcome these limitations. The intrinsic stochasticity and numerical stability of the Monte Carlo method allow aMC to train deep neural networks and recurrent neural networks in which the gradient is too small or too large to allow training by gradient descent. Monte Carlo methods offer a complement to gradient-based methods for training neural networks, allowing access to a distinct set of network architectures and principles.

preprint2021arXiv

Correspondence between neuroevolution and gradient descent

We show analytically that training a neural network by conditioned stochastic mutation or neuroevolution of its weights is equivalent, in the limit of small mutations, to gradient descent on the loss function in the presence of Gaussian white noise. Averaged over independent realizations of the learning process, neuroevolution is equivalent to gradient descent on the loss function. We use numerical simulation to show that this correspondence can be observed for finite mutations,for shallow and deep neural networks. Our results provide a connection between two families of neural-network training methods that are usually considered to be fundamentally different.

preprint2021arXiv

Improving the accuracy of nearest-neighbor classification using principled construction and stochastic sampling of training-set centroids

A conceptually simple way to classify images is to directly compare test-set data and training-set data. The accuracy of this approach is limited by the method of comparison used, and by the extent to which the training-set data cover configuration space. Here we show that this coverage can be substantially increased using coarse graining (replacing groups of images by their centroids) and stochastic sampling (using distinct sets of centroids in combination). We use the MNIST and Fashion-MNIST data sets to show that a principled coarse-graining algorithm can convert training images into fewer image centroids without loss of accuracy of classification of test-set images by nearest-neighbor classification. Distinct batches of centroids can be used in combination as a means of stochastically sampling configuration space, and can classify test-set data more accurately than can the unaltered training set. On the MNIST and Fashion-MNIST data sets this approach converts nearest-neighbor classification from a mid-ranking- to an upper-ranking member of the set of classical machine-learning techniques.

preprint2020arXiv

Evolutionary reinforcement learning of dynamical large deviations

We show how to calculate the likelihood of dynamical large deviations using evolutionary reinforcement learning. An agent, a stochastic model, propagates a continuous-time Monte Carlo trajectory and receives a reward conditioned upon the values of certain path-extensive quantities. Evolution produces progressively fitter agents, eventually allowing the calculation of a piece of a large-deviation rate function for a particular model and path-extensive quantity. For models with small state spaces the evolutionary process acts directly on rates, and for models with large state spaces the process acts on the weights of a neural network that parameterizes the model's rates. This approach shows how path-extensive physics problems can be considered within a framework widely used in machine learning.

preprint2020arXiv

Learning to grow: control of material self-assembly using evolutionary reinforcement learning

We show that neural networks trained by evolutionary reinforcement learning can enact efficient molecular self-assembly protocols. Presented with molecular simulation trajectories, networks learn to change temperature and chemical potential in order to promote the assembly of desired structures or choose between competing polymorphs. In the first case, networks reproduce in a qualitative sense the results of previously-known protocols, but faster and with higher fidelity; in the second case they identify strategies previously unknown, from which we can extract physical insight. Networks that take as input the elapsed time of the simulation or microscopic information from the system are both effective, the latter more so. The evolutionary scheme we have used is simple to implement and can be applied to a broad range of examples of experimental self-assembly, whether or not one can monitor the experiment as it proceeds. Our results have been achieved with no human input beyond the specification of which order parameter to promote, pointing the way to the design of synthesis protocols by artificial intelligence.

preprint2020arXiv

Long-Range Exciton Diffusion in Two-Dimensional Assemblies of Cesium Lead Bromide Perovskite Nanocrystals

Förster Resonant Energy Transfer (FRET)-mediated exciton diffusion through artificial nanoscale building block assemblies could be used as a new optoelectronic design element to transport energy. However, so far nanocrystal (NC) systems supported only diffusion length of 30 nm, which are too small to be useful in devices. Here, we demonstrate a FRET-mediated exciton diffusion length of 200 nm with 0.5 cm2/s diffusivity through an ordered, two-dimensional assembly of cesium lead bromide perovskite nanocrystals (PNC). Exciton diffusion was directly measured via steady-state and time-resolved photoluminescence (PL) microscopy, with physical modeling providing deeper insight into the transport process. This exceptionally efficient exciton transport is facilitated by PNCs high PL quantum yield, large absorption cross-section, and high polarizability, together with minimal energetic and geometric disorder of the assembly. This FRET-mediated exciton diffusion length matches perovskites optical absorption depth, opening the possibility to design new optoelectronic device architectures with improved performances, and providing insight into the high conversion efficiencies of PNC-based optoelectronic devices.

preprint2010arXiv

Nonclassical assembly pathways of anisotropic particles

Advances in synthetic methods have spawned an array of nanoparticles and bio-inspired molecules of diverse shapes and interaction geometries. Recent experiments indicate that such anisotropic particles exhibit a variety of 'nonclassical' self-assembly pathways, forming ordered assemblies via intermediates that do not share the architecture of the bulk material. Here we apply mean field theory to a prototypical model of interacting anisotropic particles, and find a clear thermodynamic impetus for nonclassical ordering in certain regimes of parameter space. In other parameter regimes, by contrast, assembly pathways are selected by dynamics. This approach suggests a means of predicting when anisotropic particles might assemble in a manner more complicated than that assumed by classical nucleation theory.

preprint2010arXiv

Self-assembly of amphiphilic peanut-shaped nanoparticles

We use computer simulation to investigate the self-assembly of Janus-like amphiphilic peanut-shaped nanoparticles, finding phases of clusters, bilayers and micelles in accord with ideas of packing familiar from the study of molecular surfactants. However, packing arguments do not explain the hierarchical self-assembly dynamics that we observe, nor the coexistence of bilayers and faceted polyhedra. This coexistence suggests that experimental realizations of our model can achieve multipotent assembly of either of two competing ordered structures.