Source author record

Alpha A. Lee

Alpha A. Lee appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.soft Machine Learning physics.comp-ph cond-mat.mtrl-sci cond-mat.stat-mech physics.chem-ph Biological Physics cond-mat.dis-nn

Catalog footprint

What is connected

13works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Achieving Robustness to Aleatoric Uncertainty with Heteroscedastic Bayesian Optimisation

Bayesian optimisation is a sample-efficient search methodology that holds great promise for accelerating drug and materials discovery programs. A frequently-overlooked modelling consideration in Bayesian optimisation strategies however, is the representation of heteroscedastic aleatoric uncertainty. In many practical applications it is desirable to identify inputs with low aleatoric noise, an example of which might be a material composition which consistently displays robust properties in response to a noisy fabrication process. In this paper, we propose a heteroscedastic Bayesian optimisation scheme capable of representing and minimising aleatoric noise across the input space. Our scheme employs a heteroscedastic Gaussian process (GP) surrogate model in conjunction with two straightforward adaptations of existing acquisition functions. First, we extend the augmented expected improvement (AEI) heuristic to the heteroscedastic setting and second, we introduce the aleatoric noise-penalised expected improvement (ANPEI) heuristic. Both methodologies are capable of penalising aleatoric noise in the suggestions and yield improved performance relative to homoscedastic Bayesian optimisation and random sampling on toy problems as well as on two real-world scientific datasets. Code is available at: \url{https://github.com/Ryan-Rhys/Heteroscedastic-BO}

preprint2022arXiv

Inferring global dynamics from local structure in liquid electrolytes

Ion transport in concentrated electrolytes plays a fundamental role in electrochemical systems such as lithium ion batteries. Nonetheless, the mechanism of transport amid strong ion-ion interactions remains enigmatic. A key question is whether the dynamics of ion transport can be predicted by the local static structure alone, and if so what are the key structural motifs that determine transport. In this paper, we show that machine learning can successfully decompose global conductivity into the spatio-temporal average of local, instantaneous ionic contributions, and relate this ``local molar conductivity" field to the local ionic environment. Our machine learning model accurately predicts the molar conductivity of electrolyte systems that were not part of the training set, suggesting that the dynamics of ion transport is predictable from local static structure. Further, through analysing this machine-learned local conductivity field, we observe that fluctuations in local conductivity at high concentration are negatively correlated with total molar conductivity. Surprisingly, these fluctuations arise due to a long tail distribution of low conductivity ions, rather than distinct ion pairs, and are spatially correlated through both like- and unlike-charge interactions. More broadly, our approach shows how machine learning can aid the understanding of complex soft matter systems, by learning a function that attributes global collective properties to local, atomistic contributions.

preprint2022arXiv

Rapid Discovery of Stable Materials by Coordinate-free Coarse Graining

A fundamental challenge in materials science pertains to elucidating the relationship between stoichiometry, stability, structure, and property. Recent advances have shown that machine learning can be used to learn such relationships, allowing the stability and functional properties of materials to be accurately predicted. However, most of these approaches use atomic coordinates as input and are thus bottle-necked by crystal structure identification when investigating novel materials. Our approach solves this bottleneck by coarse-graining the infinite search space of atomic coordinates into a combinatorially enumerable search space. The key idea is to use Wyckoff representations -- coordinate-free sets of symmetry-related positions in a crystal -- as the input to a machine learning model. Our model demonstrates exceptionally high precision in discovering new theoretically stable materials, identifying 1,569 materials that lie below the known convex hull of previously calculated materials from just 5,675 ab-initio calculations. Our approach opens up fundamental advances in computational materials discovery.

preprint2021arXiv

Machine learnt approximations to the bridge function yield improved closures for the Ornstein-Zernike equation

A key challenge for soft materials design and coarse-graining simulations is determining interaction potentials between components that give rise to desired condensed-phase structures. In theory, the Ornstein-Zernike equation provides an elegant framework for solving this inverse problem. Pioneering work in liquid state theory derived analytical closures for the framework. However, these analytical closures are approximations, valid only for specific classes of interaction potentials. In this work, we combine the physics of liquid state theory with machine learning to infer a closure directly from simulation data. The resulting closure is more accurate than commonly used closures across a broad range of interaction potentials. We show for two examples of a prototypical inverse design problem, fitting a coarse-grained simulation potential, that our approach leads to improved one-step inversion.

preprint2020arXiv

Materials Graph Transformer predicts the outcomes of inorganic reactions with reliable uncertainties

A common bottleneck for materials discovery is synthesis. While recent methodological advances have resulted in major improvements in the ability to predicatively design novel materials, researchers often still rely on trial-and-error approaches for determining synthesis procedures. In this work, we develop a model that predicts the major product of solid-state reactions. The cardinal feature of this approach is the construction of fixed-length, learned representations of reactions. Precursors are represented as nodes on a `reaction graph', and message-passing operations between nodes are used to embody the interactions between precursors in the reaction mixture. Through an ablation study, it is shown that this framework not only outperforms less physically-motivated baseline methods but also more reliably assesses the uncertainty in its predictions.

preprint2020arXiv

Predicting materials properties without crystal structure: Deep representation learning from stoichiometry

Machine learning has the potential to accelerate materials discovery by accurately predicting materials properties at a low computational cost. However, the model inputs remain a key stumbling block. Current methods typically use descriptors constructed from knowledge of either the full crystal structure -- therefore only applicable to materials with already characterised structures -- or structure-agnostic fixed-length representations hand-engineered from the stoichiometry. We develop a machine learning approach that takes only the stoichiometry as input and automatically learns appropriate and systematically improvable descriptors from data. Our key insight is to treat the stoichiometric formula as a dense weighted graph between elements. Compared to the state of the art for structure-agnostic methods, our approach achieves lower errors with less data.

preprint2019arXiv

Validating the Validation: Reanalyzing a large-scale comparison of Deep Learning and Machine Learning models for bioactivity prediction

Machine learning methods may have the potential to significantly accelerate drug discovery. However, the increasing rate of new methodological approaches being published in the literature raises the fundamental question of how models should be benchmarked and validated. We reanalyze the data generated by a recently published large-scale comparison of machine learning models for bioactivity prediction and arrive at a somewhat different conclusion. We show that the performance of support vector machines is competitive with that of deep learning methods. Additionally, using a series of numerical experiments, we question the relevance of area under the receiver operating characteristic curve as a metric in virtual screening, and instead suggest that area under the precision-recall curve should be used in conjunction with the receiver operating characteristic. Our numerical experiments also highlight challenges in estimating the uncertainty in model performance via scaffold-split nested cross validation.

preprint2018arXiv

Geometry of energy landscapes and the optimizability of deep neural networks

Deep neural networks are workhorse models in machine learning with multiple layers of non-linear functions composed in series. Their loss function is highly non-convex, yet empirically even gradient descent minimisation is sufficient to arrive at accurate and predictive models. It is hitherto unknown why are deep neural networks easily optimizable. We analyze the energy landscape of a spin glass model of deep neural networks using random matrix theory and algebraic geometry. We analytically show that the multilayered structure holds the key to optimizability: Fixing the number of parameters and increasing network depth, the number of stationary points in the loss function decreases, minima become more clustered in parameter space, and the tradeoff between the depth and width of minima becomes less severe. Our analytical results are numerically verified through comparison with neural networks trained on a set of classical benchmark datasets. Our model uncovers generic design principles of machine learning models.

preprint2017arXiv

Fluctuation Spectra and Force Generation in Non-equilibrium Systems

Many biological systems are appropriately viewed as passive inclusions immersed in an active bath: from proteins on active membranes to microscopic swimmers confined by boundaries. The non-equilibrium forces exerted by the active bath on the inclusions or boundaries often regulate function, and such forces may also be exploited in artificial active materials. Nonetheless, the general phenomenology of these active forces remains elusive. We show that the fluctuation spectrum of the active medium, the partitioning of energy as a function of wavenumber, controls the phenomenology of force generation. We find that for a narrow, unimodal spectrum, the force exerted by a non-equilibrium system on two embedded walls depends on the width and the position of the peak in the fluctuation spectrum, and oscillates between repulsion and attraction as a function of wall separation. We examine two apparently disparate examples: the Maritime Casimir effect and recent simulations of active Brownian particles. A key implication of our work is that important non-equilibrium interactions are encoded within the fluctuation spectrum. In this sense the noise becomes the signal.

preprint2016arXiv

The Electrostatic Screening Length in Concentrated Electrolytes Increases with Concentration

According to classical electrolyte theories interactions in dilute (low ion density) electrolytes decay exponentially with distance, with the Debye screening length the characteristic length-scale. This decay length decreases monotonically with increasing ion concentration, due to effective screening of charges over short distances. Thus within the Debye model no long-range forces are expected in concentrated electrolytes. Here we reveal, using experimental detection of the interaction between two planar charged surfaces across a wide range of electrolytes, that beyond the dilute (Debye-Huuckel) regime the screening length increases with increasing concentration. The screening lengths for all electrolytes studied - including aqueous NaCl solutions, ionic liquids diluted with propylene carbonate, and pure ionic liquids - collapse onto a single curve when scaled by the dielectric constant. This non-monotonic variation of the screening length with concentration, and its generality across ionic liquids and aqueous salt solutions, demonstrates an important characteristic of concentrated electrolytes of substantial relevance from biology to energy storage.

preprint2015arXiv

Dynamics of Ion Transport in Ionic Liquids

A gap in understanding the link between continuum theories of ion transport in ionic liquids and the underlying microscopic dynamics has hindered the development of frameworks for transport phenomena in these concentrated electrolytes. Here, we construct a continuum theory for ion transport in ionic liquids by coarse graining a simple exclusion process of interacting particles on a lattice. The resulting dynamical equations can be written as a gradient flow with a mobility matrix that vanishes at high densities. This form of the mobility matrix gives rise to a charging behaviour that is different to the one known for electrolytic solutions, but which agrees qualitatively with the phenomenology observed in experiments and simulations.

preprint2015arXiv

The role of extensibility in the birth of a ruck in a rug

Everyday experience suggests that a `ruck' forms when the two ends of a heavy carpet or rug are brought closer together. Classical analysis, however, shows that the horizontal compressive force needed to create such a ruck should be infinite. We show that this apparent paradox is due to the assumption of inextensibility of the rug. By accounting for a finite extensibility, we show that rucks appear with a finite, non-zero end-shortening and confirm our theoretical results with simple experiments. Finally, we note that the appropriate measure of extensibility, the stretchability, is in this case not determined purely by geometry, but incorporates the mechanics of the sheet.

preprint2012arXiv

Electroactuation with Single Charge Carrier Ionomers

A simple theory of electromechanical transduction for single-charge-carrier double-layer electroactuators is developed, in which the ion distribution and curvature are mutually coupled. The obtained expressions for the dependence of curvature and charge accumulation on the applied voltage, as well as the electroactuation dynamics, are compared with literature data. The mechanical- or sensor- performance of such electroactuators appears to be determined by just three cumulative parameters, with all of their constituents measurable, permitting a scaling approach to their design.

Alpha A. Lee

What is connected

Connect this record

See the researcher in context

Building this map preview

13 published item(s)

Achieving Robustness to Aleatoric Uncertainty with Heteroscedastic Bayesian Optimisation

Inferring global dynamics from local structure in liquid electrolytes

Rapid Discovery of Stable Materials by Coordinate-free Coarse Graining

Machine learnt approximations to the bridge function yield improved closures for the Ornstein-Zernike equation

Materials Graph Transformer predicts the outcomes of inorganic reactions with reliable uncertainties

Predicting materials properties without crystal structure: Deep representation learning from stoichiometry

Validating the Validation: Reanalyzing a large-scale comparison of Deep Learning and Machine Learning models for bioactivity prediction

Geometry of energy landscapes and the optimizability of deep neural networks

Fluctuation Spectra and Force Generation in Non-equilibrium Systems

The Electrostatic Screening Length in Concentrated Electrolytes Increases with Concentration

Dynamics of Ion Transport in Ionic Liquids

The role of extensibility in the birth of a ruck in a rug

Electroactuation with Single Charge Carrier Ionomers