Source author record

Aidan P. Thompson

Aidan P. Thompson appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

physics.comp-ph cond-mat.mtrl-sci Computational Engineering, Finance, and Science Machine Learning physics.atom-ph physics.chem-ph

Catalog footprint

What is connected

6works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Training Data Selection for Accuracy and Transferability of Interatomic Potentials

Advances in machine learning (ML) techniques have enabled the development of interatomic potentials that promise both the accuracy of first principles methods and the low-cost, linear scaling, and parallel efficiency of empirical potentials. Despite rapid progress in the last few years, ML-based potentials often struggle to achieve transferability, that is, to provide consistent accuracy across configurations that significantly differ from those used to train the model. In order to truly realize the promise of ML-based interatomic potentials, it is therefore imperative to develop systematic and scalable approaches for the generation of diverse training sets that ensure broad coverage of the space of atomic environments. This work explores a diverse-by-construction approach that leverages the optimization of the entropy of atomic descriptors to create a very large ($>2\cdot10^{5}$ configurations, $>7\cdot10^{6}$ atomic environments) training set for tungsten in an automated manner, i.e., without any human intervention. This dataset is used to train polynomial as well as multiple neural network potentials with different architectures. For comparison, a corresponding family of potentials were also trained on an expert-curated dataset for tungsten. The models trained to entropy-optimized data exhibited vastly superior transferability compared to the expert-curated models. Furthermore, while the models trained with heavy user input (i.e., domain expertise) yield the lowest errors when tested on similar configurations, out-sample predictions are dramatically more robust when the models are trained on a deliberately diverse set of training data. Herein we demonstrate the development of both accurate and transferable ML potentials using automated and data-driven approaches for generating large and diverse training sets.

preprint2020arXiv

Explicit Multi-element Extension of the Spectral Neighbor Analysis Potential for Chemically Complex Systems

A natural extension of the descriptors used in the Spectral Neighbor Analysis Potential (SNAP) method is derived to treat atomic interactions in chemically complex systems. Atomic environment descriptors within SNAP are obtained from a basis function expansion of the weighted density of neighboring atoms. This new formulation instead partitions the neighbor density into partial densities for each chemical element, thus leading to explicit multi-element descriptors. For $N_{elem}$ chemical elements, the number of descriptors increases as $\mathcal{O}(N_{elem}^3)$, while the computational cost of the force calculation as implemented in LAMMPS is limited to $\mathcal{O}(N_{elem}^2)$ and the favorable linear scaling in the number of atoms is retained. We demonstrate these chemically aware descriptors by producing an interatomic potential for indium phosphide capable of capturing high-energy defects that result from radiation damage cascades. This new explicit multi-element SNAP method reproduces the relaxed defect formation energies with substantially greater accuracy than weighted-density SNAP, while retaining accurate representation of the bulk indium phosphide properties.

preprint2020arXiv

Multi-fidelity machine-learning with uncertainty quantification and Bayesian optimization for materials design: Application to ternary random alloys

We present a scale-bridging approach based on a multi-fidelity (MF) machine-learning (ML) framework leveraging Gaussian processes (GP) to fuse atomistic computational model predictions across multiple levels of fidelity. Through the posterior variance of the MFGP, our framework naturally enables uncertainty quantification, providing estimates of confidence in the predictions. We used Density Functional Theory as high-fidelity prediction, while a ML interatomic potential is used as the low-fidelity prediction. Practical materials design efficiency is demonstrated by reproducing the ternary composition dependence of a quantity of interest (bulk modulus) across the full aluminum-niobium-titanium ternary random alloy composition space. The MFGP is then coupled to a Bayesian optimization procedure and the computational efficiency of this approach is demonstrated by performing an on-the-fly search for the global optimum of bulk modulus in the ternary composition space. The framework presented in this manuscript is the first application of MFGP to atomistic materials simulations fusing predictions between Density Functional Theory and classical interatomic potential calculations.

preprint2020arXiv

Simple and efficient algorithms for training machine learning potentials to force data

Abstract Machine learning models, trained on data from ab initio quantum simulations, are yielding molecular dynamics potentials with unprecedented accuracy. One limiting factor is the quantity of available training data, which can be expensive to obtain. A quantum simulation often provides all atomic forces, in addition to the total energy of the system. These forces provide much more information than the energy alone. It may appear that training a model to this large quantity of force data would introduce significant computational costs. Actually, training to all available force data should only be a few times more expensive than training to energies alone. Here, we present a new algorithm for efficient force training, and benchmark its accuracy by training to forces from real-world datasets for organic chemistry and bulk aluminum.

preprint2019arXiv

A Performance and Cost Assessment of Machine Learning Interatomic Potentials

Machine learning of the quantitative relationship between local environment descriptors and the potential energy surface of a system of atoms has emerged as a new frontier in the development of interatomic potentials (IAPs). Here, we present a comprehensive evaluation of ML-IAPs based on four local environment descriptors --- Behler-Parrinello symmetry functions, smooth overlap of atomic positions (SOAP), the Spectral Neighbor Analysis Potential (SNAP) bispectrum components, and moment tensors --- using a diverse data set generated using high-throughput density functional theory (DFT) calculations. The data set comprising bcc (Li, Mo) and fcc (Cu, Ni) metals and diamond group IV semiconductors (Si, Ge) is chosen to span a range of crystal structures and bonding. All descriptors studied show excellent performance in predicting energies and forces far surpassing that of classical IAPs, as well as predicting properties such as elastic constants and phonon dispersion curves. We observe a general trade-off between accuracy and the degrees of freedom of each model, and consequently computational cost. We will discuss these trade-offs in the context of model selection for molecular dynamics and other applications.

preprint2014arXiv

A Spectral Analysis Method for Automated Generation of Quantum-Accurate Interatomic Potentials

We present a new interatomic potential for solids and liquids called Spectral Neighbor Analysis Potential (SNAP). The SNAP potential has a very general form and uses machine-learning techniques to reproduce the energies, forces, and stress tensors of a large set of small configurations of atoms, which are obtained using high-accuracy quantum electronic structure (QM) calculations. The local environment of each atom is characterized by a set of bispectrum components of the local neighbor density projected on to a basis of hyperspherical harmonics in four dimensions. The bispectrum components are the same bond-orientational order parameters employed by the GAP potential [arXiv:0910.1019]. The SNAP potential, unlike GAP, assumes a linear relationship between atom energy and bispectrum components. The linear SNAP coefficients are determined using weighted least-squares linear regression against the full QM training set. This allows the SNAP potential to be fit in a robust, automated manner to large QM data sets using many bispectrum coefficients.

Aidan P. Thompson

What is connected

Connect this record

See the researcher in context

Building this map preview

6 published item(s)

Training Data Selection for Accuracy and Transferability of Interatomic Potentials

Explicit Multi-element Extension of the Spectral Neighbor Analysis Potential for Chemically Complex Systems

Multi-fidelity machine-learning with uncertainty quantification and Bayesian optimization for materials design: Application to ternary random alloys

Simple and efficient algorithms for training machine learning potentials to force data

A Performance and Cost Assessment of Machine Learning Interatomic Potentials

A Spectral Analysis Method for Automated Generation of Quantum-Accurate Interatomic Potentials