Researcher profile

Michele Ceriotti

Michele Ceriotti contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
19works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

19 published item(s)

preprint2026arXiv

A universal machine learning model for the electronic density of states

In the last few years several ``universal'' interatomic potentials have appeared, using machine-learning approaches to predict energy and forces of atomic configurations with arbitrary composition and structure, with an accuracy often comparable with that of the electronic-structure calculations they are trained on. Here we demonstrate that these generally-applicable models can also be built to predict explicitly the electronic structure of materials and molecules. We focus on the electronic density of states (DOS), and develop PET-MAD-DOS, a rotationally unconstrained transformer model built on the Point Edge Transformer (PET) architecture, and trained on the Massive Atomistic Diversity (MAD) dataset. We demonstrate our model's predictive abilities on samples from diverse external datasets, showing also that the DOS can be further manipulated to obtain accurate band gap predictions. A fast evaluation of the DOS is especially useful in combination with molecular simulations probing matter in finite-temperature thermodynamic conditions. To assess the accuracy of PET-MAD-DOS in this context, we evaluate the ensemble-averaged DOS and the electronic heat capacity of three technologically relevant systems: lithium thiophosphate (LPS), gallium arsenide (GaAs), and a high entropy alloy (HEA). By comparing with bespoke models, trained exclusively on system-specific datasets, we show that our universal model achieves semi-quantitative agreement for all these tasks. Furthermore, we demonstrate that fine-tuning can be performed using a small fraction of the bespoke data, yielding models that are comparable to, and sometimes better than, fully-trained bespoke models.

preprint2022arXiv

Beyond potentials: integrated machine-learning models for materials

Over the past decade inter-atomic potentials based on machine-learning (ML) techniques have become an indispensable tool in the atomic-scale modeling of materials. Trained on energies and forces obtained from electronic-structure calculations, they inherit their predictive accuracy, and extend greatly the length and time scales that are accessible to explicit atomistic simulations. Inexpensive predictions of the energetics of individual configurations have facilitated greatly the calculation of the thermodynamics of materials, including finite-temperature effects and disorder. More recently, machine-learning models have been closing the gap with first-principles calculations in another area: the prediction of arbitrarily complicated functional properties, from vibrational and optical spectroscopies to electronic excitations. The implementation of integrated machine-learning models, that combine energetic and functional predictions with statistical and dynamical sampling of atomic-scale properties is bringing the promise of predictive, uncompromising simulations of existing and novel materials closer to its full realisation.

preprint2022arXiv

Electronic-structure properties from atom-centered predictions of the electron density

The electron density of a molecule or material has recently received major attention as a target quantity of machine-learning models. A natural choice to construct a model that yields transferable and linear-scaling predictions is to represent the scalar field using a multi-centered atomic basis analogous to that routinely used in density fitting approximations. However, the non-orthogonality of the basis poses challenges for the learning exercise, as it requires accounting for all the atomic density components at once. We devise a gradient-based approach to directly minimize the loss function of the regression problem in an optimized and highly sparse feature space. In so doing, we overcome the limitations associated with adopting an atom-centered model to learn the electron density over arbitrarily complex datasets, obtaining extremely accurate predictions. The enhanced framework is tested on 32-molecule periodic cells of liquid water, presenting enough complexity to require an optimal balance between accuracy and computational efficiency. We show that starting from the predicted density a single Kohn-Sham diagonalization step can be performed to access total energy components that carry an error of just 0.1 meV/atom with respect to the reference density functional calculations. Finally, we test our method on the highly heterogeneous QM9 benchmark dataset, showing that a small fraction of the training data is enough to derive ground-state total energies within chemical accuracy.

preprint2022arXiv

Modeling the Ga/As binary system across temperaturesand compositions from first principles

Materials composed of elements from the third and fifth columns of the periodic table display a very rich behavior, with the phase diagram usually containing a metallic liquid phase and a polar semiconducting solid. As a consequence, it is very hard to achieve transferable empirical models of interactions between the atoms that can reliably predict their behavior across the temperature and composition range that is relevant to the study of the synthesis and properties of III/V nanostructures and devices. We present a machine-learning potential trained on density functional theory reference data that provides a general-purpose model for the Ga$_x$As$_{1-x}$ system. We provide a series of stringent tests that showcase the accuracy of the potential, and its applicability across the whole binary phase space, computing with ab initio accuracy a large number of finite-temperature properties as well as the location of phase boundaries. We also show how a committe model can be used to reliably determine the uncertainty induced by the limitations of the ML model on its predictions, to identify regions of phase space that are predicted with insufficient accuracy, and to iteratively refine the training set to achieve consistent, reliable modeling.

preprint2022arXiv

Optimal radial basis for density-based atomic representations

The input of almost every machine learning algorithm targeting the properties of matter at the atomic scale involves a transformation of the list of Cartesian atomic coordinates into a more symmetric representation. Many of the most popular representations can be seen as an expansion of the symmetrized correlations of the atom density, and differ mainly by the choice of basis. Considerable effort has been dedicated to the optimization of the basis set, typically driven by heuristic considerations on the behavior of the regression target. Here we take a different, unsupervised viewpoint, aiming to determine the basis that encodes in the most compact way possible the structural information that is relevant for the dataset at hand. For each training dataset and number of basis functions, one can determine a unique basis that is optimal in this sense, and can be computed at no additional cost with respect to the primitive basis by approximating it with splines. We demonstrate that this construction yields representations that are accurate and computationally efficient, particularly when constructing representations that correspond to high-body order correlations. We present examples that involve both molecular and condensed-phase machine-learning models.

preprint2022arXiv

The importance of nuclear quantum effects for NMR crystallography

The resolving power of solid-state nuclear magnetic resonance (NMR) crystallography depends heavily on the accuracy of computational predictions of NMR chemical shieldings of candidate structures, which are usually taken to be local minima in the potential energy. To test the limits of this approximation, we systematically study the importance of finite-temperature and quantum nuclear fluctuations for $^1$H, $^{13}$C, and $^{15}$N shieldings in polymorphs of three paradigmatic molecular crystals -- benzene, glycine, and succinic acid. The effect of quantum fluctuations is comparable to the typical errors of shielding predictions for static nuclei with respect to experiments, and their inclusion to improve the agreement with measurements, translating to more reliable assignment of the NMR spectra to the correct candidate structure. The use of integrated machine-learning models, trained on first-principles energies and shieldings, renders rigorous sampling of nuclear fluctuations affordable, setting a new standard for the calculations underlying NMR structure determinations.

preprint2022arXiv

Unified theory of atom-centered representations and message-passing machine-learning schemes

Data-driven schemes that associate molecular and crystal structures with their microscopic properties share the need for a concise, effective description of the arrangement of their atomic constituents. Many types of models rely on descriptions of atom-centered environments, that are associated with an atomic property or with an atomic contribution to an extensive macroscopic quantity. Frameworks in this class can be understood in terms of atom-centered density correlations (ACDC), that are used as a basis for a body-ordered, symmetry-adapted expansion of the targets. Several other schemes, that gather information on the relationship between neighboring atoms using "message-passing" ideas, cannot be directly mapped to correlations centered around a single atom. We generalize the ACDC framework to include multi-centered information, generating representations that provide a complete linear basis to regress symmetric functions of atomic coordinates, and provides a coherent foundation to systematize our understanding of both atom-centered and message-passing, invariant and equivariant machine-learning schemes.

preprint2020arXiv

Comparing molecules and solids across structural and alchemical space

Evaluating the (dis)similarity of crystalline, disordered and molecular compounds is a critical step in the development of algorithms to navigate automatically the configuration space of complex materials. For instance, a structural similarity metric is crucial for classifying structures, searching chemical space for better compounds and materials, and driving the next generation of machine-learning techniques for predicting the stability and properties of molecules and materials. In the last few years several strategies have been designed to compare atomic coordination environments. In particular, the Smooth Overlap of Atomic Positions (SOAP) has emerged as an elegant framework to obtain translation, rotation and permutation-invariant descriptors of groups of atoms, driven by the design of various classes of machine-learned inter-atomic potentials. Here we discuss how one can combine such local descriptors using a Regularized Entropy Match (REMatch) approach to describe the similarity of both whole molecular and bulk periodic structures, introducing powerful metrics that enable the navigation of alchemical and structural complexity within a unified framework. Furthermore, using this kernel and a ridge regression method we can predict atomization energies for a database of small organic molecules with a mean absolute error below 1kcal/mol, reaching an important milestone in the application of machine-learning techniques to the evaluation of molecular properties.

preprint2020arXiv

Learning the electronic density of states in condensed matter

The electronic density of states (DOS) quantifies the distribution of the energy levels that can be occupied by electrons in a quasiparticle picture, and is central to modern electronic structure theory. It also underpins the computation and interpretation of experimentally observable material properties such as optical absorption and electrical conductivity. We discuss the challenges inherent in the construction of a machine-learning (ML) framework aimed at predicting the DOS as a combination of local contributions that depend in turn on the geometric configuration of neighbours around each atom, using quasiparticle energy levels from density functional theory as training data. We present a challenging case study that includes configurations of silicon spanning a broad set of thermodynamic conditions, ranging from bulk structures to clusters, and from semiconducting to metallic behavior. We compare different approaches to represent the DOS, and the accuracy of predicting quantities such as the Fermi level, the DOS at the Fermi level, or the band energy, either directly or as a side-product of the evaluation of the DOS. The performance of the model depends crucially on the smoothening of the DOS, and there is a tradeoff to be made between the systematic error associated with the smoothening and the error in the ML model for a specific structure. We demonstrate the usefulness of this approach by computing the density of states of a large amorphous silicon sample, for which it would be prohibitively expensive to compute the DOS by direct electronic structure calculations, and show how the atom-centred decomposition of the DOS that is obtained through our model can be used to extract physical insights into the connections between structural and electronic features.

preprint2020arXiv

Machine learning force fields and coarse-grained variables in molecular dynamics: application to materials and biological systems

Machine learning encompasses a set of tools and algorithms which are now becoming popular in almost all scientific and technological fields. This is true for molecular dynamics as well, where machine learning offers promises of extracting valuable information from the enormous amounts of data generated by simulation of complex systems. We provide here a review of our current understanding of goals, benefits, and limitations of machine learning techniques for computational studies on atomistic systems, focusing on the construction of empirical force fields from ab-initio databases and the determination of reaction coordinates for free energy computation and enhanced sampling.

preprint2020arXiv

Multi-scale approach for the prediction of atomic scale properties

Electronic nearsightedness is one of the fundamental principles governing the behavior of condensed matter and supporting its description in terms of local entities such as chemical bonds. Locality also underlies the tremendous success of machine-learning schemes that predict quantum mechanical observables -- such as the cohesive energy, the electron density, or a variety of response properties -- as a sum of atom-centred contributions, based on a short-range representation of atomic environments. One of the main shortcomings of these approaches is their inability to capture physical effects, ranging from electrostatic interactions to quantum delocalization, which have a long-range nature. Here we show how to build a multi-scale scheme that combines in the same framework local and non-local information, overcoming such limitations. We show that the simplest version of such features can be put in formal correspondence with a multipole expansion of permanent electrostatics. The data-driven nature of the model construction, however, makes this simple form suitable to tackle also different types of delocalized and collective effects. We present several examples that range from molecular physics, to surface science and biophysics, demonstrating the ability of this multi-scale approach to model interactions driven by electrostatics, polarization and dispersion, as well as the cooperative behavior of dielectric response functions.

preprint2020arXiv

Quantum kinetic energy and isotope fractionation in aqueous ionic solutions

At room temperature, the quantum contribution to the kinetic energy of a water molecule exceeds the classical contribution by an order of magnitude. The quantum kinetic energy (QKE) of a water molecule is modulated by its local chemical environment and leads to uneven partitioning of isotopes between different phases in thermal equilibrium, which would not occur if the nuclei behaved classically. In this work, we use ab initio path integral simulations to show that QKEs of the water molecules and the equilibrium isotope fractionation ratios of the oxygen and hydrogen isotopes are sensitive probes of the hydrogen bonding structures in aqueous ionic solutions. In particular, we demonstrate how the QKE of water molecules in path integral simulations can be decomposed into translational, rotational and vibrational degrees of freedom, and use them to determine the impact of solvation on different molecular motions. By analyzing the QKEs and isotope fractionation ratios, we show how the addition of the Na$^+$, Cl$^-$ and HPO$_4^{2-}$ ions perturbs the competition between quantum effects in liquid water and impacts their local solvation structures.

preprint2020arXiv

Structure-Property Maps with Kernel Principal Covariates Regression

Data analyses based on linear methods constitute the simplest, most robust, and transparent approaches to the automatic processing of large amounts of data for building supervised or unsupervised machine learning models. Principal covariates regression (PCovR) is an underappreciated method that interpolates between principal component analysis and linear regression, and can be used to conveniently reveal structure-property relations in terms of simple-to-interpret, low-dimensional maps. Here we provide a pedagogic overview of these data analysis schemes, including the use of the kernel trick to introduce an element of non-linearity, while maintaining most of the convenience and the simplicity of linear approaches. We then introduce a kernelized version of PCovR and a sparsified extension, and demonstrate the performance of this approach in revealing and predicting structure-property relations in chemistry and materials science, showing a variety of examples including elemental carbon, porous silicate frameworks, organic molecules, amino acid conformers, and molecular materials.

preprint2019arXiv

A New Kind of Atlas of Zeolite Building Blocks

We have analysed structural motifs in the Deem database of hypothetical zeolites, to investigate whether the structural diversity found in this database can be well-represented by classical descriptors such as distances, angles, and ring sizes, or whether a more general representation of atomic structure, furnished by the smooth overlap of atomic positions (SOAP) method, is required to capture accurately structure-property relations. We assessed the quality of each descriptor by machine-learning the molar energy and volume for each hypothetical framework in the dataset. We have found that SOAP with a cutoff-length of 6 Å, which goes beyond near-neighbor tetrahedra, best describes the structural diversity in the Deem database by capturing relevant inter-atomic correlations. Kernel principal component analysis shows that SOAP maintains its superior performance even when reducing its dimensionality to those of the classical descriptors, and that the first three kernel principal components capture the main variability in the data set, allowing a 3D point cloud visualization of local environments in the Deem database. This ``cloud atlas" of local environments was found to show good correlations with the contribution of a given motif to the density and stability of its parent framework. Local volume and energy maps constructed from the SOAP/machine-learning analyses provide new images of zeolites that reveal smooth variations of local volumes and energies across a given framework, and correlations between local volume and energy in a given framework.

preprint2019arXiv

Classical nucleation theory predicts the shape of the nucleus in homogeneous solidification

Macroscopic models of nucleation provide powerful tools for understanding activated phase transition processes. These models do not provide atomistic insights and can thus sometime lack material-specific descriptions. Here we provide a comprehensive framework for constructing a continuum picture from an atomistic simulation of homogeneous nucleation. We use this framework to determine the shape of the equilibrium solid nucleus that forms inside bulk liquid for a Lennard-Jones potential. From this shape, we then extract the anisotropy of the solid-liquid interfacial free energy, by performing a reverse Wulff construction in the space of spherical harmonic expansions. We find that the shape of the nucleus is nearly spherical and that its anisotropy can be perfectly described using classical models.

preprint2019arXiv

Evidence for supercritical behavior of high-pressure liquid hydrogen

Hydrogen exhibits unusual behaviors at megabar pressures, with consequences for planetary science, condensed matter physics and materials science. Experiments at such extreme conditions are challenging, often resulting in hard-to-interpret and controversial observations. We present a theoretical study of the phase diagram of dense hydrogen, using machine learning to overcome time and length scale limitations while describing accurately interatomic forces. We reproduce the re-entrant melting behavior and the polymorphism of the solid phase. In simulations based on the machine learning potential we find evidence for continuous metallization in the liquid, as a first-order liquid-liquid transition is pre-empted by freezing. This suggests a smooth transition between insulating and metallic layers in giant gas planets, and reconciles existing discrepancies between experiments as a manifestation of supercritical behavior.

preprint2019arXiv

Feature Optimization for Atomistic Machine Learning Yields A Data-Driven Construction of the Periodic Table of the Elements

Machine-learning of atomic-scale properties amounts to extracting correlations between structure, composition and the quantity that one wants to predict. Representing the input structure in a way that best reflects such correlations makes it possible to improve the accuracy of the model for a given amount of reference data. When using a description of the structures that is transparent and well-principled, optimizing the representation might reveal insights into the chemistry of the data set. Here we show how one can generalize the SOAP kernel to introduce a distance-dependent weight that accounts for the multi-scale nature of the interactions, and a description of correlations between chemical species. We show that this improves substantially the performance of ML models of molecular and materials stability, while making it easier to work with complex, multi-component systems and to extend SOAP to coarse-grained intermolecular potentials. The element correlations that give the best performing model show striking similarities with the conventional periodic table of the elements, providing an inspiring example of how machine learning can rediscover, and generalize, intuitive concepts that constitute the foundations of chemistry.

preprint2019arXiv

Incorporating long-range physics in atomic-scale machine learning

The most successful and popular machine learning models of atomic-scale properties derive their transferability from a locality ansatz. The properties of a large molecule or a bulk material are written as a sum over contributions that depend on the configurations within finite atom-centered environments. The obvious downside of this approach is that it cannot capture non-local, non-additive effects such as those arising due to long-range electrostatics or quantum interference. We propose a solution to this problem by introducing non-local representations of the system that are remapped as feature vectors that are defined locally and are equivariant in O(3). We consider in particular one form that has the same asymptotic behavior as the electrostatic potential. We demonstrate that this framework can capture non-local, long-range physics by building a model for the electrostatic energy of randomly distributed point-charges, for the unrelaxed binding curves of charged organic molecular dimers, and for the electronic dielectric response of liquid water. By combining a representation of the system that is sensitive to long-range correlations with the transferability of an atom-centered additive model, this method outperforms current state-of-the-art machine-learning schemes, and provides a conceptual framework to incorporate non-local physics into atomistic machine learning.

preprint2019arXiv

Inexpensive modelling of quantum dynamics using path integral generalized Langevin equation thermostats

The properties of molecules and materials containing light nuclei are affected by their quantum mechanical nature. Modelling these quantum nuclear effects accurately requires computationally demanding path integral techniques. Considerable success has been achieved in reducing the cost of such simulations by using generalized Langevin dynamics to induce frequency-dependent fluctuations. Path integral generalized Langevin equation methods, however, have this far been limited to the study of static, thermodynamic properties due to the large perturbation to the system's dynamics induced by the aggressive thermostatting. Here we introduce a post-processing scheme, based on analytical estimates of the dynamical perturbation induced by the generalized Langevin dynamics, that makes it possible to recover meaningful time correlation properties from a thermostatted trajectory. We show that this approach yields spectroscopic observables for model and realistic systems which have an accuracy comparable to much more demanding approximate quantum dynamics techniques based on full path integral simulations.