Researcher profile

Oliver T. Unke

Oliver T. Unke contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2023arXiv

So3krates: Equivariant attention for interactions on arbitrary length-scales in molecular systems

The application of machine learning methods in quantum chemistry has enabled the study of numerous chemical phenomena, which are computationally intractable with traditional ab-initio methods. However, some quantum mechanical properties of molecules and materials depend on non-local electronic effects, which are often neglected due to the difficulty of modeling them efficiently. This work proposes a modified attention mechanism adapted to the underlying physics, which allows to recover the relevant non-local effects. Namely, we introduce spherical harmonic coordinates (SPHCs) to reflect higher-order geometric information for each atom in a molecule, enabling a non-local formulation of attention in the SPHC space. Our proposed model So3krates - a self-attention based message passing neural network - uncouples geometric information from atomic features, making them independently amenable to attention mechanisms. Thereby we construct spherical filters, which extend the concept of continuous filters in Euclidean space to SPHC space and serve as foundation for a spherical self-attention mechanism. We show that in contrast to other published methods, So3krates is able to describe non-local quantum mechanical effects over arbitrary length scales. Further, we find evidence that the inclusion of higher-order geometric correlations increases data efficiency and improves generalization. So3krates matches or exceeds state-of-the-art performance on popular benchmarks, notably, requiring a significantly lower number of parameters (0.25 - 0.4x) while at the same time giving a substantial speedup (6 - 14x for training and 2 - 11x for inference) compared to other models.

preprint2022arXiv

Accurate Machine Learned Quantum-Mechanical Force Fields for Biomolecular Simulations

Molecular dynamics (MD) simulations allow atomistic insights into chemical and biological processes. Accurate MD simulations require computationally demanding quantum-mechanical calculations, being practically limited to short timescales and few atoms. For larger systems, efficient, but much less reliable empirical force fields are used. Recently, machine learned force fields (MLFFs) emerged as an alternative means to execute MD simulations, offering similar accuracy as ab initio methods at orders-of-magnitude speedup. Until now, MLFFs mainly capture short-range interactions in small molecules or periodic materials, due to the increased complexity of constructing models and obtaining reliable reference data for large molecules, where long-ranged many-body effects become important. This work proposes a general approach to constructing accurate MLFFs for large-scale molecular simulations (GEMS) by training on "bottom-up" and "top-down" molecular fragments of varying size, from which the relevant physicochemical interactions can be learned. GEMS is applied to study the dynamics of alanine-based peptides and the 46-residue protein crambin in aqueous solution, allowing nanosecond-scale MD simulations of >25k atoms at essentially ab initio quality. Our findings suggest that structural motifs in peptides and proteins are more flexible than previously thought, indicating that simulations at ab initio accuracy might be necessary to understand dynamic biomolecular processes such as protein (mis)folding, drug-protein binding, or allosteric regulation.

preprint2021arXiv

SpookyNet: Learning Force Fields with Electronic Degrees of Freedom and Nonlocal Effects

Machine-learned force fields (ML-FFs) combine the accuracy of ab initio methods with the efficiency of conventional force fields. However, current ML-FFs typically ignore electronic degrees of freedom, such as the total charge or spin state, and assume chemical locality, which is problematic when molecules have inconsistent electronic states, or when nonlocal effects play a significant role. This work introduces SpookyNet, a deep neural network for constructing ML-FFs with explicit treatment of electronic degrees of freedom and quantum nonlocality. Chemically meaningful inductive biases and analytical corrections built into the network architecture allow it to properly model physical limits. SpookyNet improves upon the current state-of-the-art (or achieves similar performance) on popular quantum chemistry data sets. Notably, it is able to generalize across chemical and conformational space and can leverage the learned chemical insights, e.g. by predicting unknown spin states, thus helping to close a further important remaining gap for today's machine learning models in quantum chemistry.

preprint2020arXiv

Isomerization and Decomposition Reactions of Acetaldehyde Relevant to Atmospheric Processes from Dynamics Simulations on Neural Network-Based Potential Energy Surfaces

Acetaldehyde (AA) isomerization (to vinylalcohol, VA) and decomposition (into either CO+CH$_4$ and H$_2$+H$_2$CCO) is studied using a fully dimensional, reactive potential energy surface represented as a neural network (NN). The NN, trained on 432'399 reference structures from MP2/aug-cc-pVTZ calculations has a MAE of 0.0453 kcal/mol and an RMSE of 1.186 kcal/mol for a test set of 27'399 structures. For the isomerization process AA $\rightarrow$ VA the minimum dynamical path implies that the C-H vibration, and the C-C-H (with H being the transferring H-atom) and the C-C-O angles are involved to surmount the 68.2 kcal/mol barrier. Using an excess energy of 93.6 kcal/mol - the energy available in the solar spectrum and sufficient to excite to the first electronically excited state - to initialize the molecular dynamics, no isomerization to VA is observed on the 500 ns time scale. Only with excess energies of $\sim$ 127.6 kcal/mol (including the zero point energy of the AA molecule), isomerization occurs on the nanosecond time scale. Given that collisional de-excitation at atmospheric conditions in the stratosphere occurs on the 100 ns time scale, it is concluded that formation of VA following photoexcitation of AA from actinic photons is unlikely. This also limits the relevance of this reaction pathway to be a source for formic acid.

preprint2020arXiv

Thermal Activation of Methane by MgO$^+$: Temperature Dependent Kinetics, Reactive Molecular Dynamics Simulations and Statistical Modeling

The kinetics of MgO$^+$ + CH$_4$ was studied experimentally using the variable ion source, temperature adjustable selected ion flow tube (VISTA-SIFT) apparatus from 300 $-$ 600 K and computationally by running and analyzing reactive atomistic simulations. Rates and product branching fractions were determined as a function of temperature. The reaction proceeded with a rate of $k = 5.9 \pm 1.5 10^{-10}(T/300 $ K$)^{-0.5 \pm 0.2}$ cm$^3$ s$^{-1}$. MgOH$^+$ was the dominant product at all temperatures, but Mg$^+$, the co-product of oxygen-atom transfer to form methanol, was observed with a product branching fraction of $0.08 \pm 0.03 (T / 300 $ K$)^{-0.8 \pm 0.7}$. Reactive molecular dynamics simulations using a reactive force field, as well as a neural network yield rate coefficients about one order of magnitude lower. This underestimation of the rates is traced back to the multireference character of the transition state [MgOCH$_4$]$^+$. Statistical modeling of the temperature-dependent kinetics provides further insight into the reactive potential surface. The rate limiting step was found to be consistent with a four-centered activation of the C-H bond, consistent with previous calculations. The product branching was modeled as a competition between dissociation of an insertion intermediate directly after the rate-limiting transition state, and traversing a transition state corresponding to a methyl migration leading to a Mg-CH$_3$OH$^+$ complex, though only if this transition state is stabilized significantly relative to the dissociated MgOH$^+$ + CH$_3$ product channel. An alternative non-statistical mechanism is discussed, whereby a post-transition state bifurcation in the potential surface could allow the reaction to proceed directly from the four-centered TS to the Mg-CH$_3$OH$^+$ complex thereby allowing a more robust competition between the product channels.

preprint2019arXiv

High-Dimensional Potential Energy Surfaces for Molecular Simulations

An overview of computational methods to describe high-dimensional potential energy surfaces suitable for atomistic simulations is given. Particular emphasis is put on accuracy, computability, transferability and extensibility of the methods discussed. They include empirical force fields, representations based on reproducing kernels, using permutationally invariant polynomials, and neural network-learned representations and combinations thereof. Future directions and potential improvements are discussed primarily from a practical, application-oriented perspective.

preprint2019arXiv

Reactive Dynamics and Spectroscopy of Hydrogen Transfer from Neural Network-Based Reactive Potential Energy Surfaces

The in silico exploration of chemical, physical and biological systems requires accurate and efficient energy functions to follow their nuclear dynamics at a molecular and atomistic level. Recently, machine learning tools gained a lot of attention in the field of molecular sciences and simulations and are increasingly used to investigate the dynamics of such systems. Among the various approaches, artificial neural networks (NNs) are one promising tool to learn a representation of potential energy surfaces. This is done by formulating the problem as a mapping from a set of atomic positions $\mathbf{x}$ and nuclear charges $Z_i$ to a potential energy $V(\mathbf{x})$. Here, a fully-dimensional, reactive neural network representation for malonaldehyde (MA), acetoacetaldehyde (AAA) and acetylacetone (AcAc) is learned. It is used to run finite-temperature molecular dynamics simulations, and to determine the infrared spectra and the hydrogen transfer rates for the three molecules. The finite-temperature infrared spectrum for MA based on the NN learned on MP2 reference data provides a realistic representation of the low-frequency modes and the H-transfer band whereas the CH vibrations are somewhat too high in frequency. For AAA it is demonstrated that the IR spectroscopy is sensitive to the position of the transferring hydrogen at either the OCH- or OCCH$_3$ end of the molecule. For the hydrogen transfer rates it is demonstrated that the O-O vibration is a gating mode and largely determines the rate at which the hydrogen is transferred between the donor and acceptor. Finally, possibilities to further improve such NN-based potential energy surfaces are explored. They include the transferability of an NN-learned energy function across chemical species (here methylation) and transfer learning from a lower level of reference data (MP2) to a higher level of theory (pair natural orbital-LCCSD(T)).