Researcher profile

Anders M. N. Niklasson

Anders M. N. Niklasson contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2022arXiv

Quantum perturbation theory using Tensor cores and a deep neural network

Time-independent quantum response calculations are performed using Tensor cores. This is achieved by mapping density matrix perturbation theory onto the computational structure of a deep neural network. The main computational cost of each deep layer is dominated by tensor contractions, i.e. dense matrix-matrix multiplications, in mixed precision arithmetics which achieves close to peak performance. Quantum response calculations are demonstrated and analyzed using self-consistent charge density-functional tight-binding theory as well as coupled-perturbed Hartree-Fock theory. For linear response calculations, a novel parameter-free convergence criterion is presented that is well-suited for numerically noisy low precision floating point operations and we demonstrate a peak performance of almost 200 Tflops using the Tensor cores of two Nvidia A100 GPUs.

preprint2021arXiv

Performance Optimizations of Recursive Electronic Structure Solvers targeting Multi-Core Architectures (LA-UR-20-26665)

As we rapidly approach the frontiers of ultra large computing resources, software optimization is becoming of paramount interest to scientific application developers interested in efficiently leveraging all available on-Node computing capabilities and thereby improving a requisite science per watt metric. The scientific application of interest here is the Basic Math Library (BML) that provides a singular interface for linear algebra operation frequently used in the Quantum Molecular Dynamics (QMD) community. The provisioning of a singular interface indicates the presence of an abstraction layer which in-turn suggests commonalities in the code-base and therefore any optimization or tuning introduced in the core of code-base has the ability to positively affect the performance of the aforementioned library as a whole. With that in mind, we proceed with this investigation by performing a survey of the entirety of the BML code-base, and extract, in form of micro-kernels, common snippets of code. We introduce several optimization strategies into these micro-kernels including 1.) Strength Reduction 2.) Memory Alignment for large arrays 3.) Non Uniform Memory Access (NUMA) aware allocations to enforce data locality and 4.) appropriate thread affinity and bindings to enhance the overall multi-threaded performance. After introducing these optimizations, we benchmark the micro-kernels and compare the run-time before and after optimization for several target architectures. Finally we use the results as a guide to propagating the optimization strategies into the BML code-base. As a demonstration, herein, we test the efficacy of these optimization strategies by comparing the benchmark and optimized versions of the code.

preprint2021arXiv

Shadow Lagrangian dynamics for superfluidity

Motivated by a similar approach for Born-Oppenheimer molecular dynamics, this paper proposes an extended "shadow" Lagrangian density for quantum states of superfluids. The extended Lagrangian contains an additional field variable that is forced to follow the wave function of the quantum state through a rapidly oscillating extended harmonic oscillator. By considering the adiabatic limit for large frequencies of the harmonic oscillator, we can derive the two equations of motions, a Schrödinger-type equation for the quantum state and a wave equation for the extended field variable. The equations are coupled in a nonlinear way, but each equation individually is linear with respect to the variable that it defines. The computational advantage of this new system is that it can be easily discretized using linear time stepping methods, where we propose to use a Crank-Nicolson-type approach for the Schrödinger equation and an extended leapfrog scheme for the wave equation. Furthermore, the difference between the quantum state and the extended field variable defines a consistency error that should go to zero if the frequency tends to infinity. By coupling the time-step size in our discretization to the frequency of the harmonic oscillator we can extract an easily computable consistency error indicator that can be used to estimate the numerical error without any additional costs. The findings are illustrated in numerical experiments.

preprint2020arXiv

Density-matrix based Extended Lagrangian Born-Oppenheimer Molecular Dynamics

Extended Lagrangian Born-Oppenheimer molecular dynamics [{\em Phys.\ Rev.\ Lett.\ } {\bf 2008}, {\em 100}, 123004] is presented for Hartree-Fock theory, where the extended electronic degrees of freedom are represented by a density matrix, including fractional occupation numbers at elevated electronic temperatures. In contrast to regular direct Born-Oppenheimer molecular dynamics simulations, no iterative self-consistent field optimization is required prior to the force evaluations. To sample regions of the potential energy landscape where the gap is small or vanishing, which leads to particular convergence problems in regular direct Born-Oppenheimer molecular dynamics simulations, an adaptive integration scheme for the extended electronic degrees of freedom is presented. The integration scheme is based on a tunable, low-rank approximation of a fourth-order kernel, ${\cal K}$, that determines the metric tensor, ${\cal T}\equiv {\cal K}^T{\cal K}$, used in the extended harmonic oscillator of the Lagrangian that generates the dynamics of the electronic degrees of freedom. The formulation and algorithms provide a general guide to implement extended Lagrangian Born-Oppenheimer molecular dynamics for quantum chemistry, density functional theory, and semiempirical methods using a density matrix formalism.

preprint2020arXiv

Modeling solid-liquid interface reactions with next generation extended Lagrangian quantum-based molecular dynamics

We demonstrate the applicability of extended Lagrangian Born-Oppenheimer quantum-based molecular dynamics (XL-BOMD) to model electron transfer reactions occurring on solid-liquid interfaces. Specifically, we consider the reduction of O$_2$ as catalyzed at the interface of an N-doped graphene sheet and H$_2$O at fuel cell cathodes. This system is a good testbed for next-generation computational chemistry methods since the electrochemical functionalities strongly depend on atomic-scale quantum mechanics. As opposed to prior iterations of first principles molecular dynamics, XL-BOMD only requires a full self-consistent-charge relaxation during the initial time step. The electronic ground state and total energy are stabilized thereafter through nuclear and electronic equations of motion assisted by an inner-product kernel updated with low-rank approximations. A species charge analysis reveals that the kernel-based XL-BOMD simulation can capture an electron transfer between the PGM-free catalyst and a solvated O$_2$ molecule mediated by H$_2$O, which results in the molecular dissociation of O$_2$.