Researcher profile

Nicholas Lubbers

Nicholas Lubbers contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2026arXiv

Effects of Structural Allocation of Geometric Task Diversity in Linear Meta-Learning Models

Meta-learning aims to leverage information across related tasks to improve prediction on unlabeled data for new tasks when only a small number of labeled observations are available ("few-shot" learning). Increased task diversity is often believed to enhance meta-learning by providing richer information across tasks. However, recent work by Kumar et al. (2022) shows that increasing task diversity, quantified through the overall geometric spread of task representations, can in fact degrade meta-learning prediction performance across a range of models and datasets. In this work, we build on this observation by showing that meta-learning performance is affected not only by the overall geometric variability of task parameters, but also by how this variability is allocated relative to an underlying low-dimensional structure. Similar to Pimonova et al. (2025), we decompose task-specific regression effects into a structurally informative component and an orthogonal, non-informative component. We show theoretically and through simulation that meta-learning prediction degrades when a larger fraction of between-task variability lies in orthogonal, non-informative directions, even when the overall geometric variability of tasks is held fixed.

preprint2022arXiv

Machine Learning of consistent thermodynamic models using automatic differentiation

We propose a data-driven method to describe consistent equations of state (EOS) for arbitrary systems. Complex EOS are traditionally obtained by fitting suitable analytical expressions to thermophysical data. A key aspect of EOS are that the relationships between state variables are given by derivatives of the system free energy. In this work, we model the free energy with an artificial neural network, and utilize automatic differentiation to directly learn the derivatives of the free energy. We demonstrate this approach on two different systems, the analytic van der Waals EOS, and published data for the Lennard-Jones fluid, and show that it is advantageous over direct learning of thermodynamic properties (i.e. not as derivatives of the free energy, but as independent properties), in terms of both accuracy and the exact preservation of the Maxwell relations. Furthermore, the method implicitly provides the free energy of a system without explicit integration.

preprint2022arXiv

Training Data Selection for Accuracy and Transferability of Interatomic Potentials

Advances in machine learning (ML) techniques have enabled the development of interatomic potentials that promise both the accuracy of first principles methods and the low-cost, linear scaling, and parallel efficiency of empirical potentials. Despite rapid progress in the last few years, ML-based potentials often struggle to achieve transferability, that is, to provide consistent accuracy across configurations that significantly differ from those used to train the model. In order to truly realize the promise of ML-based interatomic potentials, it is therefore imperative to develop systematic and scalable approaches for the generation of diverse training sets that ensure broad coverage of the space of atomic environments. This work explores a diverse-by-construction approach that leverages the optimization of the entropy of atomic descriptors to create a very large ($>2\cdot10^{5}$ configurations, $>7\cdot10^{6}$ atomic environments) training set for tungsten in an automated manner, i.e., without any human intervention. This dataset is used to train polynomial as well as multiple neural network potentials with different architectures. For comparison, a corresponding family of potentials were also trained on an expert-curated dataset for tungsten. The models trained to entropy-optimized data exhibited vastly superior transferability compared to the expert-curated models. Furthermore, while the models trained with heavy user input (i.e., domain expertise) yield the lowest errors when tested on similar configurations, out-sample predictions are dramatically more robust when the models are trained on a deliberately diverse set of training data. Herein we demonstrate the development of both accurate and transferable ML potentials using automated and data-driven approaches for generating large and diverse training sets.

preprint2020arXiv

Embedding Hard Physical Constraints in Neural Network Coarse-Graining of 3D Turbulence

In the recent years, deep learning approaches have shown much promise in modeling complex systems in the physical sciences. A major challenge in deep learning of PDEs is enforcing physical constraints and boundary conditions. In this work, we propose a general framework to directly embed the notion of an incompressible fluid into Convolutional Neural Networks, and apply this to coarse-graining of turbulent flow. These physics-embedded neural networks leverage interpretable strategies from numerical methods and computational fluid dynamics to enforce physical laws and boundary conditions by taking advantage the mathematical properties of the underlying equations. We demonstrate results on three-dimensional fully-developed turbulence, showing that this technique drastically improves local conservation of mass, without sacrificing performance according to several other metrics characterizing the fluid flow.

preprint2020arXiv

Simple and efficient algorithms for training machine learning potentials to force data

Abstract Machine learning models, trained on data from ab initio quantum simulations, are yielding molecular dynamics potentials with unprecedented accuracy. One limiting factor is the quantity of available training data, which can be expensive to obtain. A quantum simulation often provides all atomic forces, in addition to the total energy of the system. These forces provide much more information than the energy alone. It may appear that training a model to this large quantity of force data would introduce significant computational costs. Actually, training to all available force data should only be a few times more expensive than training to energies alone. Here, we present a new algorithm for efficient force training, and benchmark its accuracy by training to forces from real-world datasets for organic chemistry and bulk aluminum.