Researcher profile

Anubhav Jain

Anubhav Jain contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
9works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2022arXiv

Machine-learning rationalization and prediction of solid-state synthesis conditions

There currently exist no quantitative methods to determine the appropriate conditions for solid-state synthesis. This not only hinders the experimental realization of novel materials but also complicates the interpretation and understanding of solid-state reaction mechanisms. Here, we demonstrate a machine-learning approach that predicts synthesis conditions using large solid-state synthesis datasets text-mined from scientific journal articles. Using feature importance ranking analysis, we discovered that optimal heating temperatures have strong correlations with the stability of precursor materials quantified using melting points and formation energies ($ΔG_f$, $ΔH_f$). In contrast, features derived from the thermodynamics of synthesis-related reactions did not directly correlate to the chosen heating temperatures. This correlation between optimal solid-state heating temperature and precursor stability extends Tamman's rule from intermetallics to oxide systems, suggesting the importance of reaction kinetics in determining synthesis conditions. Heating times are shown to be strongly correlated with the chosen experimental procedures and instrument setups, which may be indicative of human bias in the dataset. Using these predictive features, we constructed machine-learning models with good performance and general applicability to predict the conditions required to synthesize diverse chemical systems. Codes and data used in this work can be found at: https://github.com/CederGroupHub/s4.

preprint2022arXiv

Text-mined dataset of gold nanoparticle synthesis procedures, morphologies, and size entities

Gold nanoparticles are highly desired for a range of technological applications due to their tunable properties, which are dictated by the size and shape of the constituent particles. Many heuristic methods for controlling the morphological characteristics of gold nanoparticles are well known. However, the underlying mechanisms controlling their size and shape remain poorly understood, partly due to the immense range of possible combinations of synthesis parameters. Data-driven methods can offer insight to help guide understanding of these underlying mechanisms, so long as sufficient synthesis data are available. To facilitate data mining in this direction, we have constructed and made publicly available a dataset of codified gold nanoparticle synthesis protocols and outcomes extracted directly from the nanoparticle materials science literature using natural language processing and text-mining techniques. This dataset contains 5,154 data records, each representing a single gold nanoparticle synthesis article, filtered from a database of 4,973,165 publications. Each record contains codified synthesis protocols and extracted morphological information from a total of 7,608 experimental and 12,519 characterization paragraphs.

preprint2021arXiv

Non-destructive Characterization of Anti-Reflective Coatings on PV Modules

Anti-reflective coatings (ARCs) are used on the vast majority of solar photovoltaic (PV) modules to increase power production. However, ARC longevity can vary from less than 1 year to over 15 years depending on coating quality and deployment conditions. A technique that can quantify ARC degradation non-destructively on commercial modules would be useful both for in-field diagnostics and accelerated aging tests. In this paper, we demonstrate that accurate measurements of ARC spectral reflectance can be performed using a modified commercially-available integrating-sphere probe. The measurement is fast, accurate, non-destructive and can be performed outdoors in full-sun conditions. We develop an interferometric model that estimates coating porosity, thickness and fractional area coverage from the measured reflectance spectrum for a uniform single-layer coating. We demonstrate the measurement outdoors on an active PV installation, identify the presence of an ARC and estimate the properties of the coating.

preprint2021arXiv

Optimal Band Structure for Thermoelectrics with Realistic Scattering and Bands

Understanding how to optimize electronic band structures for thermoelectrics is a topic of long-standing interest in the community. Prior models have been limited to simplified bands and/or scattering models. In this study, we apply more rigorous scattering treatments to more realistic model band structures - upward-parabolic bands that inflect to an inverted parabolic behavior - including cases of multiple bands. In contrast to common descriptors (e.g., quality factor and complexity factor), the degree to which multiple pockets improve thermoelectric performance is bounded by interband scattering and the relative shapes of the bands. We establish that extremely anisotropic `flat-and-dispersive' bands, although best-performing in theory, may not represent a promising design strategy in practice. Critically, we determine optimum bandwidth, dependent on temperature and lattice thermal conductivity, from perfect transport cutoffs that can in theory significantly boost $zT$ beyond the values attainable through intrinsic band structures alone. Our analysis should be widely useful as the thermoelectric research community eyes $zT>3$.

preprint2021arXiv

Recent Advances and Applications of Deep Learning Methods in Materials Science

Deep learning (DL) is one of the fastest growing topics in materials data science, with rapidly emerging applications spanning atomistic, image-based, spectral, and textual data modalities. DL allows analysis of unstructured data and automated identification of features. Recent development of large materials databases has fueled the application of DL methods in atomistic prediction in particular. In contrast, advances in image and spectral data have largely leveraged synthetic data enabled by high quality forward models as well as by generative unsupervised DL methods. In this article, we present a high-level overview of deep-learning methods followed by a detailed discussion of recent developments of deep learning in atomistic simulation, materials imaging, spectral analysis, and natural language processing. For each modality we discuss applications involving both theoretical and experimental data, typical modeling approaches with their strengths and limitations, and relevant publicly available software and datasets. We conclude the review with a discussion of recent cross-cutting work related to uncertainty quantification in this field and a brief perspective on limitations, challenges, and potential growth areas for DL methods in materials science. The application of DL methods in materials science presents an exciting avenue for future materials discovery and design.

preprint2020arXiv

A critical examination of compound stability predictions from machine-learned formation energies

Machine learning has emerged as a novel tool for the efficient prediction of materials properties, and claims have been made that machine-learned models for the formation energy of compounds can approach the accuracy of Density Functional Theory (DFT). The models tested in this work include five recently published compositional models, a baseline model using stoichiometry alone, and a structural model. By testing seven machine learning models for formation energy on stability predictions using the Materials Project database of DFT calculations for 85,014 unique chemical compositions, we show that while formation energies can indeed be predicted well, all compositional models perform poorly on predicting the stability of compounds, making them considerably less useful than DFT for the discovery and design of new solids. Most critically, in sparse chemical spaces where few stoichiometries have stable compounds, only the structural model is capable of efficiently detecting which materials are stable. The non-incremental improvement of structural models compared with compositional models is noteworthy and encourages the use of structural models for materials discovery, with the constraint that for any new composition, the ground-state structure is not known a priori. This work demonstrates that accurate predictions of formation energy do not imply accurate predictions of stability, emphasizing the importance of assessing model performance on stability predictions, for which we provide a set of publicly available tests.

preprint2020arXiv

Benchmarking Materials Property Prediction Methods: The Matbench Test Set and Automatminer Reference Algorithm

We present a benchmark test suite and an automated machine learning procedure for evaluating supervised machine learning (ML) models for predicting properties of inorganic bulk materials. The test suite, Matbench, is a set of 13 ML tasks that range in size from 312 to 132k samples and contain data from 10 density functional theory-derived and experimental sources. Tasks include predicting optical, thermal, electronic, thermodynamic, tensile, and elastic properties given a materials composition and/or crystal structure. The reference algorithm, Automatminer, is a highly-extensible, fully-automated ML pipeline for predicting materials properties from materials primitives (such as composition and crystal structure) without user intervention or hyperparameter tuning. We test Automatminer on the Matbench test suite and compare its predictive power with state-of-the-art crystal graph neural networks and a traditional descriptor-based Random Forest model. We find Automatminer achieves the best performance on 8 of 13 tasks in the benchmark. We also show our test suite is capable of exposing predictive advantages of each algorithm - namely, that crystal graph methods appear to outperform traditional machine learning methods given ~10^4 or greater data points. The pre-processed, ready-to-use Matbench tasks and the Automatminer source code are open source and available online (http://hackingmaterials.lbl.gov/automatminer/). We encourage evaluating new materials ML algorithms on the MatBench benchmark and comparing them against the latest version of Automatminer.

preprint2020arXiv

High Thermoelectric Performance and Defect Energetics of Multi-pocketed Full-Heusler Compounds

We report first-principles density-functional study of electron-phonon interactions and thermoelectric transport properties of full-Heusler compounds Sr$_{2}$BiAu and Sr$_{2}$SbAu. Our results show that ultrahigh intrinsic bulk thermoelectric performance across a wide range of temperatures is physically possible and point to the presence of multiply degenerate and highly dispersive carrier pockets as the key factor for achieving it. Sr$_{2}$BiAu, which features ten energy-aligned low effective mass pockets (six along $Γ-X$ and four at $L$), is predicted to deliver $n$-type $zT=0.4-4.9$ at $T=100-700$~K. Comparison with the previously investigated Ba$_{2}$BiAu compound shows that the additional $L$-pockets in Sr$_{2}$BiAu significantly increase its low-temperature power factor to a maximum value of $12$~mW~m$^{-1}$~K$^{-2}$ near $T=300$~K. However, at high temperatures the power factor of Sr$_{2}$BiAu drops below that of Ba$_{2}$BiAu because the $L$ states are heavier and subject to strong scattering by phonon deformation as opposed to the lighter $Γ-X$ states that are limited by polar-optical scattering. Sr$_{2}$SbAu is predicted to deliver lower $n$-type of $zT=3.4$ at $T=750$~K due to appreciable misalignment between the $L$ and $Γ-X$ carrier pockets, generally heavier scattering, and slightly higher lattice thermal conductivity. Soft acoustic modes, responsible for low lattice thermal conductivity, also increase vibrational entropies and high-temperature stability of the Heusler compounds, suggesting that their experimental synthesis may be feasible. The dominant intrinsic defects are found to be Au vacancies, which drive the Fermi level towards the conduction band and work in favor of $n$-doping.

preprint2019arXiv

A transferable machine-learning framework linking interstice distribution and plastic heterogeneity in metallic glasses

When metallic glasses (MGs) are subjected to mechanical loads, the plastic response of atoms is non-uniform. However, the extent and manner in which atomic environment signatures present in the undeformed structure determine this plastic heterogeneity remain elusive. Here, we demonstrate that novel site environment features that characterize interstice distributions around atoms combined with machine learning (ML) can reliably identify plastic sites in several Cu-Zr compositions. Using only quenched structural information as input, the ML-based plastic probability estimates ("quench-in softness" metric) can identify plastic sites that could activate at high strains, losing predictive power only upon the formation of shear bands. Moreover, we reveal that a quench-in softness model trained on a single composition and quenching rate substantially improves upon previous models in generalizing to different compositions and completely different MG systems (Ni62Nb38, Al90Sm10 and Fe80P20). Our work presents a general, data-centric framework that could potentially be used to address the structural origin of any site-specific property in MGs.