Researcher profile

Entao Yang

Entao Yang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 17 - UnverifiedVerification L1Unclaimed author
4works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

4 published item(s)

preprint2026arXiv

Is Grokking a Computational Glass Relaxation?

Understanding neural network's (NN) generalizability remains a central question in deep learning research. The special phenomenon of grokking, where NNs abruptly generalize long after the training performance reaches a near-perfect level, offers a unique window to investigate the underlying mechanisms of NNs' generalizability. Here we propose an interpretation for grokking by framing it as a computational glass relaxation: viewing NNs as a physical system where parameters are the degrees of freedom and train loss is the system energy, we find memorization process resembles a rapid cooling of liquid into non-equilibrium glassy state at low temperature and the later generalization is like a slow relaxation towards a more stable configuration. This mapping enables us to sample NNs' Boltzmann entropy (states of density) landscape as a function of training loss and test accuracy. Our experiments in transformers on arithmetic tasks suggests that there is NO entropy barrier in the memorization-to-generalization transition of grokking, challenging previous theory that defines grokking as a first-order phase transition. We identify a high-entropy advantage under grokking, an extension of prior work linking entropy to generalizability but much more significant. Inspired by grokking's far-from-equilibrium nature, we develop a toy optimizer WanD based on Wang-landau molecular dynamics, which can eliminate grokking without any constraints and find high-norm generalizing solutions. This provides strictly-defined counterexamples to theory attributing grokking solely to weight norm evolution towards the Goldilocks zone and also suggests new potential ways for optimizer design.

preprint2022arXiv

Structuro-elasto-plasticity (StEP) model for plasticity in disordered solids

Elastoplastic lattice models for the response of solids to deformation typically incorporate structure only implicitly via a local yield strain that is assigned to each site. However, the local yield strain can change in response to a nearby or even distant plastic event in the system. This interplay is key to understanding phenomena such as avalanches in which one plastic event can trigger another, leading to a cascade of events, but typically is neglected in elastoplastic models. To include the interplay one could calculate the local yield strain for a given particulate system and follow its evolution, but this is expensive and requires knowledge of particle interactions, which is often hard to extract from experiments. Instead, we introduce a structural quantity, "softness," obtained using machine learning to correlate with imminent plastic rearrangements. We show that softness also correlates with local yield strain. We incorporate softness to construct a "structuro-elasto-plasticity" model that reproduces particle simulation results quantitatively for several observable quantities, confirming that we capture the influence of the interplay of local structure, plasticity, and elasticity on material response.

preprint2022arXiv

Understanding Creep Suppression Mechanism in Polymer Nanocomposites through Machine Learning

While recent efforts have shown how local structure plays an essential role in the dynamic heterogeneity of homogeneous glass-forming materials, systems containing interfaces such as thin films or composite materials remain poorly understood. It is known that interfaces perturb the molecular packing nearby, however, numerous studies show the dynamics are modified over a much larger range. Here, we examine the dynamics in polymer nanocomposites (PNCs) using a combination of simulations and experiments and quantitatively separate the role of polymer packing from other effects on the dynamics, as a function of distance from the nanoparticle surfaces. After showing good qualitative agreement between the simulations and experiments in glassy structure and creep compliance, we use a recently developed machine learning technique to decompose polymer dynamics in our simulated PNCs into structure-dependent and structure-independent processes. With this decomposition, the free energy barrier for polymer rearrangement can be described as a combination of packing-dependent and packing-independent barriers. We find both barriers are higher near nanoparticles and decrease with applied stress, quantitatively demonstrating that the slow interfacial dynamics is not solely due to polymer packing differences, but also the change of structure-dynamics relationships. Finally, we present how this decomposition can be used to accurately predict strain-time creep curves for PNCs from their static configuration, providing additional insights into the effects of polymer-nanoparticle interfaces on creep suppression in PNCs.

preprint2021arXiv

The Role of Local Structure in the Enhanced Dynamics of Deformed Glasses

External stress can accelerate molecular mobility of amorphous solids by several orders of magnitude. The changes in mobility are commonly interpreted through the Eyring model, which invokes an empirical activation volume whose origin remains poorly understood. Here, we analyze constant-stress molecular dynamics simulations and propose an extension of the Eyring model with a machine-learned field, softness. Our model connects the activation volume, an empirical parameter, to a structural property (softness). We show that stress has an inhomogeneous effect on the mobility that depends on local structure, which explains the narrower distribution of relaxation time observed under stress.