Researcher profile

Jason W. Rocks

Jason W. Rocks contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2022arXiv

Bias-variance decomposition of overparameterized regression with random linear features

In classical statistics, the bias-variance trade-off describes how varying a model's complexity (e.g., number of fit parameters) affects its ability to make accurate predictions. According to this trade-off, optimal performance is achieved when a model is expressive enough to capture trends in the data, yet not so complex that it overfits idiosyncratic features of the training data. Recently, it has become clear that this classic understanding of the bias-variance must be fundamentally revisited in light of the incredible predictive performance of "overparameterized models" -- models that avoid overfitting even when the number of fit parameters is large enough to perfectly fit the training data. Here, we present results for one of the simplest examples of an overparameterized model: regression with random linear features (i.e. a two-layer neural network with a linear activation function). Using the zero-temperature cavity method, we derive analytic expressions for the training error, test error, bias, and variance. We show that the linear random features model exhibits three phase transitions: two different transitions to an interpolation regime where the training error is zero, along with an additional transition between regimes with large bias and minimal bias. Using random matrix theory, we show how each transition arises due to small nonzero eigenvalues in the Hessian matrix. Finally, we compare and contrast the phase diagram of the random linear features model to the random nonlinear features model and ordinary regression, highlighting the new phase transitions that result from the use of linear basis functions.

preprint2022arXiv

Memorizing without overfitting: Bias, variance, and interpolation in over-parameterized models

The bias-variance trade-off is a central concept in supervised learning. In classical statistics, increasing the complexity of a model (e.g., number of parameters) reduces bias but also increases variance. Until recently, it was commonly believed that optimal performance is achieved at intermediate model complexities which strike a balance between bias and variance. Modern Deep Learning methods flout this dogma, achieving state-of-the-art performance using "over-parameterized models" where the number of fit parameters is large enough to perfectly fit the training data. As a result, understanding bias and variance in over-parameterized models has emerged as a fundamental problem in machine learning. Here, we use methods from statistical physics to derive analytic expressions for bias and variance in two minimal models of over-parameterization (linear regression and two-layer neural networks with nonlinear data distributions), allowing us to disentangle properties stemming from the model architecture and random sampling of data. In both models, increasing the number of fit parameters leads to a phase transition where the training error goes to zero and the test error diverges as a result of the variance (while the bias remains finite). Beyond this threshold, the test error of the two-layer neural network decreases due to a monotonic decrease in \emph{both} the bias and variance in contrast with the classical bias-variance trade-off. We also show that in contrast with classical intuition, over-parameterized models can overfit even in the absence of noise and exhibit bias even if the student and teacher models match. We synthesize these results to construct a holistic understanding of generalization error and the bias-variance trade-off in over-parameterized models and relate our results to random matrix theory.

preprint2020arXiv

Correlation of plastic events with local structure in jammed packings across spatial dimensions

In jammed packings, it is usually thought that local structure only plays a significant role in specific regimes. The standard deviation of the relative excess coordination, $σ_Z/ Z_\mathrm{c}$, decays like $1/\sqrt{d}$, so that local structure should play no role in high spatial dimensions. Furthermore, in any fixed dimension $d \geq 2$, there are diverging length scales as the pressure vanishes approaching the unjamming transition, again suggesting that local structure should not be sufficient to describe response. Here we challenge the assumption that local structure does not matter in these cases. In simulations of jammed packings under athermal, quasistatic shear, we use machine learning to identify a local structural variable, softness, that correlates with rearrangements in dimensions $d=2$ to $d=5$. We find that softness - and even just the coordination number $Z$ - are quite predictive of rearrangements over a wide range of pressures, all the way down to unjamming, in all $d$ studied. This result provides direct evidence that local structure can play a role in higher spatial dimensions.

preprint2019arXiv

Revealing structure-function relationships in functional flow networks via persistent homology

Complex networks encountered in biology are often characterized by significant structural diversity. Whether it be differences in the three-dimensional structure of allosteric proteins, or the variation among the micro-scale structures of organisms' cerebral vasculature systems, identifying relationships between structure and function often poses a difficult challenge. Here we showcase an approach to characterizing structure-function relationships in complex networks applied in the context of flow networks tuned to perform specific functions. Using persistent homology, we analyze flow networks tuned to perform complex multifunctional tasks, answering the question of how local changes in the network structure coordinate to create functionality at at the scale of the entire network. We find that the response of such networks encodes hidden topological features - sectors of uniform pressure - that are not apparent in the underlying network architectures, Regardless of differences in local connectivity, these features provide a universal topological description for all networks that perform these types of functions. We show that these features correlate strongly with the tuned response, providing a clear topological relationship between structure and function and structural insight into the limits of multifunctionality.

preprint2019arXiv

The hidden topological structure of flow network functionality

The ability to reroute and control flow is vital to the function of venation networks across a wide range of organisms. By modifying individual edges in these networks, either by adjusting edge conductances or creating and destroying edges, organisms can robustly control the propagation of inputs to perform specific tasks. However, a fundamental disconnect exists between the structure and function of these networks: networks with different local architectures can perform the same functions. Here we answer the question of how structural changes at the microscopic level are able to collectively create functionality at the scale of an entire network. Using persistent homology, we analyze networks tuned to perform complex multifunctional tasks. We find that the responses of such networks encode a hidden topological structure composed of sectors of uniform pressure. Although these sectors are not apparent in the underlying network architectures, we find that they nonetheless correlate strongly with the tuned function. We conclude that the connectivity of these sectors, rather than that of the individual nodes, provides a quantitative relationship between structure and function in flow networks. Finally, we use this topological description to place a bound on the limits of task complexity.