Source author record

David Rolnick

David Rolnick appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning math.CO Artificial Intelligence Computational Complexity Computer Vision cs.CY math.AC math.AG math.MG Neural and Evolutionary Computing Symbolic Computation

Catalog footprint

What is connected

11works

11topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

In-Context Reinforcement Learning through Bayesian Fusion of Context and Value Prior

In-context reinforcement learning (ICRL) promises fast adaptation to unseen environments without parameter updates, but current methods either cannot improve beyond the training distribution or require near-optimal data, limiting practical adoption. We introduce SPICE, a Bayesian ICRL method that learns a prior over Q-values via deep ensemble and updates this prior at test-time using in-context information through Bayesian updates. To recover from poor priors resulting from training on sub-optimal data, our online inference follows an Upper-Confidence Bound rule that favours exploration and adaptation. We prove that SPICE achieves regret-optimal behaviour in both stochastic bandits and finite-horizon MDPs, even when pretrained only on suboptimal trajectories. We validate these findings empirically across bandit and control benchmarks. SPICE achieves near-optimal decisions on unseen tasks, substantially reduces regret compared to prior ICRL and meta-RL approaches while rapidly adapting to unseen tasks and remaining robust under distribution shift.

preprint2024arXiv

Dataset Difficulty and the Role of Inductive Bias

Motivated by the goals of dataset pruning and defect identification, a growing body of methods have been developed to score individual examples within a dataset. These methods, which we call "example difficulty scores", are typically used to rank or categorize examples, but the consistency of rankings between different training runs, scoring methods, and model architectures is generally unknown. To determine how example rankings vary due to these random and controlled effects, we systematically compare different formulations of scores over a range of runs and model architectures. We find that scores largely share the following traits: they are noisy over individual runs of a model, strongly correlated with a single notion of difficulty, and reveal examples that range from being highly sensitive to insensitive to the inductive biases of certain model architectures. Drawing from statistical genetics, we develop a simple method for fingerprinting model architectures using a few sensitive examples. These findings guide practitioners in maximizing the consistency of their scores (e.g. by choosing appropriate scoring methods, number of runs, and subsets of examples), and establishes comprehensive baselines for evaluating scores in the future.

preprint2022arXiv

Bugs in the Data: How ImageNet Misrepresents Biodiversity

ImageNet-1k is a dataset often used for benchmarking machine learning (ML) models and evaluating tasks such as image recognition and object detection. Wild animals make up 27% of ImageNet-1k but, unlike classes representing people and objects, these data have not been closely scrutinized. In the current paper, we analyze the 13,450 images from 269 classes that represent wild animals in the ImageNet-1k validation set, with the participation of expert ecologists. We find that many of the classes are ill-defined or overlapping, and that 12% of the images are incorrectly labeled, with some classes having >90% of images incorrect. We also find that both the wildlife-related labels and images included in ImageNet-1k present significant geographical and cultural biases, as well as ambiguities such as artificial animals, multiple species in the same image, or the presence of humans. Our findings highlight serious issues with the extensive use of this dataset for evaluating ML systems, the use of such algorithms in wildlife-related tasks, and more broadly the ways in which ML datasets are commonly created and curated.

preprint2022arXiv

On Neural Architecture Inductive Biases for Relational Tasks

Current deep learning approaches have shown good in-distribution generalization performance, but struggle with out-of-distribution generalization. This is especially true in the case of tasks involving abstract relations like recognizing rules in sequences, as we find in many intelligence tests. Recent work has explored how forcing relational representations to remain distinct from sensory representations, as it seems to be the case in the brain, can help artificial systems. Building on this work, we further explore and formalize the advantages afforded by 'partitioned' representations of relations and sensory details, and how this inductive bias can help recompose learned relational structure in newly encountered settings. We introduce a simple architecture based on similarity scores which we name Compositional Relational Network (CoRelNet). Using this model, we investigate a series of inductive biases that ensure abstract relations are learned and represented distinctly from sensory data, and explore their effects on out-of-distribution generalization for a series of relational psychophysics tasks. We find that simple architectural choices can outperform existing models in out-of-distribution generalization. Together, these results show that partitioning relational representations from other information streams may be a simple way to augment existing network architectures' robustness when performing out-of-distribution relational computations.

preprint2022arXiv

TIML: Task-Informed Meta-Learning for Agriculture

Labeled datasets for agriculture are extremely spatially imbalanced. When developing algorithms for data-sparse regions, a natural approach is to use transfer learning from data-rich regions. While standard transfer learning approaches typically leverage only direct inputs and outputs, geospatial imagery and agricultural data are rich in metadata that can inform transfer learning algorithms, such as the spatial coordinates of data-points or the class of task being learned. We build on previous work exploring the use of meta-learning for agricultural contexts in data-sparse regions and introduce task-informed meta-learning (TIML), an augmentation to model-agnostic meta-learning which takes advantage of task-specific metadata. We apply TIML to crop type classification and yield estimation, and find that TIML significantly improves performance compared to a range of benchmarks in both contexts, across a diversity of model architectures. While we focus on tasks from agriculture, TIML could offer benefits to any meta-learning setup with task-specific metadata, such as classification of geo-tagged images and species distribution modelling.

preprint2020arXiv

Reverse-Engineering Deep ReLU Networks

It has been widely assumed that a neural network cannot be recovered from its outputs, as the network depends on its parameters in a highly nonlinear way. Here, we prove that in fact it is often possible to identify the architecture, weights, and biases of an unknown deep ReLU network by observing only its output. Every ReLU network defines a piecewise linear function, where the boundaries between linear regions correspond to inputs for which some neuron in the network switches between inactive and active ReLU states. By dissecting the set of region boundaries into components associated with particular neurons, we show both theoretically and empirically that it is possible to recover the weights of neurons and their arrangement within the network, up to isomorphism.

preprint2015arXiv

Quantitative $(p,q)$ theorems in combinatorial geometry

We show quantitative versions of classic results in discrete geometry, where the size of a convex set is determined by some non-negative function. We give versions of this kind for the selection theorem of Bárány, the existence of weak epsilon-nets for convex sets and the $(p,q)$ theorem of Alon and Kleitman. These methods can be applied to functions such as the volume, surface area or number of points of a discrete set. We also give general quantitative versions of the colorful Helly theorem for continuous functions.

preprint2014arXiv

Acyclic Subgraphs of Planar Digraphs

An acyclic set in a digraph is a set of vertices that induces an acyclic subgraph. In 2011, Harutyunyan conjectured that every planar digraph on $n$ vertices without directed 2-cycles possesses an acyclic set of size at least $3n/5$. We prove this conjecture for digraphs where every directed cycle has length at least 8. More generally, if $g$ is the length of the shortest directed cycle, we show that there exists an acyclic set of size at least $(1 - 3/g)n$.

preprint2014arXiv

Gröbner Bases and Nullstellensätze for Graph-Coloring Ideals

We revisit a well-known family of polynomial ideals encoding the problem of graph-$k$-colorability. Our paper describes how the inherent combinatorial structure of the ideals implies several interesting algebraic properties. Specifically, we provide lower bounds on the difficulty of computing Gröbner bases and Nullstellensatz certificates for the coloring ideals of general graphs. For chordal graphs, however, we explicitly describe a Gröbner basis for the coloring ideal, and provide a polynomial-time algorithm.

preprint2014arXiv

On the classification of Stanley sequences

An integer sequence is said to be 3-free if no three elements form an arithmetic progression. Following the greedy algorithm, the Stanley sequence $S(a_0,a_1,\ldots,a_k)$ is defined to be the 3-free sequence $\{a_n\}$ having initial terms $a_0,a_1,\ldots,a_k$ and with each subsequent term $a_n>a_{n-1}$ chosen minimally such that the 3-free condition is not violated. Odlyzko and Stanley conjectured that Stanley sequences divide into two classes based on asymptotic growth patterns, with one class of highly structured sequences satisfying $a_n\approx Θ(n^{\log_2 3})$ and another class of seemingly chaotic sequences obeying $a_n=Θ(n^2/\log n)$. We propose a rigorous definition of regularity in Stanley sequences based on local structure rather than asymptotic behavior and show that our definition implies the corresponding asymptotic property proposed by Odlyzko and Stanley. We then construct many classes of regular Stanley sequences, which include as special cases all such sequences previously identified. We show how two regular sequences may be combined into another regular sequence, and how parts of a Stanley sequence may be translated while preserving regularity. Finally, we demonstrate that certain Stanley sequences possess proper subsets that are also Stanley sequences, a situation that appears previously to have been assumed impossible.

preprint2014arXiv

On the growth of Stanley sequences

A set is said to be \emph{3-free} if no three elements form an arithmetic progression. Given a 3-free set $A$ of integers $0=a_0<a_1<\cdots<a_t$, the \emph{Stanley sequence} $S(A)=\{a_n\}$ is defined using the greedy algorithm: For each successive $n>t$, we pick the smallest possible $a_n$ so that $\{a_0,a_1,\ldots,a_n\}$ is 3-free and increasing. Work by Odlyzko and Stanley indicates that Stanley sequences may be divided into two classes. Sequences of Type 1 are highly structured and satisfy $αn^{\log_2 3}/2\le a_n\le αn^{\log_2 3}$, for some constant $α$, while those of Type 2 are chaotic and satisfy $Θ(n^2/\log n)$. In this paper, we consider the possible values for $α$ in the growth of Type 1 Stanley sequences. Whereas Odlyzko and Stanley assumed $α=1$, we show that $α$ can be any rational number which is at least 1 and for which the denominator, in lowest terms, is a power of 3.

David Rolnick

What is connected

Connect this record

See the researcher in context

Building this map preview

11 published item(s)

In-Context Reinforcement Learning through Bayesian Fusion of Context and Value Prior

Dataset Difficulty and the Role of Inductive Bias

Bugs in the Data: How ImageNet Misrepresents Biodiversity

On Neural Architecture Inductive Biases for Relational Tasks

TIML: Task-Informed Meta-Learning for Agriculture

Reverse-Engineering Deep ReLU Networks

Quantitative $(p,q)$ theorems in combinatorial geometry

Acyclic Subgraphs of Planar Digraphs

Gröbner Bases and Nullstellensätze for Graph-Coloring Ideals

On the classification of Stanley sequences

On the growth of Stanley sequences