Source author record

Andrzej Banburski

Andrzej Banburski appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

gr-qc hep-th Machine Learning Computer Vision math-ph math.MP Artificial Intelligence hep-ex hep-ph nucl-ex

Catalog footprint

What is connected

9works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Non-local Field Theory from Matrix Models

We show that a class of matrix theories can be understood as an extension of quantum field theory which has non-local interactions. This reformulation is based on the Wigner-Weyl transformation, and the interactions take the form of Moyal product on a doubled geometry. We recover local dynamics on the spacetime as a low-energy limit. This framework opens up the possibility for studying novel high-energy phenomena, including the unification of gauge and geometric symmetries in a gauge theory.

preprint2020arXiv

Biologically Inspired Mechanisms for Adversarial Robustness

A convolutional neural network strongly robust to adversarial perturbations at reasonable computational and performance cost has not yet been demonstrated. The primate visual ventral stream seems to be robust to small perturbations in visual stimuli but the underlying mechanisms that give rise to this robust perception are not understood. In this work, we investigate the role of two biologically plausible mechanisms in adversarial robustness. We demonstrate that the non-uniform sampling performed by the primate retina and the presence of multiple receptive fields with a range of receptive field sizes at each eccentricity improve the robustness of neural networks to small adversarial perturbations. We verify that these two mechanisms do not suffer from gradient obfuscation and study their contribution to adversarial robustness through ablation studies.

preprint2020arXiv

Double descent in the condition number

In solving a system of $n$ linear equations in $d$ variables $Ax=b$, the condition number of the $n,d$ matrix $A$ measures how much errors in the data $b$ affect the solution $x$. Estimates of this type are important in many inverse problems. An example is machine learning where the key task is to estimate an underlying function from a set of measurements at random points in a high dimensional space and where low sensitivity to error in the data is a requirement for good predictive performance. Here we discuss the simple observation, which is known but surprisingly little quoted (see Theorem 4.2 in \cite{Brgisser:2013:CGN:2526261}): when the columns of $A$ are random vectors, the condition number of $A$ is highest if $d=n$, that is when the inverse of $A$ exists. An overdetermined system ($n>d$) as well as an underdetermined system ($n<d$), for which the pseudoinverse must be used instead of the inverse, typically have significantly better, that is lower, condition numbers. Thus the condition number of $A$ plotted as function of $d$ shows a double descent behavior with a peak at $d=n$.

preprint2020arXiv

Theory III: Dynamics and Generalization in Deep Networks

The key to generalization is controlling the complexity of the network. However, there is no obvious control of complexity -- such as an explicit regularization term -- in the training of deep networks for classification. We will show that a classical form of norm control -- but kind of hidden -- is present in deep networks trained with gradient descent techniques on exponential-type losses. In particular, gradient descent induces a dynamics of the normalized weights which converge for $t \to \infty$ to an equilibrium which corresponds to a minimum norm (or maximum margin) solution. For sufficiently large but finite $ρ$ -- and thus finite $t$ -- the dynamics converges to one of several margin maximizers, with the margin monotonically increasing towards a limit stationary point of the flow. In the usual case of stochastic gradient descent, most of the stationary points are likely to be convex minima corresponding to a constrained minimizer -- the network with normalized weights-- which corresponds to vanishing regularization. The solution has zero generalization gap, for fixed architecture, asymptotically for $N \to \infty$, where $N$ is the number of training examples. Our approach extends some of the original results of Srebro from linear networks to deep networks and provides a new perspective on the implicit bias of gradient descent. We believe that the elusive complexity control we describe is responsible for the puzzling empirical finding of good predictive performance by deep networks, despite overparametrization.

preprint2015arXiv

A simpler way of imposing simplicity constraints

We investigate a way of imposing simplicity constraints in a holomorphic Spin Foam model that we recently introduced. Rather than imposing the constraints on the boundary spin network, as is usually done, one can impose the constraints directly on the Spin Foam propagator. We find that the two approaches have the same leading asymptotic behaviour, with differences appearing at higher order. This allows us to obtain a model that greatly simplifies calculations, but still has Regge Calculus as its semi-classical limit.

preprint2014arXiv

Pachner moves in a 4d Riemannian holomorphic Spin Foam model

In this work we study a Spin Foam model for 4d Riemannian gravity, and propose a new way of imposing the simplicity constraints that uses the recently developed holomorphic representation. Using the power of the holomorphic integration techniques, and with the introduction of two new tools: the homogeneity map and the loop identity, for the first time we give the analytic expressions for the behaviour of the Spin Foam amplitudes under 4-dimensional Pachner moves. It turns out that this behaviour is controlled by an insertion of nonlocal mixing operators. In the case of the 5-1 move, the expression governing the change of the amplitude can be interpreted as a vertex renormalisation equation. We find a natural truncation scheme that allows us to get an invariance up to an overall factor for the 4-2 and 5-1 moves, but not for the 3-3 move. The study of the divergences shows that there is a range of parameter space for which the 4-2 move is finite while the 5-1 move diverges. This opens up the possibility to recover diffeomorphism invariance in the continuum limit of Spin Foam models for 4D Quantum Gravity.

preprint2014arXiv

Snyder Momentum Space in Relative Locality

The standard approaches of phenomenology of Quantum Gravity have usually explicitly violated Lorentz invariance, either in the dispersion relation or in the addition rule for momenta. We investigate whether it is possible in 3+1 dimensions to have a non local deformation that preserves fully Lorentz invariance, as it is the case in 2+1D Quantum Gravity. We answer positively to this question and show for the first time how to construct a homogeneously curved momentum space preserving the full action of the Lorentz group in dimension 4 and higher, despite relaxing locality. We study the property of this relative locality deformation and show that this space leads to a noncommutativity related to Snyder spacetime.

preprint2013arXiv

Twisting loops and global momentum non-conservation in Relative Locality

Recent work in Relative Locality has shown that the theory allows for a solution of an on-shell causal loop. We show that the theory contains a different type of a loop in which locally momenta are conserved, but there is no global momentum conservation. Thus a freely propagating particle can decay into two particles, which later recombine to give a particle with momentum and mass different than the original one.

preprint2012arXiv

The Production and Discovery of True Muonium in Fixed-Target Experiments

Upcoming fixed-target experiments designed to search for new sub-GeV forces will also have sensitivity to the never before observed True Muonium atom, a bound state of a muon and anti-muon. We describe the production and decay characteristics of True Muonium relevant to these experiments. Importantly, we find that secondary production mechanisms dominate over primary production for the long-lived 2S and 2P states, leading to total yields an order of magnitude larger than naive estimates previously suggested. We present yield estimates for True Muonium as a function of energy fraction and decay length, useful for guiding future experimental studies. Discovery and measurement prospects appear very favorable.

Andrzej Banburski

What is connected

Connect this record

See the researcher in context

Building this map preview

9 published item(s)

Non-local Field Theory from Matrix Models

Biologically Inspired Mechanisms for Adversarial Robustness

Double descent in the condition number

Theory III: Dynamics and Generalization in Deep Networks

A simpler way of imposing simplicity constraints

Pachner moves in a 4d Riemannian holomorphic Spin Foam model

Snyder Momentum Space in Relative Locality

Twisting loops and global momentum non-conservation in Relative Locality

The Production and Discovery of True Muonium in Fixed-Target Experiments