Source author record

Jay Roberts

Jay Roberts appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Computer Vision math.AP

Catalog footprint

What is connected

3works

3topics

3close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Understanding and Increasing Efficiency of Frank-Wolfe Adversarial Training

Deep neural networks are easily fooled by small perturbations known as adversarial attacks. Adversarial Training (AT) is a technique that approximately solves a robust optimization problem to minimize the worst-case loss and is widely regarded as the most effective defense. Due to the high computation time for generating strong adversarial examples in the AT process, single-step approaches have been proposed to reduce training time. However, these methods suffer from catastrophic overfitting where adversarial accuracy drops during training, and although improvements have been proposed, they increase training time and robustness is far from that of multi-step AT. We develop a theoretical framework for adversarial training with FW optimization (FW-AT) that reveals a geometric connection between the loss landscape and the $\ell_2$ distortion of $\ell_\infty$ FW attacks. We analytically show that high distortion of FW attacks is equivalent to small gradient variation along the attack path. It is then experimentally demonstrated on various deep neural network architectures that $\ell_\infty$ attacks against robust models achieve near maximal distortion, while standard networks have lower distortion. It is experimentally shown that catastrophic overfitting is strongly correlated with low distortion of FW attacks. This mathematical transparency differentiates FW from Projected Gradient Descent (PGD) optimization. To demonstrate the utility of our theoretical framework we develop FW-AT-Adapt, a novel adversarial training algorithm which uses a simple distortion measure to adapt the number of attack steps during training to increase efficiency without compromising robustness. FW-AT-Adapt provides training time on par with single-step fast AT methods and closes the gap between fast AT methods and multi-step PGD-AT with minimal loss in adversarial accuracy in white-box and black-box settings.

preprint2020arXiv

Affine motion of 2d incompressible fluids surrounded by vacuum and flows in ${\rm SL}(2,{\mathbb R})$

The affine motion of two-dimensional (2d) incompressible fluids surrounded by vacuum can be reduced to a completely integrable and globally solvable Hamiltonian system of ordinary differential equations for the deformation gradient in ${\rm SL}(2,{\mathbb R})$. In the case of perfect fluids, the motion is given by geodesic flow in ${\rm SL}(2,{\mathbb R})$ with the Euclidean metric, while for magnetically conducting fluids (MHD), the motion is governed by a harmonic oscillator in ${\rm SL}(2,{\mathbb R})$. A complete classification of the dynamics is given including rigid motions, rotating eddies with stable and unstable manifolds, and solutions with vanishing pressure. For perfect fluids, the displacement generically becomes unbounded, as $t\to\pm\infty$. For MHD, solutions are bounded and generically quasi-periodic and recurrent.

preprint2020arXiv

Second Order Optimization for Adversarial Robustness and Interpretability

Deep neural networks are easily fooled by small perturbations known as adversarial attacks. Adversarial Training (AT) is a technique aimed at learning features robust to such attacks and is widely regarded as a very effective defense. However, the computational cost of such training can be prohibitive as the network size and input dimensions grow. Inspired by the relationship between robustness and curvature, we propose a novel regularizer which incorporates first and second order information via a quadratic approximation to the adversarial loss. The worst case quadratic loss is approximated via an iterative scheme. It is shown that using only a single iteration in our regularizer achieves stronger robustness than prior gradient and curvature regularization schemes, avoids gradient obfuscation, and, with additional iterations, achieves strong robustness with significantly lower training time than AT. Further, it retains the interesting facet of AT that networks learn features which are well-aligned with human perception. We demonstrate experimentally that our method produces higher quality human-interpretable features than other geometric regularization techniques. These robust features are then used to provide human-friendly explanations to model predictions.

Jay Roberts

What is connected

Connect this record

See the researcher in context

Building this map preview

3 published item(s)

Understanding and Increasing Efficiency of Frank-Wolfe Adversarial Training

Affine motion of 2d incompressible fluids surrounded by vacuum and flows in ${\rm SL}(2,{\mathbb R})$

Second Order Optimization for Adversarial Robustness and Interpretability