Source author record

Pan Kessel

Pan Kessel appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning hep-th Artificial Intelligence cond-mat.stat-mech hep-lat physics.comp-ph

Catalog footprint

What is connected

9works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Diffeomorphic Counterfactuals with Generative Models

Counterfactuals can explain classification decisions of neural networks in a human interpretable way. We propose a simple but effective method to generate such counterfactuals. More specifically, we perform a suitable diffeomorphic coordinate transformation and then perform gradient ascent in these coordinates to find counterfactuals which are classified with great confidence as a specified target class. We propose two methods to leverage generative models to construct such suitable coordinate systems that are either exactly or approximately diffeomorphic. We analyze the generation process theoretically using Riemannian differential geometry and validate the quality of the generated counterfactuals using various qualitative and quantitative measures.

preprint2022arXiv

Gradients should stay on Path: Better Estimators of the Reverse- and Forward KL Divergence for Normalizing Flows

We propose an algorithm to estimate the path-gradient of both the reverse and forward Kullback-Leibler divergence for an arbitrary manifestly invertible normalizing flow. The resulting path-gradient estimators are straightforward to implement, have lower variance, and lead not only to faster convergence of training but also to better overall approximation results compared to standard total gradient estimators. We also demonstrate that path-gradient training is less susceptible to mode-collapse. In light of our results, we expect that path-gradient estimators will become the new standard method to train normalizing flows for variational inference.

preprint2022arXiv

Path-Gradient Estimators for Continuous Normalizing Flows

Recent work has established a path-gradient estimator for simple variational Gaussian distributions and has argued that the path-gradient is particularly beneficial in the regime in which the variational distribution approaches the exact target distribution. In many applications, this regime can however not be reached by a simple Gaussian variational distribution. In this work, we overcome this crucial limitation by proposing a path-gradient estimator for the considerably more expressive variational family of continuous normalizing flows. We outline an efficient algorithm to calculate this estimator and establish its superior performance empirically.

preprint2021arXiv

Estimation of Thermodynamic Observables in Lattice Field Theories with Deep Generative Models

In this work, we demonstrate that applying deep generative machine learning models for lattice field theory is a promising route for solving problems where Markov Chain Monte Carlo (MCMC) methods are problematic. More specifically, we show that generative models can be used to estimate the absolute value of the free energy, which is in contrast to existing MCMC-based methods which are limited to only estimate free energy differences. We demonstrate the effectiveness of the proposed method for two-dimensional $ϕ^4$ theory and compare it to MCMC-based methods in detailed numerical experiments.

preprint2020arXiv

Asymptotically unbiased estimation of physical observables with neural samplers

We propose a general framework for the estimation of observables with generative neural samplers focusing on modern deep generative neural networks that provide an exact sampling probability. In this framework, we present asymptotically unbiased estimators for generic observables, including those that explicitly depend on the partition function such as free energy or entropy, and derive corresponding variance estimators. We demonstrate their practical applicability by numerical experiments for the 2d Ising model which highlight the superiority over existing methods. Our approach greatly enhances the applicability of generative neural samplers to real-world physical systems.

preprint2020arXiv

Fairwashing Explanations with Off-Manifold Detergent

Explanation methods promise to make black-box classifiers more transparent. As a result, it is hoped that they can act as proof for a sensible, fair and trustworthy decision-making process of the algorithm and thereby increase its acceptance by the end-users. In this paper, we show both theoretically and experimentally that these hopes are presently unfounded. Specifically, we show that, for any classifier $g$, one can always construct another classifier $\tilde{g}$ which has the same behavior on the data (same train, validation, and test error) but has arbitrarily manipulated explanation maps. We derive this statement theoretically using differential geometry and demonstrate it experimentally for various explanation methods, architectures, and datasets. Motivated by our theoretical insights, we then propose a modification of existing explanation methods which makes them significantly more robust.

preprint2015arXiv

Higher Spin Interactions in Four Dimensions: Vasiliev vs. Fronsdal

We consider four-dimensional Higher-Spin Theory at the first nontrivial order corresponding to the cubic action. All Higher-Spin interaction vertices are explicitly obtained from Vasiliev's equations. In particular, we obtain the vertices that are not determined solely by the Higher-Spin algebra structure constants. The dictionary between the Fronsdal fields and Higher-Spin connections is found and the corrections to the Fronsdal equations are derived. These corrections turn out to involve derivatives of arbitrary order. We observe that the vertices not determined by the Higher-Spin algebra produce naked infinities, when decomposed into the minimal derivative vertices and improvements. Therefore, standard methods can only be used to check a rather limited number of correlation functions within the HS AdS/CFT duality. A possible resolution of the puzzle is discussed.

preprint2015arXiv

Higher Spins and Matter Interacting in Dimension Three

The spectrum of Prokushkin--Vasiliev Theory is puzzling in light of the Gaberdiel--Gopakumar conjecture because it generically contains an additional sector besides higher-spin gauge and scalar fields. We find the unique truncation of the theory avoiding this problem to order 2 in perturbations around AdS$_3$. The second-order backreaction on the physical gauge sector induced by the scalars is computed explicitly. The cubic action for the physical fields is determined completely. We comment on a different higher-spin theory without such additional fields at $λ=1$.

preprint2014arXiv

Metric- and frame-like higher-spin gauge theories in three dimensions

We study the relation between the frame-like and metric-like formulation of higher-spin gauge theories in three space-time dimensions. We concentrate on the theory that is described by an SL(3) x SL(3) Chern-Simons theory in the frame-like formulation. The metric-like theory is obtained by eliminating the generalised spin connection by its equation of motion, and by expressing everything in terms of the metric and a spin-3 Fronsdal field. We give an exact map between fields and gauge parameters in both formulations. To work out the gauge transformations explicitly in terms of metric-like variables, we have to make a perturbative expansion in the spin-3 field. We describe an algorithm how to do this systematically, and we work out the gauge transformations to cubic order in the spin-3 field. We use these results to determine the gauge algebra to this order, and explain why the commutator of two spin-3 transformations only closes on-shell.

Pan Kessel

What is connected

Connect this record

See the researcher in context

Building this map preview

9 published item(s)

Diffeomorphic Counterfactuals with Generative Models

Gradients should stay on Path: Better Estimators of the Reverse- and Forward KL Divergence for Normalizing Flows

Path-Gradient Estimators for Continuous Normalizing Flows

Estimation of Thermodynamic Observables in Lattice Field Theories with Deep Generative Models

Asymptotically unbiased estimation of physical observables with neural samplers

Fairwashing Explanations with Off-Manifold Detergent

Higher Spin Interactions in Four Dimensions: Vasiliev vs. Fronsdal

Higher Spins and Matter Interacting in Dimension Three

Metric- and frame-like higher-spin gauge theories in three dimensions