Researcher profile

Martin Burger

Martin Burger contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
14works
0followers
10topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

14 published item(s)

preprint2026arXiv

Multi-Dimensional Opinion Formation

In this paper we propose and investigate a multi-dimensional opinion dynamics model where people are characterised by both opinions and importance weights across these opinions. Opinion changes occur through binary interactions, with a novel coupling mechanism: the change in one topic depends on the weighted similarity across the full opinion vector. We state the kinetic equation for this process and derive its mean-field partial differential equation to describe the overall dynamics. Analytical computations and numerical simulations confirm that this model generates complex stationary states, and we demonstrate that the final opinion structures are critically determined by the peoples' opinion weights.

preprint2022arXiv

A Bregman Learning Framework for Sparse Neural Networks

We propose a learning framework based on stochastic Bregman iterations, also known as mirror descent, to train sparse neural networks with an inverse scale space approach. We derive a baseline algorithm called LinBreg, an accelerated version using momentum, and AdaBreg, which is a Bregmanized generalization of the Adam algorithm. In contrast to established methods for sparse training the proposed family of algorithms constitutes a regrowth strategy for neural networks that is solely optimization-based without additional heuristics. Our Bregman learning framework starts the training with very few initial parameters, successively adding only significant ones to obtain a sparse and expressive network. The proposed approach is extremely easy and efficient, yet supported by the rich mathematical theory of inverse scale space methods. We derive a statistically profound sparse parameter initialization strategy and provide a rigorous stochastic convergence analysis of the loss decay and additional convergence proofs in the convex regime. Using only 3.4% of the parameters of ResNet-18 we achieve 90.2% test accuracy on CIFAR-10, compared to 93.6% using the dense network. Our algorithm also unveils an autoencoder architecture for a denoising task. The proposed framework also has a huge potential for integrating sparse backpropagation and resource-friendly training.

preprint2022arXiv

On multi-species diffusion with size exclusion

We revisit a classical continuum model for the diffusion of multiple species with size-exclusion constraint, which leads to a degenerate nonlinear cross-diffusion system. The purpose of this article is twofold: first, it aims at a systematic study of the question of existence of weak solutions and their long-time asymptotic behaviour. Second, it provides a weak-strong stability estimate for a wide range of coefficients, which had been missing so far. In order to achieve the results mentioned above, we exploit the formal gradient-flow structure of the model with respect to a logarithmic entropy, which leads to best estimates in the full-interaction case, where all cross-diffusion coefficients are non-zero. Those are crucial to obtain the minimal Sobolev regularity needed for a weak-strong stability result. For meaningful cases when some of the coefficients vanish, we provide a novel existence result based on approximation by the full-interaction case.

preprint2022arXiv

Region-of-Interest Prioritised Sampling for Constrained Autonomous Exploration Systems

Goal oriented autonomous operation of space rovers has been known to increase scientific output of a mission. In this work we present an algorithm, called the RoI Prioritised Sampling (RPS), that prioritises Region-of-Interests (RoIs) in an exploration scenario in order to utilise the limited resources of the imaging instrument on the rover effectively. This prioritisation is based on an estimator that evaluates the change in information content at consecutive spatial scales of the RoIs without calculating the finer scale reconstruction. The estimator, called the Refinement Indicator (RI), is motivated and derived. Multi-scale acquisition approaches, based on classical and multilevel compressed sensing, with respect to the single pixel camera architecture are discussed. The performance of the algorithm is verified on remote sensing images and compared with the state-of-the-art multi-resolution reconstruction algorithms. At the considered sub-sampling rates the RPS is shown to better utilise the system resources for reconstructing the RoIs.

preprint2022arXiv

Well-posedness of an integro-differential model for active Brownian particles

We propose a general strategy for solving nonlinear integro-differential evolution problems with periodic boundary conditions, where no direct maximum/minimum principle is available. This is motivated by the study of recent macroscopic models for active Brownian particles with repulsive interactions, consisting of advection-diffusion processes in the space of particle position and orientation. We focus on one of such models, namely a semilinear parabolic equation with a nonlinear active drift term, whereby the velocity depends on the particle orientation and angle-independent overall particle density (leading to a nonlocal term by integrating out the angular variable). The main idea of the existence analysis is to exploit a-priori estimates from (approximate) entropy dissipation. The global existence and uniqueness of weak solutions is shown using a two-step Galerkin approximation with appropriate cutoff in order to obtain nonnegativity, an upper bound on the overall density and preserve a-priori estimates. Our anyalysis naturally includes the case of finite systems, corresponding to the case of a finite number of directions. The Duhamel principle is then used to obtain additional regularity of the solution, namely continuity in time-space. Motivated by the class of initial data relevant for the application, which includes perfectly aligned particles (same orientation), we extend the well-posedness result to very weak solutions allowing distributional initial data with low regularity.

preprint2021arXiv

Gradient Flows and Nonlinear Power Methods for the Computation of Nonlinear Eigenfunctions

This chapter describes how gradient flows and nonlinear power methods in Banach spaces can be used to solve nonlinear eigenvector-dependent eigenvalue problems, and how convergence of (discretized) approximations can be verified. We review several flows from literature, which were proposed to compute nonlinear eigenfunctions, and show that they all relate to normalized gradient flows. Furthermore, we show that the implicit Euler discretization of gradient flows gives rise to a nonlinear power method of the proximal operator and prove their convergence to nonlinear eigenfunctions. Finally, we prove that $Γ$-convergence of functionals implies convergence of their ground states, which is important for discrete approximations.

preprint2021arXiv

Identifying Untrustworthy Predictions in Neural Networks by Geometric Gradient Analysis

The susceptibility of deep neural networks to untrustworthy predictions, including out-of-distribution (OOD) data and adversarial examples, still prevent their widespread use in safety-critical applications. Most existing methods either require a re-training of a given model to achieve robust identification of adversarial attacks or are limited to out-of-distribution sample detection only. In this work, we propose a geometric gradient analysis (GGA) to improve the identification of untrustworthy predictions without retraining of a given model. GGA analyzes the geometry of the loss landscape of neural networks based on the saliency maps of their respective input. To motivate the proposed approach, we provide theoretical connections between gradients' geometrical properties and local minima of the loss function. Furthermore, we demonstrate that the proposed method outperforms prior approaches in detecting OOD data and adversarial attacks, including state-of-the-art and adaptive attacks.

preprint2020arXiv

Coarse graining of a Fokker-Planck equation with excluded volume effects preserving the gradient-flow structure

The propagation of gradient flow structures from microscopic to macroscopic models is a topic of high current interest. In this paper we discuss this propagation in a model for the diffusion of particles interacting via hard-core exclusion or short-range repulsive potentials. We formulate the microscopic model as a high-dimensional gradient flow in the Wasserstein metric for an appropriate free-energy functional. Then we use the JKO approach to identify the asymptotics of the metric and the free-energy functional beyond the lowest order for single particle densities in the limit of small particle volumes by matched asymptotic expansions. While we use a propagation of chaos assumption at far distances, we consider correlations at small distance in the expansion. In this way we obtain a clear picture of the emergence of a macroscopic gradient structure incorporating corrections in the free energy functional due to the volume exclusion.

preprint2020arXiv

Data assimilation in price formation

We consider the problem of estimating the density of buyers and vendors in a nonlinear parabolic price formation model using measurements of the price and the transaction rate. Our approach is based on a work by Puel et al., see \cite{Puel2002}, and results in a optimal control problem. We analyse this problems and provide stability estimates for the controls as well as the unknown density in the presence of measurement errors. Our analytic findings are supported with numerical experiments.

preprint2020arXiv

Delayed Blow-Up for Chemotaxis Models with Local Sensing

The aim of this paper is to analyze a model for chemotaxis based on a local sensing mechanism instead of the gradient sensing mechanism used in the celebrated minimal Keller-Segel model. The model we study has the same entropy as the minimal Keller-Segel model, but a different dynamics to minimize this entropy. Consequently, the conditions on the mass for the existence of stationary solutions or blow-up are the same, however we make the interesting observation that with the local sensing mechanism the blow-up in the case of supercritical mass is delayed to infinite time. Our observation is made rigorous from a mathematical point via a proof of global existence of weak solutions for arbitrary large masses and space dimension. The key difference of our model to the minimal Keller-Segel model is that the structure of the equation allows for a duality estimate that implies a bound on the $(H^1)'$-norm of the solutions, which can only grow with a square-root law in time. This additional $(H^1)'$-bound implies a lower bound on the entropy, which contrasts markedly with the minimal Keller-Segel model for which it is unbounded from below in the supercritical case. Besides, regularity and uniqueness of solutions are also studied.

preprint2020arXiv

Mean-field optimal control and optimality conditions in the space of probability measures

We derive a framework to compute optimal controls for problems with states in the space of probability measures. Since many optimal control problems constrained by a system of ordinary differential equations (ODE) modelling interacting particles converge to optimal control problems constrained by a partial differential equation (PDE) in the mean-field limit, it is interesting to have a calculus directly on the mesoscopic level of probability measures which allows us to derive the corresponding first-order optimality system. In addition to this new calculus, we provide relations for the resulting system to the first-order optimality system derived on the particle level, and the first-order optimality system based on $L^2$-calculus under additional regularity assumptions. We further justify the use of the $L^2$-adjoint in numerical simulations by establishing a link between the adjoint in the space of probability measures and the adjoint corresponding to $L^2$-calculus. Moreover, we prove a convergence rate for the convergence of the optimal controls corresponding to the particle formulation to the optimal controls of the mean-field problem as the number of particles tends to infinity.

preprint2020arXiv

Network structured kinetic models of social interactions

The aim of this paper is to study the derivation of appropriate meso- and macroscopic models for interactions as appearing in social processes. There are two main characteristics the models take into account, namely a network structure of interactions, which we treat by an appropriate mesoscopic description, and a different role of interacting agents. The latter differs from interactions treated in classical statistical mechanics in the sense that the agents do not have symmetric roles, but there is rather an active and a passive agent. We will demonstrate how a certain form of kinetic equations can be obtained to describe such interactions at a mesoscopic level and moreover obtain macroscopic models from monokinetics solutions of those. The derivation naturally leads to systems of nonlocal reaction-diffusion equations (or in a suitable limit local versions thereof), which can explain spatial phase separation phenomena found to emerge from the microscopic interactions. We will highlight the approach in three examples, namely the evolution and coarsening of dialects in human language, the construction of social norms, and the spread of an epidemic.

preprint2019arXiv

An entropic Landweber method for linear ill-posed problems

The aim of this paper is to investigate the use of an entropic projection method for the iterative regularization of linear ill-posed problems. We derive a closed form solution for the iterates and analyze their convergence behaviour both in a case of reconstructing general nonnegative unknowns as well as for the sake of recovering probability distributions. Moreover, we discuss several variants of the algorithm and relations to other methods in the literature. The effectiveness of the approach is studied numerically in several examples.

preprint2019arXiv

Instantaneous control of interacting particle systems in the mean-field limit

Controlling large particle systems in collective dynamics by a few agents is a subject of high practical importance, e.g., in evacuation dynamics. In this paper we study an instantaneous control approach to steer an interacting particle system into a certain spatial region by repulsive forces from a few external agents, which might be interpreted as shepherd dogs leading sheep to their home. We introduce an appropriate mathematical model and the corresponding optimization problem. In particular, we are interested in the interaction of numerous particles, which can be approximated by a mean-field equation. Due to the high-dimensional phase space this will require a tailored optimization strategy. The arising control problems are solved using adjoint information to compute the descent directions. Numerical results on the microscopic and the macroscopic level indicate the convergence of optimal controls and optimal states in the mean-field limit,i.e., for an increasing number of particles.