Source author record

Martin Burger

Martin Burger appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

53works

20topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Multi-Dimensional Opinion Formation

In this paper we propose and investigate a multi-dimensional opinion dynamics model where people are characterised by both opinions and importance weights across these opinions. Opinion changes occur through binary interactions, with a novel coupling mechanism: the change in one topic depends on the weighted similarity across the full opinion vector. We state the kinetic equation for this process and derive its mean-field partial differential equation to describe the overall dynamics. Analytical computations and numerical simulations confirm that this model generates complex stationary states, and we demonstrate that the final opinion structures are critically determined by the peoples' opinion weights.

preprint2022arXiv

A Bregman Learning Framework for Sparse Neural Networks

We propose a learning framework based on stochastic Bregman iterations, also known as mirror descent, to train sparse neural networks with an inverse scale space approach. We derive a baseline algorithm called LinBreg, an accelerated version using momentum, and AdaBreg, which is a Bregmanized generalization of the Adam algorithm. In contrast to established methods for sparse training the proposed family of algorithms constitutes a regrowth strategy for neural networks that is solely optimization-based without additional heuristics. Our Bregman learning framework starts the training with very few initial parameters, successively adding only significant ones to obtain a sparse and expressive network. The proposed approach is extremely easy and efficient, yet supported by the rich mathematical theory of inverse scale space methods. We derive a statistically profound sparse parameter initialization strategy and provide a rigorous stochastic convergence analysis of the loss decay and additional convergence proofs in the convex regime. Using only 3.4% of the parameters of ResNet-18 we achieve 90.2% test accuracy on CIFAR-10, compared to 93.6% using the dense network. Our algorithm also unveils an autoencoder architecture for a denoising task. The proposed framework also has a huge potential for integrating sparse backpropagation and resource-friendly training.

preprint2022arXiv

On multi-species diffusion with size exclusion

We revisit a classical continuum model for the diffusion of multiple species with size-exclusion constraint, which leads to a degenerate nonlinear cross-diffusion system. The purpose of this article is twofold: first, it aims at a systematic study of the question of existence of weak solutions and their long-time asymptotic behaviour. Second, it provides a weak-strong stability estimate for a wide range of coefficients, which had been missing so far. In order to achieve the results mentioned above, we exploit the formal gradient-flow structure of the model with respect to a logarithmic entropy, which leads to best estimates in the full-interaction case, where all cross-diffusion coefficients are non-zero. Those are crucial to obtain the minimal Sobolev regularity needed for a weak-strong stability result. For meaningful cases when some of the coefficients vanish, we provide a novel existence result based on approximation by the full-interaction case.

preprint2022arXiv

Region-of-Interest Prioritised Sampling for Constrained Autonomous Exploration Systems

Goal oriented autonomous operation of space rovers has been known to increase scientific output of a mission. In this work we present an algorithm, called the RoI Prioritised Sampling (RPS), that prioritises Region-of-Interests (RoIs) in an exploration scenario in order to utilise the limited resources of the imaging instrument on the rover effectively. This prioritisation is based on an estimator that evaluates the change in information content at consecutive spatial scales of the RoIs without calculating the finer scale reconstruction. The estimator, called the Refinement Indicator (RI), is motivated and derived. Multi-scale acquisition approaches, based on classical and multilevel compressed sensing, with respect to the single pixel camera architecture are discussed. The performance of the algorithm is verified on remote sensing images and compared with the state-of-the-art multi-resolution reconstruction algorithms. At the considered sub-sampling rates the RPS is shown to better utilise the system resources for reconstructing the RoIs.

preprint2022arXiv

Well-posedness of an integro-differential model for active Brownian particles

We propose a general strategy for solving nonlinear integro-differential evolution problems with periodic boundary conditions, where no direct maximum/minimum principle is available. This is motivated by the study of recent macroscopic models for active Brownian particles with repulsive interactions, consisting of advection-diffusion processes in the space of particle position and orientation. We focus on one of such models, namely a semilinear parabolic equation with a nonlinear active drift term, whereby the velocity depends on the particle orientation and angle-independent overall particle density (leading to a nonlocal term by integrating out the angular variable). The main idea of the existence analysis is to exploit a-priori estimates from (approximate) entropy dissipation. The global existence and uniqueness of weak solutions is shown using a two-step Galerkin approximation with appropriate cutoff in order to obtain nonnegativity, an upper bound on the overall density and preserve a-priori estimates. Our anyalysis naturally includes the case of finite systems, corresponding to the case of a finite number of directions. The Duhamel principle is then used to obtain additional regularity of the solution, namely continuity in time-space. Motivated by the class of initial data relevant for the application, which includes perfectly aligned particles (same orientation), we extend the well-posedness result to very weak solutions allowing distributional initial data with low regularity.

preprint2021arXiv

Gradient Flows and Nonlinear Power Methods for the Computation of Nonlinear Eigenfunctions

This chapter describes how gradient flows and nonlinear power methods in Banach spaces can be used to solve nonlinear eigenvector-dependent eigenvalue problems, and how convergence of (discretized) approximations can be verified. We review several flows from literature, which were proposed to compute nonlinear eigenfunctions, and show that they all relate to normalized gradient flows. Furthermore, we show that the implicit Euler discretization of gradient flows gives rise to a nonlinear power method of the proximal operator and prove their convergence to nonlinear eigenfunctions. Finally, we prove that $Γ$-convergence of functionals implies convergence of their ground states, which is important for discrete approximations.

preprint2021arXiv

Identifying Untrustworthy Predictions in Neural Networks by Geometric Gradient Analysis

The susceptibility of deep neural networks to untrustworthy predictions, including out-of-distribution (OOD) data and adversarial examples, still prevent their widespread use in safety-critical applications. Most existing methods either require a re-training of a given model to achieve robust identification of adversarial attacks or are limited to out-of-distribution sample detection only. In this work, we propose a geometric gradient analysis (GGA) to improve the identification of untrustworthy predictions without retraining of a given model. GGA analyzes the geometry of the loss landscape of neural networks based on the saliency maps of their respective input. To motivate the proposed approach, we provide theoretical connections between gradients' geometrical properties and local minima of the loss function. Furthermore, we demonstrate that the proposed method outperforms prior approaches in detecting OOD data and adversarial attacks, including state-of-the-art and adaptive attacks.

preprint2020arXiv

Coarse graining of a Fokker-Planck equation with excluded volume effects preserving the gradient-flow structure

The propagation of gradient flow structures from microscopic to macroscopic models is a topic of high current interest. In this paper we discuss this propagation in a model for the diffusion of particles interacting via hard-core exclusion or short-range repulsive potentials. We formulate the microscopic model as a high-dimensional gradient flow in the Wasserstein metric for an appropriate free-energy functional. Then we use the JKO approach to identify the asymptotics of the metric and the free-energy functional beyond the lowest order for single particle densities in the limit of small particle volumes by matched asymptotic expansions. While we use a propagation of chaos assumption at far distances, we consider correlations at small distance in the expansion. In this way we obtain a clear picture of the emergence of a macroscopic gradient structure incorporating corrections in the free energy functional due to the volume exclusion.

preprint2020arXiv

Data assimilation in price formation

We consider the problem of estimating the density of buyers and vendors in a nonlinear parabolic price formation model using measurements of the price and the transaction rate. Our approach is based on a work by Puel et al., see \cite{Puel2002}, and results in a optimal control problem. We analyse this problems and provide stability estimates for the controls as well as the unknown density in the presence of measurement errors. Our analytic findings are supported with numerical experiments.

preprint2020arXiv

Delayed Blow-Up for Chemotaxis Models with Local Sensing

The aim of this paper is to analyze a model for chemotaxis based on a local sensing mechanism instead of the gradient sensing mechanism used in the celebrated minimal Keller-Segel model. The model we study has the same entropy as the minimal Keller-Segel model, but a different dynamics to minimize this entropy. Consequently, the conditions on the mass for the existence of stationary solutions or blow-up are the same, however we make the interesting observation that with the local sensing mechanism the blow-up in the case of supercritical mass is delayed to infinite time. Our observation is made rigorous from a mathematical point via a proof of global existence of weak solutions for arbitrary large masses and space dimension. The key difference of our model to the minimal Keller-Segel model is that the structure of the equation allows for a duality estimate that implies a bound on the $(H^1)'$-norm of the solutions, which can only grow with a square-root law in time. This additional $(H^1)'$-bound implies a lower bound on the entropy, which contrasts markedly with the minimal Keller-Segel model for which it is unbounded from below in the supercritical case. Besides, regularity and uniqueness of solutions are also studied.

preprint2020arXiv

Mean-field optimal control and optimality conditions in the space of probability measures

We derive a framework to compute optimal controls for problems with states in the space of probability measures. Since many optimal control problems constrained by a system of ordinary differential equations (ODE) modelling interacting particles converge to optimal control problems constrained by a partial differential equation (PDE) in the mean-field limit, it is interesting to have a calculus directly on the mesoscopic level of probability measures which allows us to derive the corresponding first-order optimality system. In addition to this new calculus, we provide relations for the resulting system to the first-order optimality system derived on the particle level, and the first-order optimality system based on $L^2$-calculus under additional regularity assumptions. We further justify the use of the $L^2$-adjoint in numerical simulations by establishing a link between the adjoint in the space of probability measures and the adjoint corresponding to $L^2$-calculus. Moreover, we prove a convergence rate for the convergence of the optimal controls corresponding to the particle formulation to the optimal controls of the mean-field problem as the number of particles tends to infinity.

preprint2020arXiv

Network structured kinetic models of social interactions

The aim of this paper is to study the derivation of appropriate meso- and macroscopic models for interactions as appearing in social processes. There are two main characteristics the models take into account, namely a network structure of interactions, which we treat by an appropriate mesoscopic description, and a different role of interacting agents. The latter differs from interactions treated in classical statistical mechanics in the sense that the agents do not have symmetric roles, but there is rather an active and a passive agent. We will demonstrate how a certain form of kinetic equations can be obtained to describe such interactions at a mesoscopic level and moreover obtain macroscopic models from monokinetics solutions of those. The derivation naturally leads to systems of nonlocal reaction-diffusion equations (or in a suitable limit local versions thereof), which can explain spatial phase separation phenomena found to emerge from the microscopic interactions. We will highlight the approach in three examples, namely the evolution and coarsening of dialects in human language, the construction of social norms, and the spread of an epidemic.

preprint2019arXiv

An entropic Landweber method for linear ill-posed problems

The aim of this paper is to investigate the use of an entropic projection method for the iterative regularization of linear ill-posed problems. We derive a closed form solution for the iterates and analyze their convergence behaviour both in a case of reconstructing general nonnegative unknowns as well as for the sake of recovering probability distributions. Moreover, we discuss several variants of the algorithm and relations to other methods in the literature. The effectiveness of the approach is studied numerically in several examples.

preprint2019arXiv

Instantaneous control of interacting particle systems in the mean-field limit

Controlling large particle systems in collective dynamics by a few agents is a subject of high practical importance, e.g., in evacuation dynamics. In this paper we study an instantaneous control approach to steer an interacting particle system into a certain spatial region by repulsive forces from a few external agents, which might be interpreted as shepherd dogs leading sheep to their home. We introduce an appropriate mathematical model and the corresponding optimization problem. In particular, we are interested in the interaction of numerous particles, which can be approximated by a mean-field equation. Due to the high-dimensional phase space this will require a tailored optimization strategy. The arising control problems are solved using adjoint information to compute the descent directions. Numerical results on the microscopic and the macroscopic level indicate the convergence of optimal controls and optimal states in the mean-field limit,i.e., for an increasing number of particles.

preprint2016arXiv

A Variational Model for Joint Motion Estimation and Image Reconstruction

The aim of this paper is to derive and analyze a variational model for the joint estimation of motion and reconstruction of image sequences, which is based on a time-continuous Eulerian motion model. The model can be set up in terms of the continuity equation or the brightness constancy equation. The analysis in this paper focuses on the latter for robust motion estimation on sequences of two-dimensional images. We rigorously prove the existence of a minimizer in a suitable function space setting. Moreover, we discuss the numerical solution of the model based on primal-dual algorithms and investigate several examples. Finally, the benefits of our model compared to existing techniques, such as sequential image reconstruction and motion estimation, are shown.

preprint2016arXiv

An optimization approach for well-targeted transcranial direct current stimulation

Transcranial direct current stimulation is a non-invasive brain stimulation technique which modifies neural excitability by providing weak currents through scalp electrodes. The aim of this study is to introduce and analyze a novel optimization method for safe and well-targeted multi-array tDCS. For optimization, we consider an optimal control problem for a Laplace equation with Neumann boundary conditions with control and point-wise gradient state constraints. We prove existence and residual and objective convergence results for the proposed methods and provide computer simulation results in a highly realistic six-compartment geometry-adapted hexahedral head model. For discretization of the proposed minimization problem the finite element method is employed and the existence of at least one minimizer to the discretized optimization problem is shown. For numerical solution of the corresponding discretized problem we employ the alternating direction method of multipliers and comprehensively examine the cortical current flow field with regard to focality, target intensity and orientation. The numerical results reveal that the optimized current flow fields show significantly higher focality and, in most cases, higher directional agreement to the target vector in comparison to standard bipolar electrode montages.

preprint2016arXiv

Balanced growth path solutions of a Boltzmann mean field game model for knowledge growth

In this paper we study balanced growth path solutions of a Boltzmann mean field game model proposed by Lucas et al [13] to model knowledge growth in an economy. Agents can either increase their knowledge level by exchanging ideas in learning events or by producing goods with the knowledge they already have. The existence of balanced growth path solutions implies exponential growth of the overall production in time. We proof existence of balanced growth path solutions if the initial distribution of individuals with respect to their knowledge level satisfies a Pareto-tail condition. Furthermore we give first insights into the existence of such solutions if in addition to production and knowledge exchange the knowledge level evolves by geometric Brownian motion.

preprint2016arXiv

Bregman Cost for Non-Gaussian Noise

One of the tasks of the Bayesian inverse problem is to find a good estimate based on the posterior probability density. The most common point estimators are the conditional mean (CM) and maximum a posteriori (MAP) estimates, which correspond to the mean and the mode of the posterior, respectively. From a theoretical point of view it has been argued that the MAP estimate is only in an asymptotic sense a Bayes estimator for the uniform cost function, while the CM estimate is a Bayes estimator for the means squared cost function. Recently, it has been proven that the MAP estimate is a proper Bayes estimator for the Bregman cost if the image is corrupted by Gaussian noise. In this work we extend this result to other noise models with log-concave likelihood density, by introducing two related Bregman cost functions for which the CM and the MAP estimates are proper Bayes estimators. Moreover, we also prove that the CM estimate outperforms the MAP estimate, when the error is measured in a certain Bregman distance, a result previously unknown also in the case of additive Gaussian noise.

preprint2016arXiv

Simultaneous Reconstruction and Segmentation for Dynamic SPECT Imaging

This work deals with the reconstruction of dynamic images that incorporate characteristic dynamics in certain subregions, as arising for the kinetics of many tracers in emission tomography (SPECT, PET). We make use of a basis function approach for the unknown tracer concentration by assuming that the region of interest can be divided into subregions with spatially constant concentration curves. Applying a regularized variational framework reminiscent of the Chan-Vese model for image segmentation we simultaneously reconstruct both the labelling functions of the subregions as well as the subconcentrations within each region. Our particular focus is on applications in SPECT with Poisson noise model, resulting in a Kullback-Leibler data fidelity in the variational approach. We present a detailed analysis of the proposed variational model and prove existence of minimizers as well as error estimates. The latter apply to a more general class of problems and generalize existing results in literature since we deal with a nonlinear forward operator and a nonquadratic data fidelity. A computational algorithm based on alternating minimization and splitting techniques is developed for the solution of the problem and tested on appropriately designed synthetic data sets. For those we compare the results to those of standard EM reconstructions and investigate the effects of Poisson noise in the data.

preprint2016arXiv

Spectral Decompositions using One-Homogeneous Functionals

This paper discusses the use of absolutely one-homogeneous regularization functionals in a variational, scale space, and inverse scale space setting to define a nonlinear spectral decomposition of input data. We present several theoretical results that explain the relation between the different definitions. Additionally, results on the orthogonality of the decomposition, a Parseval-type identity and the notion of generalized (nonlinear) eigenvectors closely link our nonlinear multiscale decompositions to the well-known linear filtering theory. Numerical results are used to illustrate our findings.

Martin Burger

What is connected

Connect this record

See the researcher in context

Building this map preview

53 published item(s)

Multi-Dimensional Opinion Formation

A Bregman Learning Framework for Sparse Neural Networks

On multi-species diffusion with size exclusion

Region-of-Interest Prioritised Sampling for Constrained Autonomous Exploration Systems

Well-posedness of an integro-differential model for active Brownian particles

Gradient Flows and Nonlinear Power Methods for the Computation of Nonlinear Eigenfunctions

Identifying Untrustworthy Predictions in Neural Networks by Geometric Gradient Analysis

Coarse graining of a Fokker-Planck equation with excluded volume effects preserving the gradient-flow structure

Data assimilation in price formation

Delayed Blow-Up for Chemotaxis Models with Local Sensing

Mean-field optimal control and optimality conditions in the space of probability measures

Network structured kinetic models of social interactions

An entropic Landweber method for linear ill-posed problems

Instantaneous control of interacting particle systems in the mean-field limit

A Variational Model for Joint Motion Estimation and Image Reconstruction

An optimization approach for well-targeted transcranial direct current stimulation

Balanced growth path solutions of a Boltzmann mean field game model for knowledge growth

Bregman Cost for Non-Gaussian Noise

Simultaneous Reconstruction and Segmentation for Dynamic SPECT Imaging

Spectral Decompositions using One-Homogeneous Functionals

A Nonlinear Variational Approach to Motion-Corrected Reconstruction of Density Images

Bregman Distances in Inverse Problems and Partial Differential Equation

Diffuse Interface Methods for Inverse Problems: Case Study for an Elliptic Cauchy Problem

Flow Characteristics in a Crowded Transport Model

Infimal convolution regularisation functionals of BV and $\mathrm{L}^{p}$ spaces. Part I: The finite $p$ case

Infimal Convolution Regularisation Functionals of BV and $\mathrm{L}^{p}$ Spaces. The Case p$=\infty$

Lane formation by side-stepping

Locally Sparse Reconstruction Using the $\ell^{1,\infty}$-Norm

Maximum a posteriori probability estimates in infinite-dimensional Bayesian inverse problems

Nonlinear Spectral Analysis via One-homogeneous Functionals - Overview and Future Prospects

On a Boltzmann mean field model for knowledge growth

On Optical Flow Models for Variational Motion Estimation

Regularization with Sparse Vector Fields: From Image Compression to TV-type Reconstruction

Second-order edge-penalization in the Ambrosio-Tortorelli functional

Spectral Representations of One-Homogeneous Functionals

A Stochastic Model for the Normal Tissue Complication Probability (NTCP) in Radiation Treatment of Cancer

Analysis of the Diffuse Domain Method for second order elliptic boundary value problems

Color Bregman TV

First order algorithms in variational image processing

Maximum-A-Posteriori Estimates in Linear Inverse Problems with Log-concave Priors are Proper Bayes Estimators

Total Variation Regularisation in Measurement and Image space for PET reconstruction

Towards Dynamic PET Reconstruction under Flow Conditions: Parameter Identification in a PDE Model

Mean field games with nonlinear mobilities in pedestrian dynamics

On a Boltzmann type price formation model

On the asymptotic behavior of a Boltzmann-type price formation model

Stationary States and Asymptotic Behaviour of Aggregation Models with Nonlinear Local Repulsion

A Framework for Automated Cell Tracking in Phase Contrast Microscopic Videos based on Normal Velocities

Convergence rates in $\mathbf{\ell^1}$-regularization if the sparsity assumption fails

Exact Relaxation for Classes of Minimization Problems with Binary Constraints

Ground States and Singular Vectors of Convex Variational Regularization Methods

Individual based and mean-field modelling of direct aggregation

Mathematical Modelling of Polarizing GTPases in Developing Axons

The Iteratively Regularized Gauß-Newton Method with Convex Constraints and Applications in 4Pi-Microscopy