Source author record

Steffen Dereich

Steffen Dereich appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.PR Machine Learning math.CO math.LO math.OC

Catalog footprint

What is connected

12works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

SAD Neural Networks: Divergent Gradient Flows and Asymptotic Optimality via o-minimal Structures

We study gradient flows for loss landscapes of fully connected feedforward neural networks with commonly used continuously differentiable activation functions such as the logistic, hyperbolic tangent, softplus or GELU function. We prove that the gradient flow either converges to a critical point or diverges to infinity while the loss converges to an asymptotic critical value. Moreover, we prove the existence of a threshold $\varepsilon>0$ such that the loss value of any gradient flow initialized at most $\varepsilon$ above the optimal level converges to it. For polynomial target functions and sufficiently big architecture and data set, we prove that the optimal loss value is zero and can only be realized asymptotically. From this setting, we deduce our main result that any gradient flow with sufficiently good initialization diverges to infinity. Our proof heavily relies on the geometry of o-minimal structures. We confirm these theoretical findings with numerical experiments and extend our investigation to more realistic scenarios, where we observe an analogous behavior.

preprint2016arXiv

Multilevel Monte Carlo for Lévy-driven SDEs: Central limit theorems for adaptive Euler schemes

In this article, we consider multilevel Monte Carlo for the numerical computation of expectations for stochastic differential equations driven by Lévy processes. The underlying numerical schemes are based on jump-adapted Euler schemes. We prove stable convergence of an idealised scheme. Further, we deduce limit theorems for certain classes of functionals depending on the whole trajectory of the process. In particular, we allow dependence on marginals, integral averages and the supremum of the process. The idealised scheme is related to two practically implementable schemes and corresponding central limit theorems are given. In all cases, we obtain errors of order $N^{-1/2}(\log N)^{1/2}$ in the computational time $N$ which is the same order as obtained in the classical set-up analysed by Giles [Oper. Res. 56 (2008) 607-617]. Finally, we use the central limit theorems to optimise the parameters of the multilevel scheme.

preprint2015arXiv

Random Interlacements via Kuznetsov Measures

The aim of this note is to give an alternative construction of interlacements - as introduced by Sznitman - which makes use of classical probabilistic potential theory. In particular, we outline that the intensity measure of an interlacement is known in probabilistic potential theory under the name "approximate Markov chain" or "quasi-process". We provide a simple construction of random interlacements through (unconditioned) two-sided Brownian motions (resp. two-sided random walks) involving Mitro's general construction of Kuznetsov measures and a Palm measures relation due to Fitzsimmons. In particular, we show that random interlacement is a Poisson cloud (`soup') of two-sided random walks (or Brownian motions) started in Lebesgue measure and restricted on being closest to the origin at time between 0 and 1 - modulus time-shift.

preprint2015arXiv

Real Self-Similar Processes Started from the Origin

Since the seminal work of Lamperti there is a lot of interest in the understanding of the general structure of self-similar Markov processes. Lamperti gave a representation of positive self-similar Markov processes with initial condition strictly larger than 0 which subsequently was extended to zero initial condition. For real self-similar Markov processes (rssMps) there is a generalization of Lamperti's representation giving a one-to-one correspondence between Markov additive processes and rssMps with initial condition different from the origin. We develop fluctuation theory for Markov additive processes and use Kuznetsov measures to construct the law of transient real self-similar Markov processes issued from the origin. The construction gives a pathwise representation through two-sided Markov additive processes extending the Lamperti-Kiu representation to the origin.

preprint2013arXiv

Random networks with sublinear preferential attachment: The giant component

We study a dynamical random network model in which at every construction step a new vertex is introduced and attached to every existing vertex independently with a probability proportional to a concave function f of its current degree. We give a criterion for the existence of a giant component, which is both necessary and sufficient, and which becomes explicit when f is linear. Otherwise it allows the derivation of explicit necessary and sufficient conditions, which are often fairly close. We give an explicit criterion to decide whether the giant component is robust under random removal of edges. We also determine asymptotically the size of the giant component and the empirical distribution of component sizes in terms of the survival probability and size distribution of a multitype branching random walk associated with f.

preprint2012arXiv

Emergence of condensation in Kingman's model of selection and mutation

We describe the onset of condensation in the simple model for the balance between selection and mutation given by Kingman in terms of a scaling limit theorem. Loosely speaking, this shows that the wave moving towards genes of maximal fitness has the shape of a gamma distribution. We conjecture that this wave shape is a universal phenomenon that can also be found in a variety of more complex models, well beyond the genetics context, and provide some further evidence for this.

preprint2012arXiv

Persistence probabilities for an integrated random walk bridge

We prove that an integrated simple random walk, where random walk and integrated random walk are conditioned to return to zero, has asymptotic probability $n^{-1/2}$ to stay positive. This question is motivated by so-called random polymer models and proves a conjecture by Caravenna and Deuschel.

preprint2011arXiv

Constructive quantization: approximation by empirical measures

In this article, we study the approximation of a probability measure $μ$ on $\mathbb{R}^{d}$ by its empirical measure $\hatμ_{N}$ interpreted as a random quantization. As error criterion we consider an averaged $p$-th moment Wasserstein metric. In the case where $2p<d$, we establish refined upper and lower bounds for the error, a high-resolution formula. Moreover, we provide a universal estimate based on moments, a so-called Pierce type estimate. In particular, we show that quantization by empirical measures is of optimal order under weak assumptions.

preprint2011arXiv

Multilevel Monte Carlo algorithms for Lévy-driven SDEs with Gaussian correction

We introduce and analyze multilevel Monte Carlo algorithms for the computation of $\mathbb {E}f(Y)$, where $Y=(Y_t)_{t\in[0,1]}$ is the solution of a multidimensional Lévy-driven stochastic differential equation and $f$ is a real-valued function on the path space. The algorithm relies on approximations obtained by simulating large jumps of the Lévy process individually and applying a Gaussian approximation for the small jump part. Upper bounds are provided for the worst case error over the class of all measurable real functions $f$ that are Lipschitz continuous with respect to the supremum norm. These upper bounds are easily tractable once one knows the behavior of the Lévy measure around zero. In particular, one can derive upper bounds from the Blumenthal--Getoor index of the Lévy process. In the case where the Blumenthal--Getoor index is larger than one, this approach is superior to algorithms that do not apply a Gaussian approximation. If the Lévy process does not incorporate a Wiener process or if the Blumenthal--Getoor index $β$ is larger than $\frac{4}{3}$, then the upper bound is of order $τ^{-({4-β})/({6β})}$ when the runtime $τ$ tends to infinity. Whereas in the case, where $β$ is in $[1,\frac{4}{3}]$ and the Lévy process has a Gaussian component, we obtain bounds of order $τ^{-β/(6β-4)}$. In particular, the error is at most of order $τ^{-1/6}$.

preprint2011arXiv

Typical distances in ultrasmall random networks

We show that in preferential attachment models with power-law exponent $τ\in(2,3)$ the distance between randomly chosen vertices in the giant component is asymptotically equal to $(4+o(1))\, \frac{\log\log N}{-\log (τ-2)}$, where $N$ denotes the number of nodes. This is twice the value obtained for several types of configuration models with the same power-law exponent. The extra factor reveals the different structure of typical shortest paths in preferential attachment graphs.

preprint2010arXiv

The high resolution vector quantization problem with Orlicz norm distortion

We derive a high-resolution formula for the quantization problem under Orlicz norm distortion. In this setting, the optimal point density solves a variational problem which comprises a function $g:\mathbb{R}_+\to[0,\infty)$ characterizing the quantization complexity of the underlying Orlicz space. Moreover, asymptotically optimal codebooks induce a tight sequence of empirical measures. The set of possible accumulation points is characterized and in most cases it consists of a single element. In that case, we find convergence as in the classical setting.

preprint2010arXiv

Universality of the asymptotics of the one-sided exit problem for integrated processes

We consider the one-sided exit problem for (fractionally) integrated random walks and Lévy processes. We prove that the rate of decrease of the non-exit probability -- the so-called survival exponent -- is universal in this class of processes. In particular, the survival exponent can be inferred from the (fractionally) integrated Brownian motion. This, in particular, extends Sinai's result on the survival exponent for the integrated simple random walk to general random walks with some finite exponential moment. Further, we prove existence and monotonicity of the survival exponent of fractionally integrated processes. We show that this exponent is related to a constant appearing in the study of random polynomials.

Steffen Dereich

What is connected

Connect this record

See the researcher in context

Building this map preview

12 published item(s)

SAD Neural Networks: Divergent Gradient Flows and Asymptotic Optimality via o-minimal Structures

Multilevel Monte Carlo for Lévy-driven SDEs: Central limit theorems for adaptive Euler schemes

Random Interlacements via Kuznetsov Measures

Real Self-Similar Processes Started from the Origin

Random networks with sublinear preferential attachment: The giant component

Emergence of condensation in Kingman's model of selection and mutation

Persistence probabilities for an integrated random walk bridge

Constructive quantization: approximation by empirical measures

Multilevel Monte Carlo algorithms for Lévy-driven SDEs with Gaussian correction

Typical distances in ultrasmall random networks

The high resolution vector quantization problem with Orlicz norm distortion

Universality of the asymptotics of the one-sided exit problem for integrated processes