Source author record

Harald Oberhauser

Harald Oberhauser appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.PR Machine Learning math.AP math.ST Statistics Theory math.FA math.NA Numerical Analysis Computational Engineering, Finance, and Science Discrete Mathematics math.AT Methodology

Catalog footprint

What is connected

21works

12topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2023arXiv

Grid-Free Computation of Probabilistic Safety with Malliavin Calculus

This work concerns continuous-time, continuous-space stochastic dynamical systems described by stochastic differential equations (SDE). It presents a new approach to compute probabilistic safety regions, namely sets of initial conditions of the SDE associated to trajectories that are safe with a probability larger than a given threshold. The approach introduces a functional that is minimised at the border of the probabilistic safety region, then solves an optimisation problem using techniques from Malliavin Calculus, which computes such region. Unlike existing results in the literature, the new approach allows one to compute probabilistic safety regions without gridding the state space of the SDE.

preprint2023arXiv

Nonlinear Independent Component Analysis for Discrete-Time and Continuous-Time Signals

We study the classical problem of recovering a multidimensional source signal from observations of nonlinear mixtures of this signal. We show that this recovery is possible (up to a permutation and monotone scaling of the source's original component signals) if the mixture is due to a sufficiently differentiable and invertible but otherwise arbitrarily nonlinear function and the component signals of the source are statistically independent with 'non-degenerate' second-order statistics. The latter assumption requires the source signal to meet one of three regularity conditions which essentially ensure that the source is sufficiently far away from the non-recoverable extremes of being deterministic or constant in time. These assumptions, which cover many popular time series models and stochastic processes, allow us to reformulate the initial problem of nonlinear blind source separation as a simple-to-state problem of optimisation-based function approximation. We propose to solve this approximation problem by minimizing a novel type of objective function that efficiently quantifies the mutual statistical dependence between multiple stochastic processes via cumulant-like statistics. This yields a scalable and direct new method for nonlinear Independent Component Analysis with widely applicable theoretical guarantees and for which our experiments indicate good performance.

preprint2022arXiv

A Topological Approach to Mapping Space Signatures

A common approach for describing classes of functions and probability measures on a topological space $\mathcal{X}$ is to construct a suitable map $Φ$ from $\mathcal{X}$ into a vector space, where linear methods can be applied to address both problems. The case where $\mathcal{X}$ is a space of paths $[0,1] \to \mathbb{R}^n$ and $Φ$ is the path signature map has received much attention in stochastic analysis and related fields. In this article we develop a generalized $Φ$ for the case where $\mathcal{X}$ is a space of maps $[0,1]^d \to \mathbb{R}^n$ for any $d \in \mathbb{N}$, and show that the map $Φ$ generalizes many of the desirable algebraic and analytic properties of the path signature to $d \ge 2$. The key ingredient to our approach is topological; in particular, our starting point is a generalisation of K-T Chen's path space cochain construction to the setting of cubical mapping spaces.

preprint2022arXiv

Capturing Graphs with Hypo-Elliptic Diffusions

Convolutional layers within graph neural networks operate by aggregating information about local neighbourhood structures; one common way to encode such substructures is through random walks. The distribution of these random walks evolves according to a diffusion equation defined using the graph Laplacian. We extend this approach by leveraging classic mathematical results about hypo-elliptic diffusions. This results in a novel tensor-valued graph operator, which we call the hypo-elliptic graph Laplacian. We provide theoretical guarantees and efficient low-rank approximation algorithms. In particular, this gives a structured approach to capture long-range dependencies on graphs that is robust to pooling. Besides the attractive theoretical properties, our experiments show that this method competes with graph transformers on datasets requiring long-range reasoning but scales only linearly in the number of edges as opposed to quadratically in nodes.

preprint2022arXiv

Signature moments to characterize laws of stochastic processes

The sequence of moments of a vector-valued random variable can characterize its law. We study the analogous problem for path-valued random variables, that is stochastic processes, by using so-called robust signature moments. This allows us to derive a metric of maximum mean discrepancy type for laws of stochastic processes and study the topology it induces on the space of laws of stochastic processes. This metric can be kernelized using the signature kernel which allows to efficiently compute it. As an application, we provide a non-parametric two-sample hypothesis test for laws of stochastic processes.

preprint2021arXiv

Estimating the probability that a given vector is in the convex hull of a random sample

For a $d$-dimensional random vector $X$, let $p_{n, X}(θ)$ be the probability that the convex hull of $n$ independent copies of $X$ contains a given point $θ$. We provide several sharp inequalities regarding $p_{n, X}(θ)$ and $N_X(θ)$ denoting the smallest $n$ for which $p_{n, X}(θ)\ge1/2$. As a main result, we derive the totally general inequality $1/2 \le α_X(θ)N_X(θ)\le 3d + 1$, where $α_X(θ)$ (a.k.a. the Tukey depth) is the minimum probability that $X$ is in a fixed closed halfspace containing the point $θ$. We also show several applications of our general results: one is a moment-based bound on $N_X(\mathbb{E}[X])$, which is an important quantity in randomized approaches to cubature construction or measure reduction problem. Another application is the determination of the canonical convex body included in a random convex polytope given by independent copies of $X$, where our combinatorial approach allows us to generalize existing results in random matrix community significantly.

preprint2021arXiv

The shifted ODE method for underdamped Langevin MCMC

In this paper, we consider the underdamped Langevin diffusion (ULD) and propose a numerical approximation using its associated ordinary differential equation (ODE). When used as a Markov Chain Monte Carlo (MCMC) algorithm, we show that the ODE approximation achieves a $2$-Wasserstein error of $\varepsilon$ in $\mathcal{O}\big(d^{\frac{1}{3}}/\varepsilon^{\frac{2}{3}}\big)$ steps under the standard smoothness and strong convexity assumptions on the target distribution. This matches the complexity of the randomized midpoint method proposed by Shen and Lee [NeurIPS 2019] which was shown to be order optimal by Cao, Lu and Wang. However, the main feature of the proposed numerical method is that it can utilize additional smoothness of the target log-density $f$. More concretely, we show that the ODE approximation achieves a $2$-Wasserstein error of $\varepsilon$ in $\mathcal{O}\big(d^{\frac{2}{5}}/\varepsilon^{\frac{2}{5}}\big)$ and $\mathcal{O}\big(\sqrt{d}/\varepsilon^{\frac{1}{3}}\big)$ steps when Lipschitz continuity is assumed for the Hessian and third derivative of $f$. By discretizing this ODE using a third order Runge-Kutta method, we can obtain a practical MCMC method that uses just two additional gradient evaluations per step. In our experiment, where the target comes from a logistic regression, this method shows faster convergence compared to other unadjusted Langevin MCMC algorithms.

preprint2020arXiv

An optimal polynomial approximation of Brownian motion

In this paper, we will present a strong (or pathwise) approximation of standard Brownian motion by a class of orthogonal polynomials. The coefficients that are obtained from the expansion of Brownian motion in this polynomial basis are independent Gaussian random variables. Therefore it is practical (requires $N$ independent Gaussian coefficients) to generate an approximate sample path of Brownian motion that respects integration of polynomials with degree less than $N$. Moreover, since these orthogonal polynomials appear naturally as eigenfunctions of an integral operator defined by the Brownian bridge covariance function, the proposed approximation is optimal in a certain weighted $L^{2}(\mathbb{P})$ sense. In addition, discretizing Brownian paths as piecewise parabolas gives a locally higher order numerical method for stochastic differential equations (SDEs) when compared to the standard piecewise linear approach. We shall demonstrate these ideas by simulating Inhomogeneous Geometric Brownian Motion (IGBM). This numerical example will also illustrate the deficiencies of the piecewise parabola approximation when compared to a new version of the asymptotically efficient log-ODE (or Castell-Gaines) method.

preprint2020arXiv

Bayesian Learning from Sequential Data using Gaussian Processes with Signature Covariances

We develop a Bayesian approach to learning from sequential data by using Gaussian processes (GPs) with so-called signature kernels as covariance functions. This allows to make sequences of different length comparable and to rely on strong theoretical results from stochastic analysis. Signatures capture sequential structure with tensors that can scale unfavourably in sequence length and state space dimension. To deal with this, we introduce a sparse variational approach with inducing tensors. We then combine the resulting GP with LSTMs and GRUs to build larger models that leverage the strengths of each of these approaches and benchmark the resulting GPs on multivariate time series (TS) classification datasets. Code available at https://github.com/tgcsaba/GPSig.

preprint2020arXiv

Signature Cumulants, Ordered Partitions, and Independence of Stochastic Processes

The sequence of so-called signature moments describes the laws of many stochastic processes in analogy with how the sequence of moments describes the laws of vector-valued random variables. However, even for vector-valued random variables, the sequence of cumulants is much better suited for many tasks than the sequence of moments. This motivates us to study so-called signature cumulants. To do so, we develop an elementary combinatorial approach and show that in the same way that cumulants relate to the lattice of partitions, signature cumulants relate to the lattice of so-called "ordered partitions". We use this to give a new characterisation of independence of multivariate stochastic processes; finally we construct a family of unbiased minimum-variance estimators of signature cumulants.

preprint2016arXiv

Kernels for sequentially ordered data

We present a novel framework for kernel learning with sequential data of any kind, such as time series, sequences of graphs, or strings. Our approach is based on signature features which can be seen as an ordered variant of sample (cross-)moments; it allows to obtain a "sequentialized" version of any static kernel. The sequential kernels are efficiently computable for discrete sequences and are shown to approximate a continuous moment form in a sampling sense. A number of known kernels for sequences arise as "sequentializations" of suitable static kernels: string kernels may be obtained as a special case, and alignment kernels are closely related up to a modification that resolves their open non-definiteness issue. Our experiments indicate that our signature-based sequential kernel framework may be a promising approach to learning with sequential data, such as time series, that allows to avoid extensive manual pre-processing.

preprint2015arXiv

An integral equation for Root's barrier and the generation of Brownian increments

We derive a nonlinear integral equation to calculate Root's solution of the Skorokhod embedding problem for atom-free target measures. We then use this to efficiently generate bounded time-space increments of Brownian motion and give a parabolic version of Muller's classic "Random walk over spheres" algorithm.

preprint2014arXiv

A Levy-area between Brownian motion and rough paths with applications to robust non-linear filtering and RPDEs

We give meaning to differential equations with a rough path term and a Brownian noise term as driving signals. Such differential equations as well as the question of regularity of the solution map arise naturally and we discuss two applications: one revisits Clark's robustness problem in nonlinear filtering, the other is a Feynman--Kac type representation of linear RPDEs. En passant, we give a short and direct argument that implies integrability estimates for rough differential equations with Gaussian driving signals which is of independent interest.

preprint2014arXiv

Root's barrier, viscosity solutions of obstacle problems and reflected FBSDEs

We revisit work of Rost, Dupire and Cox--Wang on connections between Root's solution of the Skorokhod embedding problem and obstacle problems. We develop an approach based on viscosity sub- and supersolutions and an accompanying comparison principle. This gives a complete characterization of (reversed) Root barriers and leads to new proofs of existence as well as minimality of such barrier solutions by pure PDE methods. The approach is self-contained and general enough to cover martingale diffusions with degenerate elliptic or time-dependent volatility; it also provides insights about the dynamics of general Skorokhod embeddings.

preprint2013arXiv

Rough path stability of (semi-)linear SPDEs

We give meaning to linear and semi-linear (possibly degenerate) parabolic partial differential equations with (affine) linear rough path noise and establish stability in a rough path metric. In the case of enhanced Brownian motion (Brownian motion with its Lévy area) as rough path noise the solution coincides with the standard variational solution of the SPDE.

preprint2012arXiv

An extension of the functional Ito formula under a family of non-dominated measures

Motivated by questions arising in financial mathematics, Dupire introduced a notion of smoothness for functionals of paths (different from the usual Fréchet--Gatéaux derivatives) and arrived at a generalization of Itō's formula applicable to functionals which have a pathwise continuous dependence on the trajectories of the underlying process. We study nonlinear functionals which do not have such pathwise continuity and further work simultaneously under the family of continuous semimartingale measures on path-space. We do this without introducing a second component, as carried out by Cont--Fournie but by using old work of Bichteler which allows to keep a pathwise picture even for complex functionals

preprint2011arXiv

A Chen-Fliess approximation for diffusion functionals

We show that an interesting class of functionals of stochastic differential equations can be approximated by a Chen-Fliess series of iterated stochastic integrals and give a L^{2} error estimate, thus generalizing the standard stochastic Taylor expansion. The coefficients in this series are given a very intuitive meaning by using functional derivatives, recently introduced by B. Dupire.

preprint2011arXiv

Parabolic comparison revisited and applications

We consider the Cauchy-Dirichlet problem $\partial_t u - F(t,x,u,Du,D^2 u) = 0 on (0,T)\times \R^n$ in viscosity sense. Comparison is established for bounded semi-continuous (sub-/super-)solutions under structural assumption (3.14) of the User's Guide plus a mild condition on $F$ such as to cope with the unbounded domain. Comparison on $(0,T]$, space-time regularity and existence are also discussed. Our analysis passes through an extension of the parabolic theorem of sums which appears to be useful in its own right.

preprint2010arXiv

A (rough) pathwise approach to a class of non-linear stochastic partial differential equations

We consider nonlinear parabolic evolution equations of the form $\partial_{t}u=F(t,x,Du,D^{2}u) $, subject to noise of the form $H(x,Du) \circ dB$ where $H$ is linear in $Du$ and $\circ dB$ denotes the Stratonovich differential of a multidimensional Brownian motion. Motivated by the essentially pathwise results of [Lions, P.-L. and Souganidis, P.E.; Fully nonlinear stochastic partial differential equations. C. R. Acad. Sci. Paris Sér. I Math. 326 (1998), no. 9] we propose the use of rough path analysis [Lyons, T. J.; Differential equations driven by rough signals. Rev. Mat. Iberoamericana 14 (1998), no. 2, 215--310] in this context. Although the core arguments are entirely deterministic, a continuity theorem allows for various probabilistic applications (limit theorems, support, large deviations, ...).

preprint2010arXiv

A generalized Fernique theorem and applications

We prove a generalisation of Fernique's theorem which applies to a class of (measurable) functionals on abstract Wiener spaces by using the isoperimetric inequality. Our motivation comes from rough path theory where one deals with iterated integrals of Gaussian processes (which are generically not Gaussian). Gaussian integrability with explicitly given constants for variation and Hölder norms of the (fractional) Brownian rough path, Gaussian rough paths and the Banach space valued Wiener process enhanced with its Lévy area [Ledoux, Lyons, Quian. "Lévy area of Wiener processes in Banach spaces". Ann. Probab., 30(2):546--578, 2002] then all follow from applying our main theorem.

preprint2010arXiv

On the splitting-up method for rough (partial) differential equations

This article introduces the splitting method to systems responding to rough paths as external stimuli. The focus is on nonlinear partial differential equations with rough noise but we also cover rough differential equations. Applications to stochastic partial differential equations arising in control theory and nonlinear filtering are given.

Harald Oberhauser

What is connected

Connect this record

See the researcher in context

Building this map preview

21 published item(s)

Grid-Free Computation of Probabilistic Safety with Malliavin Calculus

Nonlinear Independent Component Analysis for Discrete-Time and Continuous-Time Signals

A Topological Approach to Mapping Space Signatures

Capturing Graphs with Hypo-Elliptic Diffusions

Signature moments to characterize laws of stochastic processes

Estimating the probability that a given vector is in the convex hull of a random sample

The shifted ODE method for underdamped Langevin MCMC

An optimal polynomial approximation of Brownian motion

Bayesian Learning from Sequential Data using Gaussian Processes with Signature Covariances

Signature Cumulants, Ordered Partitions, and Independence of Stochastic Processes

Kernels for sequentially ordered data

An integral equation for Root's barrier and the generation of Brownian increments

A Levy-area between Brownian motion and rough paths with applications to robust non-linear filtering and RPDEs

Root's barrier, viscosity solutions of obstacle problems and reflected FBSDEs

Rough path stability of (semi-)linear SPDEs

An extension of the functional Ito formula under a family of non-dominated measures

A Chen-Fliess approximation for diffusion functionals

Parabolic comparison revisited and applications

A (rough) pathwise approach to a class of non-linear stochastic partial differential equations

A generalized Fernique theorem and applications

On the splitting-up method for rough (partial) differential equations