Source author record

Patrick Cheridito

Patrick Cheridito appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.PR Machine Learning math.NA math.OC math.ST Numerical Analysis q-fin.PM Statistics Theory eess.SY math.AP math.FA math.SP q-fin.CP q-fin.PR Systems and Control

Catalog footprint

What is connected

13works

15topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Deep Learning for Continuous-Time Stochastic Control with Jumps

In this paper, we introduce a model-based deep-learning approach to solve finite-horizon continuous-time stochastic control problems with jumps. We iteratively train two neural networks: one to represent the optimal policy and the other to approximate the value function. Leveraging a continuous-time version of the dynamic programming principle, we derive two different training objectives based on the Hamilton-Jacobi-Bellman equation, ensuring that the networks capture the underlying stochastic dynamics. Empirical evaluations on different problems illustrate the accuracy and scalability of our approach, demonstrating its effectiveness in solving complex high-dimensional stochastic control tasks.

preprint2026arXiv

Deep Legendre Transform

We introduce a novel deep learning algorithm for computing convex conjugates of differentiable convex functions, a fundamental operation in convex analysis with various applications in different fields such as optimization, control theory, physics and economics. While traditional numerical methods suffer from the curse of dimensionality and become computationally intractable in high dimensions, more recent neural network--based approaches scale better, but have mostly been studied with the aim of solving optimal transport problems and require the solution of complicated optimization or max--min problems. Using an implicit Fenchel formulation of convex conjugation, our approach facilitates an efficient gradient--based framework for the minimization of approximation errors and, as a byproduct, also provides a posteriori estimates of the approximation accuracy. Numerical experiments demonstrate our method's ability to deliver accurate results across different high-dimensional examples. Moreover, by employing symbolic regression with Kolmogorov--Arnold networks, it is able to obtain the exact convex conjugates of specific convex functions.

preprint2026arXiv

INEUS: Iterative Neural Solver for High-Dimensional PIDEs

In this paper, we introduce INEUS, a meshfree iterative neural solver for partial integro-differential equations (PIDEs). The method replaces the explicit evaluation of nonlocal jump integrals with single-jump sampling and reformulates PIDE solving as a sequence of recursive regression problems. Like Physics-Informed Neural Networks (PINNs), INEUS learns global solutions over the entire space-time domain, yet it offers a more efficient treatment of nonlocal terms and avoids the computationally expensive differentiation of full PIDE residuals. These features make INEUS particularly well suited for high-dimensional PDEs and PIDEs. Supported by a contraction-based convergence proof for linear PIDEs, our numerical experiments show that INEUS delivers accurate and scalable solutions for various high-dimensional linear and nonlinear examples.

preprint2022arXiv

Landscape analysis for shallow neural networks: complete classification of critical points for affine target functions

In this paper, we analyze the landscape of the true loss of neural networks with one hidden layer and ReLU, leaky ReLU, or quadratic activation. In all three cases, we provide a complete classification of the critical points in the case where the target function is affine and one-dimensional. In particular, we show that there exist no local maxima and clarify the structure of saddle points. Moreover, we prove that non-global local minima can only be caused by `dead' ReLU neurons. In particular, they do not appear in the case of leaky ReLU or quadratic activation. Our approach is of a combinatorial nature and builds on a careful analysis of the different types of hidden neurons that can occur.

preprint2021arXiv

A proof of convergence for gradient descent in the training of artificial neural networks for constant target functions

Gradient descent optimization algorithms are the standard ingredients that are used to train artificial neural networks (ANNs). Even though a huge number of numerical simulations indicate that gradient descent optimization methods do indeed convergence in the training of ANNs, until today there is no rigorous theoretical analysis which proves (or disproves) this conjecture. In particular, even in the case of the most basic variant of gradient descent optimization algorithms, the plain vanilla gradient descent method, it remains an open problem to prove or disprove the conjecture that gradient descent converges in the training of ANNs. In this article we solve this problem in the special situation where the target function under consideration is a constant function. More specifically, in the case of constant target functions we prove in the training of rectified fully-connected feedforward ANNs with one-hidden layer that the risk function of the gradient descent method does indeed converge to zero. Our mathematical analysis strongly exploits the property that the rectifier function is the activation function used in the considered ANNs. A key contribution of this work is to explicitly specify a Lyapunov function for the gradient flow system of the ANN parameters. This Lyapunov function is the central tool in our convergence proof of the gradient descent method.

preprint2021arXiv

On non-local ergodic Jacobi semigroups: spectral theory, convergence-to-equilibrium and contractivity

In this paper, we introduce and study non-local Jacobi operators, which generalize the classical (local) Jacobi operators. We show that these operators extend to generators of ergodic Markov semigroups with unique invariant probability measures and study their spectral and convergence properties. In particular, we derive a series expansion of the semigroup in terms of explicitly defined polynomials, which generalize the classical Jacobi orthogonal polynomials. In addition, we give a complete characterization of the spectrum of the non-self-adjoint generator and semigroup. We show that the variance decay of the semigroup is hypocoercive with explicit constants, which provides a natural generalization of the spectral gap estimate. After a random warm-up time, the semigroup also decays exponentially in entropy and is both hypercontractive and ultracontractive. Our proofs hinge on the development of commutation identities, known as intertwining relations, between local and non-local Jacobi operators and semigroups, with the local objects serving as reference points for transferring properties from the local to the non-local case.

preprint2015arXiv

Multidimensional quadratic and subquadratic BSDEs with special structure

We study multidimensional BSDEs of the form $$ Y_t = ξ+ \int_t^T f(s,Y_s,Z_s)ds - \int_t^T Z_s dW_s $$ with bounded terminal conditions $ξ$ and drivers $f$ that grow at most quadratically in $Z_s$. We consider three different cases. In the first one the BSDE is Markovian, and a solution can be obtained from a solution to a related FBSDE. In the second case, the BSDE becomes a one-dimensional quadratic BSDE when projected to a one-dimensional subspace, and a solution can be derived from a solution of the one-dimensional equation. In the third case, the growth of the driver $f$ in $Z_s$ is strictly subquadratic, and the existence and uniqueness of a solution can be shown by first solving the BSDE on a short time interval and then extending the solution recursively.

preprint2014arXiv

Conditional Analysis on R^d

This paper provides versions of classical results from linear algebra, real analysis and convex analysis in a free module of finite rank over the ring $L^0$ of measurable functions on a $σ$-finite measure space. We study the question whether a submodule is finitely generated and introduce the more general concepts of $L^0$-affine sets, $L^0$-convex sets, $L^0$-convex cones, $L^0$-hyperplanes, $L^0$-half-spaces and $L^0$-convex polyhedral sets. We investigate orthogonal complements, orthogonal decompositions and the existence of orthonormal bases. We also study $L^0$-linear, $L^0$-affine, $L^0$-convex and $L^0$-sublinear functions and introduce notions of continuity, differentiability, directional derivatives and subgradients. We use a conditional version of the Bolzano-Weierstrass theorem to show that conditional Cauchy sequences converge and give conditions under which conditional optimization problems have optimal solutions. We prove results on the separation of $L^0$-convex sets by $L^0$-hyperplanes and study $L^0$-convex conjugate functions. We provide a result on the existence of $L^0$-subgradients of $L^0$-convex functions, prove a conditional version of the Fenchel-Moreau theorem and study conditional inf-convolutions.

preprint2013arXiv

BSDEs with terminal conditions that have bounded Malliavin derivative

We show existence and uniqueness of solutions to BSDEs of the form $$ Y_t = ξ+ \int_t^T f(s,Y_s,Z_s)ds - \int_t^T Z_s dW_s$$ in the case where the terminal condition $ξ$ has bounded Malliavin derivative. The driver $f(s,y,z)$ is assumed to be Lipschitz continuous in $y$ but only locally Lipschitz continuous in $z$. In particular, it can grow arbitrarily fast in $z$. If in addition to having bounded Malliavin derivative, $ξ$ is bounded, the driver needs only be locally Lipschitz continuous in $y$. In the special case where the BSDE is Markovian, we obtain existence and uniqueness results for semilinear parabolic PDEs with non-Lipschitz nonlinearities. We discuss the case where there is no lateral boundary as well as lateral boundary conditions of Dirichlet and Neumann type.

preprint2013arXiv

BSΔEs and BSDEs with non-Lipschitz drivers: Comparison, convergence and robustness

We provide existence results and comparison principles for solutions of backward stochastic difference equations (BS$Δ$Es) and then prove convergence of these to solutions of backward stochastic differential equations (BSDEs) when the mesh size of the time-discretizaton goes to zero. The BS$Δ$Es and BSDEs are governed by drivers $f^N(t,ω,y,z)$ and $f(t,ω,y,z),$ respectively. The new feature of this paper is that they may be non-Lipschitz in z. For the convergence results it is assumed that the BS$Δ$Es are based on d-dimensional random walks $W^N$ approximating the d-dimensional Brownian motion W underlying the BSDE and that $f^N$ converges to f. Conditions are given under which for any bounded terminal condition $ξ$ for the BSDE, there exist bounded terminal conditions $ξ^N$ for the sequence of BS$Δ$Es converging to $ξ$, such that the corresponding solutions converge to the solution of the limiting BSDE. An important special case is when $f^N$ and f are convex in z. We show that in this situation, the solutions of the BS$Δ$Es converge to the solution of the BSDE for every uniformly bounded sequence $ξ^N$ converging to $ξ$. As a consequence, one obtains that the BSDE is robust in the sense that if $(W^N,ξ^N)$ is close to $(W,ξ)$ in distribution, then the solution of the Nth BS$Δ$E is close to the solution of the BSDE in distribution too.

preprint2011arXiv

Existence, minimality and approximation of solutions to BSDEs with convex drivers

We study the existence of solutions to backward stochastic differential equations with drivers f(t,W,y,z) that are convex in z. We assume f to be Lipschitz in y and W but do not make growth assumptions with respect to z. We first show the existence of a unique solution (Y,Z) with bounded Z if the terminal condition is Lipschitz in W and that it can be approximated by the solutions to properly discretized equations. If the terminal condition is bounded and uniformly continuous in W, we show the existence of a minimal continuous supersolution by uniformly approximating the terminal condition with Lipschitz terminal conditions. Finally, we prove existence of a minimal RCLL supersolution for bounded lower semicontinuous terminal conditions by approximating the terminal condition pointwise from below with Lipschitz terminal conditions.

preprint2011arXiv

Pricing and Hedging in Affine Models with Possibility of Default

We propose a general framework for the simultaneous modeling of equity, government bonds, corporate bonds and derivatives. Uncertainty is generated by a general affine Markov process. The setting allows for stochastic volatility, jumps, the possibility of default and correlation between different assets. We show how to calculate discounted complex moments by solving a coupled system of generalized Riccati equations. This yields an efficient method to compute prices of power payoffs. European calls and puts as well as binaries and asset-or-nothing options can be priced with the fast Fourier transform methods of Carr and Madan (1999) and Lee (2005). Other European payoffs can be approximated with a linear combination of government bonds, power payoffs and vanilla options. We show the results to be superior to using only government bonds and power payoffs or government bonds and vanilla options. We also give conditions for European continent claims in our framework to be replicable if enough financial instruments are liquidly tradable and study dynamic hedging strategies. As an example we discuss a Heston-type stochastic volatility model with possibility of default and stochastic interest rates.

preprint2010arXiv

Optimal consumption and investment in incomplete markets with general constraints

We study an optimal consumption and investment problem in a possibly incomplete market with general, not necessarily convex, stochastic constraints. We give explicit solutions for investors with exponential, logarithmic and power utility. Our approach is based on martingale methods which rely on recent results on the existence and uniqueness of solutions to BSDEs with drivers of quadratic growth.

Patrick Cheridito

What is connected

Connect this record

See the researcher in context

Building this map preview

13 published item(s)

Deep Learning for Continuous-Time Stochastic Control with Jumps

Deep Legendre Transform

INEUS: Iterative Neural Solver for High-Dimensional PIDEs

Landscape analysis for shallow neural networks: complete classification of critical points for affine target functions

A proof of convergence for gradient descent in the training of artificial neural networks for constant target functions

On non-local ergodic Jacobi semigroups: spectral theory, convergence-to-equilibrium and contractivity

Multidimensional quadratic and subquadratic BSDEs with special structure

Conditional Analysis on R^d

BSDEs with terminal conditions that have bounded Malliavin derivative

BSΔEs and BSDEs with non-Lipschitz drivers: Comparison, convergence and robustness

Existence, minimality and approximation of solutions to BSDEs with convex drivers

Pricing and Hedging in Affine Models with Possibility of Default

Optimal consumption and investment in incomplete markets with general constraints