Researcher profile

Gilles Pagès

Gilles Pagès contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
13works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

13 published item(s)

preprint2026arXiv

Strong Solutions and Quantization-Based Numerical Schemes for a Class of Non-Markovian Volatility Models

We investigate a class of non-Markovian processes that hold particular relevance in the realm of mathematical finance. This family encompasses path-dependent volatility models, including those pioneered by [Platen and Rendek, 2018] and, more recently, by [Guyon and Lekeufack, 2023]. Our study unfolds in two principal phases. In the first phase, we introduce a functional quantization scheme based on an extended version of the Lamperti transformation that we propose to handle the presence of a memory term incorporated into the diffusion coefficient. In the second phase, we study the problem of existence and uniqueness of a strong solution for the SDEs related to the examples that motivate our study, in order to provide a theoretical basis to correctly apply the proposed numerical schemes.

preprint2023arXiv

Langevin algorithms for Markovian Neural Networks and Deep Stochastic control

Stochastic Gradient Descent Langevin Dynamics (SGLD) algorithms, which add noise to the classic gradient descent, are known to improve the training of neural networks in some cases where the neural network is very deep. In this paper we study the possibilities of training acceleration for the numerical resolution of stochastic control problems through gradient descent, where the control is parametrized by a neural network. If the control is applied at many discretization times then solving the stochastic control problem reduces to minimizing the loss of a very deep neural network. We numerically show that Langevin algorithms improve the training on various stochastic control problems like hedging and resource management, and for different choices of gradient descent methods.

preprint2022arXiv

Convergence of Langevin-Simulated Annealing algorithms with multiplicative noise

We study the convergence of Langevin-Simulated Annealing type algorithms with multiplicative noise, i.e. for $V : \mathbb{R}^d \to \mathbb{R}$ a potential function to minimize, we consider the stochastic equation $dY_t = - σσ^\top \nabla V(Y_t) dt + a(t)σ(Y_t)dW_t + a(t)^2Υ(Y_t)dt$, where $(W_t)$ is a Brownian motion, where $σ: \mathbb{R}^d \to \mathcal{M}_d(\mathbb{R})$ is an adaptive (multiplicative) noise, where $a : \mathbb{R}^+ \to \mathbb{R}^+$ is a function decreasing to $0$ and where $Υ$ is a correction term. This setting can be applied to optimization problems arising in Machine Learning. The case where $σ$ is a constant matrix has been extensively studied however little attention has been paid to the general case. We prove the convergence for the $L^1$-Wasserstein distance of $Y_t$ and of the associated Euler-scheme $\bar{Y}_t$ to some measure $ν^\star$ which is supported by $\text{argmin}(V)$ and give rates of convergence to the instantaneous Gibbs measure $ν_{a(t)}$ of density $\propto \exp(-2V(x)/a(t)^2)$. To do so, we first consider the case where $a$ is a piecewise constant function. We find again the classical schedule $a(t) = A\log^{-1/2}(t)$. We then prove the convergence for the general case by giving bounds for the Wasserstein distance to the stepwise constant case using ergodicity properties.

preprint2022arXiv

Convergence of Langevin-Simulated Annealing algorithms with multiplicative noise II: Total Variation

We study the convergence of Langevin-Simulated Annealing type algorithms with multiplicative noise, i.e. for $V : \mathbb{R}^d \to \mathbb{R}$ a potential function to minimize, we consider the stochastic differential equation $dY_t = - σσ^\top \nabla V(Y_t) dt + a(t)σ(Y_t)dW_t + a(t)^2Υ(Y_t)dt$, where $(W_t)$ is a Brownian motion, where $σ: \mathbb{R}^d \to \mathcal{M}_d(\mathbb{R})$ is an adaptive (multiplicative) noise, where $a : \mathbb{R}^+ \to \mathbb{R}^+$ is a function decreasing to $0$ and where $Υ$ is a correction term. Allowing $σ$ to depend on the position brings faster convergence in comparison with the classical Langevin equation $dY_t = -\nabla V(Y_t)dt + σdW_t$. In a previous paper we established the convergence in $L^1$-Wasserstein distance of $Y_t$ and of its associated Euler scheme $\bar{Y}_t$ to $\text{argmin}(V)$ with the classical schedule $a(t) = A\log^{-1/2}(t)$. In the present paper we prove the convergence in total variation distance. The total variation case appears more demanding to deal with and requires regularization lemmas.

preprint2022arXiv

Functional convex order for the scaled McKean-Vlasov processes

We establish the functional convex order results for two scaled McKean-Vlasov processes $X=(X_{t})_{t\in[0, T]}$ and $Y=(Y_{t})_{t\in[0, T]}$ defined on a filtered probability space $(Ω, \mathcal{F}, (\mathcal{F}_{t})_{t\geq0}, \mathbb{P})$ by \[\begin{cases} dX_{t}= b(t, X_{t}, μ_{t})dt+σ(t, X_{t}, μ_{t})dB_{t}, \;\;X_{0}\in L^{p}(\mathbb{P}),\\ dY_{t}\,= b(t, \,Y_{t}\,,\, ν_{t})dt+θ(t, \,Y_{t}\,,\, ν_{t})dB_{t}, \;\;Y_{0}\in L^{p}(\mathbb{P}), \end{cases}\] where $p\geq2$, for every $ t\in[0, T]$, $μ_t$, $ν_t$ denote the probability distribution of $X_t$, $Y_t$ respectively and the drift coefficient $b(t, x, μ)$ is affine in $x$ (scaled). If we make the convexity and monotony assumption (only) on $σ$ and if $σ\preceqθ$ with respect to the partial matrix order, the convex order for the initial random variable $X_0 \preceq_{\,cv} Y_0$ can be propagated to the whole path of process $X$ and $Y$. That is, if we consider a convex functional $F$ defined on the path space with polynomial growth, we have $\mathbb{E}F(X)\leq\mathbb{E}F(Y)$; for a convex functional $G$ defined on the product space involving the path space and its marginal distribution space, we have $\mathbb{E}\,G\big(X, (μ_t)_{t\in[0, T]}\big)\leq \mathbb{E}\,G\big(Y, (ν_t)_{t\in[0, T]}\big)$ under appropriate conditions. The symmetric setting is also valid, that is, if $θ\preceq σ$ and $Y_0 \leq X_0$ with respect to the convex order, then $\mathbb{E}\,F(Y) \leq \mathbb{E}\,F(X)$ and $\mathbb{E}\,G\big(Y, (ν_t)_{t\in[0, T]}\big)\leq \mathbb{E}\,G(X, (μ_t)_{t\in[0, T]})$. The proof is based on several forward and backward dynamic programming principles and the convergence of the Euler scheme of the McKean-Vlasov equation.

preprint2020arXiv

Convergence rate of optimal quantization grids and application to empirical measure

We study the convergence rate of the optimal quantization for a probability measure sequence $(μ_{n})_{n\in\mathbb{N}^{*}}$ on $\mathbb{R}^{d}$ converging in the Wasserstein distance in two aspects: the first one is the convergence rate of optimal quantizer $x^{(n)}\in(\mathbb{R}^{d})^{K}$ of $μ_{n}$ at level $K$; the other one is the convergence rate of the distortion function valued at $x^{(n)}$, called the "performance" of $x^{(n)}$. Moreover, we also study the mean performance of the optimal quantization for the empirical measure of a distribution $μ$ with finite second moment but possibly unbounded support. As an application, we show that the mean performance for the empirical measure of the multidimensional normal distribution $\mathcal{N}(m, Σ)$ and of distributions with hyper-exponential tails behave like $\mathcal{O}(\frac{\log n}{\sqrt{n}})$. This extends the results from [BDL08] obtained for compactly supported distribution. We also derive an upper bound which is sharper in the quantization level $K$ but suboptimal in $n$ by applying results in [FG15].

preprint2020arXiv

New approach to greedy vector quantization

We extend some rate of convergence results of greedy quantization sequences already investigated in arXiv:1409.0732 [math.PR]. We show, for a more general class of distributions satisfying a certain control, that the quantization error of these sequences have an $n^{-\frac1d}$ rate of convergence and that the distortion mismatch property is satisfied. We will give some non-asymptotic Pierce type estimates. The recursive character of greedy vector quantization allows some improvements to the algorithm of computation of these sequences and the implementation of a recursive formula to quantization-based numerical integration. Furthermore, we establish further properties of sub-optimality of greedy quantization sequences.

preprint2020arXiv

New Weak Error bounds and expansions for Optimal Quantization

We propose new weak error bounds and expansion in dimension one for optimal quantization-based cubature formula for different classes of functions, such that piecewise affine functions, Lipschitz convex functions or differentiable function with piecewise-defined locally Lipschitz or $α$-Hölder derivatives. This new results rest on the local behaviors of optimal quantizers, the $L^r$-$L^s$ distribution mismatch problem and Zador's Theorem. This new expansion supports the definition of a Richardson-Romberg extrapolation yielding a better rate of convergence for the cubature formula. An extension of this expansion is then proposed in higher dimension for the first time. We then propose a novel variance reduction method for Monte Carlo estimators, based on one dimensional optimal quantizers.

preprint2020arXiv

Quantization-based Bermudan option pricing in the $FX$ world

This paper proposes two numerical solution based on Product Optimal Quantization for the pricing of Foreign Echange (FX) linked long term Bermudan options e.g. Bermudan Power Reverse Dual Currency options, where we take into account stochastic domestic and foreign interest rates on top of stochastic FX rate, hence we consider a 3-factor model. For these two numerical methods, we give an estimation of the $L^2$-error induced by such approximations and we illustrate them with market-based examples that highlight the speed of such methods.

preprint2020arXiv

Stationary Heston model: Calibration and Pricing of exotics using Product Recursive Quantization

A major drawback of the Standard Heston model is that its implied volatility surface does not produce a steep enough smile when looking at short maturities. For that reason, we introduce the Stationary Heston model where we replace the deterministic initial condition of the volatility by its invariant measure and show, based on calibrated parameters, that this model produce a steeper smile for short maturities than the Standard Heston model. We also present numerical solution based on Product Recursive Quantization for the evaluation of exotic options (Bermudan and Barrier options).

preprint2019arXiv

Characterization of probability distribution convergence in Wasserstein distance by $L^{p}$-quantization error function

We establish conditions to characterize probability measures by their $L^{p}$-quantization error functions in both $\mathbb{R}^{d}$ and Hilbert settings. This characterization is two-fold: static (identity of two distributions) and dynamic (convergence for the $L^p$-Wasserstein distance). We first propose a criterion on the quantization level $N$, valid for any norm on $\mathbb{R}^{d}$ and any order $p$ based on a geometrical approach involving the Voronoï diagram. Then, we prove that in the $L^2$-case on a (separable) Hilbert space, the condition on the level $N$ can be reduced to $N=2$, which is optimal. More quantization based characterization cases on dimension 1 and a discussion of the completeness of a distance defined by the quantization error function can be found in the end of this paper.

preprint2018arXiv

Weak error for nested Multilevel Monte Carlo

This article discusses MLMC estimators with and without weights, applied to nested expectations of the form E [f (E [F (Y, Z)|Y ])]. More precisely, we are interested on the assumptions needed to comply with the MLMC framework, depending on whether the payoff function f is smooth or not. A new result to our knowledge is given when f is not smooth in the development of the weak error at an order higher than 1, which is needed for a successful use of MLMC estimators with weights.

preprint2016arXiv

Multilevel Richardson-Romberg extrapolation

We propose and analyze a Multilevel Richardson-Romberg (MLRR) estimator which combines the higher order bias cancellation of the Multistep Richardson-Romberg method introduced in [Pa07] and the variance control resulting from the stratification introduced in the Multilevel Monte Carlo (MLMC) method (see [Hei01, Gi08]). Thus, in standard frameworks like discretization schemes of diffusion processes, the root mean squared error (RMSE) $\varepsilon > 0$ can be achieved with our MLRR estimator with a global complexity of $\varepsilon^{-2} \log(1/\varepsilon)$ instead of $\varepsilon^{-2} (\log(1/\varepsilon))^2$ with the standard MLMC method, at least when the weak error $\mathbf{E}[Y_h]-\mathbf{E}[Y_0]$ of the biased implemented estimator $Y_h$ can be expanded at any order in $h$ and $\|Y_h - Y_0\|_2 = O(h^{\frac{1}{2}})$. The MLRR estimator is then halfway between a regular MLMC and a virtual unbiased Monte Carlo. When the strong error $\|Y_h - Y_0\|_2 = O(h^{\fracβ{2}})$, $β< 1$, the gain of MLRR over MLMC becomes even more striking. We carry out numerical simulations to compare these estimators in two settings: vanilla and path-dependent option pricing by Monte Carlo simulation and the less classical Nested Monte Carlo simulation.