Source author record

Lukasz Szpruch

Lukasz Szpruch appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.PR math.NA q-fin.CP Machine Learning q-fin.MF Artificial Intelligence econ.GN math.CA math.OC Numerical Analysis q-fin.EC q-fin.PR q-fin.TR

Catalog footprint

What is connected

18works

13topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime

We study the global convergence of policy gradient for infinite-horizon, continuous state and action space, and entropy-regularized Markov decision processes (MDPs). We consider a softmax policy with (one-hidden layer) neural network approximation in a mean-field regime. Additional entropic regularization in the associated mean-field probability measure is added, and the corresponding gradient flow is studied in the 2-Wasserstein metric. We show that the objective function is increasing along the gradient flow. Further, we prove that if the regularization in terms of the mean-field measure is sufficient, the gradient flow converges exponentially fast to the unique stationary solution, which is the unique maximizer of the regularized MDP objective. Lastly, we study the sensitivity of the value function along the gradient flow with respect to regularization parameters and the initial condition. Our results rely on the careful analysis of the non-linear Fokker-Planck-Kolmogorov equation and extend the pioneering work of Mei et al. 2020 and Agarwal et al. 2020, which quantify the global convergence rate of policy gradient for entropy-regularized MDPs in the tabular setting.

preprint2022arXiv

Synthetic Data -- what, why and how?

This explainer document aims to provide an overview of the current state of the rapidly expanding work on synthetic data technologies, with a particular focus on privacy. The article is intended for a non-technical audience, though some formal definitions have been given to provide clarity to specialists. This article is intended to enable the reader to quickly become familiar with the notion of synthetic data, as well as understand some of the subtle intricacies that come with it. We do believe that synthetic data is a very useful tool, and our hope is that this report highlights that, while drawing attention to nuances that can easily be overlooked in its deployment.

preprint2022arXiv

Unbiased deep solvers for linear parametric PDEs

We develop several deep learning algorithms for approximating families of parametric PDE solutions. The proposed algorithms approximate solutions together with their gradients, which in the context of mathematical finance means that the derivative prices and hedging strategies are computed simulatenously. Having approximated the gradient of the solution one can combine it with a Monte-Carlo simulation to remove the bias in the deep network approximation of the PDE solution (derivative price). This is achieved by leveraging the Martingale Representation Theorem and combining the Monte Carlo simulation with the neural network. The resulting algorithm is robust with respect to quality of the neural network approximation and consequently can be used as a black-box in case only limited a priori information about the underlying problem is available. We believe this is important as neural network based algorithms often require fair amount of tuning to produce satisfactory results. The methods are empirically shown to work for high-dimensional problems (e.g. 100 dimensions). We provide diagnostics that shed light on appropriate network architectures.

preprint2021arXiv

Black-box model risk in finance

Machine learning models are increasingly used in a wide variety of financial settings. The difficulty of understanding the inner workings of these systems, combined with their wide applicability, has the potential to lead to significant new risks for users; these risks need to be understood and quantified. In this sub-chapter, we will focus on a well studied application of machine learning techniques, to pricing and hedging of financial options. Our aim will be to highlight the various sources of risk that the introduction of machine learning emphasises or de-emphasises, and the possible risk mitigation and management strategies that are available.

preprint2020arXiv

Robust pricing and hedging via neural SDEs

Mathematical modelling is ubiquitous in the financial industry and drives key decision processes. Any given model provides only a crude approximation to reality and the risk of using an inadequate model is hard to detect and quantify. By contrast, modern data science techniques are opening the door to more robust and data-driven model selection mechanisms. However, most machine learning models are "black-boxes" as individual parameters do not have meaningful interpretation. The aim of this paper is to combine the above approaches achieving the best of both worlds. Combining neural networks with risk models based on classical stochastic differential equations (SDEs), we find robust bounds for prices of derivatives and the corresponding hedging strategies while incorporating relevant market data. The resulting model called neural SDE is an instantiation of generative models and is closely linked with the theory of causal optimal transport. Neural SDEs allow consistent calibration under both the risk-neutral and the real-world measures. Thus the model can be used to simulate market scenarios needed for assessing risk profiles and hedging strategies. We develop and analyse novel algorithms needed for efficient use of neural SDEs. We validate our approach with numerical experiments using both local and stochastic volatility models.

preprint2020arXiv

Sig-SDEs model for quantitative finance

Mathematical models, calibrated to data, have become ubiquitous to make key decision processes in modern quantitative finance. In this work, we propose a novel framework for data-driven model selection by integrating a classical quantitative setup with a generative modelling approach. Leveraging the properties of the signature, a well-known path-transform from stochastic analysis that recently emerged as leading machine learning technology for learning time-series data, we develop the Sig-SDE model. Sig-SDE provides a new perspective on neural SDEs and can be calibrated to exotic financial products that depend, in a non-linear way, on the whole trajectory of asset prices. Furthermore, we our approach enables to consistently calibrate under the pricing measure $\mathbb Q$ and real-world measure $\mathbb P$. Finally, we demonstrate the ability of Sig-SDE to simulate future possible market scenarios needed for computing risk profiles or hedging strategies. Importantly, this new model is underpinned by rigorous mathematical analysis, that under appropriate conditions provides theoretical guarantees for convergence of the presented algorithms.

preprint2019arXiv

Weak quantitative propagation of chaos via differential calculus on the space of measures

Consider the metric space $(\mathcal{P}_2(\mathbb{R}^d),W_2)$ of square integrable laws on $\mathbb{R}^d$ with the topology induced by the 2-Wasserstein distance $W_2$. Let $Φ: \mathcal{P}_2( \mathbb{R}^d) \to \mathbb{R}$ be a function and $μ_N$ be the empirical measure of a sample of $N$ random variables distributed as $μ$. The main result of this paper is to show that under suitable regularity conditions, we have \[ |Φ(μ) - \mathbb{E}Φ(μ_N)|= \sum_{j=1}^{k-1}\frac{C_j}{N^j} + O(\frac{1}{N^k}), \] for some positive constants $C_1, \ldots, C_{k-1}$ that do not depend on $N$, where $k$ corresponds to the degree of smoothness. We distinguish two cases: a) $μ_N$ is the empirical measure of $N$-samples from $μ$; b) $μ$ is a marginal law of McKean-Vlasov stochastic differential equation in which case $μ_N$ is an empirical law of marginal laws of the corresponding particle system. The first case is studied using functional derivatives on the space of measures. The second case relies on an Itô-type formula for the flow of probability measures and is intimately connected to PDEs on the space of measures, called the master equation in the literature of mean-field games. We state the general regularity conditions required for each case and analyse the regularity in the case of functionals of the laws of McKean-Vlasov SDEs. Ultimately, this work reveals quantitative estimates of propagation of chaos for interacting particle systems. Furthermore, we are able to provide weak propagation of chaos estimates for ensembles of interacting particles and show that these may have some remarkable properties.

preprint2016arXiv

Convergence and qualitative properties of modified explicit schemes for BSDEs with polynomial growth

The theory of Forward-Backward Stochastic Differential Equations (FBSDEs) paves a way to probabilistic numerical methods for nonlinear parabolic PDEs. The majority of the results on the numerical methods for FBSDEs relies on the global Lipschitz assumption, which is not satisfied for a number of important cases such as the Fisher--KPP or the FitzHugh--Nagumo equations. Furthermore, it has been shown in \cite{LionnetReisSzpruch2015} that for BSDEs with monotone drivers having polynomial growth in the primary variable $y$, only the (sufficiently) implicit schemes converge. But these require an additional computational effort compared to explicit schemes. This article develops a general framework that allows the analysis, in a systematic fashion, of the integrability properties, convergence and qualitative properties (e.g.~comparison theorem) for whole families of modified explicit schemes. The framework yields the convergence of some modified explicit scheme with the same rate as implicit schemes and with the computational cost of the standard explicit scheme. To illustrate our theory, we present several classes of easily implementable modified explicit schemes that can computationally outperform the implicit one and preserve the qualitative properties of the solution to the BSDE. These classes fit into our developed framework and are tested in computational experiments.

preprint2016arXiv

Multilevel Monte Carlo for Scalable Bayesian Computations

Markov chain Monte Carlo (MCMC) algorithms are ubiquitous in Bayesian computations. However, they need to access the full data set in order to evaluate the posterior density at every step of the algorithm. This results in a great computational burden in big data applications. In contrast to MCMC methods, Stochastic Gradient MCMC (SGMCMC) algorithms such as the Stochastic Gradient Langevin Dynamics (SGLD) only require access to a batch of the data set at every step. This drastically improves the computational performance and scales well to large data sets. However, the difficulty with SGMCMC algorithms comes from the sensitivity to its parameters which are notoriously difficult to tune. Moreover, the Root Mean Square Error (RMSE) scales as $\mathcal{O}(c^{-\frac{1}{3}})$ as opposed to standard MCMC $\mathcal{O}(c^{-\frac{1}{2}})$ where $c$ is the computational cost. We introduce a new class of Multilevel Stochastic Gradient Markov chain Monte Carlo algorithms that are able to mitigate the problem of tuning the step size and more importantly of recovering the $\mathcal{O}(c^{-\frac{1}{2}})$ convergence of standard Markov Chain Monte Carlo methods without the need to introduce Metropolis-Hasting steps. A further advantage of this new class of algorithms is that it can easily be parallelised over a heterogeneous computer architecture. We illustrate our methodology using Bayesian logistic regression and provide numerical evidence that for a prescribed relative RMSE the computational cost is sublinear in the number of data items.

preprint2015arXiv

$V$-Integrability, Asymptotic Stability And Comparison Theorem of Explicit Numerical Schemes for SDEs

Khasminski's \cite{chas1980stochastic} showed that many of the asymptotic stability and the integrability properties of the solutions to the Stochastic Differential Equations (SDEs) can be obtained using Lyapunov functions techniques. These properties are rarely inherited by standard numerical integrators. In this article we introduce a family of explicit numerical approximations for the SDEs and derive conditions that allow to use Khasminski's techniques in the context of numerical approximations, particularly in the case where SDEs have non globally Lipschitz coefficients. Consequently, we show that it is possible to construct a numerical scheme, that is bounded in expectation with respect to a Lyapunov function, and/or inherit the asymptotic stability property from the SDEs. Finally we show that using suitable schemes it is possible to recover comparison theorem for scalar SDEs.

preprint2015arXiv

Time discretization of FBSDE with polynomial growth drivers and reaction-diffusion PDEs

In this paper, we undertake the error analysis of the time discretization of systems of Forward-Backward Stochastic Differential Equations (FBSDEs) with drivers having polynomial growth and that are also monotone in the state variable. We show with a counter-example that the natural explicit Euler scheme may diverge, unlike in the canonical Lipschitz driver case. This is due to the lack of a certain stability property of the Euler scheme which is essential to obtain convergence. However, a thorough analysis of the family of $θ$-schemes reveals that this required stability property can be recovered if the scheme is sufficiently implicit. As a by-product of our analysis, we shed some light on higher order approximation schemes for FBSDEs under non-Lipschitz condition. We then return to fully explicit schemes and show that an appropriately tamed version of the explicit Euler scheme enjoys the required stability property and as a consequence converges. In order to establish convergence of the several discretizations, we extend the canonical path- and first-order variational regularity results to FBSDEs with polynomial growth drivers which are also monotone. These results are of independent interest for the theory of FBSDEs.

preprint2014arXiv

Antithetic multilevel Monte Carlo estimation for multi-dimensional SDEs without Lévy area simulation

In this paper we introduce a new multilevel Monte Carlo (MLMC) estimator for multi-dimensional SDEs driven by Brownian motions. Giles has previously shown that if we combine a numerical approximation with strong order of convergence $O(Δt)$ with MLMC we can reduce the computational complexity to estimate expected values of functionals of SDE solutions with a root-mean-square error of $ε$ from $O(ε^{-3})$ to $O(ε^{-2})$. However, in general, to obtain a rate of strong convergence higher than $O(Δt^{1/2})$ requires simulation, or approximation, of Lévy areas. In this paper, through the construction of a suitable antithetic multilevel correction estimator, we are able to avoid the simulation of Lévy areas and still achieve an $O(Δt^2)$ multilevel correction variance for smooth payoffs, and almost an $O(Δt^{3/2})$ variance for piecewise smooth payoffs, even though there is only $O(Δt^{1/2})$ strong convergence. This results in an $O(ε^{-2})$ complexity for estimating the value of European and Asian put and call options.

preprint2012arXiv

Convergence, Non-negativity and Stability of a New Milstein Scheme with Applications to Finance

We propose and analyse a new Milstein type scheme for simulating stochastic differential equations (SDEs) with highly nonlinear coefficients. Our work is motivated by the need to justify multi-level Monte Carlo simulations for mean-reverting financial models with polynomial growth in the diffusion term. We introduce a double implicit Milstein scheme and show that it possesses desirable properties. It converges strongly and preserves non-negativity for a rich family of financial models and can reproduce linear and nonlinear stability behaviour of the underlying SDE without severe restriction on the time step. Although the scheme is implicit, we point out examples of financial models where an explicit formula for the solution to the scheme can be found.

preprint2012arXiv

First order strong approximations of scalar SDEs with values in a domain

We are interested in strong approximations of one-dimensional SDEs which have non-Lipschitz coefficients and which take values in a domain. Under a set of general assumptions we derive an implicit scheme that preserves the domain of the SDEs and is strongly convergent with rate one. Moreover, we show that this general result can be applied to many SDEs we encounter in mathematical finance and bio-mathematics. We will demonstrate flexibility of our approach by analysing classical examples of SDEs with sublinear coefficients (CIR, CEV models and Wright-Fisher diffusion) and also with superlinear coefficients (3/2-volatility, Ait-Sahalia model). Our goal is to justify an efficient Multi-Level Monte Carlo (MLMC) method for a rich family of SDEs, which relies on good strong convergence properties.

preprint2012arXiv

Multilevel Monte Carlo methods for applications in finance

Since Giles introduced the multilevel Monte Carlo path simulation method [18], there has been rapid development of the technique for a variety of applications in computational finance. This paper surveys the progress so far, highlights the key features in achieving a high rate of multilevel variance convergence, and suggests directions for future research.

preprint2012arXiv

Strong convergence and stability of implicit numerical methods for stochastic differential equations with non-globally Lipschitz continuous coefficients

We are interested in the strong convergence and almost sure stability of Euler-Maruyama (EM) type approximations to the solutions of stochastic differential equations (SDEs) with non-linear and non-Lipschitzian coefficients. Motivation comes from finance and biology where many widely applied models do not satisfy the standard assumptions required for the strong convergence. In addition we examine the globally almost surely asymptotic stability in this non-linear setting for EM type schemes. In particular, we present a stochastic counterpart of the discrete LaSalle principle from which we deduce stability properties for numerical methods.

preprint2011arXiv

A limit order book model for latency arbitrage

We consider a single security market based on a limit order book and two investors, with different speeds of trade execution. If the fast investor can front-run the slower investor, we show that this allows the fast trader to obtain risk free profits, but that these profits cannot be scaled. We derive the fast trader's optimal behaviour when she has only distributional knowledge of the slow trader's actions, with few restrictions on the possible prior distributions. We also consider the slower trader's response to the presence of a fast trader in a market, and the effects of the introduction of a `Tobin tax' on financial transactions. We show that such a tax can lead to the elimination of profits from front-running strategies. Consequently, a Tobin tax can both increase market efficiency and attract traders to a market.

preprint2011arXiv

On Markovian solutions to Markov Chain BSDEs

We study (backward) stochastic differential equations with noise coming from a finite state Markov chain. We show that, for the solutions of these equations to be `Markovian', in the sense that they are deterministic functions of the state of the underlying chain, the integrand must be of a specific form. This allows us to connect these equations to coupled systems of ODEs, and hence to give fast numerical methods for the evaluation of Markov-Chain BSDEs.

Lukasz Szpruch

What is connected

Connect this record

See the researcher in context

Building this map preview

18 published item(s)

Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime

Synthetic Data -- what, why and how?

Unbiased deep solvers for linear parametric PDEs

Black-box model risk in finance

Robust pricing and hedging via neural SDEs

Sig-SDEs model for quantitative finance

Weak quantitative propagation of chaos via differential calculus on the space of measures

Convergence and qualitative properties of modified explicit schemes for BSDEs with polynomial growth

Multilevel Monte Carlo for Scalable Bayesian Computations

$V$-Integrability, Asymptotic Stability And Comparison Theorem of Explicit Numerical Schemes for SDEs

Time discretization of FBSDE with polynomial growth drivers and reaction-diffusion PDEs

Antithetic multilevel Monte Carlo estimation for multi-dimensional SDEs without Lévy area simulation

Convergence, Non-negativity and Stability of a New Milstein Scheme with Applications to Finance

First order strong approximations of scalar SDEs with values in a domain

Multilevel Monte Carlo methods for applications in finance

Strong convergence and stability of implicit numerical methods for stochastic differential equations with non-globally Lipschitz continuous coefficients

A limit order book model for latency arbitrage

On Markovian solutions to Markov Chain BSDEs