Source author record

David Šiška

David Šiška appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.PR math.NA math.OC Machine Learning Artificial Intelligence math.AP math.FA q-fin.CP q-fin.MF q-fin.PR Systems and Control

Catalog footprint

What is connected

8works

11topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime

We study the global convergence of policy gradient for infinite-horizon, continuous state and action space, and entropy-regularized Markov decision processes (MDPs). We consider a softmax policy with (one-hidden layer) neural network approximation in a mean-field regime. Additional entropic regularization in the associated mean-field probability measure is added, and the corresponding gradient flow is studied in the 2-Wasserstein metric. We show that the objective function is increasing along the gradient flow. Further, we prove that if the regularization in terms of the mean-field measure is sufficient, the gradient flow converges exponentially fast to the unique stationary solution, which is the unique maximizer of the regularized MDP objective. Lastly, we study the sensitivity of the value function along the gradient flow with respect to regularization parameters and the initial condition. Our results rely on the careful analysis of the non-linear Fokker-Planck-Kolmogorov equation and extend the pioneering work of Mei et al. 2020 and Agarwal et al. 2020, which quantify the global convergence rate of policy gradient for entropy-regularized MDPs in the tabular setting.

preprint2022arXiv

Decaying derivative estimates for functions of solutions to non-autonomous SDEs

We produce uniform and decaying bounds in time for derivatives of the solution to the backwards Kolmogorov equation associated to a stochastic processes governed by a time dependent dynamics. These hold under assumptions over the integrability properties in finite time of the derivatives of the transition density associated to the process, together with the assumption of remaining close over all $[0,\infty)$, or decaying in time, to some static measure. We moreover provide examples which satisfy such a set of assumptions. Finally, the results are interpreted in the McKean-Vlasov context for monotonic coefficients by introducing an auxiliary non-autonomous stochastic process.

preprint2020arXiv

Robust pricing and hedging via neural SDEs

Mathematical modelling is ubiquitous in the financial industry and drives key decision processes. Any given model provides only a crude approximation to reality and the risk of using an inadequate model is hard to detect and quantify. By contrast, modern data science techniques are opening the door to more robust and data-driven model selection mechanisms. However, most machine learning models are "black-boxes" as individual parameters do not have meaningful interpretation. The aim of this paper is to combine the above approaches achieving the best of both worlds. Combining neural networks with risk models based on classical stochastic differential equations (SDEs), we find robust bounds for prices of derivatives and the corresponding hedging strategies while incorporating relevant market data. The resulting model called neural SDE is an instantiation of generative models and is closely linked with the theory of causal optimal transport. Neural SDEs allow consistent calibration under both the risk-neutral and the real-world measures. Thus the model can be used to simulate market scenarios needed for assessing risk profiles and hedging strategies. We develop and analyse novel algorithms needed for efficient use of neural SDEs. We validate our approach with numerical experiments using both local and stochastic volatility models.

preprint2020arXiv

Weak Existence and Uniqueness for McKean-Vlasov SDEs with Common Noise

This paper concerns the McKean-Vlasov stochastic differential equation (SDE) with common noise. An appropriate definition of a weak solution to such an equation is developed. The importance of the notion of compatibility in this definition is highlighted by a demonstration of its rôle in connecting weak solutions to McKean-Vlasov SDEs with common noise and solutions to corresponding stochastic partial differential equations (SPDEs). By keeping track of the dependence structure between all components in a sequence of approximating processes, a compactness argument is employed to prove the existence of a weak solution assuming boundedness and joint continuity of the coefficients (allowing for degenerate diffusions). Weak uniqueness is established when the private (idiosyncratic) noise's diffusion coefficient is non-degenerate and the drift is regular in the total variation distance. This seems sharp when one considers using finite-dimensional noise to regularise an infinite dimensional problem. The proof relies on a suitably tailored cost function in the Monge-Kantorovich problem and representation of weak solutions via Girsanov transformations.

preprint2016arXiv

Nonlinear stochastic evolution equations of second order with damping

Convergence of a full discretization of a second order stochastic evolution equation with nonlinear damping is shown and thus existence of a solution is established. The discretization scheme combines an implicit time stepping scheme with an internal approximation. Uniqueness is proved as well.

preprint2015arXiv

Convergence of tamed Euler schemes for a class of stochastic evolution equations

We prove stability and convergence of a full discretization for a class of stochastic evolution equations with super-linearly growing operators appearing in the drift term. This is done using the recently developed tamed Euler method, which uses a fully explicit time stepping, coupled with a Galerkin scheme for the spatial discretization.

preprint2014arXiv

On finite-difference approximations for normalized Bellman equations

A class of stochastic optimal control problems involving optimal stopping is considered. Methods of Krylov are adapted to investigate the numerical solutions of the corresponding normalized Bellman equations and to estimate the rate of convergence of finite difference approximations for the optimal reward functions.

preprint2011arXiv

Error estimates for finite difference approximations of American put option price

Finite difference approximations to multi-asset American put option price are considered. The assets are modelled as a multi-dimensional diffusion process with variable drift and volatility. Approximation error of order one quarter with respect to the time discretisation parameter and one half with respect to the space discretisation parameter is proved by reformulating the corresponding optimal stopping problem as a solution of a degenerate Hamilton-Jacobi-Bellman equation. Furthermore, the error arising from restricting the discrete problem to a finite grid by reducing the original problem to a bounded domain is estimated.