Source author record

Denis Belomestny

Denis Belomestny appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.ST Statistics Theory math.PR Methodology q-fin.CP math.OC Applications Machine Learning math.NA Numerical Analysis Computation econ.EM q-fin.MF stat.OT

Catalog footprint

What is connected

22works

14topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses

We propose the Bayes-UCBVI algorithm for reinforcement learning in tabular, stage-dependent, episodic Markov decision process: a natural extension of the Bayes-UCB algorithm by Kaufmann et al. (2012) for multi-armed bandits. Our method uses the quantile of a Q-value function posterior as upper confidence bound on the optimal Q-value function. For Bayes-UCBVI, we prove a regret bound of order $\widetilde{O}(\sqrt{H^3SAT})$ where $H$ is the length of one episode, $S$ is the number of states, $A$ the number of actions, $T$ the number of episodes, that matches the lower-bound of $Ω(\sqrt{H^3SAT})$ up to poly-$\log$ terms in $H,S,A,T$ for a large enough $T$. To the best of our knowledge, this is the first algorithm that obtains an optimal dependence on the horizon $H$ (and $S$) without the need for an involved Bernstein-like bonus or noise. Crucial to our analysis is a new fine-grained anti-concentration bound for a weighted Dirichlet sum that can be of independent interest. We then explain how Bayes-UCBVI can be easily extended beyond the tabular setting, exhibiting a strong link between our algorithm and Bayesian bootstrap (Rubin, 1981).

preprint2022arXiv

Reinforced optimal control

Least squares Monte Carlo methods are a popular numerical approximation method for solving stochastic control problems. Based on dynamic programming, their key feature is the approximation of the conditional expectation of future rewards by linear least squares regression. Hence, the choice of basis functions is crucial for the accuracy of the method. Earlier work by some of us [Belomestny, Schoenmakers, Spokoiny, Zharkynbay. Commun.~Math.~Sci., 18(1):109-121, 2020](arXiv:1808.02341) proposes to reinforce the basis functions in the case of optimal stopping problems by already computed value functions for later times, thereby considerably improving the accuracy with limited additional computational cost. We extend the reinforced regression method to a general class of stochastic control problems, while considerably improving the method's efficiency, as demonstrated by substantial numerical examples as well as theoretical analysis.

preprint2022arXiv

Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization

Policy-gradient methods in Reinforcement Learning(RL) are very universal and widely applied in practice but their performance suffers from the high variance of the gradient estimate. Several procedures were proposed to reduce it including actor-critic(AC) and advantage actor-critic(A2C) methods. Recently the approaches have got new perspective due to the introduction of Deep RL: both new control variates(CV) and new sub-sampling procedures became available in the setting of complex models like neural networks. The vital part of CV-based methods is the goal functional for the training of the CV, the most popular one is the least-squares criterion of A2C. Despite its practical success, the criterion is not the only one possible. In this paper we for the first time investigate the performance of the one called Empirical Variance(EV). We observe in the experiments that not only EV-criterion performs not worse than A2C but sometimes can be considerably better. Apart from that, we also prove some theoretical guarantees of the actual variance reduction under very general assumptions and show that A2C least-squares goal functional is an upper bound for EV goal. Our experiments indicate that in terms of variance reduction EV-based methods are much better than A2C and allow stronger variance reduction.

preprint2021arXiv

From optimal martingales to randomized dual optimal stopping

In this article we study and classify optimal martingales in the dual formulation of optimal stopping problems. In this respect we distinguish between weakly optimal and surely optimal martingales. It is shown that the family of weakly optimal and surely optimal martingales may be quite large. On the other hand it is shown that the Doob-martingale, that is, the martingale part of the Snell envelope, is in a certain sense the most robust surely optimal martingale under random perturbations. This new insight leads to a novel randomized dual martingale minimization algorithm that doesn't require nested simulation. As a main feature, in a possibly large family of optimal martingales the algorithm efficiently selects a martingale that is as close as possible to the Doob martingale. As a result, one obtains the dual upper bound for the optimal stopping problem with low variance.

preprint2020arXiv

Density deconvolution under general assumptions on the distribution of measurement errors

In this paper we study the problem of density deconvolution under general assumptions on the measurement error distribution. Typically deconvolution estimators are constructed using Fourier transform techniques, and it is assumed that the characteristic function of the measurement errors does not have zeros on the real line. This assumption is rather strong and is not fulfilled in many cases of interest. In this paper we develop a methodology for constructing optimal density deconvolution estimators in the general setting that covers vanishing and non--vanishing characteristic functions of the measurement errors. We derive upper bounds on the risk of the proposed estimators and provide sufficient conditions under which zeros of the corresponding characteristic function have no effect on estimation accuracy. Moreover, we show that the derived conditions are also necessary in some specific problem instances.

preprint2020arXiv

Estimating TVP-VAR models with time invariant long-run multipliers

The main goal of this paper is to develop a methodology for estimating time varying parameter vector auto-regression (TVP-VAR) models with a timeinvariant long-run relationship between endogenous variables and changes in exogenous variables. We propose a Gibbs sampling scheme for estimation of model parameters as well as time-invariant long-run multiplier parameters. Further we demonstrate the applicability of the proposed method by analyzing examples of the Norwegian and Russian economies based on the data on real GDP, real exchange rate and real oil prices. Our results show that incorporating the time invariance constraint on the long-run multipliers in TVP-VAR model helps to significantly improve the forecasting performance.

preprint2020arXiv

Randomized optimal stopping algorithms and their convergence analysis

In this paper we study randomized optimal stopping problems and consider corresponding forward and backward Monte Carlo based optimisation algorithms. In particular we prove the convergence of the proposed algorithms and derive the corresponding convergence rates.

preprint2019arXiv

Fourier transform MCMC, heavy tailed distributions and geometric ergodicity

Markov Chain Monte Carlo methods become increasingly popular in applied mathematics as a tool for numerical integration with respect to complex and high-dimensional distributions. However, application of MCMC methods to heavy tailed distributions and distributions with analytically intractable densities turns out to be rather problematic. In this paper, we propose a novel approach towards the use of MCMC algorithms for distributions with analytically known Fourier transforms and, in particular, heavy tailed distributions. The main idea of the proposed approach is to use MCMC methods in Fourier domain to sample from a density proportional to the absolute value of the underlying characteristic function. A subsequent application of the Parseval's formula leads to an efficient algorithm for the computation of integrals with respect to the underlying density. We show that the resulting Markov chain in Fourier domain may be geometrically ergodic even in the case of heavy tailed original distributions. We illustrate our approach by several numerical examples including multivariate elliptically contoured stable distributions.

preprint2016arXiv

Low frequency estimation of continuous-time moving average Lévy processes

In this paper we study the problem of statistical inference for a continuous-time moving average Lévy process of the form $$Z_{t} = \int_{\mathbb{R}}\mathcal{K}(t-s)\, dL_{s},\quad t\in\mathbb{R}$$ with a deterministic kernel (\K\) and a L{é}vy process (L\). Especially the estimation of the Lévy measure (ν\) of $L$ from low-frequency observations of the process $Z$ is considered. We construct a consistent estimator, derive its convergence rates and illustrate its performance by a numerical example. On the technical level, the main challenge is to establish a kind of exponential mixing for continuous-time moving average Lévy processes.

preprint2015arXiv

Generalized Post-Widder inversion formula with application to statistics

In this work we derive an inversion formula for the Laplace transform of a density observed on a curve in the complex domain, which generalizes the well known Post-Widder formula. We establish convergence of our inversion method and derive the corresponding convergence rates for the case of a Laplace transform of a smooth density. As an application we consider the problem of statistical inference for variance-mean mixture models. We construct a nonparametric estimator for the mixing density based on the generalized Post-Widder formula, derive bounds for its root mean square error and give a brief numerical example.

preprint2015arXiv

Optimal stopping under probability distortions and law invariant coherent risk measures

In this paper we study optimal stopping problems with respect to distorted expectations of the form \begin{eqnarray*} \mathcal{E}(X)=\int_{-\infty}^{\infty} x\,dG(F_X(x)), \end{eqnarray*} where $F_X$ is the distribution function of $X$ and $G$ is a convex distribution function on $[0,1].$ As a matter of fact, except for $G$ being the identity on $[0,1],$ dynamic versions of $\mathcal{E}(X)$ do not have the so-called time-consistency property necessary for the dynamic programming approach. So the standard approaches are not applicable to optimal stopping under $\mathcal{E}(X).$ In this paper, we prove a novel representation, which relates the solution of an optimal stopping problem under distorted expectation to the sequence of standard optimal stopping problems and hence makes the application of the standard dynamic programming-based approaches possible. Furthermore, by means of the well known Kusuoka representation, we extend our results to optimal stopping under general law invariant coherent risk measures. Finally, based on our novel representations, we develop several Monte Carlo approximation algorithms and illustrate their power for optimal stopping under Average Value at Risk and the absolute semideviation risk measures.

preprint2015arXiv

Statistical inference for generalized Ornstein-Uhlenbeck processes

In this paper, we consider the problem of statistical inference for generalized Ornstein-Uhlenbeck processes of the type \[ X_{t} = e^{-ξ_{t}} \left( X_{0} + \int_{0}^{t} e^{ξ_{u-}} d u \right), \] where $ξ_s$ is a L{é}vy process. Our primal goal is to estimate the characteristics of the Lévy process $ξ$ from the low-frequency observations of the process $X$. We present a novel approach towards estimating the L{é}vy triplet of $ξ,$ which is based on the Mellin transform technique. It is shown that the resulting estimates attain optimal minimax convergence rates. The suggested algorithms are illustrated by numerical simulations.

preprint2014arXiv

Multilevel path simulation for weak approximation schemes

In this paper we discuss the possibility of using multilevel Monte Carlo (MLMC) methods for weak approximation schemes. It turns out that by means of a simple coupling between consecutive time discretisation levels, one can achieve the same complexity gain as under the presence of a strong convergence. We exemplify this general idea in the case of weak Euler scheme for Lévy driven stochastic differential equations, and show that, given a weak convergence of order $α\geq 1/2,$ the complexity of the corresponding "weak" MLMC estimate is of order $\varepsilon^{-2}\log ^{2}(\varepsilon).$ The numerical performance of the new "weak" MLMC method is illustrated by several numerical examples.

preprint2014arXiv

Optimal stopping under model uncertainty: randomized stopping times approach

In this work we consider optimal stopping problems with conditional convex risk measures called optimised certainty equivalents. Without assuming any kind of time-consistency for the underlying family of risk measures, we derive a novel representation for the solution of the optimal stopping problem. In particular, we generalise the additive dual representation of Rogers (2002) to the case of optimal stopping under uncertainty. Finally, we develop several Monte Carlo algorithms and illustrate their power for optimal stopping under Average Value at Risk.

preprint2014arXiv

Statistical Skorohod embedding problem and its generalizations

Given a Lévy process $L$, we consider the so-called statistical Skorohod embedding problem of recovering the distribution of an independent random time $T$ based on i.i.d. sample from $L_{T}.$ Our approach is based on the genuine use of the Mellin and Laplace transforms. We propose a consistent estimator for the density of $T,$ derive its convergence rates and prove their optimality. It turns out that the convergence rates heavily depend on the decay of the Mellin transform of $T.$ We also consider the application of our results to the problem of statistical inference for variance-mean mixture models and for time-changed Lévy processes.

preprint2013arXiv

Concentration inequalities for smooth random fields

In this note we derive a sharp concentration inequality for the supremum of a smooth random field over a finite dimensional set. It is shown that this supremum can be bounded with high probability by the value of the field at some deterministic point plus an intrinsic dimension of the optimisation problem. As an application we prove the exponential inequality for a function of the maximal eigenvalue of a random matrix is proved.

preprint2013arXiv

Pricing American options via multi-level approximation methods

In this article we propose a novel approach to reduce the computational complexity of various approximation methods for pricing discrete time American options. Given a sequence of continuation values estimates corresponding to different levels of spatial approximation and time discretization, we propose a multi-level low biased estimate for the price of an American option. It turns out that the resulting complexity gain can be rather high and can even reach the order (\varepsilon^{-1}) with (\varepsilon) denoting the desired precision. The performance of the proposed multilevel algorithm is illustrated by a numerical example of pricing Bermudan max-call options.

preprint2013arXiv

Solving optimal stopping problems via empirical dual optimization

In this paper we consider a method of solving optimal stopping problems in discrete and continuous time based on their dual representation. A novel and generic simulation-based optimization algorithm not involving nested simulations is proposed and studied. The algorithm involves the optimization of a genuinely penalized dual objective functional over a class of adapted martingales. We prove the convergence of the proposed algorithm and demonstrate its efficiency for optimal stopping problems arising in option pricing.

preprint2013arXiv

Statistical inference for exponential functionals of Lévy processes

In this paper, we consider the exponential functional $A_{\infty}=\int_0^\infty e^{-ξ_s}ds$ of a L{é}vy process $ξ_s$ and aim to estimate the characteristics of $ξ_{s}$ from the distribution of $A_{\infty}$. We present a new approach, which allows to statistically infer on the L{é}vy triplet of $ξ_{t}$, and study the theoretical properties of the proposed estimators. The suggested algorithms are illustrated with numerical simulations.

preprint2012arXiv

Statistical inference for time-changed Lévy processes via composite characteristic function estimation

In this article, the problem of semi-parametric inference on the parameters of a multidimensional Lévy process $L_t$ with independent components based on the low-frequency observations of the corresponding time-changed Lévy process $L_{\mathcal{T}(t)}$, where $\mathcal{T}$ is a nonnegative, nondecreasing real-valued process independent of $L_t$, is studied. We show that this problem is closely related to the problem of composite function estimation that has recently gotten much attention in statistical literature. Under suitable identifiability conditions, we propose a consistent estimate for the Lévy density of $L_t$ and derive the uniform as well as the pointwise convergence rates of the estimate proposed. Moreover, we prove that the rates obtained are optimal in a minimax sense over suitable classes of time-changed Lévy models. Finally, we present a simulation study showing the performance of our estimation algorithm in the case of time-changed Normal Inverse Gaussian (NIG) Lévy processes.

preprint2011arXiv

Spectral estimation of the Lévy density in partially observed affine models

The problem of estimating the Lévy density of a partially observed multidimensional affine process from low-frequency and mixed-frequency data is considered. The estimation methodology is based on the log-affine representation of the conditional characteristic function of an affine process and local linear smoothing in time. We derive almost sure uniform rates of convergence for the estimated Lévy density both in mixed-frequency and low-frequency setups and prove that these rates are optimal in the minimax sense. Finally, the performance of the estimation algorithms is illustrated in the case of the Bates stochastic volatility model.

preprint2010arXiv

Spectral estimation of the fractional order of a Lévy process

We consider the problem of estimating the fractional order of a Lévy process from low frequency historical and options data. An estimation methodology is developed which allows us to treat both estimation and calibration problems in a unified way. The corresponding procedure consists of two steps: the estimation of a conditional characteristic function and the weighted least squares estimation of the fractional order in spectral domain. While the second step is identical for both calibration and estimation, the first one depends on the problem at hand. Minimax rates of convergence for the fractional order estimate are derived, the asymptotic normality is proved and a data-driven algorithm based on aggregation is proposed. The performance of the estimator in both estimation and calibration setups is illustrated by a simulation study.

Denis Belomestny

What is connected

Connect this record

See the researcher in context

Building this map preview

22 published item(s)

From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses

Reinforced optimal control

Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization

From optimal martingales to randomized dual optimal stopping

Density deconvolution under general assumptions on the distribution of measurement errors

Estimating TVP-VAR models with time invariant long-run multipliers

Randomized optimal stopping algorithms and their convergence analysis

Fourier transform MCMC, heavy tailed distributions and geometric ergodicity

Low frequency estimation of continuous-time moving average Lévy processes

Generalized Post-Widder inversion formula with application to statistics

Optimal stopping under probability distortions and law invariant coherent risk measures

Statistical inference for generalized Ornstein-Uhlenbeck processes

Multilevel path simulation for weak approximation schemes

Optimal stopping under model uncertainty: randomized stopping times approach

Statistical Skorohod embedding problem and its generalizations

Concentration inequalities for smooth random fields

Pricing American options via multi-level approximation methods

Solving optimal stopping problems via empirical dual optimization

Statistical inference for exponential functionals of Lévy processes

Statistical inference for time-changed Lévy processes via composite characteristic function estimation

Spectral estimation of the Lévy density in partially observed affine models

Spectral estimation of the fractional order of a Lévy process