Source author record

Sotirios Sabanis

Sotirios Sabanis appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.PR math.NA Machine Learning math.OC math.ST Statistics Theory Numerical Analysis q-fin.MF q-fin.PM

Catalog footprint

What is connected

13works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2023arXiv

Kinetic Langevin MCMC Sampling Without Gradient Lipschitz Continuity -- the Strongly Convex Case

In this article we consider sampling from log concave distributions in Hamiltonian setting, without assuming that the objective gradient is globally Lipschitz. We propose two algorithms based on monotone polygonal (tamed) Euler schemes, to sample from a target measure, and provide non-asymptotic 2-Wasserstein distance bounds between the law of the process of each algorithm and the target measure. Finally, we apply these results to bound the excess risk optimization error of the associated optimization problem.

preprint2023arXiv

Taming neural networks with TUSLA: Non-convex learning via adaptive stochastic gradient Langevin algorithms

Artificial neural networks (ANNs) are typically highly nonlinear systems which are finely tuned via the optimization of their associated, non-convex loss functions. In many cases, the gradient of any such loss function has superlinear growth, making the use of the widely-accepted (stochastic) gradient descent methods, which are based on Euler numerical schemes, problematic. We offer a new learning algorithm based on an appropriately constructed variant of the popular stochastic gradient Langevin dynamics (SGLD), which is called tamed unadjusted stochastic Langevin algorithm (TUSLA). We also provide a nonasymptotic analysis of the new algorithm's convergence properties in the context of non-convex learning problems with the use of ANNs. Thus, we provide finite-time guarantees for TUSLA to find approximate minimizers of both empirical and population risks. The roots of the TUSLA algorithm are based on the taming technology for diffusion processes with superlinear coefficients as developed in \citet{tamed-euler, SabanisAoAP} and for MCMC algorithms in \citet{tula}. Numerical experiments are presented which confirm the theoretical findings and illustrate the need for the use of the new algorithm in comparison to vanilla SGLD within the framework of ANNs.

preprint2022arXiv

Existence, uniqueness and approximation of solutions of SDEs with superlinear coefficients in the presence of discontinuities of the drift coefficient

Existence, uniqueness, and $L_p$-approximation results are presented for scalar stochastic differential equations (SDEs) by considering the case where, the drift coefficient has finitely many spatial discontinuities while both coefficients can grow superlinearly (in the space variable). These discontinuities are described by a piecewise local Lipschitz continuity and a piecewise monotone-type condition while the diffusion coefficient is assumed to be locally Lipschitz continuous and non-degenerate at the discontinuity points of the drift coefficient. Moreover, the superlinear nature of the coefficients is dictated by a suitable coercivity condition and a polynomial growth of the (local) Lipschitz constants of the coefficients. Existence and uniqueness of strong solutions of such SDEs are obtained. Furthermore, the classical $L_p$-error rate $1/2$, for a suitable range of values of $p$, is recovered for a tamed Euler scheme which is used for approximating these solutions. To the best of the authors' knowledge, these are the first existence, uniqueness and approximation results for this class of SDEs.

preprint2021arXiv

On stochastic gradient Langevin dynamics with dependent data streams: the fully non-convex case

We consider the problem of sampling from a target distribution, which is \emph {not necessarily logconcave}, in the context of empirical risk minimization and stochastic optimization as presented in Raginsky et al. (2017). Non-asymptotic analysis results are established in the $L^1$-Wasserstein distance for the behaviour of Stochastic Gradient Langevin Dynamics (SGLD) algorithms. We allow the estimation of gradients to be performed even in the presence of \emph{dependent} data streams. Our convergence estimates are sharper and \emph{uniform} in the number of iterations, in contrast to those in previous studies.

preprint2020arXiv

A fully data-driven approach to minimizing CVaR for portfolio of assets via SGLD with discontinuous updating

A new approach in stochastic optimization via the use of stochastic gradient Langevin dynamics (SGLD) algorithms, which is a variant of stochastic gradient decent (SGD) methods, allows us to efficiently approximate global minimizers of possibly complicated, high-dimensional landscapes. With this in mind, we extend here the non-asymptotic analysis of SGLD to the case of discontinuous stochastic gradients. We are thus able to provide theoretical guarantees for the algorithm's convergence in (standard) Wasserstein distances for both convex and non-convex objective functions. We also provide explicit upper estimates of the expected excess risk associated with the approximation of global minimizers of these objective functions. All these findings allow us to devise and present a fully data-driven approach for the optimal allocation of weights for the minimization of CVaR of portfolio of assets with complete theoretical guarantees for its performance. Numerical results illustrate our main findings.

preprint2016arXiv

Euler approximations with varying coefficients: The case of superlinearly growing diffusion coefficients

A new class of explicit Euler schemes, which approximate stochastic differential equations (SDEs) with superlinearly growing drift and diffusion coefficients, is proposed in this article. It is shown, under very mild conditions, that these explicit schemes converge in probability and in $\mathcal{L}^p$ to the solution of the corresponding SDEs. Moreover, rate of convergence estimates are provided for $\mathcal{L}^p$ and almost sure convergence. In particular, the strong order $1/2$ is recovered in the case of uniform $\mathcal{L}^p$-convergence.

preprint2016arXiv

On Explicit Approximations for Lévy Driven SDEs with Super-linear Diffusion Coefficients

Motivated by the results of \cite{sabanis2015}, we propose explicit Euler-type schemes for SDEs with random coefficients driven by Lévy noise when the drift and diffusion coefficients can grow super-linearly. As an application of our results, one can construct explicit Euler-type schemes for SDEs with delays (SDDEs) which are driven by Lévy noise and have super-linear coefficients. Strong convergence results are established and their rate of convergence is shown to be equal to that of the classical Euler scheme. It is proved that the optimal rate of convergence is achieved for $\mathcal{L}^2$-convergence which is consistent with the corresponding results available in the literature.

preprint2016arXiv

On Milstein approximations with varying coefficients: the case of super-linear diffusion coefficients

A new class of explicit Milstein schemes, which approximate stochastic differential equations (SDEs) with superlinearly growing drift and diffusion coefficients, is proposed in this article. It is shown, under very mild conditions, that these explicit schemes converge in $\mathcal L^p$ to the solution of the corresponding SDEs with optimal rate.

preprint2015arXiv

Convergence of tamed Euler schemes for a class of stochastic evolution equations

We prove stability and convergence of a full discretization for a class of stochastic evolution equations with super-linearly growing operators appearing in the drift term. This is done using the recently developed tamed Euler method, which uses a fully explicit time stepping, coupled with a Galerkin scheme for the spatial discretization.

preprint2015arXiv

On Tamed Euler Approximations of SDEs Driven by Lévy Noise with Applications to Delay Equations

We extend the taming techniques for explicit Euler approximations of stochastic differential equations (SDEs) driven by Lévy noise with super-linearly growing drift coefficients. Strong convergence results are presented for the case of locally Lipschitz coefficients. Moreover, rate of convergence results are obtained in agreement with classical literature when the local Lipschitz continuity assumptions are replaced by global and, in addition, the drift coefficients satisfy polynomial Lipschitz continuity. Finally, we further extend these techniques to the case of delay equations.

preprint2013arXiv

A note on tamed Euler approximations

Strong convergence results on tamed Euler schemes, which approximate stochastic differential equations with superlinearly growing drift coefficients that are locally one-sided Lipschitz continuous, are presented in this article. The diffusion coefficients are assumed to be locally Lipschitz continuous and have at most linear growth. Furthermore, the classical rate of convergence, i.e. one--half, for such schemes is recovered when the local Lipschitz continuity assumptions are replaced by global and, in addition, it is assumed that the drift coefficients satisfy polynomial Lipschitz continuity.

preprint2013arXiv

Strong Convergence of Euler Approximations of Stochastic Differential Equations with Delay under Local Lipschitz Condition

The strong convergence of Euler approximations of stochastic delay differential equations is proved under general conditions. The assumptions on drift and diffusion coefficients have been relaxed to include polynomial growth and only continuity in the arguments corresponding to delays. Furthermore, the rate of convergence is obtained under one-sided and polynomial Lipschitz conditions. Finally, our findings are demonstrated with the help of numerical simulations.

preprint2012arXiv

A note on Euler approximations for stochastic differential equations with delay

An existence and uniqueness theorem for a class of stochastic delay differential equations is presented, and the convergence of Euler approximations for these equations is proved under general conditions. Moreover, the rate of almost sure convergence is obtained under local Lipschitz and also under monotonicity conditions.

Sotirios Sabanis

What is connected

Connect this record

See the researcher in context

Building this map preview

13 published item(s)

Kinetic Langevin MCMC Sampling Without Gradient Lipschitz Continuity -- the Strongly Convex Case

Taming neural networks with TUSLA: Non-convex learning via adaptive stochastic gradient Langevin algorithms

Existence, uniqueness and approximation of solutions of SDEs with superlinear coefficients in the presence of discontinuities of the drift coefficient

On stochastic gradient Langevin dynamics with dependent data streams: the fully non-convex case

A fully data-driven approach to minimizing CVaR for portfolio of assets via SGLD with discontinuous updating

Euler approximations with varying coefficients: The case of superlinearly growing diffusion coefficients

On Explicit Approximations for Lévy Driven SDEs with Super-linear Diffusion Coefficients

On Milstein approximations with varying coefficients: the case of super-linear diffusion coefficients

Convergence of tamed Euler schemes for a class of stochastic evolution equations

On Tamed Euler Approximations of SDEs Driven by Lévy Noise with Applications to Delay Equations

A note on tamed Euler approximations

Strong Convergence of Euler Approximations of Stochastic Differential Equations with Delay under Local Lipschitz Condition

A note on Euler approximations for stochastic differential equations with delay