Source author record

Konstantinos Spiliopoulos

Konstantinos Spiliopoulos appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.PR math.OC Machine Learning Methodology Applications math-ph math.MP math.ST q-fin.MF q-fin.RM Statistics Theory math.AP Computation math.DS q-fin.CP Systems and Control

Catalog footprint

What is connected

34works

16topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Convergence Analysis of Newton's Method for Neural Networks in the Overparameterized Limit

A convergence analysis is developed for the regularized Newton method for training neural networks (NNs) in the overparameterized limit. As the number of hidden units tends to infinity, the NN training dynamics converge in probability to the solution of a deterministic limit equation involving a ``Newton neural tangent kernel'' (NNTK). Explicit rates characterizing this convergence are provided and, in the infinite-width limit, we prove that the NN converges exponentially fast to the target data (i.e., a global minimizer with zero loss). We show that this convergence is uniform across the frequency spectrum, addressing the spectral bias inherent in gradient descent. The eigenvalues of the NTK for gradient descent accumulate at zero, leading to slow convergence for target data with high-frequency components. In contrast, the NNTK has uniformly lower bounded eigenvalues if the regularization parameter is selected appropriately, allowing Newton's method to converge more quickly for data with high-frequency components. Mathematical challenges that need to be addressed in our analysis include the implicit parameter update of the Newton method with a potentially indefinite Hessian matrix and the fact that the dimension of this linear system of equations tends to infinity as the NN width grows. This complicates deriving the training dynamics in the overparameterized limit as well as proving the convergence of the finite-width dynamics thereto. The analysis identifies a scaling formula for selecting the regularization parameter, which we show can vanish at a suitable rate as the number of hidden units becomes larger. We prove that, for sufficiently large numbers of hidden units, the regularized Hessian remains positive definite during training and the Newton updates for individual NN parameters converge to zero, showing that the model behaves as a linearization around the initialization.

preprint2026arXiv

Kernel Limit for a Class of Recurrent Neural Networks Trained on Ergodic Data Sequences

Mathematical methods are developed to characterize the asymptotics of recurrent neural networks (RNN) as the number of hidden units, data samples in the sequence, hidden state updates, and training steps simultaneously grow to infinity. In the case of an RNN with a simplified weight matrix, we prove the convergence of the RNN to the solution of an infinite-dimensional ODE coupled with the fixed point of a random algebraic equation. The analysis requires addressing several challenges which are unique to RNNs. In typical mean-field applications (e.g., feedforward neural networks), discrete updates are of magnitude $\mathcal{O}(1/N)$ and the number of updates is $\mathcal{O}(N)$. Therefore, the system can be represented as an Euler approximation of an appropriate ODE/PDE, which it will converge to as $N \rightarrow \infty$. However, the RNN hidden layer updates are $\mathcal{O}(1)$. Therefore, RNNs cannot be represented as a discretization of an ODE/PDE and standard mean-field techniques cannot be applied. Instead, we develop a fixed point analysis for the evolution of the RNN memory states, with convergence estimates in terms of the number of update steps and the number of hidden units. The RNN hidden layer is studied as a function in a Sobolev space, whose evolution is governed by the data sequence (a Markov chain), the parameter updates, and its dependence on the RNN hidden layer at the previous time step. Due to the strong correlation between updates, a Poisson equation must be used to bound the fluctuations of the RNN around its limit equation. These mathematical methods give rise to the neural tangent kernel (NTK) limits for RNNs trained on data sequences as the number of data samples and size of the neural network grow to infinity.

preprint2022arXiv

Disentangling positive and negative partisanship in social media interactions using a coevolving latent space network with attractors model

We develop a broadly applicable class of coevolving latent space network with attractors (CLSNA) models, where nodes represent individual social actors assumed to lie in an unknown latent space, edges represent the presence of a specified interaction between actors, and attractors are added in the latent level to capture the notion of attractive and repulsive forces. We apply the CLSNA models to understand the dynamics of partisan polarization on social media, where we expect Republicans and Democrats to increasingly interact with their own party and disengage with the opposing party. Using longitudinal social networks from the social media platforms Twitter and Reddit, we investigate the relative contributions of positive (attractive) and negative (repulsive) forces among political elites and the public, respectively. Our goals are to disentangle the positive and negative forces within and between parties and explore if and how they change over time. Our analysis confirms the existence of partisan polarization in social media interactions among both political elites and the public. Moreover, while positive partisanship is the driving force of interactions across the full periods of study for both the public and Democratic elites, negative partisanship has come to dominate Republican elites' interactions since the run-up to the 2016 presidential election.

preprint2022arXiv

Geometry-informed irreversible perturbations for accelerated convergence of Langevin dynamics

We introduce a novel geometry-informed irreversible perturbation that accelerates convergence of the Langevin algorithm for Bayesian computation. It is well documented that there exist perturbations to the Langevin dynamics that preserve its invariant measure while accelerating its convergence. Irreversible perturbations and reversible perturbations (such as Riemannian manifold Langevin dynamics (RMLD)) have separately been shown to improve the performance of Langevin samplers. We consider these two perturbations simultaneously by presenting a novel form of irreversible perturbation for RMLD that is informed by the underlying geometry. Through numerical examples, we show that this new irreversible perturbation can improve estimation performance over irreversible perturbations that do not take the geometry into account. Moreover we demonstrate that irreversible perturbations generally can be implemented in conjunction with the stochastic gradient version of the Langevin algorithm. Lastly, while continuous-time irreversible perturbations cannot impair the performance of a Langevin estimator, the situation can sometimes be more complicated when discretization is considered. To this end, we describe a discrete-time example in which irreversibility increases both the bias and variance of the resulting estimator.

preprint2022arXiv

Moderate deviations for systems of slow-fast stochastic reaction-diffusion equations

The goal of this paper is to study the Moderate Deviation Principle (MDP) for a system of stochastic reaction-diffusion equations with a time-scale separation in slow and fast components and small noise in the slow component. Based on weak convergence methods in infinite dimensions and related stochastic control arguments, we obtain an exact form for the moderate deviations rate function in different regimes as the small noise and time-scale separation parameters vanish. Many issues that appear due to the infinite dimensionality of the problem are completely absent in their finite-dimensional counterpart. In comparison to corresponding Large Deviation Principles, the moderate deviation scaling necessitates a more delicate approach to establishing tightness and properly identifying the limiting behavior of the underlying controlled problem. The latter involves regularity properties of a solution of an associated elliptic Kolmogorov equation on Hilbert space along with a finite-dimensional approximation argument.

preprint2022arXiv

Normalization effects on deep neural networks

We study the effect of normalization on the layers of deep neural networks of feed-forward type. A given layer $i$ with $N_{i}$ hidden units is allowed to be normalized by $1/N_{i}^{γ_{i}}$ with $γ_{i}\in[1/2,1]$ and we study the effect of the choice of the $γ_{i}$ on the statistical behavior of the neural network's output (such as variance) as well as on the test accuracy on the MNIST data set. We find that in terms of variance of the neural network's output and test accuracy the best choice is to choose the $γ_{i}$'s to be equal to one, which is the mean-field scaling. We also find that this is particularly true for the outer layer, in that the neural network's behavior is more sensitive in the scaling of the outer layer as opposed to the scaling of the inner layers. The mechanism for the mathematical analysis is an asymptotic expansion for the neural network's output. An important practical consequence of the analysis is that it provides a systematic and mathematically informed way to choose the learning rate hyperparameters. Such a choice guarantees that the neural network behaves in a statistically robust way as the $N_i$ grow to infinity.

preprint2022arXiv

Normalization effects on shallow neural networks and related asymptotic expansions

We consider shallow (single hidden layer) neural networks and characterize their performance when trained with stochastic gradient descent as the number of hidden units $N$ and gradient descent steps grow to infinity. In particular, we investigate the effect of different scaling schemes, which lead to different normalizations of the neural network, on the network's statistical output, closing the gap between the $1/\sqrt{N}$ and the mean-field $1/N$ normalization. We develop an asymptotic expansion for the neural network's statistical output pointwise with respect to the scaling parameter as the number of hidden units grows to infinity. Based on this expansion, we demonstrate mathematically that to leading order in $N$, there is no bias-variance trade off, in that both bias and variance (both explicitly characterized) decrease as the number of hidden units increases and time grows. In addition, we show that to leading order in $N$, the variance of the neural network's statistical output decays as the implied normalization by the scaling parameter approaches the mean field normalization. Numerical studies on the MNIST and CIFAR10 datasets show that test and train accuracy monotonically improve as the neural network's normalization gets closer to the mean field normalization.

preprint2022arXiv

Online Adjoint Methods for Optimization of PDEs

We present and mathematically analyze an online adjoint algorithm for the optimization of partial differential equations (PDEs). Traditional adjoint algorithms would typically solve a new adjoint PDE at each optimization iteration, which can be computationally costly. In contrast, an online adjoint algorithm updates the design variables in continuous-time and thus constantly makes progress towards minimizing the objective function. The online adjoint algorithm we consider is similar in spirit to the the pseudo-time-stepping, one-shot method which has been previously proposed. Motivated by the application of such methods to engineering problems, we mathematically study the convergence of the online adjoint algorithm. The online adjoint algorithm relies upon a time-relaxed adjoint PDE which provides an estimate of the direction of steepest descent. The algorithm updates this estimate continuously in time, and it asymptotically converges to the exact direction of steepest descent as $t \rightarrow \infty$. We rigorously prove that the online adjoint algorithm converges to a critical point of the objective function for optimizing the PDE. Under appropriate technical conditions, we also prove a convergence rate for the algorithm. A crucial step in the convergence proof is a multi-scale analysis of the coupled system for the forward PDE, adjoint PDE, and the gradient descent ODE for the design variables.

preprint2022arXiv

Rate of homogenization for fully-coupled McKean-Vlasov SDEs

We consider a fully-coupled slow-fast system of McKean-Vlasov SDEs with full dependence on the slow and fast component and on the law of the slow component and derive convergence rates to its homogenized limit. We do not make periodicity assumptions, but we impose conditions on the fast motion to guarantee ergodicity. In the course of the proof we obtain related ergodic theorems and we gain results on the regularity of Poisson type of equations and of the associated Cauchy-Problem on the Wasserstein space that are of independent interest.

preprint2022arXiv

Scaling Limit of Neural Networks with the Xavier Initialization and Convergence to a Global Minimum

We analyze single-layer neural networks with the Xavier initialization in the asymptotic regime of large numbers of hidden units and large numbers of stochastic gradient descent training steps. The evolution of the neural network during training can be viewed as a stochastic system and, using techniques from stochastic analysis, we prove the neural network converges in distribution to a random ODE with a Gaussian distribution. The limit is completely different than in the typical mean-field results for neural networks due to the $\frac{1}{\sqrt{N}}$ normalization factor in the Xavier initialization (versus the $\frac{1}{N}$ factor in the typical mean-field framework). Although the pre-limit problem of optimizing a neural network is non-convex (and therefore the neural network may converge to a local minimum), the limit equation minimizes a (quadratic) convex objective function and therefore converges to a global minimum. Furthermore, under reasonable assumptions, the matrix in the limiting quadratic objective function is positive definite and thus the neural network (in the limit) will converge to a global minimum with zero loss on the training set.

preprint2020arXiv

Importance sampling for slow-fast diffusions based on moderate deviations

We consider systems of slow--fast diffusions with small noise in the slow component. We construct provably logarithmic asymptotically optimal importance schemes for the estimation of rare events based on the moderate deviations principle. Using the subsolution approach we construct schemes and identify conditions under which the schemes will be asymptotically optimal. Moderate deviations--based importance sampling offers a viable alternative to large deviations importance sampling when the events are not too rare. In particular, in many cases of interest one can indeed construct the required change of measure in closed form, a task which is more complicated using the large deviations--based importance sampling, especially when it comes to multiscale dynamically evolving processes. The presence of multiple scales and the fact that we do not make any periodicity assumptions for the coefficients driving the processes, complicates the design and the analysis of efficient importance sampling schemes. Simulation studies illustrate the theory.

preprint2020arXiv

Network effects in default clustering for large systems

We consider a large collection of dynamically interacting components defined on a weighted directed graph determining the impact of default of one component to another one. We prove a law of large numbers for the empirical measure capturing the evolution of the different components in the pool and from this we extract important information for quantities such as the loss rate in the overall pool as well as the mean impact on a given component from system wide defaults. A singular value decomposition of the adjacency matrix of the graph allows to coarse-grain the system by focusing on the highest eigenvalues which also correspond to the components with the highest contagion impact on the pool. Numerical simulations demonstrate the theoretical findings.

preprint2020arXiv

Selection of quasi-stationary states in the stochastically forced Navier-Stokes equation on the torus

The stochastically forced vorticity equation associated with the two dimensional incompressible Navier-Stokes equation on $D_δ:=[0,2πδ]\times [0,2π]$ is considered for $δ\approx 1$, periodic boundary conditions, and viscocity $0<ν\ll 1$. An explicit family of quasi-stationary states of the deterministic vorticity equation is known to play an important role in the long-time evolution of solutions both in the presence of and without noise. Recent results show the parameter $δ$ plays a central role in selecting which of the quasi-stationary states is most important. In this paper, we aim to develop a finite dimensional model that captures this selection mechanism for the stochastic vorticity equation. This is done by projecting the vorticity equation in Fourier space onto a center manifold corresponding to the lowest eight Fourier modes. Through Monte Carlo simulation, the vorticity equation and the model are shown to be in agreement regarding key aspects of the long-time dynamics. Following this comparison, perturbation analysis is performed on the model via averaging and homogenization techniques to determine the leading order dynamics for statistics of interest for $δ\approx1$.

preprint2020arXiv

Typical dynamics and fluctuation analysis of slow-fast systems driven by fractional Brownian motion

This article studies typical dynamics and fluctuations for a slow-fast dynamical system perturbed by a small fractional Brownian noise. Based on an ergodic theorem with explicit rates of convergence, which may be of independent interest, we characterize the asymptotic dynamics of the slow component to two orders (i.e., the typical dynamics and the fluctuations). The limiting distribution of the fluctuations turns out to depend upon the manner in which the small-noise parameter is taken to zero relative to the scale-separation parameter. We study also an extension of the original model in which the relationship between the two small parameters leads to a qualitative difference in limiting behavior. The results of this paper provide an approximation, to two orders, to dynamical systems perturbed by small fractional Brownian noise and subject to multiscale effects.

preprint2016arXiv

Improving the convergence of reversible samplers

In Monte-Carlo methods the Markov processes used to sample a given target distribution usually satisfy detailed balance, i.e. they are time-reversible. However, relatively recent results have demonstrated that appropriate reversible and irreversible perturbations can accelerate convergence to equilibrium. In this paper we present some general design principles which apply to general Markov processes. Working with the generator of Markov processes, we prove that for some of the most commonly used performance criteria, i.e., spectral gap, asymptotic variance and large deviation functionals, sampling is improved for appropriate reversible and irreversible perturbations of some initially given reversible sampler. Moreover we provide specific constructions for such reversible and irreversible perturbations for various commonly used Markov processes, such as Markov chains and diffusions. In the case of diffusions, we make the discussion more specific using the large deviations rate function as a measure of performance.

preprint2016arXiv

Indifference pricing for Contingent Claims: Large Deviations Effects

We study utility indifference prices and optimal purchasing quantities for a non-traded contingent claim in an incomplete semi-martingale market with vanishing hedging errors. We make connections with the theory of large deviations. We concentrate on sequences of semi-complete markets where in the $n^{th}$ market, the claim $B_n$ admits the decomposition $B_n = D_n+Y_n$. Here, $D_n$ is replicable by trading in the underlying assets $S_n$, but $Y_n$ is independent of $S_n$. Under broad conditions, we may assume that $Y_n$ vanishes in accordance with a large deviations principle as $n$ grows. In this setting, for an exponential investor, we identify the limit of the average indifference price $p_n(q_n)$, for $q_n$ units of $B_n$, as $n\rightarrow \infty$. We show that if $|q_n|\rightarrow\infty$, the limiting price typically differs from the price obtained by assuming bounded positions $\sup_n|q_n|<\infty$, and the difference is explicitly identifiable using large deviations theory. Furthermore, we show that optimal purchase quantities occur at the large deviations scaling, and hence large positions arise endogenously in this setting.

preprint2016arXiv

Markov processes with spatial delay: path space characterization, occupation time and properties

In this paper, we study one dimensional Markov processes with spatial delay. Since the seminal work of Feller, we know that virtually any one dimensional, strong, homogeneous, continuous Markov process can be uniquely characterized via its infinitesimal generator and the generator's domain of definition. Unlike standard diffusions like Brownian motion, processes with spatial delay spend positive time at a single point of space. Interestingly, the set of times that a delay process spends at its delay point is nowhere dense and forms a positive measure Cantor set. The domain of definition of the generator has restrictions involving second derivatives. In this article we provide a pathwise characterization for processes with delay in terms of an SDE and an occupation time formula involving the symmetric local time. This characterization provides an explicit Doob-Meyer decomposition, demonstrating that such processes are semi-martingales and that all of stochastic calculus including Itô formula and Girsanov formula applies. We also establish an occupation time formula linking the time that the process spends at a delay point with its symmetric local time there. A physical example of a stochastic dynamical system with delay is lastly presented and analyzed.

preprint2016arXiv

Statistical Inference for Perturbed Multiscale Dynamical Systems

We study statistical inference for small-noise-perturbed multiscale dynamical systems. We prove consistency, asymptotic normality, and convergence of all scaled moments of an appropriately-constructed maximum likelihood estimator (MLE) for a parameter of interest, identifying precisely its limiting variance. We allow full dependence of coefficients on both slow and fast processes, which take values in the full Euclidean space; coefficients in the equation for the slow process need not be bounded and there is no assumption of periodic dependence. The results provide a theoretical basis for calibration of small-noise-perturbed multiscale dynamical systems. Data from numerical simulations are presented to illustrate the theory.

preprint2016arXiv

The pricing of contingent claims and optimal positions in asymptotically complete markets

We study utility indifference prices and optimal purchasing quantities for a contingent claim, in an incomplete semi-martingale market, in the presence of vanishing hedging errors and/or risk aversion. Assuming that the average indifference price converges to a well defined limit, we prove that optimally taken positions become large in absolute value at a specific rate. We draw motivation from and make connections to Large Deviations theory, and in particular, the celebrated Gärtner-Ellis theorem. We analyze a series of well studied examples where this limiting behavior occurs, such as fixed markets with vanishing risk aversion, the basis risk model with high correlation, models of large markets with vanishing trading restrictions and the Black-Scholes-Merton model with either vanishing default probabilities or vanishing transaction costs. Lastly, we show that the large claim regime could naturally arise in partial equilibrium models.

preprint2015arXiv

Escaping from an attractor: Importance sampling and rest points I

We discuss importance sampling schemes for the estimation of finite time exit probabilities of small noise diffusions that involve escape from an equilibrium. A factor that complicates the analysis is that rest points are included in the domain of interest. We build importance sampling schemes with provably good performance both pre-asymptotically, that is, for fixed size of the noise, and asymptotically, that is, as the size of the noise goes to zero, and that do not degrade as the time horizon gets large. Simulation studies demonstrate the theoretical results.

preprint2015arXiv

Quenched Large Deviations for Multiscale Diffusion Processes in Random Environments

We consider multiple time scales systems of stochastic differential equations with small noise in random environments. We prove a quenched large deviations principle with explicit characterization of the action functional. The random medium is assumed to be stationary and ergodic. In the course of the proof we also prove related quenched ergodic theorems for controlled diffusion processes in random environments that are of independent interest. The proof relies entirely on probabilistic arguments, allowing to obtain detailed information on how the rare event occurs. We derive a control, equivalently a change of measure, that leads to the large deviations lower bound. This information on the change of measure can motivate the design of asymptotically efficient Monte Carlo importance sampling schemes for multiscale systems in random environments.

preprint2015arXiv

Rare event simulation for multiscale diffusions in random environments

We consider systems of stochastic differential equations with multiple scales and small noise and assume that the coefficients of the equations are ergodic and stationary random fields. Our goal is to construct provably-efficient importance sampling Monte Carlo methods that allow efficient computation of rare event probabilities or expectations of functionals that can be associated with rare events. Standard Monte Carlo algorithms perform poorly in the small noise limit and hence fast simulations algorithms become relevant. The presence of multiple scales complicates the design and the analysis of efficient importance sampling schemes. An additional complication is the randomness of the environment. We construct explicit changes of measures that are proven to be logarithmic asymptotically efficient with probability one with respect to the random environment (i.e., in the quenched sense). Numerical simulations support the theoretical results.

preprint2014arXiv

Filtering the Maximum Likelihood for Multiscale Problems

Filtering and parameter estimation under partial information for multiscale problems is studied in this paper. After proving mean square convergence of the nonlinear filter to a filter of reduced dimension, we establish that the conditional (on the observations) log-likelihood process has a correction term given by a type of central limit theorem. To achieve this we assume that the operator of the (hidden) fast process has a discrete spectrum and an orthonormal basis of eigenfunctions. Based on these results, we then propose to estimate the unknown parameters of the model based on the limiting log-likelihood, which is an easier function to optimize because it of reduced dimension. We also establish consistency and asymptotic normality of the maximum likelihood estimator based on the reduced log-likelihood. Simulation results illustrate our theoretical findings.

preprint2014arXiv

Non-asymptotic performance analysis of importance sampling schemes for small noise diffusions

In this note we develop a prelimit analysis of performance measures for importance sampling schemes related to small noise diffusion processes. In importance sampling the performance of any change of measure is characterized by its second moment. For a given change of measure, we characterize the second moment of the corresponding estimator as the solution to a PDE, which we analyze via a full asymptotic expansion with respect to the size of the noise and obtain a precise statement on its accuracy. The main correction term to the decay rate of the second moment solves a transport equation that can be solved explicitly. The asymptotic expansion that we obtain identifies the source of possible poor performance of nevertheless asymptotically optimal importance sampling schemes and allows for more accurate comparison among competing importance sampling schemes.

preprint2013arXiv

Default clustering in large portfolios: Typical events

We develop a dynamic point process model of correlated default timing in a portfolio of firms, and analyze typical default profiles in the limit as the size of the pool grows. In our model, a firm defaults at a stochastic intensity that is influenced by an idiosyncratic risk process, a systematic risk process common to all firms, and past defaults. We prove a law of large numbers for the default rate in the pool, which describes the "typical" behavior of defaults.

preprint2012arXiv

Large Deviations and Importance Sampling for Systems of Slow-Fast Motion

In this paper we develop the large deviations principle and a rigorous mathematical framework for asymptotically efficient importance sampling schemes for general, fully dependent systems of stochastic differential equations of slow and fast motion with small noise in the slow component. We assume periodicity with respect to the fast component. Depending on the interaction of the fast scale with the smallness of the noise, we get different behavior. We examine how one range of interaction differs from the other one both for the large deviations and for the importance sampling. We use the large deviations results to identify asymptotically optimal importance sampling schemes in each case. Standard Monte Carlo schemes perform poorly in the small noise limit. In the presence of multiscale aspects one faces additional difficulties and straightforward adaptation of importance sampling schemes for standard small noise diffusions will not produce efficient schemes. It turns out that one has to consider the so called cell problem from the homogenization theory for Hamilton-Jacobi-Bellman equations in order to guarantee asymptotic optimality. We use stochastic control arguments.

preprint2011arXiv

Importance Sampling for Multiscale Diffusions

We construct importance sampling schemes for stochastic differential equations with small noise and fast oscillating coefficients. Standard Monte Carlo methods perform poorly for these problems in the small noise limit. With multiscale processes there are additional complications, and indeed the straightforward adaptation of methods for standard small noise diffusions will not produce efficient schemes. Using the subsolution approach we construct schemes and identify conditions under which the schemes will be asymptotically optimal. Examples and simulation results are provided.

preprint2011arXiv

Large Deviations Principle for a Large Class of One-Dimensional Markov Processes

We study the large deviations principle for one dimensional, continuous, homogeneous, strong Markov processes that do not necessarily behave locally as a Wiener process. Any strong Markov process $X_{t}$ in $\mathbb{R}$ that is continuous with probability one, under some minimal regularity conditions, is governed by a generalized elliptic operator $D_{v}D_{u}$, where $v$ and $u$ are two strictly increasing functions, $v$ is right continuous and $u$ is continuous. In this paper, we study large deviations principle for Markov processes whose infinitesimal generator is $εD_{v}D_{u}$ where $0<ε\ll 1$. This result generalizes the classical large deviations results for a large class of one dimensional "classical" stochastic processes. Moreover, we consider reaction-diffusion equations governed by a generalized operator $D_{v}D_{u}$. We apply our results to the problem of wave front propagation for these type of reaction-diffusion equations.

preprint2011arXiv

Recovery Rates in investment-grade pools of credit assets: A large deviations analysis

We consider the effect of recovery rates on a pool of credit assets. We allow the recovery rate to depend on the defaults in a general way. Using the theory of large deviations, we study the structure of losses in a pool consisting of a continuum of types. We derive the corresponding rate function and show that it has a natural interpretation as the favored way to rearrange recoveries and losses among the different types. Numerical examples are also provided.

preprint2010arXiv

A Note on the Smoluchowski-Kramers Approximation for the Langevin Equation with Reflection

According to the Smoluchowski-Kramers approximation, the solution of the equation $μ\ddot{q}^μ_t=b(q^μ_t)-\dot{q}^μ_t+Σ(q^μ_t)\dot{W}_t, q^μ_0=q, \dot{q}^μ_0=p$ converges to the solution of the equation $\dot{q}_t=b(q_t)+Σ(q_t)\dot{W}_t, q_0=q$ as μ->0. We consider here a similar result for the Langevin process with elastic reflection on the boundary.

preprint2010arXiv

Large Deviations for Multiscale Diffusions via Weak Convergence Methods

We study the large deviations principle for locally periodic stochastic differential equations with small noise and fast oscillating coefficients. There are three possible regimes depending on how fast the intensity of the noise goes to zero relative to the homogenization parameter. We use weak convergence methods which provide convenient representations for the action functional for all three regimes. Along the way we study weak limits of related controlled SDEs with fast oscillating coefficients and derive, in some cases, a control that nearly achieves the large deviations lower bound at the prelimit level. This control is useful for designing efficient importance sampling schemes for multiscale diffusions driven by small noise.

preprint2010arXiv

Method of Moments Estimation of Ornstein-Uhlenbeck Processes Driven by General Lévy Process

Ornstein-Uhlenbeck processes driven by general Lévy process are considered in this paper. We derive strongly consistent estimators for the moments of the underlying Lévy process and for the mean reverting parameter of the Ornstein-Uhlenbeck process. Moreover, we prove that the estimators are asymptotically normal. Finally, we test the empirical performance of our estimators in a simulation study and we fit the model to real data.

preprint2010arXiv

Reaction Diffusion Equations with Nonlinear Boundary Conditions in Narrow Domains

Second initial boundary problem in narrow domains of width $ε\ll 1$ for linear second order differential equations with nonlinear boundary conditions is considered in this paper. Using probabilistic methods we show that the solution of such a problem converges as $ε\downarrow 0$ to the solution of a standard reaction-diffusion equation in a domain of reduced dimension. This reduction allows to obtain some results concerning wave front propagation in narrow domains. In particular, we describe conditions leading to jumps of the wave front.

preprint2010arXiv

Wiener Process with Reflection in Non-Smooth Narrow Tubes

Wiener process with instantaneous reflection in narrow tubes of width ε<<1 around axis x is considered in this paper. The tube is assumed to be (asymptotically) non-smooth in the following sense. Let $V^ε(x)$ be the volume of the cross-section of the tube. We assume that $V^ε(x)/ε$ converges in an appropriate sense to a non-smooth function as ε->0. This limiting function can be composed by smooth functions, step functions and also the Dirac delta distribution. Under this assumption we prove that the x-component of the Wiener process converges weakly to a Markov process that behaves like a standard diffusion process away from the points of discontinuity and has to satisfy certain gluing conditions at the points of discontinuity.

Konstantinos Spiliopoulos

What is connected

Connect this record

See the researcher in context

Building this map preview

34 published item(s)

Convergence Analysis of Newton's Method for Neural Networks in the Overparameterized Limit

Kernel Limit for a Class of Recurrent Neural Networks Trained on Ergodic Data Sequences

Disentangling positive and negative partisanship in social media interactions using a coevolving latent space network with attractors model

Geometry-informed irreversible perturbations for accelerated convergence of Langevin dynamics

Moderate deviations for systems of slow-fast stochastic reaction-diffusion equations

Normalization effects on deep neural networks

Normalization effects on shallow neural networks and related asymptotic expansions

Online Adjoint Methods for Optimization of PDEs

Rate of homogenization for fully-coupled McKean-Vlasov SDEs

Scaling Limit of Neural Networks with the Xavier Initialization and Convergence to a Global Minimum

Importance sampling for slow-fast diffusions based on moderate deviations

Network effects in default clustering for large systems

Selection of quasi-stationary states in the stochastically forced Navier-Stokes equation on the torus

Typical dynamics and fluctuation analysis of slow-fast systems driven by fractional Brownian motion

Improving the convergence of reversible samplers

Indifference pricing for Contingent Claims: Large Deviations Effects

Markov processes with spatial delay: path space characterization, occupation time and properties

Statistical Inference for Perturbed Multiscale Dynamical Systems

The pricing of contingent claims and optimal positions in asymptotically complete markets

Escaping from an attractor: Importance sampling and rest points I

Quenched Large Deviations for Multiscale Diffusion Processes in Random Environments

Rare event simulation for multiscale diffusions in random environments

Filtering the Maximum Likelihood for Multiscale Problems

Non-asymptotic performance analysis of importance sampling schemes for small noise diffusions

Default clustering in large portfolios: Typical events

Large Deviations and Importance Sampling for Systems of Slow-Fast Motion

Importance Sampling for Multiscale Diffusions

Large Deviations Principle for a Large Class of One-Dimensional Markov Processes

Recovery Rates in investment-grade pools of credit assets: A large deviations analysis

A Note on the Smoluchowski-Kramers Approximation for the Langevin Equation with Reflection

Large Deviations for Multiscale Diffusions via Weak Convergence Methods

Method of Moments Estimation of Ornstein-Uhlenbeck Processes Driven by General Lévy Process

Reaction Diffusion Equations with Nonlinear Boundary Conditions in Narrow Domains

Wiener Process with Reflection in Non-Smooth Narrow Tubes