Source author record

George Yin

George Yin appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.PR math.OC eess.SY Machine Learning math.DS Systems and Control eess.SP Information Theory math.IT math.ST q-fin.CP q-fin.PR q-fin.RM Statistics Theory

Catalog footprint

What is connected

22works

14topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

On an Ergodic Two-Sided Singular Control Problem

Motivated by applications in natural resource management, risk management, and finance, this paper is focused on an ergodic two-sided singular control problem for a general one-dimensional diffusion process. The control is given by a bounded variation process. Under some mild conditions, the optimal reward value as well as an optimal control policy are derived by the vanishing discount method. Moreover, the Abelian and Cesàro limits are established. Then a direct solution approach is provided at the end of the paper.

preprint2021arXiv

Langevin Dynamics for Adaptive Inverse Reinforcement Learning of Stochastic Gradient Algorithms

Inverse reinforcement learning (IRL) aims to estimate the reward function of optimizing agents by observing their response (estimates or actions). This paper considers IRL when noisy estimates of the gradient of a reward function generated by multiple stochastic gradient agents are observed. We present a generalized Langevin dynamics algorithm to estimate the reward function $R(θ)$; specifically, the resulting Langevin algorithm asymptotically generates samples from the distribution proportional to $\exp(R(θ))$. The proposed IRL algorithms use kernel-based passive learning schemes. We also construct multi-kernel passive Langevin algorithms for IRL which are suitable for high dimensional data. The performance of the proposed IRL algorithms are illustrated on examples in adaptive Bayesian learning, logistic regression (high dimensional problem) and constrained Markov decision processes. We prove weak convergence of the proposed IRL algorithms using martingale averaging methods. We also analyze the tracking performance of the IRL algorithms in non-stationary environments where the utility function $R(θ)$ jump changes over time as a slow Markov chain.

preprint2021arXiv

Multi-kernel Passive Stochastic Gradient Algorithms and Transfer Learning

This paper develops a novel passive stochastic gradient algorithm. In passive stochastic approximation, the stochastic gradient algorithm does not have control over the location where noisy gradients of the cost function are evaluated. Classical passive stochastic gradient algorithms use a kernel that approximates a Dirac delta to weigh the gradients based on how far they are evaluated from the desired point. In this paper we construct a multi-kernel passive stochastic gradient algorithm. The algorithm performs substantially better in high dimensional problems and incorporates variance reduction. We analyze the weak convergence of the multi-kernel algorithm and its rate of convergence. In numerical examples, we study the multi-kernel version of the passive least mean squares (LMS) algorithm for transfer learning to compare the performance with the classical passive version.

preprint2021arXiv

Optimal Control and Numerical Methods for Hybrid Stochastic SIS Models

This work focuses on optimal controls of a class of stochastic SIS epidemic models under regime switching. By assuming that a decision maker can either influence the infectivity period or isolate infected individuals, our aim is to minimize the expected discounted cost due to illness, medical treatment, and the adverse effect on the society. In addition, a model with the incorporation of vaccination is proposed. Numerical schemes are developed by approximating the continuous-time dynamics using Markov chain approximation methods. It is demonstrated that the approximation schemes converge to the optimal strategy as the mesh size goes to zero. Numerical examples are provided to illustrate our results.

preprint2020arXiv

Deep Filtering

This paper develops a deep learning method for linear and nonlinear filtering. The idea is to start with a nominal dynamic model and generate Monte Carlo sample paths. Then these samples are used to train a deep neutral network. A least square error is used as a loss function for network training. Then the resulting weights are applied to Monte Carlo sampl\ es from an actual dynamic model. The deep filter obtained in such a way compares favorably to the traditional Kalman filter in linear cases and the extended Kalman filter in nonlinear cases. Moreover, a switching model with jumps is studied to show the adaptiveness and power of our deep filtering method. A main advantage of deep filtering is its robustness when the nominal model and actual model differ. Another advantage of deep filtering is that real data can be used directly to train the deep neutral network. Therefore, one does not need to calibrate the model.

preprint2020arXiv

General Nonlinear Stochastic Systems Motivated by Chemostat Models: Complete Characterization of Long-Time Behavior, Optimal Controls, and Applications to Wastewater Treatment

The paper considers a chemostat model describing an activated sludge process in wastewater treatment. The model is assumed to be subject to environment noise in terms of both white noise and color noise. The paper fully characterizes the asymptotic behavior of the model that is a hybrid switching diffusion. We show that the long-term properties of the system can be classified using a value $λ$. More precisely, if $λ\leq 0$, the bacteria in the sewage will die out, which means that the process does not operate. If $λ>0$, the system has an invariant probability measure to which the transition probability of the solution process converges exponentially fast. One of the distinctive contributions of this paper is that the critical case $λ=0$ is considered. Numerical examples are given to illustrate our results.

preprint2020arXiv

Solving A Class of Mean-Field LQG Problems

In this work, we study a class of mean-field linear quadratic Gaussian (LQG) problems. Under suitable conditions, explicit solutions of the distribution-dependent optimal control problems are obtained. Riccati systems are derived by directly solving the associated master equations. Some extensions on controls with partial observations are also considered.

preprint2019arXiv

Analysis of A Spatially Inhomogeneous Stochastic Partial Differential Equation Epidemic Model

This work proposes and analyzes a family of spatially inhomogeneous epidemic models. This is our first effort to use stochastic partial differential equations (SPDEs) to model epidemic dynamics with spatial variations and environmental noise. After setting up the problem, existence and uniqueness of solutions of the underlying SPDEs are examined. Then definitions of permanence and extinction are given. Certain sufficient conditions are provided for the permanence and extinction. Our hope is that this paper will open up windows for investigation of epidemic models from a new angle.

preprint2016arXiv

Coexistence and Exclusion of Stochastic Competitive Lotka-Volterra Models

This work derives sufficient conditions for the coexistence and exclusion of a stochastic competitive Lotka-Volterra model. The conditions obtained are close to the necessary conditions. In addition, convergence in distribution of positive solutions of the model is also established. A number of numerical examples are given to illustrate our results.

preprint2016arXiv

Conditions for Permanence and Ergodicity of Certain Stochastic Predator-Prey Models

This work derives sufficient conditions for the permanence and ergodicity of a stochastic predator-prey model with Beddington-DeAngelis functional response. The conditions obtained in fact are very close to the necessary conditions. Both non-degenerate and degenerate diffusions are considered. One of the distinctive features of our results is that our results enables characterization of the support of a unique invariant probability measure. It proves the convergence in total variation norm of the transition probability to the invariant measure. Comparisons to existing literature and related matters to other stochastic predator-prey models are also given.

preprint2016arXiv

Two-time-scale stochastic partial differential equations driven by $α$-stable noises: Averaging principles

This paper focuses on stochastic partial differential equations (SPDEs) under two-time-scale formulation. Distinct from the work in the existing literature, the systems are driven by $α$-stable processes with $α\in(1,2)$. In addition, the SPDEs are either modulated by a continuous-time Markov chain with a finite state space or have an addition fast jump component. The inclusion of the Markov chain is for the needs of treating random environment, whereas the addition of the fast jump process enables the consideration of discontinuity in the sample paths of the fast processes. Assuming either a fast changing Markov switching or an additional fast-varying jump process, this work aims to obtain the averaging principles for such systems. There are several distinct difficulties. First, the noise is not square integrable. Second, in our setup, for the underlying SPDE, there is only a unique mild solution and as a result, there is only mild Itô's formula that can be used. Moreover, another new aspect is the addition of the fast regime switching and the addition of the fast varying jump processes in the formulation, which enlarges the applicability of the underlying systems. To overcome these difficulties, a semigroup approach is taken. Under suitable conditions, it is proved that the $p$th moment convergence takes place with $p\in(1,α)$, which is stronger than the usual weak convergence approaches.

preprint2015arXiv

A Strong Limit Theorem for Two-Time-Scale Fucntional Stochastic Differential Equations

This paper focuses on a class of two-time-scale functional stochastic differential equations, where the phase space of the segment processes is infinite-dimensional. It develops ergodicity of the fast component and obtains a strong limit theorem for the averaging principle in the spirit of Khasminskii's averaging approach for the slow component.

preprint2014arXiv

Adaptive Search Algorithms for Discrete Stochastic Optimization: A Smooth Best-Response Approach

This paper considers simulation-based optimization of the performance of a regime-switching stochastic system over a finite set of feasible configurations. Inspired by the stochastic fictitious play learning rules in game theory, we propose an adaptive simulation-based search algorithm that uses a smooth best-response sampling strategy and tracks the set of global optima, yet distributes the search so that most of the effort is spent on simulating the system performance at the global optima. The algorithm converges weakly to the set of global optima even when the observation data is correlated (as long as a weak law of large numbers holds). Numerical examples show that the proposed scheme yields a faster convergence for finite sample lengths compared with several existing random search and pure exploration methods in the literature.

preprint2014arXiv

Analyzing Convergence and Rates of Convergence of Particle Swarm Optimization Algorithms Using Stochastic Approximation Methods

Recently, much progress has been made on particle swarm optimization (PSO). A number of works have been devoted to analyzing the convergence of the underlying algorithms. Nevertheless, in most cases, rather simplified hypotheses are used. For example, it often assumes that the swarm has only one particle. In addition, more often than not, the variables and the points of attraction are assumed to remain constant throughout the optimization process. In reality, such assumptions are often violated. Moreover, there are no rigorous rates of convergence results available to date for the particle swarm, to the best of our knowledge. In this paper, we consider a general form of PSO algorithms, and analyze asymptotic properties of the algorithms using stochastic approximation methods. We introduce four coefficients and rewrite the PSO procedure as a stochastic approximation type iterative algorithm. Then we analyze its convergence using weak convergence method. It is proved that a suitably scaled sequence of swarms converge to the solution of an ordinary differential equation. We also establish certain stability results. Moreover, convergence rates are ascertained by using weak convergence method. A centered and scaled sequence of the estimation errors is shown to have a diffusion limit.

preprint2014arXiv

Mean-Variance Type Controls Involving a Hidden Markov Chain: Models and Numerical Approximation

Motivated by applications arising in networked systems, this work examines controlled regime-switching systems that stem from a mean-variance formulation. A main point is that the switching process is a hidden Markov chain. An additional piece of information, namely, a noisy observation of switching process corrupted by white noise is available. We focus on minimizing the variance subject to a fixed terminal expectation. Using the Wonham filter, we convert the partially observed system to a completely observable one first. Since closed-form solutions are virtually impossible be obtained, a Markov chain approximation method is used to devise a computational scheme. Convergence of the algorithm is obtained. A numerical example is provided to demonstrate the results.

preprint2013arXiv

Exponential Mixing for Retarded Stochastic Differential Equations

In this paper, we discuss exponential mixing property for Markovian semigroups generated by segment processes associated with several class of retarded Stochastic Differential Equations (SDEs) which cover SDEs with constant/variable/distributed time-lags. In particular, we investigate the exponential mixing property for (a) non-autonomous retarded SDEs by the Arzelà--Ascoli tightness characterization of the space $\C$ equipped with the uniform topology (b) neutral SDEs with continuous sample paths by a generalized Razumikhin-type argument and a stability-in-distribution approach and (c) jump-diffusion retarded SDEs by the Kurtz criterion of tightness for the space $\D$ endowed with the Skorohod topology.

preprint2013arXiv

Stationary Distributions for Retarded Stochastic Differential Equations without Dissipativity

Retarded stochastic differential equations (SDEs) constitute a large collection of systems arising in various real-life applications. Most of the existing results make crucial use of dissipative conditions. Dealing with "pure delay" systems in which both the drift and the diffusion coefficients depend only on the arguments with delays, the existing results become not applicable. This work uses a variation-of-constants formula to overcome the difficulties due to the lack of the information at the current time. This paper establishes existence and uniqueness of stationary distributions for retarded SDEs that need not satisfy dissipative conditions. The retarded SDEs considered in this paper also cover SDEs of neutral type and SDEs driven by Lévy processes that might not admit finite second moments.

preprint2013arXiv

Tracking the Empirical Distribution of a Markov-modulated Duplication-Deletion Random Graph

This paper considers a Markov-modulated duplication-deletion random graph where at each time instant, one node can either join or leave the network; the probabilities of joining or leaving evolve according to the realization of a finite state Markov chain. The paper comprises of 2 results. First, motivated by social network applications, we analyze the asymptotic behavior of the degree distribution of the Markov-modulated random graph. Using the asymptotic degree distribution, an expression is obtained for the delay in searching such graphs. Second, a stochastic approximation algorithm is presented to track empirical degree distribution as it evolves over time. The tracking performance of the algorithm is analyzed in terms of mean square error and a functional central limit theorem is presented for the asymptotic tracking error.

preprint2013arXiv

UAV Circumnavigation of an Unknown Target Without Location Information Using Noisy Range-based Measurements

This paper proposes a control algorithm for a UAV to circumnavigate an unknown target at a fixed radius when the location information of the UAV is unavailable. By assuming that the UAV has a constant velocity, the control algorithm makes adjustments to the heading angle of the UAV based on range and range rate measurements from the target, which may be corrupted by additive measurement noise. The control algorithm has the added benefit of being globally smooth and bounded. Exploiting the relationship between range rate and bearing angle, we transform the system dynamics from Cartesian coordinate in terms of location and heading to polar coordinate in terms of range and bearing angle. We then formulate the addition of measurement errors as a stochastic differential equation. A recurrence result is established showing that the UAV will reach a neighborhood of the desired orbit in finite time. Some statistical measures of performance are obtained to support the technical analysis.

preprint2013arXiv

Weak Convergence Methods for Approximation of Path-dependent Functionals

This paper provides convergence analysis for the approximation of a class of path-dependent functionals underlying a continuous stochastic process. In the first part, given a sequence of weak convergent processes, we provide a sufficient condition for the convergence of the path-dependent functional underlying weak convergent processes to the functional of the original process. In the second part, we study the weak convergence of Markov chain approximation to the underlying process when it is given by a solution of stochastic differential equation. Finally, we combine the results of the two parts to provide approximation of option pricing for discretely monitoring barrier option underlying stochastic volatility model. Different from the existing literatures, the weak convergence analysis is obtained by means of metric computations in the Skorohod topology together with the continuous mapping theorem. The advantage of this approach is that the functional under study may be a function of stopping times, projection of the underlying diffusion on a sequence of random times, or maximum/minimum of the underlying diffusion.

preprint2011arXiv

Indifference Pricing of American Option Underlying Illiquid Stock under Exponential Forward Performance

This work focuses on the indifference pricing of American call option underlying a non-traded stock, which may be partially hedgeable by another traded stock. Under the exponential forward measure, the indifference price is formulated as a stochastic singular control problem. The value function is characterized as the unique solution of a partial differential equation in a Sobolev space. Together with some regularities and estimates of the value function, the existence of the optimal strategy is also obtained. The applications of the characterization result includes a derivation of a dual representation and the indifference pricing on employee stock option. As a byproduct, a generalized Ito's formula is obtained for functions in a Sobolev space.

preprint2011arXiv

Numerical Solutions of Optimal Risk Control and Dividend Optimization Policies under A Generalized Singular Control Formulation

This paper develops numerical methods for finding optimal dividend pay-out and reinsurance policies. A generalized singular control formulation of surplus and discounted payoff function are introduced, where the surplus is modeled by a regime-switching process subject to both regular and singular controls. To approximate the value function and optimal controls, Markov chain approximation techniques are used to construct a discrete-time controlled Markov chain with two components. The proofs of the convergence of the approximation sequence to the surplus process and the value function are given. Examples of proportional and excess-of-loss reinsurance are presented to illustrate the applicability of the numerical methods.

George Yin

What is connected

Connect this record

See the researcher in context

Building this map preview

22 published item(s)

On an Ergodic Two-Sided Singular Control Problem

Langevin Dynamics for Adaptive Inverse Reinforcement Learning of Stochastic Gradient Algorithms

Multi-kernel Passive Stochastic Gradient Algorithms and Transfer Learning

Optimal Control and Numerical Methods for Hybrid Stochastic SIS Models

Deep Filtering

General Nonlinear Stochastic Systems Motivated by Chemostat Models: Complete Characterization of Long-Time Behavior, Optimal Controls, and Applications to Wastewater Treatment

Solving A Class of Mean-Field LQG Problems

Analysis of A Spatially Inhomogeneous Stochastic Partial Differential Equation Epidemic Model

Coexistence and Exclusion of Stochastic Competitive Lotka-Volterra Models

Conditions for Permanence and Ergodicity of Certain Stochastic Predator-Prey Models

Two-time-scale stochastic partial differential equations driven by $α$-stable noises: Averaging principles

A Strong Limit Theorem for Two-Time-Scale Fucntional Stochastic Differential Equations

Adaptive Search Algorithms for Discrete Stochastic Optimization: A Smooth Best-Response Approach

Analyzing Convergence and Rates of Convergence of Particle Swarm Optimization Algorithms Using Stochastic Approximation Methods

Mean-Variance Type Controls Involving a Hidden Markov Chain: Models and Numerical Approximation

Exponential Mixing for Retarded Stochastic Differential Equations

Stationary Distributions for Retarded Stochastic Differential Equations without Dissipativity

Tracking the Empirical Distribution of a Markov-modulated Duplication-Deletion Random Graph

UAV Circumnavigation of an Unknown Target Without Location Information Using Noisy Range-based Measurements

Weak Convergence Methods for Approximation of Path-dependent Functionals

Indifference Pricing of American Option Underlying Illiquid Stock under Exponential Forward Performance

Numerical Solutions of Optimal Risk Control and Dividend Optimization Policies under A Generalized Singular Control Formulation