Researcher profile

George Yin

George Yin contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
16works
0followers
12topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

16 published item(s)

preprint2022arXiv

On an Ergodic Two-Sided Singular Control Problem

Motivated by applications in natural resource management, risk management, and finance, this paper is focused on an ergodic two-sided singular control problem for a general one-dimensional diffusion process. The control is given by a bounded variation process. Under some mild conditions, the optimal reward value as well as an optimal control policy are derived by the vanishing discount method. Moreover, the Abelian and Cesàro limits are established. Then a direct solution approach is provided at the end of the paper.

preprint2021arXiv

Langevin Dynamics for Adaptive Inverse Reinforcement Learning of Stochastic Gradient Algorithms

Inverse reinforcement learning (IRL) aims to estimate the reward function of optimizing agents by observing their response (estimates or actions). This paper considers IRL when noisy estimates of the gradient of a reward function generated by multiple stochastic gradient agents are observed. We present a generalized Langevin dynamics algorithm to estimate the reward function $R(θ)$; specifically, the resulting Langevin algorithm asymptotically generates samples from the distribution proportional to $\exp(R(θ))$. The proposed IRL algorithms use kernel-based passive learning schemes. We also construct multi-kernel passive Langevin algorithms for IRL which are suitable for high dimensional data. The performance of the proposed IRL algorithms are illustrated on examples in adaptive Bayesian learning, logistic regression (high dimensional problem) and constrained Markov decision processes. We prove weak convergence of the proposed IRL algorithms using martingale averaging methods. We also analyze the tracking performance of the IRL algorithms in non-stationary environments where the utility function $R(θ)$ jump changes over time as a slow Markov chain.

preprint2021arXiv

Multi-kernel Passive Stochastic Gradient Algorithms and Transfer Learning

This paper develops a novel passive stochastic gradient algorithm. In passive stochastic approximation, the stochastic gradient algorithm does not have control over the location where noisy gradients of the cost function are evaluated. Classical passive stochastic gradient algorithms use a kernel that approximates a Dirac delta to weigh the gradients based on how far they are evaluated from the desired point. In this paper we construct a multi-kernel passive stochastic gradient algorithm. The algorithm performs substantially better in high dimensional problems and incorporates variance reduction. We analyze the weak convergence of the multi-kernel algorithm and its rate of convergence. In numerical examples, we study the multi-kernel version of the passive least mean squares (LMS) algorithm for transfer learning to compare the performance with the classical passive version.

preprint2021arXiv

Optimal Control and Numerical Methods for Hybrid Stochastic SIS Models

This work focuses on optimal controls of a class of stochastic SIS epidemic models under regime switching. By assuming that a decision maker can either influence the infectivity period or isolate infected individuals, our aim is to minimize the expected discounted cost due to illness, medical treatment, and the adverse effect on the society. In addition, a model with the incorporation of vaccination is proposed. Numerical schemes are developed by approximating the continuous-time dynamics using Markov chain approximation methods. It is demonstrated that the approximation schemes converge to the optimal strategy as the mesh size goes to zero. Numerical examples are provided to illustrate our results.

preprint2020arXiv

Deep Filtering

This paper develops a deep learning method for linear and nonlinear filtering. The idea is to start with a nominal dynamic model and generate Monte Carlo sample paths. Then these samples are used to train a deep neutral network. A least square error is used as a loss function for network training. Then the resulting weights are applied to Monte Carlo sampl\ es from an actual dynamic model. The deep filter obtained in such a way compares favorably to the traditional Kalman filter in linear cases and the extended Kalman filter in nonlinear cases. Moreover, a switching model with jumps is studied to show the adaptiveness and power of our deep filtering method. A main advantage of deep filtering is its robustness when the nominal model and actual model differ. Another advantage of deep filtering is that real data can be used directly to train the deep neutral network. Therefore, one does not need to calibrate the model.

preprint2020arXiv

General Nonlinear Stochastic Systems Motivated by Chemostat Models: Complete Characterization of Long-Time Behavior, Optimal Controls, and Applications to Wastewater Treatment

The paper considers a chemostat model describing an activated sludge process in wastewater treatment. The model is assumed to be subject to environment noise in terms of both white noise and color noise. The paper fully characterizes the asymptotic behavior of the model that is a hybrid switching diffusion. We show that the long-term properties of the system can be classified using a value $λ$. More precisely, if $λ\leq 0$, the bacteria in the sewage will die out, which means that the process does not operate. If $λ>0$, the system has an invariant probability measure to which the transition probability of the solution process converges exponentially fast. One of the distinctive contributions of this paper is that the critical case $λ=0$ is considered. Numerical examples are given to illustrate our results.

preprint2019arXiv

Analysis of A Spatially Inhomogeneous Stochastic Partial Differential Equation Epidemic Model

This work proposes and analyzes a family of spatially inhomogeneous epidemic models. This is our first effort to use stochastic partial differential equations (SPDEs) to model epidemic dynamics with spatial variations and environmental noise. After setting up the problem, existence and uniqueness of solutions of the underlying SPDEs are examined. Then definitions of permanence and extinction are given. Certain sufficient conditions are provided for the permanence and extinction. Our hope is that this paper will open up windows for investigation of epidemic models from a new angle.

preprint2014arXiv

Mean-Variance Type Controls Involving a Hidden Markov Chain: Models and Numerical Approximation

Motivated by applications arising in networked systems, this work examines controlled regime-switching systems that stem from a mean-variance formulation. A main point is that the switching process is a hidden Markov chain. An additional piece of information, namely, a noisy observation of switching process corrupted by white noise is available. We focus on minimizing the variance subject to a fixed terminal expectation. Using the Wonham filter, we convert the partially observed system to a completely observable one first. Since closed-form solutions are virtually impossible be obtained, a Markov chain approximation method is used to devise a computational scheme. Convergence of the algorithm is obtained. A numerical example is provided to demonstrate the results.

preprint2013arXiv

Exponential Mixing for Retarded Stochastic Differential Equations

In this paper, we discuss exponential mixing property for Markovian semigroups generated by segment processes associated with several class of retarded Stochastic Differential Equations (SDEs) which cover SDEs with constant/variable/distributed time-lags. In particular, we investigate the exponential mixing property for (a) non-autonomous retarded SDEs by the Arzelà--Ascoli tightness characterization of the space $\C$ equipped with the uniform topology (b) neutral SDEs with continuous sample paths by a generalized Razumikhin-type argument and a stability-in-distribution approach and (c) jump-diffusion retarded SDEs by the Kurtz criterion of tightness for the space $\D$ endowed with the Skorohod topology.

preprint2013arXiv

Stationary Distributions for Retarded Stochastic Differential Equations without Dissipativity

Retarded stochastic differential equations (SDEs) constitute a large collection of systems arising in various real-life applications. Most of the existing results make crucial use of dissipative conditions. Dealing with "pure delay" systems in which both the drift and the diffusion coefficients depend only on the arguments with delays, the existing results become not applicable. This work uses a variation-of-constants formula to overcome the difficulties due to the lack of the information at the current time. This paper establishes existence and uniqueness of stationary distributions for retarded SDEs that need not satisfy dissipative conditions. The retarded SDEs considered in this paper also cover SDEs of neutral type and SDEs driven by Lévy processes that might not admit finite second moments.

preprint2013arXiv

Tracking the Empirical Distribution of a Markov-modulated Duplication-Deletion Random Graph

This paper considers a Markov-modulated duplication-deletion random graph where at each time instant, one node can either join or leave the network; the probabilities of joining or leaving evolve according to the realization of a finite state Markov chain. The paper comprises of 2 results. First, motivated by social network applications, we analyze the asymptotic behavior of the degree distribution of the Markov-modulated random graph. Using the asymptotic degree distribution, an expression is obtained for the delay in searching such graphs. Second, a stochastic approximation algorithm is presented to track empirical degree distribution as it evolves over time. The tracking performance of the algorithm is analyzed in terms of mean square error and a functional central limit theorem is presented for the asymptotic tracking error.

preprint2013arXiv

UAV Circumnavigation of an Unknown Target Without Location Information Using Noisy Range-based Measurements

This paper proposes a control algorithm for a UAV to circumnavigate an unknown target at a fixed radius when the location information of the UAV is unavailable. By assuming that the UAV has a constant velocity, the control algorithm makes adjustments to the heading angle of the UAV based on range and range rate measurements from the target, which may be corrupted by additive measurement noise. The control algorithm has the added benefit of being globally smooth and bounded. Exploiting the relationship between range rate and bearing angle, we transform the system dynamics from Cartesian coordinate in terms of location and heading to polar coordinate in terms of range and bearing angle. We then formulate the addition of measurement errors as a stochastic differential equation. A recurrence result is established showing that the UAV will reach a neighborhood of the desired orbit in finite time. Some statistical measures of performance are obtained to support the technical analysis.

preprint2013arXiv

Weak Convergence Methods for Approximation of Path-dependent Functionals

This paper provides convergence analysis for the approximation of a class of path-dependent functionals underlying a continuous stochastic process. In the first part, given a sequence of weak convergent processes, we provide a sufficient condition for the convergence of the path-dependent functional underlying weak convergent processes to the functional of the original process. In the second part, we study the weak convergence of Markov chain approximation to the underlying process when it is given by a solution of stochastic differential equation. Finally, we combine the results of the two parts to provide approximation of option pricing for discretely monitoring barrier option underlying stochastic volatility model. Different from the existing literatures, the weak convergence analysis is obtained by means of metric computations in the Skorohod topology together with the continuous mapping theorem. The advantage of this approach is that the functional under study may be a function of stopping times, projection of the underlying diffusion on a sequence of random times, or maximum/minimum of the underlying diffusion.

preprint2011arXiv

Indifference Pricing of American Option Underlying Illiquid Stock under Exponential Forward Performance

This work focuses on the indifference pricing of American call option underlying a non-traded stock, which may be partially hedgeable by another traded stock. Under the exponential forward measure, the indifference price is formulated as a stochastic singular control problem. The value function is characterized as the unique solution of a partial differential equation in a Sobolev space. Together with some regularities and estimates of the value function, the existence of the optimal strategy is also obtained. The applications of the characterization result includes a derivation of a dual representation and the indifference pricing on employee stock option. As a byproduct, a generalized Ito's formula is obtained for functions in a Sobolev space.

preprint2011arXiv

Numerical Solutions of Optimal Risk Control and Dividend Optimization Policies under A Generalized Singular Control Formulation

This paper develops numerical methods for finding optimal dividend pay-out and reinsurance policies. A generalized singular control formulation of surplus and discounted payoff function are introduced, where the surplus is modeled by a regime-switching process subject to both regular and singular controls. To approximate the value function and optimal controls, Markov chain approximation techniques are used to construct a discrete-time controlled Markov chain with two components. The proofs of the convergence of the approximation sequence to the surplus process and the value function are given. Examples of proportional and excess-of-loss reinsurance are presented to illustrate the applicability of the numerical methods.