Researcher profile

Serdar Yuksel

Serdar Yuksel contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 17 - UnverifiedVerification L1Unclaimed author
4works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

4 published item(s)

preprint2022arXiv

Near Optimality of Finite Memory Feedback Policies in Partially Observed Markov Decision Processes

In the theory of Partially Observed Markov Decision Processes (POMDPs), existence of optimal policies have in general been established via converting the original partially observed stochastic control problem to a fully observed one on the belief space, leading to a belief-MDP. However, computing an optimal policy for this fully observed model, and so for the original POMDP, using classical dynamic or linear programming methods is challenging even if the original system has finite state and action spaces, since the state space of the fully observed belief-MDP model is always uncountable. Furthermore, there exist very few rigorous value function approximation and optimal policy approximation results, as regularity conditions needed often require a tedious study involving the spaces of probability measures leading to properties such as Feller continuity. In this paper, we study a planning problem for POMDPs where the system dynamics and measurement channel model are assumed to be known. We construct an approximate belief model by discretizing the belief space using only finite window information variables. We then find optimal policies for the approximate model and we rigorously establish near optimality of the constructed finite window control policies in POMDPs under mild non-linear filter stability conditions and the assumption that the measurement and action sets are finite (and the state space is real vector valued). We also establish a rate of convergence result which relates the finite window memory size and the approximation error bound, where the rate of convergence is exponential under explicit and testable exponential filter stability conditions. While there exist many experimental results and few rigorous asymptotic convergence results, an explicit rate of convergence result is new in the literature, to our knowledge.

preprint2022arXiv

Zero-Delay Lossy Coding of Linear Vector Markov Sources: Optimality of Stationary Codes and Near Optimality of Finite Memory Codes

Optimal zero-delay coding (quantization) of $\mathbb{R}^d$-valued linearly generated Markov sources is studied under quadratic distortion. The structure and existence of deterministic and stationary coding policies that are optimal for the infinite horizon average cost (distortion) problem are established. Prior results studying the optimality of zero-delay codes for Markov sources for infinite horizons either considered finite alphabet sources or, for the $\mathbb{R}^d$-valued case, only showed the existence of deterministic and non-stationary Markov coding policies or those which are randomized. In addition to existence results, for finite blocklength (horizon) $T$ the performance of an optimal coding policy is shown to approach the infinite time horizon optimum at a rate $O(\frac{1}{T})$. This gives an explicit rate of convergence that quantifies the near-optimality of finite window (finite-memory) codes among all optimal zero-delay codes.

preprint2021arXiv

Ergodicity Conditions For Controlled Stochastic Non-Linear Systems Under Information Constraints

Consider a stochastic nonlinear system controlled over a possibly noisy communication channel. An important problem is to characterize the largest class of channels for which there exist coding and control policies so that the closed-loop system is stochastically stable. In this paper, we consider the stability notion of (asymptotic) ergodicity. We prove lower bounds on the channel capacity necessary to achieve the stability criterion. Under mild technical assumptions, we obtain that the necessary channel capacity is lower bounded by the log-determinant of the linearization, double-averaged over the state and noise space. We prove this bound by introducing a modified version of invariance entropy and utilizing the almost sure convergence of sample paths guaranteed by the pointwise ergodic theorem. The fundamental bounds obtained generalize well-known formulas for linear systems, and are in some cases more refined than those obtained for nonlinear systems via information-theoretic methods.

preprint2020arXiv

Exponential Filter Stability via Dobrushin's Coefficient

Filter stability is a classical problem in the study of partially observed Markov processes (POMP), also known as hidden Markov models (HMM). For a POMP, an incorrectly initialized non-linear filter is said to be (asymptotically) stable if the filter eventually corrects itself as more measurements are collected. Filter stability results in the literature that provide rates of convergence typically rely on very restrictive mixing conditions on the transition kernel and measurement kernel pair, and do not consider their effects independently. In this paper, we introduce an alternative approach using the Dobrushin coefficients associated with both the transition kernel as well as the measurement channel. Such a joint study, which seems to have been unexplored, leads to a concise analysis that can be applied to more general system models under relaxed conditions: in particular, we show that if $(1 - δ(T))(2-δ(Q)) < 1$, where $δ(T)$ and $δ(Q)$ are the Dobrushin coefficients for the transition and the measurement kernels, then the filter is exponentially stable. Our findings are also applicable for controlled models.