Source author record

Serdar Yuksel

Serdar Yuksel appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.OC Information Theory math.IT math.PR Systems and Control Machine Learning

Catalog footprint

What is connected

8works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Near Optimality of Finite Memory Feedback Policies in Partially Observed Markov Decision Processes

In the theory of Partially Observed Markov Decision Processes (POMDPs), existence of optimal policies have in general been established via converting the original partially observed stochastic control problem to a fully observed one on the belief space, leading to a belief-MDP. However, computing an optimal policy for this fully observed model, and so for the original POMDP, using classical dynamic or linear programming methods is challenging even if the original system has finite state and action spaces, since the state space of the fully observed belief-MDP model is always uncountable. Furthermore, there exist very few rigorous value function approximation and optimal policy approximation results, as regularity conditions needed often require a tedious study involving the spaces of probability measures leading to properties such as Feller continuity. In this paper, we study a planning problem for POMDPs where the system dynamics and measurement channel model are assumed to be known. We construct an approximate belief model by discretizing the belief space using only finite window information variables. We then find optimal policies for the approximate model and we rigorously establish near optimality of the constructed finite window control policies in POMDPs under mild non-linear filter stability conditions and the assumption that the measurement and action sets are finite (and the state space is real vector valued). We also establish a rate of convergence result which relates the finite window memory size and the approximation error bound, where the rate of convergence is exponential under explicit and testable exponential filter stability conditions. While there exist many experimental results and few rigorous asymptotic convergence results, an explicit rate of convergence result is new in the literature, to our knowledge.

preprint2022arXiv

Zero-Delay Lossy Coding of Linear Vector Markov Sources: Optimality of Stationary Codes and Near Optimality of Finite Memory Codes

Optimal zero-delay coding (quantization) of $\mathbb{R}^d$-valued linearly generated Markov sources is studied under quadratic distortion. The structure and existence of deterministic and stationary coding policies that are optimal for the infinite horizon average cost (distortion) problem are established. Prior results studying the optimality of zero-delay codes for Markov sources for infinite horizons either considered finite alphabet sources or, for the $\mathbb{R}^d$-valued case, only showed the existence of deterministic and non-stationary Markov coding policies or those which are randomized. In addition to existence results, for finite blocklength (horizon) $T$ the performance of an optimal coding policy is shown to approach the infinite time horizon optimum at a rate $O(\frac{1}{T})$. This gives an explicit rate of convergence that quantifies the near-optimality of finite window (finite-memory) codes among all optimal zero-delay codes.

preprint2021arXiv

Ergodicity Conditions For Controlled Stochastic Non-Linear Systems Under Information Constraints

Consider a stochastic nonlinear system controlled over a possibly noisy communication channel. An important problem is to characterize the largest class of channels for which there exist coding and control policies so that the closed-loop system is stochastically stable. In this paper, we consider the stability notion of (asymptotic) ergodicity. We prove lower bounds on the channel capacity necessary to achieve the stability criterion. Under mild technical assumptions, we obtain that the necessary channel capacity is lower bounded by the log-determinant of the linearization, double-averaged over the state and noise space. We prove this bound by introducing a modified version of invariance entropy and utilizing the almost sure convergence of sample paths guaranteed by the pointwise ergodic theorem. The fundamental bounds obtained generalize well-known formulas for linear systems, and are in some cases more refined than those obtained for nonlinear systems via information-theoretic methods.

preprint2020arXiv

Exponential Filter Stability via Dobrushin's Coefficient

Filter stability is a classical problem in the study of partially observed Markov processes (POMP), also known as hidden Markov models (HMM). For a POMP, an incorrectly initialized non-linear filter is said to be (asymptotically) stable if the filter eventually corrects itself as more measurements are collected. Filter stability results in the literature that provide rates of convergence typically rely on very restrictive mixing conditions on the transition kernel and measurement kernel pair, and do not consider their effects independently. In this paper, we introduce an alternative approach using the Dobrushin coefficients associated with both the transition kernel as well as the measurement channel. Such a joint study, which seems to have been unexplored, leads to a concise analysis that can be applied to more general system models under relaxed conditions: in particular, we show that if $(1 - δ(T))(2-δ(Q)) < 1$, where $δ(T)$ and $δ(Q)$ are the Dobrushin coefficients for the transition and the measurement kernels, then the filter is exponentially stable. Our findings are also applicable for controlled models.

preprint2014arXiv

On the Existence of Optimal Policies for a Class of Static and Sequential Dynamic Teams

In this paper, we identify sufficient conditions under which static teams and a class of sequential dynamic teams admit team-optimal solutions. We first investigate the existence of optimal solutions in static teams where the observations of the decision makers are conditionally independent or satisfy certain regularity conditions. Building on these findings and the static reduction method of Witsenhausen, we then extend the analysis to sequential dynamic teams. In particular, we show that a large class of dynamic LQG team problems, including the vector version of the well-known Witsenhausen's counterexample and the Gaussian relay channel problem viewed as a dynamic team, admit team-optimal solutions. Results in this paper substantially broaden the class of stochastic control and team problems with non-classical information known to have optimal solutions.

preprint2013arXiv

Stabilization of Linear Systems Over Gaussian Networks

The problem of remotely stabilizing a noisy linear time invariant plant over a Gaussian relay network is addressed. The network is comprised of a sensor node, a group of relay nodes and a remote controller. The sensor and the relay nodes operate subject to an average transmit power constraint and they can cooperate to communicate the observations of the plant's state to the remote controller. The communication links between all nodes are modeled as Gaussian channels. Necessary as well as sufficient conditions for mean-square stabilization over various network topologies are derived. The sufficient conditions are in general obtained using delay-free linear policies and the necessary conditions are obtained using information theoretic tools. Different settings where linear policies are optimal, asymptotically optimal (in certain parameters of the system) and suboptimal have been identified. For the case with noisy multi-dimensional sources controlled over scalar channels, it is shown that linear time varying policies lead to minimum capacity requirements, meeting the fundamental lower bound. For the case with noiseless sources and parallel channels, non-linear policies which meet the lower bound have been identified.

preprint2013arXiv

Stochastic Stability of Event-triggered Anytime Control

We investigate control of a non-linear process when communication and processing capabilities are limited. The sensor communicates with a controller node through an erasure channel which introduces i.i.d. packet dropouts. Processor availability for control is random and, at times, insufficient to calculate plant inputs. To make efficient use of communication and processing resources, the sensor only transmits when the plant state lies outside a bounded target set. Control calculations are triggered by the received data. If a plant state measurement is successfully received and while the processor is available for control, the algorithm recursively calculates a sequence of tentative plant inputs, which are stored in a buffer for potential future use. This safeguards for time-steps when the processor is unavailable for control. We derive sufficient conditions on system parameters for stochastic stability of the closed loop and illustrate performance gains through numerical studies.

preprint2007arXiv

On the error exponent of variable-length block-coding schemes over finite-state Markov channels with feedback

The error exponent of Markov channels with feedback is studied in the variable-length block-coding setting. Burnashev's classic result is extended and a single letter characterization for the reliability function of finite-state Markov channels is presented, under the assumption that the channel state is causally observed both at the transmitter and at the receiver side. Tools from stochastic control theory are used in order to treat channels with intersymbol interference. In particular the convex analytical approach to Markov decision processes is adopted to handle problems with stopping time horizons arising from variable-length coding schemes.

Serdar Yuksel

What is connected

Connect this record

See the researcher in context

Building this map preview

8 published item(s)

Near Optimality of Finite Memory Feedback Policies in Partially Observed Markov Decision Processes

Zero-Delay Lossy Coding of Linear Vector Markov Sources: Optimality of Stationary Codes and Near Optimality of Finite Memory Codes

Ergodicity Conditions For Controlled Stochastic Non-Linear Systems Under Information Constraints

Exponential Filter Stability via Dobrushin's Coefficient

On the Existence of Optimal Policies for a Class of Static and Sequential Dynamic Teams

Stabilization of Linear Systems Over Gaussian Networks

Stochastic Stability of Event-triggered Anytime Control

On the error exponent of variable-length block-coding schemes over finite-state Markov channels with feedback