Source author record

Debraj Chakraborty

Debraj Chakraborty appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence Computer Science and Game Theory Formal Languages and Automata Theory Logic in Computer Science math.OC Multiagent Systems Systems and Control

Catalog footprint

What is connected

3works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Synthesizing POMDP Policies: Sampling Meets Model-checking via Learning

Partially Observable Markov Decision Processes (POMDPs) are the standard framework for decision-making under uncertainty. While sampling-based methods scale well, they lack formal correctness guarantees, making them unsuitable for safety-critical applications. Conversely, formal synthesis techniques provide correctness-by-construction but often struggle with scalability, as general POMDP synthesis is undecidable. To bridge this gap, we propose a synthesis framework that integrates sampling, automata learning, and model-checking. Inspired by Angluin's $L^*$ algorithm, our approach utilizes sampling as a membership oracle and model-checking as an equivalence oracle. This enables the synthesis of finite-state controllers with formal guarantees, provided the sampling-induced policy is regular. We establish a relative completeness result for this framework. Experimental results from our prototypical implementation demonstrate that this method successfully solves threshold-safety problems that remain challenging for existing formal synthesis tools. We believe our algorithm serves as a valuable component in a portfolio approach to tackling the inherent difficulty of POMDP synthesis problems.

preprint2020arXiv

Monte Carlo Tree Search guided by Symbolic Advice for MDPs

In this paper, we consider the online computation of a strategy that aims at optimizing the expected average reward in a Markov decision process. The strategy is computed with a receding horizon and using Monte Carlo tree search (MCTS). We augment the MCTS algorithm with the notion of symbolic advice, and show that its classical theoretical guarantees are maintained. Symbolic advice are used to bias the selection and simulation strategies of MCTS. We describe how to use QBF and SAT solvers to implement symbolic advice in an efficient way. We illustrate our new algorithm using the popular game Pac-Man and show that the performances of our algorithm exceed those of plain MCTS as well as the performances of human players.

preprint2013arXiv

Formation control with pole placement for multi-agent systems

The problem of distributed controller synthesis for formation control of multi-agent systems is considered. The agents (single integrators) communicate over a communication graph and a decentralized linear feedback structure is assumed. One of the agents is designated as the leader. If the communication graph contains a directed spanning tree with the leader node as the root, then it is possible to place the poles of the ensemble system with purely local feedback controller gains. Given a desired formation, first one of the poles is placed at the origin. Then it is shown that the inter-agent weights can be independently adjusted to assign an eigenvector corresponding to the formation positions, to the zero eigenvalue. Then, only the leader input is enough to bring the agents to the desired formation and keep it there with no further inputs. Moreover, given a formation, the computation of the inter-agent weights that encode the formation information, can be calculated in a decentralized fashion using only local information.