Source author record

Solmaz S. Kia

Solmaz S. Kia appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.OC Robotics Systems and Control Distributed, Parallel, and Cluster Computing eess.SP eess.SY Machine Learning Multiagent Systems

Catalog footprint

What is connected

9works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Learning Contraction Policies from Offline Data

This paper proposes a data-driven method for learning convergent control policies from offline data using Contraction theory. Contraction theory enables constructing a policy that makes the closed-loop system trajectories inherently convergent towards a unique trajectory. At the technical level, identifying the contraction metric, which is the distance metric with respect to which a robot's trajectories exhibit contraction is often non-trivial. We propose to jointly learn the control policy and its corresponding contraction metric while enforcing contraction. To achieve this, we learn an implicit dynamics model of the robotic system from an offline data set consisting of the robot's state and input trajectories. Using this learned dynamics model, we propose a data augmentation algorithm for learning contraction policies. We randomly generate samples in the state-space and propagate them forward in time through the learned dynamics model to generate auxiliary sample trajectories. We then learn both the control policy and the contraction metric such that the distance between the trajectories from the offline data set and our generated auxiliary sample trajectories decreases over time. We evaluate the performance of our proposed framework on simulated robotic goal-reaching tasks and demonstrate that enforcing contraction results in faster convergence and greater robustness of the learned policy.

preprint2022arXiv

Online Target Localization using Adaptive Belief Propagation in the HMM Framework

This paper proposes a novel adaptive sample space-based Viterbi algorithm for target localization in an online manner. The method relies on discretizing the target's motion space into cells representing a finite number of hidden states. Then, the most probable trajectory of the tracked target is computed via dynamic programming in a Hidden Markov Model (HMM) framework. The proposed method uses a Bayesian estimation framework which is neither limited to Gaussian noise models nor requires a linearized target motion model or sensor measurement models. However, an HMM-based approach to localization can suffer from poor computational complexity in scenarios where the number of hidden states increases due to high-resolution modeling or target localization in a large space. To improve this poor computational complexity, this paper proposes a belief propagation in the most probable belief space with a low to high-resolution sequentially, reducing the required resources significantly. The proposed method is inspired by the k-d Tree algorithm (e.g., quadtree) commonly used in the computer vision field. Experimental tests using an ultra-wideband (UWB) sensor network demonstrate our results.

preprint2022arXiv

The fastest linearly converging discrete-time average consensus using buffered information

In this letter, we study the problem of accelerating reaching average consensus over connected graphs in a discrete-time communication setting. Literature has shown that consensus algorithms can be accelerated by increasing the graph connectivity or optimizing the weights agents place on the information received from their neighbors. In this letter instead of altering the communication graph, we investigate two methods that use buffered states to accelerate reaching average consensus over a given graph. In the first method, we study how convergence rate of the well-known first-order Laplacian average consensus algorithm changes with delayed feedback and obtain a sufficient condition on the ranges of delay that leads to faster convergence. In the second proposed method, we show how average consensus problem can be cast as a convex optimization problem and solved by first-order accelerated optimization algorithms for strongly-convex cost functions. We construct the fastest converging average consensus algorithm using the so-called Triple Momentum optimization algorithm. We demonstrate our results using an in-network linear regression problem, which is formulated as two average consensus problems.

preprint2020arXiv

An IMM-based Decentralized Cooperative Localization with LoS and NLoS UWB Inter-agent Ranging

This paper investigates an infra-structure free global localization of a group of communicating mobile agents (e.g., first responders or exploring robots) via an ultra-wideband (UWB) inter-agent ranging aided dead-reckoning. We propose a loosely coupled cooperative localization algorithm that acts as an augmentation atop the local dead-reckoning system of each mobile agent. This augmentation becomes active only when an agent wants to process a relative measurement it has taken. The main contribution of this paper is addressing the challenges in the proper processing of the UWB range measurements in the framework of a loosely coupled cooperative localization. Even though UWB offers a decimeter level accuracy in line-of-sight (LoS) ranging, its accuracy degrades significantly in non-line-of-sight (NLoS) due to the significant unknown positive bias in the measurements. Thus, the measurement models for the UWB LoS and NLoS ranging conditions are different, and proper processing of NLoS measurements requires a bias compensation measure. We also show that, in practice, the measurement modal discriminators determine the type of UWB range measurements should be probabilistic. To take into account the probabilistic nature of the NLoS identifiers when processing UWB inter-agent ranging feedback, we employ an interacting multiple model (IMM) estimator in our localization filter. We also propose a bias compensation method for NLoS UWB measurements. The effectiveness of our cooperative localization is demonstrated via an experiment for a group of pedestrians who use UWB relative range measurements among themselves to improve their shoe-mounted INS geolocation.

preprint2020arXiv

Dynamic Active Average Consensus and its Application in Containment Control

This paper proposes a continuous-time dynamic active weighted average consensus algorithm in which the agents can alternate between active and passive modes depending on their ability to access to their reference input. The objective is to enable all the agents, both active and passive, to track the weighted average of the reference inputs of the active agents. The algorithm is modeled as a switched linear system whose convergence properties are carefully studied considering the agents' piece-wise constant access to the reference signals and possible piece-wise constant weights of the agents. We also study the discrete-time implementation of this algorithm. Next, we show how a containment control problem, in which a group of followers should track the convex hull of a set of observed leaders, can be cast as an active average consensus problem, and solved efficiently by our proposed dynamic active average consensus algorithm. Numerical examples demonstrate our results.

preprint2015arXiv

Cooperative localization for mobile agents: a recursive decentralized algorithm based on Kalman filter decoupling

We consider cooperative localization technique for mobile agents with communication and computation capabilities. We start by provide and overview of different decentralization strategies in the literature, with special focus on how these algorithms maintain an account of intrinsic correlations between state estimate of team members. Then, we present a novel decentralized cooperative localization algorithm that is a decentralized implementation of a centralized Extended Kalman Filter for cooperative localization. In this algorithm, instead of propagating cross-covariance terms, each agent propagates new intermediate local variables that can be used in an update stage to create the required propagated cross-covariance terms. Whenever there is a relative measurement in the network, the algorithm declares the agent making this measurement as the interim master. By acquiring information from the interim landmark, the agent the relative measurement is taken from, the interim master can calculate and broadcast a set of intermediate variables which each robot can then use to update its estimates to match that of a centralized Extended Kalman Filter for cooperative localization. Once an update is done, no further communication is needed until the next relative measurement.

preprint2015arXiv

Distributed event-triggered communication for dynamic average consensus in networked systems

This paper presents distributed algorithmic solutions that employ opportunistic inter-agent communication to achieve dynamic average consensus. In our solutions each agent is endowed with a local criterion that enables it to determine whether to broadcast its state to its neighbors. Our starting point is a continuous-time distributed coordination strategy that, under continuous-time communication, achieves practical asymptotic tracking of the dynamic average of the time-varying agents' reference inputs. Then, for this algorithm, depending on the directed or undirected nature of the time-varying interactions and under suitable connectivity conditions, we propose two different distributed event-triggered communication laws that prescribe agent communications at discrete time instants in an opportunistic fashion. In both cases, we establish positive lower bounds on the inter-event times of each agent and characterize their dependence on the algorithm design parameters. This analysis allows us to rule out the presence of Zeno behavior and characterize the asymptotic correctness of the resulting implementations. Several simulations illustrate the results.

preprint2014arXiv

Distributed convex optimization via continuous-time coordination algorithms with discrete-time communication

This paper proposes a novel class of distributed continuous-time coordination algorithms to solve network optimization problems whose cost function is a sum of local cost functions associated to the individual agents. We establish the exponential convergence of the proposed algorithm under (i) strongly connected and weight-balanced digraph topologies when the local costs are strongly convex with globally Lipschitz gradients, and (ii) connected graph topologies when the local costs are strongly convex with locally Lipschitz gradients. When the local cost functions are convex and the global cost function is strictly convex, we establish asymptotic convergence under connected graph topologies. We also characterize the algorithm's correctness under time-varying interaction topologies and study its privacy preservation properties. Motivated by practical considerations, we analyze the algorithm implementation with discrete-time communication. We provide an upper bound on the stepsize that guarantees exponential convergence over connected graphs for implementations with periodic communication. Building on this result, we design a provably-correct centralized event-triggered communication scheme that is free of Zeno behavior. Finally, we develop a distributed, asynchronous event-triggered communication scheme that is also free of Zeno with asymptotic convergence guarantees. Several simulations illustrate our results.

preprint2014arXiv

Dynamic Average Consensus under Limited Control Authority and Privacy Requirements

This paper introduces a novel continuous-time dynamic average consensus algorithm for networks whose interaction is described by a strongly connected and weight-balanced directed graph. The proposed distributed algorithm allows agents to track the average of their dynamic inputs with some steady-state error whose size can be controlled using a design parameter. This steady-state error vanishes for special classes of input signals. We analyze the asymptotic correctness of the algorithm under time-varying interaction topologies and characterize the requirements on the stepsize for discrete-time implementations. We show that our algorithm naturally preserves the privacy of the local input of each agent. Building on this analysis, we synthesize an extension of the algorithm that allows individual agents to control their own rate of convergence towards agreement and handle saturation bounds on the driving command. Finally, we show that the proposed extension additionally preserves the privacy of the transient response of the agreement states and the final agreement value from internal and external adversaries. Numerical examples illustrate the results.

Solmaz S. Kia

What is connected

Connect this record

See the researcher in context

Building this map preview

9 published item(s)

Learning Contraction Policies from Offline Data

Online Target Localization using Adaptive Belief Propagation in the HMM Framework

The fastest linearly converging discrete-time average consensus using buffered information

An IMM-based Decentralized Cooperative Localization with LoS and NLoS UWB Inter-agent Ranging

Dynamic Active Average Consensus and its Application in Containment Control

Cooperative localization for mobile agents: a recursive decentralized algorithm based on Kalman filter decoupling

Distributed event-triggered communication for dynamic average consensus in networked systems

Distributed convex optimization via continuous-time coordination algorithms with discrete-time communication

Dynamic Average Consensus under Limited Control Authority and Privacy Requirements