Source author record

Tatiana Tatarenko

Tatiana Tatarenko appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.OC eess.SY Systems and Control Computer Science and Game Theory Machine Learning math.PR Multiagent Systems

Catalog footprint

What is connected

11works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Inverse Learning in $2\times2$ Games: From Synthetic Interactions to Traffic Simulation

Understanding how agents coordinate or compete from limited behavioral data is central to modeling strategic interactions in traffic, robotics, and other multi-agent systems. In this work, we investigate the following complementary formulations of inverse game-theoretic learning: (i) a Closed-form Correlated Equilibrium Maximum-Likelihood estimator (CE-ML) specialized for $2\times2$ games; and (ii) a Logit Best Response Maximum-Likelihood estimator (LBR-ML) that captures long-run adaptation dynamics via stochastic response processes. Together, these approaches span the spectrum between static equilibrium consistency and dynamic behavioral realism. We evaluate them on synthetic "chicken-dare" games and traffic-interaction scenarios simulated in SUMO, comparing parameter recovery and distributional fit. Results reveal clear trade-offs between interpretability, computational tractability, and behavioral expressiveness across models.

preprint2022arXiv

On the Rate of Convergence of Payoff-based Algorithms to Nash Equilibrium in Strongly Monotone Games

We derive the rate of convergence to Nash equilibria for the payoff-based algorithm proposed in \cite{tat_kam_TAC}. These rates are achieved under the standard assumption of convexity of the game, strong monotonicity and differentiability of the pseudo-gradient. In particular, we show the algorithm achieves $O(\frac{1}{T})$ in the two-point function evaluating setting and $O(\frac{1}{\sqrt{T}})$ in the one-point function evaluation under additional requirement of Lipschitz continuity of the pseudo-gradient. These rates are to our knowledge the best known rates for the corresponding problem classes.

preprint2022arXiv

Projected gradient-tracking in multi-cluster games and its application to power management

We are concerned with a distributed approach to solve multi-cluster games arising in multi-agent systems. In such games, agents are separated into distinct clusters. The agents belonging to the same cluster cooperate with each other to achieve a common cluster goal while a non-cooperative game is played between the clusters. To be able to deal with the sparsity of information, as each agent only knows a specific part of the problem, we combine gradient-tracking and consensus methods for information distribution into an algorithm that can solve both the cooperative and non-cooperative problem in a single run. The constraints of the problem are taken into account by the corresponding projection operators and linear convergence is proven given an appropriate constant step size. The algorithm is applied to a day-ahead power management problem, posed as a multi-cluster game, and its efficiency is demonstrated by simulations.

preprint2021arXiv

Gradient-Tracking over Directed Graphs for solving Leaderless Multi-Cluster Games

We are concerned with finding Nash Equilibria in agent-based multi-cluster games, where agents are separated into distinct clusters. While the agents inside each cluster collaborate to achieve a common goal, the clusters are considered to be virtual players that compete against each other in a non-cooperative game with respect to a coupled cost function. In such scenarios, the inner-cluster problem and the game between the clusters need to be solved simultaneously. Therefore, the resulting inter-cluster Nash Equilibrium should also be a minimizer of the social welfare problem inside the clusters. In this work, this setup is cast as a distributed optimization problem with sparse state information. Hence, critical information, such as the agent's cost functions, remain private. We present a distributed algorithm that converges with a linear rate to the optimal solution. Furthermore, we apply our algorithm to an extended cournot game to verify our theoretical results.

preprint2020arXiv

A Smooth Inexact Penalty Reformulation of Convex Problems with Linear Constraints

In this work, we consider a constrained convex problem with linear inequalities and provide an inexact penalty re-formulation of the problem. The novelty is in the choice of the penalty functions, which are smooth and can induce a non-zero penalty over some points in feasible region of the original constrained problem. The resulting unconstrained penalized problem is parametrized by two penalty parameters which control the slope and the curvature of the penalty function. With a suitable selection of these penalty parameters, we show that the solutions of the resulting penalized unconstrained problem are \emph{feasible} for the original constrained problem, under some assumptions. Also, we establish that, with suitable choices of penalty parameters, the solutions of the penalized unconstrained problem can achieve a suboptimal value which is arbitrarily close to the optimal value of the original constrained problem. For the problems with a large number of linear inequality constraints, a particular advantage of such a smooth penalty-based reformulation is that it renders a penalized problem suitable for the implementation of fast incremental gradient methods, which require only one sample from the inequality constraints at each iteration. We consider applying SAGA proposed in \cite{saga} to solve the resulting penalized unconstrained problem. Moreover, we propose an alternative approach to set up the penalized problem. This approach is based on the time-varying penalty parameters and, thus, does not require knowledge about some problem-specific properties, that might be difficult to estimate. We prove that the single-loop full gradient-based algorithm applied to the corresponding time-varying penalized problem converges to the solution of the original constrained problem in the case of the strongly convex objective function.

preprint2020arXiv

Convergence Rate of a Penalty Method for Strongly Convex Problems with Linear Constraints

We consider an optimization problem with strongly convex objective and linear inequalities constraints. To be able to deal with a large number of constraints we provide a penalty reformulation of the problem. As penalty functions we use a version of the one-sided Huber losses. The smoothness properties of these functions allow us to choose time-varying penalty parameters in such a way that the incremental procedure with the diminishing step-size converges to the exact solution with the rate $O(1/{\sqrt k})$. To the best of our knowledge, we present the first result on the convergence rate for the penalty-based gradient method, in which the penalty parameters vary with time.

preprint2020arXiv

Minimizing Regret of Bandit Online Optimization in Unconstrained Action Spaces

We consider online convex optimization with a zero-order oracle feedback. In particular, the decision maker does not know the explicit representation of the time-varying cost functions, or their gradients. At each time step, she observes the value of the corresponding cost function evaluated at her chosen action (zero-order oracle). The objective is to minimize the regret, that is, the difference between the sum of the costs she accumulates and that of a static optimal action had she known the sequence of cost functions a priori. We present a novel algorithm to minimize regret in unconstrained action spaces. Our algorithm hinges on a classical idea of one-point estimation of the gradients of the cost functions based on their observed values. The algorithm is independent of problem parameters. Letting $T$ denote the number of queries of the zero-order oracle and $n$ the problem dimension, the regret rate achieved is $O(n^{2/3}T^{2/3})$. Moreover, we adapt the presented algorithm to the setting with two-point feedback and demonstrate that the adapted procedure achieves the theoretical lower bound on the regret of $(n^{1/2}T^{1/2})$.

preprint2020arXiv

Projected Push-Sum Gradient Descent-Ascent for Convex Optimizationwith Application to Economic Dispatch Problems

We propose a novel algorithm for solving convex, constrained and distributed optimization problems defined on multi-agent-networks, where each agent has exclusive access to a part of the global objective function. The agents are able to exchange information over a directed, weighted communication graph, which can be represented as a column-stochastic matrix. The algorithm combines an adjusted push-sum consensus protocol for information diffusion and a gradient descent-ascent on the local cost functions, providing convergence to the optimum of their sum. We provide results on a reformulation of the push-sum into single matrix-updates and prove convergence of the proposed algorithm to an optimal solution, given standard assumptions in distributed optimization. The algorithm is applied to a distributed economic dispatch problem, in which the constraints can be expressed in local and global subsets.

preprint2020arXiv

Revisiting Consensus-Based Energy-Management in Smart Grid with Transmission Losses and Directed Communication

We discovered a deficiency in Algorithm 1 and Theorem 3 of [1]. The algorithm called CEMA aims to solve an energy management problem distributively. However, by means of a counter example, we show that Theorem 2 and 3 of [1] contradict each other in the case of a valid scenario, proving that the suggested algorithm does not always find the optimum. Furthermore, we provide theoretic results, showing that Theorem 3 of [1] does not hold generally. At last, we provide a rectification by adjusting the algorithm and the corresponding proof of Theorem 3.

preprint2016arXiv

Non-Convex Distributed Optimization

We study distributed non-convex optimization on a time-varying multi-agent network. Each node has access to its own smooth local cost function, and the collective goal is to minimize the sum of these functions. We generalize the results obtained previously to the case of non-convex functions. Under some additional technical assumptions on the gradients we prove the convergence of the distributed push-sum algorithm to some critical point of the objective function. By utilizing perturbations on the update process, we show the almost sure convergence of the perturbed dynamics to a local minimum of the global objective function. Our analysis shows that this noised procedure converges at a rate of $O(1/t)$.

preprint2016arXiv

Payoff-Based Approach to Learning Nash Equilibria in Convex Games

We consider multi-agent decision making, where each agent optimizes its cost function subject to constraints. Agents' actions belong to a compact convex Euclidean space and the agents' cost functions are coupled. We propose a distributed payoff-based algorithm to learn Nash equilibria in the game between agents. Each agent uses only information about its current cost value to compute its next action. We prove convergence of the proposed algorithm to a Nash equilibrium in the game leveraging established results on stochastic processes. The performance of the algorithm is analyzed with a numerical case study.

Tatiana Tatarenko

What is connected

Connect this record

See the researcher in context

Building this map preview

11 published item(s)

Inverse Learning in $2\times2$ Games: From Synthetic Interactions to Traffic Simulation

On the Rate of Convergence of Payoff-based Algorithms to Nash Equilibrium in Strongly Monotone Games

Projected gradient-tracking in multi-cluster games and its application to power management

Gradient-Tracking over Directed Graphs for solving Leaderless Multi-Cluster Games

A Smooth Inexact Penalty Reformulation of Convex Problems with Linear Constraints

Convergence Rate of a Penalty Method for Strongly Convex Problems with Linear Constraints

Minimizing Regret of Bandit Online Optimization in Unconstrained Action Spaces

Projected Push-Sum Gradient Descent-Ascent for Convex Optimizationwith Application to Economic Dispatch Problems

Revisiting Consensus-Based Energy-Management in Smart Grid with Transmission Losses and Directed Communication

Non-Convex Distributed Optimization

Payoff-Based Approach to Learning Nash Equilibria in Convex Games