Researcher profile

Xiuxian Li

Xiuxian Li contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
12works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

12 published item(s)

preprint2022arXiv

A Survey of Decision Making in Adversarial Games

Game theory has by now found numerous applications in various fields, including economics, industry, jurisprudence, and artificial intelligence, where each player only cares about its own interest in a noncooperative or cooperative manner, but without obvious malice to other players. However, in many practical applications, such as poker, chess, evader pursuing, drug interdiction, coast guard, cyber-security, and national defense, players often have apparently adversarial stances, that is, selfish actions of each player inevitably or intentionally inflict loss or wreak havoc on other players. Along this line, this paper provides a systematic survey on three main game models widely employed in adversarial games, i.e., zero-sum normal-form and extensive-form games, Stackelberg (security) games, zero-sum differential games, from an array of perspectives, including basic knowledge of game models, (approximate) equilibrium concepts, problem classifications, research frontiers, (approximate) optimal strategy seeking techniques, prevailing algorithms, and practical applications. Finally, promising future research directions are also discussed for relevant adversarial games.

preprint2022arXiv

Continuous-Time and Event-Triggered Online Optimization for Linear Multi-Agent Systems

This paper studies the decentralized online convex optimization problem for heterogeneous linear multi-agent systems. Agents have access to their time-varying local cost functions related to their own outputs, and there are also time-varying coupling inequality constraints among them. The goal of each agent is to minimize the global cost function by selecting appropriate local actions only through communication between neighbors. We design a distributed controller based on the saddle-point method which achieves constant regret bound and sublinear fit bound. In addition, to reduce the communication overhead, we propose an event-triggered communication scheme and show that the constant regret bound and sublinear fit bound are still achieved in the case of discrete communications with no Zeno behavior. A numerical example is provided to verify the proposed algorithms.with no Zeno behavior. A numerical example is provided to verify the proposed algorithms.

preprint2022arXiv

Decentralized Nash Equilibria Learning for Online Game with Bandit Feedback

This paper studies distributed online bandit learning of generalized Nash equilibria for online game, where cost functions of all players and coupled constraints are time-varying. The values rather than full information of cost and local constraint functions are revealed to local players gradually. The goal of each player is to selfishly minimize its own cost function with no future information subject to a strategy set constraint and time-varying coupled inequality constraints. To this end, a distributed online algorithm based on mirror descent and one-point bandit feedback is designed for seeking generalized Nash equilibria of the online game. It is shown that the devised online algorithm achieves sublinear expected regrets and accumulated constraint violation if the path variation of the generalized Nash equilibrium sequence is sublinear. Furthermore, the proposed algorithm is extended to the scenario of delayed bandit feedback, that is, the values of cost and constraint functions are disclosed to local players with time delays. It is also demonstrated that the online algorithm with delayed bandit feedback still has sublinear expected regrets and accumulated constraint violation under some conditions on the path variation and delay. Simulations are presented to illustrate the efficiency of theoretical results.

preprint2022arXiv

Linear Last-Iterate Convergence for Continuous Games with Coupled Inequality Constraints

In this paper, the generalized Nash equilibrium (GNE) seeking problem for continuous games with coupled affine inequality constraints is investigated in a partial-decision information scenario, where each player can only access its neighbors' information through local communication although its cost function possibly depends on all other players' strategies. To this end, a novel decentralized primal-dual algorithm based on consensus and dual diffusion methods is devised for seeking the variational GNE of the studied games. This paper also provides theoretical analysis to show that the designed algorithm converges linearly for the last-iterate, which, to our best knowledge, is the first to propose a linearly convergent GNE seeking algorithm under coupled affine inequality constraints. Finally, a numerical example is presented to demonstrate the effectiveness of the obtained theoretical results.

preprint2022arXiv

On the linear convergence of distributed Nash equilibrium seeking for multi-cluster games under partial-decision information

This paper considers the distributed strategy design for Nash equilibrium (NE) seeking in multi-cluster games under a partial-decision information scenario. In the considered game, there are multiple clusters and each cluster consists of a group of agents. A cluster is viewed as a virtual noncooperative player that aims to minimize its local payoff function and the agents in a cluster are the actual players that cooperate within the cluster to optimize the payoff function of the cluster through communication via a connected graph. In our setting, agents have only partial-decision information, that is, they only know local information and cannot have full access to opponents' decisions. To solve the NE seeking problem of this formulated game, a discrete-time distributed algorithm, called distributed gradient tracking algorithm (DGT), is devised based on the inter- and intra-communication of clusters. In the designed algorithm, each agent is equipped with strategy variables including its own strategy and estimates of other clusters' strategies. With the help of a weighted Fronbenius norm and a weighted Euclidean norm, theoretical analysis is presented to rigorously show the linear convergence of the algorithm. Finally, a numerical example is given to illustrate the proposed algorithm.

preprint2022arXiv

Resilient Multi-Dimensional Consensus in Adversarial Environment

This paper considers the multi-dimensional consensus in networked systems, where some of the agents might be misbehaving (or faulty). Despite the influence of these misbehaviors, the benign agents aim to reach an agreement while avoiding being seriously influenced by the faulty ones. To this end, this paper first considers a general class of consensus algorithms, where each benign agent computes an "auxiliary point" based on the received values and moves its state toward this point. Concerning this generic form, we present conditions for achieving resilient consensus and obtain a lower bound on the exponential convergence rate. Assuming that the number of malicious agents is upper bounded, two specific resilient consensus algorithms are further developed based on the obtained conditions. Particularly, the first solution, based on Helly's Theorem, achieves the consensus within the convex hull formed by the benign agents' initial states, where the auxiliary point can be efficiently computed through linear programming. On the other hand, the second algorithm serves as a "built-in" security guarantee for standard average consensus algorithms, in the sense that its performance coincides exactly with that of the standard ones in the absence of faulty nodes while also resisting the serious influence of the misbehaving ones in adversarial environment. Some numerical examples are provided in the end to verify the theoretical results.

preprint2021arXiv

Exponential convergence of distributed optimization for heterogeneous linear multi-agent systems

In this work we study a distributed optimal output consensus problem for heterogeneous linear multi-agent systems where the agents aim to reach consensus with the purpose of minimizing the sum of private convex costs. Based on output feedback, a fully distributed control law is proposed by using the proportional-integral (PI) control technique. For strongly convex cost functions with Lipschitz gradients, the designed controller can achieve convergence exponentially in an undirected and connected network. Furthermore, to remove the requirement of continuous communications, the proposed control law is then extended to periodic and event-triggered communication schemes, which also achieve convergence exponentially. Two simulation examples are given to verify the proposed control algorithms.

preprint2020arXiv

A "Safe Kernel" Approach for Resilient Multi-Dimensional Consensus

This paper considers the resilient multi-dimensional consensus problem in networked systems, where some of the agents might be malicious (or faulty). We propose a multi-dimensional consensus algorithm, where at each time step each healthy agent computes a "safe kernel" based on the information from its neighbors, and modifies its own state towards a point inside the kernel. Assuming that the number of malicious agents is locally (or globally) upper bounded, sufficient conditions on the network topology are presented to guarantee that the benign agents exponentially reach an agreement within the convex hull of their initial states, regardless of the actions of the misbehaving ones. It is also revealed that the graph connectivity and robustness required to achieve the resilient consensus increases linearly with respect to the dimension of the agents' state, indicating the existence of a trade-off between the low communication cost and system security. Numerical examples are provided in the end to validate the theoretical results.

preprint2020arXiv

Distributed Aggregative Optimization over Multi-Agent Networks

This paper proposes a new framework for distributed optimization, called distributed aggregative optimization, which allows local objective functions to be dependent not only on their own decision variables, but also on the average of summable functions of decision variables of all other agents. To handle this problem, a distributed algorithm, called distributed gradient tracking (DGT), is proposed and analyzed, where the global objective function is strongly convex, and the communication graph is balanced and strongly connected. It is shown that the algorithm can converge to the optimal variable at a linear rate. A numerical example is provided to corroborate the theoretical result.

preprint2020arXiv

Distributed Online Convex Optimization with an Aggregative Variable

This paper investigates distributed online convex optimization in the presence of an aggregative variable without any global/central coordinators over a multi-agent network, where each individual agent is only able to access partial information of time-varying global loss functions, thus requiring local information exchanges between neighboring agents. Motivated by many applications in reality, the considered local loss functions depend not only on their own decision variables, but also on an aggregative variable, such as the average of all decision variables. To handle this problem, an Online Distributed Gradient Tracking algorithm (O-DGT) is proposed with exact gradient information and it is shown that the dynamic regret is upper bounded by three terms: a sublinear term, a path variation term, and a gradient variation term. Meanwhile, the O-DGT algorithm is also analyzed with stochastic/noisy gradients, showing that the expected dynamic regret has the same upper bound as the exact gradient case. To our best knowledge, this paper is the first to study online convex optimization in the presence of an aggregative variable, which enjoys new characteristics in comparison with the conventional scenario without the aggregative variable. Finally, a numerical experiment is provided to corroborate the obtained theoretical results.

preprint2020arXiv

Distributed Online Optimization for Multi-Agent Networks with Coupled Inequality Constraints

This paper investigates the distributed online optimization problem over a multi-agent network subject to local set constraints and coupled inequality constraints, which has a lot of applications in many areas, such as wireless sensor networks, power systems and plug-in electric vehicles. In this problem, the cost function at each time step is the sum of local cost functions with each of them being gradually revealed to its corresponding agent, and meanwhile only local functions in coupled inequality constraints are accessible to each agent. To address this problem, a modified primal-dual algorithm, called distributed online primal-dual push-sum algorithm (DOPP), is developed in this paper, which does not rest on any assumption on parameter boundedness and is applicable to unbalanced networks. It is shown that the proposed algorithm is sublinear for both the dynamic regret and the violation of coupled inequality constraints. Finally, the theoretical results are supported by a simulation example.

preprint2020arXiv

Distributed Proximal Algorithms for Multi-Agent Optimization with Coupled Inequality Constraints

This paper aims to address distributed optimization problems over directed and time-varying networks, where the global objective function consists of a sum of locally accessible convex objective functions subject to a feasible set constraint and coupled inequality constraints whose information is only partially accessible to each agent. For this problem, a distributed proximal-based algorithm, called distributed proximal primal-dual (DPPD) algorithm, is proposed based on the celebrated centralized proximal point algorithm. It is shown that the proposed algorithm can lead to the global optimal solution with a general stepsize, which is diminishing and non-summable, but not necessarily square-summable, and the saddle-point running evaluation error vanishes proportionally to $O(1/\sqrt{k})$, where $k>0$ is the iteration number. Finally, a simulation example is presented to corroborate the effectiveness of the proposed algorithm.