Researcher profile

Ming Cao

Ming Cao contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
21works
0followers
14topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

21 published item(s)

preprint2026arXiv

Fairness risk and its privacy-enabled solution in AI-driven robotic applications

Complex decision-making by autonomous machines and algorithms could underpin the foundations of future society. Generative AI is emerging as a powerful engine for such transitions. However, we show that Generative AI-driven developments pose a critical pitfall: fairness concerns. In robotic applications, although intuitions about fairness are common, a precise and implementable definition that captures user utility and inherent data randomness is missing. Here we provide a utility-aware fairness metric for robotic decision making and analyze fairness jointly with user-data privacy, deriving conditions under which privacy budgets govern fairness metrics. This yields a unified framework that formalizes and quantifies fairness and its interplay with privacy, which is tested in a robot navigation task. In view of the fact that under legal requirements, most robotic systems will enforce user privacy, the approach shows surprisingly that such privacy budgets can be jointly used to meet fairness targets. Addressing fairness concerns in the creative combined consideration of privacy is a step towards ethical use of AI and strengthens trust in autonomous robots deployed in everyday environments.

preprint2022arXiv

Guiding Vector Fields for Following Occluded Paths

Accurately following a geometric desired path in a two-dimensional space is a fundamental task for many engineering systems, in particular mobile robots. When the desired path is occluded by obstacles, it is necessary and crucial to temporarily deviate from the path for obstacle/collision avoidance. In this paper, we develop a composite guiding vector field via the use of smooth bump functions, and provide theoretical guarantees that the integral curves of the vector field can follow an arbitrary sufficiently smooth desired path and avoid collision with obstacles of arbitrary shapes. These two behaviors are reactive since path (re)-planning and global map construction are not involved. To deal with the common deadlock problem, we introduce a switching vector field, and the Zeno behavior is excluded. Simulations are conducted to support the theoretical results.

preprint2022arXiv

Limit Cycles Analysis and Control of Evolutionary Game Dynamics with Environmental Feedback

Recently, an evolutionary game dynamics model taking into account the environmental feedback has been proposed to describe the co-evolution of strategic actions of a population of individuals and the state of the surrounding environment; correspondingly a range of interesting dynamic behaviors have been reported. In this paper, we provide new theoretical insight into such behaviors and discuss control options. Instead of the standard replicator dynamics, we use a more realistic and comprehensive model of replicator-mutator dynamics, to describe the strategic evolution of the population. After integrating the environment feedback, we study the effect of mutations on the resulting closed-loop system dynamics. We prove the conditions for two types of bifurcations, Hopf bifurcation and Heteroclinic bifurcation, both of which result in stable limit cycles. These limit cycles have not been identified in existing works, and we further prove that such limit cycles are in fact persistent in a large parameter space and are almost globally stable. In the end, an intuitive control policy based on incentives is applied, and the effectiveness of this control policy is examined by analysis and simulations.

preprint2022arXiv

Robust optimal policies for team Markov games

In stochastic dynamic environments, team Markov games have emerged as a versatile paradigm for studying sequential decision-making problems of fully cooperative multi-agent systems. However, the optimality of the derived policies is usually sensitive to model parameters, which are typically unknown and required to be estimated from noisy data in practice. To mitigate the sensitivity of optimal policies to these uncertain parameters, we propose a robust model of team Markov games in this paper, where agents utilize robust optimization approaches to update strategies. This model extends team Markov games to the scenario of incomplete information and meanwhile provides an alternative solution concept of robust team optimality. To seek such a solution, we develop a robust iterative learning algorithm of team policies and prove its convergence. This algorithm, compared with robust dynamic programming, not only possesses a faster convergence rate, but also allows for using approximation calculations to alleviate the curse of dimensionality. Moreover, some numerical simulations are presented to demonstrate the effectiveness of the algorithm by generalizing the game model of sequential social dilemmas to uncertain scenarios.

preprint2021arXiv

A two-layer model for coevolving opinion dynamics and collective decision-making in complex social systems

Motivated by the literature on opinion dynamics and evolutionary game theory, we propose a novel mathematical framework to model the intertwined coevolution of opinions and decision-making in a complex social system. In the proposed framework, the members of a social community update their opinions and revise their actions as they learn of others' opinions shared on a communication channel, and observe of others' actions through an influence channel; these interactions determine a two-layer network structure. We offer an application of the proposed framework by tailoring it to study the adoption of a novel social norm, demonstrating that the model is able to capture the emergence of several real-world collective phenomena such as paradigm shifts and unpopular norms. Through the establishment of analytical conditions and Monte Carlo numerical simulations, we shed light on the role of the coupling between opinion dynamics and decision-making, and of the network structure, in shaping the emergence of complex collective behavior in social systems.

preprint2021arXiv

Convergence Analysis of Dual Decomposition Algorithm in Distributed Optimization: Asynchrony and Inexactness

Dual decomposition is widely utilized in distributed optimization of multi-agent systems. In practice, the dual decomposition algorithm is desired to admit an asynchronous implementation due to imperfect communication, such as time delay and packet drop. In addition, computational errors also exist when individual agents solve their own subproblems. In this paper, we analyze the convergence of the dual decomposition algorithm in distributed optimization when both the asynchrony in communication and the inexactness in solving subproblems exist. We find that the interaction between asynchrony and inexactness slows down the convergence rate from $\mathcal{O} ( 1 / k )$ to $\mathcal{O} ( 1 / \sqrt{k} )$. Specifically, with a constant step size, the value of objective function converges to a neighborhood of the optimal value, and the solution converges to a neighborhood of the exact optimal solution. Moreover, the violation of the constraints diminishes in $\mathcal{O} ( 1 / \sqrt{k} )$. Our result generalizes and unifies the existing ones that only consider either asynchrony or inexactness. Finally, numerical simulations validate the theoretical results.

preprint2021arXiv

Different Environment Feedback in Fast-slow Eco-evolutionary Dynamics

The fast-slow dynamics of an eco-evolutionary system are studied, where we consider the feedback actions of environmental resources that are classified into those that are self-renewing and those externally supplied. We show although these two types of resources are drastically different, the resulting closed-loop systems bear close resemblances, which include the same equilibria and their stability conditions on the boundary of the phase space, and the similar appearances of equilibria in the interior. After closer examination of specific choices of parameter values, we disclose that the global dynamical behaviors of the two types of closed-loop systems can be fundamentally different in terms of limit cycles: the system with self-renewing resources undergoes a generalized Hopf bifurcation such that one stable limit cycle and one unstable limit cycle can coexist; the system with externally supplied resources can only have the stable limit cycle induced by a supercritical Hopf bifurcation. Finally, the explorative analysis is carried out to show the discovered dynamic behaviors are robust in even larger parameter space.

preprint2021arXiv

Highway Traffic Control via Smart e-Mobility -- Part I: Theory

In this paper, we study how to alleviate highway traffic congestion by encouraging plug-in hybrid and electric vehicles to stop at a charging station around peak congestion times. Specifically, we design a pricing policy to make the charging price dynamic and dependent on the traffic congestion, predicted via the cell transmission model, and the availability of charging spots. Furthermore, we develop a novel framework to model how this policy affects the drivers' decisions by formulating a mixed-integer potential game. Technically, we introduce the concept of "road-to-station" (r2s) and "station-to-road" (s2r) flows, and show that the selfish actions of the drivers converge to charging schedules that are individually optimal in the sense of Nash. In the second part of this work, submitted as a separate paper (Part II: Case Study), we validate the proposed strategy on a simulated highway stretch between The Hague and Rotterdam, in The Netherlands.

preprint2021arXiv

Highway Traffic Control via Smart e-Mobility -- Part II: Dutch A13 Case Study

In this paper, we study how to alleviate highway traffic congestions by encouraging plug-in electric and hybrid vehicles to stop at charging stations around peak congestion times. Specifically, we focus on a case study and simulate the adoption of a dynamic charging price depending on the traffic congestion. We use real traffic data of the A13 highway stretch between The Hague and Rotterdam, in The Netherlands, to identify the Cell Transmission Model. Then, we apply the algorithm proposed in (Part I: Theory) to different scenarios, validating the theoretical results and showing the benefits of our strategy in terms of traffic congestion alleviation. Finally, we carry out a sensitivity analysis of the proposed algorithm and discuss how to optimize its performance.

preprint2021arXiv

Modelling the Effect of Vaccination and Human Behaviour on the Spread of Epidemic Diseases on Temporal Networks

Motivated by the increasing number of COVID-19 cases that have been observed in many countries after the vaccination and relaxation of non-pharmaceutical interventions, we propose a mathematical model on time-varying networks for the spread of recurrent epidemic diseases in a partially vaccinated population. The model encapsulates several realistic features, such as the different effectiveness of the vaccine against transmission and development of severe symptoms, testing practices, the possible implementation of non-pharmaceutical interventions to reduce the transmission, isolation of detected individuals, and human behaviour. Using a mean-field approach, we analytically derive the epidemic threshold of the model and, if the system is above such a threshold, we compute the epidemic prevalence at the endemic equilibrium. These theoretical results show that precautious human behaviour and effective testing practices are key toward avoiding epidemic outbreaks. Interestingly, we found that, in many realistic scenarios, vaccination is successful in mitigating the outbreak by reducing the prevalence of seriously ill patients, but it could be a double-edged sword, whereby in some cases it might favour resurgent outbreaks, calling for higher testing rates, more cautiousness and responsibility among the population, or the reintroduction of non-pharmaceutical interventions to achieve complete eradication.

preprint2021arXiv

Stability of Remote Synchronization in Star Networks of Kuramoto Oscillators

Synchrony of neuronal ensembles is believed to facilitate information exchange among cortical regions in the human brain. Recently, it has been observed that distant brain areas which are not directly connected by neural links also experience synchronization. Such synchronization between remote regions is sometimes due to the presence of a mediating region connecting them, e.g., \textit{the thalamus}. The underlying network structure of this phenomenon is star-like and motivates us to study the \textit{remote synchronization} of Kuramoto oscillators, {modeling neural dynamics}, coupled by a directed star network, for which peripheral oscillators get phase synchronized, remaining the accommodating central mediator at a different phase. We show that the symmetry of the coupling strengths of the outgoing links from the central oscillator plays a crucial role in enabling stable remote synchronization. We also consider the case when there is a phase shift in the model which results from synaptic and conduction delays. Sufficient conditions on the coupling strengths are obtained to ensure the stability of remotely synchronized states. To validate our obtained results, numerical simulations are also performed.

preprint2020arXiv

An asynchronous distributed and scalable generalized Nash equilibrium seeking algorithm for strongly monotone games

In this paper, we present three distributed algorithms to solve a class of generalized Nash equilibrium (GNE) seeking problems in strongly monotone games. The first one (SD-GENO) is based on synchronous updates of the agents, while the second and the third (AD-GEED and AD-GENO) represent asynchronous solutions that are robust to communication delays. AD-GENO can be seen as a refinement of AD-GEED, since it only requires node auxiliary variables, enhancing the scalability of the algorithm. Our main contribution is to prove converge to a variational GNE of the game via an operator-theoretic approach. Finally, we apply the algorithms to network Cournot games and show how different activation sequences and delays affect convergence. We also compare the proposed algorithms to the only other in the literature (ADAGNES), and observe that AD-GENO outperforms the alternative.

preprint2020arXiv

Analysis of a Nonlinear Opinion Dynamics Model with Biased Assimilation

This paper analyzes a nonlinear opinion dynamics model which generalizes the DeGroot model by introducing a bias parameter for each individual. The original DeGroot model is recovered when the bias parameter is equal to zero. The magnitude of this parameter reflects an individual's degree of bias when assimilating new opinions, and depending on the magnitude, an individual is said to have weak, intermediate, and strong bias. The opinions of the individuals lie between 0 and 1. It is shown that for strongly connected networks, the equilibria with all elements equal identically to the extreme value 0 or 1 is locally exponentially stable, while the equilibrium with all elements equal to the neutral consensus value of 1/2 is unstable. Regions of attraction for the extreme consensus equilibria are given. For the equilibrium consisting of both extreme values 0 and 1, which corresponds to opinion polarization according to the model, it is shown that the equilibrium is unstable for all strongly connected networks if individuals all have weak bias, becomes locally exponentially stable for complete and two-island networks if individuals all have strong bias, and its stability heavily depends on the network topology when individuals have intermediate bias. Analysis on star graphs and simulations show that additional equilibria may exist where individuals form clusters.

preprint2020arXiv

Design of Privacy-Preserving Dynamic Controllers

As a quantitative criterion for privacy of "mechanisms" in the form of data-generating processes, the concept of differential privacy was first proposed in computer science and has later been applied to linear dynamical systems. However, differential privacy has not been studied in depth together with other properties of dynamical systems, and it has not been fully utilized for controller design. In this paper, first we clarify that a classical concept in systems and control, input observability (sometimes referred to as left invertibility) has a strong connection with differential privacy. In particular, we show that the Gaussian mechanism can be made highly differentially private by adding small noise if the corresponding system is less input observable. Next, enabled by our new insight into privacy, we develop a method to design dynamic controllers for the classic tracking control problem while addressing privacy concerns. We call the obtained controller through our design method the privacy-preserving controller. The usage of such controllers is further illustrated by an example of tracking the prescribed power supply in a DC microgrid installed with smart meters while keeping the electricity consumers' tracking errors private.

preprint2020arXiv

Mediated Remote Synchronization of Kuramoto-Sakaguchi Oscillators: the Number of Mediators Matters

Cortical regions without direct neuronal connections have been observed to exhibit synchronized dynamics. A recent empirical study has further revealed that such regions that share more common neighbors are more likely to behave coherently. To analytically investigate the underlying mechanisms, we consider that a set of n oscillators, which have no direct connections, are linked through m intermediate oscillators (called mediators), forming a complete bipartite network structure. Modeling the oscillators by the Kuramoto-Sakaguchi model, we rigorously prove that mediated remote synchronization, i.e., synchronization between those n oscillators that are not directly connected, becomes more robust as the number of mediators increases. Simulations are also carried out to show that our theoretical findings can be applied to other general and complex networks.

preprint2020arXiv

Optimal Universal Controllers for Roll Stabilization

Roll stabilization is an important problem of ship motion control. This problem becomes especially difficult if the same set of actuators (e.g. a single rudder) has to be used for roll stabilization and heading control of the vessel, so that the roll stabilizing system interferes with the ship autopilot. Finding the "trade-off" between the concurrent goals of accurate vessel steering and roll stabilization usually reduces to an optimization problem, which has to be solved in presence of an unknown wave disturbance. Standard approaches to this problem (loop-shaping, LQG, $H_{\infty}$-control etc.) require to know the spectral density of the disturbance, considered to be a \colored noise". In this paper, we propose a novel approach to optimal roll stabilization, approximating the disturbance by a polyharmonic signal with known frequencies yet uncertain amplitudes and phase shifts. Linear quadratic optimization problems in presence of polyharmonic disturbances can be solved by means of the theory of universal controllers developed by V.A. Yakubovich. An optimal universal controller delivers the optimal solution for any uncertain amplitudes and phases. Using Marine Systems Simulator (MSS) Toolbox that provides a realistic vessel's model, we compare our design method with classical approaches to optimal roll stabilization. Among three controllers providing the same quality of yaw steering, OUC stabilizes the roll motion most efficiently.

preprint2020arXiv

Path Following Control in 3D Using a Vector Field

Using a designed vector field to control a mobile robot to follow a given desired path is intuitive and practical, and to build a rigorous theory to guide its implementation is essential. In this paper, we study the properties of a general 3D vector field for robotic path following. We propose and investigate assumptions that turn out to be crucial for this method, but have been rarely explicitly stated in related works. We derive conditions under which the local path-following error vanishes exponentially in a sufficiently small neighborhood of the desired path, which is key to show the local input-to-state stability (local ISS) property of the path-following error dynamics. The local ISS property then justifies the control algorithm design for a fixed-wing aircraft model. Our approach is effective for any sufficiently smooth desired path in 3D, bounded or unbounded; note that the case for unbounded desired paths has not been sufficiently discussed in the literature. Simulations are conducted to verify the theoretical results.

preprint2020arXiv

Recurrent Averaging Inequalities in Multi-Agent Control and Social Dynamics Modeling

Many multi-agent control algorithms and dynamic agent-based models arising in natural and social sciences are based on the principle of iterative averaging. Each agent is associated to a value of interest, which may represent, for instance, the opinion of an individual in a social group, the velocity vector of a mobile robot in a flock, or the measurement of a sensor within a sensor network. This value is updated, at each iteration, to a weighted average of itself and of the values of the adjacent agents. It is well known that, under natural assumptions on the network's graph connectivity, this local averaging procedure eventually leads to global consensus, or synchronization of the values at all nodes. Applications of iterative averaging include, but are not limited to, algorithms for distributed optimization, for solution of linear and nonlinear equations, for multi-robot coordination and for opinion formation in social groups. Although these algorithms have similar structures, the mathematical techniques used for their analysis are diverse, and conditions for their convergence and differ from case to case. In this paper, we review many of these algorithms and we show that their properties can be analyzed in a unified way by using a novel tool based on recurrent averaging inequalities (RAIs). We develop a theory of RAIs and apply it to the analysis of several important multi-agent algorithms recently proposed in the literature.

preprint2020arXiv

Vector Field Guided Path Following Control: Singularity Elimination and Global Convergence

Vector field guided path following (VF-PF) algorithms are fundamental in robot navigation tasks, but may not deliver the desirable performance when robots encounter singular points where the vector field becomes zero. The existence of singular points prevents the global convergence of the vector field's integral curves to the desired path. Moreover, VF-PF algorithms, as well as most of the existing path following algorithms, fail to enable following a self-intersected desired path. In this paper, we show that such failures are fundamentally related to the mathematical topology of the path, and that by "stretching" the desired path along a virtual dimension, one can remove the topological obstruction. Consequently, this paper proposes a new guiding vector field defined in a higher-dimensional space, in which self-intersected desired paths become free of self-intersections; more importantly, the new guiding vector field does not have any singular points, enabling the integral curves to converge globally to the "stretched" path. We further introduce the extended dynamics to retain this appealing global convergence property for the desired path in the original lower-dimensional space. Both simulations and experiments are conducted to verify the theory.

preprint2020arXiv

Zero-Determinant strategies in repeated multiplayer social dilemmas with discounted payoffs

In two-player repeated games, Zero-Determinant (ZD) strategies enable a player to unilaterally enforce a linear payoff relation between her own and her opponent's payoff irrespective of the opponent's strategy. This manipulative nature of the ZD strategies attracted significant attention from researchers due to its close connection to controlling distributively the outcome of evolutionary games in large populations. In this paper, necessary and sufficient conditions are derived for a payoff relation to be enforceable in multiplayer social dilemmas with a finite expected number of rounds that is determined by a fixed and common discount factor. Thresholds exist for such a discount factor above which desired payoff relations can be enforced. Our results show that depending on the group size and the ZD-strategist's initial probability to cooperate there exist extortionate, generous and equalizer ZD-strategies. The threshold discount factors rely on the desired payoff relation and the variation in the single-round payoffs. To show the utility of our results, we apply them to multiplayer social dilemmas, and show how discounting affects ZD Nash equilibria.