Source author record

Minghui Zhu

Minghui Zhu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.OC Systems and Control Robotics math.DS Computer Science and Game Theory Cryptography and Security eess.SY Machine Learning Multiagent Systems

Catalog footprint

What is connected

17works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Differentiation Between Faults and Cyberattacks through Combined Analysis of Cyberspace Logs and Physical Measurements

In recent years, cyberattacks - along with physical faults - have become an increasing factor causing system failures, especially in DER (Distributed Energy Resources) systems. In addition, according to the literature, a number of faults have been reported to remain undetected. Consequently, unlike anomaly detection works that only identify abnormalities, differentiating undetected faults and cyberattacks is a challenging task. Although several works have studied this problem, they crucially fall short of achieving an accurate distinction due to the reliance on physical laws or physical measurements. To resolve this issue, the industry typically conducts an integrated analysis with physical measurements and cyberspace information. Nevertheless, this industry approach consumes a significant amount of time due to the manual efforts required in the analysis. In this work, we focus on addressing these crucial gaps by proposing a non-trivial approach of distinguishing undetected faults and cyberattacks in DER systems. Specifically, first, a special kind of dependency graph is constructed using a novel virtual physical variable-oriented taint analysis (PVOTA) algorithm. Then, the graph is simplified using an innovative node pruning technique, which is based on a set of context-dependent operations. Next, a set of patterns capturing domain-specific knowledge is derived to bridge the semantic gaps between the cyber and physical sides. Finally, these patterns are matched to the relevant events that occurred during failure incidents, and possible root causes are concluded based on the pattern matching results. In the end, the efficacy of our proposed automatic integrated analysis is evaluated through four case studies covering failure incidents caused by the FDI attack, undetected faults, and memory corruption attacks.

preprint2026arXiv

Interactive Inverse Reinforcement Learning of Interaction Scenarios via Bi-level Optimization

Inverse reinforcement learning (IRL) learns a reward function and a corresponding policy that best fit the demonstration data of an expert. However, in the current IRL setting, the learner is isolated from the expert and can only passively observe the expert demonstrations. This limits the applicability of IRL to interactive settings, where the learner actively interacts with the expert and needs to infer the expert's reward function from the interactions. To bridge the gap, this paper studies interactive IRL (IIRL) where a learner aims to learn the reward function of an expert and a policy to interact with the expert during its interactions with the expert. We formulate IIRL as a stochastic bi-level optimization problem where the lower level learns a reward function to explain the behaviors of the expert, and the upper level learns a policy to interact with the expert. We develop a double-loop algorithm, Bi-level Interactive Scenarios Inverse Reinforcement Learning (BISIRL), which solves the lower-level problem in the inner loop and the upper-level problem in the outer loop. We formally guarantee that BISIRL converges and validate our algorithm through extensive experiments.

preprint2024arXiv

iPolicy: Incremental Policy Algorithms for Feedback Motion Planning

This paper presents policy-based motion planning for robotic systems. The motion planning literature has been mostly focused on open-loop trajectory planning which is followed by tracking online. In contrast, we solve the problem of path planning and controller synthesis simultaneously by solving the related feedback control problem. We present a novel incremental policy (iPolicy) algorithm for motion planning, which integrates sampling-based methods and set-valued optimal control methods to compute feedback controllers for the robotic system. In particular, we use sampling to incrementally construct the state space of the system. Asynchronous value iterations are performed on the sampled state space to synthesize the incremental policy feedback controller. We show the convergence of the estimates to the optimal value function in continuous state space. Numerical results with various different dynamical systems (including nonholonomic systems) verify the optimality and effectiveness of iPolicy.

preprint2020arXiv

Distributed Robust Adaptive Frequency Control of Power Systems with Dynamic Loads

This paper investigates the frequency control of multi-machine power systems subject to uncertain and dynamic net loads. We propose distributed internal model controllers that coordinate synchronous generators and demand response to tackle the unpredictable nature of net loads. Frequency stability is formally guaranteed via Lyapunov analysis. Numerical simulations on the IEEE 68-bus test system demonstrate the effectiveness of the controllers.

preprint2020arXiv

Pareto optimal multi-robot motion planning

This paper studies a class of multi-robot coordination problems where a team of robots aim to reach their goal regions with minimum time and avoid collisions with obstacles and other robots. A novel numerical algorithm is proposed to identify the Pareto optimal solutions where no robot can unilaterally reduce its traveling time without extending others'. The consistent approximation of the algorithm in the epigraphical profile sense is guaranteed using set-valued numerical analysis. Experiments on an indoor multi-robot platform and computer simulations show the anytime property of the proposed algorithm; i.e., it is able to quickly return a feasible control policy that safely steers the robots to their goal regions and it keeps improving policy optimality if more time is given.

preprint2016arXiv

Simultaneous Input and State Estimation for Linear Time-Varying Continuous-Time Stochastic Systems

In this paper, we present an optimal filter for linear time-varying continuous-time stochastic systems that simultaneously estimates the states and unknown inputs in an unbiased minimum-variance sense. We first show that the unknown inputs cannot be estimated without additional assumptions. Then, we discuss two complementary variants of the filter: (i) for the case when an additional measurement containing information about the state derivative is available, and (ii) for the case without the additional measurement but the input signals are assumed to be sufficiently smooth and have bounded derivatives. Conditions for uniform asymptotic stability and the existence of a steady-state solution for the proposed filter, as well as the convergence rate of the state and input estimate biases are given. Moreover, we show that a principle of separation of estimation and control holds and that the unknown inputs may be rejected. Two examples, including a nonlinear vehicle reentry example, are given to illustrate that our filter is applicable even when some strong assumptions do not hold.

preprint2016arXiv

Simultaneous Mode, Input and State Estimation for Switched Linear Stochastic Systems

In this paper, we propose a filtering algorithm for simultaneously estimating the mode, input and state of hidden mode switched linear stochastic systems with unknown inputs. Using a multiple-model approach with a bank of linear input and state filters for each mode, our algorithm relies on the ability to find the most probable model as a mode estimate, which we show is possible with input and state filters by identifying a key property, that a particular residual signal we call generalized innovation is a Gaussian white noise. We also provide an asymptotic analysis for the proposed algorithm and provide sufficient conditions for asymptotically achieving convergence to the true model (consistency), or to the 'closest' model according to an information-theoretic measure (convergence). A simulation example of intention-aware vehicles at an intersection is given to demonstrate the effectiveness of our approach.

preprint2015arXiv

Distributed robust adaptive equilibrium computation for generalized convex games

This paper considers a class of generalized convex games where each player is associated with a convex objective function, a convex inequality constraint and a convex constraint set. The players aim to compute a Nash equilibrium through communicating with neighboring players. The particular challenge we consider is that the component functions are unknown a priori to associated players. We study two distributed computation algorithms and analyze their convergence properties in the presence of data transmission delays and dynamic changes of network topologies. The algorithm performance is verified through demand response on the IEEE 30-bus Test System. Our technical tools integrate convex analysis, variational inequalities and simultaneous perturbation stochastic approximation.

preprint2014arXiv

A Unified Filter for Simultaneous Input and State Estimation of Linear Discrete-time Stochastic Systems

In this paper, we present a unified optimal and exponentially stable filter for linear discrete-time stochastic systems that simultaneously estimates the states and unknown inputs in an unbiased minimum-variance sense, without making any assumptions on the direct feedthrough matrix. We also derive input and state observability/detectability conditions, and analyze their connection to the convergence and stability of the estimator. We discuss two variations of the filter and their optimality and stability properties, and show that filters in the literature, including the Kalman filter, are special cases of the filter derived in this paper. Finally, illustrative examples are given to demonstrate the performance of the unified unbiased minimum-variance filter.

preprint2014arXiv

Game theoretic controller synthesis for multi-robot motion planning Part I : Trajectory based algorithms

We consider a class of multi-robot motion planning problems where each robot is associated with multiple objectives and decoupled task specifications. The problems are formulated as an open-loop non-cooperative differential game. A distributed anytime algorithm is proposed to compute a Nash equilibrium of the game. The following properties are proven: (i) the algorithm asymptotically converges to the set of Nash equilibrium; (ii) for scalar cost functionals, the price of stability equals one; (iii) for the worst case, the computational complexity and communication cost are linear in the robot number.

preprint2013arXiv

Anytime computation algorithms for approach-evasion differential games

This paper studies a class of approach-evasion differential games, in which one player aims to steer the state of a dynamic system to the given target set in minimum time, while avoiding some set of disallowed states, and the other player desires to achieve the opposite. We propose a class of novel anytime computation algorithms, analyze their convergence properties and verify their performance via a number of numerical simulations. Our algorithms significantly outperform the multi-grid method for the approach-evasion differential games both theoretically and numerically. Our technical approach leverages incremental sampling in robotic motion planning and viability theory.

preprint2013arXiv

On the performance analysis of resilient networked control systems under replay attacks

This paper studies a resilient control problem for discrete-time, linear time-invariant systems subject to state and input constraints. State measurements and control commands are transmitted over a communication network and could be corrupted by adversaries. In particular, we consider the replay attackers who maliciously repeat the messages sent from the operator to the actuator. We propose a variation of the receding-horizon control law to deal with the replay attacks and analyze the resulting system performance degradation. A class of competitive (resp. cooperative) resource allocation problems for resilient networked control systems is also investigated.

preprint2013arXiv

Real-time game theoretic coordination of competitive mobility-on-demand systems

This paper considers competitive mobility-on-demand systems where a group of vehicle sharing companies, on one hand, want to collectively regulate the traffic of the user queueing network, and on the other hand, maximize their own profits at each time instant. We formulate the strategic interconnection among the companies as a real-time game theoretic coordination problem. We propose an algorithm to achieve vehicle balance and practical regulation of the user queueing network. We quantify the relation between the regulation error and the system parameters (e.g., the maximum variation of the user arrival rates).

preprint2012arXiv

An approximate dual subgradient algorithm for multi-agent non-convex optimization

We consider a multi-agent optimization problem where agents subject to local, intermittent interactions aim to minimize a sum of local objective functions subject to a global inequality constraint and a global state constraint set. In contrast to previous work, we do not require that the objective, constraint functions, and state constraint sets to be convex. In order to deal with time-varying network topologies satisfying a standard connectivity assumption, we resort to consensus algorithm techniques and the Lagrangian duality method. We slightly relax the requirement of exact consensus, and propose a distributed approximate dual subgradient algorithm to enable agents to asymptotically converge to a pair of primal-dual solutions to an approximate problem. To guarantee convergence, we assume that the Slater's condition is satisfied and the optimal solution set of the dual limit is singleton. We implement our algorithm over a source localization problem and compare the performance with existing algorithms.

preprint2011arXiv

On distributed convex optimization under inequality and equality constraints via primal-dual subgradient methods

We consider a general multi-agent convex optimization problem where the agents are to collectively minimize a global objective function subject to a global inequality constraint, a global equality constraint, and a global constraint set. The objective function is defined by a sum of local objective functions, while the global constraint set is produced by the intersection of local constraint sets. In particular, we study two cases: one where the equality constraint is absent, and the other where the local constraint sets are identical. We devise two distributed primal-dual subgradient algorithms which are based on the characterization of the primal-dual optimal solutions as the saddle points of the Lagrangian and penalty functions. These algorithms can be implemented over networks with changing topologies but satisfying a standard connectivity property, and allow the agents to asymptotically agree on optimal solutions and optimal values of the optimization problem under the Slater's condition.

preprint2010arXiv

Distributed coverage games for mobile visual sensor networks

Motivated by current challenges in data-intensive sensor networks, we formulate a coverage optimization problem for mobile visual sensors as a (constrained) repeated multi-player game. Each visual sensor tries to optimize its own coverage while minimizing the processing cost. We present two distributed learning algorithms where each sensor only remembers its own utility values and actions played during the last plays. These algorithms are proven to be convergent in probability to the set of (constrained) Nash equilibria and global optima of certain coverage performance metric, respectively.

preprint2010arXiv

On the convergence time of asynchronous distributed quantized averaging algorithms

We come up with a class of distributed quantized averaging algorithms on asynchronous communication networks with fixed, switching and random topologies. The implementation of these algorithms is subject to the realistic constraint that the communication rate, the memory capacities of agents and the computation precision are finite. The focus of this paper is on the study of the convergence time of the proposed quantized averaging algorithms. By appealing to random walks on graphs, we derive polynomial bounds on the expected convergence time of the algorithms presented.

Minghui Zhu

What is connected

Connect this record

See the researcher in context

Building this map preview

17 published item(s)

Differentiation Between Faults and Cyberattacks through Combined Analysis of Cyberspace Logs and Physical Measurements

Interactive Inverse Reinforcement Learning of Interaction Scenarios via Bi-level Optimization

iPolicy: Incremental Policy Algorithms for Feedback Motion Planning

Distributed Robust Adaptive Frequency Control of Power Systems with Dynamic Loads

Pareto optimal multi-robot motion planning

Simultaneous Input and State Estimation for Linear Time-Varying Continuous-Time Stochastic Systems

Simultaneous Mode, Input and State Estimation for Switched Linear Stochastic Systems

Distributed robust adaptive equilibrium computation for generalized convex games

A Unified Filter for Simultaneous Input and State Estimation of Linear Discrete-time Stochastic Systems

Game theoretic controller synthesis for multi-robot motion planning Part I : Trajectory based algorithms

Anytime computation algorithms for approach-evasion differential games

On the performance analysis of resilient networked control systems under replay attacks

Real-time game theoretic coordination of competitive mobility-on-demand systems

An approximate dual subgradient algorithm for multi-agent non-convex optimization

On distributed convex optimization under inequality and equality constraints via primal-dual subgradient methods

Distributed coverage games for mobile visual sensor networks

On the convergence time of asynchronous distributed quantized averaging algorithms