Researcher profile

Di Zhao

Di Zhao contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2026arXiv

SC-MAS: Constructing Cost-Efficient Multi-Agent Systems with Edge-Level Heterogeneous Collaboration

Large Language Model (LLM)-based Multi-Agent Systems (MAS) enhance complex problem solving through multi-agent collaboration, but often incur substantially higher costs than single-agent systems. Recent MAS routing methods aim to balance performance and overhead by dynamically selecting agent roles and language models. However, these approaches typically rely on a homogeneous collaboration mode, where all agents follow the same interaction pattern, limiting collaboration flexibility across different roles. Motivated by Social Capital Theory, which emphasizes that different roles benefit from distinct forms of collaboration, we propose SC-MAS, a framework for constructing heterogeneous and cost-efficient multi-agent systems. SC-MAS models MAS as directed graphs, where edges explicitly represent pairwise collaboration strategies, allowing different agent pairs to interact through tailored communication patterns. Given an input query, a unified controller progressively constructs an executable MAS by selecting task-relevant agent roles, assigning edge-level collaboration strategies, and allocating appropriate LLM backbones to individual agents. Experiments on multiple benchmarks demonstrate the effectiveness of SC-MAS. In particular, SC-MAS improves accuracy by 3.35% on MMLU while reducing inference cost by 15.38%, and achieves a 3.53% accuracy gain with a 12.13% cost reduction on MBPP. These results validate the feasibility of SC-MAS and highlight the effectiveness of heterogeneous collaboration in multi-agent systems.

preprint2026arXiv

TowerMind: A Tower Defence Game Learning Environment and Benchmark for LLM as Agents

Recent breakthroughs in Large Language Models (LLMs) have positioned them as a promising paradigm for agents, with long-term planning and decision-making emerging as core general-purpose capabilities for adapting to diverse scenarios and tasks. Real-time strategy (RTS) games serve as an ideal testbed for evaluating these two capabilities, as their inherent gameplay requires both macro-level strategic planning and micro-level tactical adaptation and action execution. Existing RTS game-based environments either suffer from relatively high computational demands or lack support for textual observations, which has constrained the use of RTS games for LLM evaluation. Motivated by this, we present TowerMind, a novel environment grounded in the tower defense (TD) subgenre of RTS games. TowerMind preserves the key evaluation strengths of RTS games for assessing LLMs, while featuring low computational demands and a multimodal observation space, including pixel-based, textual, and structured game-state representations. In addition, TowerMind supports the evaluation of model hallucination and provides a high degree of customizability. We design five benchmark levels to evaluate several widely used LLMs under different multimodal input settings. The results reveal a clear performance gap between LLMs and human experts across both capability and hallucination dimensions. The experiments further highlight key limitations in LLM behavior, such as inadequate planning validation, a lack of multifinality in decision-making, and inefficient action use. We also evaluate two classic reinforcement learning algorithms: Ape-X DQN and PPO. By offering a lightweight and multimodal design, TowerMind complements the existing RTS game-based environment landscape and introduces a new benchmark for the AI agent field. The source code is publicly available on GitHub(https://github.com/tb6147877/TowerMind).

preprint2023arXiv

Investigation of the laser-induced lineshape change in attosecond transient absorption spectra by employing a time-dependent generalized Floquet approach

We introduce a time-dependent generalized Floquet (TDGF) approach to calculate attosecond transient absorption spectra of helium atoms subjected to the combination of an attosecond extreme ultraviolet (XUV) pulse and a delayed few-cycle infrared (IR) laser pulse. This TDGF approach provides a Floquet understanding of the laser-induced change of resonant absorption lineshape. It is analytically demonstrated that, the phase shift of the time-dependent dipole moment that results in the lineshape changes consists of the \emph{adiabatic} laser-induced phase (LIP) due to the IR-induced stark shifts of adiabatic Floquet states and the \emph{non-adiabatic} phase correction due to the non-adiabatic IR-induced coupling between adiabatic Floquet states. Comparisons of the spectral lineshape calculated based on the TDGF approach with the results obtained with the LIP model [S. Chen \emph{et al.}, Phys. Rev. A \textbf{88}, 033409(2013)] and the rotating-wave approximation (RWA) are made in several typical cases. It is suggested in the picture of adiabatic Floquet states that, the LIP model works as long as the generalized adiabatic theorem [A. Dodin \emph{et al.}, Phys. Rev. X Quantum \textbf{2}, 030302(2021)] fulfils, and the RWA works when the higher-order IR-coupling effect in the formation of adiabatic Floquet states is neglectable.

preprint2022arXiv

Some statistics on generalized Motzkin paths with vertical steps

Recently, several authors have considered lattice paths with various steps, including vertical steps permitted. In this paper, we consider a kind of generalized Motzkin paths, called {\it G-Motzkin paths} for short, that is lattice paths from $(0, 0)$ to $(n, 0)$ in the first quadrant of the $XOY$-plane that consist of up steps $\mathbf{u}=(1, 1)$, down steps $\mathbf{d}=(1, -1)$, horizontal steps $\mathbf{h}=(1, 0)$ and vertical steps $\mathbf{v}=(0, -1)$. We mainly count the number of G-Motzkin paths of length $n$ with given number of $\mathbf{z}$-steps for $\mathbf{z}\in \{\mathbf{u}, \mathbf{h}, \mathbf{v}, \mathbf{d}\}$, and enumerate the statistics "number of $\mathbf{z}$-steps" at given level in G-Motzkin paths for $\mathbf{z}\in \{\mathbf{u}, \mathbf{h}, \mathbf{v}, \mathbf{d}\}$, some explicit formulas and combinatorial identities are given by bijective and algebraic methods, some enumerative results are linked with Riordan arrays according to the structure decompositions of G-Motzkin paths. We also discuss the statistics "number of $\mathbf{z}_1\mathbf{z}_2$-steps" in G-Motzkin paths for $\mathbf{z}_1, \mathbf{z}_2\in \{\mathbf{u}, \mathbf{h}, \mathbf{v}, \mathbf{d}\}$, the exact counting formulas except for $\mathbf{z}_1\mathbf{z}_2=\mathbf{dd}$ are obtained by the Lagrange inversion formula and their generating functions.

preprint2020arXiv

Stabilization of Cascaded Two-Port Networked Systems with Simultaneous Nonlinear Uncertainties

We introduce a versatile framework to model and study networked control systems (NCSs). An NCS is described as a feedback interconnection of a plant and a controller communicating through a bidirectional channel modelled by cascaded nonlinear two-port networks. This model is sufficiently rich to capture various properties of a real-world communication channel, such as distortion, interference, and nonlinearity. Uncertainties in the plant, controller and communication channels can be handled simultaneously in the framework. We provide a necessary and sufficient condition for the robust finite-gain stability of an NCS when the model uncertainties in the plant and controller are measured by the gap metric and those in the nonlinear communication channels are measured by operator norms of the uncertain elements. This condition is given by an inequality involving "arcsine" of the uncertainty bounds and is derived from novel geometric insights underlying the robustness of a standard closed-loop system in the presence of conelike nonlinear perturbations on the system graphs.