Source author record

Ziming Wang

Ziming Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision gr-qc Artificial Intelligence astro-ph.HE astro-ph.IM eess.SY Neural and Evolutionary Computing Systems and Control

Catalog footprint

What is connected

9works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

An agentic framework for gravitational-wave counterpart association in the multi-messenger era

With the detection of gravitational waves (GWs), multi-messenger astronomy has opened a new window for advancing our understanding of astrophysics, dense matter, gravitation, and cosmology. The GW sources detected to date are from mergers of compact object binaries, which possess the potential to generate detectable electromagnetic (EM) counterparts. Searching for associations between GW signals and their EM counterparts is an essential step toward enabling subsequent multi-messenger studies. In the era of next-generation GW and EM detectors, the rapid increase in the number of events brings not only unprecedented scientific opportunities, but also substantial challenges to the existing data analysis paradigm. To help address these challenges, we develop GW-Eyes, an agentic framework powered by large language models (LLMs). For the first time, GW-Eyes integrates domain-specific tools and autonomously performs counterpart association tasks between GW and candidate EM events. It supports natural language interaction to assist human experts with auxiliary tasks such as catalog management, skymap visualization, and rapid verification. Our framework leverages the complex decision-making capabilities of LLMs and their traceable reasoning processes, offering a new perspective to the multi-messenger astronomy.

preprint2026arXiv

CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games

Recent advances in Vision-Language-Action models (VLAs) have expanded the capabilities of embodied intelligence. However, significant challenges remain in real-time decision-making in complex 3D environments, which demand second-level responses, high-resolution perception, and tactical reasoning under dynamic conditions. To advance the field, we introduce CombatVLA, an efficient VLA model optimized for combat tasks in 3D action role-playing games(ARPGs). Specifically, our CombatVLA is a 3B model trained on video-action pairs collected by an action tracker, where the data is formatted as action-of-thought (AoT) sequences. Thereafter, CombatVLA seamlessly integrates into an action execution framework, allowing efficient inference through our truncated AoT strategy. Experimental results demonstrate that CombatVLA not only outperforms all existing models on the combat understanding benchmark but also achieves a 50-fold acceleration in game combat. Moreover, it has a higher task success rate than human players. We will open-source all resources, including the action tracker, dataset, benchmark, model weights, training code, and the implementation of the framework at https://combatvla.github.io/.

preprint2026arXiv

Global Compression Commander: Plug-and-Play Inference Acceleration for High-Resolution Large Vision-Language Models

Large vision-language models (LVLMs) excel at visual understanding, but face efficiency challenges due to quadratic complexity in processing long multi-modal contexts. While token compression can reduce computational costs, existing approaches are designed for single-view LVLMs and fail to consider the unique multi-view characteristics of high-resolution LVLMs with dynamic cropping. Existing methods treat all tokens uniformly, but our analysis reveals that global thumbnails can naturally guide the compression of local crops by providing holistic context for informativeness evaluation. In this paper, we first analyze dynamic cropping strategy, revealing both the complementary nature between thumbnails and crops, and the distinctive characteristics across different crops. Based on our observations, we propose ``Global Compression Commander'' (\textit{i.e.}, \textbf{GlobalCom$^2$}), a novel plug-and-play token compression framework for HR-LVLMs. GlobalCom$^2$ leverages thumbnail as the ``commander'' to guide the compression of local crops, adaptively preserving informative details while eliminating redundancy. Extensive experiments show that GlobalCom$^2$ maintains over \textbf{90\%} performance while compressing \textbf{90\%} visual tokens, reducing FLOPs and peak memory to \textbf{9.1\%} and \textbf{60\%}.

preprint2026arXiv

Neural-network-based Self-triggered Observed Platoon Control for Autonomous Vehicles

This paper investigates autonomous vehicle (AV) platoon control under uncertain dynamics and intermittent communication, which remains a critical challenge in intelligent transportation systems. To address these issues, this paper proposes an adaptive consensus tracking control framework for nonlinear multi-agent systems (MASs). The proposed approach integrates backstepping design, a nonlinear sampled-data observer, radial basis function neural networks, and a self-triggered communication mechanism. The radial basis function neural networks approximate unknown nonlinearities and time-varying disturbances, thereby enhancing system robustness. A distributed observer estimates neighboring states based on limited and intermittent measurements, thereby reducing dependence on continuous communication. Moreover, self-triggered mechanism is developed to determine triggering instants, guaranteeing a strictly positive minimum inter-event time and preventing Zeno behavior. The theoretical analysis proves that all closed-loop signals are uniformly ultimately bounded (UUB), and tracking errors converge to a compact set. Simulation results demonstrate that the proposed approach achieves high robustness, adaptability, and communication efficiency, making it suitable for real-world networked vehicle systems.

preprint2022arXiv

An Efficient Multi-Indicator and Many-Objective Optimization Algorithm based on Two-Archive

Indicator-based algorithms are gaining prominence as traditional multi-objective optimization algorithms based on domination and decomposition struggle to solve many-objective optimization problems. However, previous indicator-based multi-objective optimization algorithms suffer from the following flaws: 1) The environment selection process takes a long time; 2) Additional parameters are usually necessary. As a result, this paper proposed an multi-indicator and multi-objective optimization algorithm based on two-archive (SRA3) that can efficiently select good individuals in environment selection based on indicators performance and uses an adaptive parameter strategy for parental selection without setting additional parameters. Then we normalized the algorithm and compared its performance before and after normalization, finding that normalization improved the algorithm's performance significantly. We also analyzed how normalizing affected the indicator-based algorithm and observed that the normalized $I_{ε+}$ indicator is better at finding extreme solutions and can reduce the influence of each objective's different extent of contribution to the indicator due to its different scope. However, it also has a preference for extreme solutions, which causes the solution set to converge to the extremes. As a result, we give some suggestions for normalization. Then, on the DTLZ and WFG problems, we conducted experiments on 39 problems with 5, 10, and 15 objectives, and the results show that SRA3 has good convergence and diversity while maintaining high efficiency. Finally, we conducted experiments on the DTLZ and WFG problems with 20 and 25 objectives and found that the algorithm proposed in this paper is more competitive than other algorithms as the number of objectives increases.

preprint2022arXiv

Extending the Fisher Information Matrix in Gravitational-wave Data Analysis

The Fisher information matrix (FM) plays an important role in forecasts and inferences in many areas of physics. While giving fast parameter estimation with the Gaussian likelihood approximation in the parameter space, the FM can only give the ellipsoidal posterior contours of parameters and lose the higher-order information beyond Gaussianity. We extend the FM in gravitational-wave (GW) data analysis using the Derivative Approximation for LIkelihoods (DALI), a method to expand the likelihood while keeping it positive definite and normalizable at every order, for more accurate forecasts and inferences. When applied to the two real GW events, GW150914 and GW170817, DALI can reduce the difference between FM approximation and the real posterior by 5 times in the best case. The calculation time of DALI and FM is at the same order of magnitude, while obtaining the real full posterior will take several orders of magnitude longer. Besides more accurate approximations, higher-order correction from DALI provides a fast assessment on the FM analysis and gives suggestions for complex sampling techniques which are computationally intensive. We recommend using the DALI method as an extension to the FM method in GW data analysis to pursue better accuracy while still keeping the speed.

preprint2022arXiv

Improving RGB-D Point Cloud Registration by Learning Multi-scale Local Linear Transformation

Point cloud registration aims at estimating the geometric transformation between two point cloud scans, in which point-wise correspondence estimation is the key to its success. In addition to previous methods that seek correspondences by hand-crafted or learnt geometric features, recent point cloud registration methods have tried to apply RGB-D data to achieve more accurate correspondence. However, it is not trivial to effectively fuse the geometric and visual information from these two distinctive modalities, especially for the registration problem. In this work, we propose a new Geometry-Aware Visual Feature Extractor (GAVE) that employs multi-scale local linear transformation to progressively fuse these two modalities, where the geometric features from the depth data act as the geometry-dependent convolution kernels to transform the visual features from the RGB data. The resultant visual-geometric features are in canonical feature spaces with alleviated visual dissimilarity caused by geometric changes, by which more reliable correspondence can be achieved. The proposed GAVE module can be readily plugged into recent RGB-D point cloud registration framework. Extensive experiments on 3D Match and ScanNet demonstrate that our method outperforms the state-of-the-art point cloud registration methods even without correspondence or pose supervision. The code is available at: https://github.com/514DNA/LLT.

preprint2022arXiv

Simultaneous bounds on the gravitational dipole radiation and varying gravitational constant from compact binary inspirals

Compact binaries are an important class of gravitational-wave (GW) sources that can be detected by current and future GW observatories. They provide a testbed for general relativity (GR) in the highly dynamical strong-field regime. Here, we use GWs from inspiraling binary neutron stars and binary black holes to investigate dipolar gravitational radiation (DGR) and varying gravitational constant predicted by some alternative theories to GR, such as the scalar-tensor gravity. Within the parametrized post-Einsteinian framework, we introduce the parametrization of these two effects simultaneously into compact binaries' inspiral waveform and perform the Fisher-information-matrix analysis to estimate their simultaneous bounds. In general, the space-based GW detectors can give a tighter limit than ground-based ones. The tightest constraints can reach $σ_B<3\times10^{-11}$ for the DGR parameter $B$ and $σ_{\dot{G}}/G < 7\times10^{-9} \, {\rm yr}^{-1} $ for the varying $G$, when the time to coalescence of the GW event is close to the lifetime of space-based detectors. In addition, we analyze the correlation between these two effects and highlight the importance of considering both effects in order to arrive at more realistic results.

preprint2014arXiv

Local Stability of Strongly Magnetized Black Hole Tori

We investigate the local stability of strongly magnetized relativistic tori orbiting Kerr black holes, for the case of a purely toroidal magnetic field topology. Our approach encompasses both solving the full dispersion relation numerically and solving a simplified analytic treatment. Both approaches indicate that such tori are subject to an unstable non-axisymmetric magnetorotational mode, regardless of such background properties as the gas-to-magnetic pressure ratio and specific angular momentum distribution. We demonstrate that our modal analysis matches well with what is seen in global, three-dimensional, general relativistic magnetohydrodynamic simulations.

Ziming Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

9 published item(s)

An agentic framework for gravitational-wave counterpart association in the multi-messenger era

CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing Games

Global Compression Commander: Plug-and-Play Inference Acceleration for High-Resolution Large Vision-Language Models

Neural-network-based Self-triggered Observed Platoon Control for Autonomous Vehicles

An Efficient Multi-Indicator and Many-Objective Optimization Algorithm based on Two-Archive

Extending the Fisher Information Matrix in Gravitational-wave Data Analysis

Improving RGB-D Point Cloud Registration by Learning Multi-scale Local Linear Transformation

Simultaneous bounds on the gravitational dipole radiation and varying gravitational constant from compact binary inspirals

Local Stability of Strongly Magnetized Black Hole Tori