Source author record

Xiaobo Liu

Xiaobo Liu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math-ph math.AG math.DG math.MP Artificial Intelligence Machine Learning math.GT math.QA Robotics Computer Vision hep-th math.CO math.OC Networking and Internet Architecture Neurons and Cognition nlin.SI

Catalog footprint

What is connected

18works

16topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning

Traditional workflow-based agents exhibit limited intelligence when addressing real-world problems requiring tool invocation. Tool-integrated reasoning (TIR) agents capable of autonomous reasoning and tool invocation are rapidly emerging as a powerful approach for complex decision-making tasks involving multi-step interactions with external environments. In this work, we introduce MindWatcher, a TIR agent integrating interleaved thinking and multimodal chain-of-thought (CoT) reasoning. MindWatcher can autonomously decide whether and how to invoke diverse tools and coordinate their use, without relying on human prompts or workflows. The interleaved thinking paradigm enables the model to switch between thinking and tool calling at any intermediate stage, while its multimodal CoT capability allows manipulation of images during reasoning to yield more precise search results. We implement automated data auditing and evaluation pipelines, complemented by manually curated high-quality datasets for training, and we construct a benchmark, called MindWatcher-Evaluate Bench (MWE-Bench), to evaluate its performance. MindWatcher is equipped with a comprehensive suite of auxiliary reasoning tools, enabling it to address broad-domain multimodal problems. A large-scale, high-quality local image retrieval database, covering eight categories including cars, animals, and plants, endows model with robust object recognition despite its small size. Finally, we design a more efficient training infrastructure for MindWatcher, enhancing training speed and hardware utilization. Experiments not only demonstrate that MindWatcher matches or exceeds the performance of larger or more recent models through superior tool invocation, but also uncover critical insights for agent training, such as the genetic inheritance phenomenon in agentic RL.

preprint2022arXiv

Dynamic Cooperative Vehicle Platoon Control Considering Longitudinal and Lane-changing Dynamics

This paper presents a distributed cascade Proportional Integral Derivate (DCPID) control algorithm for the connected and automated vehicle (CAV) platoon considering the heterogeneity of CAVs in terms of the inertial lag. Furthermore, a real-time dynamic cooperative lane-changing model for CAVs, which can seamlessly combine the DCPID algorithm and the improved sine function is developed. The DCPID algorithm determines the appropriate longitudinal acceleration and speed of the lane-changing vehicle considering the speed fluctuations of the front vehicle on the target lane (TFV). In the meantime, the sine function plans a reference trajectory which is further updated in real time using the model predictive control (MPC) to avoid potential collisions until lane-changing is completed. Both the local and the asymptotic stability conditions of the DCPID algorithm are mathematically derived, and the sensitivity of the DCPID control parameters under different states is analyzed. Simulation experiments are conducted to assess the performance of the proposed model and the results indicate that the DCPID algorithm can provide robust control for tracking and adjusting the desired spacing and velocity for all 400 scenarios, even in the relatively extreme initial state. Besides, the proposed dynamic cooperative lane-changing model can guarantee an effective and safe lane-changing with different speeds and even in emergency situations (such as the sudden deceleration of the TFV).

preprint2022arXiv

Q-Polynomial expansion for Brezin-Gross-Witten tau-function

In this paper, we prove a conjecture of Alexandrov that the generalized Brezin-Gross-Witten tau-functions are hypergeometric tau functions of BKP hierarchy after re-scaling. In particular, this shows that the original BGW tau-function, which has enumerative geometric interpretations, can be represented as a linear combination of Schur Q-polynomials with simple coefficients.

preprint2022arXiv

Schur Q-Polynomials and Kontsevich-Witten Tau Function

Using matrix model, Mironov and Morozov recently gave a formula which represents Kontsevich-Witten tau-function as a linear expansion of Schur Q-polynomials. In this paper, we will show directly that the Q-polynomial expansion in this formula satisfies the Virasoro constraints, and consequently obtain a proof of this formula without using matrix model. We also give a proof for Alexandrov's conjecture that Kontsevich-Witten tau-function is a hypergeometric tau-function of the BKP hierarchy after re-scaling.

preprint2022arXiv

Traffic Analytics Development Kits (TADK): Enable Real-Time AI Inference in Networking Apps

Sophisticated traffic analytics, such as the encrypted traffic analytics and unknown malware detection, emphasizes the need for advanced methods to analyze the network traffic. Traditional methods of using fixed patterns, signature matching, and rules to detect known patterns in network traffic are being replaced with AI (Artificial Intelligence) driven algorithms. However, the absence of a high-performance AI networking-specific framework makes deploying real-time AI-based processing within networking workloads impossible. In this paper, we describe the design of Traffic Analytics Development Kits (TADK), an industry-standard framework specific for AI-based networking workloads processing. TADK can provide real-time AI-based networking workload processing in networking equipment from the data center out to the edge without the need for specialized hardware (e.g., GPUs, Neural Processing Unit, and so on). We have deployed TADK in commodity WAF and 5G UPF, and the evaluation result shows that TADK can achieve a throughput up to 35.3Gbps per core on traffic feature extraction, 6.5Gbps per core on traffic classification, and can decrease SQLi/XSS detection down to 4.5us per request with higher accuracy than fixed pattern solution.

preprint2021arXiv

Weighted Ensemble-model and Network Analysis: A method to predict fluid intelligence via naturalistic functional connectivity

Objectives: Functional connectivity triggered by naturalistic stimulus (e.g., movies) and machine learning techniques provide a great insight in exploring the brain functions such as fluid intelligence. However, functional connectivity are considered to be multi-layered, while traditional machine learning based on individual models not only are limited in performance, but also fail to extract multi-dimensional and multi-layered information from brain network. Methods: In this study, inspired by multi-layer brain network structure, we propose a new method namely Weighted Ensemble-model and Network Analysis, which combines the machine learning and graph theory for improved fluid intelligence prediction. Firstly, functional connectivity analysis and graphical theory were jointly employed. The functional connectivity and graphical indices computed using the preprocessed fMRI data were then all fed into auto-encoder parallelly for feature extraction to predict the fluid intelligence. In order to improve the performance, tree regression and ridge regression model were automatically stacked and fused with weighted values. Finally, layers of auto-encoder were visualized to better illustrate the connectome patterns, followed by the evaluation of the performance to justify the mechanism of brain functions. Results: Our proposed methods achieved best performance with 3.85 mean absolute deviation, 0.66 correlation coefficient and 0.42 R-squared coefficient, outperformed other state-of-the-art methods. It is also worth noting that, the optimization of the biological pattern extraction was automated though the auto-encoder algorithm. Conclusion: The proposed method not only outperforming the state-of-the-art reports, but also able to effectively capturing the biological patterns from functional connectivity during naturalistic movies state for potential clinical explorations.

preprint2020arXiv

DeepClaw: A Robotic Hardware Benchmarking Platform for Learning Object Manipulation

We present DeepClaw as a reconfigurable benchmark of robotic hardware and task hierarchy for robot learning. The DeepClaw benchmark aims at a mechatronics perspective of the robot learning problem, which features a minimum design of robot cell that can be easily reconfigured to host robot hardware from various vendors, including manipulators, grippers, cameras, desks, and objects, aiming at a streamlined collection of physical manipulation data and evaluation of the learned skills for hardware benchmarking. We provide a detailed design of the robot cell with readily available parts to build the experiment environment that can host a wide range of robotic hardware commonly adopted for robot learning. We also propose a hierarchical pipeline of software integration, including localization, recognition, grasp planning, and motion planning, to streamline learning-based robot control, data collection, and experiment validation towards shareability and reproducibility. We present benchmarking results of the DeepClaw system for a baseline Tic-Tac-Toe task, a bin-clearing task, and a jigsaw puzzle task using three sets of standard robotic hardware. Our results show that tasks defined in DeepClaw can be easily reproduced on three robot cells. Under the same task setup, the differences in robotic hardware used will present a non-negligible impact on the performance metrics of robot learning. All design layouts and codes are hosted on Github for open access.

preprint2020arXiv

Graph Convolutional Subspace Clustering: A Robust Subspace Clustering Framework for Hyperspectral Image

Hyperspectral image (HSI) clustering is a challenging task due to the high complexity of HSI data. Subspace clustering has been proven to be powerful for exploiting the intrinsic relationship between data points. Despite the impressive performance in the HSI clustering, traditional subspace clustering methods often ignore the inherent structural information among data. In this paper, we revisit the subspace clustering with graph convolution and present a novel subspace clustering framework called Graph Convolutional Subspace Clustering (GCSC) for robust HSI clustering. Specifically, the framework recasts the self-expressiveness property of the data into the non-Euclidean domain, which results in a more robust graph embedding dictionary. We show that traditional subspace clustering models are the special forms of our framework with the Euclidean data. Basing on the framework, we further propose two novel subspace clustering models by using the Frobenius norm, namely Efficient GCSC (EGCSC) and Efficient Kernel GCSC (EKGCSC). Both models have a globally optimal closed-form solution, which makes them easier to implement, train, and apply in practice. Extensive experiments on three popular HSI datasets demonstrate that EGCSC and EKGCSC can achieve state-of-the-art clustering performance and dramatically outperforms many existing methods with significant margins.

preprint2020arXiv

Rigid-Soft Interactive Learning for Robust Grasping

Inspired by widely used soft fingers on grasping, we propose a method of rigid-soft interactive learning, aiming at reducing the time of data collection. In this paper, we classify the interaction categories into Rigid-Rigid, Rigid-Soft, Soft-Rigid according to the interaction surface between grippers and target objects. We find experimental evidence that the interaction types between grippers and target objects play an essential role in the learning methods. We use soft, stuffed toys for training, instead of everyday objects, to reduce the integration complexity and computational burden and exploit such rigid-soft interaction by changing the gripper fingers to the soft ones when dealing with rigid, daily-life items such as the Yale-CMU-Berkeley (YCB) objects. With a small data collection of 5K picking attempts in total, our results suggest that such Rigid-Soft and Soft-Rigid interactions are transferable. Moreover, the combination of different grasp types shows better performance on the grasping test. We achieve the best grasping performance at 97.5\% for easy YCB objects and 81.3\% for difficult YCB objects while using a precise grasp with a two-soft-finger gripper to collect training data and power grasp with a four-soft-finger gripper to test.

preprint2016arXiv

Connecting the Kontsevich-Witten and Hodge tau-functions by the $\hat{GL(\infty)}$ operators

In this paper, we present an explicit formula that connects the Kontsevich-Witten tau-function and the Hodge tau-function by differential operators belonging to the $\hat{GL(\infty)}$ group. Indeed, we show that the two tau-functions can be connected using Virasoro operators. This proves a conjecture posted by Alexandrov in [1].

preprint2015arXiv

Topological Recursion Relations on $\bar{\cal M}_{3,2}$

In this paper, we give some new genus-3 universal equations for Gromov-Witten invariants of compact symplectic manifolds. These equations were obtained by studying new relations in the tautological ring of the moduli space of 2-pointed genus-3 stable curves. A byproduct of our search for genus-3 equations is a new genus-2 universal equation for Gromov-Witten invariants.

preprint2014arXiv

Conditions for the vanishing of the genus-2 G-function

In this paper we give some sufficient conditions for the vanishing of the genus-2 G-function, which was introduced by B. Dubrovin, S. Liu and Y. Zhang in [DLZ]. As a corollary we prove their conjecture for the vanishing of the genus-2 G-function for ADE singularities.

preprint2014arXiv

Genus-2 G-function for $P^1$ orbifolds

In this paper we prove that for Gromov-Witten theory of $P^1$ orbifolds of ADE type the genus-2 G-function introduced by B. Dubrovin, S. Liu, and Y. Zhang vanishes. Together with our results in [LW], this completely solves the main conjecture in their paper [DLZ]. In the process, we also found a sufficient condition for the vanishing of the genus-2 G-function which is weaker than the condition given in our previous paper [LW].

preprint2011arXiv

Genus-1 Virasoro conjecture along quantum volume direction

In this paper, we show that the derivative of the genus-1 Virasoro conjecture for Gromov-Witten invariants along the direction of quantum volume element holds for all smooth projective varieties. This result provides new evidence for the Virasoro conjecture.

preprint2010arXiv

New topological recursion relations

Simple boundary expressions for the k-th power of the cotangent line class on the moduli space of stable 1-pointed genus g curves are found for k >= 2g. The method is by virtual localization on the moduli space of maps to the projective line. As a consequence, nontrivial tautological classes in the kernel of the push-forward map associated to the irreducible boundary divisor of the moduli space of stable g+1 curves are constructed. The geometry of genus g+1 curves then provides universal equations in genus g Gromov-Witten theory. As an application, we prove all the Gromov-Witten identities conjectured recently by K. Liu and H. Xu.

preprint2009arXiv

Quantum Teichmüller space and Kashaev algebra

Kashaev algebra associated to a surface is a noncommutative deformation of the algebra of rational functions of Kashaev coordinates. For two arbitrary complex numbers, there is a generalized Kashaev algebra. The relationship between the shear coordinates and Kashaev coordinates induces a natural relationship between the quantum Teichmüller space and the generalized Kashaev algebra.

preprint2007arXiv

Representations of the quantum Teichmuller space, and invariants of surface diffeomorphisms

We investigate the representation theory of the polynomial core of the quantum Teichmuller space of a punctured surface S. This is a purely algebraic object, closely related to the combinatorics of the simplicial complex of ideal cell decompositions of S. Our main result is that irreducible finite-dimensional representations of this polynomial core are classified, up to finitely many choices, by group homomorphisms from the fundamental group of the surface to the isometry group of the hyperbolic 3--space. We exploit this connection between algebra and hyperbolic geometry to exhibit new invariants of diffeomorphisms of S.

preprint1999arXiv

Homogeneity of infinite dimensional isoparametric submanifolds

A subset S of a Riemannian manifold N is called extrinsically homogeneous if S is an orbit of a subgroup of the isometry group of N. Thorbergsson proved the remarkable result that every complete, connected, full, irreducible isoparametric submanifold of a finite dimensional Euclidean space of rank at least 3 is extrinsically homogeneous. This result, combined with results of Palais-Terng and Dadok, finally classified irreducible isoparametric submanifolds of a finite dimensional Euclidean space of rank at least 3. While Thorbergsson's proof used Tits buildings, a simpler proof without using Tits buildings was given by Olmos. The main purpose of this paper is to extend Thorbergsson's result to the infinite dimensional case.

Xiaobo Liu

What is connected

Connect this record

See the researcher in context

Building this map preview

18 published item(s)

MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning

Dynamic Cooperative Vehicle Platoon Control Considering Longitudinal and Lane-changing Dynamics

Q-Polynomial expansion for Brezin-Gross-Witten tau-function

Schur Q-Polynomials and Kontsevich-Witten Tau Function

Traffic Analytics Development Kits (TADK): Enable Real-Time AI Inference in Networking Apps

Weighted Ensemble-model and Network Analysis: A method to predict fluid intelligence via naturalistic functional connectivity

DeepClaw: A Robotic Hardware Benchmarking Platform for Learning Object Manipulation

Graph Convolutional Subspace Clustering: A Robust Subspace Clustering Framework for Hyperspectral Image

Rigid-Soft Interactive Learning for Robust Grasping

Connecting the Kontsevich-Witten and Hodge tau-functions by the $\hat{GL(\infty)}$ operators

Topological Recursion Relations on $\bar{\cal M}_{3,2}$

Conditions for the vanishing of the genus-2 G-function

Genus-2 G-function for $P^1$ orbifolds

Genus-1 Virasoro conjecture along quantum volume direction

New topological recursion relations

Quantum Teichmüller space and Kashaev algebra

Representations of the quantum Teichmuller space, and invariants of surface diffeomorphisms

Homogeneity of infinite dimensional isoparametric submanifolds