Source author record

Xu Shen

Xu Shen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.NT Artificial Intelligence Computer Vision eess.SY Systems and Control Machine Learning math.AG Robotics

Catalog footprint

What is connected

12works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Metacognitive Self-Correction for Multi-Agent System via Prototype-Guided Next-Execution Reconstruction

Large Language Model based multi-agent systems (MAS) excel at collaborative problem solving but remain brittle to cascading errors: a single faulty step can propagate across agents and disrupt the trajectory. In this paper, we present MASC, a metacognitive framework that endows MAS with real-time, unsupervised, step-level error detection and self-correction. MASC rethinks detection as history-conditioned anomaly scoring via two complementary designs: (1) Next-Execution Reconstruction, which predicts the embedding of the next step from the query and interaction history to capture causal consistency, and (2) Prototype-Guided Enhancement, which learns a prototype prior over normal-step embeddings and uses it to stabilize reconstruction and anomaly scoring under sparse context (e.g., early steps). When an anomaly step is flagged, MASC triggers a correction agent to revise the acting agent's output before information flows downstream. On the Who&When benchmark, MASC consistently outperforms all baselines, improving step-level error detection by up to 8.47% AUC-ROC ; When plugged into diverse MAS frameworks, it delivers consistent end-to-end gains across architectures, confirming that our metacognitive monitoring and targeted correction can mitigate error propagation with minimal overhead.

preprint2023arXiv

ParkPredict+: Multimodal Intent and Motion Prediction for Vehicles in Parking Lots with CNN and Transformer

The problem of multimodal intent and trajectory prediction for human-driven vehicles in parking lots is addressed in this paper. Using models designed with CNN and Transformer networks, we extract temporal-spatial and contextual information from trajectory history and local bird's eye view (BEV) semantic images, and generate predictions about intent distribution and future trajectory sequences. Our methods outperform existing models in accuracy, while allowing an arbitrary number of modes, encoding complex multi-agent scenarios, and adapting to different parking maps. To train and evaluate our method, we present the first public 4K video dataset of human driving in parking lots with accurate annotation, high frame rate, and rich traffic scenarios.

preprint2022arXiv

Cloth-Changing Person Re-identification from A Single Image with Gait Prediction and Regularization

Cloth-Changing person re-identification (CC-ReID) aims at matching the same person across different locations over a long-duration, e.g., over days, and therefore inevitably meets challenge of changing clothing. In this paper, we focus on handling well the CC-ReID problem under a more challenging setting, i.e., just from a single image, which enables high-efficiency and latency-free pedestrian identify for real-time surveillance applications. Specifically, we introduce Gait recognition as an auxiliary task to drive the Image ReID model to learn cloth-agnostic representations by leveraging personal unique and cloth-independent gait information, we name this framework as GI-ReID. GI-ReID adopts a two-stream architecture that consists of a image ReID-Stream and an auxiliary gait recognition stream (Gait-Stream). The Gait-Stream, that is discarded in the inference for high computational efficiency, acts as a regulator to encourage the ReID-Stream to capture cloth-invariant biometric motion features during the training. To get temporal continuous motion cues from a single image, we design a Gait Sequence Prediction (GSP) module for Gait-Stream to enrich gait information. Finally, a high-level semantics consistency over two streams is enforced for effective knowledge regularization. Experiments on multiple image-based Cloth-Changing ReID benchmarks, e.g., LTCC, PRCC, Real28, and VC-Clothes, demonstrate that GI-ReID performs favorably against the state-of-the-arts. Codes are available at https://github.com/jinx-USTC/GI-ReID.

preprint2022arXiv

Meta Clustering Learning for Large-scale Unsupervised Person Re-identification

Unsupervised Person Re-identification (U-ReID) with pseudo labeling recently reaches a competitive performance compared to fully-supervised ReID methods based on modern clustering algorithms. However, such clustering-based scheme becomes computationally prohibitive for large-scale datasets. How to efficiently leverage endless unlabeled data with limited computing resources for better U-ReID is under-explored. In this paper, we make the first attempt to the large-scale U-ReID and propose a "small data for big task" paradigm dubbed Meta Clustering Learning (MCL). MCL only pseudo-labels a subset of the entire unlabeled data via clustering to save computing for the first-phase training. After that, the learned cluster centroids, termed as meta-prototypes in our MCL, are regarded as a proxy annotator to softly annotate the rest unlabeled data for further polishing the model. To alleviate the potential noisy labeling issue in the polishment phase, we enforce two well-designed loss constraints to promise intra-identity consistency and inter-identity strong correlation. For multiple widely-used U-ReID benchmarks, our method significantly saves computational cost while achieving a comparable or even better performance compared to prior works.

preprint2020arXiv

Autonomous Parking of Vehicle Fleet in Tight Environments

The problem of autonomous parking of vehicle fleets is addressed in this paper. We present a system-level modeling and control framework which allows investigating different vehicle parking strategies while taking into account path planning and collision avoidance. The proposed approach decouples the problem into a centralized parking spot allocation and path generation, and a decentralized collision avoidance control. This paper presents the hierarchical framework and algorithmic details. Extensive simulations are used to assess several allocation strategies in terms of total fleet parking time and queue length. In particular, we describe how Braess's paradox can be observed for parking vehicle fleets.

preprint2020arXiv

ParkPredict: Motion and Intent Prediction of Vehicles in Parking Lots

We investigate the problem of predicting driver behavior in parking lots, an environment which is less structured than typical road networks and features complex, interactive maneuvers in a compact space. Using the CARLA simulator, we develop a parking lot environment and collect a dataset of human parking maneuvers. We then study the impact of model complexity and feature information by comparing a multi-modal Long Short-Term Memory (LSTM) prediction model and a Convolution Neural Network LSTM (CNN-LSTM) to a physics-based Extended Kalman Filter (EKF) baseline. Our results show that 1) intent can be estimated well (roughly 85% top-1 accuracy and nearly 100% top-3 accuracy with the LSTM and CNN-LSTM model); 2) knowledge of the human driver's intended parking spot has a major impact on predicting parking trajectory; and 3) the semantic representation of the environment improves long term predictions.

preprint2016arXiv

On the $l$-adic cohomology of some $p$-adically uniformized Shimura varieties

We determine the Galois representations inside the $l$-adic cohomology of some unitary Shimura varieties at split places where they admit uniformization by finite products of Drinfeld upper half spaces. Our main results confirm Langlands-Kottwitz's description of the cohomology of Shimura varieties in new cases.

preprint2016arXiv

Perfectoid Shimura varieties of abelian type

We prove that Shimura varieties of abelian type with infinite level at $p$ are perfectoid. As a corollary, the moduli spaces of polarized K3 surfaces with infinite level at $p$ are also perfectoid.

preprint2015arXiv

$p$-adic families of automorphic forms over some unitary Shimura varieties

We construct some $n$-dimensional eigenvarieties for finite slope overconvergent eigenforms over some unitary Shimura varieties with signature $(1,n-1)\times(0,n)\times\cdots\times(0,n)$ by adapting Andreatta-Iovita-Pilloni's method. We also show that there are some Galois pseudo-characters over our eigenvarieties by studying analytic continuation of finite slope eigenforms over these Shimura varieties.

preprint2014arXiv

Cell decomposition of some unitary group Rapoport-Zink spaces

In this paper we study the $p$-adic analytic geometry of the basic unitary group Rapoport-Zink spaces $\M_K$ with signature $(1,n-1)$. Using the theory of Harder-Narasimhan filtration of finite flat groups developed by Fargues in \cite{F2},\cite{F3}, and the Bruhat-Tits stratification of the reduced special fiber $\M_{red}$ defined by Vollaard-Wedhorn in \cite{VW}, we find some relatively compact fundamental domain $\D_K$ in $\M_K$ for the action of $G(\Q_p)\times J_b(\Q_p)$, the product of the associated $p$-adic reductive groups, and prove that $\M_K$ admits a locally finite cell decomposition. By considering the action of regular elliptic elements on these cells, we establish a Lefschetz trace formula for these spaces by applying Mieda's main theorem in \cite{Mi2}.

preprint2013arXiv

On the Hodge-Newton filtration for p-divisible groups with additional structures

We prove that, for a $p$-divisible group with additional structures over a complete valuation ring of rank one $O_K$ with mixed characteristic $(0,p)$, if the Newton polygon and the Hodge polygon of its special fiber possess a non trivial contact point, which is a break point for the Newton polygon, then it admits a "Hodge-Newton filtration" over $O_K$. The proof is based on the theories of Harder-Narasimhan filtration of finite flat group schemes and admissible filtered isocrystals. We then apply this result to the study of some larger class of Rapoport-Zink spaces and Shimura varieties than those studied previously by Mantovan, and confirm some new cases of Harris's conjecture.

preprint2012arXiv

On the Lefschetz trace formula for Lubin-Tate spaces

We reprove the Lefschetz trace formula for Lubin-Tate spaces, based on the locally finite cell decompositions of these spaces obtained by Fargues, and Mieda's theorem of Lefschetz trace formula for certain open adic spaces (\cite{Mi1} theorem 3.13). This proof is rather different from those of Strauch in \cite{St} (theorem 3.3.1) and of Mieda in \cite{Mi1} (example 4.21), and is quite hopeful to generalized to some other Rapoport-Zink spaces as soon as there exist suitable cell decompositions. For example, we proved a Lefschetz trace formula for some unitary Rapoport-Zink spaces in \cite{Sh} by using similar ideas here.