Researcher profile

Yuan Zhuang

Yuan Zhuang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2026arXiv

SEVO: Semantic-Enhanced Virtual Observation for Robust VLA Manipulation via Active Illumination and Data-Centric Collection

Vision-Language-Action (VLA) and imitation-learning policies trained via community toolchains on low-cost hardware frequently fail when deployed outside the training environment. Existing evaluations, including the original ACT and SmolVLA benchmarks, demonstrate high success rates under controlled, fixed backgrounds, yet community practitioners report near-zero transfer to new environments. We present SEVO (Semantic-Enhanced Virtual Observation), a data-centric approach that improves cross-environment manipulation robustness without modifying the policy architecture. SEVO transforms the raw RGB camera stream through three mechanisms: (1) body-fixed cameras whose combined fields of view cover the full manipulation workspace, (2) active red-spectrum illumination that physically normalizes object appearance, and (3) real-time YOLO segmentation overlay that provides a background-invariant semantic cue. Critically, we show that a diversified data collection protocol (systematically varying lighting, backgrounds, and distractors during teleoperation) is the single most important factor for generalization. We target transparent water bottles, objects that visually blend with their surroundings, and select a simple pick-and-place task to enable hundreds of controlled real-robot trials across two mobile platforms. The full pipeline achieves 95% grasp success with ACT and 83% with SmolVLA in the training environment, transferring to novel environments at 85% and 75%. Without SEVO, the same policies achieve only 75%/70% in training and collapse to 30-35% in novel environments. Our results demonstrate that principled observation design and environmental diversity during data collection, not model scaling, enable low-cost robots to operate reliably in everyday household environments.

preprint2022arXiv

Observability Analysis and Keyframe-Based Filtering for Visual Inertial Odometry with Full Self-Calibration

Camera-IMU (Inertial Measurement Unit) sensor fusion has been extensively studied in recent decades. Numerous observability analysis and fusion schemes for motion estimation with self-calibration have been presented. However, it has been uncertain whether both camera and IMU intrinsic parameters are observable under general motion. To answer this question, by using the Lie derivatives, we first prove that for a rolling shutter (RS) camera-IMU system, all intrinsic and extrinsic parameters, camera time offset, and readout time of the RS camera, are observable with an unknown landmark. To our knowledge, we are the first to present such a proof. Next, to validate this analysis and to solve the drift issue of a structureless filter during standstills, we develop a Keyframe-based Sliding Window Filter (KSWF) for odometry and self-calibration, which works with a monocular RS camera or stereo RS cameras. Though the keyframe concept is widely used in vision-based sensor fusion, to our knowledge, KSWF is the first of its kind to support self-calibration. Our simulation and real data tests have validated that it is possible to fully calibrate the camera-IMU system using observations of opportunistic landmarks under diverse motion. Real data tests confirmed previous allusions that keeping landmarks in the state vector can remedy the drift in standstill, and showed that the keyframe-based scheme is an alternative solution.

preprint2021arXiv

A Versatile Keyframe-Based Structureless Filter for Visual Inertial Odometry

Motion estimation by fusing data from at least a camera and an Inertial Measurement Unit (IMU) enables many applications in robotics. However, among the multitude of Visual Inertial Odometry (VIO) methods, few efficiently estimate device motion with consistent covariance, and calibrate sensor parameters online for handling data from consumer sensors. This paper addresses the gap with a Keyframe-based Structureless Filter (KSF). For efficiency, landmarks are not included in the filter's state vector. For robustness, KSF associates feature observations and manages state variables using the concept of keyframes. For flexibility, KSF supports anytime calibration of IMU systematic errors, as well as extrinsic, intrinsic, and temporal parameters of each camera. Estimator consistency and observability of sensor parameters were analyzed by simulation. Sensitivity to design options, e.g., feature matching method and camera count was studied with the EuRoC benchmark. Sensor parameter estimation was evaluated on raw TUM VI sequences and smartphone data. Moreover, pose estimation accuracy was evaluated on EuRoC and TUM VI sequences versus recent VIO methods. These tests confirm that KSF reliably calibrates sensor parameters when the data contain adequate motion, and consistently estimate motion with accuracy rivaling recent VIO methods. Our implementation runs at 42 Hz with stereo camera images on a consumer laptop.

preprint2020arXiv

Deep Reinforcement Learning (DRL): Another Perspective for Unsupervised Wireless Localization

Location is key to spatialize internet-of-things (IoT) data. However, it is challenging to use low-cost IoT devices for robust unsupervised localization (i.e., localization without training data that have known location labels). Thus, this paper proposes a deep reinforcement learning (DRL) based unsupervised wireless-localization method. The main contributions are as follows. (1) This paper proposes an approach to model a continuous wireless-localization process as a Markov decision process (MDP) and process it within a DRL framework. (2) To alleviate the challenge of obtaining rewards when using unlabeled data (e.g., daily-life crowdsourced data), this paper presents a reward-setting mechanism, which extracts robust landmark data from unlabeled wireless received signal strengths (RSS). (3) To ease requirements for model re-training when using DRL for localization, this paper uses RSS measurements together with agent location to construct DRL inputs. The proposed method was tested by using field testing data from multiple Bluetooth 5 smart ear tags in a pasture. Meanwhile, the experimental verification process reflected the advantages and challenges for using DRL in wireless localization.

preprint2020arXiv

Evolutionary Architecture Search for Graph Neural Networks

Automated machine learning (AutoML) has seen a resurgence in interest with the boom of deep learning over the past decade. In particular, Neural Architecture Search (NAS) has seen significant attention throughout the AutoML research community, and has pushed forward the state-of-the-art in a number of neural models to address grid-like data such as texts and images. However, very litter work has been done about Graph Neural Networks (GNN) learning on unstructured network data. Given the huge number of choices and combinations of components such as aggregator and activation function, determining the suitable GNN structure for a specific problem normally necessitates tremendous expert knowledge and laborious trails. In addition, the slight variation of hyper parameters such as learning rate and dropout rate could dramatically hurt the learning capacity of GNN. In this paper, we propose a novel AutoML framework through the evolution of individual models in a large GNN architecture space involving both neural structures and learning parameters. Instead of optimizing only the model structures with fixed parameter settings as existing work, an alternating evolution process is performed between GNN structures and learning parameters to dynamically find the best fit of each other. To the best of our knowledge, this is the first work to introduce and evaluate evolutionary architecture search for GNN models. Experiments and validations demonstrate that evolutionary NAS is capable of matching existing state-of-the-art reinforcement learning approaches for both the semi-supervised transductive and inductive node representation learning and classification.

preprint2020arXiv

Inertial Sensing Meets Artificial Intelligence: Opportunity or Challenge?

The inertial navigation system (INS) has been widely used to provide self-contained and continuous motion estimation in intelligent transportation systems. Recently, the emergence of chip-level inertial sensors has expanded the relevant applications from positioning, navigation, and mobile mapping to location-based services, unmanned systems, and transportation big data. Meanwhile, benefit from the emergence of big data and the improvement of algorithms and computing power, artificial intelligence (AI) has become a consensus tool that has been successfully applied in various fields. This article reviews the research on using AI technology to enhance inertial sensing from various aspects, including sensor design and selection, calibration and error modeling, navigation and motion-sensing algorithms, multi-sensor information fusion, system evaluation, and practical application. Based on the over 30 representative articles selected from the nearly 300 related publications, this article summarizes the state of the art, advantages, and challenges on each aspect. Finally, it summarizes nine advantages and nine challenges of AI-enhanced inertial sensing and then points out future research directions.

preprint2020arXiv

Towards Robust Crowdsourcing-Based Localization: A Fingerprinting Accuracy Indicator Enhanced Wireless/Magnetic/Inertial Integration Approach

The next-generation internet of things (IoT) systems have an increasingly demand on intelligent localization which can scale with big data without human perception. Thus, traditional localization solutions without accuracy metric will greatly limit vast applications. Crowdsourcing-based localization has been proven to be effective for mass-market location-based IoT applications. This paper proposes an enhanced crowdsourcing-based localization method by integrating inertial, wireless, and magnetic sensors. Both wireless and magnetic fingerprinting accuracy are predicted in real time through the introduction of fingerprinting accuracy indicators (FAI) from three levels (i.e., signal, geometry, and database). The advantages and limitations of these FAI factors and their performances on predicting location errors and outliers are investigated. Furthermore, the FAI-enhanced extended Kalman filter (EKF) is proposed, which improved the dead-reckoning (DR)/WiFi, DR/Magnetic, and DR/WiFi/Magnetic integrated localization accuracy by 30.2 %, 19.4 %, and 29.0 %, and reduced the maximum location errors by 41.2 %, 28.4 %, and 44.2 %, respectively. These outcomes confirm the effectiveness of the FAI-enhanced EKF on improving both accuracy and reliability of multi-sensor integrated localization using crowdsourced data.

preprint2020arXiv

Two-Dimensional Rare Earth -- Gold Intermetallic Compounds on Au(111) by Surface Alloying

Surface alloying is a straightforward route to control and modify the structure and electronic properties of surfaces. Here, We present a systematical study on the structural and electronic properties of three novel rare earth-based intermetallic compounds, namely ReAu2 (Re = Tb, Ho, and Er), on Au(111) via directly depositing rare-earth metals onto the hot Au(111) surface. Scanning tunneling microscopy/spectroscopy measurements reveal the very similar atomic structures and electronic properties, e.g. electronic states, and surface work functions, for all these intermetallic compound systems due to the physical and chemical similarities between these rare earth elements. Further, these electronic properties are periodically modulated by the moiré structures caused by the lattice mismatches between ReAu2 and Au(111). These periodically modulated surfaces could serve as templates for the self-assembly of nanostructures. Besides, these two-dimensional rare earth-based intermetallic compounds provide platforms to investigate the rare earth related catalysis, magnetisms, etc., in the lower dimensions.

preprint2017arXiv

Phase diagram and aggregation dynamics of a monolayer of paramagnetic colloids

We have developed a tunable colloidal system and a corresponding simulation model for studying the phase behavior of particles assembling under the influence of long-range magnetic interactions. A monolayer of paramagnetic particles is subjected to a spatially uniform magnetic field with a static perpendicular component and rapidly rotating in-plane component. The sign and strength of the interactions vary with the tilt angle $θ$ of the rotating magnetic field. For a purely in-plane field, $θ=90^{\circ}$, interactions are attractive and the experimental results agree well with both equilibrium and out-of-equilibrium predictions based on a two-body interaction model. For tilt angles $50^{\circ}\lesssim θ\lesssim 55^{\circ}$, the two-body interaction gives a short-range attractive and long-range repulsive (SALR) interaction, which predicts the formation of equilibrium microphases. In experiments, however, a different type of assembly is observed. Inclusion of three-body (and higher-order) terms in the model does not resolve the discrepancy. We thus further characterize the anomalous behavior by measuring the time-dependent cluster size distribution.

preprint2016arXiv

Recent Advances in the Theory and Simulation of Model Colloidal Microphase Formers

This mini-review synthesizes our understanding of the equilibrium behavior of particle models with short-range attractive and long-range repulsive (SALR) interactions. These models, which can form stable periodic microphases, aim to reproduce the essence of colloidal suspensions with competing interactions. Ordered structures, however, have yet to be obtained in experiments. In order to better understand the hurdles to periodic microphase assembly, marked theoretical and simulation advances have been made over the last few years. Here, we present recent progress in the study of microphases in models with SALR interactions using liquid-state theory and density-functional theory as well as numerical simulations. Combining these various approaches provides a description of periodic microphases, and give insights into the rich phenomenology of the surrounding disordered regime. Three additional ongoing research directions in the thermodynamics of models with SALR interactions are also presented.