Source author record

Jiayao Ma

Jiayao Ma appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision cond-mat.soft physics.app-ph Robotics

Catalog footprint

What is connected

3works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Genie Centurion: Accelerating Scalable Real-World Robot Training with Human Rewind-and-Refine Guidance

While Vision-Language-Action (VLA) models show strong generalizability in various tasks, real-world deployment of robotic policy still requires large-scale, high-quality human expert demonstrations. However, data collection via human teleoperation requires continuous operator attention, which is costly, hard to scale. To address this, we propose Genie Centurion (GCENT), a scalable and general data collection paradigm based on human rewind-and-refine guidance, enabling robots' interactive learning in deployment. GCENT starts at an imperfect policy and improves over time. When the robot execution failures occur, GCENT allows robots to revert to a previous state with a rewind mechanism, after which a teleoperator provides corrective demonstrations to refine the policy. This framework supports a one-human-to-many-robots supervision scheme with a Task Sentinel module, which autonomously predicts task success and solicits human intervention when necessary. Empirical results show that GCENT achieves up to 40% higher task success rates than state-of-the-art data collection methods, and reaches comparable performance using less than half the data in long-horizon and precise tasks. We also quantify the data yield-to-effort ratio under multi-robot scenarios, demonstrating GCENT's potential for scalable and cost-efficient robot policy training in real-world environments.

preprint2026arXiv

Object Hallucination-Free Reinforcement Unlearning for Vision-Language Models

Vision-language models (VLMs) raise growing concerns about privacy, copyright, and bias, motivating machine unlearning to remove sensitive knowledge. However, existing methods primarily fine-tune the language decoder, leading to superficial forgetting that fails to erase underlying visual representations and often introduces object hallucination. We propose HFRU, a reinforcement unlearning framework that operates on the vision encoder for deep semantic removal. Our two-stage approach combines alignment disruption with GRPO-based optimization using a composite reward, including an abstraction reward that encourages semantically valid substitutions and mitigates hallucinations. Experiments on object recognition and face identity tasks show that HFRU achieves over 98% forgetting and retention performance, while introducing negligible object hallucination, significantly outperforming prior methods.Our code and implementation details are available at https://github.com/XMUDeepLIT/HFRU.

preprint2020arXiv

Rigid Foldability and Mountain-Valley Crease Assignments of Square-Twist Origami Pattern

Rigid foldability allows an origami pattern to fold about crease lines without twisting or stretching component panels. It enables folding of rigid materials, facilitating the design of foldable structures. Recent study shows that rigid foldability is affected by the mountain-valley crease (M-V) assignment of an origami pattern. In this paper, we investigate the rigid foldability of the square-twist origami pattern with diverse M-V assignments by a kinematic method based on the motion transmission path. Four types of square-twist origami patterns are analyzed, among which two are found rigidly foldable, while the other two are not. The explicit kinematic equations of the rigid cases are derived based on the kinematic equivalence between the rigid origami pattern and the closed-loop network of spherical 4R linkages. We also propose a crease-addition method to convert the rigid foldability of the non-rigid patterns. The motion compatibility conditions of the modified patterns are checked, which verify the rigid foldability of the modified patterns. The kinematic analysis reveals the bifurcation behaviour of the modified patterns. This work not only helps to deepen our understanding on the rigid foldability of origami patterns and its relationship with the M-V assignments, but also provides us an effective way to create more rigidly foldable origami patterns from non rigid ones.