Researcher profile

Jiayao Ma

Jiayao Ma contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 15 - UnverifiedVerification L1Unclaimed author
3works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

3 published item(s)

preprint2026arXiv

Genie Centurion: Accelerating Scalable Real-World Robot Training with Human Rewind-and-Refine Guidance

While Vision-Language-Action (VLA) models show strong generalizability in various tasks, real-world deployment of robotic policy still requires large-scale, high-quality human expert demonstrations. However, data collection via human teleoperation requires continuous operator attention, which is costly, hard to scale. To address this, we propose Genie Centurion (GCENT), a scalable and general data collection paradigm based on human rewind-and-refine guidance, enabling robots' interactive learning in deployment. GCENT starts at an imperfect policy and improves over time. When the robot execution failures occur, GCENT allows robots to revert to a previous state with a rewind mechanism, after which a teleoperator provides corrective demonstrations to refine the policy. This framework supports a one-human-to-many-robots supervision scheme with a Task Sentinel module, which autonomously predicts task success and solicits human intervention when necessary. Empirical results show that GCENT achieves up to 40% higher task success rates than state-of-the-art data collection methods, and reaches comparable performance using less than half the data in long-horizon and precise tasks. We also quantify the data yield-to-effort ratio under multi-robot scenarios, demonstrating GCENT's potential for scalable and cost-efficient robot policy training in real-world environments.

preprint2026arXiv

Object Hallucination-Free Reinforcement Unlearning for Vision-Language Models

Vision-language models (VLMs) raise growing concerns about privacy, copyright, and bias, motivating machine unlearning to remove sensitive knowledge. However, existing methods primarily fine-tune the language decoder, leading to superficial forgetting that fails to erase underlying visual representations and often introduces object hallucination. We propose HFRU, a reinforcement unlearning framework that operates on the vision encoder for deep semantic removal. Our two-stage approach combines alignment disruption with GRPO-based optimization using a composite reward, including an abstraction reward that encourages semantically valid substitutions and mitigates hallucinations. Experiments on object recognition and face identity tasks show that HFRU achieves over 98% forgetting and retention performance, while introducing negligible object hallucination, significantly outperforming prior methods.Our code and implementation details are available at https://github.com/XMUDeepLIT/HFRU.

preprint2020arXiv

Rigid Foldability and Mountain-Valley Crease Assignments of Square-Twist Origami Pattern

Rigid foldability allows an origami pattern to fold about crease lines without twisting or stretching component panels. It enables folding of rigid materials, facilitating the design of foldable structures. Recent study shows that rigid foldability is affected by the mountain-valley crease (M-V) assignment of an origami pattern. In this paper, we investigate the rigid foldability of the square-twist origami pattern with diverse M-V assignments by a kinematic method based on the motion transmission path. Four types of square-twist origami patterns are analyzed, among which two are found rigidly foldable, while the other two are not. The explicit kinematic equations of the rigid cases are derived based on the kinematic equivalence between the rigid origami pattern and the closed-loop network of spherical 4R linkages. We also propose a crease-addition method to convert the rigid foldability of the non-rigid patterns. The motion compatibility conditions of the modified patterns are checked, which verify the rigid foldability of the modified patterns. The kinematic analysis reveals the bifurcation behaviour of the modified patterns. This work not only helps to deepen our understanding on the rigid foldability of origami patterns and its relationship with the M-V assignments, but also provides us an effective way to create more rigidly foldable origami patterns from non rigid ones.