Researcher profile

Jingzhi Liu

Jingzhi Liu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 13 - UnverifiedVerification L1Unclaimed author
2works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

2 published item(s)

preprint2026arXiv

RePO-VLA: Recovery-Driven Policy Optimization for Vision-Language-Action Models

Vision-Language-Action (VLA) models remain brittle in long-horizon, contact-rich manipulation because success-only imitation provides little supervision for execution drift, while failed rollouts are often discarded. We introduce RePO-VLA, a recovery-driven policy optimization framework that assigns distinct roles to success, recovery, and failure trajectories. RePO-VLA first applies Recovery-Aware Initialization (RAI), slicing recovery segments and resetting history so corrective actions depend on the current adverse state rather than the preceding failure. It then learns a Progress-Aware Semantic Value Function (PAS-VF), aligning spatiotemporal trajectory features with instructions and successful references. The resulting labels salvage useful failure prefixes via reliability decay, while low-value labels mark drift and terminal breakdowns, teaching differences among nominal, failed, and corrective actions. The data engine turns adverse states into planner-generated or human-collected corrective rollouts, teaching recovery to the success manifold. Value-Conditioned Refinement (VCR) trains the policy to prefer high-progress actions. At deployment, a fixed high value ($v=1.0$) biases actions toward the learned success manifold without online failure detectors or heuristic retries. We introduce FRBench, with standardized error injection and recovery-focused evaluation. Across simulated and real-world bimanual tasks, RePO-VLA improves robustness, raising adversarial success from 20% to 75% on average and up to 80% in scaled real-world trials.

preprint2022arXiv

Investigation of variable temperature Mössbauer spectrum of YFe$_{0.5}$Cr$_{0.5}$O$_3$ perovskite

In this paper, we reported the preparation of YFe$_{0.5}$Cr$_{0.5}$O$_3$ by the sol-gel method and studied its structure and Mössbauer spectrum at variable temperatures. X-ray diffraction(XRD) analysis exhibits that the sample has the orthorhombic structure with the Pnma space group, and the energy dispersive spectroscopy (EDS) analysis shows that the sample has Fe/Cr = 1:1, indicating that the sample is Fe half-doped YCrO$_3$. The hyperfine parameters of the Mössbauer spectrum at room temperature confirm that the characteristics of 57Fe in the sample were trivalent hexacoordinated high-spin(s=5/2), and the coexistence of doublet and the sextets at 250K indicate that the sample has superparamagnetic relaxation. The Mössbauer spectrum records the magnetic phase transition in the temperature range of 250K-300K.