Researcher profile

Yilin Wu

Yilin Wu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2026arXiv

Do What You Say: Steering Vision-Language-Action Models via Runtime Reasoning-Action Alignment Verification

Reasoning Vision Language Action (VLA) models improve robotic instruction-following by generating step-by-step textual plans before low-level actions, an approach inspired by Chain-of-Thought (CoT) reasoning in language models. Yet even with a correct textual plan, the generated actions can still miss the intended outcomes in the plan, especially in out-of-distribution (OOD) scenarios. We formalize this phenomenon as a lack of embodied CoT faithfulness, and introduce a training-free, runtime policy steering method for reasoning-action alignment. Given a reasoning VLA's intermediate textual plan, our framework samples multiple candidate action sequences from the same model, predicts their outcomes via simulation, and uses a pre-trained Vision-Language Model (VLM) to select the sequence whose outcome best aligns with the VLA's own textual plan. Only executing action sequences that align with the textual reasoning turns our base VLA's natural action diversity from a source of error into a strength, boosting robustness to semantic and visual OOD perturbations and enabling novel behavior composition without costly re-training. We also contribute a reasoning-annotated extension of LIBERO-100, environment variations tailored for OOD evaluation, and demonstrate up to 15% performance gain over prior work on behavior composition tasks and scales with compute and data diversity. Project Website at: https://yilin-wu98.github.io/steering-reasoning-vla/

preprint2026arXiv

Empowering Small Language Models with Factual Hallucination-Aware Reasoning for Financial Classification

Small language models (SLMs) are increasingly used for financial classification due to their fast inference and local deployability. However, compared with large language models, SLMs are more prone to factual hallucinations in reasoning and exhibit weaker classification performance. This raises a natural question: Can mitigating factual hallucinations improve SLMs' financial classification? To address this, we propose a three-step pipeline named AAAI (Association Identification, Automated Detection, and Adaptive Inference). Experiments on three representative SLMs reveal that: (1) factual hallucinations are positively correlated with misclassifications; (2) encoder-based verifiers effectively detect factual hallucinations; and (3) incorporating feedback on factual errors enables SLMs' adaptive inference that enhances classification performance. We hope this pipeline contributes to trustworthy and effective applications of SLMs in finance.

preprint2022arXiv

Geometrical control of interface patterning underlies active matter invasion

Interaction between active materials and the boundaries of geometrical confinement is key to many emergent phenomena in active systems. For living active matter consisting of animal cells or motile bacteria, the confinement boundary is often a deformable interface, and it has been unclear how activity-induced interface dynamics might lead to morphogenesis and pattern formation. Here we studied the evolution of bacterial active matter confined by a deformable boundary. We discovered that an ordered morphological pattern emerged at the interface characterized by periodically-spaced interfacial protrusions; behind the interfacial protrusions, bacterial swimmers self-organized into multicellular clusters displaying +1/2 nematic defects. Subsequently, a hierarchical sequence of transitions from interfacial protrusions to creeping branches allowed the bacterial active drop to rapidly invade surrounding space with a striking self-similar branch pattern. We found that this interface patterning is controlled by the local curvature of the interface, a phenomenon we denote as collective curvature sensing. Using a continuum active model, we revealed that the collective curvature sensing arises from enhanced active stresses near high-curvature regions, with the active length-scale setting the characteristic distance between the interfacial protrusions. Our findings reveal a protrusion-to-branch transition as a novel mode of active matter invasion and suggest a new strategy to engineer pattern formation of active materials.

preprint2020arXiv

Learning to Manipulate Deformable Objects without Demonstrations

In this paper we tackle the problem of deformable object manipulation through model-free visual reinforcement learning (RL). In order to circumvent the sample inefficiency of RL, we propose two key ideas that accelerate learning. First, we propose an iterative pick-place action space that encodes the conditional relationship between picking and placing on deformable objects. The explicit structural encoding enables faster learning under complex object dynamics. Second, instead of jointly learning both the pick and the place locations, we only explicitly learn the placing policy conditioned on random pick points. Then, by selecting the pick point that has Maximal Value under Placing (MVP), we obtain our picking policy. This provides us with an informed picking policy during testing, while using only random pick points during training. Experimentally, this learning framework obtains an order of magnitude faster learning compared to independent action-spaces on our suite of deformable object manipulation tasks with visual RGB observations. Finally, using domain randomization, we transfer our policies to a real PR2 robot for challenging cloth and rope coverage tasks, and demonstrate significant improvements over standard RL techniques on average coverage.

preprint2020arXiv

Viscoelastic control of spatiotemporal order in bacterial active matter

Active matter consists of units that generate mechanical work by consuming energy. Examples include living systems, such as assemblies of bacteria and biological tissues, biopolymers driven by molecular motors, and suspensions of synthetic self-propelled particles. A central question in the field is to understand and control the self-organization of active assemblies in space and time. Most active systems exhibit either spatial order mediated by interactions that coordinate the spatial structure and the motion of active agents or the temporal synchronization of individual oscillatory dynamics. The simultaneous control of spatial and temporal organization is more challenging and generally requires complex interactions, such as reaction-diffusion hierarchies or genetically engineered cellular circuits. Here, we report a novel and simple means to simultaneously control the spatial and temporal self-organization of bacterial active matter. By confining an active bacterial suspension and manipulating a single macroscopic parameter, namely the viscoelasticity of the suspending fluid, we have found that the bacterial fluid first self-organizes in space into a millimeter-scale rotating vortex; then displays temporal organization as the giant vortex switches its global chirality periodically with tunable frequency, reminiscent of a torsional pendulum - a self-driven one. Combining experiments with an active matter model, we explain this striking behavior in terms of the interplay between active forcing and viscoelastic stress relaxation. Our findings advance the understanding of bacterial behavior in complex fluids, and demonstrate experimentally for the first time that rheological properties can be harnessed to control active matter flows. Coupled with actuation, our tunable self-oscillating bacterial vortex may be used as a "clock" for locomotion of soft robots and microfluidic pumping.