Source author record

Yifan Wei

Yifan Wei appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence Computation and Language cond-mat.mtrl-sci Robotics

Catalog footprint

What is connected

3works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Beyond Self-Play: Hierarchical Reasoning for Continuous Motion in Closed-Loop Traffic Simulation

Closed-loop traffic simulation requires agents that are both scalable and behaviorally realistic. Recent self-play reinforcement learning approaches demonstrate strong scalability, but their equilibrium strategies fail to capture the socially aware behaviors of real human drivers. We propose a hierarchical architecture that goes beyond self-play by combining high-level multi-agent interaction reasoning with low-level continuous trajectory realization. Specifically, a Stackelberg-style Multi-Agent Reinforcement Learning (MARL) module generates interaction-aware intention commands. These commands condition a low-level continuous motion module, translating the strategic intent into physically consistent, scene-responsive control sequences. To mitigate distribution shift in closed-loop deployment, we introduce a hybrid co-training scheme combining MARL with auxiliary recovery supervision. Experiments on a SUMO-based urban network demonstrate that the proposed framework achieves superior control smoothness and safety compared to self-play and passive imitation baselines, while maintaining competitive traffic efficiency.

preprint2026arXiv

Towards Compositional Generalization of LLMs via Skill Taxonomy Guided Data Synthesis

Large Language Models (LLMs) and agent-based systems often struggle with compositional generalization due to a data bottleneck in which complex skill combinations follow a long-tailed, power-law distribution, limiting both instruction-following performance and generalization in agent-centric tasks. To address this challenge, we propose STEPS, a Skill Taxonomy guided Entropy-based Post-training data Synthesis framework for generating compositionally challenging data. STEPS explicitly targets compositional generalization by uncovering latent relationships among skills and organizing them into an interpretable, hierarchical skill taxonomy using structural information theory. Building on this taxonomy, we formulate data synthesis as a constrained information maximization problem, selecting skill combinations that maximize marginal structural information within the hierarchy while preserving semantic coherence. Experiments on challenging instruction-following benchmarks show that STEPS outperforms existing data synthesis baselines, while also yielding improved compositional generalization in downstream agent-based evaluations.

preprint2020arXiv

Giant Polarization and Abnormal Flexural Deformation in Bent Freestanding Perovskite Oxides

Recent realizations of ultrathin freestanding perovskite oxides offer a unique platform to probe novel properties in two-dimensional oxides. Here, we observed a giant flexoelectric response in freestanding BiFeO3 and SrTiO3 in their bent state arising from strain gradients up to 4x10e7/m, suggesting a promising approach for realizing extremely large polarizations. Additionally, a substantial reversible change in thickness was discovered in bent freestanding BiFeO3, which implies an unusual bending-expansion/shrinkage and thickness-dependence Poisson's ratios in this ferroelectric membrane that has never been seen before in crystalline materials. Our theoretical modeling reveals that this unprecedented flexural deformation within the membrane is attributable to a flexoelectricity-piezoelectricity interplay. The finding unveils intriguing nanoscale electromechanical properties and provides guidance for their practical applications in flexible nanoelectromechanical systems.