Source author record

Fang Hu

Fang Hu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language Computer Vision cond-mat.mtrl-sci Robotics

Catalog footprint

What is connected

3works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Knowledge-to-Verification: Exploring RLVR for LLMs in Knowledge-Intensive Domains

Reinforcement learning with verifiable rewards (RLVR) has demonstrated promising potential to enhance the reasoning capabilities of large language models (LLMs) in domains such as mathematics and coding. However, its applications on knowledge-intensive domains have not been effectively explored due to the scarcity of high-quality verifiable data. Furthermore, current RLVR focuses solely on the correctness of final answers, leading to the limitations of flawed reasoning and sparse reward signals. In this work, we propose Knowledge-to-Verification (K2V), a framework that extends RLVR to knowledge-intensive domains through automated verifiable data synthesis, while enabling verification of the LLM's reasoning process. Extensive experiments demonstrate that K2V enhances the reasoning of LLM in knowledge-intensive domains without significantly compromising the model's general capabilities. This study also suggests that integrating automated data synthesis with reasoning verification is a promising direction to enhance model capabilities in these broader domains. Code is available at https://github.com/SeedScientist/K2V.

preprint2020arXiv

Transferable Active Grasping and Real Embodied Dataset

Grasping in cluttered scenes is challenging for robot vision systems, as detection accuracy can be hindered by partial occlusion of objects. We adopt a reinforcement learning (RL) framework and 3D vision architectures to search for feasible viewpoints for grasping by the use of hand-mounted RGB-D cameras. To overcome the disadvantages of photo-realistic environment simulation, we propose a large-scale dataset called Real Embodied Dataset (RED), which includes full-viewpoint real samples on the upper hemisphere with amodal annotation and enables a simulator that has real visual feedback. Based on this dataset, a practical 3-stage transferable active grasping pipeline is developed, that is adaptive to unseen clutter scenes. In our pipeline, we propose a novel mask-guided reward to overcome the sparse reward issue in grasping and ensure category-irrelevant behavior. The grasping pipeline and its possible variants are evaluated with extensive experiments both in simulation and on a real-world UR-5 robotic arm.

preprint2016arXiv

Strongly Modulated Ambipolar Characteristics of Few-layer Black Phosphorus in Oxygen

Two-dimensional black phosphorus has been configured as field-effect transistors, showing an intrinsic symmetric ambipolar transport characteristic. Here, we demonstrate the strongly modulated ambipolar characteristics of few-layer black phosphorus in oxygen. Pure oxygen exposure can dramatically decrease the electron mobility of black phosphorus without degrading the hole transport. The transport characteristics can be nearly recovered upon annealing in Argon. This reveals that oxygen molecules are physisorbed on black phosphorus. In contrast, oxygen exposure upon light illumination exhibits a significant attenuation for both electron and hole transport, originating from the photoactivated oxidation of black phosphorus, which is corroborated by in situ X-ray photoelectron spectroscopy characterization. Our findings clarify the predominant role of oxygen in modulating ambipolar characteristics of black phosphorus, thereby providing deeper insight to the design of black phosphorus based complementary electronics.