Researcher profile

Zhihao Xu

Zhihao Xu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2026arXiv

Think-with-Rubrics: From External Evaluator to Internal Reasoning Guidance

Rubrics have been extensively utilized for evaluating unverifiable, open-ended tasks, with recent research incorporating them into reward systems for reinforcement learning. However, existing frameworks typically treat rubrics only as external evaluator disjointed from the policy's primary reasoning trace. Such design confines rubrics to post-hoc measurement, leaving them unable to actively guide the model's generation process. In this work, we introduce Think-with-Rubrics, a novel paradigm for instruction following tasks. Think-with-Rubrics integrates rubric generation into the reasoning context, transforming the rubric from an independent artifact into an internal guidance of LLM's generation. During training, LLM sequentially generates a rubric followed by a response, while a trained rubric verifier provides joint supervision by evaluating the consistency between the answer and the self-generated / golden rubrics. Experiments across multiple benchmarks demonstrate that Think-with-Rubrics consistently outperforms the Rubric-as-Reward baseline supervised by golden rubrics by an average of 3.87 points. We have also discussed the mechanism by which Think-with-Rubrics enhances model performance. Experimental results demonstrate that supervision from golden rubrics and self-generated rubrics enhances the performance of Think-with-Rubrics by improving the quality of self-generated rubrics and increasing the internal consistency of responses respectively.

preprint2026arXiv

Unlocking Implicit Experience: Synthesizing Tool-Use Trajectories from Text

Enabling Large Language Models (LLMs) to effectively utilize tools in multi-turn interactions is essential for building capable autonomous agents. However, acquiring diverse and realistic multi-turn tool-use data remains a significant challenge. In this work, we propose a novel text-based paradigm. We observe that textual corpora naturally contain rich, multi-step problem-solving experiences, which can serve as an untapped, scalable, and authentic data source for multi-turn tool-use tasks. Based on this insight, we introduce GEM, a data synthesis pipeline that enables the generation and extraction of multi-turn tool-use trajectories from text corpora through a four-stage process: relevance filtering, workflow & tool extraction, trajectory grounding, and complexity refinement. To reduce the computational cost, we further train a specialized Trajectory Synthesizer via supervised fine-tuning. This model distills the complex generation pipeline into an efficient, end-to-end trajectory generator. Experiments demonstrate that our GEM-32B achieve a 16.5% improvement on the BFCL V3 Multi-turn benchmark. Our models partially surpass the performance of models trained on τ - bench (Airline and Retail) in-domain data, highlighting the superior generalization capability derived from our text-based synthesis paradigm. Notably, our Trajectory Synthesizer matches the quality of the full pipeline while significantly reducing inference latency and costs.

preprint2022arXiv

Characterizations for the existence of traces of first-order Sobolev spaces on hyperbolic fillings

In this paper, we study the existence of traces for Sobolev spaces on the hyperbolic filling $X$ of a compact metric space $Z$ equipped with a doubling measure. Given a suitable metric on $X$, we can regard $Z$ as the boundary of $X$. After equipping $X$ with a weighted measure $μ$ via the measure on $Z$ and the Euclidean arc length, we give characterizations for the existence of traces for first-order Sobolev spaces.

preprint2022arXiv

Exact Mobility edges and topological Anderson insulating phase in a slowly varying quasiperiodic model

We uncover the relationship of topology and disorder in a one-dimensional Su-Schrieffer-Heeger chain subjected to a slowly varying quasi-periodic modulation. By numerically calculating the disorder-averaged winding number and analytically studying the localization length of the zero modes, we obtain the topological phase diagram, which implies that the topological Anderson insulator (TAI) can be induced by a slowly varying quasi-periodic modulation. Moreover, unlike the localization properties in the TAI phase caused by random disorder, mobility edges can enter into the TAI region identified by the fractal dimension, the inverse participation ratio, and the spatial distributions of the wave functions, the boundaries of which coincide with our analytical results.

preprint2020arXiv

Dynamical observation of mobility edges in one-dimensional incommensurate optical lattices

We investigate the wave packet dynamics for a one-dimensional incommensurate optical lattice with a special on-site potential which exhibits the mobility edge in a compactly analytic form. We calculate the density propagation, long-time survival probability and mean square displacement of the wave packet in the regime with the mobility edge and compare with the cases in extended, localized and multifractal regimes. Our numerical results indicate that the dynamics in the mobility-edge regime mix both extended and localized features which is quite different from that in the mulitfractal phase. We utilize the Loschmidt echo dynamics by choosing different eigenstates as initial states and sudden changing the parameters of the system to distinguish the phases in the presence of such system.

preprint2020arXiv

Fate of zero modes in a finite Su-Schrieffer-Heeger Model with $\mathcal{PT}$ Symmetry

Due to the boundary coupling in a finite system, the zero modes of a standard Su-Schrieffer-Heeger (SSH) model may deviate from exact-zero energy. A recent experiment has shown that by increasing the system size or altering gain or loss strength of the SSH model with parity-time ($\mathcal{PT}$) symmetry, the real parts of the energies of the edge modes can be recovered to exact-zero value [Song \emph{et al.} Phys. Rev. Lett. \textbf{123}, 165701 (2019)]. To clarify the effects of $\mathcal{PT}$-symmetric potentials on the recovery of the nontrivial zero modes, we study the SSH model with $\mathcal{PT}$-symmetric potentials of different forms in both infinite and finite systems. Our results indicate that the energies of the edge modes in the infinite size case decide whether or not the success of the recovery of the zero modes by tuning the strength of $\mathcal{PT}$-symmetric potential in a finite system. If the energies of the edge modes amount to zero in the thermodynamic limit under an open boundary condition (OBC), the recovery of the zero modes will break down by increasing the gain or loss strength for a finite system. Our results can be easily examined in different experimental platforms and inspire more insightful understanding on nontrivial edge modes in topologically non-Hermitian systems.

preprint2020arXiv

Topological Bose-Mott insulators in one-dimensional non-Hermitian superlattices

We study the topological properties of Bose-Mott insulators in one-dimensional non-Hermitian superlattices, which may serve as effective Hamiltonians for cold atomic optical systems with either two-body loss or one-body loss. We find that in the strongly repulsive limit, the Mott insulator states of the Bose-Hubbard model with a finite two-body loss under integer fillings are topological insulators characterized by a finite charge gap, nonzero integer Chern numbers, and nontrivial edge modes in a low-energy excitation spectrum under an open boundary condition. The two-body loss suppressed by the strong repulsion results in a stable topological Bose-Mott insulator which has shares features similar to the Hermitian case. However, for the non-Hermitian model related to the one-body loss, we find the non-Hermitian topological Mott insulators are unstable with a finite imaginary excitation gap. Finally, we also discuss the stability of the Mott phase in the presence of two-body loss by solving the Lindblad master equation.