Researcher profile

Qixun Zhang

Qixun Zhang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 13 - UnverifiedVerification L1Unclaimed author
2works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

2 published item(s)

preprint2026arXiv

Precision over Diversity: High-Precision Reward Generalizes to Robust Instruction Following

A central belief in scaling reinforcement learning with verifiable rewards for instruction following (IF) tasks is that, a diverse mixture of verifiable hard and unverifiable soft constraints is essential for generalizing to unseen instructions. In this work, we challenge this prevailing consensus through a systematic empirical investigation. Counter-intuitively, we find that models trained on hard-only constraints consistently outperform those trained on mixed datasets. Extensive experiments reveal that reward precision, rather than constraint diversity, is the primary driver of effective alignment. The LLM judge suffers from a low recall rate in detecting false response, which leads to severe reward hacking, thereby undermining the benefits of diversity. Furthermore, analysis of the attention mechanism reveals that high-precision rewards develop a transferable meta-skill for IF. Motivated by these insights, we propose a simple yet effective data-centric refinement strategy that prioritizes reward precision. Evaluated on five benchmarks, our approach outperforms competitive baselines by 13.4\% in performance while achieving a 58\% reduction in training time, maintaining strong generalization beyond instruction following. Our findings advocate for a paradigm shift: moving away from the indiscriminate pursuit of data diversity toward high-precision rewards.

preprint2022arXiv

Vehicular Connectivity on Complex Trajectories: Roadway-Geometry Aware ISAC Beam-tracking

In this paper, we propose sensing-assisted beamforming designs for vehicles on arbitrarily shaped roads by relying on integrated sensing and communication (ISAC) signalling.Specifically, we aim to address the limitations of conventional ISAC beam-tracking schemes that do not apply to complex road geometries. To improve the tracking accuracy and communication quality of service (QoS) in vehicle to infrastructure (V2I) networks, it is essential to model the complicated roadway geometry. To that end, we impose the curvilinear coordinate system (CCS) in an interacting multiple model extended Kalman filter (IMM-EKF) framework. By doing so, both the position and the motion of the vehicle on a complicated road can be explicitly modeled and precisely tracked attributing to the benefits from the CCS. Furthermore, an optimization problem is formulated to maximize the array gain through dynamically adjusting the array size and thereby controlling the beamwidth, which takes the performance loss caused by beam misalignment into account.Numerical simulations demonstrate that the roadway geometry-aware ISAC beamforming approach outperforms the communication-only based and ISAC kinematic-only based technique in the tracking performance. Moreover, the effectiveness of the dynamic beamwidth design is also verified by our numerical results.