Source author record

Qixun Zhang

Qixun Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence eess.SP Machine Learning Networking and Internet Architecture

Catalog footprint

What is connected

3works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Precision over Diversity: High-Precision Reward Generalizes to Robust Instruction Following

A central belief in scaling reinforcement learning with verifiable rewards for instruction following (IF) tasks is that, a diverse mixture of verifiable hard and unverifiable soft constraints is essential for generalizing to unseen instructions. In this work, we challenge this prevailing consensus through a systematic empirical investigation. Counter-intuitively, we find that models trained on hard-only constraints consistently outperform those trained on mixed datasets. Extensive experiments reveal that reward precision, rather than constraint diversity, is the primary driver of effective alignment. The LLM judge suffers from a low recall rate in detecting false response, which leads to severe reward hacking, thereby undermining the benefits of diversity. Furthermore, analysis of the attention mechanism reveals that high-precision rewards develop a transferable meta-skill for IF. Motivated by these insights, we propose a simple yet effective data-centric refinement strategy that prioritizes reward precision. Evaluated on five benchmarks, our approach outperforms competitive baselines by 13.4\% in performance while achieving a 58\% reduction in training time, maintaining strong generalization beyond instruction following. Our findings advocate for a paradigm shift: moving away from the indiscriminate pursuit of data diversity toward high-precision rewards.

preprint2022arXiv

Vehicular Connectivity on Complex Trajectories: Roadway-Geometry Aware ISAC Beam-tracking

In this paper, we propose sensing-assisted beamforming designs for vehicles on arbitrarily shaped roads by relying on integrated sensing and communication (ISAC) signalling.Specifically, we aim to address the limitations of conventional ISAC beam-tracking schemes that do not apply to complex road geometries. To improve the tracking accuracy and communication quality of service (QoS) in vehicle to infrastructure (V2I) networks, it is essential to model the complicated roadway geometry. To that end, we impose the curvilinear coordinate system (CCS) in an interacting multiple model extended Kalman filter (IMM-EKF) framework. By doing so, both the position and the motion of the vehicle on a complicated road can be explicitly modeled and precisely tracked attributing to the benefits from the CCS. Furthermore, an optimization problem is formulated to maximize the array gain through dynamically adjusting the array size and thereby controlling the beamwidth, which takes the performance loss caused by beam misalignment into account.Numerical simulations demonstrate that the roadway geometry-aware ISAC beamforming approach outperforms the communication-only based and ISAC kinematic-only based technique in the tracking performance. Moreover, the effectiveness of the dynamic beamwidth design is also verified by our numerical results.

preprint2013arXiv

On the Construction of Radio Environment Maps for Cognitive Radio Networks

The Radio Environment Map (REM) provides an effective approach to Dynamic Spectrum Access (DSA) in Cognitive Radio Networks (CRNs). Previous results on REM construction show that there exists a tradeoff between the number of measurements (sensors) and REM accuracy. In this paper, we analyze this tradeoff and determine that the REM error is a decreasing and convex function of the number of measurements (sensors). The concept of geographic entropy is introduced to quantify this relationship. And the influence of sensor deployment on REM accuracy is examined using information theory techniques. The results obtained in this paper are applicable not only for the REM, but also for wireless sensor network deployment.