Researcher profile

Guixian Xu

Guixian Xu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 13 - UnverifiedVerification L1Unclaimed author
2works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

2 published item(s)

preprint2026arXiv

Reinforcement Learning with Semantic Rewards Enables Low-Resource Language Expansion without Alignment Tax

Extending large language models (LLMs) to low-resource languages often incurs an "alignment tax": improvements in the target language come at the cost of catastrophic forgetting in general capabilities. We argue that this trade-off arises from the rigidity of supervised fine-tuning (SFT), which enforces token-level surface imitation on narrow and biased data distributions. To address this limitation, we propose a semantic-space alignment paradigm powered by Group Relative Policy Optimization (GRPO), where the model is optimized using embedding-level semantic rewards rather than likelihood maximization. This objective encourages meaning preservation through flexible realizations, enabling controlled updates that reduce destructive interference with pretrained knowledge. We evaluate our approach on Tibetan-Chinese machine translation and Tibetan headline generation. Experiments show that our method acquires low-resource capabilities while markedly mitigating alignment tax, preserving general competence more effectively than SFT. Despite producing less rigid surface overlap, semantic RL yields higher semantic quality and preference in open-ended generation, and few-shot transfer results indicate that it learns more transferable and robust representations under limited supervision. Overall, our study demonstrates that reinforcement learning with semantic rewards provides a safer and more reliable pathway for inclusive low-resource language expansion.

preprint2020arXiv

Beam-space Multiplexing: Practice, Theory, and Trends-From 4G TD-LTE, 5G, to 6G and Beyond

In this article, the new term, namely beam-space multiplexing, is proposed for the former multi-layer beamforming for 4G TD-LTE in 3GPP releases. We provide a systematic overview of beam-space multiplexing from engineering and theoretical perspectives. Firstly, we clarify the fundamental theory of beam-space multiplexing. Specifically, we provide a comprehensive comparison with the antenna-space multiplexing in terms of theoretical analysis, channel state information acquisition, and engineering implementation constraints. Then, we summarize the key technologies and 3GPP standardization of beam-space multiplexing in 4G TD-LTE and 5G new radio (NR) in terms of multi-layer beamforming and massive beamforming, respectively. We also provide system-level performance evaluation of beam-space multiplexing schemes and field results from current commercial TD-LTE networks and field trial of 5G. The practical deployments of 4G TD-LTE and 5G cellular networks demonstrate the superiority of beam-space multiplexing within the limitations of implementation complexity and practical deployment scenarios. Finally, the future trends of beam-space multiplexing in 6G and beyond are discussed, including massive beamforming for extremely large-scale MIMO (XL-MIMO), low earth orbit (LEO) satellites communication, data-driven intelligent massive beamforming, and multi-target spatial signal processing, i.e., joint communication and sensing, positioning, etc.