Source author record

Guixian Xu

Guixian Xu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language eess.SP Machine Learning

Catalog footprint

What is connected

2works

3topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Reinforcement Learning with Semantic Rewards Enables Low-Resource Language Expansion without Alignment Tax

Extending large language models (LLMs) to low-resource languages often incurs an "alignment tax": improvements in the target language come at the cost of catastrophic forgetting in general capabilities. We argue that this trade-off arises from the rigidity of supervised fine-tuning (SFT), which enforces token-level surface imitation on narrow and biased data distributions. To address this limitation, we propose a semantic-space alignment paradigm powered by Group Relative Policy Optimization (GRPO), where the model is optimized using embedding-level semantic rewards rather than likelihood maximization. This objective encourages meaning preservation through flexible realizations, enabling controlled updates that reduce destructive interference with pretrained knowledge. We evaluate our approach on Tibetan-Chinese machine translation and Tibetan headline generation. Experiments show that our method acquires low-resource capabilities while markedly mitigating alignment tax, preserving general competence more effectively than SFT. Despite producing less rigid surface overlap, semantic RL yields higher semantic quality and preference in open-ended generation, and few-shot transfer results indicate that it learns more transferable and robust representations under limited supervision. Overall, our study demonstrates that reinforcement learning with semantic rewards provides a safer and more reliable pathway for inclusive low-resource language expansion.

preprint2020arXiv

Beam-space Multiplexing: Practice, Theory, and Trends-From 4G TD-LTE, 5G, to 6G and Beyond

In this article, the new term, namely beam-space multiplexing, is proposed for the former multi-layer beamforming for 4G TD-LTE in 3GPP releases. We provide a systematic overview of beam-space multiplexing from engineering and theoretical perspectives. Firstly, we clarify the fundamental theory of beam-space multiplexing. Specifically, we provide a comprehensive comparison with the antenna-space multiplexing in terms of theoretical analysis, channel state information acquisition, and engineering implementation constraints. Then, we summarize the key technologies and 3GPP standardization of beam-space multiplexing in 4G TD-LTE and 5G new radio (NR) in terms of multi-layer beamforming and massive beamforming, respectively. We also provide system-level performance evaluation of beam-space multiplexing schemes and field results from current commercial TD-LTE networks and field trial of 5G. The practical deployments of 4G TD-LTE and 5G cellular networks demonstrate the superiority of beam-space multiplexing within the limitations of implementation complexity and practical deployment scenarios. Finally, the future trends of beam-space multiplexing in 6G and beyond are discussed, including massive beamforming for extremely large-scale MIMO (XL-MIMO), low earth orbit (LEO) satellites communication, data-driven intelligent massive beamforming, and multi-target spatial signal processing, i.e., joint communication and sensing, positioning, etc.