Source author record

Xiang Huang

Xiang Huang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence Computation and Language Computational Complexity cond-mat.mes-hall cond-mat.mtrl-sci Distributed, Parallel, and Cluster Computing Formal Languages and Automata Theory Graphics physics.app-ph physics.atom-ph physics.comp-ph quant-ph

Catalog footprint

What is connected

7works

12topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Act-Adaptive Margin: Dynamically Calibrating Reward Models for Subjective Ambiguity

Currently, most reinforcement learning tasks focus on domains like mathematics and programming, where verification is relatively straightforward. However, in subjective tasks such as role-playing, alignment techniques struggle to make progress, primarily because subjective reward modeling using the Bradley-Terry model faces significant challenges when dealing with ambiguous preferences. To improve reward modeling in subjective tasks, this paper proposes AAM (\textbf{\underline{A}}ct-\textbf{\underline{A}}daptive \textbf{\underline{M}}argin), which enhances reward modeling by dynamically calibrating preference margins using the model's internal parameter knowledge. We design two versions of AAM that efficiently generate contextually-appropriate preference gaps without additional human annotation. This approach fundamentally improves how reward models handle subjective rewards by better integrating generative understanding with preference scoring. To validate AAM's effectiveness in subjective reward modeling, we conduct evaluations on RewardBench, JudgeBench, and challenging role-playing tasks. Results show that AAM significantly improves subjective reward modeling performance, enhancing Bradley-Terry reward models by 2.95\% in general tasks and 4.85\% in subjective role-playing tasks. Furthermore, reward models trained with AAM can help downstream alignment tasks achieve better results. Our test results show that applying rewards generated by AAM-Augmented RM to preference learning techniques (e.g., GRPO) achieves state-of-the-art results on CharacterEval and Charm. Code and dataset are available at https://github.com/calubkk/AAM.

preprint2022arXiv

Computing Real Numbers with Large-Population Protocols Having a Continuum of Equilibria

Bournez, Fraigniaud, and Koegler defined a number in [0,1] as computable by their Large-Population Protocol (LPP) model, if the proportion of agents in a set of marked states converges to said number over time as the population grows to infinity. The notion, however, restricts the ordinary differential equations (ODEs) associated with an LPP to have only finitely many equilibria. This restriction places an intrinsic limitation on the model. As a result, a number is computable by an LPP if and only if it is algebraic, namely, not a single transcendental number can be computed under this notion. In this paper, we lift the finitary requirement on equilibria. That is, we consider systems with a continuum of equilibria. We show that essentially all numbers in [0,1] that are computable by bounded general-purpose analog computers (GPACs) or chemical reaction networks (CRNs) can also be computed by LPPs under this new definition. This implies a rich series of numbers (e.g., the reciprocal of Euler's constant, $π/4$, Euler's $γ$, Catalan's constant, and Dottie number) are all computable by LPPs. Our proof is constructive: We develop an algorithm that transfers bounded GPACs/CRNs into LPPs. Our algorithm also fixes a gap in Bournez et al.'s construction of LPPs designed to compute any arbitrary algebraic number in [0,1].

preprint2022arXiv

Enhancing thermoelectric properties of isotope graphene nanoribbons via machine learning guided manipulation of disordered antidots and interfaces

Structural manipulation at the nanoscale breaks the intrinsic correlations among different energy carrier transport properties, achieving high thermoelectric performance. However, the coupled multifunctional (phonon and electron) transport in the design of nanomaterials makes the optimization of thermoelectric properties challenging. Machine learning brings convenience to the design of nanostructures with large degree of freedom. Herein, we conducted comprehensive thermoelectric optimization of isotopic armchair graphene nanoribbons (AGNRs) with antidots and interfaces by combining Green's function approach with machine learning algorithms. The optimal AGNR with ZT of 0.894 by manipulating antidots was obtained at the interfaces of the aperiodic isotope superlattices, which is 5.69 times larger than that of the pristine structure. The proposed optimal structure via machine learning provides physical insights that the carbon-13 atoms tend to form a continuous interface barrier perpendicular to the carrier transport direction to suppress the propagation of phonons through isotope AGNRs. The antidot effect is more effective than isotope substitution in improving the thermoelectric properties of AGNRs. The proposed approach coupling energy carrier transport property analysis with machine learning algorithms offers highly efficient guidance on enhancing the thermoelectric properties of low-dimensional nanomaterials, as well as to explore and gain non-intuitive physical insights.

preprint2019arXiv

Coulomb focusing in retrapped ionization with near-circularly polarized laser field

The full three-dimensional photoelectron momentum distributions of argon are measured in intense near-circularly polarized laser fields. We observed that the transverse momentum distribution of ejected electrons by 410-nm near-circularly polarized field is unexpectedly narrowed with increasing laser intensity, which is contrary to the conventional rules predicted by adiabatic theory. By analyzing the momentum-resolved angular momentum distribution measured experimentally and the corresponding trajectories of ejected electrons semiclassically, the narrowing can be attributed to a temporary trapping and thereby focusing of a photoelectron by the atomic potential in a quasibound state. With the near-circularly polarized laser field, the strong Coulomb interaction with the rescattering electrons is avoided, thus the Coulomb focusing in the retrapped process is highlighted. We believe that these findings will facilitate understanding and steering electron dynamics in the Coulomb coupled system.

preprint2016arXiv

Polynomial Space Randomness in Analysis

We study the interaction between polynomial space randomness and a fundamental result of analysis, the Lebesgue differentiation theorem. We generalize Ko's framework for polynomial space computability in $\mathbb{R}^n$ to define \textit{weakly pspace-random} points, a new variant of polynomial space randomness. We show that the Lebesgue differentiation theorem holds for every weakly pspace-random point.

preprint2015arXiv

Massively Parallel Ray Tracing Algorithm Using GPU

Ray tracing is a technique for generating an image by tracing the path of light through pixels in an image plane and simulating the effects of high-quality global illumination at a heavy computational cost. Because of the high computation complexity, it can't reach the requirement of real-time rendering. The emergence of many-core architectures, makes it possible to reduce significantly the running time of ray tracing algorithm by employing the powerful ability of floating point computation. In this paper, a new GPU implementation and optimization of the ray tracing to accelerate the rendering process is presented.

preprint2015arXiv

Three-party quantum private comparison of equality based on genuinely maximally entangled six-qubit states

We propose a new three-party quantum private comparison protocol using genuinely maximally entangled six-qubit states. In our protocol, three participants can determine whether their private information are equal or not without an external third party who helps compute the comparison result. At the same time the participants can preserve the privacy of their inputs, respectively. Our protocol does not need any unitary operations to encode information due to the excellent properties of genuinely maximally entangled six-qubit states. Additionally, the protocol uses one-step quantum transmission and it is congenitally free from Trojan horse attacks. We have also shown that our protocol is secure against outside and participant attacks in this paper.

Xiang Huang

What is connected

Connect this record

See the researcher in context

Building this map preview

7 published item(s)

Act-Adaptive Margin: Dynamically Calibrating Reward Models for Subjective Ambiguity

Computing Real Numbers with Large-Population Protocols Having a Continuum of Equilibria

Enhancing thermoelectric properties of isotope graphene nanoribbons via machine learning guided manipulation of disordered antidots and interfaces

Coulomb focusing in retrapped ionization with near-circularly polarized laser field

Polynomial Space Randomness in Analysis

Massively Parallel Ray Tracing Algorithm Using GPU

Three-party quantum private comparison of equality based on genuinely maximally entangled six-qubit states