Source author record

Xing Yu

Xing Yu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

physics.optics Artificial Intelligence Machine Learning Computation and Language

Catalog footprint

What is connected

6works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

On-policy self-distillation, where a student is pulled toward a copy of itself conditioned on privileged context (e.g., a verified solution or feedback), offers a promising direction for advancing reasoning capability without a stronger external teacher. Yet in math reasoning the gains are inconsistent, even when the same approach succeeds elsewhere. A pointwise mutual information analysis traces the failure to the privileged context itself: it inflates the teacher's confidence on tokens already implied by the solution (structural connectives, verifiable claims) and deflates it on deliberation tokens ("Wait", "Let", "Maybe") that drive multi-step search. We propose Anti-Self-Distillation (AntiSD), which ascends a divergence between student and teacher rather than descending it: this reverses the per-token sign and yields a naturally bounded advantage in one step. An entropy-triggered gate disables the term once the teacher entropy collapses, completing a drop-in replacement for default self-distillation. Across five models from 4B to 30B parameters on math reasoning benchmarks, AntiSD reaches the GRPO baseline's accuracy in 2 to 10x fewer training steps and improves final accuracy by up to 11.5 points. AntiSD opens a path to scalable self-improvement, where a language model bootstraps its own reasoning through its training signal.

preprint2026arXiv

From Generic Correlation to Input-Specific Credit in On-Policy Self Distillation

On-policy self-distillation has emerged as a promising paradigm for post-training language models, in which the model conditions on environment feedback to serve as its own teacher, providing dense token-level rewards without external teacher models or step-level annotations. Despite its empirical success, what this reward actually measures and what kind of credit it assigns remain unclear. Under a posterior-compatibility interpretation of feedback conditioning, standard in the implicit-reward literature, we show that the self-distillation token reward is a Bayesian filtering increment whose trajectory sum is exactly the pointwise mutual information between the response and the feedback given the input. This pMI can be raised by input-specific reasoning or by input-generic shortcuts, so we further decompose the teacher log-probability along the input axis. Based on this analysis, we propose CREDIT (Contrastive REward from DIsTillation), which isolates the input-specific component with a batch-contrastive baseline. At the sequence level, CREDIT is a teacher-side surrogate for a contrastive pMI objective that also penalizes responses remaining likely under unrelated inputs. Across coding, scientific reasoning, and tool-use benchmarks on two model families, CREDIT delivers the strongest aggregate performance at negligible additional compute.

preprint2011arXiv

An ultra-thin waveguide twist constructed using fish-scale metallic wires

This study theoretically and experimentally investigates the transmission properties of a metamaterial slab comprised of two layers of metallic fish-scale structure arrays and a sandwiched dielectric layer. Calculations show that the asymmetric transmission can be tuned by varying the slab thickness, due to evanescent interlayer coupling. The spatial evolution of the local field inside the structure indicates that the slab functions as a perfect polarization transformer at certain frequencies in the manner of a waveguide twist. Measured transmission spectra are in good agreement with calculated results when material dissipation is considered.

preprint2011arXiv

Broadband enhanced transmission through the stacked metallic multi-layers perforated with coaxial annular apertures

This paper theoretically and experimentally presents a first report on broadband enhanced transmission through stacked metallic multi-layers perforated with coaxial annular apertures (CAAs). Different from previous studies on extraordinary transmission that occurs at a single frequency, the enhanced transmission of our system with two or three metallic layers can span a wide frequency range with a bandwidth about 60% of the central frequency. The phenomena arise from the excitation and hybridization of guided resonance modes in CAAs among different layers. Measured transmission spectra are in good agreement with calculations semi-analytically resolved by modal expansion method.

preprint2011arXiv

Metallic helix array as a broadband wave plate

This study proposes that a metallic helix array can operate as a highly-transparent broadband wave plate in propagation directions perpendicular to the axis of helices. The functionality arises from a special property of the helix array, namely that the eigenstates of elliptically right-handed and left-handed polarization are dominated by Bragg scattering and local resonance respectively, and can be modulated separately with nearly fixed difference between their wavevectors in a wide frequency range. The wave plate functionality is theoretically and experimentally demonstrated by the transformation of polarized states in a wide frequency range.

preprint2011arXiv

Subwavelength electromagnetic diode: one-way response of cascading nonlinear meta-atoms

We propose a scheme for realizing subwavelength electromagnetic diode by employing cascading nonlinear meta-atoms. One-way response is demonstrated on a microwave transmission line comprising of three metallic ring resonators acting as meta-atoms and a varactor as the nonlinear medium inclusion. Experiments show that our implementation can operate simultaneously as forward diode and backward diode at different frequencies. A transmission contrast of up to 14.7dB was achieved between forward and backward transmission. Subwavelength size of our diode should be useful for miniaturization of integrated optical nanocircuits.

Institution

Affiliation not imported yet

This author record came from a source that does not expose affiliation metadata. Once the author claims the profile or we enrich the record from another provider, this section will link to the concrete institution.

Topic footprint