Source author record

Yong Wu

Yong Wu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

17works

21topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Boundary-Aware NL2SQL: Integrating Reliability through Hybrid Reward and Data Synthesis

In this paper, we present BAR-SQL (Boundary-Aware Reliable NL2SQL), a unified training framework that embeds reliability and boundary awareness directly into the generation process. We introduce a Seed Mutation data synthesis paradigm that constructs a representative enterprise corpus, explicitly encompassing multi-step analytical queries alongside boundary cases including ambiguity and schema limitations. To ensure interpretability, we employ Knowledge-Grounded Reasoning Synthesis, which produces Chain-of-Thought traces explicitly anchored in schema metadata and business rules. The model is trained through a two-stage process: Supervised Fine-Tuning (SFT) followed by Reinforcement Learning via Group Relative Policy Optimization. We design a Task-Conditioned Hybrid Reward mechanism that simultaneously optimizes SQL execution accuracy-leveraging Abstract Syntax Tree analysis and dense result matching-and semantic precision in abstention responses. To evaluate reliability alongside generation accuracy, we construct and release Ent-SQL-Bench, which jointly assesse SQL precision and boundary-aware abstention across ambiguous and unanswerable queries. Experimental results on this benchmark demonstrate that BAR-SQL achieves 91.48% average accuracy, outperforming leading proprietary models, including Claude 4.5 Sonnet and GPT-5, in both SQL generation quality and boundary-aware abstention capability. The source code and benchmark are available anonymously at: https://github.com/TianSongS/BAR-SQL.

preprint2026arXiv

FlashInfer-Bench: Building the Virtuous Cycle for AI-driven LLM Systems

Recent advances show that large language models (LLMs) can act as autonomous agents capable of generating GPU kernels, but integrating these AI-generated kernels into real-world inference systems remains challenging. FlashInfer-Bench addresses this gap by establishing a standardized, closed-loop framework that connects kernel generation, benchmarking, and deployment. At its core, FlashInfer Trace provides a unified schema describing kernel definitions, workloads, implementations, and evaluations, enabling consistent communication between agents and systems. Built on real serving traces, FlashInfer-Bench includes a curated dataset, a robust correctness- and performance-aware benchmarking framework, a public leaderboard to track LLM agents' GPU programming capabilities, and a dynamic substitution mechanism (apply()) that seamlessly injects the best-performing kernels into production LLM engines such as SGLang and vLLM. Using FlashInfer-Bench, we further evaluate the performance and limitations of LLM agents, compare the trade-offs among different GPU programming languages, and provide insights for future agent design. FlashInfer-Bench thus establishes a practical, reproducible pathway for continuously improving AI-generated kernels and deploying them into large-scale LLM inference.

preprint2026arXiv

IndexTTS 2.5 Technical Report

In prior work, we introduced IndexTTS 2, a zero-shot neural text-to-speech foundation model comprising two core components: a transformer-based Text-to-Semantic (T2S) module and a non-autoregressive Semantic-to-Mel (S2M) module, which together enable faithful emotion replication and establish the first autoregressive duration-controllable generative paradigm. Building upon this, we present IndexTTS 2.5, which significantly enhances multilingual coverage, inference speed, and overall synthesis quality through four key improvements: 1) Semantic Codec Compression: we reduce the semantic codec frame rate from 50 Hz to 25 Hz, halving sequence length and substantially lowering both training and inference costs; 2) Architectural Upgrade: we replace the U-DiT-based backbone of the S2M module with a more efficient Zipformer-based modeling architecture, achieving notable parameter reduction and faster mel-spectrogram generation; 3) Multilingual Extension: We propose three explicit cross-lingual modeling strategies, boundary-aware alignment, token-level concatenation, and instruction-guided generation, establishing practical design principles for zero-shot multilingual emotional TTS that supports Chinese, English, Japanese, and Spanish, and enables robust emotion transfer even without target-language emotional training data; 4) Reinforcement Learning Optimization: we apply GRPO in post-training of the T2S module, improving pronunciation accuracy and natrualness. Experiments show that IndexTTS 2.5 not only supports broader language coverage but also replicates emotional prosody in unseen languages under the same zero-shot setting. IndexTTS 2.5 achieves a 2.28 times improvement in RTF while maintaining comparable WER and speaker similarity to IndexTTS 2.

preprint2026arXiv

Multiple charge transfer driven complex reaction dynamics: covalent bonding meets van der Waals interactions

Ultrafast charge transfer (CT) processes redistribute electronic charge within and between molecular units and play a central role in many physical, chemical, and biological phenomena. However, the microscopic pathways of multiple CT events, including the coupled structural evolution and energy redistribution, are challenging to disentangle experimentally in complex systems. To obtain controlled insight into such dynamics, well-defined properties are required. Here, we investigate the N2Ar dimer, which combines a covalent bond with a weak van der Waals interaction, using site-selective synchrotron photoionization and coincident detection of electrons and ions. Combined with ab initio calculations, this approach enables step-by-step tracking of ultrafast CT and fragmentation dynamics. We find that the dimer's structural evolution triggers a second CT event, opening complex reaction pathways in which electrons are transferred back and forth between Ar and N2, through two nonadiabatic transitions involving conical intersections. These results demonstrate that sequential multiple CT-induced transitions, even in a simple dimer, provide controlled insight into nonadiabatic reaction mechanisms relevant to complex systems.

preprint2025arXiv

Index-ASR Technical Report

Automatic speech recognition (ASR) has witnessed remarkable progress in recent years, largely driven by the emergence of LLM-based ASR paradigm. Despite their strong performance on a variety of open-source benchmarks, existing LLM-based ASR systems still suffer from two critical limitations. First, they are prone to hallucination errors, often generating excessively long and repetitive outputs that are not well grounded in the acoustic input. Second, they provide limited support for flexible and fine-grained contextual customization. To address these challenges, we propose Index-ASR, a large-scale LLM-based ASR system designed to simultaneously enhance robustness and support customizable hotword recognition. The core idea of Index-ASR lies in the integration of LLM and large-scale training data enriched with background noise and contextual information. Experimental results show that our Index-ASR achieves strong performance on both open-source benchmarks and in-house test sets, highlighting its robustness and practicality for real-world ASR applications.

preprint2021arXiv

A class of fractional p()-Kirchhoff type systems

This paper is concerned with an elliptic system of Kirchhoff type, driven by the variable-order fractional $p(x)$-operator. With the help of the direct variational method and Ekeland variational principle, we show the existence of a weak solution. This is our first attempt to study this kind of system, in the case of variable-order fractional variable exponents. Our main theorem extends in several directions previous results.

preprint2021arXiv

Personal Fixations-Based Object Segmentation with Object Localization and Boundary Preservation

As a natural way for human-computer interaction, fixation provides a promising solution for interactive image segmentation. In this paper, we focus on Personal Fixations-based Object Segmentation (PFOS) to address issues in previous studies, such as the lack of appropriate dataset and the ambiguity in fixations-based interaction. In particular, we first construct a new PFOS dataset by carefully collecting pixel-level binary annotation data over an existing fixation prediction dataset, such dataset is expected to greatly facilitate the study along the line. Then, considering characteristics of personal fixations, we propose a novel network based on Object Localization and Boundary Preservation (OLBP) to segment the gazed objects. Specifically, the OLBP network utilizes an Object Localization Module (OLM) to analyze personal fixations and locates the gazed objects based on the interpretation. Then, a Boundary Preservation Module (BPM) is designed to introduce additional boundary information to guard the completeness of the gazed objects. Moreover, OLBP is organized in the mixed bottom-up and top-down manner with multiple types of deep supervision. Extensive experiments on the constructed PFOS dataset show the superiority of the proposed OLBP network over 17 state-of-the-art methods, and demonstrate the effectiveness of the proposed OLM and BPM components. The constructed PFOS dataset and the proposed OLBP network are available at https://github.com/MathLee/OLBPNet4PFOS.

preprint2020arXiv

Transport of Finite Size Self-Propelled Particles Confined in a 2D Zigzag Channel with Gaussian Colored Noise

The directional transport of finite size self-propelled Brownian particles confined in a 2D zigzag channel with colored noise is investigated. The noises(noise parallel to x-axis and y-axis), the asymmetry parameter Δk, the ratio f(ratio of the particle radius and the bottleneck half width), the selfpropelled speed v0 have joint effect on the particles. The average velocity of self-propelled particles is significantly different from passive particles. The average velocity exhibits complicated behavior with increasing self-propelled speed v0

preprint2016arXiv

Binarized Neural Networks on the ImageNet Classification Task

We trained Binarized Neural Networks (BNNs) on the high resolution ImageNet ILSVRC-2102 dataset classification task and achieved a good performance. With a moderate size network of 13 layers, we obtained top-5 classification accuracy rate of 84.1 % on validation set through network distillation, much better than previous published results of 73.2% on XNOR network and 69.1% on binarized GoogleNET. We expect networks of better performance can be obtained by following our current strategies. We provide a detailed discussion and preliminary analysis on strategies used in the network training.

preprint2016arXiv

Highly Nonlinear Luminescence Induced by Gold Nanoparticles on Glass Surfaces with Continuous-Wave Laser Illumination

We report on highly nonlinear luminescence being observed from individual spherical gold nanoparticles immobilized on a glass surface and illuminated by continuous-wave (CW) lasers with relatively low power. The nonlinear luminescence shows optical super-resolution beyond the diffraction limit in three dimensions compared to the scatting of the excitation laser light. The luminescence intensity from most nanoparticles is proportional to the 5th--7th power of the excitation laser power and has wide excitation and emission spectra across the visible wavelength range. Strong nonlinear luminescence is only observed near the glass surface. High optical nonlinearity excited by low CW laser power is related to a long-lived dark state of the gold nanoparticles, where the excitation light is strongly absorbed. This phenomenon has potential biological applications in super-resolution and deep tissue imaging.

preprint2016arXiv

Tunable spin-orbit coupling and symmetry-protected edge states in graphene/WS$_2$

We demonstrate clear weak anti-localization (WAL) effect arising from induced Rashba spin-orbit coupling (SOC) in WS$_2$-covered single-layer and bilayer graphene devices. Contrary to the uncovered region of a shared single-layer graphene flake, WAL in WS$_2$-covered graphene occurs over a wide range of carrier densities on both electron and hole sides. At high carrier densities, we estimate the Rashba SOC relaxation rate to be $\sim 0.2 \rm{ps}^{-1}$ and show that it can be tuned by transverse electric fields. In addition to the Rashba SOC, we also predict the existence of a `valley-Zeeman' SOC from first-principles calculations. The interplay between these two SOC's can open a non-topological but interesting gap in graphene; in particular, zigzag boundaries host four sub-gap edge states protected by time-reversal and crystalline symmetries. The graphene/WS$_2$ system provides a possible platform for these novel edge states.

preprint2016arXiv

Who pumps spin current into nonmagnetic-metal (NM) layer in YIG/NM multilayers at ferromagnetic resonance?

Spin pumping in Yttrium-iron-garnet (YIG)/nonmagnetic-metal (NM) layer systems under ferromagnetic resonance (FMR) conditions is a popular method of generating spin current in the NM layer. A good understanding of the spin current source is essential in extracting spin Hall angle of the NM and in potential spintronics applications. It is widely believed that spin current is pumped from precessing YIG magnetization into NM layer. Here, by combining microwave absorption and DC-voltage measurements on YIG/Pt and YIG/NM1/NM2 (NM1=Cu or Al, NM2=Pt or Ta), we unambiguously showed that spin current in NM came from the magnetized NM surface (in contact with YIG) due to the magnetic proximity effect (MPE), rather than the precessing YIG magnetization. This conclusion is reached through our unique detecting method where the FMR microwave absorption of the magnetized NM surface, hardly observed in the conventional FMR experiments, was greatly amplified when the electrical detection circuit was switched on.

preprint2015arXiv

Resonant Scanning with Large Field of View Reduces Photobleaching and Enhances Fluorescence Yield in STED Microscopy

Photobleaching is a major limitation of superresolution Stimulated Depletion Emission (STED) microscopy. Fast scanning has long been considered an effective means to reduce photobleaching in fluorescence microscopy, but a careful quantitative study of this issue is missing. In this paper, we show that the photobleaching rate in STED microscopy is slowed down and fluorescence yield is enhanced by scanning with high linear speed, enabled by the large field of view in our custom-built resonant-scanning STED microscope. The effect of scanning speed on photobleaching and fluorescence yield is more remarkable at higher levels of depletion laser irradiance, and virtually disappears in conventional confocal microscopy. With a depletion irradiance of >0.2 GW$\cdot$cm$^{-2}$ (time average), we were able to extend the fluorescence survival time of the Atto 647N dye by ~80% with an 8-fold wider field of view. We confirm that STED Photobleaching is primarily caused by the depletion light acting upon the excited fluorophores. Experimental data agree with a theoretical model. Our results encourage further increasing linear scanning speed for photobleaching reduction in STED microscopy.

preprint2014arXiv

Can Charge Exchange Explain Anomalous Soft X-ray Emission in the Cygnus Loop?

Recent X-ray studies have shown that supernova shock models are unable to satisfactorily explain X-ray emission in the rim of the Cygnus Loop. In an attempt to account for this anomalously enhanced X-ray flux, we fit the region with a model including theoretical charge exchange (CX) data along with shock and background X-ray models. The model includes the CX collisions of $O^{8+}$, $O{7+}$, $N^{7+}$, $N^{6+}$, $C^{6+}$, and $C^{5+}$ with H with an energy of 1 keV/u (438 km/s). The observations reveal a strong emission feature near 0.7 keV that cannot fully be accounted for by a shock model, nor the current CX data. Inclusion of CX, specifically $O^{7+} + H$, does provide for a statistically significant improvement over a pure shock model.

preprint2013arXiv

Constructing 3-Lie algebras

3-Lie algebras are constructed by Lie algebras, derivations and linear functions, associative commutative algebras, whose involutions and derivations. Then the 3-Lie algebras are obtained from group algebras $F[G]$. An infinite dimensional simple 3-Lie algebra $(A, [,,]_{ω, δ_0})$ and a non-simple 3-Lie algebra $(A, [,,]_{ω_1, δ})$ are constructed by Laurent polynomials $A=F[t, t^{-1}]$ and its involutions $ω$ and $ω_1$ and derivations $δ$ and $δ_0$. At last of the paper, we summarize the methods of constructing $n$-Lie algebras for $n\geq 3$ and provide a problem.

preprint2013arXiv

Rota-Baxter 3-Lie algebras

In this paper we introduce the concepts of a Rota-Baxter operator and a differential operator with weights on an $n$-algebra. We then focus on Rota-Baxter 3-Lie algebras and show that they can be derived from Rota-Baxter Lie algebras and pre-Lie algebras and from Rota-Baxter commutative associative algebras with derivations. We also establish the inheritance property of Rota-Baxter 3-Lie algebras.

preprint2012arXiv

On 3-Lie algebras with abelian ideals and subalgebras

In this paper, we study the maximal dimension $α(L)$ of abelian subalgebras and the maximal dimension $β(L)$ of abelian ideals of m-dimensional 3-Lie algebras $L$ over an algebraically closed field. We show that these dimensions do not coincide if the field is of characteristic zero, even for nilpotent 3-Lie algebras. We then prove that 3-Lie algebras with $β(L) = m-2$ are 2-step solvable (see definition in Section 2). Furthermore, we give a precise description of these 3-Lie algebras with one or two dimensional derived algebras. In addition, we provide a classification of 3-Lie algebras with $α(L)=\dim L-2$. We also obtain the classification of 3-Lie algebras with $α(L)=\dim L-1$ and with their derived algebras of one dimension.

Yong Wu

What is connected

Connect this record

See the researcher in context

Building this map preview

17 published item(s)

Boundary-Aware NL2SQL: Integrating Reliability through Hybrid Reward and Data Synthesis

FlashInfer-Bench: Building the Virtuous Cycle for AI-driven LLM Systems

IndexTTS 2.5 Technical Report

Multiple charge transfer driven complex reaction dynamics: covalent bonding meets van der Waals interactions

Index-ASR Technical Report

A class of fractional p()-Kirchhoff type systems

Personal Fixations-Based Object Segmentation with Object Localization and Boundary Preservation

Transport of Finite Size Self-Propelled Particles Confined in a 2D Zigzag Channel with Gaussian Colored Noise

Binarized Neural Networks on the ImageNet Classification Task

Highly Nonlinear Luminescence Induced by Gold Nanoparticles on Glass Surfaces with Continuous-Wave Laser Illumination

Tunable spin-orbit coupling and symmetry-protected edge states in graphene/WS$_2$

Who pumps spin current into nonmagnetic-metal (NM) layer in YIG/NM multilayers at ferromagnetic resonance?

Resonant Scanning with Large Field of View Reduces Photobleaching and Enhances Fluorescence Yield in STED Microscopy

Can Charge Exchange Explain Anomalous Soft X-ray Emission in the Cygnus Loop?

Constructing 3-Lie algebras

Rota-Baxter 3-Lie algebras

On 3-Lie algebras with abelian ideals and subalgebras