Researcher profile

Yong Wu

Yong Wu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
11works
0followers
13topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

11 published item(s)

preprint2026arXiv

Boundary-Aware NL2SQL: Integrating Reliability through Hybrid Reward and Data Synthesis

In this paper, we present BAR-SQL (Boundary-Aware Reliable NL2SQL), a unified training framework that embeds reliability and boundary awareness directly into the generation process. We introduce a Seed Mutation data synthesis paradigm that constructs a representative enterprise corpus, explicitly encompassing multi-step analytical queries alongside boundary cases including ambiguity and schema limitations. To ensure interpretability, we employ Knowledge-Grounded Reasoning Synthesis, which produces Chain-of-Thought traces explicitly anchored in schema metadata and business rules. The model is trained through a two-stage process: Supervised Fine-Tuning (SFT) followed by Reinforcement Learning via Group Relative Policy Optimization. We design a Task-Conditioned Hybrid Reward mechanism that simultaneously optimizes SQL execution accuracy-leveraging Abstract Syntax Tree analysis and dense result matching-and semantic precision in abstention responses. To evaluate reliability alongside generation accuracy, we construct and release Ent-SQL-Bench, which jointly assesse SQL precision and boundary-aware abstention across ambiguous and unanswerable queries. Experimental results on this benchmark demonstrate that BAR-SQL achieves 91.48% average accuracy, outperforming leading proprietary models, including Claude 4.5 Sonnet and GPT-5, in both SQL generation quality and boundary-aware abstention capability. The source code and benchmark are available anonymously at: https://github.com/TianSongS/BAR-SQL.

preprint2026arXiv

FlashInfer-Bench: Building the Virtuous Cycle for AI-driven LLM Systems

Recent advances show that large language models (LLMs) can act as autonomous agents capable of generating GPU kernels, but integrating these AI-generated kernels into real-world inference systems remains challenging. FlashInfer-Bench addresses this gap by establishing a standardized, closed-loop framework that connects kernel generation, benchmarking, and deployment. At its core, FlashInfer Trace provides a unified schema describing kernel definitions, workloads, implementations, and evaluations, enabling consistent communication between agents and systems. Built on real serving traces, FlashInfer-Bench includes a curated dataset, a robust correctness- and performance-aware benchmarking framework, a public leaderboard to track LLM agents' GPU programming capabilities, and a dynamic substitution mechanism (apply()) that seamlessly injects the best-performing kernels into production LLM engines such as SGLang and vLLM. Using FlashInfer-Bench, we further evaluate the performance and limitations of LLM agents, compare the trade-offs among different GPU programming languages, and provide insights for future agent design. FlashInfer-Bench thus establishes a practical, reproducible pathway for continuously improving AI-generated kernels and deploying them into large-scale LLM inference.

preprint2026arXiv

IndexTTS 2.5 Technical Report

In prior work, we introduced IndexTTS 2, a zero-shot neural text-to-speech foundation model comprising two core components: a transformer-based Text-to-Semantic (T2S) module and a non-autoregressive Semantic-to-Mel (S2M) module, which together enable faithful emotion replication and establish the first autoregressive duration-controllable generative paradigm. Building upon this, we present IndexTTS 2.5, which significantly enhances multilingual coverage, inference speed, and overall synthesis quality through four key improvements: 1) Semantic Codec Compression: we reduce the semantic codec frame rate from 50 Hz to 25 Hz, halving sequence length and substantially lowering both training and inference costs; 2) Architectural Upgrade: we replace the U-DiT-based backbone of the S2M module with a more efficient Zipformer-based modeling architecture, achieving notable parameter reduction and faster mel-spectrogram generation; 3) Multilingual Extension: We propose three explicit cross-lingual modeling strategies, boundary-aware alignment, token-level concatenation, and instruction-guided generation, establishing practical design principles for zero-shot multilingual emotional TTS that supports Chinese, English, Japanese, and Spanish, and enables robust emotion transfer even without target-language emotional training data; 4) Reinforcement Learning Optimization: we apply GRPO in post-training of the T2S module, improving pronunciation accuracy and natrualness. Experiments show that IndexTTS 2.5 not only supports broader language coverage but also replicates emotional prosody in unseen languages under the same zero-shot setting. IndexTTS 2.5 achieves a 2.28 times improvement in RTF while maintaining comparable WER and speaker similarity to IndexTTS 2.

preprint2026arXiv

Multiple charge transfer driven complex reaction dynamics: covalent bonding meets van der Waals interactions

Ultrafast charge transfer (CT) processes redistribute electronic charge within and between molecular units and play a central role in many physical, chemical, and biological phenomena. However, the microscopic pathways of multiple CT events, including the coupled structural evolution and energy redistribution, are challenging to disentangle experimentally in complex systems. To obtain controlled insight into such dynamics, well-defined properties are required. Here, we investigate the N2Ar dimer, which combines a covalent bond with a weak van der Waals interaction, using site-selective synchrotron photoionization and coincident detection of electrons and ions. Combined with ab initio calculations, this approach enables step-by-step tracking of ultrafast CT and fragmentation dynamics. We find that the dimer's structural evolution triggers a second CT event, opening complex reaction pathways in which electrons are transferred back and forth between Ar and N2, through two nonadiabatic transitions involving conical intersections. These results demonstrate that sequential multiple CT-induced transitions, even in a simple dimer, provide controlled insight into nonadiabatic reaction mechanisms relevant to complex systems.

preprint2025arXiv

Index-ASR Technical Report

Automatic speech recognition (ASR) has witnessed remarkable progress in recent years, largely driven by the emergence of LLM-based ASR paradigm. Despite their strong performance on a variety of open-source benchmarks, existing LLM-based ASR systems still suffer from two critical limitations. First, they are prone to hallucination errors, often generating excessively long and repetitive outputs that are not well grounded in the acoustic input. Second, they provide limited support for flexible and fine-grained contextual customization. To address these challenges, we propose Index-ASR, a large-scale LLM-based ASR system designed to simultaneously enhance robustness and support customizable hotword recognition. The core idea of Index-ASR lies in the integration of LLM and large-scale training data enriched with background noise and contextual information. Experimental results show that our Index-ASR achieves strong performance on both open-source benchmarks and in-house test sets, highlighting its robustness and practicality for real-world ASR applications.

preprint2021arXiv

A class of fractional p()-Kirchhoff type systems

This paper is concerned with an elliptic system of Kirchhoff type, driven by the variable-order fractional $p(x)$-operator. With the help of the direct variational method and Ekeland variational principle, we show the existence of a weak solution. This is our first attempt to study this kind of system, in the case of variable-order fractional variable exponents. Our main theorem extends in several directions previous results.

preprint2021arXiv

Personal Fixations-Based Object Segmentation with Object Localization and Boundary Preservation

As a natural way for human-computer interaction, fixation provides a promising solution for interactive image segmentation. In this paper, we focus on Personal Fixations-based Object Segmentation (PFOS) to address issues in previous studies, such as the lack of appropriate dataset and the ambiguity in fixations-based interaction. In particular, we first construct a new PFOS dataset by carefully collecting pixel-level binary annotation data over an existing fixation prediction dataset, such dataset is expected to greatly facilitate the study along the line. Then, considering characteristics of personal fixations, we propose a novel network based on Object Localization and Boundary Preservation (OLBP) to segment the gazed objects. Specifically, the OLBP network utilizes an Object Localization Module (OLM) to analyze personal fixations and locates the gazed objects based on the interpretation. Then, a Boundary Preservation Module (BPM) is designed to introduce additional boundary information to guard the completeness of the gazed objects. Moreover, OLBP is organized in the mixed bottom-up and top-down manner with multiple types of deep supervision. Extensive experiments on the constructed PFOS dataset show the superiority of the proposed OLBP network over 17 state-of-the-art methods, and demonstrate the effectiveness of the proposed OLM and BPM components. The constructed PFOS dataset and the proposed OLBP network are available at https://github.com/MathLee/OLBPNet4PFOS.

preprint2020arXiv

Transport of Finite Size Self-Propelled Particles Confined in a 2D Zigzag Channel with Gaussian Colored Noise

The directional transport of finite size self-propelled Brownian particles confined in a 2D zigzag channel with colored noise is investigated. The noises(noise parallel to x-axis and y-axis), the asymmetry parameter Δk, the ratio f(ratio of the particle radius and the bottleneck half width), the selfpropelled speed v0 have joint effect on the particles. The average velocity of self-propelled particles is significantly different from passive particles. The average velocity exhibits complicated behavior with increasing self-propelled speed v0

preprint2013arXiv

Constructing 3-Lie algebras

3-Lie algebras are constructed by Lie algebras, derivations and linear functions, associative commutative algebras, whose involutions and derivations. Then the 3-Lie algebras are obtained from group algebras $F[G]$. An infinite dimensional simple 3-Lie algebra $(A, [,,]_{ω, δ_0})$ and a non-simple 3-Lie algebra $(A, [,,]_{ω_1, δ})$ are constructed by Laurent polynomials $A=F[t, t^{-1}]$ and its involutions $ω$ and $ω_1$ and derivations $δ$ and $δ_0$. At last of the paper, we summarize the methods of constructing $n$-Lie algebras for $n\geq 3$ and provide a problem.

preprint2012arXiv

On 3-Lie algebras with abelian ideals and subalgebras

In this paper, we study the maximal dimension $α(L)$ of abelian subalgebras and the maximal dimension $β(L)$ of abelian ideals of m-dimensional 3-Lie algebras $L$ over an algebraically closed field. We show that these dimensions do not coincide if the field is of characteristic zero, even for nilpotent 3-Lie algebras. We then prove that 3-Lie algebras with $β(L) = m-2$ are 2-step solvable (see definition in Section 2). Furthermore, we give a precise description of these 3-Lie algebras with one or two dimensional derived algebras. In addition, we provide a classification of 3-Lie algebras with $α(L)=\dim L-2$. We also obtain the classification of 3-Lie algebras with $α(L)=\dim L-1$ and with their derived algebras of one dimension.