Source author record

Ying Zhu

Ying Zhu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning math.ST physics.flu-dyn Statistics Theory alg-geom Artificial Intelligence cond-mat.other dg-ga eess.AS Human-Computer Interaction math.AG math.DG nlin.CD physics.app-ph Sound

Catalog footprint

What is connected

8works

15topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

DiRL: An Efficient Post-Training Framework for Diffusion Language Models

Diffusion Language Models (dLLMs) have emerged as promising alternatives to Auto-Regressive (AR) models. While recent efforts have validated their pre-training potential and accelerated inference speeds, the post-training landscape for dLLMs remains underdeveloped. Existing methods suffer from computational inefficiency and objective mismatches between training and inference, severely limiting performance on complex reasoning tasks such as mathematics. To address this, we introduce DiRL, an efficient post-training framework that tightly integrates FlexAttention-accelerated blockwise training with LMDeploy-optimized inference. This architecture enables a streamlined online model update loop, facilitating efficient two-stage post-training (Supervised Fine-Tuning followed by Reinforcement Learning). Building on this framework, we propose DiPO, the first unbiased Group Relative Policy Optimization (GRPO) implementation tailored for dLLMs. We validate our approach by training DiRL-8B-Instruct on high-quality math data. Our model achieves state-of-the-art math performance among dLLMs and surpasses comparable models in the Qwen2.5 series on several benchmarks.

preprint2022arXiv

SuperVoice: Text-Independent Speaker Verification Using Ultrasound Energy in Human Speech

Voice-activated systems are integrated into a variety of desktop, mobile, and Internet-of-Things (IoT) devices. However, voice spoofing attacks, such as impersonation and replay attacks, in which malicious attackers synthesize the voice of a victim or simply replay it, have brought growing security concerns. Existing speaker verification techniques distinguish individual speakers via the spectrographic features extracted from an audible frequency range of voice commands. However, they often have high error rates and/or long delays. In this paper, we explore a new direction of human voice research by scrutinizing the unique characteristics of human speech at the ultrasound frequency band. Our research indicates that the high-frequency ultrasound components (e.g. speech fricatives) from 20 to 48 kHz can significantly enhance the security and accuracy of speaker verification. We propose a speaker verification system, SUPERVOICE that uses a two-stream DNN architecture with a feature fusion mechanism to generate distinctive speaker models. To test the system, we create a speech dataset with 12 hours of audio (8,950 voice samples) from 127 participants. In addition, we create a second spoofed voice dataset to evaluate its security. In order to balance between controlled recordings and real-world applications, the audio recordings are collected from two quiet rooms by 8 different recording devices, including 7 smartphones and an ultrasound microphone. Our evaluation shows that SUPERVOICE achieves 0.58% equal error rate in the speaker verification task, it only takes 120 ms for testing an incoming utterance, outperforming all existing speaker verification systems. Moreover, within 91 ms processing time, SUPERVOICE achieves 0% equal error rate in detecting replay attacks launched by 5 different loudspeakers.

preprint2022arXiv

Testing wave turbulence theory for Gross-Pitaevskii system

We test the predictions of the theory of weak wave turbulence by performing numerical simulations of the Gross-Pitaevskii equation (GPE) and the associated wave-kinetic equation (WKE). We consider an initial state localized in Fourier space, and we confront the solutions of the WKE obtained numerically with GPE data for both the wave-action spectrum and the probability density functions (PDFs) of the Fourier mode intensities. We find that the temporal evolution of the GPE data is accurately predicted by the WKE, with no adjustable parameters, for about two nonlinear kinetic times. Qualitative agreement between the GPE and the WKE persists also for longer times with some quantitative deviations that may be attributed to the combination of breakdown of the theoretical assumptions underlying the WKE as well as numerical issues. Furthermore, we study how the wave statistics evolves toward Gaussianity in a time scale of the order of the kinetic time.The excellent agreement between direct numerical simulations of the GPE and the WKE provides a new and solid ground to the theory of weak wave turbulence.

preprint2019arXiv

Towards Commercializing Vanadium Dioxide Films: Investigation of the Impact of Different Interface on the Deterioration Process for Largely Extended Service Life

Long term stability is the most pressing issue that impedes commercialization of Vanadium Dioxide (VO2) based functional films, which show a gradual loss of relative phase transition performance, especially in humid conditions when serving as smart windows. Here, we investigated the impact of different interface on the deterioration process of VO2 films and proposed a novel encapsulation structure for largely extended service life. Hydrophobic and stable hafnium dioxide (HfO2) layers have been incorporated with VO2 films for encapsulated surfaces and cross sections. With modified thickness and structure of HfO2 layers, the degradation process of VO2 can be effectively suppressed. The proposed films can retain stable phase transition performances under high relative humidity (90%) and temperature (60 Celsius) over 100 days, which is equal to about 16 years in the real environment. Improving the stability of VO2 materials is a necessary step towards commercializing production of high performance films for long term use.

preprint2014arXiv

High-Dimensional Semiparametric Selection Models: Estimation Theory with an Application to the Retail Gasoline Market

This paper proposes a multi-stage projection-based Lasso procedure for the semiparametric sample selection model in high-dimensional settings under a weak nonparametric restriction on the selection correction. In particular, the number of regressors in the main equation, p, and the number of regressors in the selection equation, d, can grow with and exceed the sample size n. The analysis considers the exact sparsity case and the approximate sparsity case. The main theoretical results are finite-sample bounds from which sufficient scaling conditions on the sample size for estimation consistency and variable-selection consistency are established. Statistical efficiency of the proposed estimators is studied via lower bounds on minimax risks and the result shows that, for a family of models with exactly sparse structure on the coefficient vector in the main equation, one of the proposed estimators attains the smallest estimation error up to the (n,d,p)-scaling among a class of procedures in worst-case scenarios. Inference procedures for the coefficients of the main equation, one based on a pivotal Dantzig selector to construct non-asymptotic confidence sets and one based on a post-selection strategy, are discussed. Other theoretical contributions include establishing the non-asymptotic counterpart of the familiar asymptotic oracle results from previous literature: the estimator of the coefficients in the main equation behaves as if the unknown nonparametric component were known, provided the nonparametric component is sufficiently smooth. Small-sample performance of the high-dimensional multi-stage estimation procedure is evaluated by Monte-Carlo simulations and illustrated with an empirical application to the retail gasoline market in the Greater Saint Louis area.

preprint2013arXiv

Sparse Linear Models and Two-Stage Estimation in High-Dimensional Settings with Possibly Many Endogenous Regressors

This paper explores the validity of the two-stage estimation procedure for sparse linear models in high-dimensional settings with possibly many endogenous regressors. In particular, the number of endogenous regressors in the main equation and the instruments in the first-stage equations can grow with and exceed the sample size n. The analysis concerns the exact sparsity case, i.e., the maximum number of non-zero components in the vectors of parameters in the first-stage equations, k1, and the number of non-zero components in the vector of parameters in the second-stage equation, k2, are allowed to grow with n but slowly compared to n. I consider the high-dimensional version of the two-stage least square estimator where one obtains the fitted regressors from the first-stage regression by a least square estimator with l_1-regularization (the Lasso or Dantzig selector) when the first-stage regression concerns a large number of instruments relative to n, and then construct a similar estimator using these fitted regressors in the second-stage regression. The main theoretical results of this paper are non-asymptotic bounds from which I establish sufficient scaling conditions on the sample size for estimation consistency in l_2-norm and variable-selection consistency. A technical issue regarding the so-called "restricted eigenvalue (RE) condition" for estimation consistency and the "mutual incoherence (MI) condition" for selection consistency arises in the two-stage estimation from allowing the number of regressors in the main equation to exceed n and this paper provides analysis to verify these RE and MI conditions. Depending on the underlying assumptions, the upper bounds on the l_2-error and the sample size required to obtain these consistency results differ by factors involving k1 and/or k2. Simulations are conducted to gain insight on the finite sample performance.

preprint2012arXiv

How to freeze drops with powder

This document accompanies fluid dyanmics video entry V83911 for APS DFD 2012 meeting. In this video, we present experiments on how drop oscillations can be "frozen" using hydrophobic powders.

preprint1995arXiv

A Generalization of the Kodaira Vanishing and Embedding Theorem

We give several generalizations of the Kodaira vanishing and embedding theorems for Kähler manifolds to the case where the relevent line bundle has a small region of negative curvature. To prove the vanishing theorems we adapt techniques of Elworthy-Rosenberg for vanishing theorems in Riemannian geometry. For the embedding theorem, we show that a Kähler manifold with a mostly positive line bundle is Moishezon, since the usual blow up techniques do not work in our situation.