Source author record

Yingying Li

Yingying Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

14works

20topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2025arXiv

GestureHYDRA: Semantic Co-speech Gesture Synthesis via Hybrid Modality Diffusion Transformer and Cascaded-Synchronized Retrieval-Augmented Generation

While increasing attention has been paid to co-speech gesture synthesis, most previous works neglect to investigate hand gestures with explicit and essential semantics. In this paper, we study co-speech gesture generation with an emphasis on specific hand gesture activation, which can deliver more instructional information than common body movements. To achieve this, we first build a high-quality dataset of 3D human body movements including a set of semantically explicit hand gestures that are commonly used by live streamers. Then we present a hybrid-modality gesture generation system GestureHYDRA built upon a hybrid-modality diffusion transformer architecture with novelly designed motion-style injective transformer layers, which enables advanced gesture modeling ability and versatile gesture operations. To guarantee these specific hand gestures can be activated, we introduce a cascaded retrieval-augmented generation strategy built upon a semantic gesture repository annotated for each subject and an adaptive audio-gesture synchronization mechanism, which substantially improves semantic gesture activation and production efficiency. Quantitative and qualitative experiments demonstrate that our proposed approach achieves superior performance over all the counterparts. The project page can be found at https://mumuwei.github.io/GestureHYDRA/.

preprint2023arXiv

Mining the Factor Zoo: Estimation of Latent Factor Models with Sufficient Proxies

Latent factor model estimation typically relies on either using domain knowledge to manually pick several observed covariates as factor proxies, or purely conducting multivariate analysis such as principal component analysis. However, the former approach may suffer from the bias while the latter can not incorporate additional information. We propose to bridge these two approaches while allowing the number of factor proxies to diverge, and hence make the latent factor model estimation robust, flexible, and statistically more accurate. As a bonus, the number of factors is also allowed to grow. At the heart of our method is a penalized reduced rank regression to combine information. To further deal with heavy-tailed data, a computationally attractive penalized robust reduced rank regression method is proposed. We establish faster rates of convergence compared with the benchmark. Extensive simulations and real examples are used to illustrate the advantages.

preprint2022arXiv

First Identification of New X-Ray Spectra of Mo39+, Mo40+, W43+, W44+ and W45+ on EAST

New high-resolution x-ray spectra of Mo39+, Mo40+, W43+, W44+ and W45+ have been carefully confirmed for the first time by use of the x-ray imaging crystal spectrometer (XCS) in Experimental Advanced Superconducting Tokamak (EAST) under various combined auxiliary heating plasmas conditions. Wavelength of these new x-ray spectra is ranged from 3.895 Å to 3.986 Å. When core electron temperature (Te0) reaches 6.0 keV, Mo39+ and Mo40+ lines of 3.9727, 3.9294 and 3.9480 Å can be effectively detected on XCS for EAST; meanwhile, line-integrated brightness of these spectral lines of Mo39+ and Mo40+ is very considerable when electron temperature reaches 12.9 keV. Multi-components spectral lines for W43+, W44+ and W45+ have also been identified when Te0 reaches 6 keV. Parts of spectral lines, such as Zn-1, Cu-2, Cu-4a, Cu-4d and Cu-5 lines of tungsten, are first observed experimentally. When electron temperature reaches 12.9 keV, line-integrated intensity for part of these spectral lines of W43+, W44+ and W45+ are considerable. These experimental results and theoretical predictions from FAC and FLYCHK codes are in good general agreement. These new spectral lines, obtained on XCS for EAST, are vital for deeply uncovering the mechanisms of ion and electron thermal, high-Z impurity and momentum (anomalous) transport to achieve the advanced steady-state operation scenarios for ITER and CFETR.

preprint2022arXiv

Rope3D: TheRoadside Perception Dataset for Autonomous Driving and Monocular 3D Object Detection Task

Concurrent perception datasets for autonomous driving are mainly limited to frontal view with sensors mounted on the vehicle. None of them is designed for the overlooked roadside perception tasks. On the other hand, the data captured from roadside cameras have strengths over frontal-view data, which is believed to facilitate a safer and more intelligent autonomous driving system. To accelerate the progress of roadside perception, we present the first high-diversity challenging Roadside Perception 3D dataset- Rope3D from a novel view. The dataset consists of 50k images and over 1.5M 3D objects in various scenes, which are captured under different settings including various cameras with ambiguous mounting positions, camera specifications, viewpoints, and different environmental conditions. We conduct strict 2D-3D joint annotation and comprehensive data analysis, as well as set up a new 3D roadside perception benchmark with metrics and evaluation devkit. Furthermore, we tailor the existing frontal-view monocular 3D object detection approaches and propose to leverage the geometry constraint to solve the inherent ambiguities caused by various sensors, viewpoints. Our dataset is available on https://thudair.baai.ac.cn/rope.

preprint2021arXiv

On the Regret Analysis of Online LQR Control with Predictions

In this paper, we study the dynamic regret of online linear quadratic regulator (LQR) control with time-varying cost functions and disturbances. We consider the case where a finite look-ahead window of cost functions and disturbances is available at each stage. The online control algorithm studied in this paper falls into the category of model predictive control (MPC) with a particular choice of terminal costs to ensure the exponential stability of MPC. It is proved that the regret of such an online algorithm decays exponentially fast with the length of predictions. The impact of inaccurate prediction on disturbances is also investigated in this paper.

preprint2020arXiv

A Reliability-aware Multi-armed Bandit Approach to Learn and Select Users in Demand Response

One challenge in the optimization and control of societal systems is to handle the unknown and uncertain user behavior. This paper focuses on residential demand response (DR) and proposes a closed-loop learning scheme to address these issues. In particular, we consider DR programs where an aggregator calls upon residential users to change their demand so that the total load adjustment is close to a target value. To learn and select the right users, we formulate the DR problem as a combinatorial multi-armed bandit (CMAB) problem with a reliability objective. We propose a learning algorithm: CUCB-Avg (Combinatorial Upper Confidence Bound-Average), which utilizes both upper confidence bounds and sample averages to balance the tradeoff between exploration (learning) and exploitation (selecting). We consider both a fixed time-invariant target and time-varying targets, and show that CUCB-Avg achieves $O(\log T)$ and $O(\sqrt{T \log(T)})$ regrets respectively. Finally, we numerically test our algorithms using synthetic and real data, and demonstrate that our CUCB-Avg performs significantly better than the classic CUCB and also better than Thompson Sampling.

preprint2020arXiv

Online Optimization with Predictions and Switching Costs: Fast Algorithms and the Fundamental Limit

This paper studies an online optimization problem with a finite prediction window of cost functions and additional switching costs on decisions. We propose two gradient-based online algorithms: Receding Horizon Gradient Descent (RHGD), and Receding Horizon Accelerated Gradient (RHAG). Both algorithms only require a finite number of projected gradient evaluations at each stage. We provide upper bounds on the dynamic regrets of the proposed algorithms and show that the regret upper bounds decay exponentially with the length of the prediction window. Moreover, we study the fundamental lower bound on the dynamic regret for a broad class of deterministic online algorithms. The lower bound is close to RHAG's regret upper bound, indicating that our gradient-based RHAG is a near-optimal online algorithm. Finally, we conduct numerical experiments to complement our theoretical analysis.

preprint2016arXiv

Dramatic increase of the onset critical temperature and critical field of elemental Sn in the form of thin nanowires

Sn is a well-known classical superconductor on the border between type I and type II with critical temperature of 3.722K and critical field of 0.031T. We show by means of specific heat and electric magneto-transport data that its critical parameters can be dramatically increased if it is brought in the form of loosely bound bundles of thin nanowires. The specific heat displays a pronounced double phase transition at 3.7K and 5.5K, which we attribute to the inner 'bulk' contribution of the nanowires and to the surface contribution, respectively. The latter is visible only because of the large volume fraction of the surface layer in relation to their bulk inner volume. The upper transition coincides with the onset of the resistive transition, while zero resistance is gradually approached below the lower transition. The large coherence length of 230nm at 0K likely actuates a Josephson coupling between adjacent neighboring nanowires and thus suppresses the effect of 1D phase fluctuations along the nanowires, and stabilizes 3D phase coherence throughout the entire network with zero resistance. A magnetic field of more than 3T is required to restore the normal state, which means that the critical field is enhanced by about two orders of magnitude with respect to Sn in its bulk form.

preprint2015arXiv

Longitudinal Antigenic Sequences and Sites from Intra-Host Evolution (LASSIE) Identifies Immune-Selected HIV Variants

Within-host genetic sequencing from samples collected over time provides a dynamic view of how viruses evade host immunity. Immune-driven mutations might stimulate neutralization breadth by selecting antibodies adapted to cycles of immune escape that generate within-subject epitope diversity. Comprehensive identification of immune-escape mutations is experimentally and computationally challenging. With current technology, many more viral sequences can readily be obtained than can be tested for binding and neutralization, making down-selection necessary. Typically, this is done manually, by picking variants that represent different time-points and branches on a phylogenetic tree. Such strategies are likely to miss many relevant mutations and combinations of mutations, and to be redundant for other mutations. Longitudinal Antigenic Sequences and Sites from Intrahost Evolution (LASSIE) uses transmitted-founder loss to identify virus "hot-spots" under putative immune selection and chooses sequences that represent recurrent mutations in selected sites. LASSIE favors earliest sequences in which mutations arise. With well-characterized longitudinal Env sequences, we confirmed selected sites were concentrated in antibody contacts and selected sequences represented diverse antigenic phenotypes. Practical applications include rapidly identifying immune targets under selective pressure within a subject, selecting minimal sets of reagents for immunological assays that characterize evolving antibody responses, and for immunogens in polyvalent "cocktail" vaccines.

preprint2014arXiv

Super-star networks: Growing optimal scale-free networks via likelihood

Preferential attachment --- by which new nodes attach to existing nodes with probability proportional to the existing nodes' degree --- has become the standard growth model for scale-free networks, where the asymptotic probability of a node having degree $k$ is proportional to $k^{-γ}$. However, the motivation for this model is entirely ad hoc. We use exact likelihood arguments and show that the optimal way to build a scale-free network is to attach most new links to nodes of low degree. Curiously, this leads to a scale-free networks with a single dominant hub: a star-like structure we call a super-star network. Asymptotically, the optimal strategy is to attach each new node to one of the nodes of degree $k$ with probability proportional to $\frac{1}{N+ζ(γ)(k+1)^γ}$ (in a $N$ node network) --- a stronger bias toward high degree nodes than exhibited by standard preferential attachment. Our algorithm generates optimally scale-free networks (the super-star networks) as well as randomly sampling the space of all scale-free networks with a given degree exponent $γ$. We generate viable realisation with finite $N$ for $1\ll γ<2$ as well as $γ>2$. We observe an apparently discontinuous transition at $γ\approx 2$ between so-called super-star networks and more tree-like realisations. Gradually increasing $γ$ further leads to re-emergence of a super-star hub. To quantify these structural features we derive a new analytic expression for the expected degree exponent of a pure preferential attachment process, and introduce alternative measures of network entropy. Our approach is generic and may also be applied to an arbitrary degree distribution.

preprint2013arXiv

Statistical Properties of Microstructure Noise

We study the estimation of moments and joint moments of microstructure noise. Estimators of arbitrary order of (joint) moments are provided, for which we establish consistency as well as central limit theorems. In particular, we provide estimators of auto-covariances and auto-correlations of the noise. Simulation studies demonstrate excellent performance of our estimators even in the presence of jumps and irregular observation times. Empirical studies reveal (moderate) positive auto-correlation of the noise for the stocks tested.

preprint2013arXiv

Volatility Inference in the Presence of Both Endogenous Time and Microstructure Noise

In this article we consider the volatility inference in the presence of both market microstructure noise and endogenous time. Estimators of the integrated volatility in such a setting are proposed, and their asymptotic properties are studied. Our proposed estimator is compared with the existing popular volatility estimators via numerical studies. The results show that our estimator can have substantially better performance when time endogeneity exists.

preprint2012arXiv

On the estimation of integrated covariance matrices of high dimensional diffusion processes

We consider the estimation of integrated covariance (ICV) matrices of high dimensional diffusion processes based on high frequency observations. We start by studying the most commonly used estimator, the realized covariance (RCV) matrix. We show that in the high dimensional case when the dimension $p$ and the observation frequency $n$ grow in the same rate, the limiting spectral distribution (LSD) of RCV depends on the covolatility process not only through the targeting ICV, but also on how the covolatility process varies in time. We establish a Marčenko--Pastur type theorem for weighted sample covariance matrices, based on which we obtain a Marčenko--Pastur type theorem for RCV for a class $\mathcal{C}$ of diffusion processes. The results explicitly demonstrate how the time variability of the covolatility process affects the LSD of RCV. We further propose an alternative estimator, the time-variation adjusted realized covariance (TVARCV) matrix. We show that for processes in class $\mathcal {C}$, the TVARCV possesses the desirable property that its LSD depends solely on that of the targeting ICV through the Marčenko--Pastur equation, and hence, in particular, the TVARCV can be used to recover the empirical spectral distribution of the ICV by using existing algorithms.

preprint2010arXiv

Vast Volatility Matrix Estimation using High Frequency Data for Portfolio Selection

Portfolio allocation with gross-exposure constraint is an effective method to increase the efficiency and stability of selected portfolios among a vast pool of assets, as demonstrated in Fan et al (2008). The required high-dimensional volatility matrix can be estimated by using high frequency financial data. This enables us to better adapt to the local volatilities and local correlations among vast number of assets and to increase significantly the sample size for estimating the volatility matrix. This paper studies the volatility matrix estimation using high-dimensional high-frequency data from the perspective of portfolio selection. Specifically, we propose the use of "pairwise-refresh time" and "all-refresh time" methods proposed by Barndorff-Nielsen et al (2008) for estimation of vast covariance matrix and compare their merits in the portfolio selection. We also establish the concentration inequalities of the estimates, which guarantee desirable properties of the estimated volatility matrix in vast asset allocation with gross exposure constraints. Extensive numerical studies are made via carefully designed simulations. Comparing with the methods based on low frequency daily data, our methods can capture the most recent trend of the time varying volatility and correlation, hence provide more accurate guidance for the portfolio allocation in the next time period. The advantage of using high-frequency data is significant in our simulation and empirical studies, which consist of 50 simulated assets and 30 constituent stocks of Dow Jones Industrial Average index.

Yingying Li

What is connected

Connect this record

See the researcher in context

Building this map preview

14 published item(s)

GestureHYDRA: Semantic Co-speech Gesture Synthesis via Hybrid Modality Diffusion Transformer and Cascaded-Synchronized Retrieval-Augmented Generation

Mining the Factor Zoo: Estimation of Latent Factor Models with Sufficient Proxies

First Identification of New X-Ray Spectra of Mo39+, Mo40+, W43+, W44+ and W45+ on EAST

Rope3D: TheRoadside Perception Dataset for Autonomous Driving and Monocular 3D Object Detection Task

On the Regret Analysis of Online LQR Control with Predictions

A Reliability-aware Multi-armed Bandit Approach to Learn and Select Users in Demand Response

Online Optimization with Predictions and Switching Costs: Fast Algorithms and the Fundamental Limit

Dramatic increase of the onset critical temperature and critical field of elemental Sn in the form of thin nanowires

Longitudinal Antigenic Sequences and Sites from Intra-Host Evolution (LASSIE) Identifies Immune-Selected HIV Variants

Super-star networks: Growing optimal scale-free networks via likelihood

Statistical Properties of Microstructure Noise

Volatility Inference in the Presence of Both Endogenous Time and Microstructure Noise

On the estimation of integrated covariance matrices of high dimensional diffusion processes

Vast Volatility Matrix Estimation using High Frequency Data for Portfolio Selection