Source author record

Wei Yao

Wei Yao appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision math.AP Machine Learning cond-mat.mtrl-sci eess.AS eess.IV eess.SY math-ph math.MP math.OC nlin.PS Sound Systems and Control

Catalog footprint

What is connected

14works

13topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

A Novel $αβ$-Approximation Method Based on Numerical Integration for Discretizing Continuous Systems

In this article, we propose a novel discretization method based on numerical integration for discretizing continuous systems, termed the $αβ$-approximation or Scalable Bilinear Transformation (SBT). In contrast to existing methods, the proposed method consists of two factors, i.e., shape factor ($α$) and time factor ($β$). Depending on the discretization technique applied, we identify two primary distortion modes in discrete resonant controllers: frequency warping and resonance damping. We further provide a theoretical explanation for these distortion modes, and demonstrate that the performance of the method is superior to all typical methods. The proposed method is implemented to discretize a quasi-resonant (QR) controller on a control board, achieving 25\% reduction in the root-mean-square error (RMSE) compared to the SOTA method. Finally, the approach is extended to discretizing a resonant controller of a grid-tied inverter. The efficacy of the proposed method is conclusively validated through favorable comparisons among the theory, simulation, and experiments.

preprint2026arXiv

On the Blessing of Pre-training in Weak-to-Strong Generalization

The paradigm of Weak-to-Strong Generalization (W2SG) suggests that a pre-trained strong model can surpass its weak supervisor, yet the decisive role of pre-training remains theoretically and empirically under-explored. In this work, we identify pre-training as the essential prerequisite for the emergence of W2SG. Theoretically, we formalize the W2SG problem within a high-dimensional single-index model framework using spiked Gaussian data, modeling pre-training as a spectral initialization step. Building upon prior impossibility results regarding the failure of learning under random initialization, we prove that W2SG is achievable when pre-training provides a geometric warm start that places the model within an "effective region" characterized by a perturbed strong-convexity geometry. Within this region, we derive a rigorous generalization bound that naturally captures the optimization dynamics: an initial performance improvement followed by a saturation bottleneck dictated by the weak supervisor's bias. Empirically, we first validate all our assumptions and theoretical insights through controlled synthetic simulations. Finally, through a massive-scale evaluation of hundreds of intermediate pre-training checkpoints from large language models, we demonstrate that W2SG is not an innate capability but emerges via a phase transition tightly coupled with the progression of pre-training.

preprint2024arXiv

STAF: 3D Human Mesh Recovery from Video with Spatio-Temporal Alignment Fusion

The recovery of 3D human mesh from monocular images has significantly been developed in recent years. However, existing models usually ignore spatial and temporal information, which might lead to mesh and image misalignment and temporal discontinuity. For this reason, we propose a novel Spatio-Temporal Alignment Fusion (STAF) model. As a video-based model, it leverages coherence clues from human motion by an attention-based Temporal Coherence Fusion Module (TCFM). As for spatial mesh-alignment evidence, we extract fine-grained local information through predicted mesh projection on the feature maps. Based on the spatial features, we further introduce a multi-stage adjacent Spatial Alignment Fusion Module (SAFM) to enhance the feature representation of the target frame. In addition to the above, we propose an Average Pooling Module (APM) to allow the model to focus on the entire input sequence rather than just the target frame. This method can remarkably improve the smoothness of recovery results from video. Extensive experiments on 3DPW, MPII3D, and H36M demonstrate the superiority of STAF. We achieve a state-of-the-art trade-off between precision and smoothness. Our code and more video results are on the project page https://yw0208.github.io/staf/

preprint2022arXiv

Efficient divide-and-conquer registration of UAV and ground LiDAR point clouds through canopy shape context

Registration of unmanned aerial vehicle laser scanning (ULS) and ground light detection and ranging (LiDAR) point clouds in forests is critical to create a detailed representation of a forest structure and an accurate inversion of forest parameters. However, forest occlusion poses challenges for marker-based registration methods, and some marker-free automated registration methods have low efficiency due to the process of object (e.g., tree, crown) segmentation. Therefore, we use a divide-and-conquer strategy and propose an automated and efficient method to register ULS and ground LiDAR point clouds in forests. Registration involves coarse alignment and fine registration, where the coarse alignment of point clouds is divided into vertical and horizontal alignment. The vertical alignment is achieved by ground alignment, which is achieved by the transformation relationship between normal vectors of the ground point cloud and the horizontal plane, and the horizontal alignment is achieved by canopy projection image matching. During image matching, vegetation points are first distinguished by the ground filtering algorithm, and then, vegetation points are projected onto the horizontal plane to obtain two binary images. To match the two images, a matching strategy is used based on canopy shape context features, which are described by a two-point congruent set and canopy overlap. Finally, we implement coarse alignment of ULS and ground LiDAR datasets by combining the results of ground alignment and image matching and finish fine registration. Also, the effectiveness, accuracy, and efficiency of the proposed method are demonstrated by field measurements of forest plots. Experimental results show that the ULS and ground LiDAR data in different plots are registered, of which the horizontal alignment errors are less than 0.02 m, and the average runtime of the proposed method is less than 1 second.

preprint2022arXiv

Information fusion approach for biomass estimation in a plateau mountainous forest using a synergistic system comprising UAS-based digital camera and LiDAR

Forest land plays a vital role in global climate, ecosystems, farming and human living environments. Therefore, forest biomass estimation methods are necessary to monitor changes in the forest structure and function, which are key data in natural resources research. Although accurate forest biomass measurements are important in forest inventory and assessments, high-density measurements that involve airborne light detection and ranging (LiDAR) at a low flight height in large mountainous areas are highly expensive. The objective of this study was to quantify the aboveground biomass (AGB) of a plateau mountainous forest reserve using a system that synergistically combines an unmanned aircraft system (UAS)-based digital aerial camera and LiDAR to leverage their complementary advantages. In this study, we utilized digital aerial photogrammetry (DAP), which has the unique advantages of speed, high spatial resolution, and low cost, to compensate for the deficiency of forestry inventory using UAS-based LiDAR that requires terrain-following flight for high-resolution data acquisition. Combined with the sparse LiDAR points acquired by using a high-altitude and high-speed UAS for terrain extraction, dense normalized DAP point clouds can be obtained to produce an accurate and high-resolution canopy height model (CHM). Based on the CHM and spectral attributes obtained from multispectral images, we estimated and mapped the AGB of the region of interest with considerable cost efficiency. Our study supports the development of predictive models for large-scale wall-to-wall AGB mapping by leveraging the complementarity between DAP and LiDAR measurements. This work also reveals the potential of utilizing a UAS-based digital camera and LiDAR synergistically in a plateau mountainous forest area.

preprint2022arXiv

Joint Learning of Frequency and Spatial Domains for Dense Predictions

Current artificial neural networks mainly conduct the learning process in the spatial domain but neglect the frequency domain learning. However, the learning course performed in the frequency domain can be more efficient than that in the spatial domain. In this paper, we fully explore frequency domain learning and propose a joint learning paradigm of frequency and spatial domains. This paradigm can take full advantage of the preponderances of frequency learning and spatial learning; specifically, frequency and spatial domain learning can effectively capture global and local information, respectively. Exhaustive experiments on two dense prediction tasks, i.e., self-supervised depth estimation and semantic segmentation, demonstrate that the proposed joint learning paradigm can 1) achieve performance competitive to those of state-of-the-art methods in both depth estimation and semantic segmentation tasks, even without pretraining; and 2) significantly reduce the number of parameters compared to other state-of-the-art methods, which provides more chance to develop real-world applications. We hope that the proposed method can encourage more research in cross-domain learning.

preprint2022arXiv

Towards Extremely Fast Bilevel Optimization with Self-governed Convergence Guarantees

Gradient methods have become mainstream techniques for Bi-Level Optimization (BLO) in learning and vision fields. The validity of existing works heavily relies on solving a series of approximation subproblems with extraordinarily high accuracy. Unfortunately, to achieve the approximation accuracy requires executing a large quantity of time-consuming iterations and computational burden is naturally caused. This paper is thus devoted to address this critical computational issue. In particular, we propose a single-level formulation to uniformly understand existing explicit and implicit Gradient-based BLOs (GBLOs). This together with our designed counter-example can clearly illustrate the fundamental numerical and theoretical issues of GBLOs and their naive accelerations. By introducing the dual multipliers as a new variable, we then establish Bilevel Alternating Gradient with Dual Correction (BAGDC), a general framework, which significantly accelerates different categories of existing methods by taking specific settings. A striking feature of our convergence result is that, compared to those original unaccelerated GBLO versions, the fast BAGDC admits a unified non-asymptotic convergence theory towards stationarity. A variety of numerical experiments have also been conducted to demonstrate the superiority of the proposed algorithmic framework.

preprint2021arXiv

Pairwise Point Cloud Registration using Graph Matching and Rotation-invariant Features

Registration is a fundamental but critical task in point cloud processing, which usually depends on finding element correspondence from two point clouds. However, the finding of reliable correspondence relies on establishing a robust and discriminative description of elements and the correct matching of corresponding elements. In this letter, we develop a coarse-to-fine registration strategy, which utilizes rotation-invariant features and a new weighted graph matching method for iteratively finding correspondence. In the graph matching method, the similarity of nodes and edges in Euclidean and feature space are formulated to construct the optimization function. The proposed strategy is evaluated using two benchmark datasets and compared with several state-of-the-art methods. Regarding the experimental results, our proposed method can achieve a fine registration with rotation errors of less than 0.2 degrees and translation errors of less than 0.1m.

preprint2020arXiv

Generalized Operating Procedure for Deep Learning: an Unconstrained Optimal Design Perspective

Deep learning (DL) has brought about remarkable breakthrough in processing images, video and speech due to its efficacy in extracting highly abstract representation and learning very complex functions. However, there is seldom operating procedure reported on how to make it for real use cases. In this paper, we intend to address this problem by presenting a generalized operating procedure for DL from the perspective of unconstrained optimal design, which is motivated by a simple intension to remove the barrier of using DL, especially for those scientists or engineers who are new but eager to use it. Our proposed procedure contains seven steps, which are project/problem statement, data collection, architecture design, initialization of parameters, defining loss function, computing optimal parameters, and inference, respectively. Following this procedure, we build a multi-stream end-to-end speaker verification system, in which the input speech utterance is processed by multiple parallel streams within different frequency range, so that the acoustic modeling can be more robust resulting from the diversity of features. Trained with VoxCeleb dataset, our experimental results verify the effectiveness of our proposed operating procedure, and also show that our multi-stream framework outperforms single-stream baseline with 20 % relative reduction in minimum decision cost function (minDCF).

preprint2015arXiv

Monolayer charge-neutral graphene on platinum with extremely weak electron-phonon coupling

Epitaxial growth of graphene on transition metal substrates is an important route for obtaining large scale graphene. However, the interaction between graphene and the substrate often leads to multiple orientations, distorted graphene band structure, large doping and strong electron-phonon coupling. Here we report the growth of monolayer graphene with high crystalline quality on Pt(111) substrate by using a very low concentration of an internal carbon source with high annealing temperature. The controlled growth leads to electronically decoupled graphene: it is nearly charge neutral and has extremely weak electron-phonon coupling (coupling strength $λ$ $\approx$ 0.056) as revealed by angle-resolved photoemission spectroscopic measurements. The thermodynamics and kinetics of the carbon diffusion process is investigated by DFT calculation. Such graphene with negligible graphene-substrate interaction provides an important platform for fundamental research as well as device applications when combined with a nondestructive sample transfer technique.

preprint2015arXiv

Uniqueness and nondegeneracy of sign-changing radial solutions to an almost critical elliptic problem

We study sign-changing radial solutions for the following semi-linear elliptic equation \begin{align*} Δu-u+|u|^{p-1}u=0\quad{\rm{in}}\ \mathbb{R}^N,\quad u\in H^1(\mathbb{R}^N), \end{align*} where $1<p<\frac{N+2}{N-2}$, $N\geq3$. It is well-known that this equation has a unique positive radial solution and sign-changing radial solutions with exactly $k$ nodes. In this paper, we show that such sign-changing radial solution is also unique when $p$ is close to $\frac{N+2}{N-2}$. Moreover, those solutions are non-degenerate, i.e., the kernel of the linearized operator is exactly $N$-dimensional.

preprint2014arXiv

On the Ambrosetti-Malchiodi-Ni Conjecture for general submanifolds

We study positive solutions of the following semilinear equation $$\varepsilon^2Δ_{\bar g} u - V(z) u+ u^{p} =0\,\hbox{ on }\,M, $$ where $(M, \bar g )$ is a compact smooth $n$-dimensional Riemannian manifold without boundary or the Euclidean space $\mathbb R^n$, $\varepsilon$ is a small positive parameter, $p>1$ and $V$ is a uniformly positive smooth potential. Given $k=1,\dots,n-1$, and $1 < p < \frac{n+2-k}{n-2-k}$. Assuming that $K$ is a $k$-dimensional smooth, embedded compact submanifold of $M$, which is stationary and non-degenerate with respect to the functional $\int_K V^{\frac{p+1}{p-1}-\frac{n-k}{2}}dvol$, we prove the existence of a sequence $\varepsilon=\varepsilon_j\to 0$ and positive solutions $u_\varepsilon$ that concentrate along $K$. This result proves in particular the validity of a conjecture by Ambrosetti-Malchiodi-Ni, extending a recent result by Wang-Wei-Yang, where the one co-dimensional case has been considered. Furthermore, our approach explores a connection between solutions of the nonlinear Schrödinger equation and $f$-minimal submanifolds in manifolds with density.

preprint2013arXiv

Infinitely many positive solutions of nonlinear Schrödinger equations with non-symmetric potentials

We consider the standing-wave problem for a nonlinear Schrödinger equation, corresponding to the semilinear elliptic problem \begin{equation*} -Δu+V(x)u=|u|^{p-1}u,\ u\in H^1(\mathbb{R}^2), \end{equation*} where $V(x)$ is a uniformly positive potential and $p>1$. Assuming that \begin{equation*} V(x)=V_\infty+\frac{a}{|x|^m}+O\Big(\frac{1}{|x|^{m+σ}}\Big),\ \text{as}\ |x|\rightarrow+\infty, %\tag{$V2$} \end{equation*} for instance if $p>2$, $m>2$ and $σ>1$ we prove the existence of infinitely many positive solutions. If $V(x)$ is radially symmetric, this result was proved in \cite{WY-10}. The proof without symmetries is much more difficult, and for that we develop a new {\em intermediate Lyapunov-Schmidt reduction method}, which is a compromise between the finite and infinite dimensional versions of it.

preprint2010arXiv

Orbital stability of bound states of nonlinear Schrodinger equations with linear and nonlinear optical lattices

We study the orbital stability and instability of single-spike bound states of semiclassical nonlinear Schrodinger (NLS) equations with critical exponent, linear and nonlinear optical lattices (OLs). These equations may model two-dimensional Bose-Einstein condensates in linear and nonlinear OLs. When linear OLs are switched off, we derive the asymptotic expansion formulas and obtain necessary conditions for the orbital stability and instability of single-spike bound states, respectively. When linear OLs are turned on, we consider three different conditions of linear and nonlinear OLs to develop mathematical theorems which are most general on the orbital stability problem.

Wei Yao

What is connected

Connect this record

See the researcher in context

Building this map preview

14 published item(s)

A Novel $αβ$-Approximation Method Based on Numerical Integration for Discretizing Continuous Systems

On the Blessing of Pre-training in Weak-to-Strong Generalization

STAF: 3D Human Mesh Recovery from Video with Spatio-Temporal Alignment Fusion

Efficient divide-and-conquer registration of UAV and ground LiDAR point clouds through canopy shape context

Information fusion approach for biomass estimation in a plateau mountainous forest using a synergistic system comprising UAS-based digital camera and LiDAR

Joint Learning of Frequency and Spatial Domains for Dense Predictions

Towards Extremely Fast Bilevel Optimization with Self-governed Convergence Guarantees

Pairwise Point Cloud Registration using Graph Matching and Rotation-invariant Features

Generalized Operating Procedure for Deep Learning: an Unconstrained Optimal Design Perspective

Monolayer charge-neutral graphene on platinum with extremely weak electron-phonon coupling

Uniqueness and nondegeneracy of sign-changing radial solutions to an almost critical elliptic problem

On the Ambrosetti-Malchiodi-Ni Conjecture for general submanifolds

Infinitely many positive solutions of nonlinear Schrödinger equations with non-symmetric potentials

Orbital stability of bound states of nonlinear Schrodinger equations with linear and nonlinear optical lattices