Source author record

Wentao Yu

Wentao Yu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning eess.SP physics.optics Artificial Intelligence astro-ph.SR Biological Physics Computation Computation and Language cond-mat.mtrl-sci cond-mat.soft cond-mat.str-el eess.AS Emerging Technologies Information Theory Methodology nlin.AO Quantitative Methods Sound

Catalog footprint

What is connected

12works

18topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Accelerating Bayesian Phylogenetic Inference via Delayed Acceptance Sequential Monte Carlo with Random Forest Surrogates

In Bayesian phylogenetics, our goal is to estimate the posterior distribution over phylogenetic trees. Markov chain Monte Carlo methods are widely used to approximate the phylogenetic posterior distributions. For large-scale sequence data, repeated evaluation of the likelihood function incurs a high computational cost. In this article, we propose a machine-learning algorithm with over 35 topological and branch-length features to predict the changes in the likelihood function caused by tree moves (\eg,~eSPR, stNNI) used in standard MCMC approaches. This algorithm is then used to design a delayed acceptance MCMC kernel, which utilized the predicted surrogate function for preliminary rejection, to accelerate tree space searches. Furthermore, we integrate our proposed MCMC kernel into the sequential Monte Carlo sampler framework. We validate the proposed delayed-acceptance sequential Monte Carlo approach (DA-SMC) on simulation and real data sets. Our delayed acceptance kernel can maintain robust estimation while reduces the number of likelihood evaluations significantly, yielding substantial computational time savings. We develop a Python package that is available at https://github.com/wentYu/DAphyloSMC.

preprint2026arXiv

Analog RF Computing: A New Paradigm for Energy-Efficient Edge AI Over MU-MIMO Systems

Modern edge devices increasingly rely on neural networks for intelligent applications. However, conventional digital computing-based edge inference requires substantial memory and energy consumption. In analog radio frequency (RF) computing, a base station (BS) encodes the weights of the neural networks and broadcasts the RF waveforms to the clients. Each client reuses its passive mixer to multiply the received weight-encoded waveform with a locally generated input-encoded waveform. This enables wireless receivers to perform the matrix-vector multiplications (MVMs) that account for most of the computation burden in edge inference with ultra-low energy consumption. Unlike conventional downlink transmissions which are optimized for communications, analog RF computing requires a computing-centric physical layer that controls both the analog MVM accuracy and the energy consumption for inference. Motivated by this, in this paper, we propose a physical layer design framework for analog RF computing in MU-MIMO wireless systems. We derive tractable models for computing accuracy and energy consumption for inference, formulate a joint BS beamforming and client-side scaling problem subject to computing accuracy, transmit power, and hardware constraints, and develop a low-complexity algorithm to solve the non-convex problem. The proposed design provides client- and layer-specific accuracy control for both uniform- and mixed-precision inference. Simulations under 3GPP specifications show that analog RF computing can significantly reduce client-side energy consumption by nearly two orders of magnitude compared to digital computing, while mixed-precision inference requires even lower energy consumption than uniform-precision inference. Overall, these results establish analog RF computing over wireless networks as a promising paradigm for energy-efficient edge inference.

preprint2026arXiv

Beyond Rigid Alignment: Graph Federated Learning via Dual Manifold Calibration

Graph Federated Learning (GFL) enables collaborative representation learning across distributed subgraphs while preserving privacy. However, heterogeneity remains a critical challenge, as subgraphs across clients typically differ significantly in both semantics and structures. Existing methods address heterogeneity by enforcing the rigid alignment of model parameters or prototypes between clients and the server. However, these alignments implicitly rely on a restrictive global linearity assumption that summarizes local data distributions using a single and globally consistent representation space. This severely compresses the personalized representation space of clients and fails to preserve diverse local graph distributions. To overcome these limitations, we propose Federated Graph Manifold Calibration (FedGMC), a novel paradigm that tackles semantic heterogeneity and structural heterogeneity from a unified manifold perspective. Instead of enforcing rigid alignment, FedGMC introduces a dual manifold calibration mechanism that preserves global commonalities while maximizing the personalized representation space of local clients. Specifically, for semantic heterogeneity, the server constructs a geometrically optimal semantic manifold via equidistant semantic anchors, so as to guide the calibration of local semantic manifolds. For structural heterogeneity, the server constructs a global structural manifold by building global structural templates, so as to guide the calibration of local structural manifolds. Finally, the server dynamically refines both global semantic manifolds and structural manifolds by aggregating local manifolds. Extensive experiments on eleven homophilic and heterophilic graphs demonstrate that FedGMC effectively balances global commonality and local personalization, thereby significantly outperforming state-of-the-art baseline methods.

preprint2026arXiv

Graph Federated Unlearning for Privacy Preservation

Graph federated learning (GFL) facilitates decentralized training on distributed graph data while keeping sensitive user information local, aligning with policies such as GDPR and CCPA that grant users the right to freely join or withdraw from learning systems. However, even decentralized, user information can persist after quitting, potentially propagating to central servers and then redistributing to malicious clients. This privacy leakage during user withdrawal, despite its importance, has received seldom attention in GFL. To fill the gap, we explore the potential of machine unlearning (MU) to thoroughly remove user information. However, classical MU methods are known to degrade overall performance, a problem that is exacerbated in GFL due to local message passing and global model collaboration. To this end, we make two adjustments to mitigate this challenge for GFL. First, we ensure unlearning updates that minimally affect overall performance, steering them in directions orthogonal to the gradients from learning other data. Second, we introduce virtual clients, maintained by the central server, to preserve graph topology and global embeddings without recovering information of removed entities. We conduct comprehensive experiments under a representative user-withdrawal scenario and propose a novel membership inference framework to rigorously evaluate and validate the reliability of our privacy preservation. The experimental results demonstrate the effectiveness of our approach, which also surpasses the performance of seven state-of-the-art baseline methods.

preprint2026arXiv

Sensing for Free: Learn to Localize More Sources than Antennas without Pilots

Integrated sensing and communication (ISAC) represents a key paradigm for future wireless networks. However, existing approaches require waveform modifications, dedicated pilots, or overhead that complicates standards integration. We propose sensing for free - performing multi-source localization without pilots by reusing uplink data symbols, making sensing occur during transmission and directly compatible with 3GPP 5G NR and 6G specifications. With ever-increasing devices in dense 6G networks, this approach is particularly compelling when combined with sparse arrays, which can localize more sources than uniform arrays via an enlarged virtual array. Existing pilot-free multi-source localization algorithms first reconstruct an extended covariance matrix and apply subspace methods, incurring cubic complexity and limited to second-order statistics. Performance degrades under non-Gaussian data symbols and few snapshots, and higher-order statistics remain unexploited. We address these challenges with an attention-only transformer that directly processes raw signal snapshots for grid-less end-to-end direction-of-arrival (DOA) estimation. The model efficiently captures higher-order statistics while being permutation-invariant and adaptive to varying snapshot counts. Our algorithm greatly outperforms state-of-the-art AI-based benchmarks with over 30x reduction in parameters and runtime, and enjoys excellent generalization under practical mismatches. Applied to multi-user MIMO beam training, our algorithm can localize uplink DOAs of multiple users during data transmission. Through angular reciprocity, estimated uplink DOAs prune downlink beam sweeping candidates and improve throughput via sensing-assisted beam management. This work shows how reusing existing data transmission for sensing can enhance both multi-source localization and beam management in 3GPP efforts towards 6G.

preprint2022arXiv

RubCSG at SemEval-2022 Task 5: Ensemble learning for identifying misogynous MEMEs

This work presents an ensemble system based on various uni-modal and bi-modal model architectures developed for the SemEval 2022 Task 5: MAMI-Multimedia Automatic Misogyny Identification. The challenge organizers provide an English meme dataset to develop and train systems for identifying and classifying misogynous memes. More precisely, the competition is separated into two sub-tasks: sub-task A asks for a binary decision as to whether a meme expresses misogyny, while sub-task B is to classify misogynous memes into the potentially overlapping sub-categories of stereotype, shaming, objectification, and violence. For our submission, we implement a new model fusion network and employ an ensemble learning approach for better performance. With this structure, we achieve a 0.755 macroaverage F1-score (11th) in sub-task A and a 0.709 weighted-average F1-score (10th) in sub-task B.

preprint2020arXiv

Intermittent "Turbulence" in a Many-body System

In natural settings, intermittent dynamics are ubiquitous and often arise from a coupling between external driving and spatial heterogeneities. A well-known example is the generation of transient, turbulent puffs of fluid through a pipe with rough walls. Here we show how similar dynamics can emerge in a discrete, crystalline system of particles driven by noise. Polydispersity in particle masses leads to localized vibrational modes that effectuate a transition to a gas-like phase. A minimal model for the evolution of the system's mechanical energies exhibits quasi-cyclic oscillations, and a single, dimensionless number captures the essential features of the intermittent dynamics, analogous to the Reynolds number for pipe flow.

preprint2020arXiv

Multimodal Integration for Large-Vocabulary Audio-Visual Speech Recognition

For many small- and medium-vocabulary tasks, audio-visual speech recognition can significantly improve the recognition rates compared to audio-only systems. However, there is still an ongoing debate regarding the best combination strategy for multi-modal information, which should allow for the translation of these gains to large-vocabulary recognition. While an integration at the level of state-posterior probabilities, using dynamic stream weighting, is almost universally helpful for small-vocabulary systems, in large-vocabulary speech recognition, the recognition accuracy remains difficult to improve. In the following, we specifically consider the large-vocabulary task of the LRS2 database, and we investigate a broad range of integration strategies, comparing early integration and end-to-end learning with many versions of hybrid recognition and dynamic stream weighting. One aspect, which is shown to provide much benefit here, is the use of dynamic stream reliability indicators, which allow for hybrid architectures to strongly profit from the inclusion of visual information whenever the audio channel is distorted even slightly.

preprint2016arXiv

High-resolution Collinear Chiral Sum Frequency Generation Microscopy by Using Vectorial Beam

In chiral sum frequency generation (C-SFG), the chiral nature of $χ^{(2)}$ requires the three involved electric fields to be pairwise non-parallel, leading to the traditional non-collinear configuration which is a hindrance for achieving diffraction limited resolution while utilizing it as a label-free imaging contrast mechanism . Here we propose a collinear C-SFG (CC-SFG) microscopy modality by using longitudinal z-polarized vectorial field. Label-free chiral imaging with enhanced spatial resolution (~1.4 times improvement in one lateral and the longitudinal directions over the traditional non-collinear scheme) is demonstrated, providing a new path for SFG microscopy with diffraction-limited resolution for mapping chirality.

preprint2015arXiv

Super-resolution deep imaging with hollow Bessel beam STED microscopy

Stimulated emission depletion (STED) microscopy has become a powerful imaging and localized excitation method beating the diffraction barrier for improved lateral spatial resolution in cellular imaging, lithography, etc. Due to specimen-induced aberrations and scattering distortion, it has been a great challenge for STED to maintain consistent lateral resolution deeply inside the specimens. Here we report on a deep imaging STED microscopy by using Gaussian beam for excitation and hollow Bessel beam for depletion (GB-STED). The proposed scheme shows the improved imaging depth up to ~155μm in solid agarose sample, ~115μm in PDMS and ~100μm in phantom of gray matter in brain tissue with consistent super resolution, while the standard STED microscopy shown a significantly reduced lateral resolution at the same imaging depth. The results indicate the excellent imaging penetration capability of GB-STED, making it a promising tool for deep 3D imaging optical nanoscopy and laser fabrication.

preprint2010arXiv

Novel Multifunctional Materials Based on Oxide Thin Films and Artificial Heteroepitaxial Multilayers

Transition metal oxides show fascinating physical properties such as high temperature superconductivity, ferro- and antiferromagnetism, ferroelectricity or even multiferroicity. The enormous progress in oxide thin film technology allows us to integrate these materials with semiconducting, normal conducting, dielectric or non-linear optical oxides in complex oxide heterostructures, providing the basis for novel multi-functional materials and various device applications. Here, we report on the combination of ferromagnetic, semiconducting, metallic, and dielectric materials properties in thin films and artificial heterostructures using laser molecular beam epitaxy. We discuss the fabrication and characterization of oxide-based ferromagnetic tunnel junctions, transition metal-doped semiconductors, intrinsic multiferroics, and artificial ferroelectric/ferromagetic heterostructures - the latter allow for the detailed study of strain effects, forming the basis of spin-mechanics. For characterization we use X-ray diffraction, SQUID magnetometry, magnetotransport measurements, and advanced methods of transmission electron microscopy with the goal to correlate macroscopic physical properties with the microstructure of the thin films and heterostructures.

preprint2009arXiv

The relation between 13CO(2-1) line width in molecular clouds and bolometric luminosity of associated IRAS sources

We search for evidence of a relation between properties of young stellar objects (YSOs) and their parent molecular clouds to understand the initial conditions of high-mass star formation. A sample of 135 sources was selected from the Infrared Astronomical Satellite (IRAS) Point Source Catalog, on the basis of their red color to enhance the possibility of discovering young sources. Using the Kolner Observatorium fur SubMillimeter Astronomie (KOSMA) 3-m telescope, a single-point survey in 13CO(2-1) was carried out for the entire sample, and 14 sources were mapped further. Archival mid-infrared (MIR) data were compared with the 13CO emissions to identify evolutionary stages of the sources. A 13CO observed sample was assembled to investigate the correlation between 13CO line width of the clouds and the luminosity of the associated YSOs. We identified 98 sources suitable for star formation analyses for which relevant parameters were calculated. We detected 18 cores from 14 mapped sources, which were identified with eight pre-UC HII regions and one UC HII region, two high-mass cores earlier than pre-UC HII phase, four possible star forming clusters, and three sourceless cores. By compiling a large (360 sources) 13CO observed sample, a good correlation was found between the 13CO line width of the clouds and the bolometric luminosity of the associated YSOs, which can be fitted as a power law: lg(dV13/km/s)=-0.023+0.135lg(Lbol/Lsolar). Results show that luminous (>10^3Lsolar) YSOs tend to be associated with both more massive and more turbulent (dV13>2km/s) molecular cloud structures.

Institution

Affiliation not imported yet

This author record came from a source that does not expose affiliation metadata. Once the author claims the profile or we enrich the record from another provider, this section will link to the concrete institution.

Topic footprint

Fields this researcher appears in

Source provenance

Where this author record came from

arxivconfidence 95%

external id: arxiv:2605.06260:author:1:wentao-yu

Imported May 20, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2605.02297:author:2:wentao-yu

Imported May 20, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2605.14331:author:1:wentao-yu

Imported May 20, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2605.09506:author:1:wentao-yu

Imported May 20, 2026Synced May 20, 2026

2 works

Chen Gong

Researcher

Chen Gong contributes to research discovery and scholarly infrastructure.

Open to collaborate

2 works

Dorothea Kolossa

Researcher

Dorothea Kolossa contributes to research discovery and scholarly infrastructure.

Open to collaborate

2 works

Kebin Shi

Researcher

Kebin Shi contributes to research discovery and scholarly infrastructure.

Open to collaborate

2 works

Qihuang Gong

Researcher

Qihuang Gong contributes to research discovery and scholarly infrastructure.

Open to collaborate

Wentao Yu

What is connected

Connect this record

See the researcher in context

Building this map preview

12 published item(s)

Accelerating Bayesian Phylogenetic Inference via Delayed Acceptance Sequential Monte Carlo with Random Forest Surrogates

Analog RF Computing: A New Paradigm for Energy-Efficient Edge AI Over MU-MIMO Systems

Beyond Rigid Alignment: Graph Federated Learning via Dual Manifold Calibration

Graph Federated Unlearning for Privacy Preservation

Sensing for Free: Learn to Localize More Sources than Antennas without Pilots

RubCSG at SemEval-2022 Task 5: Ensemble learning for identifying misogynous MEMEs

Intermittent "Turbulence" in a Many-body System

Multimodal Integration for Large-Vocabulary Audio-Visual Speech Recognition

High-resolution Collinear Chiral Sum Frequency Generation Microscopy by Using Vectorial Beam

Super-resolution deep imaging with hollow Bessel beam STED microscopy

Novel Multifunctional Materials Based on Oxide Thin Films and Artificial Heteroepitaxial Multilayers

The relation between 13CO(2-1) line width in molecular clouds and bolometric luminosity of associated IRAS sources