Source author record

Wang Lu

Wang Lu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision Machine Learning Artificial Intelligence astro-ph.HE astro-ph.SR physics.med-ph

Catalog footprint

What is connected

7works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2025arXiv

HAROOD: A Benchmark for Out-of-distribution Generalization in Sensor-based Human Activity Recognition

Sensor-based human activity recognition (HAR) mines activity patterns from the time-series sensory data. In realistic scenarios, variations across individuals, devices, environments, and time introduce significant distributional shifts for the same activities. Recent efforts attempt to solve this challenge by applying or adapting existing out-of-distribution (OOD) algorithms, but only in certain distribution shift scenarios (e.g., cross-device or cross-position), lacking comprehensive insights on the effectiveness of these algorithms. For instance, is OOD necessary to HAR? Which OOD algorithm performs the best? In this paper, we fill this gap by proposing HAROOD, a comprehensive benchmark for HAR in OOD settings. We define 4 OOD scenarios: cross-person, cross-position, cross-dataset, and cross-time, and build a testbed covering 6 datasets, 16 comparative methods (implemented with CNN-based and Transformer-based architectures), and two model selection protocols. Then, we conduct extensive experiments and present several findings for future research, e.g., no single method consistently outperforms others, highlighting substantial opportunity for advancement. Our codebase is highly modular and easy to extend for new datasets, algorithms, comparisons, and analysis, with the hope to facilitate the research in OOD-based HAR. Our implementation is released and can be found at https://github.com/AIFrontierLab/HAROOD.

preprint2025arXiv

Think Before You Move: Latent Motion Reasoning for Text-to-Motion Generation

Current state-of-the-art paradigms predominantly treat Text-to-Motion (T2M) generation as a direct translation problem, mapping symbolic language directly to continuous poses. While effective for simple actions, this System 1 approach faces a fundamental theoretical bottleneck we identify as the Semantic-Kinematic Impedance Mismatch: the inherent difficulty of grounding semantically dense, discrete linguistic intent into kinematically dense, high-frequency motion data in a single shot. In this paper, we argue that the solution lies in an architectural shift towards Latent System 2 Reasoning. Drawing inspiration from Hierarchical Motor Control in cognitive science, we propose Latent Motion Reasoning (LMR) that reformulates generation as a two-stage Think-then-Act decision process. Central to LMR is a novel Dual-Granularity Tokenizer that disentangles motion into two distinct manifolds: a compressed, semantically rich Reasoning Latent for planning global topology, and a high-frequency Execution Latent for preserving physical fidelity. By forcing the model to autoregressively reason (plan the coarse trajectory) before it moves (instantiates the frames), we effectively bridge the ineffability gap between language and physics. We demonstrate LMR's versatility by implementing it for two representative baselines: T2M-GPT (discrete) and MotionStreamer (continuous). Extensive experiments show that LMR yields non-trivial improvements in both semantic alignment and physical plausibility, validating that the optimal substrate for motion planning is not natural language, but a learned, motion-aligned concept space. Codes and demos can be found in \hyperlink{https://chenhaoqcdyq.github.io/LMR/}{https://chenhaoqcdyq.github.io/LMR/}

preprint2024arXiv

Towards Optimization and Model Selection for Domain Generalization: A Mixup-guided Solution

The distribution shifts between training and test data typically undermine the performance of models. In recent years, lots of work pays attention to domain generalization (DG) where distribution shifts exist, and target data are unseen. Despite the progress in algorithm design, two foundational factors have long been ignored: 1) the optimization for regularization-based objectives, and 2) the model selection for DG since no knowledge about the target domain can be utilized. In this paper, we propose Mixup guided optimization and selection techniques for DG. For optimization, we utilize an adapted Mixup to generate an out-of-distribution dataset that can guide the preference direction and optimize with Pareto optimization. For model selection, we generate a validation dataset with a closer distance to the target distribution, and thereby it can better represent the target data. We also present some theoretical insights behind our proposals. Comprehensive experiments demonstrate that our model optimization and selection techniques can largely improve the performance of existing domain generalization algorithms and even achieve new state-of-the-art results.

preprint2022arXiv

Generalizing to Unseen Domains: A Survey on Domain Generalization

Machine learning systems generally assume that the training and testing distributions are the same. To this end, a key requirement is to develop models that can generalize to unseen distributions. Domain generalization (DG), i.e., out-of-distribution generalization, has attracted increasing interests in recent years. Domain generalization deals with a challenging setting where one or several different but related domain(s) are given, and the goal is to learn a model that can generalize to an unseen test domain. Great progress has been made in the area of domain generalization for years. This paper presents the first review of recent advances in this area. First, we provide a formal definition of domain generalization and discuss several related fields. We then thoroughly review the theories related to domain generalization and carefully analyze the theory behind generalization. We categorize recent algorithms into three classes: data manipulation, representation learning, and learning strategy, and present several popular algorithms in detail for each category. Third, we introduce the commonly used datasets, applications, and our open-sourced codebase for fair evaluation. Finally, we summarize existing literature and present some potential research topics for the future.

preprint2022arXiv

Personalized Federated Learning with Adaptive Batchnorm for Healthcare

There is a growing interest in applying machine learning techniques to healthcare. Recently, federated learning (FL) is gaining popularity since it allows researchers to train powerful models without compromising data privacy and security. However, the performance of existing FL approaches often deteriorates when encountering non-iid situations where there exist distribution gaps among clients, and few previous efforts focus on personalization in healthcare. In this article, we propose FedAP to tackle domain shifts and then obtain personalized models for local clients. FedAP learns the similarity between clients based on the statistics of the batch normalization layers while preserving the specificity of each client with different local batch normalization. Comprehensive experiments on five healthcare benchmarks demonstrate that FedAP achieves better accuracy compared to state-of-the-art methods (e.g., 10% accuracy improvement for PAMAP2) with faster convergence speed.

preprint2020arXiv

Statistical properties of radio flux densities of solar flares

Short timescale flux variations are closely related to the energy release process of magnetic reconnection during solar flares. Radio light curves at 1, 2, 3.75, 9.4, and 17 GHz of 209 flares observed by the Nobeyama Radio Polarimeter from 2000 to 2010 are analyzed with a running smooth technique. We find that the impulsive component (with a variation timescale shorter than 1 second) of 1 GHz emission of most flares peaks at a few tens of solar flux unit and lasts for about 1 minute and the impulsive component of 2 GHz emission lasts a shorter period and peaks at a lower flux level, while at the three high frequency channels the occurrence frequency of flares increases with the decrease of the flux density up to the noise level of the corresponding background. The gradual components of these emissions, however, have similar duration and peak flux density distributions. We also derive the power spectrum on different timescales and a normalized wavelet analysis is used to confirm features on short timescales. At a time resolution of 0.1 second, more than $\sim$ 60$\%$ of these radio light curves show significant flux variation on 1 second or shorter time scales. This fraction increases with the decrease of frequency and reaches $\sim$ 100$\%$ at 1 GHz, implying that short timescale processes are universal in solar flares. We also study the correlation between the impulsive radio flux densities and soft X-ray fluxes obtained with the GOES satellites and find that more than 65$\%$ of the flares with an impulsive component have their impulsive radio emission reach a peak value ahead of the soft X-ray fluxes and this fraction increases with the radio frequency.

preprint2014arXiv

PET image reconstruction with system matrix containing point spread function derived from single photon incidence response

In positron emission tomography (PET) imaging, statistical iterative reconstruction (IR) techniques appear particularly promising since they can provide accurate system model. The system model matrix which describes the relationship between image space and projection space is important to the image quality. It contains some factors such as geometrical component and blurring component. The blurring component is usually described by point spread function (PSF). A PSF matrix derived from the single photon incidence response function is studied. And then an IR method based on the system matrix containing the PSF is developed. More specifically, the gamma photon incidence on a crystal array is simulated by Monte Carlo (MC) simulation, and then the single photon incidence response functions are calculated. Subsequently, the single photon incidence response functions is used to compute the coincidence blurring factor according to the physical process of PET coincidence detection. Through weighting the ordinary system matrix response by the coincidence blurring factors, the IR system matrix containing PSF is finally established. Using this system matrix, the image is reconstructed by ordered subset expectation maximization (OSEM) algorithm. The experimental results show that the proposed system matrix can obviously improve the image radial resolution, contrast and noise property. Furthermore, the simulated single gamma-ray incidence response function only depends on the crystal configuration, so the method could be extended to any PET scanners with the same detector crystal configuration.

Institution

Affiliation not imported yet

This author record came from a source that does not expose affiliation metadata. Once the author claims the profile or we enrich the record from another provider, this section will link to the concrete institution.

Topic footprint