Source author record

Wendong Zhang

Wendong Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision Machine Learning physics.ins-det

Catalog footprint

What is connected

3works

3topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2024arXiv

Improving Masked Autoencoders by Learning Where to Mask

Masked image modeling is a promising self-supervised learning method for visual data. It is typically built upon image patches with random masks, which largely ignores the variation of information density between them. The question is: Is there a better masking strategy than random sampling and how can we learn it? We empirically study this problem and initially find that introducing object-centric priors in mask sampling can significantly improve the learned representations. Inspired by this observation, we present AutoMAE, a fully differentiable framework that uses Gumbel-Softmax to interlink an adversarially-trained mask generator and a mask-guided image modeling process. In this way, our approach can adaptively find patches with higher information density for different images, and further strike a balance between the information gain obtained from image reconstruction and its practical training difficulty. In our experiments, AutoMAE is shown to provide effective pretraining models on standard self-supervised benchmarks and downstream tasks.

preprint2022arXiv

Continual Predictive Learning from Videos

Predictive learning ideally builds the world model of physical processes in one or more given environments. Typical setups assume that we can collect data from all environments at all times. In practice, however, different prediction tasks may arrive sequentially so that the environments may change persistently throughout the training procedure. Can we develop predictive learning algorithms that can deal with more realistic, non-stationary physical environments? In this paper, we study a new continual learning problem in the context of video prediction, and observe that most existing methods suffer from severe catastrophic forgetting in this setup. To tackle this problem, we propose the continual predictive learning (CPL) approach, which learns a mixture world model via predictive experience replay and performs test-time adaptation with non-parametric task inference. We construct two new benchmarks based on RoboNet and KTH, in which different tasks correspond to different physical robotic environments or human actions. Our approach is shown to effectively mitigate forgetting and remarkably outperform the naïve combinations of previous art in video prediction and continual learning.

preprint2016arXiv

Systematic theoretical analysis of dual-parameters RF readout by a novel LC-type passive sensor

This paper systematically studied the simultaneous measurement of two parameters by a LC-type passive sensor from the theoretical perspective. Based on the lumped circuit model of the typical LC-type passive dual-parameter sensor system, the influencing factors of the signal strength of the sensor as well as the influencing factors of signal crosstalk were both analyzed. It is found that the influencing factors of the RF readout signal strength of the sensor are mainly quality factors (Q factors) of the LC tanks, coupling coefficients, and the resonant frequency interval of the two LC tanks. And the influencing factors of the signal crosstalk are mainly coupling coefficient between the sensor inductance coils and the resonant frequency interval of the two LC tanks. The specific influence behavior of corresponding influencing factors on the signal strength and crosstalk is illustrated by a series of curves from numerical results simulated by using MATLAB software. Additionally, a decoupling scheme for solving the crosstalk problem algorithmically was proposed and a corresponding function was derived out. Overall, the theoretical analysis conducted in this work can provide design guidelines for making the dual-parameter LC-type passive sensor useful in practical applications.