Source author record

Huanyu Wang

Huanyu Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision astro-ph.IM physics.ins-det physics.geo-ph physics.space-ph Artificial Intelligence astro-ph.HE eess.IV hep-ex Machine Learning physics.plasm-ph

Catalog footprint

What is connected

14works

11topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Efficient Long-Context Modeling in Diffusion Language Models via Block Approximate Sparse Attention

Diffusion Language Models (DLMs) enable globally coherent, bidirectional, and controllable text generation, offering advantages over traditional autoregressive LLMs, while scaling to ultra-long sequences remains costly. Many existing block-sparse attention methods select blocks by fixed sampling patterns over the high-resolution attention space, such as tail regions or anti-diagonal stripes. Such prior-driven sampling can miss salient tokens and introduce instability under distribution shifts. In this paper, we propose the Block Approximate Sparse Attention framework (BA-Att) with block-wise pre-downsampled operation, which identifies informative regions within a compact downsampled space, avoiding reliance on brittle positional priors. To analyze its theoretical behavior, we define an oracle post-downsample attention map and formalize the approximation error between pre- and post-downsample schemes. Based on this insight, we introduce a lightweight norm-sorting module and a covariance-compensated correction that approximates full covariance using diagonal QK variances, reducing computational complexity. Extensive experiments show that our operator achieves up to 6.95x acceleration over FlashAttention in attention computation, and maintains near full-attention performance at 50% sparsity across language models, multimodal language models, and video generation models, demonstrating strong efficiency and generalization.

preprint2022arXiv

Compressing Models with Few Samples: Mimicking then Replacing

Few-sample compression aims to compress a big redundant model into a small compact one with only few samples. If we fine-tune models with these limited few samples directly, models will be vulnerable to overfit and learn almost nothing. Hence, previous methods optimize the compressed model layer-by-layer and try to make every layer have the same outputs as the corresponding layer in the teacher model, which is cumbersome. In this paper, we propose a new framework named Mimicking then Replacing (MiR) for few-sample compression, which firstly urges the pruned model to output the same features as the teacher's in the penultimate layer, and then replaces teacher's layers before penultimate with a well-tuned compact one. Unlike previous layer-wise reconstruction methods, our MiR optimizes the entire network holistically, which is not only simple and effective, but also unsupervised and general. MiR outperforms previous methods with large margins. Codes will be available soon.

preprint2022arXiv

D3T-GAN: Data-Dependent Domain Transfer GANs for Few-shot Image Generation

As an important and challenging problem, few-shot image generation aims at generating realistic images through training a GAN model given few samples. A typical solution for few-shot generation is to transfer a well-trained GAN model from a data-rich source domain to the data-deficient target domain. In this paper, we propose a novel self-supervised transfer scheme termed D3T-GAN, addressing the cross-domain GANs transfer in few-shot image generation. Specifically, we design two individual strategies to transfer knowledge between generators and discriminators, respectively. To transfer knowledge between generators, we conduct a data-dependent transformation, which projects and reconstructs the target samples into the source generator space. Then, we perform knowledge transfer from transformed samples to generated samples. To transfer knowledge between discriminators, we design a multi-level discriminant knowledge distillation from the source discriminator to the target discriminator on both the real and fake samples. Extensive experiments show that our method improve the quality of generated images and achieves the state-of-the-art FID scores on commonly used datasets.

preprint2022arXiv

F3A-GAN: Facial Flow for Face Animation with Generative Adversarial Networks

Formulated as a conditional generation problem, face animation aims at synthesizing continuous face images from a single source image driven by a set of conditional face motion. Previous works mainly model the face motion as conditions with 1D or 2D representation (e.g., action units, emotion codes, landmark), which often leads to low-quality results in some complicated scenarios such as continuous generation and largepose transformation. To tackle this problem, the conditions are supposed to meet two requirements, i.e., motion information preserving and geometric continuity. To this end, we propose a novel representation based on a 3D geometric flow, termed facial flow, to represent the natural motion of the human face at any pose. Compared with other previous conditions, the proposed facial flow well controls the continuous changes to the face. After that, in order to utilize the facial flow for face editing, we build a synthesis framework generating continuous images with conditional facial flows. To fully take advantage of the motion information of facial flows, a hierarchical conditional framework is designed to combine the extracted multi-scale appearance features from images and motion features from flows in a hierarchical manner. The framework then decodes multiple fused features back to images progressively. Experimental results demonstrate the effectiveness of our method compared to other state-of-the-art methods.

preprint2021arXiv

Mixup Without Hesitation

Mixup linearly interpolates pairs of examples to form new samples, which is easy to implement and has been shown to be effective in image classification tasks. However, there are two drawbacks in mixup: one is that more training epochs are needed to obtain a well-trained model; the other is that mixup requires tuning a hyper-parameter to gain appropriate capacity but that is a difficult task. In this paper, we find that mixup constantly explores the representation space, and inspired by the exploration-exploitation dilemma in reinforcement learning, we propose mixup Without hesitation (mWh), a concise, effective, and easy-to-use training algorithm. We show that mWh strikes a good balance between exploration and exploitation by gradually replacing mixup with basic data augmentation. It can achieve a strong baseline with less training time than original mixup and without searching for optimal hyper-parameter, i.e., mWh acts as mixup without hesitation. mWh can also transfer to CutMix, and gain consistent improvement on other machine learning and computer vision tasks such as object detection. Our code is open-source and available at https://github.com/yuhao318/mwh

preprint2020arXiv

Ultra Fast Structure-aware Deep Lane Detection

Modern methods mainly regard lane detection as a problem of pixel-wise segmentation, which is struggling to address the problem of challenging scenarios and speed. Inspired by human perception, the recognition of lanes under severe occlusion and extreme lighting conditions is mainly based on contextual and global information. Motivated by this observation, we propose a novel, simple, yet effective formulation aiming at extremely fast speed and challenging scenarios. Specifically, we treat the process of lane detection as a row-based selecting problem using global features. With the help of row-based selecting, our formulation could significantly reduce the computational cost. Using a large receptive field on global features, we could also handle the challenging scenarios. Moreover, based on the formulation, we also propose a structural loss to explicitly model the structure of lanes. Extensive experiments on two lane detection benchmark datasets show that our method could achieve the state-of-the-art performance in terms of both speed and accuracy. A light-weight version could even achieve 300+ frames per second with the same resolution, which is at least 4x faster than previous state-of-the-art methods. Our code will be made publicly available.

preprint2019arXiv

Overview to the Hard X-ray Modulation Telescope (Insight-HXMT) Satellite

As China's first X-ray astronomical satellite, the Hard X-ray Modulation Telescope (HXMT), which was dubbed as Insight-HXMT after the launch on June 15, 2017, is a wide-band (1-250 keV) slat-collimator-based X-ray astronomy satellite with the capability of all-sky monitoring in 0.2-3 MeV. It was designed to perform pointing, scanning and gamma-ray burst (GRB) observations and, based on the Direct Demodulation Method (DDM), the image of the scanned sky region can be reconstructed. Here we give an overview of the mission and its progresses, including payload, core sciences, ground calibration/facility, ground segment, data archive, software, in-orbit performance, calibration, background model, observations and some preliminary results.

preprint2019arXiv

The Medium Energy (ME) X-ray telescope onboard the Insight-HXMT astronomy satellite

The Medium Energy X-ray telescope (ME) is one of the three main telescopes on board the Insight Hard X-ray Modulation Telescope (Insight-HXMT) astronomy satellite. ME contains 1728 pixels of Si-PIN detectors sensitive in 5-30 keV with a total geometrical area of 952 cm2. Application Specific Integrated Circuit (ASIC) chips, VA32TA6, is used to achieve low power consumption and low readout noise. The collimators define three kinds of field of views (FOVs) for the telescope, 1°{\times}4°, 4°{\times}4°, and blocked ones. Combination of such FOVs can be used to estimate the in-orbit X-ray and particle background components. The energy resolution of ME is ~3 keV at 17.8 keV (FWHM) and the time resolution is 255 μs. In this paper, we introduce the design and performance of ME.

preprint2016arXiv

The Mechanisms of Electron Acceleration During Multiple X Line Magnetic Reconnection with a Guide Field

The interactions between magnetic islands are considered to play an important role in electron acceleration during magnetic reconnection. In this paper, two-dimensional (2-D) particle-in-cell (PIC) simulations are performed to study electron acceleration during multiple X line reconnection with a guide field. The electrons remain almost magnetized, and we can then analyze the contributions of the parallel electric field, Fermi and betatron mechanisms to electron acceleration during the evolution of magnetic reconnection by comparing with a guide-center theory. The results show that with the proceeding of magnetic reconnection, two magnetic islands are formed in the simulation domain. The electrons are accelerated by both the parallel electric field in the vicinity of the X lines and Fermi mechanism due to the contraction of the two magnetic islands. Then the two magnetic islands begin to merge into one, and in such a process electrons can be accelerated by the parallel electric field and betatron mechanisms. During the betatron acceleration, the electrons are locally accelerated in the regions where the magnetic field is piled up by the high-speed flow from the X line. At last, when the coalescence of the two islands into a big one finishes, electrons can further be accelerated by the Fermi mechanism because of the contraction of the big island. With the increase of the guide field, the contributions of Fermi and betatron mechanisms to electron acceleration become less and less important. When the guide field is sufficiently large, the contributions of Fermi and betatron mechanisms are almost negligible.

preprint2015arXiv

Ground-based verification and data processing of Yutu rover Active Particle-induced X-ray Spectrometer

The Active Particle-induced X-ray Spectrometer (APXS) is one of the payloads on board the Yutu rover of Chang'E-3 mission. In order to assess the instrumental performance of APXS, a ground verification test was done for two unknown samples (basaltic rock, mixed powder sample). In this paper, the details of the experiment configurations and data analysis method are presented. The results show that the elemental abundance of major elements can be well determined by the APXS with relative deviations < 15 wt. % (detection distance = 30 mm, acquisition time = 30 min). The derived detection limit of each major element is inversely proportional to acquisition time and directly proportional to detection distance, suggesting that the appropriate distance should be < 50mm.

preprint2015arXiv

On-orbit calibration of soft X-ray detector on Chang'E-2 satellite

X-ray spectrometer is one of the satellite payloads on Chang'E-2 satellite. The soft X-ray detector is one of the device on X-ray spectrometer which is designed to detect the major rock-forming elements within 0.5-10keV range on lunar surface. In this paper, energy linearity and energy resolution calibration is done using a weak Fe55 source, while temperature and time effect is considered not take big error. The total uncertainty is estimated to be within 5% after correction.

preprint2014arXiv

Physical Design and Monte Carlo Simulations of a Space Radiation Detector onboard the SJ-10 satellite

A radiation gene box (RGB) onboard the SJ-10 satellite is a device carrying mice and drosophila cells to determine the biological effects of space radiation environment. The shielded fluxes of different radioactive sources were calculated and the linear energy transfers of gamma-rays, electrons, protons and alpha-particles in tissue were acquired using A-150 tissue-equivalent plastic. Then, a conceptual model of a space radiation instrument employing three semiconductor sub-detectors for deriving the charged and uncharged radiation environment of the RGB was designed. The energy depositions in the three sub-detectors were classified into fifteen channels (bins) in an algorithm derived from the Monte Carlo method. The physical feasibility of the conceptual instrument was also verified by Monte Carlo simulations.

preprint2012arXiv

Common dependence on earthquake magnitudes for the trapped particles bursts approaching the earthquake

Trapped particles bursts have long been observed to be frequently occurred several hours before earthquakes, especially for strong earthquakes, from several space experiments during past decades. However, the validity of earthquake origin of particles bursts events is still unsolved. In this paper, we firstly reported the frequency distribution and time evolution of particles bursts within different time windows centered around earthquakes for various magnitudes. The results showed nearly the same systematic dependence of particle bursts frequency on earthquake magnitude and characteristic time decay behavior of average number of particles bursts for various magnitudes. These findings should strengthen the validity of earthquake origin of particles bursts and further understanding of particles bursts as possible precursor of earthquake.

preprint2011arXiv

Study of the NWC electrons belt observed on DEMETER Satellite

We analyzed observation data collected by the Instrument for the Detection of Particles (IDP) on board of DEMETER satellite during the period of total seventeen months in 2007 and 2008. In the meantime, the VLF transmitter located at NWC ground station was shutdown for seven months and working for total ten months. Our analysis, for the first time, revealed in details the transient properties of the space electrons induced by the man-made VLF wave emitted by the transmitter at NWC. First, we mapped the electron flux distribution and figured out the special range what the NWC belt covered. Then we investigated the NWC electron spectrograms in a wide range of McIlwain parameter (up to L=3.0). Finally, we obtained the averaged energy spectrum of the NWC electrons within the drift loss-cone, and compared the difference during the observations between daytime and nighttime. Our results proved the fact that the VLF emissions from NWC transmitter created momentary electron enhancement with fluxes up to 3 orders of magnitude. These electrons are distributed in the region of 180 degree in longitude and 1.6 ~ 1.9 of L shell. In addition, the VLF emission induced either enhancement or loss of electrons in higher magnetic shells up to L=3, and the maximum loss was up to 60% of the original value. The energy spectra of these electrons revealed that the enhancement during the NWC daytime are more attenuated than those in the NWC nighttime, and that the shape and the cutoff energy of the spectra are also quite different. We will present the results of our analysis, compare it with previous studies, and discuss the agreement of our results with the theory of wave-particle interaction.

Huanyu Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

14 published item(s)

Efficient Long-Context Modeling in Diffusion Language Models via Block Approximate Sparse Attention

Compressing Models with Few Samples: Mimicking then Replacing

D3T-GAN: Data-Dependent Domain Transfer GANs for Few-shot Image Generation

F3A-GAN: Facial Flow for Face Animation with Generative Adversarial Networks

Mixup Without Hesitation

Ultra Fast Structure-aware Deep Lane Detection

Overview to the Hard X-ray Modulation Telescope (Insight-HXMT) Satellite

The Medium Energy (ME) X-ray telescope onboard the Insight-HXMT astronomy satellite

The Mechanisms of Electron Acceleration During Multiple X Line Magnetic Reconnection with a Guide Field

Ground-based verification and data processing of Yutu rover Active Particle-induced X-ray Spectrometer

On-orbit calibration of soft X-ray detector on Chang'E-2 satellite

Physical Design and Monte Carlo Simulations of a Space Radiation Detector onboard the SJ-10 satellite

Common dependence on earthquake magnitudes for the trapped particles bursts approaching the earthquake

Study of the NWC electrons belt observed on DEMETER Satellite