Researcher profile

Biao Wang

Biao Wang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
23works
0followers
16topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

23 published item(s)

preprint2025arXiv

On averages of completely multiplicative functions over co-prime integer pairs

Recently, Donoso, Le, Moreira and Sun studied the asymptotic behavior of the averages of completely multiplicative functions over the Gaussian integers. They derived Wirsing's theorem for Gaussian integers, answered a question of Frantzikinakis and Host for sum of two squares, and obtained a variant of a theorem of Bergelson and Richter on ergodic averages along the number of prime factors of integers. In this paper, we will show the analogue of these results for co-prime integer pairs. Moreover, building on Frantzikinakis and Host's results, we obtain some convergences on the multilinear averages of multiplicative functions over primitive lattice points.

preprint2022arXiv

Approximate synchronization of coupled multi-valued logical networks

This article deals with the approximate synchronization of two coupled multi-valued logical networks. According to the initial state set from which both systems start, two kinds of approximate synchronization problem, local approximate synchronization and global approximate synchronization, are proposed for the first time. Three new notions: approximate synchronization state set (ASSS), the maximum approximate synchronization basin (MASB) and the shortest approximate synchronization time (SAST) are introduced and analyzed. Based on ASSS, several necessary and sufficient conditions are obtained for approximate synchronization. MASB, the set of all possible initial states, from which the systems are approximately synchronous, is investigated combining with the maximum invariant subset. And the calculation method of the SAST, associated with transient period, is presented. By virtue of MASB, pinning control scheme is investigated to make two coupled systems achieve global approximate synchronization. Furthermore, the related theories are also applied to the complete synchronization problem of $k$-valued ($k\geq2$) logical networks. Finally, four examples are given to illustrate the obtained results.

preprint2022arXiv

Biaxial strain engineering on the superconducting properties of MgB2 thin film

The effect of biaxial strain on the superconducting properties of MgB2 thin films was studied by first-principles calculations. The stability analyses by phonon dispersions show that biaxial strain as much as 7% can be applied onto MgB2 without inducing any imaginary frequency. The superconducting property calculations based on the frame of Migdal-Eliashberg theory successfully reproduce the two-gap superconductivity of MgB2. The results show that the tensile biaxial strain can increase the critical temperature of MgB2 while the compressive biaxial strain would decrease the critical temperature. The detailed microscopic mechanism of the biaxial strain effect on the superconducting properties was studied by calculations of electronic structures and phonon dispersions. The increased Tc is a combining result of the increased electron density at the Fermi level and the in-plane boron phonon softening. By means of high-throughput screening of proper substrates, it is found that most of the substrates would result in tensile strain in MgB2 film, which is in agreement with many experimental works. The results in this work provide detailed understanding of the biaxial strain engineering mechanism and demonstrate that biaxial strain engineering can be an effective way of tuning the superconducting properties of MgB2 and other similar materials.

preprint2022arXiv

Bond-Selective Intensity Diffraction Tomography

Recovering molecular information remains a grand challenge in the widely used holographic and computational imaging technologies. To address this challenge, we developed a computational mid-infrared photothermal microscope, termed Bond-selective Intensity Diffraction Tomography (BS-IDT). Based on a low-cost brightfield microscope with an add-on pulsed light source, BS-IDT recovers both infrared spectra and bond-selective 3D refractive index maps from intensity-only measurements. High-fidelity infrared fingerprint spectra extraction is validated. Volumetric chemical imaging of biological cells is demonstrated at a speed of ~20 seconds per volume, with a lateral and axial resolution of ~350 nm and ~1.1 micron, respectively. BS-IDT's application potential is investigated by chemically quantifying lipids stored in cancer cells and volumetric chemical imaging on Caenorhabditis elegans with a large field of view (~100 micron X 100 micron).

preprint2022arXiv

Deep Multi-Branch Aggregation Network for Real-Time Semantic Segmentation in Street Scenes

Real-time semantic segmentation, which aims to achieve high segmentation accuracy at real-time inference speed, has received substantial attention over the past few years. However, many state-of-the-art real-time semantic segmentation methods tend to sacrifice some spatial details or contextual information for fast inference, thus leading to degradation in segmentation quality. In this paper, we propose a novel Deep Multi-branch Aggregation Network (called DMA-Net) based on the encoder-decoder structure to perform real-time semantic segmentation in street scenes. Specifically, we first adopt ResNet-18 as the encoder to efficiently generate various levels of feature maps from different stages of convolutions. Then, we develop a Multi-branch Aggregation Network (MAN) as the decoder to effectively aggregate different levels of feature maps and capture the multi-scale information. In MAN, a lattice enhanced residual block is designed to enhance feature representations of the network by taking advantage of the lattice structure. Meanwhile, a feature transformation block is introduced to explicitly transform the feature map from the neighboring branch before feature aggregation. Moreover, a global context block is used to exploit the global contextual information. These key components are tightly combined and jointly optimized in a unified network. Extensive experimental results on the challenging Cityscapes and CamVid datasets demonstrate that our proposed DMA-Net respectively obtains 77.0% and 73.6% mean Intersection over Union (mIoU) at the inference speed of 46.7 FPS and 119.8 FPS by only using a single NVIDIA GTX 1080Ti GPU. This shows that DMA-Net provides a good tradeoff between segmentation quality and speed for semantic segmentation in street scenes.

preprint2022arXiv

Dense Learning based Semi-Supervised Object Detection

Semi-supervised object detection (SSOD) aims to facilitate the training and deployment of object detectors with the help of a large amount of unlabeled data. Though various self-training based and consistency-regularization based SSOD methods have been proposed, most of them are anchor-based detectors, ignoring the fact that in many real-world applications anchor-free detectors are more demanded. In this paper, we intend to bridge this gap and propose a DenSe Learning (DSL) based anchor-free SSOD algorithm. Specifically, we achieve this goal by introducing several novel techniques, including an Adaptive Filtering strategy for assigning multi-level and accurate dense pixel-wise pseudo-labels, an Aggregated Teacher for producing stable and precise pseudo-labels, and an uncertainty-consistency-regularization term among scales and shuffled patches for improving the generalization capability of the detector. Extensive experiments are conducted on MS-COCO and PASCAL-VOC, and the results show that our proposed DSL method records new state-of-the-art SSOD performance, surpassing existing methods by a large margin. Codes can be found at \textcolor{blue}{https://github.com/chenbinghui1/DSL}.

preprint2022arXiv

Dynamics on the number of prime divisors for additive arithmetic semigroups

In 2020, Bergelson and Richter gave a dynamical generalization of the classical Prime Number Theorem, which has been generalized by Loyd in a disjoint form with the Erdős-Kac Theorem. These generalizations reveal the rich ergodic properties of the number of prime divisors of integers. In this article, we show a new generalization of Bergelson and Richter's Theorem in a disjoint form with the distribution of the largest prime factors of integers. Then following Bergelson and Richter's techniques, we will show the analogues of all of these results for the arithmetic semigroups arising from finite fields as well.

preprint2022arXiv

High-throughput screening of piezo-photocatalytic materials for hydrogen production

Finding cost-effective and efficient photocatalytic materials able to catalyse the water splitting reaction under visible light is one of the greatest challenges in current environmental material science. Despite that many photocatalysts are already known in the context of green hydrogen production, strategies to systematically and rationally modify their optoelectronic properties to achieve desired photocatalytic performance are yet to be established. Piezoelectric materials react to mechanical stimuli by adjusting their band gaps and band alignments, thus offering a possible route to precise photocatalyst design. However, piezo-photocatalysts are relatively scarce and have been seldom investigated to date. Here, we present a high-throughput screening of piezo-photocatalytic materials performed over $\sim 1,000$ bulk piezoelectrics that relies on a simple electrostatic model and first-principles calculations. A total of $\sim 10$ previously overlooked binary and tertiary bulk compounds are theoretically identified as highly promising piezo-photocatalysts due to their appropriate optoelectronic properties and superb band alignment tunability driven by uniaxial strain.

preprint2022arXiv

Learning Pixel-Level Distinctions for Video Highlight Detection

The goal of video highlight detection is to select the most attractive segments from a long video to depict the most interesting parts of the video. Existing methods typically focus on modeling relationship between different video segments in order to learning a model that can assign highlight scores to these segments; however, these approaches do not explicitly consider the contextual dependency within individual segments. To this end, we propose to learn pixel-level distinctions to improve the video highlight detection. This pixel-level distinction indicates whether or not each pixel in one video belongs to an interesting section. The advantages of modeling such fine-level distinctions are two-fold. First, it allows us to exploit the temporal and spatial relations of the content in one video, since the distinction of a pixel in one frame is highly dependent on both the content before this frame and the content around this pixel in this frame. Second, learning the pixel-level distinction also gives a good explanation to the video highlight task regarding what contents in a highlight segment will be attractive to people. We design an encoder-decoder network to estimate the pixel-level distinction, in which we leverage the 3D convolutional neural networks to exploit the temporal context information, and further take advantage of the visual saliency to model the spatial distinction. State-of-the-art performance on three public benchmarks clearly validates the effectiveness of our framework for video highlight detection.

preprint2022arXiv

Nonlinear beam self-maintaining effect in graded-index multimode fiber

Multimode fiber systems are desirable for industrial and scientific applications. As an interesting effect for the laser beam propagation in a multimode fiber, nonlinear Kerr beam cleanup has attracted considerable research interest due to the spatial beam compressing. However, its physical mechanisms, especially the influences of input conditions on its performances, remain unclear. Here, we report a new self-organized regime for the multimode beam propagation in a graded-index multimode fiber: when the input laser has a dominant mode in which most of the laser energy is concentrated, the beam profile can be maintaining in a well-defined structure similar to the input dominant mode in nonlinear regime, while it will evolve to an irregular pattern in linear regime. The existence and universality of this nonlinear beam self-maintaining effect have been verified by the experimental and numerical data. Our results also provide evidence that nonlinear Kerr effects can be the driving mechanism and nonlinear Kerr beam cleanup is a specific case of this effect. Further research into this spatial beam shaping effect may provide a new perspective to understand other multimode fiber nonlinearities.

preprint2022arXiv

SP-ViT: Learning 2D Spatial Priors for Vision Transformers

Recently, transformers have shown great potential in image classification and established state-of-the-art results on the ImageNet benchmark. However, compared to CNNs, transformers converge slowly and are prone to overfitting in low-data regimes due to the lack of spatial inductive biases. Such spatial inductive biases can be especially beneficial since the 2D structure of an input image is not well preserved in transformers. In this work, we present Spatial Prior-enhanced Self-Attention (SP-SA), a novel variant of vanilla Self-Attention (SA) tailored for vision transformers. Spatial Priors (SPs) are our proposed family of inductive biases that highlight certain groups of spatial relations. Unlike convolutional inductive biases, which are forced to focus exclusively on hard-coded local regions, our proposed SPs are learned by the model itself and take a variety of spatial relations into account. Specifically, the attention score is calculated with emphasis on certain kinds of spatial relations at each head, and such learned spatial foci can be complementary to each other. Based on SP-SA we propose the SP-ViT family, which consistently outperforms other ViT models with similar GFlops or parameters. Our largest model SP-ViT-L achieves a record-breaking 86.3% Top-1 accuracy with a reduction in the number of parameters by almost 50% compared to previous state-of-the-art model (150M for SP-ViT-L vs 271M for CaiT-M-36) among all ImageNet-1K models trained on 224x224 and fine-tuned on 384x384 resolution w/o extra data.

preprint2022arXiv

Spatiotemporal Self-attention Modeling with Temporal Patch Shift for Action Recognition

Transformer-based methods have recently achieved great advancement on 2D image-based vision tasks. For 3D video-based tasks such as action recognition, however, directly applying spatiotemporal transformers on video data will bring heavy computation and memory burdens due to the largely increased number of patches and the quadratic complexity of self-attention computation. How to efficiently and effectively model the 3D self-attention of video data has been a great challenge for transformers. In this paper, we propose a Temporal Patch Shift (TPS) method for efficient 3D self-attention modeling in transformers for video-based action recognition. TPS shifts part of patches with a specific mosaic pattern in the temporal dimension, thus converting a vanilla spatial self-attention operation to a spatiotemporal one with little additional cost. As a result, we can compute 3D self-attention using nearly the same computation and memory cost as 2D self-attention. TPS is a plug-and-play module and can be inserted into existing 2D transformer models to enhance spatiotemporal feature learning. The proposed method achieves competitive performance with state-of-the-arts on Something-something V1 & V2, Diving-48, and Kinetics400 while being much more efficient on computation and memory cost. The source code of TPS can be found at https://github.com/MartinXM/TPS.

preprint2022arXiv

Structure-Aware Motion Transfer with Deformable Anchor Model

Given a source image and a driving video depicting the same object type, the motion transfer task aims to generate a video by learning the motion from the driving video while preserving the appearance from the source image. In this paper, we propose a novel structure-aware motion modeling approach, the deformable anchor model (DAM), which can automatically discover the motion structure of arbitrary objects without leveraging their prior structure information. Specifically, inspired by the known deformable part model (DPM), our DAM introduces two types of anchors or keypoints: i) a number of motion anchors that capture both appearance and motion information from the source image and driving video; ii) a latent root anchor, which is linked to the motion anchors to facilitate better learning of the representations of the object structure information. Moreover, DAM can be further extended to a hierarchical version through the introduction of additional latent anchors to model more complicated structures. By regularizing motion anchors with latent anchor(s), DAM enforces the correspondences between them to ensure the structural information is well captured and preserved. Moreover, DAM can be learned effectively in an unsupervised manner. We validate our proposed DAM for motion transfer on different benchmark datasets. Extensive experiments clearly demonstrate that DAM achieves superior performance relative to existing state-of-the-art methods.

preprint2022arXiv

Valley Piezoelectric Mechanism for Interpreting and Optimizing Piezoelectricity in Quantum Materials via Anomalous Hall Effect

Quantum materials have exhibited attractive electro-mechanical responses, but their piezoelectric coefficients are far from satisfactory due to the lack of fundamental mechanisms to benefit from the quantum effects. We discovered the valley piezoelectric mechanism that is absent in traditional piezoelectric theory yet promising to overcome this challenge. A theoretical model was developed to elucidate the valley piezoelectricity as the Valley Hall effect driven by pseudoelectric field, which can be significant in quantum systems with broken time reversal symmetry. Consistent tight-binding and density-functional-theory (DFT) calculations validate the model and unveil the crucial dependence of valley piezoelectricity on valley splitting, hybridization energy, bandgap, and Poisson ratio. Doping, passivation, and external stress are proposed as rational strategies to optimize piezoelectricity, with a more than 130% increase of piezoelectricity demonstrated by DFT simulations. The general valley piezoelectric model bridges the gap between electro-mechanical response and quantum effects, which opens an opportunity to achieve outstanding piezoelectricity in quantum materials via optimizing spin-valley and spin-orbit couplings.

preprint2020arXiv

AIBench: An Agile Domain-specific Benchmarking Methodology and an AI Benchmark Suite

Domain-specific software and hardware co-design is encouraging as it is much easier to achieve efficiency for fewer tasks. Agile domain-specific benchmarking speeds up the process as it provides not only relevant design inputs but also relevant metrics, and tools. Unfortunately, modern workloads like Big data, AI, and Internet services dwarf the traditional one in terms of code size, deployment scale, and execution path, and hence raise serious benchmarking challenges. This paper proposes an agile domain-specific benchmarking methodology. Together with seventeen industry partners, we identify ten important end-to-end application scenarios, among which sixteen representative AI tasks are distilled as the AI component benchmarks. We propose the permutations of essential AI and non-AI component benchmarks as end-to-end benchmarks. An end-to-end benchmark is a distillation of the essential attributes of an industry-scale application. We design and implement a highly extensible, configurable, and flexible benchmark framework, on the basis of which, we propose the guideline for building end-to-end benchmarks, and present the first end-to-end Internet service AI benchmark. The preliminary evaluation shows the value of our benchmark suite---AIBench against MLPerf and TailBench for hardware and software designers, micro-architectural researchers, and code developers. The specifications, source code, testbed, and results are publicly available from the web site \url{http://www.benchcouncil.org/AIBench/index.html}.

preprint2020arXiv

Analogues of Alladi's formula

In this note, we mainly show the analogue of one of Alladi's formulas over $\mathbb{Q}$ with respect to the Dirichlet convolutions involving the Möbius function $μ(n)$, which is related to the natural densities of sets of primes by recent work of Dawsey, Sweeting and Woo, and Kural et al. This would give us several new analogues. In particular, we get that if $(k, \ell)=1$, then $$-\sum_{\begin{smallmatrix}n\geq 2\\ p(n)\equiv \ell (\operatorname{mod} k) \end{smallmatrix}} \frac{μ(n)}{φ(n)} = \frac1{φ(k)},$$ where $p(n)$ is the smallest prime divisor of $n$, and $φ(n)$ is Euler's totient function. This refines one of Hardy's formulas in 1921. At the end, we give some examples for the $φ(n)$ replaced by functions "near $n$", which include the sum-of-divisors function.

preprint2020arXiv

Asymptotic Plateau Problem for Two Contours

For two disjoint rectifiable star-shaped Jordan curves (including round circles) in the asymptotic boundary of hyperbolic 3-space, if the distance (see Definition 1.8) between these two Jordan curves are bounded from above by some constant, then there exists an annulus-type area minimizing (or equivalently least area) minimal surface asymptotic to these two Jordan curves. The main results of this paper are Theorem 1.7 and Theorem 1.11.

preprint2020arXiv

Continual Local Replacement for Few-shot Learning

The goal of few-shot learning is to learn a model that can recognize novel classes based on one or few training data. It is challenging mainly due to two aspects: (1) it lacks good feature representation of novel classes; (2) a few of labeled data could not accurately represent the true data distribution and thus it's hard to learn a good decision function for classification. In this work, we use a sophisticated network architecture to learn better feature representation and focus on the second issue. A novel continual local replacement strategy is proposed to address the data deficiency problem. It takes advantage of the content in unlabeled images to continually enhance labeled ones. Specifically, a pseudo labeling method is adopted to constantly select semantically similar images on the fly. Original labeled images will be locally replaced by the selected images for the next epoch training. In this way, the model can directly learn new semantic information from unlabeled images and the capacity of supervised signals in the embedding space can be significantly enlarged. This allows the model to improve generalization and learn a better decision boundary for classification. Our method is conceptually simple and easy to implement. Extensive experiments demonstrate that it can achieve state-of-the-art results on various few-shot image recognition benchmarks.

preprint2020arXiv

Effects of applied mechanical strain on vacancy clustering in FCC Ni

Irradiation-induced vacancy evolution in face-centered cubic (FCC) Ni under mechanical strains was studied using molecular dynamics simulations. Applied hydrostatic strain led to different stable forms of vacancy clusters, i.e., voids under strain >= +2% and stacking fault tetrahedras (SFTs) under strain <= 0. Direct transitions between SFT and void revealed that increasing strain magnitude facilitated the thermodynamic stability and dynamical evolution. The estimated free energy difference could well validate the dynamical simulations results by accounting for entropic contribution, which was revealed to play an important role in the thermodynamic stability of vacancy clusters in FCC Ni.

preprint2020arXiv

On the effectiveness of local vortex identification criteria in the compressed representation of wall-bounded turbulence

Compressing complex flows into a tangle of vortex filaments is the basic implication of the classical notion of the vortex representation. Various vortex identification criteria have been proposed to extract the vortex filaments from available velocity fields, which is an essential procedure in the practice of the vortex representation. This work focuses on the effectiveness of those identification criteria in the compressed representation of wall-bounded turbulence. Five local identification criteria regarding the vortex strength and three criteria for the vortex axis are considered. To facilitate the comparisons, this work first non-dimensionalize the criteria of the vortex strength based on their dimensions and root mean squares, with corresponding equivalent thresholds prescribed. The optimal definition for the vortex vector is discussed by trialling all the possible combinations of the identification criteria for the vortex strength and the vortex axis. The effectiveness of those criteria in the compressed representation is evaluated based on two principles: (1) efficient compression, which implies the less information required, the better for the representation; (2) accurate decompression, which stresses that the original velocity fields could be reconstructed based on the vortex representation in high accuracy. In practice, the alignment of the identified vortex axis and vortex isosurface, and the accuracy for decompressed velocity fields based on those criteria are quantitatively compared. The alignment degree is described by using the differential geometry method, and the decompressing process is implemented via the two-dimensional field-based linear stochastic estimation. The results of this work provide some reference for the applications of vortex identification criteria in wall-bounded turbulence.

preprint2020arXiv

Vortex-to-velocity reconstruction for wall-bounded turbulence via a data-driven model

Modelling the vortex structures and then translating them into the corresponding velocity fields are two essential aspects for the vortex-based modelling works in wall-bounded turbulence. This work develops a datadriven method, which allows an effective reconstruction for the velocity field based on a given vortex field. The vortex field is defined as a vector field by combining the swirl strength and the real eigenvector of the velocity gradient tensor. The distinctive properties for the vortex field are investigated, with the relationship between the vortex magnitude and orientation revealed by the differential geometry. The vortex-to-velocity reconstruction method incorporates the vortex-vortex and vortex-velocity correlation information and derives the inducing model functions under the framework of the linear stochastic estimation. Fast Fourier transformation is employed to improve the computation efficiency in implementation. The reconstruction accuracy is accessed and compared with the widely-used Biot-Savart law. Results show that the method can effectively recover the turbulent motions in a large scale range, which is very promising for the turbulence modelling. The method is also employed to investigate the inducing effects of vortices at different heights, and some revealing results are discussed and linked to the hot research topics in wall-bounded turbulence.