Researcher profile

Bing Yang

Bing Yang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
12works
0followers
17topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

12 published item(s)

preprint2026arXiv

MacVQA: Adaptive Memory Allocation and Global Noise Filtering for Continual Visual Question Answering

Visual Question Answering (VQA) requires models to reason over multimodal information, combining visual and textual data. With the development of continual learning, significant progress has been made in retaining knowledge and adapting to new information in the VQA domain. However, current methods often struggle with balancing knowledge retention, adaptation, and robust feature representation. To address these challenges, we propose a novel framework with adaptive memory allocation and global noise filtering called MacVQA for visual question answering. MacVQA fuses visual and question information while filtering noise to ensure robust representations, and employs prototype-based memory allocation to optimize feature quality and memory usage. These designs enable MacVQA to balance knowledge acquisition, retention, and compositional generalization in continual VQA learning. Experiments on ten continual VQA tasks show that MacVQA outperforms existing baselines, achieving 43.38% average accuracy and 2.32% average forgetting on standard tasks, and 42.53% average accuracy and 3.60% average forgetting on novel composition tasks.

preprint2026arXiv

MyGram: Modality-aware Graph Transformer with Global Distribution for Multi-modal Entity Alignment

Multi-modal entity alignment aims to identify equivalent entities between two multi-modal Knowledge graphs by integrating multi-modal data, such as images and text, to enrich the semantic representations of entities. However, existing methods may overlook the structural contextual information within each modality, making them vulnerable to interference from shallow features. To address these challenges, we propose MyGram, a modality-aware graph transformer with global distribution for multi-modal entity alignment. Specifically, we develop a modality diffusion learning module to capture deep structural contextual information within modalities and enable fine-grained multi-modal fusion. In addition, we introduce a Gram Loss that acts as a regularization constraint by minimizing the volume of a 4-dimensional parallelotope formed by multi-modal features, thereby achieving global distribution consistency across modalities. We conduct experiments on five public datasets. Results show that MyGram outperforms baseline models, achieving a maximum improvement of 4.8% in Hits@1 on FBDB15K, 9.9% on FBYG15K, and 4.3% on DBP15K.

preprint2026arXiv

Scalable cold-atom quantum simulator of a $3+1$D U$(1)$ lattice gauge theory with dynamical matter

The stated overarching goal of the highly active field of quantum simulation of high-energy physics (HEP) is to achieve the capability to study \textit{ab-initio} real-time microscopic dynamics of $3+1$D quantum chromodynamics (QCD). However, existing experimental realizations and theoretical proposals for future ones have remained restricted to one or two spatial dimensions. Here, we take a big step towards this goal by proposing a concrete experimentally feasible scalable cold-atom quantum simulator of a U$(1)$ quantum link model of quantum electrodynamics (QED) in three spatial dimensions, employing \textit{linear gauge protection} to stabilize gauge invariance. Using tree tensor network simulations, we benchmark the performance of this quantum simulator through near- and far-from-equilibrium observables, showing excellent agreement with the ideal gauge theory. Additionally, we introduce a method for \textit{analog quantum error mitigation} that accounts for unwanted first-order tunneling processes, vastly improving agreement between quantum-simulator and ideal-gauge-theory results. Our findings pave the way towards realistic quantum simulators of $3+1$D lattice gauge theories that can probe regimes well beyond classical simulability.

preprint2023arXiv

High-order adaptive multiresolution wavelet upwind schemes for hyperbolic conservation laws

A system of high-order adaptive multiresolution wavelet collocation upwind schemes are developed for the solution of hyperbolic conservation laws. A couple of asymmetrical wavelet bases with interpolation property are built to realize the upwind property, and address the nonlinearity in the hyperbolic problems. An adaptive algorithm based on multiresolution analysis in wavelet theory is designed to capture moving shock waves and distinguish new localized steep regions. An integration average reconstruction method is proposed based on the Lebesgue differentiation theorem to suppress the Gibbs phenomenon. All these numerical techniques enable the wavelet collocation upwind scheme to provide a general framework for devising satisfactory adaptive wavelet upwind methods with high-order accuracy. Several benchmark tests for 1D hyperbolic problems are carried out to verify the accuracy and efficiency of the present wavelet schemes.

preprint2022arXiv

Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization

Direct-path relative transfer function (DP-RTF) refers to the ratio between the direct-path acoustic transfer functions of two microphone channels. Though DP-RTF fully encodes the sound spatial cues and serves as a reliable localization feature, it is often erroneously estimated in the presence of noise and reverberation. This paper proposes to learn DP-RTF with deep neural networks for robust binaural sound source localization. A DP-RTF learning network is designed to regress the binaural sensor signals to a real-valued representation of DP-RTF. It consists of a branched convolutional neural network module to separately extract the inter-channel magnitude and phase patterns, and a convolutional recurrent neural network module for joint feature learning. To better explore the speech spectra to aid the DP-RTF estimation, a monaural speech enhancement network is used to recover the direct-path spectrograms from the noisy ones. The enhanced spectrograms are stacked onto the noisy spectrograms to act as the input of the DP-RTF learning network. We train one unique DP-RTF learning network using many different binaural arrays to enable the generalization of DP-RTF learning across arrays. This way avoids time-consuming training data collection and network retraining for a new array, which is very useful in practical application. Experimental results on both simulated and real-world data show the effectiveness of the proposed method for direction of arrival (DOA) estimation in the noisy and reverberant environment, and a good generalization ability to unseen binaural arrays.

preprint2022arXiv

Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition

Graph convolutional networks have been widely used for skeleton-based action recognition due to their excellent modeling ability of non-Euclidean data. As the graph convolution is a local operation, it can only utilize the short-range joint dependencies and short-term trajectory but fails to directly model the distant joints relations and long-range temporal information that are vital to distinguishing various actions. To solve this problem, we present a multi-scale spatial graph convolution (MS-GC) module and a multi-scale temporal graph convolution (MT-GC) module to enrich the receptive field of the model in spatial and temporal dimensions. Concretely, the MS-GC and MT-GC modules decompose the corresponding local graph convolution into a set of sub-graph convolution, forming a hierarchical residual architecture. Without introducing additional parameters, the features will be processed with a series of sub-graph convolutions, and each node could complete multiple spatial and temporal aggregations with its neighborhoods. The final equivalent receptive field is accordingly enlarged, which is capable of capturing both short- and long-range dependencies in spatial and temporal domains. By coupling these two modules as a basic block, we further propose a multi-scale spatial temporal graph convolutional network (MST-GCN), which stacks multiple blocks to learn effective motion representations for action recognition. The proposed MST-GCN achieves remarkable performance on three challenging benchmark datasets, NTU RGB+D, NTU-120 RGB+D and Kinetics-Skeleton, for skeleton-based action recognition.

preprint2022arXiv

Non-neglectable entropy effect on sintering of supported nanoparticles

Sintering refers to particle coalescence by heat, which has been known as a thermal phenomenon involving all aspects of natural science for centuries. It is particularly important in heterogeneous catalysis because normally sintering results in deactivation of the catalysts. In previous studies, the enthalpy contribution was considered to be dominant in sintering and the entropy effect is generally considered neglectable. However, we unambiguously demonstrate in this work that entropy could prevail over the enthalpy contribution to dominate the sintering behavior of supported nanoparticles (NPs) by designed experiments and improved theoretical framework. Using in situ Cs-corrected environmental scanning transmission electron microscopy and synchrotron-based ambient pressure X-ray photoelectron spectroscopy, we observe the unprecedent entropy-driven phenomenon that supported NPs reversibly redisperse upon heating and sinter upon cooling in three systems (Pd-CeO2, Cu-TiO2, Ag-TiO2). We quantitatively show that the configurational entropy of highly dispersed ad-atoms is large enough to reverse their sintering tendency at the elevated temperature. This work reshapes the basic understanding of sintering at the nanoscale and opens the door for various de-novo designs of thermodynamically stable nanocatalysts.

preprint2022arXiv

SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization

Multiple moving sound source localization in real-world scenarios remains a challenging issue due to interaction between sources, time-varying trajectories, distorted spatial cues, etc. In this work, we propose to use deep learning techniques to learn competing and time-varying direct-path phase differences for localizing multiple moving sound sources. A causal convolutional recurrent neural network is designed to extract the direct-path phase difference sequence from signals of each microphone pair. To avoid the assignment ambiguity and the problem of uncertain output-dimension encountered when simultaneously predicting multiple targets, the learning target is designed in a weighted sum format, which encodes source activity in the weight and direct-path phase differences in the summed value. The learned direct-path phase differences for all microphone pairs can be directly used to construct the spatial spectrum according to the formulation of steered response power (SRP). This deep neural network (DNN) based SRP method is referred to as SRP-DNN. The locations of sources are estimated by iteratively detecting and removing the dominant source from the spatial spectrum, in which way the interaction between sources is reduced. Experimental results on both simulated and real-world data show the superiority of the proposed method in the presence of noise and reverberation.

preprint2022arXiv

Thermalization dynamics of a gauge theory on a quantum simulator

Gauge theories form the foundation of modern physics, with applications ranging from elementary particle physics and early-universe cosmology to condensed matter systems. We perform quantum simulations of the unitary dynamics of a U(1) symmetric gauge field theory and demonstrate emergent irreversible behavior. The highly constrained gauge theory dynamics is encoded in a one-dimensional Bose--Hubbard simulator, which couples fermionic matter fields through dynamical gauge fields. We investigate global quantum quenches and the equilibration to a steady state well approximated by a thermal ensemble. Our work may enable the investigation of elusive phenomena, such as Schwinger pair production and string-breaking, and paves the way for simulating more complex higher-dimensional gauge theories on quantum synthetic matter devices.

preprint2021arXiv

AlCrO protected textured stainless steel surface for high temperature solar selective absorber applications

The diffusion of substrate material into absorbing layer and oxidation of metal substrate or cermet metal nanoparticles at high temperatures are known as inevitable problems of the solar selective absorbers. In this study, we consider the use of textured stainless steel (SS) surface coated with a protective AlCr oxide layer as a high temperature solar selective absorber. The textured SS surface was prepared by ion etching techniques and AlCr oxide protective layer was deposited by RF magnetron sputtering. The absorptivity and emissivity of the as-prepared absorbers were 0.86-0.92 and 0.151-0.168, respectively. In order to evaluate the thermal stability, the absorbers were annealed at 600-800 C for different time in ambient atmosphere. Absorbers demonstrated a red shift of the onset of the reflectivity at all annealing temperatures. The high activation energy of 315 kJ/mol was calculated. The service lifetime of the absorbers at 500 C was estimated to be about 100 years and at 700 and 800 C the absorbers were stable about 50 and 1 hours, respectively. A detailed examination of the annealed absorber surface revealed growth of surface Mn3O4 nanocrystals, which resulted in observed change of the reflectance spectra, while the textured surface morphology had no significant change. The results show that the protective textured surface has much higher thermal stability in air than iron based cermet absorbers.

preprint2020arXiv

Cooling and entangling ultracold atoms in optical lattices

Scalable, coherent many-body systems can enable the realization of previously unexplored quantum phases and have the potential to exponentially speed up information processing. Thermal fluctuations are negligible and quantum effects govern the behavior of such systems with extremely low temperature. We report the cooling of a quantum simulator with 10,000 atoms and mass production of high-fidelity entangled pairs. In a two-dimensional plane, we cool Mott insulator samples by immersing them into removable superfluid reservoirs, achieving an entropy per particle of $1.9^{+1.7}_{-0.4} \times 10^{-3} k_{\text{B}}$. The atoms are then rearranged into a two-dimensional lattice free of defects. We further demonstrate a two-qubit gate with a fidelity of 0.993 $\pm$ 0.001 for entangling 1250 atom pairs. Our results offer a setting for exploring low-energy many-body phases and may enable the creation of large-scale entanglement

preprint2020arXiv

Robustness of gauge-invariant dynamics against defects in ultracold-atom gauge theories

Recent years have seen strong progress in quantum simulation of gauge-theory dynamics using ultracold-atom experiments. A principal challenge in these efforts is the certification of gauge invariance, which has recently been realized in [B.~Yang et al., arXiv:2003.08945]. One major but poorly investigated experimental source of gauge-invariance violation is an imperfect preparation of the initial state. Using the time-dependent density-matrix renormalization group, we analyze the robustness of gauge-invariant dynamics against potential preparation defects in the above ultracold-atom implementation of a $\mathrm{U}(1)$ gauge theory. We find defects related to an erroneous initialization of matter fields to be innocuous, as the associated gauge-invariance violation remains strongly localized throughout the time evolution. A defect due to faulty initialization of the gauge field leads to a mild proliferation of the associated violation. Furthermore, we characterize the influence of immobile and mobile defects by monitoring the spread of entanglement entropy. Overall, our results indicate that the aforementioned experimental realization exhibits a high level of fidelity in the gauge invariance of its dynamics at all evolution times. Our work provides strong evidence that ultracold-atom setups can serve as an extremely reliable framework for the quantum simulation of gauge-theory dynamics.