Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
22works
0followers
19topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

22 published item(s)

preprint2026arXiv

Are Multimodal Embeddings Truly Beneficial for Recommendation? A Deep Dive into Whole vs. Individual Modalities

Multimodal recommendation has emerged as a mainstream paradigm, typically leveraging text and visual embeddings extracted from pre-trained models such as Sentence-BERT, Vision Transformers, and ResNet. This approach is founded on the intuitive assumption that incorporating multimodal embeddings can enhance recommendation performance. However, despite its popularity, this assumption lacks comprehensive empirical verification. This presents a critical research gap. To address it, we pose the central research question of this paper: Are multimodal embeddings truly beneficial for recommendation? To answer this question, we conduct a large-scale empirical study examining the role of text and visual embeddings in modern multimodal recommendation models, both as a whole and individually. Specifically, we pose two key research questions: (1) Do multimodal embeddings as a whole improve recommendation performance? (2) Is each individual modality - text and image - useful when used alone? To isolate the effect of individual modalities - text or visual - we employ a modality knockout strategy by setting the corresponding embeddings to either constant values or random noise. To ensure the scale and comprehensiveness of our study, we evaluate 14 widely used state-of-the-art multimodal recommendation models. Our findings reveal that: (1) multimodal embeddings generally enhance recommendation performance - particularly when integrated through more sophisticated graph-based fusion models. Surprisingly, commonly adopted baseline models with simple fusion schemes, such as VBPR and BM3, show only limited gains. (2) The text modality alone achieves performance comparable to the full multimodal setting in most cases, whereas the image modality alone does not. These results offer foundational insights and practical guidance for the multimodal recommendation community.

preprint2026arXiv

The Waring Problem of Harmonic Polynomials

This paper investigates the Waring problem of harmonic polynomials. By characterizing the annihilating ideal of a homogeneous harmonic polynomial, i.e., a real binary form that is in the kernel of the Laplacian, we show that its Waring rank equals its degree. Moreover, we show that any linear form can appear in a minimal Waring decomposition of a homogeneous harmonic polynomial, implying that the forbidden locus is empty. We also provide an explicit algorithm for computing the minimal Waring decompositions.

preprint2024arXiv

SUANPAN: Scalable Photonic Linear Vector Machine

Photonic linear operation is a promising approach to handle the extensive vector multiplications in artificial intelligence techniques due to the natural bosonic parallelism and high-speed information transmission of photonics. Although it is believed that maximizing the interaction of the light beams is necessary to fully utilize the parallelism and tremendous efforts have been made in past decades, the achieved dimensionality of vector-matrix multiplication is very limited due to the difficulty of scaling up a tightly interconnected or highly coupled optical system. Additionally, there is still a lack of a universal photonic computing architecture that can be readily merged with existing computing system to meet the computing power demand of AI techniques. Here, we propose a programmable and reconfigurable photonic linear vector machine to perform only the inner product of two vectors, formed by a series of independent basic computing units, while each unit is just one pair of light-emitter and photodetector. Since there is no interaction among light beams inside, extreme scalability could be achieved by simply duplicating the independent basic computing unit while there is no requirement of large-scale analog-to-digital converter and digital-to-analog converter arrays. Our architecture is inspired by the traditional Chinese Suanpan or abacus and thus is denoted as photonic SUANPAN. As a proof of principle, SUANPAN architecture is implemented with an 8*8 vertical cavity surface emission laser array and an 8*8 MoTe2 two-dimensional material photodetector array. We believe that our proposed photonic SUANPAN is capable of serving as a fundamental linear vector machine that can be readily merged with existing electronic digital computing system and is potential to enhance the computing power for future various AI applications.

preprint2022arXiv

Asynchronous Parallel Incremental Block-Coordinate Descent for Decentralized Machine Learning

Machine learning (ML) is a key technique for big-data-driven modelling and analysis of massive Internet of Things (IoT) based intelligent and ubiquitous computing. For fast-increasing applications and data amounts, distributed learning is a promising emerging paradigm since it is often impractical or inefficient to share/aggregate data to a centralized location from distinct ones. This paper studies the problem of training an ML model over decentralized systems, where data are distributed over many user devices and the learning algorithm run on-device, with the aim of relaxing the burden at a central entity/server. Although gossip-based approaches have been used for this purpose in different use cases, they suffer from high communication costs, especially when the number of devices is large. To mitigate this, incremental-based methods are proposed. We first introduce incremental block-coordinate descent (I-BCD) for the decentralized ML, which can reduce communication costs at the expense of running time. To accelerate the convergence speed, an asynchronous parallel incremental BCD (API-BCD) method is proposed, where multiple devices/agents are active in an asynchronous fashion. We derive convergence properties for the proposed methods. Simulation results also show that our API-BCD method outperforms state of the art in terms of running time and communication costs.

preprint2022arXiv

Doping-induced structural transformation in the spin-1/2 triangular-lattice antiferromagnet Na$_{2}$Ba$_{1-x}$Sr$_{x}$Co(PO$_{4}$)$_{2}$

The effects of Sr doping on the structural properties of Na$_{2}$BaCo(PO$_{4}$)$_{2}$, a spin-1/2 triangular-lattice antiferromagnet as a quantum spin liquid candidate, are investigated by complementary x-ray and neutron powder diffraction measurements. It is found that in Na$_{2}$Ba$_{1-x}$Sr$_{x}$Co(PO$_{4}$)$_{2}$ (NBSCPO), the trigonal phase (space group $\mathit{P}$$\bar{3}$$\mathit{m}$1) with a perfect triangular lattice of Co$^{2+}$ ions is structurally stable when the doping level of Sr is below 30% ($\mathit{x}$ $\le$ 0.3), while a pure monoclinic phase (space group $\mathit{P}$2$_{1}$/$\mathit{a}$) with slight rotations of CoO$_{6}$ octahedra and displacements of Ba$^{2+}$/Sr$^{2+}$ ions will be established when the Sr doping level is above 60% ($\mathit{x}$ $\ge$ 0.6). Such a doping-induced structural transformation in NBSCPO is supported by first-principles calculations and Raman spectroscopy. Na$_{2}$SrCo(PO$_{4}$)$_{2}$, a novel spin-1/2 triangular-lattice antiferromagnet with glaserite-type structure, although monoclinically distorted, exhibits no long-range magnetic order down to 2 K and a similar negative Curie-Weiss temperature as Na$_{2}$BaCo(PO$_{4}$)$_{2}$ with a perfect triangular lattice, suggesting the robustness of magnetic exchange interaction against the Ba/Sr substitutions.

preprint2022arXiv

Electrically pumped polarized exciton-polaritons in a halide perovskite microcavity

Exciton polaritons, hybrid quasiparticles with part-light part-matter nature in semiconductor microcavities, are extensively investigated for striking phenomena such as polariton condensation and quantum emulation. These phenomena have recently been discovered in emerging lead halide perovskites at elevated temperatures up to room temperature. For advancing these discoveries into practical applications, one critical requirement is the realization of electrically pumped exciton-polaritons. However, electrically pumped polariton light-emitting devices with perovskites have not yet been achieved experimentally. Here, we devise a new method to combine the device with the microcavity and report the first halide perovskite polariton light-emitting device. Specifically, the device is based on a CsPbBr3 capacitive structure, which can inject the electrons and holes from the same electrode, conducive to the formation of excitons and simultaneously maintaining the high quality of the microcavity. In addition, highly polarization-selective polariton emissions have been demonstrated due to the optical birefringence in the CsPbBr3 microplate. This work paves the way for realizing practical polaritonic devices such as high-speed light-emitting devices for information communications and inversionless electrically pumped lasers based on perovskites.

preprint2022arXiv

Orbital polarization and third-order anomalous Hall effect in WTe2

The anomalous Hall effect (AHE) has been extended into the nonlinear regime, where the Hall voltage shows higher-order response to the applied current. Nevertheless, the microscopic mechanism of the nonlinear AHE remains unclear. Here we report the orbital polarization and its induced third-order AHE in few-layer WTe2 flakes. Through angle-dependent electric measurements, it is found that the third-order AHE is quite consistent with the electric field induced polarization of orbital magnetic moment caused by the Berry connection polarizability tensor, which is further directly detected by polar reflective magnetic circular dichroism spectroscopy. The microscopic mechanisms of third-order AHE are analyzed through the scaling law, that is, the opposite orbital magnetic moments (up or down) deflect to opposite directions driven by electric field induced Berry curvature, forming the intrinsic contribution; driven by the Magnus effect of the self-rotating Bloch electrons, the opposite orbital magnetic moments are scattered towards opposite transverse directions, resulting in the skew scattering.

preprint2022arXiv

Ultra-efficient magnetism modulation in a Weyl ferromagnet by current-assisted domain wall motion

Flexible and efficient manipulation of magnetic configurations can be challenging. In the design of practical devices, achieving a high effective magnetic field with a low working current is under tight demand. Here, we report a unique method for efficient magnetism modulation by direct current injection in magnetic Weyl semimetal Co3Sn2S2. We demonstrate that the modulation process stems from current-assisted domain wall motion. Through two independent methods, we reveal that the spin-transfer torque efficiency of Co3Sn2S2 reaches as high as 2.4-5.6 kOe MA^(-1) cm^2, and the threshold current density for driving the magnetic domain walls is as low as <5.1*10^5 A/cm^2 without an external field, and <1.5*10^5 A/cm^2 with a moderate external field. Our findings manifest a new and powerful approach for sub-micron magnetism manipulation, and also open the door towards a new paradigm of spintronics that combines magnetism, topology, and metallicity for low-energy consumption memory and computing.

preprint2021arXiv

On the Harish-Chandra Homomorphism for Quantum Superalgebras

In this paper, we introduce the Harish-Chandra homomorphism for the quantum superalgebra $\mathrm{U}_q(\mathfrak{g})$ associated with a simple basic Lie superalgebra $\mathfrak{g}$ and give an explicit description of its image. We use it to prove that the center of $\mathrm{U}_q(\mathfrak{g})$ is isomorphic to a subring of the ring $J(\mathfrak{g})$ of exponential super-invariants in the sense of Sergeev and Veselov, establishing a Harish-Chandra type theorem for $\mathrm{U}_q(\mathfrak{g})$. As a byproduct, we obtain a basis of the center of $\mathrm{U}_q(\mathfrak{g})$ with the aid of quasi-R-matrix.

preprint2020arXiv

Decentralized Beamforming Design for Intelligent Reflecting Surface-enhanced Cell-free Networks

Cell-free networks are considered as a promising distributed network architecture to satisfy the increasing number of users and high rate expectations in beyond-5G systems. However, to further enhance network capacity, an increasing number of high-cost base stations (BSs) are required. To address this problem and inspired by the cost-effective intelligent reflecting surface (IRS) technique, we propose a fully decentralized design framework for cooperative beamforming in IRS-aided cell-free networks. We first transform the centralized weighted sum-rate maximization problem into a tractable consensus optimization problem, and then an incremental alternating direction method of multipliers (ADMM) algorithm is proposed to locally update the beamformer. The complexity and convergence of the proposed method are analyzed, and these results show that the performance of the new scheme can asymptotically approach that of the centralized one as the number of iterations increases. Results also show that IRSs can significantly increase the system sum-rate of cell-free networks and the proposed method outperforms existing decentralized methods.

preprint2020arXiv

Deep Reinforcement Learning Based Spectrum Allocation in Integrated Access and Backhaul Networks

We develop a framework based on deep reinforce-ment learning (DRL) to solve the spectrum allocation problem inthe emerging integrated access and backhaul (IAB) architecturewith large scale deployment and dynamic environment. The avail-able spectrum is divided into several orthogonal sub-channels,and the donor base station (DBS) and all IAB nodes have thesame spectrum resource for allocation, where a DBS utilizes thosesub-channels for access links of associated user equipment (UE)as well as for backhaul links of associated IAB nodes, and anIAB node can utilize all for its associated UEs. This is one ofkey features in which 5G differs from traditional settings wherethe backhaul networks were designed independently from theaccess networks. With the goal of maximizing the sum log-rateof all UE groups, we formulate the spectrum allocation probleminto a mix-integer and non-linear programming. However, itis intractable to find an optimal solution especially when theIAB network is large and time-varying. To tackle this problem,we propose to use the latest DRL method by integrating anactor-critic spectrum allocation (ACSA) scheme and deep neuralnetwork (DNN) to achieve real-time spectrum allocation indifferent scenarios. The proposed methods are evaluated throughnumerical simulations and show promising results compared withsome baseline allocation policies.

preprint2020arXiv

Extension of elementary $p$-groups and its application in classification of groups of prime exponent

Let $p$ be a prime number and $\mathbb{Z}_p=\mathbb{Z}/p\mathbb{Z}$. We study finite groups with abelian derived subgroup and exponent $p$ in terms of group extension data and their matrix presentations. We show a one-to-one correspondence between the following two sets: (i) the isoclasses of class 2 groups of exponent $p$ and order $p^{m+n}$ and with derived subgroup $\mathbb{Z}_p^n$, and (ii) the set $\text{Gr}(n,\text{AS}_m(\mathbb{Z}_p))/\text{GL}_m(\mathbb{Z}_p)$ of orbits of $\text{Gr}(n,\text{AS}_m(\mathbb{Z}_p))$ under the congruence action by $\text{GL}_m(\mathbb{Z}_p)$, where $\text{Gr}(n,\text{AS}_m(\mathbb{Z}_p))$ is the set of $n$-dimensional subspaces of anti-symmetric matrices of order $m$ over $\mathbb{Z}_p$. We give a description of the orbit spaces $\text{Gr}(2, \text{AS}_m(\mathbb{Z}_p))/\text{GL}_m(\mathbb{Z}_p)$ for all $m$ and $p$ by applying the theory of pencils of anti-symmetric matrices. Based on this, we show complete sets of representatives of orbits of $\text{Gr}(3,\text{AS}_4(\mathbb{Z}_3))/\text{GL}_4(\mathbb{Z}_3)$, $\text{Gr}(4, \text{AS}_4(\mathbb{Z}_3))/\text{GL}_4(\mathbb{Z}_3)$ and $\text{Gr}(3, \text{AS}_5(\mathbb{Z}_3))/\text{GL}_5(\mathbb{Z}_3)$. As a consequence, we obtain a classification of corresponding class 2 groups of exponent $p$. In particular, we recover the classification of groups with exponent 3 and order $\le 3^8$.

preprint2020arXiv

Fully Decentralized Federated Learning Based Beamforming Design for UAV Communications

To handle the data explosion in the era of internet of things (IoT), it is of interest to investigate the decentralized network, with the aim at relaxing the burden to central server along with keeping data privacy. In this work, we develop a fully decentralized federated learning (FL) framework with an inexact stochastic parallel random walk alternating direction method of multipliers (ISPW-ADMM). Performing more communication efficient and enhanced privacy preservation compared with the current state-of-the-art, the proposed ISPW-ADMM can be partially immune to the impacts from time-varying dynamic network and stochastic data collection, while still in fast convergence. Benefits from the stochastic gradients and biased first-order moment estimation, the proposed framework can be applied to any decentralized FL tasks over time-varying graphs. Thus to further demonstrate the practicability of such framework in providing fast convergence, high communication efficiency, and system robustness, we study the extreme learning machine(ELM)-based FL model for robust beamforming (BF) design in UAV communications, as verified by the numerical simulations.

preprint2020arXiv

Learning Based Hybrid Beamforming Design for Full-Duplex Millimeter Wave Systems

Millimeter Wave (mmWave) communications with full-duplex (FD) have the potential of increasing the spectral efficiency, relative to those with half-duplex. However, the residual self-interference (SI) from FD and high pathloss inherent to mmWave signals may degrade the system performance. Meanwhile, hybrid beamforming (HBF) is an efficient technology to enhance the channel gain and mitigate interference with reasonable complexity. However, conventional HBF approaches for FD mmWave systems are based on optimization processes, which are either too complex or strongly rely on the quality of channel state information (CSI). We propose two learning schemes to design HBF for FD mmWave systems, i.e., extreme learning machine based HBF (ELM-HBF) and convolutional neural networks based HBF (CNN-HBF). Specifically, we first propose an alternating direction method of multipliers (ADMM) based algorithm to achieve SI cancellation beamforming, and then use a majorization-minimization (MM) based algorithm for joint transmitting and receiving HBF optimization. To train the learning networks, we simulate noisy channels as input, and select the hybrid beamformers calculated by proposed algorithms as targets. Results show that both learning based schemes can provide more robust HBF performance and achieve at least 22.1% higher spectral efficiency compared to orthogonal matching pursuit (OMP) algorithms. Besides, the online prediction time of proposed learning based schemes is almost 20 times faster than the OMP scheme. Furthermore, the training time of ELM-HBF is about 600 times faster than that of CNN-HBF with 64 transmitting and receiving antennas.

preprint2020arXiv

Learning Based Hybrid Beamforming for Millimeter Wave Multi-User MIMO Systems

Hybrid beamforming (HBF) design is a crucial stage in millimeter wave (mmWave) multi-user multi-input multi-output (MU-MIMO) systems. However, conventional HBF methods are still with high complexity and strongly rely on the quality of channel state information. We propose an extreme learning machine (ELM) framework to jointly optimize transmitting and receiving beamformers. Specifically, to provide accurate labels for training, we first propose an factional-programming and majorization-minimization based HBF method (FP-MM-HBF). Then, an ELM based HBF (ELM-HBF) framework is proposed to increase the robustness of beamformers. Both FP-MM-HBF and ELM-HBF can provide higher system sum-rate compared with existing methods. Moreover, ELM-HBF cannot only provide robust HBF performance, but also consume very short computation time.

preprint2020arXiv

Odd-even layer-number effect and layer-dependent magnetic phase diagrams in MnBi2Te4

The intrinsic magnetic layered topological insulator MnBi2Te4 with nontrivial topological properties and magnetic order has become a promising system for exploring exotic quantum phenomena such as quantum anomalous Hall effect. However, the layer-dependent magnetism of MnBi2Te4, which is fundamental and crucial for further exploration of quantum phenomena in this system, remains elusive. Here, we use polar reflective magnetic circular dichroism spectroscopy, combined with theoretical calculations, to obtain an in-depth understanding of the layer-dependent magnetic properties in MnBi2Te4. The magnetic behavior of MnBi2Te4 exhibits evident odd-even layer-number effect, i.e. the oscillations of the coercivity of the hysteresis loop (at μ0Hc) and the spin-flop transition (at μ0H1), concerning the Zeeman energy and magnetic anisotropy energy. In the even-number septuple layers, an anomalous magnetic hysteresis loop is observed, which is attributed to the thickness-independent surface-related magnetization. Through the linear-chain model, we can clarify the odd-even effect of the spin-flop field and determine the evolution of magnetic states under the external magnetic field. The mean-field method also allows us to trace the experimentally observed magnetic phase diagrams to the magnetic fields, layer numbers and especially, temperature. Overall, by harnessing the unusual layer-dependent magnetic properties, our work paves the way for further study of quantum properties of MnBi2Te4.

preprint2020arXiv

Pre-resolutions of noncommutative isolated singularities

We introduce the notion of right pre-resolutions (quasi-resolutions) for noncommutative isolated singularities, which is a weaker version of quasi-resolutions introduced by Qin-Wang-Zhang. We prove that right quasi-resolutions for noetherian bounded below and locally finite graded algebra with right injective dimension 2 are always Morita equivalent. When we restrict to noncommutative quadric hypersurfaces, we prove that a noncommutative quadric hypersurface, which is a noncommutative isolated singularity, always admits a right pre-resolution. Besides, we provide a method to verify whether a noncommutative quadric hypersurface is an isolated singularity. An example of noncommutative quadric hypersurfaces with detailed computations of indecomposable maximal Cohen-Macaulay modules and right pre-resolutions is included as well.

preprint2020arXiv

Spin-Valley Locking Effect in Defect States of Monolayer MoS$_2$

Valley pseudospin in two-dimensional (2D) transition-metal dichalcogenides (TMDs) allows optical control of spin-valley polarization and intervalley quantum coherence. Defect states in TMDs give rise to new exciton features and theoretically exhibit spin-valley polarization; however, experimental achievement of this phenomenon remains challenges. Here, we report unambiguous valley pseudospin of defect-bound localized excitons in CVD-grown monolayer MoS2; enhanced valley Zeeman splitting with an effective g-factor of -6.2 is observed. Our results reveal that all five d-orbitals and the increased effective electron mass contribute to the band shift of defect states, demonstrating a new physics of the magnetic responses of defect-bound localized excitons, strikingly different from that of A excitons. Our work paves the way for the manipulation of the spin-valley degrees of freedom through defects toward valleytronic devices.

preprint2019arXiv

Correlating the Electronic Structures of Metallic/Semiconductor MoTe2 Interface to its Atomic Structures

Contact interface properties are important in determining the performances of devices based on atomically thin two-dimensional (2D) materials, especially those with short channels. Understanding the contact interface is therefore quite important to design better devices. Herein, we use scanning transmission electron microscopy, electron energy loss spectroscopy, and first-principles calculations to reveal the electronic structures within the metallic (1T&#39;)-semiconducting (2H) MoTe2 coplanar phase boundary across a wide spectral range and correlate its properties and atomic structure. We find that the 2H-MoTe2 excitonic peaks cross the phase boundary into the 1T&#39; phase within a range of approximately 150 nm. The 1T&#39;-MoTe2 crystal field can penetrate the boundary and extend into the 2H phase by approximately two unit cells. The plasmonic oscillations exhibit strong angle dependence, i.e., a red-shift (approximately 0.3 eV-1.2 eV) occurs within 4 nm at 1T&#39;/2H-MoTe2 boundaries with large tilt angles, but there is no shift at zero-tilted boundaries. These atomic-scale measurements reveal the structure-property relationships of 1T&#39;/2H-MoTe2 boundary, providing useful information for phase boundary engineering and device development based on 2D materials.