Researcher profile

Haijun Zhang

Haijun Zhang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
26works
0followers
13topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

26 published item(s)

preprint2026arXiv

Offline Policy Optimization with Posterior Sampling

A fundamental challenge in model-based offline reinforcement learning (RL) lies in the trade-off between generalization and robustness against exploitation errors in out-of-distribution (OOD) regions. While OOD samples may capture valid underlying physical dynamics, they also introduce the risk of model exploitation. Existing methods typically address this risk through excessive pessimistic regularization, which ensures robustness but often sacrifices generalization. To overcome this limitation, we propose Posterior Sampling-based Policy Optimization (PSPO), which formulates dynamics modeling as a Bayesian inference process to derive a posterior that explicitly quantifies model fidelity. Through the integration of posterior sampling and constrained policy optimization, our method leverages dynamics-consistent OOD transitions for generalization while ensuring robustness against model exploitation. Theoretically, we formulate Q-value estimation under posterior sampling as a stochastic approximation problem and establish its convergence. We decompose policy optimization into a sequence of constrained subproblems, demonstrating that solving these subproblems guarantees monotonic improvement until convergence. Experiments on standard benchmarks validate that PSPO achieves superior performance compared to state-of-the-art baselines.

preprint2026arXiv

Report of the 5th PVUW Challenge: Towards More Diverse Modalities in Pixel-Level Understanding

This report summarizes the objectives, datasets, and top-performing methodologies of the 2026 Pixel-level Video Understanding in the Wild (PVUW) Challenge, hosted at CVPR 2026, which evaluates state-of-the-art models under highly unconstrained conditions. To provide a comprehensive assessment, the 2026 edition features three specialized tracks: the MOSE track for tracking objects within densely cluttered and severely occluded scenarios; the MeViS-Text track for localizing targets via motion-focused linguistic expressions; and the newly inaugurated MeViS-Audio track, which pioneers acoustic-driven object segmentation. By introducing previously unreleased challenging data and analyzing the cutting-edge, multimodal solutions submitted by participants, this report highlights the community's latest technical advancements and charts promising future directions for robust video scene comprehension.

preprint2026arXiv

Towards a Theoretical Framework for Robust Node Deployment in Cooperative ISAC Networks

This paper investigates node deployment strategies for robust multi-node cooperative localization in integrated sensing and communication (ISAC) networks.We first analyze how steering vector correlation across different positions affects localization performance and introduce a novel distance-weighted correlation metric to characterize this effect. Building upon this insight, we propose a deployment optimization framework that minimizes the maximum weighted steering vector correlation by optimizing simultaneously node positions and array orientations, thereby enhancing worst-case network robustness. Then, a genetic algorithm (GA) is developed to solve this min-max optimization, yielding optimized node positions and array orientations. Extensive simulations using both multiple signal classification (MUSIC) and neural-network (NN)-based localization validate the effectiveness of the proposed methods, demonstrating significant improvements in robust localization performance.

preprint2025arXiv

Empower Low-Altitude Economy: A Reliability-Aware Dynamic Weighting Allocation for Multi-modal UAV Beam Prediction

The low-altitude economy (LAE) is rapidly expanding driven by urban air mobility, logistics drones, and aerial sensing, while fast and accurate beam prediction in uncrewed aerial vehicles (UAVs) communications is crucial for achieving reliable connectivity. Current research is shifting from single-signal to multi-modal collaborative approaches. However, existing multi-modal methods mostly employ fixed or empirical weights, assuming equal reliability across modalities at any given moment. Indeed, the importance of different modalities fluctuates dramatically with UAV motion scenarios, and static weighting amplifies the negative impact of degraded modalities. Furthermore, modal mismatch and weak alignment further undermine cross-scenario generalization. To this end, we propose a reliability-aware dynamic weighting scheme applied to a semantic-aware multi-modal beam prediction framework, named SaM2B. Specifically, SaM2B leverages lightweight cues such as environmental visual, flight posture, and geospatial data to adaptively allocate contributions across modalities at different time points through reliability-aware dynamic weight updates. Moreover, by utilizing cross-modal contrastive learning, we align the "multi-source representation beam semantics" associated with specific beam information to a shared semantic space, thereby enhancing discriminative power and robustness under modal noise and distribution shifts. Experiments on real-world low-altitude UAV datasets show that SaM2B achieves more satisfactory results than baseline methods.

preprint2025arXiv

No Vision, No Wearables: 5G-based 2D Human Pose Recognition with Integrated Sensing and Communications

With the increasing maturity of contactless human pose recognition (HPR) technology, indoor interactive applications have raised higher demands for natural, controller-free interaction methods. However, current mainstream HPR solutions relying on vision or radio-frequency (RF) (including WiFi, radar) still face various challenges in practical deployment, such as privacy concerns, susceptibility to occlusion, dedicated equipment and functions, and limited sensing resolution and range. 5G-based integrated sensing and communication (ISAC) technology, by merging communication and sensing functions, offers a new approach to address these challenges in contactless HPR. We propose a practical 5G-based ISAC system capable of inferring 2D HPR from uplink sounding reference signals (SRS). Specifically, rich features are extracted from multiple domains and employ an encoder to achieve unified alignment and representation in a latent space. Subsequently, low-dimensional features are fused to output the human pose state. Experimental results demonstrate that in typical indoor environments, our proposed 5G-based ISAC HPR system significantly outperforms current mainstream baseline solutions in HPR performance, providing a solid technical foundation for universal human-computer interaction.

preprint2022arXiv

AI-aided Traffic Control Scheme for M2M Communications in the Internet of Vehicles

Due to the rapid growth of data transmissions in internet of vehicles (IoV), finding schemes that can effectively alleviate access congestion has become an important issue. Recently, many traffic control schemes have been studied. Nevertheless, the dynamics of traffic and the heterogeneous requirements of different IoV applications are not considered in most existing studies, which is significant for the random access resource allocation. In this paper, we consider a hybrid traffic control scheme and use proximal policy optimization (PPO) method to tackle it. Firstly, IoV devices are divided into various classes based on delay characteristics. The target of maximizing the successful transmission of packets with the success rate constraint is established. Then, the optimization objective is transformed into a markov decision process (MDP) model. Finally, the access class barring (ACB) factors are obtained based on the PPO method to maximize the number of successful access devices. The performance of the proposal algorithm in respect of successful events and delay compared to existing schemes is verified by simulations.

preprint2022arXiv

Direct Visualization and Manipulation of Tunable Quantum Well State in Semiconducting Nb2SiTe4

Quantum well states (QWSs) can form at the surface or interfaces of materials with confinement potential. They have broad applications in electronic and optical devices such as high mobility electron transistor, photodetector and quantum well laser. The properties of the QWSs are usually the key factors for the performance of the devices. However, direct visualization and manipulation of such states are in general challenging. In this work, by using angle-resolved photoemission spectroscopy (ARPES) and scanning tunneling microscopy/spectroscopy (STM/STS), we directly probe the QWSs generated on the vacuum interface of a narrow band gap semiconductor Nb2SiTe4. Interestingly, the position and splitting of QWSs could be easily manipulated via potassium (K) dosage onto the sample surface. Our results suggest Nb2SiTe4 to be an intriguing semiconductor system to study and engineer the QWSs, which has great potential in device applications.

preprint2022arXiv

Joint Caching and Transmission in the Mobile Edge Network: A Multi-Agent Learning Approach

Joint caching and transmission optimization problem is challenging due to the deep coupling between decisions. This paper proposes an iterative distributed multi-agent learning approach to jointly optimize caching and transmission. The goal of this approach is to minimize the total transmission delay of all users. In this iterative approach, each iteration includes caching optimization and transmission optimization. A multi-agent reinforcement learning (MARL)-based caching network is developed to cache popular tasks, such as answering which files to evict from the cache and which files to storage. Based on the cached files of the caching network, the transmission network transmits cached files for users by single transmission (ST) or joint transmission (JT) with multi-agent Bayesian learning automaton (MABLA) method. And then users access the edge servers with the minimum transmission delay. The experimental results demonstrate the performance of the proposed multi-agent learning approach.

preprint2022arXiv

Large Exchange Bias Effect and Coverage-Dependent Interfacial Coupling in CrI3/MnBi2Te4 van der Waals Heterostructures

Igniting interface magnetic ordering of magnetic topological insulators by building a van der Waals heterostructure can help to reveal novel quantum states and design functional devices. Here, we observe an interesting exchange bias effect, indicating successful interfacial magnetic coupling, in CrI3/MnBi2Te4 ferromagnetic insulator/antiferromagnetic topological insulator (FMI/AFM-TI) heterostructure devices. The devices originally exhibit a negative exchange bias field, which decays with increasing temperature and is unaffected by the back-gate voltage. When we change the device configuration to be half-covered by CrI3, the exchange bias becomes positive with a very large exchange bias field exceeding 300 mT. Such sensitive manipulation is explained by the competition between the FM and AFM coupling at the interface of CrI3 and MnBi2Te4, pointing to coverage-dependent interfacial magnetic interactions. Our work will facilitate the development of topological and antiferromagnetic devices.

preprint2022arXiv

Multiple topological nodal structure in LaSb2 with large linear magnetoresistance

Unconventional fermions in the immensely studied topological semimetals are the source for rich exotic topological properties. Here, using symmetry analysis and first-principles calculations, we propose the coexistence of multiple topological nodal structure in LaSb2, including topological nodal surfaces, nodal lines and in particular eightfold degenerate nodal points, which have been scarcely observed in a single material. Further, utilizing high resolution angle-resolved photoemission spectroscopy in combination with Shubnikov-de Haas quantum oscillations measurements, we confirm the existence of nodal surfaces and eightfold degenerate nodal points in LaSb2, and extract the π Berry phase proving the non-trivial electronic band structure topology therein. The intriguing multiple topological nodal structure might play a crucial role in giving rise to the large linear magnetoresistance. Our work renews the insights into the exotic topological phenomena in LaSb2 and its analogous.

preprint2022arXiv

Performance Analysis of Fog-Aided D2D Networks with Multicast-Based Opportunistic Content Delivery

In this paper, we develop a comprehensive and tractable analytical framework based on stochastic geometry to evaluate the performance of large-scale fog-aided device-to-device (F-D2D) networks with opportunistic content multicasting. As a part of the analysis, to resolve the contentions of file requests from the cache-incapable conventional user equipments (C-UEs), two simple yet typical candidate file selection schemes for cache-enabled fog user equipments (F-UEs), namely the random file selection (RFS) scheme and the most requested file selection (MRFS) scheme, are considered. Further, to suppress the harmful interference among the concurrent transmissions of F-UEs, a multicast-based opportunistic content delivery strategy is proposed by exploring the idea of opportunistic spectrum access (OSA). Assuming decentralized probabilistic caching, we first derive the activation probability of the F-UEs. Then, by adopting an appropriate approximation, the cache-hit probability, the coverage probability, and thereby the successful content delivery probability (SCDP) of the F-D2D network are evaluated. We also develop an iterative algorithm based on the gradient projection method to obtain a suboptimal caching policy for the maximization of SCDP. Extensive simulation and numerical results are presented to verify our analysis and demonstrate the superior performance of the proposed multicast-based opportunistic content delivery strategy.

preprint2022arXiv

Proximal Policy Optimization-based Transmit Beamforming and Phase-shift Design in an IRS-aided ISAC System for the THz Band

In this paper, an IRS-aided integrated sensing and communications (ISAC) system operating in the terahertz (THz) band is proposed to maximize the system capacity. Transmit beamforming and phase-shift design are transformed into a universal optimization problem with ergodic constraints. Then the joint optimization of transmit beamforming and phase-shift design is achieved by gradient-based, primal-dual proximal policy optimization (PPO) in the multi-user multiple-input single-output (MISO) scenario. Specifically, the actor part generates continuous transmit beamforming and the critic part takes charge of discrete phase shift design. Based on the MISO scenario, we investigate a distributed PPO (DPPO) framework with the concept of multi-threading learning in the multi-user multiple-input multiple-output (MIMO) scenario. Simulation results demonstrate the effectiveness of the primal-dual PPO algorithm and its multi-threading version in terms of transmit beamforming and phase-shift design.

preprint2022arXiv

Reconfigurable Intelligent Surface With Energy Harvesting Assisted Cooperative Ambient Backscatter Communications

The performance of cooperative ambient backscatter communications (CABC) can be enhanced by employing reconfigurable intelligent surface (RIS) to assist backscatter transmitters. Since the RIS power consumption is a non-negligible issue, we consider a RIS assisted CABC system where the RIS with energy harvesting circuit can not only reflect signal but also harvest wireless energy. We study a transmission design problem to minimize the RIS power consumption with the quality of service constraints for both active and backscatter transmissions. The optimization problem is a mixed-integer non-convex programming problem which is NP-hard. To tackle it, an algorithm is proposed by employing the block coordinate descent, semidefinite relaxation and alternating direction method of multipliers techniques. Simulation results demonstrate the effectiveness of the proposed algorithm.

preprint2021arXiv

Customized Slicing for 6G: Enforcing Artificial Intelligence on Resource Management

Next generation wireless networks are expected to support diverse vertical industries and offer countless emerging use cases. To satisfy stringent requirements of diversified services, network slicing is developed, which enables service-oriented resource allocation by tailoring the infrastructure network into multiple logical networks. However, there are still some challenges in cross-domain multi-dimensional resource management for end-to-end (E2E) slices under the dynamic and uncertain environment. Trading off the revenue and cost of resource allocation while guaranteeing service quality is significant to tenants. Therefore, this article introduces a hierarchical resource management framework, utilizing deep reinforcement learning in admission control of resource requests from different tenants and resource adjustment within admitted slices for each tenant. Particularly, we first discuss the challenges in customized resource management of 6G. Second, the motivation and background are presented to explain why artificial intelligence (AI) is applied in resource customization of multi-tenant slicing. Third, E2E resource management is decomposed into two problems, multi-dimensional resource allocation decision based on slice-level feedback and real-time slice adaption aimed at avoiding service quality degradation. Simulation results demonstrate the effectiveness of AI-based customized slicing. Finally, several significant challenges that need to be addressed in practical implementation are investigated.

preprint2021arXiv

Evidence of topological nodal lines and surface states in the centrosymmetric superconductor SnTaS2

The discovery of signatures of topological superconductivity in superconducting bulk materials with topological surface states has attracted intensive research interests recently. Utilizing angle-resolved photoemission spectroscopy and first-principles calculations, here, we demonstrate the existence of topological nodal-line states and drumheadlike surface states in centrosymmetric superconductor SnTaS2, which is a type-II superconductor with a critical transition temperature of about 3 K. The valence bands from Ta 5d orbitals and the conduction bands from Sn 5p orbitals cross each other, forming two nodal lines in the vicinity of the Fermi energy without the inclusion of spin-orbit coupling (SOC), protected by the spatial-inversion symmetry and time-reversal symmetry. The nodal lines are gapped out by SOC. The drumheadlike surface states, the typical characteristics in nodal-line semimetals, are quite visible near the Fermi level. Our findings indicate that SnTaS2 offers a promising platform for exploring the exotic properties of the topological nodal-line fermions and gives a help to study topological superconductivity.

preprint2021arXiv

Experimental evidence on the dissipationless transport of chiral edge state of the high-field Chern insulator in MnBi2Te4 nanodevices

We demonstrate the dissipationless transport of the chiral edge state (CES) in the nanodevices of quantum anomalous Hall insulator candidate MnBi2Te4. The device presents a near-zero longitudinal resistance together with a quantized Hall plateau in excess of 0.97 h/e2 over a range of temperatures from very low up to the Neel temperature of 22 K. Each of four-probe nonlocal measurements gives near-zero resistance and two-probe measurements exhibit a plateau of +1 h/e2, while the results of three-probe nonlocal measurements depend on the magnetic field. This indicates non-dissipation as well as the chirality of the edge state. The CES shows three regimes of temperature dependence, i.e., well-preserved dissipationless transport below 6 K, variable range hopping while increasing the temperature and thermal activation at higher than 22 K. Even at the lowest temperature, a current of over 1.4 μA breaks the dissipationless transport. These form a complete set of evidences of the Chern insulator state in the MnBi2Te4 systems.

preprint2020arXiv

Aggressive Congestion Control Mechanism for Space Systems

How to implement an impeccable space system-of-systems (SoS) internetworking architecture has been a significant issue in system engineering for years. Reliable data transmission is considered one of the most important technologies of space SoS internetworking systems. Due to the high bit error rate (BER), long time delay and asymmetrical channel in the space communication environment, the congestion control mechanism of classic transport control protocols (TCP) shows unsatisfying performances. With the help of existing TCP modifications, this paper contributes an aggressive congestion control mechanism. The proposed mechanism is characterized with a fast start procedure, as well as the feedback information to analyze network traffic and with a link terminating processing mechanism, which can help to reveal the real reason of packet loss, and maintain the size of congestion window at a high level. Simulation results are shown in the end to verify the proposed scheme.

preprint2020arXiv

Compressed DenseNet for Lightweight Character Recognition

Convolutional Recurrent Neural Network (CRNN) is a popular network for recognizing texts in images. Advances like the variant of CRNN, such as Dense Convolutional Network with Connectionist Temporal Classification, has reduced the running time of the network, but exposing the inner computation cost and weight size of the convolutional networks as a bottleneck. Specifically, the DenseNet based models utilize the dense blocks as the core module, but the inner features are combined in the form of concatenation in dense blocks. As such, the number of channels of combined features delivered as the input of the layers close to the output and the relevant computational cost grows rapidly with the dense blocks getting deeper. This will severely bring heavy computational cost and big weight size, which restrict the depth of dense blocks. In this paper, we propose a compressed convolution block called Lightweight Dense Block (LDB). To reduce the computing cost and weight size, we re-define and re-design the way of combining internal features of the dense blocks. LDB is a convolutional block similarly as dense block, but it can reduce the computation cost and weight size to (1/L, 2/L), compared with original ones, where L is the number of layers in blocks. Moreover, LDB can be used to replace the original dense block in any DenseNet based models. Based on the LDBs, we propose a Compressed DenseNet (CDenseNet) for the lightweight character recognition. Extensive experiments demonstrate that CDenseNet can effectively reduce the weight size while delivering the promising recognition results.

preprint2020arXiv

Deep Learning based Radio Resource Management in NOMA Networks: User Association, Subchannel and Power Allocation

With the rapid development of future wireless communication, the combination of NOMA technology and millimeter-wave(mmWave) technology has become a research hotspot. The application of NOMA in mmWave heterogeneous networks can meet the diverse needs of users in different applications and scenarios in future communications. In this paper, we propose a machine learning framework to deal with the user association, subchannel and power allocation problems in such a complex scenario. We focus on maximizing the energy efficiency (EE) of the system under the constraints of quality of service (QoS), interference limitation, and power limitation. Specifically, user association is solved through the Lagrange dual decomposition method, while semi-supervised learning and deep neural network (DNN) are used for the subchannel and power allocation, respectively. In particular, unlabeled samples are introduced to improve approximation and generalization ability for subchannel allocation. The simulation indicates that the proposed scheme can achieve higher EE with lower complexity.

preprint2020arXiv

Dynamical axion state with hidden pseudospin Chern numbers in MnBi$_{2}$Te$_{4}$-based heterostructures

Axion is a hypothetical elementary particle which was initially postulated to solve the charge conjugation-parity problem in particle physics. Interestingly, the axion state has emerged in effective theory of topological insulators and has attracted extensive attention in condensed matter physics. Time-reversal or inversion symmetry constrains the axion field $θ$ to be quantized. When both the time-reversal and inversion symmetries are broken by, say, an antiferromagnetic order, the axion field $θ$ could become unquantized and dynamical along with magnetic fluctuations, which is termed the dynamical axion field. Here, we reveal that a wide class of topological-insulator-based dynamical axion states could be distinguished from the normal-insulator-based ones by a hidden quantity derived from the pseudospin Chern number. Motivated by recent research on MnBi$_{2}$Te$_{4}$-family materials, we further show that such topological-insulator-based dynamical axion states can be hopefully achieved in MnBi$_{2}$Te$_{4}$-based heterostructures, which should greatly facilitate the study of axion electrodynamics in condensed matter physics.

preprint2020arXiv

Energy Efficiency Optimization for NOMA UAV Network with Imperfect CSI

Unmanned aerial vehicles (UAVs) are developing rapidly owing to flexible deployment and access services as air base stations. However, the channel errors of low-altitude communication links formed by mobile deployment of UAVs cannot be ignored. And the energy efficiency of the UAVs communication with imperfect channel state information (CSI) hasnt been well studied yet. Therefore, we focus on system performance optimization in non-orthogonal multiple access (NOMA) UAV network considering imperfect CSI between the UAV and users. A suboptimal resource allocation scheme including user scheduling and power allocation is designed for maximizing energy efficiency. Because of the nonconvexity of optimization function with an probability constraint for imperfect CSI, the original problem is converted into a non-probability problem and then decoupled into two convex subproblems. First, a user scheduling method is applied in the two-side matching of users and subchannels by the difference of convex programming. Then based on user scheduling, the energy efficiency in UAV cells is optimized through a suboptimal power allocation algorithm by successive convex approximation method. The simulation results prove that the proposed algorithm is effective compared with existing resource allocation schemes.

preprint2020arXiv

Energy Efficient User Clustering, Hybrid Precoding and Power Optimization in Terahertz MIMO-NOMA Systems

Terahertz (THz) band communication has been widely studied to meet the future demand for ultra-high capacity. In addition, multi-input multi-output (MIMO) technique and non-orthogonal multiple access (NOMA) technique with multi-antenna also enable the network to carry more users and provide multiplexing gain. In this paper, we study the maximization of energy efficiency (EE) problem in THz-NOMA-MIMO systems for the first time. And the original optimization problem is divided into user clustering, hybrid precoding and power optimization. Based on channel correlation characteristics, a fast convergence scheme for user clustering in THz-NOMA-MIMO system using enhanced K-means machine learning algorithm is proposed. Considering the power consumption and implementation complexity, the hybrid precoding scheme based on the sub-connection structure is adopted. Considering the fronthaul link capacity constraint, we design a distributed alternating direction method of multipliers (ADMM) algorithm for power allocation to maximize the EE of THz-NOMA cache-enabled system with imperfect successive interference cancellation (SIC). The simulation results show that the proposed user clustering scheme can achieve faster convergence and higher EE, the design of the hybrid precoding of the sub-connection structure can achieve lower power consumption and power optimization can achieve a higher EE for the THz cache-enabled network.

preprint2020arXiv

Large dynamical axion field in topological antiferromagnetic insulator Mn$_2$Bi$_2$Te$_5$

The dynamical axion field is a new state of quantum matter where the magnetoelectric response couples strongly to its low-energy magnetic fluctuations. It is fundamentally different from an axion insulator with a static quantized magnetoelectric response. The dynamical axion field exhibits many exotic phenomena such as axionic polariton and axion instability. However, these effects have not been experimentally confirmed due to the lack of proper topological magnetic materials. Here by combining analytic models and first-principles calculations, we predict a series of van der Waal layered Mn$_2$Bi$_2$Te$_5$-related topological antiferromagnetic materials could host the long-sought dynamical axion field with a topological origin. We also show a large dynamical axion field can be achieved in antiferromagnetic insulating states close to the topological phase transition. We further propose the optical and transport experiments to detect such a dynamical axion field. Our results could directly aid and facilitate the search for topological-origin large dynamical axion field in realistic materials.

preprint2020arXiv

Large Magnetoresistance in Topological Insulator Candidate TaSe3

Large unsaturated magnetoresistance (XMR) with magnitude about 1000% is observed in topological insulator candidate TaSe3 from our high field (up to 38 T) measurements. Two oscillation modes, associated with one hole pocket and two electron pockets in the bulk, respectively, are detected from our Shubnikov-de Hass (SdH) measurements, consistent with our first-principles calculations. With the detailed Hall measurements performed, our two-band model analysis exhibits an imperfect density ratio n_h/n_e closing 0.9 at T< 20 K , which suggests that the carrier compensations account for the XMR in TaSe3.

preprint2020arXiv

Thirty Years of Machine Learning: The Road to Pareto-Optimal Wireless Networks

Future wireless networks have a substantial potential in terms of supporting a broad range of complex compelling applications both in military and civilian fields, where the users are able to enjoy high-rate, low-latency, low-cost and reliable information services. Achieving this ambitious goal requires new radio techniques for adaptive learning and intelligent decision making because of the complex heterogeneous nature of the network structures and wireless services. Machine learning (ML) algorithms have great success in supporting big data analytics, efficient parameter estimation and interactive decision making. Hence, in this article, we review the thirty-year history of ML by elaborating on supervised learning, unsupervised learning, reinforcement learning and deep learning. Furthermore, we investigate their employment in the compelling applications of wireless networks, including heterogeneous networks (HetNets), cognitive radios (CR), Internet of things (IoT), machine to machine networks (M2M), and so on. This article aims for assisting the readers in clarifying the motivation and methodology of the various ML algorithms, so as to invoke them for hitherto unexplored services as well as scenarios of future wireless networks.

preprint2019arXiv

Experimental observation of the gate-controlled reversal of the anomalous Hall effect in the intrinsic magnetic topological insulator MnBi2Te4 device

Here we report the reserved anomalous Hall effect (AHE) in the 5-septuple-layer van der Waals device of the intrinsic magnetic topological insulator MnBi2Te4. By employing the top/bottom gate, a negative AHE loop gradually decreases to zero and changes to a reversed sign. The reversed AHE exhibits distinct coercive fields and temperature dependence from the previous AHE. It reaches the maximum inside the gap of the Dirac cone. The newly-seen reversed AHE is attributed to the competition of the intrinsic Berry curvature and the Dirac-gap enhanced extrinsic skew scattering. Its gate-controlled switching contributes a scheme for the topological spin field-effect transistors.