Researcher profile

Yonghui Li

Yonghui Li contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
39works
0followers
12topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

39 published item(s)

preprint2026arXiv

Hybrid Centralized Distributed Control for Lifelong MAPF over Wireless Connections

In lifelong multi-agent path finding (MAPF) with many robots, unreliable wireless links and stochastic executions are the norm. Existing approaches typically either rely on centralized planning under idealized communication, or run fully distributed local controllers with fixed communication patterns; they rarely couple communication scheduling with policy learning, and thus struggle when bandwidth is scarce or packets are frequently dropped. We address this joint control--communication problem and propose a hybrid centralized--distributed scheme: a centralized cloud policy sends small residual corrections only when selected, while a lightweight on-board Gated recurrent unit (GRU) policy provides a safe default fallback when wireless connection is not available.

preprint2023arXiv

HARQ Optimization for Real-Time Remote Estimation in Wireless Networked Control

This paper analyzes wireless network control for remote estimation of linear time-invariant dynamical systems under various Hybrid Automatic Repeat Request (HARQ) packet retransmission schemes. In conventional HARQ, packet reliability increases gradually with additional packets; however, each retransmission maximally increases the Age of Information and causes severe degradation in estimation mean squared error (MSE) performance. We optimize standard HARQ schemes by allowing partial retransmissions to increase the packet reliability gradually and limit the AoI growth. In incremental redundancy HARQ, we optimize the retransmission time to enable the early arrival of the next status updates. In Chase combining HARQ, since packet length remains fixed, we allow retransmission and new updates in a single time slot using non-orthogonal signaling. Non-orthogonal retransmissions increase packet reliability without delaying the fresh updates. We formulate bi-objective optimization with the proposed variance of the MSE-based cost function and standard long-term average MSE cost function to guarantee short-term performance stability. Using the Markov decision process formulation, we find the optimal static and dynamic policies under the proposed HARQ schemes to improve MSE performance further. The simulation results show that the proposed HARQ-based policies are more robust and achieve significantly better and more stable MSE performance than standard HARQ-based policies.

preprint2023arXiv

Partially Concatenated Calderbank-Shor-Steane Codes Achieving the Quantum Gilbert-Varshamov Bound Asymptotically

In this paper, we utilize a concatenation scheme to construct new families of quantum error correction codes achieving the quantum Gilbert-Varshamov (GV) bound asymptotically. We concatenate alternant codes with any linear code achieving the classical GV bound to construct Calderbank-Shor-Steane (CSS) codes. We show that the concatenated code can achieve the quantum GV bound asymptotically and can approach the Hashing bound for asymmetric Pauli channels. By combing Steane's enlargement construction of CSS codes, we derive a family of enlarged stabilizer codes achieving the quantum GV bound for enlarged CSS codes asymptotically. As applications, we derive two families of fast encodable and decodable CSS codes with parameters $\mathscr{Q}_1=[[N,Ω(\sqrt{N}),Ω( \sqrt{N})]],$ and $\mathscr{Q}_2=[[N,Ω(N/\log N),Ω(N/\log N)/Ω(\log N)]].$ We show that $\mathscr{Q}_1$ can be encoded very efficiently by circuits of size $O(N)$ and depth $O(\sqrt{N})$. For an input error syndrome, $\mathscr{Q}_1$ can correct any adversarial error of weight up to half the minimum distance bound in $O(N)$ time. $\mathscr{Q}_1$ can also be decoded in parallel in $O(\sqrt{N})$ time by using $O(\sqrt{N})$ classical processors. For an input error syndrome, we proved that $\mathscr{Q}_2$ can correct a linear number of ${X}$-errors with high probability and an almost linear number of ${Z}$-errors in $O(N )$ time. Moreover, $\mathscr{Q}_2$ can be decoded in parallel in $O(\log(N))$ time by using $O(N)$ classical processors.

preprint2022arXiv

DRL-based Resource Allocation in Remote State Estimation

Remote state estimation, where sensors send their measurements of distributed dynamic plants to a remote estimator over shared wireless resources, is essential for mission-critical applications of Industry 4.0. Existing algorithms on dynamic radio resource allocation for remote estimation systems assumed oversimplified wireless communications models and can only work for small-scale settings. In this work, we consider remote estimation systems with practical wireless models over the orthogonal multiple-access and non-orthogonal multiple-access schemes. We derive necessary and sufficient conditions under which remote estimation systems can be stabilized. The conditions are described in terms of the transmission power budget, channel statistics, and plants' parameters. For each multiple-access scheme, we formulate a novel dynamic resource allocation problem as a decision-making problem for achieving the minimum overall long-term average estimation mean-square error. Both the estimation quality and the channel quality states are taken into account for decision making. We systematically investigated the problems under different multiple-access schemes with large discrete, hybrid discrete-and-continuous, and continuous action spaces, respectively. We propose novel action-space compression methods and develop advanced deep reinforcement learning algorithms to solve the problems. Numerical results show that our algorithms solve the resource allocation problems effectively and provide much better scalability than the literature.

preprint2022arXiv

Elevation Angle-Dependent 3D Trajectory Design for Aerial RIS-aided Communication

This paper investigates an aerial reconfigurable intelligent surface (RIS)-aided communication system under the probabilistic line-of-sight (LoS) channel, where an unmanned aerial vehicle (UAV) equipped with an RIS is deployed to assist two ground nodes in their information exchange. An optimization problem with the objective of maximizing the minimum average achievable rate is formulated to jointly design the communication scheduling, the RIS's phase shift, and the three-dimensional (3D) UAV trajectory. To solve such a non-convex problem, we propose an efficient iterative algorithm to obtain its suboptimal solution. Simulation results show that our proposed design significantly outperforms the existing schemes and provides new insights into the elevation angle and distance trade-off for the UAV-borne RIS communication system.

preprint2022arXiv

Full-Dimensional Rate Enhancement for UAV-Enabled Communications via Intelligent Omni-Surface

This paper investigates the achievable rate maximization problem of a downlink unmanned aerial vehicle (UAV)-enabled communication system aided by an intelligent omni-surface (IOS). Different from the state-of-the-art reconfigurable intelligent surface (RIS) that only reflects incident signals, the IOS can simultaneously reflect and transmit the signals, thereby providing full-dimensional rate enhancement. To tackle such a problem, we formulate it by jointly optimizing the IOS's phase shift and the UAV trajectory. Although it is difficult to solve it optimally due to its non-convexity, we propose an efficient iterative algorithm to obtain a high-quality suboptimal solution. Simulation results show that the IOS-assisted UAV communications can achieve more significant improvement in achievable rates than other benchmark schemes.

preprint2022arXiv

Learning-based Predictive Beamforming for Integrated Sensing and Communication in Vehicular Networks

This paper investigates the integrated sensing and communication (ISAC) in vehicle-to-infrastructure (V2I) networks. To realize ISAC, an effective beamforming design is essential which however, highly depends on the availability of accurate channel tracking requiring large training overhead and computational complexity. Motivated by this, we adopt a deep learning (DL) approach to implicitly learn the features of historical channels and directly predict the beamforming matrix to be adopted for the next time slot to maximize the average achievable sum-rate of an ISAC system. The proposed method can bypass the need of explicit channel tracking process and reduce the signaling overhead significantly. To this end, a general sum-rate maximization problem with Cramer-Rao lower bounds (CRLBs)-based sensing constraints is first formulated for the considered ISAC system taking into account the multiple access interference. Then, by exploiting the penalty method, a versatile unsupervised DL-based predictive beamforming design framework is developed to address the formulated design problem. As a realization of the developed framework, a historical channels-based convolutional long short-term memory (LSTM) network (HCL-Net) is devised for predictive beamforming in the ISAC-based V2I network. Specifically, the convolution and LSTM modules are successively adopted in the proposed HCL-Net to exploit the spatial and temporal dependencies of communication channels to further improve the learning performance. Finally, simulation results show that the proposed predictive method not only guarantees the required sensing performance, but also achieves a satisfactory sum-rate that can approach the upper bound obtained by the genie-aided scheme with the perfect instantaneous channel state information available.

preprint2022arXiv

NOMA Joint Channel Estimation and Signal Detection using Rotational Invariant Codes and GMM-based Clustering

This paper studies the joint channel estimation and signal detection for the uplink power-domain non-orthogonal multiple access. The proposed technique performs both detection and estimation without the need of pilot symbols by using a clustering technique. We apply rotational-invariant coding to assist signal detection at the receiver without sending pilot symbols. We utilize Gaussian mixture model (GMM) to automatically cluster the received signals without supervision and optimize decision boundaries to improve the bit error rate (BER) performance. Simulation results show that the proposed scheme without using any pilot symbol achieves almost the same BER performance as that for the conventional maximum likelihood receiver with full channel state information.

preprint2022arXiv

Predictive Beamforming for Integrated Sensing and Communication in Vehicular Networks: A Deep Learning Approach

The implementation of integrated sensing and communication (ISAC) highly depends on the effective beamforming design exploiting accurate instantaneous channel state information (ICSI). However, channel tracking in ISAC requires large amount of training overhead and prohibitively large computational complexity. To address this problem, in this paper, we focus on ISAC-assisted vehicular networks and exploit a deep learning approach to implicitly learn the features of historical channels and directly predict the beamforming matrix for the next time slot to maximize the average achievable sum-rate of system, thus bypassing the need of explicit channel tracking for reducing the system signaling overhead. To this end, a general sum-rate maximization problem with Cramer-Rao lower bounds-based sensing constraints is first formulated for the considered ISAC system. Then, a historical channels-based convolutional long short-term memory network is designed for predictive beamforming that can exploit the spatial and temporal dependencies of communication channels to further improve the learning performance. Finally, simulation results show that the proposed method can satisfy the requirement of sensing performance, while its achievable sum-rate can approach the upper bound obtained by a genie-aided scheme with perfect ICSI available.

preprint2022arXiv

Proximal Policy Optimization-based Transmit Beamforming and Phase-shift Design in an IRS-aided ISAC System for the THz Band

In this paper, an IRS-aided integrated sensing and communications (ISAC) system operating in the terahertz (THz) band is proposed to maximize the system capacity. Transmit beamforming and phase-shift design are transformed into a universal optimization problem with ergodic constraints. Then the joint optimization of transmit beamforming and phase-shift design is achieved by gradient-based, primal-dual proximal policy optimization (PPO) in the multi-user multiple-input single-output (MISO) scenario. Specifically, the actor part generates continuous transmit beamforming and the critic part takes charge of discrete phase shift design. Based on the MISO scenario, we investigate a distributed PPO (DPPO) framework with the concept of multi-threading learning in the multi-user multiple-input multiple-output (MIMO) scenario. Simulation results demonstrate the effectiveness of the primal-dual PPO algorithm and its multi-threading version in terms of transmit beamforming and phase-shift design.

preprint2022arXiv

Reconfigurable Intelligent Surface-aided $M$-ary FM-DCSK System: a New Design for Noncoherent Chaos-based Communication

In this paper, we propose two reconfigurable intelligent surface-aided $M$-ary frequency-modulated differential chaos shift keying (RIS-$M$-FM-DCSK) schemes. In scheme I, the RIS is regarded as a transmitter at the source to incorporate the $M$-ary phase-shift-keying ($M$-PSK) symbols into the FM chaotic signal and to reflect the resultant $M$-ary FM chaotic signal toward the destination. The information bits of the source are carried by both the positive/negative state of the FM chaotic signal and the $M$-PSK symbols. In scheme II, the RIS is treated as a relay so that both the source and relay can simultaneously transmit their information bits to the destination. The information bits of the source and relay are carried by the positive/negative state of the FM chaotic signal and $M$-PSK symbols generated by the RIS, respectively. The proposed RIS-$M$-FM-DCSK system has an attractive advantage that it does not require channel state information for detection, thus avoiding complex channel estimation. Moreover, we derive the theoretical expressions for bit error rates (BERs) of the proposed RIS-$M$-FM-DCSK system with both scheme I and scheme II over multipath Rayleigh fading channels. Simulations results not only verify the accuracy of the theoretical derivations, but also demonstrate the superiority of the proposed system. The proposed RIS-$M$-FM-DCSK system is a promising low-cost, low-power, and high-reliability alternative for wireless communication networks.

preprint2022arXiv

Spatio-Temporal-Frequency Graph Attention Convolutional Network for Aircraft Recognition Based on Heterogeneous Radar Network

This paper proposes a knowledge-and-data-driven graph neural network-based collaboration learning model for reliable aircraft recognition in a heterogeneous radar network. The aircraft recognizability analysis shows that: (1) the semantic feature of an aircraft is motion patterns driven by the kinetic characteristics, and (2) the grammatical features contained in the radar cross-section (RCS) signals present spatial-temporal-frequency (STF) diversity decided by both the electromagnetic radiation shape and motion pattern of the aircraft. Then a STF graph attention convolutional network (STFGACN) is developed to distill semantic features from the RCS signals received by the heterogeneous radar network. Extensive experiment results verify that the STFGACN outperforms the baseline methods in terms of detection accuracy, and ablation experiments are carried out to further show that the expansion of the information dimension can gain considerable benefits to perform robustly in the low signal-to-noise ratio region.

preprint2022arXiv

Trading Payoffs to Enlarged Neighborhoods? A New Evidence from Evolutionary Game Theory

Population diversity is an important aspect of Prisoner's Dilemma Game (PDG) research. However, the studies on dynamic diversity and its associated cost still need further investigation. Based on a framework comprising 2-dimensional spatial evolutionary PDG, this work examines the change in a player's neighborhood by enabling each player to pay for an upgrade of their neighborhood to switch from the von Neumann to Moore neighborhood. The upgrade cost (i.e., the cost of the advanced neighborhood) plays a vital role in cooperation promotion and serves as an entry-level to screen players. The results show that a reasonable price (entry-level) supports the cooperators' survival in an environment with high dilemma strength since it allows the formation of "normal-edge-advantage-core" clusters. On the low entry-level side, the privilege of having a larger neighborhood supports cooperation if it is accessible to all the players. On the high entry-level side, encirclements of advantage defectors appear out of the cooperative clusters. To break the encirclement and enable the expansion of the advantage clusters, the entry-level should be increased to interrupt the advantage defectors. The encirclement can be observed only in the deterministic models. Stochastic simulations are provided as robustness benchmarks.

preprint2022arXiv

Unitary Approximate Message Passing for Matrix Factorization

We consider matrix factorization (MF) with certain constraints, which finds wide applications in various areas. Leveraging variational inference (VI) and unitary approximate message passing (UAMP), we develop a Bayesian approach to MF with an efficient message passing implementation, called UAMPMF. With proper priors imposed on the factor matrices, UAMPMF can be used to solve many problems that can be formulated as MF, such as non negative matrix factorization, dictionary learning, compressive sensing with matrix uncertainty, robust principal component analysis, and sparse matrix factorization. Extensive numerical examples are provided to show that UAMPMF significantly outperforms state-of-the-art algorithms in terms of recovery accuracy, robustness and computational complexity.

preprint2022arXiv

Weighted Sum Age of Information Minimization in Wireless Networks with Aerial IRS

In this letter, we analyze a terrestrial wireless communication network assisted by an aerial intelligent reflecting surface (IRS). We consider a packet scheduling problem at the ground base station (BS) aimed at improving the information freshness by selecting packets based on their AoI. To further improve the communication quality, the trajectory of the unmanned aerial vehicle (UAV) which carries the IRS is optimized with joint active and passive beamforming design. To solve the formulated non-convex problem, we propose an iterative alternating optimization problem based on a successive convex approximation (SCA) algorithm. The simulation results shows significant performance improvement in terms of weighted sum AoI, and the SCA solution converges quickly with low computational complexity.

preprint2021arXiv

A Tutorial on Ultra-Reliable and Low-Latency Communications in 6G: Integrating Domain Knowledge into Deep Learning

As one of the key communication scenarios in the 5th and also the 6th generation (6G) of mobile communication networks, ultra-reliable and low-latency communications (URLLC) will be central for the development of various emerging mission-critical applications. State-of-the-art mobile communication systems do not fulfill the end-to-end delay and overall reliability requirements of URLLC. In particular, a holistic framework that takes into account latency, reliability, availability, scalability, and decision making under uncertainty is lacking. Driven by recent breakthroughs in deep neural networks, deep learning algorithms have been considered as promising ways of developing enabling technologies for URLLC in future 6G networks. This tutorial illustrates how domain knowledge (models, analytical tools, and optimization frameworks) of communications and networking can be integrated into different kinds of deep learning algorithms for URLLC. We first provide some background of URLLC and review promising network architectures and deep learning frameworks for 6G. To better illustrate how to improve learning algorithms with domain knowledge, we revisit model-based analytical tools and cross-layer optimization frameworks for URLLC. Following that, we examine the potential of applying supervised/unsupervised deep learning and deep reinforcement learning in URLLC and summarize related open problems. Finally, we provide simulation and experimental results to validate the effectiveness of different learning algorithms and discuss future directions.

preprint2021arXiv

Exploration of the Doping Effect in the Thiolate-protected Gold Nanoclusters: DFT Simulations of H2S-nanoalloy Complexes

The atomically precise method has become an important technique to adjust the core of thiolate-protected gold nanoclusters to improve physical and chemical properties. But the doping effect on the structural stability has not been systematically summarized. In this work, the H2S-nanoalloy molecules with different doping metal atoms has been investigated to elucidate the impact of the dopant on the structures. With DFT simulation results, the zinc group atoms as dopants may be influenced by surrounded gold atoms and the binding of the thiolate units are enhanced. The simulated zinc group data when combined to the gold group and plantinum group data can be summarized in the perspective of balance between the ligand-core binding and core cohesive energies. Most of dopants drive the modeled nanoclusters away from the balance especially when the metal atom replaced the gold atom in gold-sulfur bindings. But when cores of the nanoclusters are dominated by gold atoms, the dopants may achieve "saturation" such that the balance in the doped clusters may be corrected. This work provide a simple profile to understand the internal shift of the structure introduced by the atomically precise method.

preprint2021arXiv

Optimizing Information Freshness for Cooperative IoT Systems with Stochastic Arrivals

This paper considers a cooperative Internet of Things (IoT) system with a source aiming to transmit randomly generated status updates to a designated destination as timely as possible under the help of a relay. We adopt a recently proposed concept, the age of information (AoI), to characterize the timeliness of the status updates. In the considered system, delivering the status updates via the one-hop direct link will have a shorter transmission time at the cost of incurring a higher error probability, while the delivery of status updates through the two-hop relay link could be more reliable at the cost of suffering longer transmission time. Thus, it is important to design the relaying protocol of the considered system for optimizing the information freshness. Considering the limited capabilities of IoT devices, we propose two low-complexity age-oriented relaying (AoR) protocols, i.e., the source-prioritized AoR (SP-AoR) protocol and the relay-prioritized AoR (RP-AoR) protocol, to reduce the AoI of the considered system. By carefully analyzing the evolution of the instantaneous AoI, we derive closed-form expressions of the average AoI for both proposed AoR protocols. We further optimize the generation probability of the status updates at the source in both protocols. Simulation results validate our theoretical analysis, and demonstrate that the two proposed protocols outperform each other under various system parameters. Moreover, the protocol with better performance can achieve near-optimal performance compared with the optimal scheduling policy attained by applying the Markov decision process (MDP) tool.

preprint2021arXiv

Optimizing Information Freshness in Two-Hop Status Update Systems under a Resource Constraint

In this paper, we investigate the age minimization problem for a two-hop relay system, under a resource constraint on the average number of forwarding operations at the relay. We first design an optimal policy by modelling the considered scheduling problem as a constrained Markov decision process (CMDP) problem. Based on the observed multi-threshold structure of the optimal policy, we then devise a low-complexity double threshold relaying (DTR) policy with only two thresholds, one for relay's AoI and the other one for the age gain between destination and relay. We derive approximate closed-form expressions of the average AoI at the destination, and the average number of forwarding operations at the relay for the DTR policy, by modelling the tangled evolution of age at relay and destination as a Markov chain (MC). Numerical results validate all the theoretical analysis, and show that the low-complexity DTR policy can achieve near optimal performance compared with the optimal CMDP-based policy. Moreover, the relay should always consider the threshold for its local age to maintain a low age at the destination. When the resource constraint is relatively tight, it further needs to consider the threshold on the age gain to ensure that only those packets that can decrease destination's age dramatically will be forwarded.

preprint2021arXiv

Random Shifting Intelligent Reflecting Surface for OTP Encrypted Data Transmission

In this paper, we propose a novel encrypted data transmission scheme using an intelligent reflecting surface (IRS) to generate secret keys in wireless communication networks. We show that perfectly secure one-time pad (OTP) communications can be established by using a simple random phase shifting of the IRS elements. To maximize the secure transmission rate, we design an optimal time slot allocation algorithm for the IRS secret key generation and the encrypted data transmission phases. Moreover, a theoretical expression of the key generation rate is derived based on Poisson point process (PPP) for the practical scenario when eavesdroppers' channel state information (CSI) is unavailable. Simulation results show that employing our IRS-based scheme can significantly improve the encrypted data transmission performance for a wide-range of wireless channel gains and system parameters.

preprint2021arXiv

Secret Key Generation for Intelligent Reflecting Surface Assisted Wireless Communication Networks

We propose and analyze secret key generation using intelligent reflecting surface (IRS) assisted wireless communication networks. To this end, we first formulate the minimum achievable secret key capacity for an IRS acting as a passive beamformer in the presence of multiple eavesdroppers. Next, we develop an optimization framework for the IRS reflecting coefficients based on the secret key capacity lower bound. To derive a tractable and efficient solution, we design and analyze a semidefinite relaxation (SDR) and successive convex approximation (SCA) based algorithm for the proposed optimization. Simulation results show that employing our IRS-based algorithm can significantly improve the secret key generation capacity for a wide-range of wireless channel parameters.

preprint2020arXiv

Age-Oriented Opportunistic Relaying in Cooperative Status Update Systems with Stochastic Arrivals

This paper considers a cooperative status update system with a source aiming to send randomly generated status updates to a designated destination as timely as possible with the help of a relay. We adopt a recently proposed concept, Age of Information (AoI), to characterize the timeliness of the status updates. We propose an age-oriented opportunistic relaying (AoR) protocol to reduce the AoI of the considered system. Specifically, the relay opportunistically replaces the source to retransmit the successfully received status updates that have not been correctly delivered to the destination, but the retransmission of the relay can be preempted by the arrival of a new status update at the source. By carefully analyzing the evolution of AoI, we derive a closed-form expression of the average AoI for the proposed AoR protocol. We further minimize the average AoI by optimizing the generation probability of the status updates at the source. Simulation results validate our theoretical analysis and demonstrate that the average AoI performance of the proposed AoR protocol is superior to that of the non-cooperative system.

preprint2020arXiv

Crowd Scene Analysis by Output Encoding

Crowd scene analysis receives growing attention due to its wide applications. Grasping the accurate crowd location (rather than merely crowd count) is important for spatially identifying high-risk regions in congested scenes. In this paper, we propose a Compressed Sensing based Output Encoding (CSOE) scheme, which casts detecting pixel coordinates of small objects into a task of signal regression in encoding signal space. CSOE helps to boost localization performance in circumstances where targets are highly crowded without huge scale variation. In addition, proper receptive field sizes are crucial for crowd analysis due to human size variations. We create Multiple Dilated Convolution Branches (MDCB) that offers a set of different receptive field sizes, to improve localization accuracy when objects sizes change drastically in an image. Also, we develop an Adaptive Receptive Field Weighting (ARFW) module, which further deals with scale variation issue by adaptively emphasizing informative channels that have proper receptive field size. Experiments demonstrate the effectiveness of the proposed method, which achieves state-of-the-art performance across four mainstream datasets, especially achieves excellent results in highly crowded scenes. More importantly, experiments support our insights that it is crucial to tackle target size variation issue in crowd analysis task, and casting crowd localization as regression in encoding signal space is quite effective for crowd analysis.

preprint2020arXiv

Deep Learning for Radio Resource Allocation with Diverse Quality-of-Service Requirements in 5G

To accommodate diverse Quality-of-Service (QoS) requirements in the 5th generation cellular networks, base stations need real-time optimization of radio resources in time-varying network conditions. This brings high computing overheads and long processing delays. In this work, we develop a deep learning framework to approximate the optimal resource allocation policy that minimizes the total power consumption of a base station by optimizing bandwidth and transmit power allocation. We find that a fully-connected neural network (NN) cannot fully guarantee the QoS requirements due to the approximation errors and quantization errors of the numbers of subcarriers. To tackle this problem, we propose a cascaded structure of NNs, where the first NN approximates the optimal bandwidth allocation, and the second NN outputs the transmit power required to satisfy the QoS requirement with given bandwidth allocation. Considering that the distribution of wireless channels and the types of services in the wireless networks are non-stationary, we apply deep transfer learning to update NNs in non-stationary wireless networks. Simulation results validate that the cascaded NNs outperform the fully connected NN in terms of QoS guarantee. In addition, deep transfer learning can reduce the number of training samples required to train the NNs remarkably.

preprint2020arXiv

Deep Learning for Ultra-Reliable and Low-Latency Communications in 6G Networks

In the future 6th generation networks, ultra-reliable and low-latency communications (URLLC) will lay the foundation for emerging mission-critical applications that have stringent requirements on end-to-end delay and reliability. Existing works on URLLC are mainly based on theoretical models and assumptions. The model-based solutions provide useful insights, but cannot be directly implemented in practice. In this article, we first summarize how to apply data-driven supervised deep learning and deep reinforcement learning in URLLC, and discuss some open problems of these methods. To address these open problems, we develop a multi-level architecture that enables device intelligence, edge intelligence, and cloud intelligence for URLLC. The basic idea is to merge theoretical models and real-world data in analyzing the latency and reliability and training deep neural networks (DNNs). Deep transfer learning is adopted in the architecture to fine-tune the pre-trained DNNs in non-stationary networks. Further considering that the computing capacity at each user and each mobile edge computing server is limited, federated learning is applied to improve the learning efficiency. Finally, we provide some experimental and simulation results and discuss some future directions.

preprint2020arXiv

Deep Multi-Task Learning for Cooperative NOMA: System Design and Principles

Envisioned as a promising component of the future wireless Internet-of-Things (IoT) networks, the non-orthogonal multiple access (NOMA) technique can support massive connectivity with a significantly increased spectral efficiency. Cooperative NOMA is able to further improve the communication reliability of users under poor channel conditions. However, the conventional system design suffers from several inherent limitations and is not optimized from the bit error rate (BER) perspective. In this paper, we develop a novel deep cooperative NOMA scheme, drawing upon the recent advances in deep learning (DL). We develop a novel hybrid-cascaded deep neural network (DNN) architecture such that the entire system can be optimized in a holistic manner. On this basis, we construct multiple loss functions to quantify the BER performance and propose a novel multi-task oriented two-stage training method to solve the end-to-end training problem in a self-supervised manner. The learning mechanism of each DNN module is then analyzed based on information theory, offering insights into the proposed DNN architecture and its corresponding training method. We also adapt the proposed scheme to handle the power allocation (PA) mismatch between training and inference and incorporate it with channel coding to combat signal deterioration. Simulation results verify its advantages over orthogonal multiple access (OMA) and the conventional cooperative NOMA scheme in various scenarios.

preprint2020arXiv

Deep Residual Learning-Assisted Channel Estimation in Ambient Backscatter Communications

Channel estimation is a challenging problem for realizing efficient ambient backscatter communication (AmBC) systems. In this letter, channel estimation in AmBC is modeled as a denoising problem and a convolutional neural network-based deep residual learning denoiser (CRLD) is developed to directly recover the channel coefficients from the received noisy pilot signals. To simultaneously exploit the spatial and temporal features of the pilot signals, a novel three-dimension (3D) denoising block is specifically designed to facilitate denoising in CRLD. In addition, we provide theoretical analysis to characterize the properties of the proposed CRLD. Simulation results demonstrate that the performance of the proposed method approaches the performance of the optimal minimum mean square error (MMSE) estimator with perfect statistical channel correlation matrix.

preprint2020arXiv

Grant-Free Non-Orthogonal Multiple Access: A Key Enabler for 6G-IoT

The proliferating number of devices with short payloads as well as low power budget has already driven researchers away from classical grant-based access schemes that are notorious for their large signalling overhead as well as power-consuming retransmissions. Instead, light-weight random access protocols have been re-investigated and their throughput has been improved in orders of magnitude with sophisticated yet still low-complex transceiver algorithms. In fact, grant-free access has been identified as a key medium access control technique for providing massive connectivity in machine type communications in cellular networks. In this paper, we show that grant-free access combined with non-orthogonal transmission schemes is a promising solution for 6G Internet of Things (IoT). We present novel and promising results for deep learning (DL)-based techniques for joint user detection and decoding. Then, we propose a multi-layered model for GF-NOMA for power-efficient communications. We also discuss resource allocation issues to enable the co-existence of GF-NOMA with other orthogonal or even grant-based schemes. Finally, we conclude with proposed research directions for medium access towards enabling 6G-IoT.

preprint2020arXiv

Minimizing Age of Information via Hybrid NOMA/OMA

This paper considers a wireless network with a base station (BS) conducting timely transmission to two clients in a slotted manner via hybrid non-orthogonal multiple access (NOMA)/orthogonal multiple access (OMA). Specifically, the BS is able to adaptively switch between NOMA and OMA for the downlink transmission to minimize the information freshness, characterized by Age of Information (AoI), of the network. If the BS chooses OMA, it can only serve one client within a time slot and should decide which client to serve; if the BS chooses NOMA, it can serve both clients simultaneously and should decide the power allocated to each client. To minimize the weighted sum of expected AoI of the network, we formulate a Markov Decision Process (MDP) problem and develop an optimal policy for the BS to decide whether to use NOMA or OMA for each downlink transmission based on the instantaneous AoI of both clients. We prove the existence of optimal stationary and deterministic policy, and perform action elimination to reduce the action space for lower computation complexity. The optimal policy is shown to have a switching-type property with obvious decision switching boundaries. A suboptimal policy with lower computation complexity is also devised, which can achieve near-optimal performance according to our simulation results. The performance of different policies under different system settings is compared and analyzed in numerical results to provide useful insights for practical system designs.

preprint2020arXiv

Minimizing the Age of Information of Cognitive Radio-Based IoT Systems Under A Collision Constraint

This paper considers a cognitive radio-based IoT monitoring system, consisting of an IoT device that aims to update its measurement to a destination using cognitive radio technique. Specifically, the IoT device as a secondary user (SIoT), seeks and exploits the spectrum opportunities of the licensed band vacated by its primary user (PU) to deliver status updates without causing visible effects to the licensed operation. In this context, the SIoT should carefully make use of the licensed band and schedule when to transmit to maintain the timeliness of the status update. We adopt a recent metric, Age of Information (AoI), to characterize the timeliness of the status update of the SIoT. We aim to minimize the long-term average AoI of the SIoT while satisfying the collision constraint imposed by the PU by formulating a constrained Markov decision process (CMDP) problem. We first prove the existence of optimal stationary policy of the CMDP problem. The optimal stationary policy (termed age-optimal policy) is shown to be a randomized simple policy that randomizes between two deterministic policies with a fixed probability. We prove that the two deterministic policies have a threshold structure and further derive the closed-form expression of average AoI and collision probability for the deterministic threshold-structured policy by conducting Markov Chain analysis. The analytical expression offers an efficient way to calculate the threshold and randomization probability to form the age-optimal policy. For comparison, we also consider the throughput maximization policy (termed throughput-optimal policy) and analyze the average AoI performance under the throughput-optimal policy in the considered system. Numerical simulations show the superiority of the derived age-optimal policy over the throughput-optimal policy. We also unveil the impacts of various system parameters on the corresponding optimal policy and the resultant average AoI.

preprint2020arXiv

Minimum-Latency FEC Design with Delayed Feedback: Mathematical Modeling and Efficient Algorithms

In this paper, we consider the packet-level forward error correction (FEC) code design, without feedback or with delayed feedback, for achieving the minimum end-to-end latency, i.e., the latency between the time when packet is generated at the source and its \emph{in-order delivery} to the application layer of the destination. We first show that the minimum-latency FEC design problem can be modeled as a partially observable Markov decision process (POMDP), and hence the optimal code construction can be obtained by solving the corresponding POMDP. However, solving the POMDP optimally is in general difficult unless the size is very small. To this end, we propose an efficient heuristic algorithm, namely the majority vote policy, for obtaining a high quality approximate solution. We also derive the tight lower and upper bounds of the optimal state values of this POMDP, based on which a more sophisticated D-step search algorithm is implemented for obtaining near-optimal solutions. The simulation results show that the proposed code designs via solving the POMDP, either with the majority vote policy or the D-step search algorithm, strictly outperform the existing schemes, in both cases, without or with only delayed feedback.

preprint2020arXiv

Near-Optimal Interference Exploitation 1-Bit Massive MIMO Precoding via Partial Branch-and-Bound

In this paper, we focus on 1-bit precoding for large-scale antenna systems in the downlink based on the concept of constructive interference (CI). By formulating the optimization problem that aims to maximize the CI effect subject to the 1-bit constraint on the transmit signals, we mathematically prove that, when relaxing the 1-bit constraint, the majority of the obtained transmit signals already satisfy the 1-bit constraint. Based on this important observation, we propose a 1-bit precoding method via a partial branch-and-bound (P-BB) approach, where the BB procedure is only performed for the entries that do not comply with the 1-bit constraint. The proposed P-BB enables the use of the BB framework in large-scale antenna scenarios, which was not applicable due to its prohibitive complexity. Numerical results demonstrate a near-optimal error rate performance for the proposed 1-bit precoding algorithm.

preprint2020arXiv

Optimizing Information Freshness in Two-Way Relay Networks

In this paper, we investigate an amplify-and-forward (AF) based two-way cooperative status update system, where two sources aim to exchange status updates with each other as timely as possible with the help of a relay. Specifically, the relay receives the sum signal from the two sources in one time slot, and then amplifies and forwards the received signal to both the sources in the next time slot. We adopt a recently proposed concept, the age of information (AoI), to characterize the timeliness of the status updates. Assuming that the two sources are able to generate status updates at the beginning of each time slot (i.e., generate-at-will model), we derive a closed-form expression of the expected weighted sum AoI of the considered system. We further minimize the expected weighted sum AoI by optimizing the transmission power at each node under the peak power constraints. Simulation results corroborate the correctness of our theoretical analysis.

preprint2020arXiv

Optimizing Information Freshness via Multiuser Scheduling with Adaptive NOMA/OMA

This paper considers a wireless network with a base station (BS) conducting timely status updates to multiple clients via adaptive non-orthogonal multiple access (NOMA)/orthogonal multiple access (OMA). Specifically, the BS is able to adaptively switch between NOMA and OMA for the downlink transmission to optimize the information freshness of the network, characterized by the Age of Information (AoI) metric. If the BS chooses OMA, it can only serve one client within each time slot and should decide which client to serve; if the BS chooses NOMA, it can serve more than one client at the same time and needs to decide the power allocated to the served clients. For the simple two-client case, we formulate a Markov Decision Process (MDP) problem and develop the optimal policy for the BS to decide whether to use NOMA or OMA for each downlink transmission based on the instantaneous AoI of both clients. The optimal policy is shown to have a switching-type property with obvious decision switching boundaries. A near-optimal policy with lower computation complexity is also devised. For the more general multi-client scenario, inspired by the proposed near-optimal policy, we formulate a nonlinear optimization problem to determine the optimal power allocated to each client by maximizing the expected AoI drop of the network in each time slot. We resolve the formulated problem by approximating it as a convex optimization problem. We also derive the upper bound of the gap between the approximate convex problem and the original nonlinear, nonconvex problem. Simulation results validate the effectiveness of the adopted approximation. The performance of the adaptive NOMA/OMA scheme by solving the convex optimization is shown to be close to that of max-weight policy solved by exhaustive search...

preprint2020arXiv

Physical Layer Authentication for Non-coherent Massive SIMO-Based Industrial IoT Communications

Achieving ultra-reliable, low-latency and secure communications is essential for realizing the industrial Internet of Things (IIoT). Non-coherent massive multiple-input multiple-output (MIMO) has recently been proposed as a promising methodology to fulfill ultra-reliable and low-latency requirements. In addition, physical layer authentication (PLA) technology is particularly suitable for IIoT communications thanks to its low-latency attribute. A PLA method for non-coherent massive single-input multiple-output (SIMO) IIoT communication systems is proposed in this paper. Specifically, we first determine the optimal embedding of the authentication information (tag) in the message information. We then optimize the power allocation between message and tag signal to characterize the trade-off between message and tag error performance. Numerical results show that the proposed PLA is more accurate then traditional methods adopting the uniform tag when the communication reliability remains at the same level. The proposed PLA method can be effectively applied to the non-coherent system.

preprint2020arXiv

Physical Layer Authentication for Non-Coherent Massive SIMO-Enabled Industrial IoT Communications

Achieving ultra-reliable, low-latency and secure communications is essential for realizing the industrial Internet of Things (IIoT). Non-coherent massive multiple-input multiple-output (MIMO) is one of promising techniques to fulfill ultra-reliable and low-latency requirements. In addition, physical layer authentication (PLA) technology is particularly suitable for secure IIoT communications thanks to its low-latency attribute. A PLA method for non-coherent massive single-input multiple-output (SIMO) IIoT communication systems is proposed in this paper. This method realizes PLA by embedding an authentication signal (tag) into a message signal, referred to as "message-based tag embedding". It is different from traditional PLA methods utilizing uniform power tags. We design the optimal tag embedding and optimize the power allocation between the message and tag signals to characterize the trade-off between the message and tag error performance. Numerical results show that the proposed message-based tag embedding PLA method is more accurate than the traditional uniform tag embedding method which has an unavoidable tag error floor close to 10%.

preprint2020arXiv

Reconfigurable Intelligent Surface (RIS)-Enhanced Two-Way OFDM Communications

In this paper, we focus on the reconfigurable intelligent surface (RIS)-enhanced two-way device-to-device (D2D) multi-pair orthogonal-frequency-division-multiplexing (OFDM) communication systems. Specifically, we maximize the minimum bidirectional weighted sum-rate by jointly optimizing the sub-band allocation, the power allocation and the discrete phase shift (PS) design at the RIS. To tackle the main difficulty of the non-convex PS design at the RIS, we firstly formulate a semi-definite relaxation problem and further devise a low-complexity solution for the PS design by leveraging the projected sub-gradient method. We demonstrate the desirable performance gain for the proposed designs through numerical results.

preprint2020arXiv

Spectrum Intelligent Radio: Technology, Development, and Future Trends

The advent of Industry 4.0 with massive connectivity places significant strains on the current spectrum resources, and challenges the industry and regulators to respond promptly with new disruptive spectrum management strategies. The current radio development, with certain elements of intelligence, is nowhere near showing an agile response to the complex radio environments. Following the line of intelligence, we propose to classify spectrum intelligent radio into three streams: classical signal processing, machine learning (ML), and contextual adaptation. We focus on the ML approach, and propose a new intelligent radio architecture with three hierarchical forms: perception, understanding, and reasoning. The proposed perception method achieves fully blind multi-level spectrum sensing. The understanding method accurately predicts the primary users' coverage across a large area, and the reasoning method performs a near-optimal idle channel selection. Opportunities, challenges, and future visions are also discussed for the realization of a fully intelligent radio.

preprint2020arXiv

Task Offloading for Large-Scale Asynchronous Mobile Edge Computing: An Index Policy Approach

Mobile-edge computing (MEC) offloads computational tasks from wireless devices to network edge, and enables real-time information transmission and computing. Most existing work concerns a small-scale synchronous MEC system. In this paper, we focus on a large-scale asynchronous MEC system with random task arrivals, distinct workloads, and diverse deadlines. We formulate the offloading policy design as a restless multi-armed bandit (RMAB) to maximize the total discounted reward over the time horizon. However, the formulated RMAB is related to a PSPACE-hard sequential decision-making problem, which is intractable. To address this issue, by exploiting the Whittle index (WI) theory, we rigorously establish the WI indexability and derive a scalable closed-form solution. Consequently, in our WI policy, each user only needs to calculate its WI and report it to the BS, and the users with the highest indices are selected for task offloading. Furthermore, when the task completion ratio becomes the focus, the shorter slack time less remaining workload (STLW) priority rule is introduced into the WI policy for performance improvement. When the knowledge of user offloading energy consumption is not available prior to the offloading, we develop Bayesian learning-enabled WI policies, including maximum likelihood estimation, Bayesian learning with conjugate prior, and prior-swapping techniques. Simulation results show that the proposed policies significantly outperform the other existing policies.