Researcher profile

Nan Yang

Nan Yang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
32works
0followers
13topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

32 published item(s)

preprint2026arXiv

On Training Large Language Models for Long-Horizon Tasks: An Empirical Study of Horizon Length

Large language models (LLMs) have shown promise as interactive agents that solve tasks through extended sequences of environment interactions. While prior work has primarily focused on system-level optimizations or algorithmic improvements, the role of task horizon length in shaping training dynamics remains poorly understood. In this work, we present a systematic empirical study that examines horizon length through controlled task constructions. Specifically, we construct controlled tasks in which agents face identical decision rules and reasoning structures, but differ only in the length of action sequences required for successful completion. Our results reveal that increasing horizon length alone constitutes a training bottleneck, inducing severe training instability driven by exploration difficulties and credit assignment challenges. We demonstrate that horizon reduction is a key principle to address this limitation, stabilizing training and achieving better performance in long-horizon tasks. Moreover, we find that horizon reduction is related to stronger generalization across horizon lengths: models trained under reduced horizons generalize more effectively to longer-horizon variants at inference time, a phenomenon we refer to as horizon generalization.

preprint2024arXiv

Large Search Model: Redefining Search Stack in the Era of LLMs

Modern search engines are built on a stack of different components, including query understanding, retrieval, multi-stage ranking, and question answering, among others. These components are often optimized and deployed independently. In this paper, we introduce a novel conceptual framework called large search model, which redefines the conventional search stack by unifying search tasks with one large language model (LLM). All tasks are formulated as autoregressive text generation problems, allowing for the customization of tasks through the use of natural language prompts. This proposed framework capitalizes on the strong language understanding and reasoning capabilities of LLMs, offering the potential to enhance search result quality while simultaneously simplifying the existing cumbersome search stack. To substantiate the feasibility of this framework, we present a series of proof-of-concept experiments and discuss the potential challenges associated with implementing this approach within real-world search systems.

preprint2023arXiv

Secure Short-Packet Communications via UAV-Enabled Mobile Relaying: Joint Resource Optimization and 3D Trajectory Design

Short-packet communication (SPC) and unmanned aerial vehicles (UAVs) are anticipated to play crucial roles in the development of 5G-and-beyond wireless networks and the Internet of Things (IoT). In this paper, we propose a secure SPC system, where a UAV serves as a mobile decode-and-forward (DF) relay, periodically receiving and relaying small data packets from a remote IoT device to its receiver in two hops with strict latency requirements, in the presence of an eavesdropper. This system requires careful optimization of important design parameters, such as the coding blocklengths of both hops, transmit powers, and the UAV's trajectory. While the overall optimization problem is nonconvex, we tackle it by applying a block successive convex approximation (BSCA) approach to divide the original problem into three subproblems and solve them separately. Then, an overall iterative algorithm is proposed to obtain the final design with guaranteed convergence. Our proposed low-complexity algorithm incorporates robust trajectory design and resource management to optimize the effective average secrecy throughput of the communication system over the course of the UAV-relay's mission. Simulation results demonstrate significant performance improvements compared to various benchmark schemes and provide useful design insights on the coding blocklengths and transmit powers along the trajectory of the UAV.

preprint2022arXiv

Analysis of MC Systems Employing Receivers Covered by Heterogeneous Receptors

This paper investigates the channel impulse response (CIR), i.e., the molecule hitting rate, of a molecular communication (MC) system employing an absorbing receiver (RX) covered by multiple non overlapping receptors. In this system, receptors are heterogeneous, i.e., they may have different sizes and arbitrary locations. Furthermore, we consider two types of transmitter (TX), namely a point TX and a membrane fusion (MF)-based spherical TX. We assume the point TX or the center of the MF-based TX has a fixed distance to the center of the RX. Given this fixed distance, the TX can be at different locations and the CIR of the RX depends on the exact location of the TX. By averaging over all possible TX locations, we analyze the expected molecule hitting rate at the RX as a function of the sizes and locations of the receptors, where we assume molecule degradation may occur during the propagation of the signaling molecules. Notably, our analysis is valid for different numbers, a wide range of sizes, and arbitrary locations of the receptors, and its accuracy is confirmed via particle-based simulations. Exploiting our numerical results, we show that the expected number of absorbed molecules at the RX increases with the number of receptors, when the total area on the RX surface covered by receptors is fixed. Based on the derived analytical expressions, we compare different geometric receptor distributions by examining the expected number of absorbed molecules at the RX. We show that evenly distributed receptors result in a larger number of absorbed molecules than other distributions. We further compare three models that combine different types of TXs and RXs.

preprint2022arXiv

Analysis of Receiver Covered by Heterogeneous Receptors in Molecular Communications

This paper analyzes the channel impulse response of an absorbing receiver (RX) covered by multiple non-overlapping heterogeneous receptors with different sizes and arbitrary locations in a molecular communication system. In this system, a point transmitter (TX) is assumed to be uniformly located on a virtual sphere at a fixed distance from the RX. Considering molecule degradation during the propagation from the TX to the RX, the expected molecule hitting rate at the RX over varying locations of the TX is analyzed as a function of the size and location of each receptor. Notably, this analytical result is applicable for different numbers, sizes, and locations of receptors, and its accuracy is demonstrated via particle-based simulations. Numerical results show that (i) the expected number of absorbed molecules at the RX increases with an increasing number of receptors, when the total area of receptors on the RX surface is fixed, and (ii) evenly distributed receptors lead to the largest expected number of absorbed molecules.

preprint2022arXiv

Confidence Estimation Transformer for Long-term Renewable Energy Forecasting in Reinforcement Learning-based Power Grid Dispatching

The expansion of renewable energy could help realizing the goals of peaking carbon dioxide emissions and carbon neutralization. Some existing grid dispatching methods integrating short-term renewable energy prediction and reinforcement learning (RL) have been proved to alleviate the adverse impact of energy fluctuations risk. However, these methods omit the long-term output prediction, which leads to stability and security problems on the optimal power flow. This paper proposes a confidence estimation Transformer for long-term renewable energy forecasting in reinforcement learning-based power grid dispatching (Conformer-RLpatching). Conformer-RLpatching predicts long-term active output of each renewable energy generator with an enhanced Transformer to boost the performance of hybrid energy grid dispatching. Furthermore, a confidence estimation method is proposed to reduce the prediction error of renewable energy. Meanwhile, a dispatching necessity evaluation mechanism is put forward to decide whether the active output of a generator needs to be adjusted. Experiments carried out on the SG-126 power grid simulator show that Conformer-RLpatching achieves great improvement over the second best algorithm DDPG in security score by 25.8% and achieves a better total reward compared with the golden medal team in the power grid dispatching competition sponsored by State Grid Corporation of China under the same simulation environment. Codes are outsourced in https://github.com/buptlxh/Conformer-RLpatching.

preprint2022arXiv

Dynamic-subarray with Fixed Phase Shifters for Energy-efficient Terahertz Hybrid Beamforming under Partial CSI

Terahertz (THz) communications are regarded as a pillar technology for the 6G systems, by offering multi-ten-GHz bandwidth. To overcome the huge propagation loss while reducing the hardware complexity, THz ultra-massive (UM) MIMO systems with hybrid beamforming are proposed to offer high array gain. Notably, the adjustable-phase-shifters considered in most existing hybrid beamforming studies are power-hungry and difficult to realize in the THz band. Moreover, due to the ultra-massive antennas, full channel-state-information (CSI) is challenging to obtain. To address these practical concerns, in this paper, an energy-efficient dynamic-subarray with fixed-phase-shifters (DS-FPS) architecture is proposed for THz hybrid beamforming. To compensate for the spectral efficiency loss caused by the fixed-phase of FPS, a switch network is inserted to enable dynamic connections. In addition, by considering the partial CSI, we propose a row-successive-decomposition (RSD) algorithm to design the hybrid beamforming matrices for DS-FPS. A row-by-row (RBR) algorithm is further proposed to reduce computational complexity. Extensive simulation results show that, the proposed DS-FPS architecture with the RSD and RBR algorithms achieves much higher energy efficiency than the existing architectures. Moreover, the DS-FPS architecture with partial CSI achieves 97% spectral efficiency of that with full CSI.

preprint2022arXiv

Experimental relativistic zero-knowledge proofs

Protecting secrets is a key challenge in our contemporary information-based era. In common situations, however, revealing secrets appears unavoidable, for instance, when identifying oneself in a bank to retrieve money. In turn, this may have highly undesirable consequences in the unlikely, yet not unrealistic, case where the bank's security gets compromised. This naturally raises the question of whether disclosing secrets is fundamentally necessary for identifying oneself, or more generally for proving a statement to be correct. Developments in computer science provide an elegant solution via the concept of zero-knowledge proofs: a prover can convince a verifier of the validity of a certain statement without facilitating the elaboration of a proof at all. In this work, we report the experimental realisation of such a zero-knowledge protocol involving two separated verifier-prover pairs. Security is enforced via the physical principle of special relativity, and no computational assumption (such as the existence of one-way functions) is required. Our implementation exclusively relies on off-the-shelf equipment and works at both short (60 m) and long distances ($\geqslant$400 m) in about one second. This demonstrates the practical potential of multi-prover zero-knowledge protocols, promising for identification tasks and blockchain applications such as cryptocurrencies or smart contracts.

preprint2022arXiv

Learning Diverse Document Representations with Deep Query Interactions for Dense Retrieval

In this paper, we propose a new dense retrieval model which learns diverse document representations with deep query interactions. Our model encodes each document with a set of generated pseudo-queries to get query-informed, multi-view document representations. It not only enjoys high inference efficiency like the vanilla dual-encoder models, but also enables deep query-document interactions in document encoding and provides multi-faceted representations to better match different queries. Experiments on several benchmarks demonstrate the effectiveness of the proposed method, out-performing strong dual encoder baselines.The code is available at \url{https://github.com/jordane95/dual-cross-encoder

preprint2022arXiv

Molecular Absorption Effect: A Double-edged Sword of Terahertz Communications

Communications in the terahertz band (THz) (0.1--10~THz) have been regarded as a promising technology for future 6G and beyond wireless systems, to overcome the challenges of evergrowing wireless data traffic and crowded spectrum. As the frequency increases from the microwave band to the THz band, new spectrum features pose unprecedented challenges to wireless communication system design. The molecular absorption effect is one of the new THz spectrum properties, which enlarges the path loss and noise at specific frequencies. This brings in a double-edged sword for THz wireless communication systems. On one hand, from the data rate viewpoint, molecular absorption is detrimental, since it mitigates the received signal power and degrades the channel capacity. On the other hand, it is worth noticing that for wireless security and covertness, the molecular absorption effect can be utilized to safeguard THz communications among users. In this paper, the features of the molecular absorption effect and their impact on the THz system design are analyzed under various scenarios, with the ultimate goal of providing guidelines to how better exploit this unique THz phenomenon. Specifically, since the molecular absorption greatly depends on the propagation medium, different communication scenarios consisting of various media are discussed, including terrestrial, air and space, sea surface and nano-scale communications. Furthermore, two novel molecular absorption enlightened secure and covert communication schemes are presented, where the molecular absorption effect is utilized as the key and unique feature to boost security and covertness.

preprint2022arXiv

Spectrum Allocation with Adaptive Sub-band Bandwidth for Terahertz Communication Systems

We study spectrum allocation for terahertz (THz) band communication (THzCom) systems, while considering the frequency and distance-dependent nature of THz channels. Different from existing studies, we explore multi-band-based spectrum allocation with adaptive sub-band bandwidth (ASB) by allowing the spectrum of interest to be divided into sub-bands with unequal bandwidths. Also, we investigate the impact of sub-band assignment on multi-connectivity (MC) enabled THzCom systems, where users associate and communicate with multiple access points simultaneously. We formulate resource allocation problems, with the primary focus on spectrum allocation, to determine sub-band assignment, sub-band bandwidth, and optimal transmit power. Thereafter, we propose reasonable approximations and transformations, and develop iterative algorithms based on the successive convex approximation technique to analytically solve the formulated problems. Aided by numerical results, we show that by enabling and optimizing ASB, significantly higher throughput can be achieved as compared to adopting equal sub-band bandwidth, and this throughput gain is most profound when the power budget constraint is more stringent. We also show that our sub-band assignment strategy in MC-enabled THzCom systems outperforms the state-of-the-art sub-band assignment strategies and the performance gain is most profound when the spectrum with the lowest average molecular absorption coefficient is selected during spectrum allocation.

preprint2022arXiv

Truncated Channel Inversion Power Control to Enable One-Way URLLC with Imperfect Channel Reciprocity

We propose to use channel inversion power control (CIPC) to achieve one-way ultra-reliable and low-latency communications (URLLC), where only the transmission in one direction requires ultra reliability and low latency. Based on channel reciprocity, our proposed CIPC schemes guarantee the power of received signal that is used to decode the information to be a constant value $Q$, by varying the transmit signal and power, which relaxes the assumption of knowing channel state information (CSI) at the user. Thus, the CIPC schemes eliminate the overhead of CSI feedback, reduce communication latency, and explore the benefits of multiple antennas to significantly improve transmission reliability. We derive analytical expressions for the packet loss probability of the proposed CIPC schemes, based on which we determine a closed interval and a convex set for optimizing $Q$ in CIPC with imperfect and perfect channel reciprocity, respectively. Our results show that CIPC is an effective means to achieve one-way URLLC. The tradeoff among reliability, latency, and required resources (e.g., transmit antennas) is further revealed, which provides novel principles for designing one-way URLLC systems.

preprint2021arXiv

Membrane Fusion-Based Transmitter Design for Molecular Communication Systems

This paper proposes a novel imperfect spherical transmitter (TX) model, namely the membrane fusion (MF)-based TX, that adopts MF between a vesicle and the TX membrane to release molecules encapsulated within the vesicle. For the MF-based TX, the molecule release probability and the fraction of molecules released from the TX membrane are derived. Incorporating molecular degradation and a fully-absorbing receiver (RX), the end-to-end molecule hitting probability at the RX is also derived. A simulation framework for the MF-based TX is proposed, where the released point on the TX membrane and the released time of each molecule are determined. Aided by the simulation framework, the derived analytical expressions are validated. Simulation results verify that a low MF probability or low vesicle mobility slows the release of molecules from the TX, extends time required to reach the peak release probability, and reduces the end-to-end molecule hitting probability at the RX.

preprint2020arXiv

Channel Characterization for 1D Molecular Communication with Two Absorbing Receivers

This letter develops a one-dimensional (1D) diffusion-based molecular communication system to analyze channel responses between a single transmitter (TX) and two fully-absorbing receivers (RXs). Incorporating molecular degradation in the environment, rigorous analytical formulas for i) the fraction of molecules absorbed, ii) the corresponding hitting rate, and iii) the asymptotic fraction of absorbed molecules as time approaches infinity at each RX are derived when an impulse of molecules are released at the TX. By using particle-based simulations, the derived analytical expressions are validated. Simulations also present the distance ranges of two RXs that do not impact molecular absorption of each other, and demonstrate that the mutual imfluence of two active RXs reduces with the increase in the degradation rate.

preprint2020arXiv

Characterization of Cooperators in Quorum Sensing with 2D Molecular Signal Analysis

In quorum sensing (QS), bacteria exchange molecular signals to work together. An analytically-tractable model is presented for characterizing QS signal propagation within a population of bacteria and the number of responsive cooperative bacteria (i.e., cooperators) in a two-dimensional (2D) environment. Unlike prior works with a deterministic topology and a simplified molecular propagation channel, this work considers continuous emission, diffusion, degradation, and reception among randomly-distributed bacteria. Using stochastic geometry, the 2D channel response and the corresponding probability of cooperation at a bacterium are derived. Based on this probability, new expressions are derived for the moment generating function and different orders of moments of the number of cooperators. The analytical results agree with the simulation results obtained by a particle-based method. In addition, the Poisson and Gaussian distributions are compared to approximate the distribution of the number of cooperators and the Poisson distribution provides the best overall approximation. The derived channel response can be generally applied to any molecular communication model where single or multiple transmitters continuously release molecules into a 2D environment. The derived statistics of the number of cooperators can be used to predict and control the QS process, e.g., predicting and decreasing the likelihood of biofilm formation.

preprint2020arXiv

Classical route to quantum chaotic motions

We extract the information of a quantum motion and decode it into a certain orbit via a single measurable quantity. Such that a quantum chaotic system can be reconstructed as a chaotic attractor. Two configurations for reconstructing this certain orbit are illustrated, which interpret quantum chaotic motions from the perspectives of probabilistic nature and the uncertainty principle, respectively. We further present a strategy to import classical chaos to a quantum system, revealing a connection between the classical and quantum worlds.

preprint2020arXiv

Coverage Analysis for 3D Terahertz Communication Systems with Blockage and Directional Antennas

The scarcity of spectrum resources in current wireless communication systems has sparked enormous research interest in the terahertz (THz) frequency band. This band is characterized by fundamentally different propagation properties resulting in different interference structures from what we have observed so far at lower frequencies. In this paper, we derive a new expression for the coverage probability of downlink transmission in THz communication systems within a three-dimensional (3D) environment. First, we establish a 3D propagation model which considers the molecular absorption loss, 3D directional antennas at both access points (APs) and user equipments (UEs), interference from nearby APs, and dynamic blockages caused by moving humans. Then, we develop a novel easy-to-use analytical framework based on the dominant interferer analysis to evaluate the coverage probability, the novelty of which lies in the incorporation of the instantaneous interference and the vertical height of THz devices. Our numerical results demonstrate the accuracy of our analysis and reveal that the coverage probability significantly decreases when the transmission distance increases. We also show the increasing blocker density and increasing AP density impose different impacts on the coverage performance when the UE-AP link of interest is in line-of-sight. We further show that the coverage performance improvement brought by increasing the antenna directivity at APs is higher than that brought by increasing the antenna directivity at UEs.

preprint2020arXiv

D3VO: Deep Depth, Deep Pose and Deep Uncertainty for Monocular Visual Odometry

We propose D3VO as a novel framework for monocular visual odometry that exploits deep networks on three levels -- deep depth, pose and uncertainty estimation. We first propose a novel self-supervised monocular depth estimation network trained on stereo videos without any external supervision. In particular, it aligns the training image pairs into similar lighting condition with predictive brightness transformation parameters. Besides, we model the photometric uncertainties of pixels on the input images, which improves the depth estimation accuracy and provides a learned weighting function for the photometric residuals in direct (feature-less) visual odometry. Evaluation results show that the proposed network outperforms state-of-the-art self-supervised depth estimation networks. D3VO tightly incorporates the predicted depth, pose and uncertainty into a direct visual odometry method to boost both the front-end tracking as well as the back-end non-linear optimization. We evaluate D3VO in terms of monocular visual odometry on both the KITTI odometry benchmark and the EuRoC MAV dataset.The results show that D3VO outperforms state-of-the-art traditional monocular VO methods by a large margin. It also achieves comparable results to state-of-the-art stereo/LiDAR odometry on KITTI and to the state-of-the-art visual-inertial odometry on EuRoC MAV, while using only a single camera.

preprint2020arXiv

Directional Modulation-Enabled Secure Transmission with Intelligent Reflecting Surface

We propose a new secure transmission scheme which uses directional modulation (DM) with artificial noise and is aided by the intelligent reflecting surface (IRS). Specifically, the direct path and IRS-enabled reflect path carry the same confidential signal and thus can be coherently added at the desired position to maximize the total received power, while the received signals at other positions are distorted. We derive a closed-form expression for the secrecy rate achieved by the proposed scheme. Using simulation results, we show that the proposed scheme can achieve two-dimensional secure transmission at a specific position. Also, its performance advantage over the conventional DM scheme becomes more pronounced as the number of reflecting elements at the IRS increases.

preprint2020arXiv

DirectShape: Direct Photometric Alignment of Shape Priors for Visual Vehicle Pose and Shape Estimation

Scene understanding from images is a challenging problem encountered in autonomous driving. On the object level, while 2D methods have gradually evolved from computing simple bounding boxes to delivering finer grained results like instance segmentations, the 3D family is still dominated by estimating 3D bounding boxes. In this paper, we propose a novel approach to jointly infer the 3D rigid-body poses and shapes of vehicles from a stereo image pair using shape priors. Unlike previous works that geometrically align shapes to point clouds from dense stereo reconstruction, our approach works directly on images by combining a photometric and a silhouette alignment term in the energy function. An adaptive sparse point selection scheme is proposed to efficiently measure the consistency with both terms. In experiments, we show superior performance of our method on 3D pose and shape estimation over the previous geometric approach and demonstrate that our method can also be applied as a refinement step and significantly boost the performances of several state-of-the-art deep learning based 3D object detectors. All related materials and demonstration videos are available at the project page https://vision.in.tum.de/research/vslam/direct-shape.

preprint2020arXiv

MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers

Pre-trained language models (e.g., BERT (Devlin et al., 2018) and its variants) have achieved remarkable success in varieties of NLP tasks. However, these models usually consist of hundreds of millions of parameters which brings challenges for fine-tuning and online serving in real-life applications due to latency and capacity constraints. In this work, we present a simple and effective approach to compress large Transformer (Vaswani et al., 2017) based pre-trained models, termed as deep self-attention distillation. The small model (student) is trained by deeply mimicking the self-attention module, which plays a vital role in Transformer networks, of the large model (teacher). Specifically, we propose distilling the self-attention module of the last Transformer layer of the teacher, which is effective and flexible for the student. Furthermore, we introduce the scaled dot-product between values in the self-attention module as the new deep self-attention knowledge, in addition to the attention distributions (i.e., the scaled dot-product of queries and keys) that have been used in existing works. Moreover, we show that introducing a teacher assistant (Mirzadeh et al., 2019) also helps the distillation of large pre-trained Transformer models. Experimental results demonstrate that our monolingual model outperforms state-of-the-art baselines in different parameter size of student models. In particular, it retains more than 99% accuracy on SQuAD 2.0 and several GLUE benchmark tasks using 50% of the Transformer parameters and computations of the teacher model. We also obtain competitive results in applying deep self-attention distillation to multilingual pre-trained models.

preprint2020arXiv

Multi-Connectivity for Indoor Terahertz Communication with Self and Dynamic Blockage

We derive new expressions for the connection probability and the average ergodic capacity to evaluate the performance achieved by multi-connectivity (MC) in an indoor ultra-wideband terahertz (THz) communication system. In this system, the user is affected by both self-blockage and dynamic human blockers. We first build up a three-dimensional propagation channel in this system to characterize the impact of molecular absorption loss and the shrinking usable bandwidth nature of the ultra-wideband THz channel. We then carry out new performance analysis for two MC strategies: 1) Closest line-of-sight (LOS) access point (AP) MC (C-MC), and 2) Reactive MC (R- MC). With numerical results, we validate our analysis and show the considerable improvement achieved by both MC strategies in the connection probability. We further show that the C-MC and R-MC strategies provide significant and marginal capacity gain relative to the single connectivity strategy, respectively, and increasing the number of the users associated APs imposes completely different affects on the capacity gain achieved by the C-MC and R-MC strategies. Additionally, we clarify that our analysis allows us to determine the optimal density of APs in order to maximize the capacity gain.

preprint2020arXiv

On the Pilot Contamination Attack in Multi-Cell Multiuser Massive MIMO Networks

In this paper, we analyze pilot contamination (PC) attacks on a multi-cell massive multiple-input multiple-output (MIMO) network with correlated pilots. We obtain correlated pilots using a user capacity-achieving pilot sequence design. This design relies on an algorithm which designs correlated pilot sequences based on signal-to-interference-plus-noise ratio (SINR) requirements for all the legitimate users. The pilot design is capable of achieving the SINR requirements for all users even in the presence of PC. However, this design has some intrinsic limitations and vulnerabilities, such as a known pilot sequence and the non-zero cross-correlation among different pilot sequences. We reveal that such vulnerabilities may be exploited by an active attacker to increase PC in the network. Motivated by this, we analyze the correlated pilot design for vulnerabilities that can be exploited by an active attacker. Based on this analysis, we develop an effective active attack strategy in the massive MIMO network with correlated pilot sequences. Our examinations reveal that the user capacity region of the network is significantly reduced in the presence of the active attack. Importantly, the SINR requirements for the worst-affected users may not be satisfied even with an infinite number of antennas at the base station.

preprint2020arXiv

Parameter Estimation in a Noisy 1D Environment via Two Absorbing Receivers

This paper investigates the estimation of different parameters, e.g., propagation distance and flow velocity, by utilizing two fully-absorbing receivers (RXs) in a one-dimensional (1D) environment. The time-varying number of absorbed molecules at each RX and the number of absorbed molecules in a time interval as time approaches infinity are derived. Noisy molecules in this environment, that are released by sources in addition to the transmitter, are also considered. A novel estimation method, namely difference estimation (DE), is proposed to eliminate the effect of noise by using the difference of received signals at the two RXs. For DE, the Cramer-Rao lower bound (CRLB) on the variance of estimation is derived. Independent maximum likelihood estimation is also considered at each RX as a benchmark to show the performance advantage of DE. Aided by particle-based simulation, the derived analytical results are verified. Furthermore, numerical results show that DE attains the CRLB and is less sensitive to the change of noise than independent estimation at each RX.

preprint2020arXiv

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training

We propose to pre-train a unified language model for both autoencoding and partially autoregressive language modeling tasks using a novel training procedure, referred to as a pseudo-masked language model (PMLM). Given an input text with masked tokens, we rely on conventional masks to learn inter-relations between corrupted tokens and context via autoencoding, and pseudo masks to learn intra-relations between masked spans via partially autoregressive modeling. With well-designed position embeddings and self-attention masks, the context encodings are reused to avoid redundant computation. Moreover, conventional masks used for autoencoding provide global masking information, so that all the position embeddings are accessible in partially autoregressive language modeling. In addition, the two tasks pre-train a unified language model as a bidirectional encoder and a sequence-to-sequence decoder, respectively. Our experiments show that the unified language models pre-trained using PMLM achieve new state-of-the-art results on a wide range of natural language understanding and generation tasks across several widely used benchmarks.

preprint2019arXiv

Escape, bound and capture geodesics in local static coordinates in Schwarzschild spacetime

The classical geodesics of timelike particles in Schwarzschild spacetime is analyzed according to the particle starting radius $r$, velocity $v$ and angle $α$ against the radial outward direction in the reference system of an local static observer. The region of escape, bound and capture orbits in the parameter space of $(r,~v,~α)$ are solved using the three cases of the effective potential. It is found that generally for radius smaller than $4M$ or velocity larger than $c/\sqrt{2}$ there will be no bound orbits. While for fixed radius larger than $4M$ (or velocity smaller than $c/\sqrt{2}$), as velocity (or radius) increase from zero (or $2M$), the particle is always captured until a critical value $v_{\mathrm{crit1}}$ (or $r_{\mathrm{crit1}}$) when the bound orbit start to appear around $α=π/2$ between a double-napped cone structure. As the velocity (or radius) increases to another critical value $v_{\mathrm{crit2}}$ (or $r_{\mathrm{crit2}}$) then the bound directions and escape directions in the outward cone become escape directions, leaving only the inward cone separating the capture and bound directions. The angle of this cone will increase to its asymptotic value as velocity (or radius) increases to its asymptotic value. The implication of these results in shadow of black holes formed by massive particles, in black hole accretion and in spacecraft navigation is briefly discussed.

preprint2019arXiv

Molecular Information Delivery in Porous Media

Information delivery via molecular signals is abundant in nature and potentially useful for industry sensing. Many propagation channels (e.g., tissue membranes and catalyst beds) contain porous medium materials and the impact this has on communication performance is not well understood. Here, communication through realistic porous channels is investigated for the first time via statistical breakthrough curves. Assuming that the number of arrived molecules can be approximated as a Gaussian random variable and using fully resolved computational fluid dynamics results for the breakthrough curves, the numerical results for the throughput, mutual information, error probability, and information diversity gain are presented. Using these numerical results, the unique characteristics of the porous medium channel are revealed.

preprint2018arXiv

Downlink Power Control in Massive MIMO Networks with Distributed Antenna Arrays

In this paper, we investigate downlink power control in massive multiple-input multiple-output (MIMO) networks with distributed antenna arrays. The base station (BS) in each cell consists of multiple antenna arrays, which are deployed in arbitrary locations within the cell. Due to the spatial separation between antenna arrays, the large-scale propagation effect is different from a user to different antenna arrays in a cell, which makes power control a challenging problem as compared to conventional massive MIMO. We assume that the BS in each cell obtains the channel estimates via uplink pilots. Based on the channel estimates, the BSs perform maximum ratio transmission for the downlink. We then derive a closed-form spectral efficiency (SE) expression, where the channels are subject to correlated fading. Utilizing the derived expression, we propose a max-min power control algorithm to ensure that each user in the network receives a uniform quality of service. Numerical results demonstrate that, for the network considered in this work, optimizing for max-min SE through the max-min power control improves the sum SE of the network as compared to equal power allocation.

preprint2018arXiv

Location-Aware Pilot Allocation in Multi-Cell Multi-User Massive MIMO Networks

We propose a location-aware pilot allocation algorithm for a massive multiple-input multiple-output (MIMO) network with high-mobility users, where the wireless channels are subject to Rician fading. Pilot allocation in massive MIMO is a hard combinatorial problem and depends on the locations of users. As such, it is highly complex to achieve the optimal pilot allocation in real-time for a network with high-mobility users. Against this background, we propose a low-complexity pilot allocation algorithm, which exploits the behavior of line-of-sight (LOS) interference among the users and allocate the same pilot sequence to the users with small LOS interference. Our examination demonstrates that our proposed algorithm significantly outperforms the existing algorithms, even with localization errors. Specifically, for the system considered in this work, our proposed algorithm provides up to 37.26% improvement in sum spectral efficiency (SE) and improves the sum SE of the worst interference-affected users by up to 2.57 bits/sec/Hz, as compared to the existing algorithms.

preprint2016arXiv

Mitigating Pilot Contamination Through Location-Aware Pilot Assignment in Massive MIMO Networks

We propose a novel location-aware pilot assignment scheme to mitigate pilot contamination in massive multiple-input multiple-output (MIMO) networks, where the channels are subjected to Rician fading. Our proposed scheme utilizes the location information of users as the input to conduct pilot assignment in the network. Based on the location information, we first determine the line of sight (LOS) interference between the intended signal and the interfering signal. Our analysis reveals that the LOS interference converges to zero as the number of antennas at the base station (BS) goes to infinity, whereas for finite number of antennas at the BS the LOS interference indeed depends on specific pilot allocation strategies. Following this revelation, we assign pilot sequences to all the users in the massive MIMO network such that the LOS interference is minimized for finite number of antennas at the BS. Our proposed scheme outperforms the random pilot assignment in terms of achieving a much higher uplink sum rate for reasonable values of the Rician K-factor. Moreover, we propose a new performance metric, which measures the strength of the LOS interference and demonstrate that it is a good metric to analyze the performance of pilot assignment schemes. Theoretical analysis is provided and verified through numerical simulations to support the proposed scheme.

preprint2016arXiv

Multi-Cell Multiuser Massive MIMO Networks: User Capacity Analysis and Pilot Design

We propose a novel pilot sequence design to mitigate pilot contamination in multi-cell multiuser massive multiple-input multiple-output networks. Our proposed design generates pilot sequences in the multi-cell network and devises power allocation at base stations (BSs) for downlink transmission. The pilot sequences together with the power allocation ensure that the user capacity of the network is achieved and the pre-defined signal-to-interference-plus-noise ratio (SINR) requirements of all users are met. To realize our design, we first derive new closed-form expressions for the user capacity and the user capacity region. Built upon these expressions, we then develop a new algorithm to obtain the required pilot sequences and power allocation. We further determine the minimum number of antennas required at BSs to achieve certain SINR requirements of all users. Numerical results are presented to corroborate our analysis and to examine the impact of key parameters, such as the pilot sequence length and the total number of users, on the network performance. A pivotal conclusion is reached that our design achieves a larger user capacity region than the existing designs and needs less antennas at the BS to fulfill the pre-defined SINR requirements of all users in the network than the existing designs.

preprint2016arXiv

User Load Analysis and Pilot Sequence Design for Multi-Cell Massive MIMO Networks

We propose a novel algorithm to design user load-achieving pilot sequences that mitigate pilot contamination in multi-cell massive multiple-input multiple-output (MIMO) networks. To this end, we first derive expressions for the user load and the load region of the network considering both small-scale and large-scale propagation effects. We then develop the pilot sequence algorithm for multi-cell massive MIMO networks as per the rules of generalized Welch bound equality design. Notably, we find that our algorithm and the corresponding downlink power allocation ensure that the user load is achieved when the signal-to-interference-plus-noise ratio (SINR) requirements for the users lie within the load region. Furthermore, we demonstrate the performance advantage of our proposed design relative to the existing designs, in terms of a larger load region and a higher maximum permitted SINR. Finally, we show that our proposed design can satisfy the pre-defined SINR requirements for users with a finite number of antennas at the base station (BS), while the existing designs cannot satisfy the same requirements even with an infinite number of antennas at the BS.