Source author record

Wei Yu

Wei Yu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

70works

21topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

EmambaIR: Efficient Visual State Space Model for Event-guided Image Reconstruction

Recent event-based image reconstruction methods predominantly rely on Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) to process complementary event information. However, these architectures face fundamental limitations: CNNs often fail to capture global feature correlations, whereas ViTs incur quadratic computational complexity (e.g., $O(n^2)$), hindering their application in high-resolution scenarios. To address these bottlenecks, we introduce EmambaIR, an Efficient visual State Space Model designed for image reconstruction using spatially sparse and temporally continuous event streams. Our framework introduces two key components: the cross-modal Top-k Sparse Attention Module (TSAM) and the Gated State-Space Module (GSSM). TSAM efficiently performs pixel-level top-k sparse attention to guide cross-modal interactions, yielding rich yet sparse fusion features. Subsequently, GSSM utilizes a nonlinear gated unit to enhance the temporal representation of vanilla linear-complexity ($O(n)$) SSMs, effectively capturing global contextual dependencies without the typical computational overhead. Extensive experiments on six datasets across three diverse image reconstruction tasks - motion deblurring, deraining, and High Dynamic Range (HDR) enhancement - demonstrate that EmambaIR significantly outperforms state-of-the-art methods while offering substantial reductions in memory consumption and computational cost. The source code and data are publicly available at: https://github.com/YunhangWickert/EmambaIR

preprint2026arXiv

Rationale-Grounded In-Context Learning for Time Series Reasoning with Multimodal Large Language Models

The underperformance of existing multimodal large language models for time series reasoning lies in the absence of rationale priors that connect temporal observations to their downstream outcomes, which leads models to rely on superficial pattern matching rather than principled reasoning. We therefore propose the rationale-grounded in-context learning for time series reasoning, where rationales work as guiding reasoning units rather than post-hoc explanations, and develop the RationaleTS method. Specifically, we firstly induce label-conditioned rationales, composed of reasoning paths from observable evidence to the potential outcomes. Then, we design the hybrid retrieval by balancing temporal patterns and semantic contexts to retrieve correlated rationale priors for the final in-context inference on new samples. We conduct extensive experiments to demonstrate the effectiveness and efficiency of our proposed RationaleTS on three-domain time series reasoning tasks. We will release our code for reproduction.

preprint2022arXiv

Active Sensing for Communications by Learning

This paper proposes a deep learning approach to a class of active sensing problems in wireless communications in which an agent sequentially interacts with an environment over a predetermined number of time frames to gather information in order to perform a sensing or actuation task for maximizing some utility function. In such an active learning setting, the agent needs to design an adaptive sensing strategy sequentially based on the observations made so far. To tackle such a challenging problem in which the dimension of historical observations increases over time, we propose to use a long short-term memory (LSTM) network to exploit the temporal correlations in the sequence of observations and to map each observation to a fixed-size state information vector. We then use a deep neural network (DNN) to map the LSTM state at each time frame to the design of the next measurement step. Finally, we employ another DNN to map the final LSTM state to the desired solution. We investigate the performance of the proposed framework for adaptive channel sensing problems in wireless communications. In particular, we consider the adaptive beamforming problem for mmWave beam alignment and the adaptive reconfigurable intelligent surface sensing problem for reflection alignment. Numerical results demonstrate that the proposed deep active sensing strategy outperforms the existing adaptive or nonadaptive sensing schemes.

preprint2022arXiv

Deep Learning for Channel Sensing and Hybrid Precoding in TDD Massive MIMO OFDM Systems

This paper proposes a deep learning approach to channel sensing and downlink hybrid beamforming for massive multiple-input multiple-output systems operating in the time division duplex mode and employing either single-carrier or multicarrier transmission. The conventional precoding design involves a two-step process of first estimating the high-dimensional channel, then designing the precoders based on such estimate. This two-step process is, however, not necessarily optimal. This paper shows that by using a learning approach to design the analog sensing and the hybrid downlink precoders directly from the received pilots without the intermediate high-dimensional channel estimation, the overall system performance can be significantly improved. Training a neural network to design the analog and digital precoders simultaneously is, however, difficult. Further, such an approach is not generalizable to systems with different number of users. In this paper, we develop a simplified and generalizable approach that learns the uplink sensing matrix and downlink analog precoder using a deep neural network that decomposes on a per-user basis, then designs the digital precoder based on the estimated low-dimensional equivalent channel. Numerical comparisons show that the proposed methodology results in significantly less training overhead and leads to an architecture that generalizes to various system settings.

preprint2022arXiv

Energy Efficient HARQ for Ultrareliability via Novel Outage Probability Bound and Geometric Programming

Hybrid automatic repeat 1 request (HARQ) is a key enabler for ultrareliable communications. This paper optimizes transmit power for the initial transmission and the subsequent retransmissions of HARQ with either incremental redundancy or Chase combining, aiming to minimize the expected energy consumption given the target outage probability and the target latency. The main challenge is due to the fact that the outage probability is a complicated function of the power variables which are nested in successive convolutions. The existing works mostly use a classic upper bound to approximate the outage probability by assuming unbounded transmit power, then convert the original problem to a geometric programming (GP) problem. In contrast, we propose a novel and much tighter upper bound by taking the practical power limit into consideration. The new bound and the resulting new GP method are further extended to a broader group of channel models with various fading, multiple antennas, and multiple receivers. As shown in simulations, the GP method based on the new bound significantly outperforms the existing strategies that either fix transmit power or optimize power by the classic bounding technique.

preprint2022arXiv

Interference Nulling Using Reconfigurable Intelligent Surface

This paper investigates the interference nulling capability of reconfigurable intelligent surface (RIS) in a multiuser environment where multiple single-antenna transceivers communicate simultaneously in a shared spectrum. From a theoretical perspective, we show that when the channels between the RIS and the transceivers have line-of-sight and the direct paths are blocked, it is possible to adjust the phases of the RIS elements to null out all the interference completely and to achieve the maximum $K$ degrees-of-freedom (DoF) in the overall $K$-user interference channel, provided that the number of RIS elements exceeds some finite value that depends on $K$. Algorithmically, for any fixed channel realization we formulate the interference nulling problem as a feasibility problem, and propose an alternating projection algorithm to efficiently solve the resulting nonconvex problem with local convergence guarantee. Numerical results show that the proposed alternating projection algorithm can null all the interference if the number of RIS elements is only slightly larger than a threshold of $2K(K-1)$. For the practical sum-rate maximization objective, this paper proposes to use the zero-forcing solution obtained from alternating projection as an initial point for subsequent Riemannian conjugate gradient optimization and shows that it has a significant performance advantage over random initializations. For the objective of maximizing the minimum rate, this paper proposes a subgradient projection method which is capable of achieving excellent performance at low complexity.

preprint2022arXiv

Joint Design of Hybrid Beamforming and Reflection Coefficients in RIS-aided mmWave MIMO Systems

This paper considers a reconfigurable intelligent surface (RIS)-aided millimeter wave (mmWave) downlink communication system where hybrid analog-digital beamforming is employed at the base station (BS). We formulate a power minimization problem by jointly optimizing hybrid beamforming at the BS and the response matrix at the RIS, under the signal-to-interference-plus-noise ratio (SINR) constraints at all users. The problem is highly challenging to solve due to the non-convex SINR constraints as well as the unit-modulus phase shift constraints for both the RIS reflection coefficients and the analog beamformer. A two-layer penalty-based algorithm is proposed to decouple variables in SINR constraints, and manifold optimization is adopted to handle the non-convex unit-modulus constraints. {We also propose a low-complexity sequential optimization method, which optimizes the RIS reflection coefficients, the analog beamformer, and the digital beamformer sequentially without iteration.} Furthermore, the relationship between the power minimization problem and the max-min fairness (MMF) problem is discussed. Simulation results show that the proposed penalty-based algorithm outperforms the state-of-the-art semidefinite relaxation (SDR)-based algorithm. Results also demonstrate that the RIS plays an important role in the power reduction.

preprint2022arXiv

Learning Based User Scheduling in Reconfigurable Intelligent Surface Assisted Multiuser Downlink

Reconfigurable intelligent surface (RIS) is capable of intelligently manipulating the phases of the incident electromagnetic wave to improve the wireless propagation environment between the base-station (BS) and the users. This paper addresses the joint user scheduling, RIS configuration, and BS beamforming problem in an RIS-assisted downlink network with limited pilot overhead. We show that graph neural networks (GNN) with permutation invariant and equivariant properties can be used to appropriately schedule users and to design RIS configurations to achieve high overall throughput while accounting for fairness among the users. As compared to the conventional methodology of first estimating the channels then optimizing the user schedule, RIS configuration and the beamformers, this paper shows that an optimized user schedule can be obtained directly from a very short set of pilots using a GNN, then the RIS configuration can be optimized using a second GNN, and finally the BS beamformers can be designed based on the overall effective channel. Numerical results show that the proposed approach can utilize the received pilots more efficiently than the conventional channel estimation based approach, and can generalize to systems with an arbitrary number of users.

preprint2022arXiv

Learning Progressive Distributed Compression Strategies from Local Channel State Information

This paper proposes a deep learning framework to design distributed compression strategies in which distributed agents need to compress high-dimensional observations of a source, then send the compressed bits via bandwidth limited links to a fusion center for source reconstruction. Further, we require the compression strategy to be progressive so that it can adapt to the varying link bandwidths between the agents and the fusion center. Moreover, to ensure scalability, we investigate strategies that depend only on the local channel state information (CSI) at each agent. Toward this end, we use a data-driven approach in which the progressive linear combination and uniform quantization strategy at each agent are trained as a function of its local CSI. To deal with the challenges of modeling the quantization operations (which always produce zero gradients in the training of neural networks), we propose a novel approach of exploiting the statistics of the batch training data to set the dynamic ranges of the uniform quantizers. Numerically, we show that the proposed distributed estimation strategy designed with only local CSI can significantly reduce the signaling overhead and can achieve a lower mean-squared error distortion for source reconstruction than state-of-the-art designs that require global CSI at comparable overall communication cost.

preprint2022arXiv

Modular Action Concept Grounding in Semantic Video Prediction

Recent works in video prediction have mainly focused on passive forecasting and low-level action-conditional prediction, which sidesteps the learning of interaction between agents and objects. We introduce the task of semantic action-conditional video prediction, which uses semantic action labels to describe those interactions and can be regarded as an inverse problem of action recognition. The challenge of this new task primarily lies in how to effectively inform the model of semantic action information. Inspired by the idea of Mixture of Experts, we embody each abstract label by a structured combination of various visual concept learners and propose a novel video prediction model, Modular Action Concept Network (MAC). Our method is evaluated on two newly designed synthetic datasets, CLEVR-Building-Blocks and Sapien-Kitchen, and one real-world dataset called Tower-Creation. Extensive experiments demonstrate that MAC can correctly condition on given instructions and generate corresponding future frames without need of bounding boxes. We further show that the trained model can make out-of-distribution generalization, be quickly adapted to new object categories and exploit its learnt features for object detection, showing the progression towards higher-level cognitive abilities. More visualizations can be found at http://www.pair.toronto.edu/mac/.

preprint2022arXiv

Orbital hybridization and electrostatic interaction in a double molecule transistor

Understanding the intermolecular interactions and utilize these interactions to effectively control the transport behavior of single molecule is the key step from single molecule device to molecular circuits1-6. Although many single molecule detection techniques are used to detect the molecular interaction at single-molecule level1,4,5,7,8, probing and tuning the intermolecular interaction all by electrical approaches has not been demonstrated. In this work, we successful assemble a double molecule transistor incorporating two manganese phthalocyanine molecules, on which we probe and tune the interaction in situ by implementing electrical manipulation on molecular orbitals using gate voltage. Orbital levels of the two molecules couple to each other and couple to the universal gate differently. Electrostatic interaction is observed when single electron changing in one molecule alters the transport behavior of the other, providing the information about the dynamic process of electron sequent tunneling through a molecule. Orbital hybridization is found when two orbital levels are put into degeneracy under non-equilibrium condition, making the tunneling electrons no longer localized to a specific molecule but shared by two molecules, offering a new mechanism to control charge transfer between non-covalent molecules. Current work offer a forelook into working principles of functional electrical unit based on single molecules.

preprint2022arXiv

Quasi-periodic oscillations of the X-ray burst from the magnetar SGR J1935+2154 and associated with the fast radio burst FRB 200428

The origin(s) and mechanism(s) of fast radio bursts (FRBs), which are short radio pulses from cosmological distances, have remained a major puzzle since their discovery. We report a strong Quasi-Periodic Oscillation(QPO) of 40 Hz in the X-ray burst from the magnetar SGR J1935+2154 and associated with FRB 200428, significantly detected with the Hard X-ray Modulation Telescope (Insight-HXMT) and also hinted by the Konus-Wind data. QPOs from magnetar bursts have only been rarely detected; our 3.4 sigma (p-value is 2.9e-4) detection of the QPO reported here reveals the strongest QPO signal observed from magnetars (except in some very rare giant flares), making this X-ray burst unique among magnetar bursts. The two X-ray spikes coinciding with the two FRB pulses are also among the peaks of the QPO. Our results suggest that at least some FRBs are related to strong oscillation processes of neutron stars. We also show that we may overestimate the significance of the QPO signal and underestimate the errors of QPO parameters if QPO exists only in a fraction of the time series of a X-ray burst which we use to calculate the Leahy-normalized periodogram.

preprint2022arXiv

Scheduling Versus Contention for Massive Random Access in Massive MIMO Systems

Massive machine-type communications protocols have typically been designed under the assumption that coordination between users requires significant communication overhead and is thus impractical. Recent progress in efficient activity detection and collision-free scheduling, however, indicates that the cost of coordination can be much less than the naive scheme for scheduling. This work considers a scenario in which a massive number of devices with sporadic traffic seek to access a massive multiple-input multiple-output (MIMO) base-station (BS) and explores an approach in which device activity detection is followed by a single common feedback broadcast message, which is used both to schedule the active users to different transmission slots and to assign orthogonal pilots to the users for channel estimation. The proposed coordinated communication scheme is compared to two prevalent contention-based schemes: coded pilot access, which is based on the principle of coded slotted ALOHA, and an approximate message passing scheme for joint user activity detection and channel estimation. Numerical results indicate that scheduled massive access provides significant gains in the number of successful transmissions per slot and in sum rate, due to the reduced interference, at only a small cost of feedback.

preprint2022arXiv

The accretion flow geometry of MAXI J1820+070 through broadband noise research with Insight-HXMT

Here we present a detailed study of the broadband noise in the power density spectra of the black hole X-ray binary MAXI J1820+070 during the hard state of its 2018 outburst, using the Hard X-ray Modulation Telescope (Insight-HXMT) observations. The broadband noise shows two main humps, which might separately correspond to variability from a variable disk and two Comptonization regions. We fitted the two humps with multiple Lorentzian functions and studied the energy-dependent properties of each component up to 100--150 keV and their evolution with spectral changes. The lowest frequency component is considered as the sub-harmonic of QPO component and shows different energy dependence compared with other broadband noise components. We found that although the fractional rms of all the broadband noise components mainly decrease with energy, their rms spectra are different in shape. Above $\sim$ 20--30 keV, the characteristic frequencies of these components increase sharply with energy, meaning that the high-energy component is more variable on short timescales. Our results suggest that the hot inner flow in MAXI J1820+070 is likely to be inhomogeneous. We propose a geometry with a truncated accretion disk, two Comptonization regions.

preprint2022arXiv

The evolution of the corona in MAXI J1535-571 through type-C quasi-periodic oscillations with Insight-HXMT

Type-C quasi-periodic oscillations (QPOs) in black hole X-ray transients can appear when the source is in the low-hard and hard-intermediate states. The spectral-timing evolution of the type-C QPO in MAXI J1535-571 has been recently studied with Insight-HXMT. Here we fit simultaneously the time-averaged energy spectrum, using a relativistic reflection model, and the fractional rms and phase-lag spectra of the type-C QPOs, using a recently developed time-dependent Comptonization model when the source was in the intermediate state. We show, for the first time, that the time-dependent Comptonization model can successfully explain the X-ray data up to 100 keV. We find that in the hard-intermediate state the frequency of the type-C QPO decreases from 2.6 Hz to 2.1 Hz, then increases to 3.3 Hz, and finally increases to ~ 9 Hz. Simultaneously with this, the evolution of corona size and the feedback fraction (the fraction of photons up-scattered in the corona that return to the disc) indicates the change of the morphology of the corona. Comparing with contemporaneous radio observations, this evolution suggests a possible connection between the corona and the jet when the system is in the hard-intermediate state and about to transit into the soft-intermediate state.

preprint2021arXiv

Deep Learning for Distributed Channel Feedback and Multiuser Precoding in FDD Massive MIMO

This paper shows that deep neural network (DNN) can be used for efficient and distributed channel estimation, quantization, feedback, and downlink multiuser precoding for a frequency-division duplex massive multiple-input multiple-output system in which a base station (BS) serves multiple mobile users, but with rate-limited feedback from the users to the BS. A key observation is that the multiuser channel estimation and feedback problem can be thought of as a distributed source coding problem. In contrast to the traditional approach where the channel state information (CSI) is estimated and quantized at each user independently, this paper shows that a joint design of pilots and a new DNN architecture, which maps the received pilots directly into feedback bits at the user side then maps the feedback bits from all the users directly into the precoding matrix at the BS, can significantly improve the overall performance. This paper further proposes robust design strategies with respect to channel parameters and also a generalizable DNN architecture for varying number of users and number of feedback bits. Numerical results show that the DNN-based approach with short pilot sequences and very limited feedback overhead can already approach the performance of conventional linear precoding schemes with full CSI.

preprint2021arXiv

Room temperature ferromagnetism of monolayer chromium telluride with perpendicular magnetic anisotropy

The realization of long-range magnetic ordering in two-dimensional (2D) systems can potentially revolutionize next-generation information technology. Here, we report the successful fabrication of crystalline Cr3Te4 monolayers with room temperature ferromagnetism. Using molecular beam epitaxy, the growth of 2D Cr3Te4 films with monolayer thickness is demonstrated at low substrate temperatures (~100C), compatible with Si CMOS technology. X-ray magnetic circular dichroism measurements reveal a Curie temperature (Tc) of ~344 K for the Cr3Te4 monolayer with an out-of-plane magnetic easy axis, which decreases to ~240 K for the thicker film (~ 7 nm) with an in-plane easy axis. The enhancement of ferromagnetic coupling and the magnetic anisotropy transition is ascribed to interfacial effects, in particular the orbital overlap at the monolayer Cr3Te4/graphite interface, supported by density-functional theory calculations. This work sheds light on the low-temperature scalable growth of 2D nonlayered materials with room temperature ferromagnetism for new magnetic and spintronic devices.

preprint2021arXiv

Spatial Deep Learning for Wireless Scheduling

The optimal scheduling of interfering links in a dense wireless network with full frequency reuse is a challenging task. The traditional method involves first estimating all the interfering channel strengths then optimizing the scheduling based on the model. This model-based method is however resource intensive and computationally hard because channel estimation is expensive in dense networks; furthermore, finding even a locally optimal solution of the resulting optimization problem may be computationally complex. This paper shows that by using a deep learning approach, it is possible to bypass the channel estimation and to schedule links efficiently based solely on the geographic locations of the transmitters and the receivers, due to the fact that in many propagation environments, the wireless channel strength is largely a function of the distance dependent path-loss. This is accomplished by unsupervised training over randomly deployed networks, and by using a novel neural network architecture that computes the geographic spatial convolutions of the interfering or interfered neighboring nodes along with subsequent multiple feedback stages to learn the optimum solution. The resulting neural network gives near-optimal performance for sum-rate maximization and is capable of generalizing to larger deployment areas and to deployments of different link densities. Moreover, to provide fairness, this paper proposes a novel scheduling approach that utilizes the sum-rate optimal scheduling algorithm over judiciously chosen subsets of links for maximizing a proportional fairness objective over the network. The proposed approach shows highly competitive and generalizable network utility maximization results.

preprint2020arXiv

Adaptive Semantic-Visual Tree for Hierarchical Embeddings

Merchandise categories inherently form a semantic hierarchy with different levels of concept abstraction, especially for fine-grained categories. This hierarchy encodes rich correlations among various categories across different levels, which can effectively regularize the semantic space and thus make predictions less ambiguous. However, previous studies of fine-grained image retrieval primarily focus on semantic similarities or visual similarities. In a real application, merely using visual similarity may not satisfy the need of consumers to search merchandise with real-life images, e.g., given a red coat as a query image, we might get a red suit in recall results only based on visual similarity since they are visually similar. But the users actually want a coat rather than suit even the coat is with different color or texture attributes. We introduce this new problem based on photoshopping in real practice. That's why semantic information are integrated to regularize the margins to make "semantic" prior to "visual". To solve this new problem, we propose a hierarchical adaptive semantic-visual tree (ASVT) to depict the architecture of merchandise categories, which evaluates semantic similarities between different semantic levels and visual similarities within the same semantic class simultaneously. The semantic information satisfies the demand of consumers for similar merchandise with the query while the visual information optimizes the correlations within the semantic class. At each level, we set different margins based on the semantic hierarchy and incorporate them as prior information to learn a fine-grained feature embedding. To evaluate our framework, we propose a new dataset named JDProduct, with hierarchical labels collected from actual image queries and official merchandise images on an online shopping application. Extensive experimental results on the public CARS196 and CUB-

preprint2020arXiv

DPCrowd: Privacy-preserving and Communication-efficient Decentralized Statistical Estimation for Real-time Crowd-sourced Data

In Internet of Things (IoT) driven smart-world systems, real-time crowd-sourced databases from multiple distributed servers can be aggregated to extract dynamic statistics from a larger population, thus providing more reliable knowledge for our society. Particularly, multiple distributed servers in a decentralized network can realize real-time collaborative statistical estimation by disseminating statistics from their separate databases. Despite no raw data sharing, the real-time statistics could still expose the data privacy of crowd-sourcing participants. For mitigating the privacy concern, while traditional differential privacy (DP) mechanism can be simply implemented to perturb the statistics in each timestamp and independently for each dimension, this may suffer a great utility loss from the real-time and multi-dimensional crowd-sourced data. Also, the real-time broadcasting would bring significant overheads in the whole network. To tackle the issues, we propose a novel privacy-preserving and communication-efficient decentralized statistical estimation algorithm (DPCrowd), which only requires intermittently sharing the DP protected parameters with one-hop neighbors by exploiting the temporal correlations in real-time crowd-sourced data. Then, with further consideration of spatial correlations, we develop an enhanced algorithm, DPCrowd+, to deal with multi-dimensional infinite crowd-data streams. Extensive experiments on several datasets demonstrate that our proposed schemes DPCrowd and DPCrowd+ can significantly outperform existing schemes in providing accurate and consensus estimation with rigorous privacy protection and great communication efficiency.

preprint2020arXiv

Energy-Efficient Processing and Robust Wireless Cooperative Transmission for Edge Inference

Edge machine learning can deliver low-latency and private artificial intelligent (AI) services for mobile devices by leveraging computation and storage resources at the network edge. This paper presents an energy-efficient edge processing framework to execute deep learning inference tasks at the edge computing nodes whose wireless connections to mobile devices are prone to channel uncertainties. Aimed at minimizing the sum of computation and transmission power consumption with probabilistic quality-of-service (QoS) constraints, we formulate a joint inference tasking and downlink beamforming problem that is characterized by a group sparse objective function. We provide a statistical learning based robust optimization approach to approximate the highly intractable probabilistic-QoS constraints by nonconvex quadratic constraints, which are further reformulated as matrix inequalities with a rank-one constraint via matrix lifting. We design a reweighted power minimization approach by iteratively reweighted $\ell_1$ minimization with difference-of-convex-functions (DC) regularization and updating weights, where the reweighted approach is adopted for enhancing group sparsity whereas the DC regularization is designed for inducing rank-one solutions. Numerical results demonstrate that the proposed approach outperforms other state-of-the-art approaches.

preprint2020arXiv

Enhanced Channel Estimation in Massive MIMO via Coordinated Pilot Design

Pilot contamination is a limiting factor in multicell massive multiple-input multiple-output (MIMO) systems because it can severely impair channel estimation. Prior works have suggested coordinating pilot design across cells in order to reduce the channel estimation error caused by pilot contamination. In this paper, we propose a method for coordinated pilot design using fractional programming to minimize the weighted mean squared-error (MSE) in channel estimation. In particular, we apply the recently proposed quadratic transform to the MSE expression which allows the effect of pilot contamination to be decoupled. The resulting problem reformulation enables the pilots to be optimized in closed form if they can be designed arbitrarily. When the pilots are restricted to a given set of orthogonal sequences, pilot optimization reduces to an assignment problem which can be solved by weighted bipartite matching. Furthermore, we consider the max-min fairness of data rates with orthogonal pilots and obtain an extension of the proposed method to correlated Rayleigh fading. Finally, simulations demonstrate the advantage of the proposed (orthogonal and nonorthogonal) pilot designs as compared with state-of-the-art methods in combating pilot contamination.

preprint2020arXiv

Information Relaxation and A Duality-Driven Algorithm for Stochastic Dynamic Programs

We use the technique of information relaxation to develop a duality-driven iterative approach to obtaining and improving confidence interval estimates for the true value of finite-horizon stochastic dynamic programming problems. We show that the sequence of dual value estimates yielded from the proposed approach in principle monotonically converges to the true value function in a finite number of dual iterations. Aiming to overcome the curse of dimensionality in various applications, we also introduce a regression-based Monte Carlo algorithm for implementation. The new approach can be used not only to assess the quality of heuristic policies, but also to improve them if we find that their duality gap is large. We obtain the convergence rate of our Monte Carlo method in terms of the amounts of both basis functions and the sampled states. Finally, we demonstrate the effectiveness of our method in an optimal order execution problem with market friction and in an inventory management problem in the presence of lost sale and lead time. Both examples are well known in the literature to be difficult to solve for optimality. The experiments show that our method can significantly improve the heuristics suggested in the literature and obtain new policies with a satisfactory performance guarantee.

preprint2020arXiv

Joint User Identification, Channel Estimation, and Signal Detection for Grant-Free NOMA

For massive machine-type communications, centralized control may incur a prohibitively high overhead. Grant-free non-orthogonal multiple access (NOMA) provides possible solutions, yet poses new challenges for efficient receiver design. In this paper, we develop a joint user identification, channel estimation, and signal detection (JUICESD) algorithm. We divide the whole detection scheme into two modules: slot-wise multi-user detection (SMD) and combined signal and channel estimation (CSCE). SMD is designed to decouple the transmissions of different users by leveraging the approximate message passing (AMP) algorithms, and CSCE is designed to deal with the nonlinear coupling of activity state, channel coefficient and transmit signal of each user separately. To address the problem that the exact calculation of the messages exchanged within CSCE and between the two modules is complicated due to phase ambiguity issues, this paper proposes a rotationally invariant Gaussian mixture (RIGM) model, and develops an efficient JUICESD-RIGM algorithm. JUICESD-RIGM achieves a performance close to JUICESD with a much lower complexity. Capitalizing on the feature of RIGM, we further analyze the performance of JUICESD-RIGM with state evolution techniques. Numerical results demonstrate that the proposed algorithms achieve a significant performance improvement over the existing alternatives, and the derived state evolution method predicts the system performance accurately.

preprint2020arXiv

Massive Access for 5G and Beyond

Massive access, also known as massive connectivity or massive machine-type communication (mMTC), is one of the main use cases of the fifth-generation (5G) and beyond 5G (B5G) wireless networks. A typical application of massive access is the cellular Internet of Things (IoT). Different from conventional human-type communication, massive access aims at realizing efficient and reliable communications for a massive number of IoT devices. Hence, the main characteristics of massive access include low power, massive connectivity, and broad coverage, which require new concepts, theories, and paradigms for the design of next-generation cellular networks. This paper presents a comprehensive survey of aspects of massive access design for B5G wireless networks. Specifically, we provide a detailed review of massive access from the perspectives of theory, protocols, techniques, coverage, energy, and security. Furthermore, several future research directions and challenges are identified.

preprint2020arXiv

Multi-Agent Reinforcement Learning for Adaptive User Association in Dynamic mmWave Networks

Network densification and millimeter-wave technologies are key enablers to fulfill the capacity and data rate requirements of the fifth generation (5G) of mobile networks. In this context, designing low-complexity policies with local observations, yet able to adapt the user association with respect to the global network state and to the network dynamics is a challenge. In fact, the frameworks proposed in literature require continuous access to global network information and to recompute the association when the radio environment changes. With the complexity associated to such an approach, these solutions are not well suited to dense 5G networks. In this paper, we address this issue by designing a scalable and flexible algorithm for user association based on multi-agent reinforcement learning. In this approach, users act as independent agents that, based on their local observations only, learn to autonomously coordinate their actions in order to optimize the network sum-rate. Since there is no direct information exchange among the agents, we also limit the signaling overhead. Simulation results show that the proposed algorithm is able to adapt to (fast) changes of radio environment, thus providing large sum-rate gain in comparison to state-of-the-art solutions.

preprint2020arXiv

Optimal Virtual Network Function Deployment for 5G Network Slicing in a Hybrid Cloud Infrastructure

Network virtualization is a key enabler for 5G systems to support the expected use cases of vertical markets. In this context, we study the joint optimal deployment of Virtual Network Functions (VNFs) and allocation of computational resources in a hybrid cloud infrastructure by taking the requirements of the 5G services and the characteristics of the cloud architecture into consideration. The resulting mixed-integer problem is reformulated as an integer linear problem, which can be solved by using a standard solver. Our results underline the advantages of a hybrid infrastructure over a standard centralized radio access network consisting only of a central cloud, and show that the proposed mechanism to deploy VNF chains leads to high resource utilization efficiency and large gains in terms of the number of supported VNF chains. To deal with the computational complexity of optimizing a large number of clouds and VNF chains, we propose a simple low-complexity heuristic that attempts to find a feasible VNF deployment solution with a limited number of functional splits. Numerical results indicate that the performance of the proposed heuristic is close to the optimal one when the edge clouds are well dimensioned with respect to the computational requirements of the 5G services.

preprint2020arXiv

Optimizing Downlink Resource Allocation in Multiuser MIMO Networks via Fractional Programming and the Hungarian Algorithm

Optimizing the sum-log-utility for the downlink of multi-frequency band, multiuser, multiantenna networks requires joint solutions to the associated beamforming and user scheduling problems through the use of cloud radio access network (CRAN) architecture; optimizing such a network is, however, non-convex and NP-hard. In this paper, we present a novel iterative beamforming and scheduling strategy based on fractional programming and the Hungarian algorithm. The beamforming strategy allows us to iteratively maximize the chosen objective function in a fashion similar to block coordinate ascent. Furthermore, based on the crucial insight that, in the downlink, the interference pattern remains fixed for a given set of beamforming weights, we use the Hungarian algorithm as an efficient approach to optimally schedule users for the given set of beamforming weights. Specifically, this approach allows us to select the best subset of users (amongst the larger set of all available users). Our simulation results show that, in terms of average sum-log-utility, as well as sum-rate, the proposed scheme substantially outperforms both the state-of-the-art multicell weighted minimum mean-squared error (WMMSE) and greedy proportionally fair WMMSE schemes, as well as standard interior-point and sequential quadratic solvers. Importantly, our proposed scheme is also far more computationally efficient than the multicell WMMSE scheme.

preprint2020arXiv

Products-10K: A Large-scale Product Recognition Dataset

With the rapid development of electronic commerce, the way of shopping has experienced a revolutionary evolution. To fully meet customers' massive and diverse online shopping needs with quick response, the retailing AI system needs to automatically recognize products from images and videos at the stock-keeping unit (SKU) level with high accuracy. However, product recognition is still a challenging task, since many of SKU-level products are fine-grained and visually similar by a rough glimpse. Although there are already some products benchmarks available, these datasets are either too small (limited number of products) or noisy-labeled (lack of human labeling). In this paper, we construct a human-labeled product image dataset named "Products-10K", which contains 10,000 fine-grained SKU-level products frequently bought by online customers in JD.com. Based on our new database, we also introduced several useful tips and tricks for fine-grained product recognition. The products-10K dataset is available via https://products-10k.github.io/.

preprint2020arXiv

Semi-Submersible Wind Turbine Hull Shape Design for a Favorable System Response Behavior

Floating offshore wind turbines are a novel technology, which has reached, with the first wind farm in operation, an advanced state of development. The question of how floating wind systems can be optimized to operate smoothly in harsh wind and wave conditions is the subject of the present work. An integrated optimization was conducted, where the hull shape of a semi-submersible, as well as the wind turbine controller were varied with the goal of finding a cost-efficient design, which does not respond to wind and wave excitations, resulting in small structural fatigue and extreme loads. The optimum design was found to have a remarkably low tower-base fatigue load response and small rotor fore-aft amplitudes. Further investigations showed that the reason for the good dynamic behavior is a particularly favorable response to first-order wave loads: The floating wind turbine rotates in pitch-direction about a point close to the rotor hub and the rotor fore-aft motion is almost unaffected by the wave excitation. As a result, the power production and the blade loads are not influenced by the waves. A comparable effect was so far known for Tension Leg Platforms but not for semi-submersible wind turbines. The methodology builds on a low-order simulation model, coupled to a parametric panel code model, a detailed viscous drag model and an individually tuned blade pitch controller. The results are confirmed by the higher-fidelity model FAST. A new indicator to express the optimal behavior through a single design criterion has been developed.

preprint2020arXiv

Signal-Dependent Performance Analysis of Orthogonal Matching Pursuit for Exact Sparse Recovery

Exact recovery of $K$-sparse signals $x \in \mathbb{R}^{n}$ from linear measurements $y=Ax$, where $A\in \mathbb{R}^{m\times n}$ is a sensing matrix, arises from many applications. The orthogonal matching pursuit (OMP) algorithm is widely used for reconstructing $x$. A fundamental question in the performance analysis of OMP is the characterizations of the probability of exact recovery of $x$ for random matrix $A$ and the minimal $m$ to guarantee a target recovery performance. In many practical applications, in addition to sparsity, $x$ also has some additional properties. This paper shows that these properties can be used to refine the answer to the above question. In this paper, we first show that the prior information of the nonzero entries of $x$ can be used to provide an upper bound on $\|x\|_1^2/\|x\|_2^2$. Then, we use this upper bound to develop a lower bound on the probability of exact recovery of $x$ using OMP in $K$ iterations. Furthermore, we develop a lower bound on the number of measurements $m$ to guarantee that the exact recovery probability using $K$ iterations of OMP is no smaller than a given target probability. Finally, we show that when $K=O(\sqrt{\ln n})$, as both $n$ and $K$ go to infinity, for any $0<ζ\leq 1/\sqrtπ$, $m=2K\ln (n/ζ)$ measurements are sufficient to ensure that the probability of exact recovering any $K$-sparse $x$ is no lower than $1-ζ$ with $K$ iterations of OMP. For $K$-sparse $α$-strongly decaying signals and for $K$-sparse $x$ whose nonzero entries independently and identically follow the Gaussian distribution, the number of measurements sufficient for exact recovery with probability no lower than $1-ζ$ reduces further to $m=(\sqrt{K}+4\sqrt{\frac{α+1}{α-1}\ln(n/ζ)})^2$ and asymptotically $m\approx 1.9K\ln (n/ζ)$, respectively.

preprint2020arXiv

Tunable Anisotropic Thermal Transport in Super-Aligned Carbon Nanotube Films

Super-aligned carbon nanotube (CNT) films have intriguing anisotropic thermal transport properties due to the anisotropic nature of individual nanotubes and the important role of nanotube alignment. However, the relationship between the alignment and the anisotropic thermal conductivities was not well understood due to the challenges in both the preparation of high-quality super-aligned CNT film samples and the thermal characterization of such highly anisotropic and porous thin films. Here, super-aligned CNT films with different alignment configurations are designed and their anisotropic thermal conductivities are measured using time-domain thermoreflectance (TDTR) with an elliptical-beam approach. The results suggest that the alignment configuration could tune the cross-plane thermal conductivity k_z from 6.4 to 1.5 W/mK and the in-plane anisotropic ratio from 1.2 to 13.5. This work confirms the important role of CNT alignment in tuning the thermal transport properties of super-aligned CNT films and provides an efficient way to design thermally anisotropic films for thermal management.

preprint2016arXiv

A Stochastic Analysis of Network MIMO Systems

This paper quantifies the benefits and limitations of cooperative communications by providing a statistical analysis of the downlink in network multiple-input multiple-output (MIMO) systems. We consider an idealized model where the multiple-antenna base-stations (BSs) are distributed according to a homogeneous Poisson point process and cooperate by forming disjoint clusters. We assume that perfect channel state information (CSI) is available at the cooperating BSs without any overhead. Multiple single-antenna users are served using zero-forcing beamforming with equal power allocation across the beams. For such a system, we obtain tractable, but accurate, approximations of the signal power and inter-cluster interference power distributions and derive a computationally efficient expression for the achievable per-BS ergodic sum rate using tools from stochastic geometry. This expression allows us to obtain the optimal loading factor, i.e., the ratio between the number of scheduled users and the number of BS antennas, that maximizes the per-BS ergodic sum rate. Further, it allows us to quantify the performance improvement of network MIMO systems as a function of the cooperating cluster size. We show that to perform zero-forcing across the distributed set of BSs within the cluster, the network MIMO system introduces a penalty in received signal power. Along with the inevitable out-of-cluster interference, we show that the per-BS ergodic sum rate of a network MIMO system does not approach that of an isolated cell even at unrealistically large cluster sizes. Nevertheless, network MIMO does provide significant rate improvement as compared to uncoordinated single-cell processing even at relatively modest cluster sizes.

preprint2016arXiv

An uplink-downlink duality for cloud radio access network

Uplink-downlink duality refers to the fact that the Gaussian broadcast channel has the same capacity region as the dual Gaussian multiple-access channel under the same sumpower constraint. This paper investigates a similar duality relationship between the uplink and downlink of a cloud radio access network (C-RAN), where a central processor (CP) cooperatively serves multiple mobile users through multiple remote radio heads (RRHs) connected to the CP with finite-capacity fronthaul links. The uplink of such a C-RAN model corresponds to a multipleaccess relay channel; the downlink corresponds to a broadcast relay channel. This paper considers compression-based relay strategies in both uplink and downlink C-RAN, where the quantization noise levels are functions of the fronthaul link capacities. If the fronthaul capacities are infinite, the conventional uplinkdownlink duality applies. The main result of this paper is that even when the fronthaul capacities are finite, duality continues to hold for the case where independent compression is applied across each RRH in the sense that when the transmission and compression designs are jointly optimized, the achievable rate regions of the uplink and downlink remain identical under the same sum-power and individual fronthaul capacity constraints. As an application of the duality result, the power minimization problem in downlink C-RAN can be efficiently solved based on its uplink counterpart.

preprint2016arXiv

Content-Centric Sparse Multicast Beamforming for Cache-Enabled Cloud RAN

This paper presents a content-centric transmission design in a cloud radio access network (cloud RAN) by incorporating multicasting and caching. Users requesting a same content form a multicast group and are served by a same cluster of base stations (BSs) cooperatively. Each BS has a local cache and it acquires the requested contents either from its local cache or from the central processor (CP) via backhaul links. We investigate the dynamic content-centric BS clustering and multicast beamforming with respect to both channel condition and caching status. We first formulate a mixed-integer nonlinear programming problem of minimizing the weighted sum of backhaul cost and transmit power under the quality-of-service constraint for each multicast group. Theoretical analysis reveals that all the BSs caching a requested content can be included in the BS cluster of this content, regardless of the channel conditions. Then we reformulate an equivalent sparse multicast beamforming (SBF) problem. By adopting smoothed $\ell_0$-norm approximation and other techniques, the SBF problem is transformed into the difference of convex (DC) programs and effectively solved using the convex-concave procedure algorithms. Simulation results demonstrate significant advantage of the proposed content-centric transmission. The effects of three heuristic caching strategies are also evaluated.

preprint2016arXiv

Energy Efficiency of Downlink Transmission Strategies for Cloud Radio Access Networks

This paper studies the energy efficiency of the cloud radio access network (C-RAN), specifically focusing on two fundamental and different downlink transmission strategies, namely the data-sharing strategy and the compression strategy. In the data-sharing strategy, the backhaul links connecting the central processor (CP) and the base-stations (BSs) are used to carry user messages -- each user's messages are sent to multiple BSs; the BSs locally form the beamforming vectors then cooperatively transmit the messages to the user. In the compression strategy, the user messages are precoded centrally at the CP, which forwards a compressed version of the analog beamformed signals to the BSs for cooperative transmission. This paper compares the energy efficiencies of the two strategies by formulating an optimization problem of minimizing the total network power consumption subject to user target rate constraints, where the total network power includes the BS transmission power, BS activation power, and load-dependent backhaul power. To tackle the discrete and nonconvex nature of the optimization problems, we utilize the techniques of reweighted $\ell_1$ minimization and successive convex approximation to devise provably convergent algorithms. Our main finding is that both the optimized data-sharing and compression strategies in C-RAN achieve much higher energy efficiency as compared to the non-optimized coordinated multi-point transmission, but their comparative effectiveness in energy saving depends on the user target rate. At low user target rate, data-sharing consumes less total power than compression, however, as the user target rate increases, the backhaul power consumption for data-sharing increases significantly leading to better energy efficiency of compression at the high user rate regime.

preprint2016arXiv

Generation of intense circularly polarized attosecond light bursts from relativistic laser plasmas

We have investigated the polarization of attosecond light bursts generated by nanobunches of electrons from relativistic few-cycle laser pulse interaction with the surface of overdense plasmas. Particle-in-cell simulation shows that the polarization state of the generated attosecond burst depends on the incident-pulse polarization, duration, carrier envelope phase, as well as the plasma scale length. Through laser and plasma parameter control, without compromise of generation efficiency, a linearly polarized laser pulse with azimuth $θ^i=10^\circ$ can generate an elliptically polarized attosecond burst with azimuth $|θ^r_{\rm atto}|\approx61^\circ$ and ellipticity $σ^r_{\rm atto}\approx0.27$; while an elliptically polarized laser pulse with $σ^i\approx0.36$ can generate an almost circularly polarized attosecond burst with $σ^r_{\rm atto}\approx0.95$. The results propose a new way to a table-top circularly polarized XUV source as a probe with attosecond scale time resolution for many advanced applications.

preprint2016arXiv

Hybrid Digital and Analog Beamforming Design for Large-Scale Antenna Arrays

The potential of using of millimeter wave (mmWave) frequency for future wireless cellular communication systems has motivated the study of large-scale antenna arrays for achieving highly directional beamforming. However, the conventional fully digital beamforming methods which require one radio frequency (RF) chain per antenna element is not viable for large-scale antenna arrays due to the high cost and high power consumption of RF chain components in high frequencies. To address the challenge of this hardware limitation, this paper considers a hybrid beamforming architecture in which the overall beamformer consists of a low-dimensional digital beamformer followed by an RF beamformer implemented using analog phase shifters. Our aim is to show that such an architecture can approach the performance of a fully digital scheme with much fewer number of RF chains. Specifically, this paper establishes that if the number of RF chains is twice the total number of data streams, the hybrid beamforming structure can realize any fully digital beamformer exactly, regardless of the number of antenna elements. For cases with fewer number of RF chains, this paper further considers the hybrid beamforming design problem for both the transmission scenario of a point-to-point multipleinput multiple-output (MIMO) system and a downlink multiuser multiple-input single-output (MU-MISO) system. For each scenario, we propose a heuristic hybrid beamforming design that achieves a performance close to the performance of the fully digital beamforming baseline. Finally, the proposed algorithms are modified for the more practical setting in which only finite resolution phase shifters are available. Numerical simulations show that the proposed schemes are effective even when phase shifters with very low resolution are used.

preprint2016arXiv

On Optimal Fronthaul Compression and Decoding Strategies for Uplink Cloud Radio Access Networks

This paper investigates the compress-and-forward scheme for an uplink cloud radio access network (C-RAN) model, where multi-antenna base-stations (BSs) are connected to a cloud-computing based central processor (CP) via capacity-limited fronthaul links. The BSs compress the received signals with Wyner-Ziv coding and send the representation bits to the CP; the CP performs the decoding of all the users' messages. Under this setup, this paper makes progress toward the optimal structure of the fronthaul compression and CP decoding strategies for the compress-and-forward scheme in C-RAN. On the CP decoding strategy design, this paper shows that under a sum fronthaul capacity constraint, a generalized successive decoding strategy of the quantization and user message codewords that allows arbitrary interleaved order at the CP achieves the same rate region as the optimal joint decoding. Further, it is shown that a practical strategy of successively decoding the quantization codewords first, then the user messages, achieves the same maximum sum rate as joint decoding under individual fronthaul constraints. On the joint optimization of user transmission and BS quantization strategies, this paper shows that if the input distributions are assumed to be Gaussian, then under joint decoding, the optimal quantization scheme for maximizing the achievable rate region is Gaussian. Moreover, Gaussian input and Gaussian quantization with joint decoding achieve to within a constant gap of the capacity region of the Gaussian multiple-input multiple-output (MIMO) uplink C-RAN model. Finally, this paper addresses the computational aspect of optimizing uplink MIMO C-RAN by showing that under fixed Gaussian input, the sum rate maximization problem over the Gaussian quantization noise covariance matrices can be formulated as convex optimization problems, thereby facilitating its efficient solution.

preprint2016arXiv

Role of Interference Alignment in Wireless Cellular Network Optimization

The emergence of interference alignment (IA) as a degrees-of-freedom optimal strategy motivates the need to investigate whether IA can be leveraged to aid conventional network optimization algorithms that are only capable of finding locally optimal solutions. To test the usefulness of IA in this context, this paper proposes a two-stage optimization framework for the downlink of a $G$-cell multi-antenna network with $K$ users/cell. The first stage of the proposed framework focuses on nulling interference from a set of dominant interferers using IA, while the second stage optimizes transmit and receive beamformers to maximize a network-wide utility using the IA solution as the initial condition. Further, this paper establishes a set of new feasibility results for partial IA that can be used to guide the number of dominant interferers to be nulled in the first stage. Through simulations on specific topologies of a cluster of base-stations, it is observed that the impact of IA depends on the choice of the utility function and the presence of out-of-cluster interference. In the absence of out-of-cluster interference, the proposed framework outperforms straightforward optimization when maximizing the minimum rate, while providing marginal gains when maximizing sum-rate. However, the benefit of IA is greatly diminished in the presence of significant out-of-cluster interference.

preprint2015arXiv

Cloud Radio Access Network: Virtualizing Wireless Access for Dense Heterogeneous Systems

Cloud Radio Access Network (C-RAN) refers to the virtualization of base station functionalities by means of cloud computing. This results in a novel cellular architecture in which low-cost wireless access points, known as radio units (RUs) or remote radio heads (RRHs), are centrally managed by a reconfigurable centralized "cloud", or central, unit (CU). C-RAN allows operators to reduce the capital and operating expenses needed to deploy and maintain dense heterogeneous networks. This critical advantage, along with spectral efficiency, statistical multiplexing and load balancing gains, make C-RAN well positioned to be one of the key technologies in the development of 5G systems. In this paper, a succinct overview is presented regarding the state of the art on the research on C-RAN with emphasis on fronthaul compression, baseband processing, medium access control, resource allocation, system-level considerations and standardization efforts.

preprint2015arXiv

Content-Centric Multicast Beamforming in Cache-Enabled Cloud Radio Access Networks

Multicast transmission and wireless caching are effective ways of reducing air and backhaul traffic load in wireless networks. This paper proposes to incorporate these two key ideas for content-centric multicast transmission in a cloud radio access network (RAN) where multiple base stations (BSs) are connected to a central processor (CP) via finite-capacity backhaul links. Each BS has a cache with finite storage size and is equipped with multiple antennas. The BSs cooperatively transmit contents, which are either stored in the local cache or fetched from the CP, to multiple users in the network. Users requesting a same content form a multicast group and are served by a same cluster of BSs cooperatively using multicast beamforming. Assuming fixed cache placement, this paper investigates the joint design of multicast beamforming and content-centric BS clustering by formulating an optimization problem of minimizing the total network cost under the quality-of-service (QoS) constraints for each multicast group. The network cost involves both the transmission power and the backhaul cost. We model the backhaul cost using the mixed $\ell_0/\ell_2$-norm of beamforming vectors. To solve this non-convex problem, we first approximate it using the semidefinite relaxation (SDR) method and concave smooth functions. We then propose a difference of convex functions (DC) programming algorithm to obtain suboptimal solutions and show the connection of three smooth functions. Simulation results validate the advantage of multicasting and show the effects of different cache size and caching policies in cloud RAN.

preprint2015arXiv

Inverse regression for longitudinal data

Sliced inverse regression (Duan and Li [Ann. Statist. 19 (1991) 505-530], Li [J. Amer. Statist. Assoc. 86 (1991) 316-342]) is an appealing dimension reduction method for regression models with multivariate covariates. It has been extended by Ferré and Yao [Statistics 37 (2003) 475-488, Statist. Sinica 15 (2005) 665-683] and Hsing and Ren [Ann. Statist. 37 (2009) 726-755] to functional covariates where the whole trajectories of random functional covariates are completely observed. The focus of this paper is to develop sliced inverse regression for intermittently and sparsely measured longitudinal covariates. We develop asymptotic theory for the new procedure and show, under some regularity conditions, that the estimated directions attain the optimal rate of convergence. Simulation studies and data analysis are also provided to demonstrate the performance of our method.

preprint2015arXiv

On learning optimized reaction diffusion processes for effective image restoration

For several decades, image restoration remains an active research topic in low-level computer vision and hence new approaches are constantly emerging. However, many recently proposed algorithms achieve state-of-the-art performance only at the expense of very high computation time, which clearly limits their practical relevance. In this work, we propose a simple but effective approach with both high computational efficiency and high restoration quality. We extend conventional nonlinear reaction diffusion models by several parametrized linear filters as well as several parametrized influence functions. We propose to train the parameters of the filters and the influence functions through a loss based approach. Experiments show that our trained nonlinear reaction diffusion models largely benefit from the training of the parameters and finally lead to the best reported performance on common test datasets for image restoration. Due to their structural simplicity, our trained models are highly efficient and are also well-suited for parallel computation on GPUs.

preprint2015arXiv

The Improved Job Scheduling Algorithm of Hadoop Platform

This paper discussed some job scheduling algorithms for Hadoop platform, and proposed a jobs scheduling optimization algorithm based on Bayes Classification viewing the shortcoming of those algorithms which are used. The proposed algorithm can be summarized as follows. In the scheduling algorithm based on Bayes Classification, the jobs in job queue will be classified into bad job and good job by Bayes Classification, when JobTracker gets task request, it will select a good job from job queue, and select tasks from good job to allocate JobTracker, then the execution result will feedback to the JobTracker. Therefore the scheduling algorithm based on Bayes Classification influence the job classification via learning the result of feedback with the JobTracker will select the most appropriate job to execute on TaskTracker every time. We need to consider the feature usage of job resource and the influence of TaskTracker resource on task execution, the former of which we call it job feature, for instance, the average usage rate of CPU and average usage rate of memory, the latter node feature, such as the usage rate of CPU and the size of idle physical memory, the two are called feature variables. Results show that it has a significant improvement in execution efficiency and stability of job scheduling.

preprint2014arXiv

Distributed Pricing-Based User Association for Downlink Heterogeneous Cellular Networks

This paper considers the optimization of the user and base-station (BS) association in a wireless downlink heterogeneous cellular network under the proportional fairness criterion. We first consider the case where each BS has a single antenna and transmits at fixed power, and propose a distributed price update strategy for a pricing-based user association scheme, in which the users are assigned to the BS based on the value of a utility function minus a price. The proposed price update algorithm is based on a coordinate descent method for solving the dual of the network utility maximization problem, and it has a rigorous performance guarantee. The main advantage of the proposed algorithm as compared to the existing subgradient method for price update is that the proposed algorithm is independent of parameter choices and can be implemented asynchronously. Further, this paper considers the joint user association and BS power control problem, and proposes an iterative dual coordinate descent and the power optimization algorithm that significantly outperforms existing approaches. Finally, this paper considers the joint user association and BS beamforming problem for the case where the BSs are equipped with multiple antennas and spatially multiplex multiple users. We incorporate dual coordinate descent with the weighted minimum mean-squared error (WMMSE) algorithm, and show that it achieves nearly the same performance as a computationally more complex benchmark algorithm (which applies the WMMSE algorithm on the entire network for BS association), while avoiding excessive BS handover.

preprint2014arXiv

Large-Scale MIMO versus Network MIMO for Multicell Interference Mitigation

This paper compares two important downlink multicell interference mitigation techniques, namely, large-scale (LS) multiple-input multiple-output (MIMO) and network MIMO. We consider a cooperative wireless cellular system operating in time-division duplex (TDD) mode, wherein each cooperating cluster includes $B$ base-stations (BSs), each equipped with multiple antennas and scheduling $K$ single-antenna users. In an LS-MIMO system, each BS employs $BM$ antennas not only to serve its scheduled users, but also to null out interference caused to the other users within the cooperating cluster using zero-forcing (ZF) beamforming. In a network MIMO system, each BS is equipped with only $M$ antennas, but interference cancellation is realized by data and channel state information exchange over the backhaul links and joint downlink transmission using ZF beamforming. Both systems are able to completely eliminate intra-cluster interference and to provide the same number of spatial degrees of freedom per user. Assuming the uplink-downlink channel reciprocity provided by TDD, both systems are subject to identical channel acquisition overhead during the uplink pilot transmission stage. Further, the available sum power at each cluster is fixed and assumed to be equally distributed across the downlink beams in both systems. Building upon the channel distribution functions and using tools from stochastic ordering, this paper shows, however, that from a performance point of view, users experience better quality of service, averaged over small-scale fading, under an LS-MIMO system than a network MIMO system. Numerical simulations for a multicell network reveal that this conclusion also holds true with regularized ZF beamforming scheme. Hence, given the likely lower cost of adding excess number of antennas at each BS, LS-MIMO could be the preferred route toward interference mitigation in cellular networks.

preprint2014arXiv

Optimized Backhaul Compression for Uplink Cloud Radio Access Network

This paper studies the uplink of a cloud radio access network (C-RAN) where the cell sites are connected to a cloud-computing-based central processor (CP) with noiseless backhaul links with finite capacities. We employ a simple compress-and-forward scheme in which the base-stations(BSs) quantize the received signals and send the quantized signals to the CP using either distributed Wyner-Ziv coding or single-user compression. The CP decodes the quantization codewords first, then decodes the user messages as if the remote users and the cloud center form a virtual multiple-access channel (VMAC). This paper formulates the problem of optimizing the quantization noise levels for weighted sum rate maximization under a sum backhaul capacity constraint. We propose an alternating convex optimization approach to find a local optimum solution to the problem efficiently, and more importantly, establish that setting the quantization noise levels to be proportional to the background noise levels is near optimal for sum-rate maximization when the signal-to-quantization-noise ratio (SQNR) is high. In addition, with Wyner-Ziv coding, the approximate quantization noise level is shown to achieve the sum-capacity of the uplink C-RAN model to within a constant gap. With single-user compression, a similar constant-gap result is obtained under a diagonal dominant channel condition. These results lead to an efficient algorithm for allocating the backhaul capacities in C-RAN. The performance of the proposed scheme is evaluated for practical multicell and heterogeneous networks. It is shown that multicell processing with optimized quantization noise levels across the BSs can significantly improve the performance of wireless cellular networks.

preprint2014arXiv

Optimizing User Association and Spectrum Allocation in HetNets: A Utility Perspective

The joint user association and spectrum allocation problem is studied for multi-tier heterogeneous networks (HetNets) in both downlink and uplink in the interference-limited regime. Users are associated with base-stations (BSs) based on the biased downlink received power. Spectrum is either shared or orthogonally partitioned among the tiers. This paper models the placement of BSs in different tiers as spatial point processes and adopts stochastic geometry to derive the theoretical mean proportionally fair utility of the network based on the coverage rate. By formulating and solving the network utility maximization problem, the optimal user association bias factors and spectrum partition ratios are analytically obtained for the multi-tier network. The resulting analysis reveals that the downlink and uplink user associations do not have to be symmetric. For uplink under spectrum sharing, if all tiers have the same target signal-to-interference ratio (SIR), distance-based user association is shown to be optimal under a variety of path loss and power control settings. For both downlink and uplink, under orthogonal spectrum partition, it is shown that the optimal proportion of spectrum allocated to each tier should match the proportion of users associated with that tier. Simulations validate the analytical results. Under typical system parameters, simulation results suggest that spectrum partition performs better for downlink in terms of utility, while spectrum sharing performs better for uplink with power control.

preprint2014arXiv

Sparse Beamforming and User-Centric Clustering for Downlink Cloud Radio Access Network

This paper considers a downlink cloud radio access network (C-RAN) in which all the base-stations (BSs) are connected to a central computing cloud via digital backhaul links with finite capacities. Each user is associated with a user-centric cluster of BSs; the central processor shares the user's data with the BSs in the cluster, which then cooperatively serve the user through joint beamforming. Under this setup, this paper investigates the user scheduling, BS clustering and beamforming design problem from a network utility maximization perspective. Differing from previous works, this paper explicitly considers the per-BS backhaul capacity constraints. We formulate the network utility maximization problem for the downlink C-RAN under two different models depending on whether the BS clustering for each user is dynamic or static over different user scheduling time slots. In the former case, the user-centric BS cluster is dynamically optimized for each scheduled user along with the beamforming vector in each time-frequency slot, while in the latter case the user-centric BS cluster is fixed for each user and we jointly optimize the user scheduling and the beamforming vector to account for the backhaul constraints. In both cases, the nonconvex per-BS backhaul constraints are approximated using the reweighted l1-norm technique. This approximation allows us to reformulate the per-BS backhaul constraints into weighted per-BS power constraints and solve the weighted sum rate maximization problem through a generalized weighted minimum mean square error approach. This paper shows that the proposed dynamic clustering algorithm can achieve significant performance gain over existing naive clustering schemes. This paper also proposes two heuristic static clustering schemes that can already achieve a substantial portion of the gain.

preprint2014arXiv

Visualizing and Comparing Convolutional Neural Networks

Convolutional Neural Networks (CNNs) have achieved comparable error rates to well-trained human on ILSVRC2014 image classification task. To achieve better performance, the complexity of CNNs is continually increasing with deeper and bigger architectures. Though CNNs achieved promising external classification behavior, understanding of their internal work mechanism is still limited. In this work, we attempt to understand the internal work mechanism of CNNs by probing the internal representations in two comprehensive aspects, i.e., visualizing patches in the representation spaces constructed by different layers, and visualizing visual information kept in each layer. We further compare CNNs with different depths and show the advantages brought by deeper architecture.

preprint2013arXiv

Ion heating dynamics in solid buried layer targets irradiated by ultra-short intense laser pulses

We investigate bulk ion heating in solid buried layer targets irradiated by ultra-short laser pulses of relativistic intensities using particle-in-cell simulations. Our study focuses on a CD2-Al-CD2 sandwich target geometry. We find enhanced deuteron ion heating in a layer compressed by the expanding aluminium layer. A pressure gradient created at the Al-CD2 interface pushes this layer of deuteron ions towards the outer regions of the target. During its passage through the target, deuteron ions are constantly injected into this layer. Our simulations suggest that the directed collective outward motion of the layer is converted into thermal motion inside the layer, leading to deuteron temperatures higher than those found in the rest of the target. This enhanced heating can already be observed at laser pulse durations as low as 100 femtoseconds. Thus, detailed experimental surveys at repetition rates of several ten laser shots per minute are in reach at current high-power laser systems, which would allow for probing and optimizing the heating dynamics.

preprint2013arXiv

Learning High-level Image Representation for Image Retrieval via Multi-Task DNN using Clickthrough Data

Image retrieval refers to finding relevant images from an image database for a query, which is considered difficult for the gap between low-level representation of images and high-level representation of queries. Recently further developed Deep Neural Network sheds light on automatically learning high-level image representation from raw pixels. In this paper, we proposed a multi-task DNN learned for image retrieval, which contains two parts, i.e., query-sharing layers for image representation computation and query-specific layers for relevance estimation. The weights of multi-task DNN are learned on clickthrough data by Ring Training. Experimental results on both simulated and real dataset show the effectiveness of the proposed method.

preprint2013arXiv

The state-of-the-art in web-scale semantic information processing for cloud computing

Based on integrated infrastructure of resource sharing and computing in distributed environment, cloud computing involves the provision of dynamically scalable and provides virtualized resources as services over the Internet. These applications also bring a large scale heterogeneous and distributed information which pose a great challenge in terms of the semantic ambiguity. It is critical for application services in cloud computing environment to provide users intelligent service and precise information. Semantic information processing can help users deal with semantic ambiguity and information overload efficiently through appropriate semantic models and semantic information processing technology. The semantic information processing have been successfully employed in many fields such as the knowledge representation, natural language understanding, intelligent web search, etc. The purpose of this report is to give an overview of existing technologies for semantic information processing in cloud computing environment, to propose a research direction for addressing distributed semantic reasoning and parallel semantic computing by exploiting semantic information newly available in cloud computing environment.

preprint2013arXiv

Uplink Multicell Processing with Limited Backhaul via Per-Base-Station Successive Interference Cancellation

This paper studies an uplink multicell joint processing model in which the base-stations are connected to a centralized processing server via rate-limited digital backhaul links. Unlike previous studies where the centralized processor jointly decodes all the source messages from all base-stations, this paper proposes a suboptimal achievability scheme in which the Wyner-Ziv compress-and-forward relaying technique is employed on a per-base-station basis, but successive interference cancellation (SIC) is used at the central processor to mitigate multicell interference. This results in an achievable rate region that is easily computable, in contrast to the joint processing schemes in which the rate regions can only be characterized by exponential number of rate constraints. Under the per-base-station SIC framework, this paper further studies the impact of the limited-capacity backhaul links on the achievable rates and establishes that in order to achieve to within constant number of bits to the maximal SIC rate with infinite-capacity backhaul, the backhaul capacity must scale logarithmically with the signal-to-interference-and-noise ratio (SINR) at each base-station. Finally, this paper studies the optimal backhaul rate allocation problem for an uplink multicell joint processing model with a total backhaul capacity constraint. The analysis reveals that the optimal strategy that maximizes the overall sum rate should also scale with the log of the SINR at each base-station.

preprint2012arXiv

Data Structure Lower Bounds on Random Access to Grammar-Compressed Strings

In this paper we investigate the problem of building a static data structure that represents a string s using space close to its compressed size, and allows fast access to individual characters of s. This type of structures was investigated by the recent paper of Bille et al. Let n be the size of a context-free grammar that derives a unique string s of length L. (Note that L might be exponential in n.) Bille et al. showed a data structure that uses space O(n) and allows to query for the i-th character of s using running time O(log L). Their data structure works on a word RAM with a word size of logL bits. Here we prove that for such data structures, if the space is poly(n), then the query time must be at least (log L)^{1-ε}/log S where S is the space used, for any constant eps>0. As a function of n, our lower bound is Ω(n^{1/2-ε}). Our proof holds in the cell-probe model with a word size of log L bits, so in particular it holds in the word RAM model. We show that no lower bound significantly better than n^{1/2-ε} can be achieved in the cell-probe model, since there is a data structure in the cell-probe model that uses O(n) space and achieves O(\sqrt{n log n}) query time. The "bad" setting of parameters occurs roughly when L=2^{\sqrt{n}}. We also prove a lower bound for the case of not-as-compressible strings, where, say, L=n^{1+ε}. For this case, we prove that if the space is n polylog(n), then the query time must be at least Ω(log n/loglog n). The proof works by reduction to communication complexity, namely to the LSD problem, recently employed by Patrascu and others. We prove lower bounds also for the case of LZ-compression and Burrows-Wheeler (BWT) compression. All of our lower bounds hold even when the strings are over an alphabet of size 2 and hold even for randomized data structures with 2-sided error.

preprint2012arXiv

Electrostatic field acceleration of laser-driven ion bunch by using double layer thin foils

Monoenergetic ion bunch generation and acceleration from double layer thin foil target irradiated by intense linearly polarized (LP) laser pulse is investigated using two-dimensional (2D) particle-in-cell (PIC) simulations. The low-Z ions in the front layer of the target are accelerated by the laser-driven hot electrons and penetrate through the high-Z ion layer to generate a quasi-monoenergetic ion bunch, and this bunch will continue to be accelerated by the quasi-stable electrostatic sheath field which is formed by the immobile high-Z ions and the hot electrons. This mechanism offers possibility to generate monoenergetic ion bunch without ultrahigh-contrast and ultrahigh gradient laser pulses in beam generation experiments, which is confirmed by our simulations.

preprint2012arXiv

Gaussian Z-Interference Channel with a Relay Link: Achievability Region and Asymptotic Sum Capacity

This paper studies a Gaussian Z-interference channel with a rate-limited digital relay link from one receiver to another. Achievable rate regions are derived based on a combination of Han-Kobayashi common-private information splitting technique and several different relay strategies including compress-and-forward and a partial decode-and-forward strategy, in which the interference is partially decoded then binned and forwarded through the digital link for subtraction at the other end. For the Gaussian Z-interference channel with a digital link from the interference-free receiver to the interfered receiver, the capacity region is established in the strong interference regime; an achievable rate region is established in the weak interference regime. In the weak interference regime, the partial decode-and-forward strategy is shown to be asymptotically sum-capacity achieving in the high signal-to-noise ratio and high interference-to-noise ratio limit. In this case, each relay bit asymptotically improves the sum capacity by one bit. For the Gaussian Z-interference channel with a digital link from the interfered receiver to the interference-free receiver, the capacity region is established in the strong interference regime; achievable rate regions are established in the moderately strong and weak interference regimes. In addition, the asymptotically sum capacity is established in the limit of large relay link rate. In this case, the sum capacity improvement due to the digital link is bounded by half a bit when the interference link is weaker than certain threshold, but the sum capacity improvement becomes unbounded as the interference link becomes stronger.

preprint2012arXiv

Incremental Relaying for the Gaussian Interference Channel with a Degraded Broadcasting Relay

This paper studies incremental relay strategies for a two-user Gaussian relay-interference channel with an in-band-reception and out-of-band-transmission relay, where the link between the relay and the two receivers is modelled as a degraded broadcast channel. It is shown that generalized hash-and-forward (GHF) can achieve the capacity region of this channel to within a constant number of bits in a certain weak relay regime, where the transmitter-to-relay link gains are not unboundedly stronger than the interference links between the transmitters and the receivers. The GHF relaying strategy is ideally suited for the broadcasting relay because it can be implemented in an incremental fashion, i.e., the relay message to one receiver is a degraded version of the message to the other receiver. A generalized-degree-of-freedom (GDoF) analysis in the high signal-to-noise ratio (SNR) regime reveals that in the symmetric channel setting, each common relay bit can improve the sum rate roughly by either one bit or two bits asymptotically depending on the operating regime, and the rate gain can be interpreted as coming solely from the improvement of the common message rates, or alternatively in the very weak interference regime as solely coming from the rate improvement of the private messages. Further, this paper studies an asymmetric case in which the relay has only a single single link to one of the destinations. It is shown that with only one relay-destination link, the approximate capacity region can be established for a larger regime of channel parameters. Further, from a GDoF point of view, the sum-capacity gain due to the relay can now be thought as coming from either signal relaying only, or interference forwarding only.

preprint2012arXiv

On the Capacity of the $K$-User Cyclic Gaussian Interference Channel

This paper studies the capacity region of a $K$-user cyclic Gaussian interference channel, where the $k$th user interferes with only the $(k-1)$th user (mod $K$) in the network. Inspired by the work of Etkin, Tse and Wang, who derived a capacity region outer bound for the two-user Gaussian interference channel and proved that a simple Han-Kobayashi power splitting scheme can achieve to within one bit of the capacity region for all values of channel parameters, this paper shows that a similar strategy also achieves the capacity region of the $K$-user cyclic interference channel to within a constant gap in the weak interference regime. Specifically, for the $K$-user cyclic Gaussian interference channel, a compact representation of the Han-Kobayashi achievable rate region using Fourier-Motzkin elimination is first derived, a capacity region outer bound is then established. It is shown that the Etkin-Tse-Wang power splitting strategy gives a constant gap of at most 2 bits in the weak interference regime. For the special 3-user case, this gap can be sharpened to 1 1/2 bits by time-sharing of several different strategies. The capacity result of the $K$-user cyclic Gaussian interference channel in the strong interference regime is also given. Further, based on the capacity results, this paper studies the generalized degrees of freedom (GDoF) of the symmetric cyclic interference channel. It is shown that the GDoF of the symmetric capacity is the same as that of the classic two-user interference channel, no matter how many users are in the network.

preprint2012arXiv

On the Capacity of the K-User Cyclic Gaussian Interference Channel

This paper studies the capacity region of a $K$-user cyclic Gaussian interference channel, where the $k$th user interferes with only the $(k-1)$th user (mod $K$) in the network. Inspired by the work of Etkin, Tse and Wang, which derived a capacity region outer bound for the two-user Gaussian interference channel and proved that a simple Han-Kobayashi power splitting scheme can achieve to within one bit of the capacity region for all values of channel parameters, this paper shows that a similar strategy also achieves the capacity region for the $K$-user cyclic interference channel to within a constant gap in the weak interference regime. Specifically, a compact representation of the Han-Kobayashi achievable rate region using Fourier-Motzkin elimination is first derived, a capacity region outer bound is then established. It is shown that the Etkin-Tse-Wang power splitting strategy gives a constant gap of at most two bits (or one bit per dimension) in the weak interference regime. Finally, the capacity result of the $K$-user cyclic Gaussian interference channel in the strong interference regime is also given.

preprint2012arXiv

Two Birds and One Stone: Gaussian Interference Channel with a Shared Out-of-Band Relay of Limited Rate

The two-user Gaussian interference channel with a shared out-of-band relay is considered. The relay observes a linear combination of the source signals and broadcasts a common message to the two destinations, through a perfect link of fixed limited rate $R_0$ bits per channel use. The out-of-band nature of the relay is reflected by the fact that the common relay message does not interfere with the received signal at the two destinations. A general achievable rate is established, along with upper bounds on the capacity region for the Gaussian case. For $R_0$ values below a certain threshold, which depends on channel parameters, the capacity region of this channel is determined in this paper to within a constant gap of $Δ=1.95$ bits. We identify interference regimes where a two-for-one gain in achievable rates is possible for every bit relayed, up to a constant approximation error. Instrumental to these results is a carefully-designed quantize-and-forward type of relay strategy along with a joint decoding scheme employed at destination ends. Further, we also study successive decoding strategies with optimal decoding order (corresponding to the order at which common, private, and relay messages are decoded), and show that successive decoding also achieves two-for-one gains asymptotically in regimes where a two-for-one gain is achievable by joint decoding; yet, successive decoding produces unbounded loss asymptotically when compared to joint decoding, in general.

preprint2011arXiv

Bit Allocation Law for Multi-Antenna Channel Feedback Quantization: Single-User Case

This paper studies the design and optimization of a limited feedback single-user system with multiple-antenna transmitter and single-antenna receiver. The design problem is cast in form of the minimizing the average transmission power at the base station subject to the user's outage probability constraint. The optimization is over the user's channel quantization codebook and the transmission power control function at the base station. Our approach is based on fixing the outage scenarios in advance and transforming the design problem into a robust system design problem. We start by showing that uniformly quantizing the channel magnitude in dB scale is asymptotically optimal, regardless of the magnitude distribution function. We derive the optimal uniform (in dB) channel magnitude codebook and combine it with a spatially uniform channel direction codebook to arrive at a product channel quantization codebook. We then optimize such a product structure in the asymptotic regime of $B\rightarrow \infty$, where $B$ is the total number of quantization feedback bits. The paper shows that for channels in the real space, the asymptotically optimal number of direction quantization bits should be ${(M{-}1)}/{2}$ times the number of magnitude quantization bits, where $M$ is the number of base station antennas. We also show that the performance of the designed system approaches the performance of the perfect channel state information system as $2^{-\frac{2B}{M+1}}$. For complex channels, the number of magnitude and direction quantization bits are related by a factor of $(M{-}1)$ and the system performance scales as $2^{-\frac{B}{M}}$ as $B\rightarrow\infty$.

preprint2011arXiv

Bit Allocation Laws for Multi-Antenna Channel Feedback Quantization: Multi-User Case

This paper addresses the optimal design of limited-feedback downlink multi-user spatial multiplexing systems. A multiple-antenna base-station is assumed to serve multiple single-antenna users, who quantize and feed back their channel state information (CSI) through a shared rate-limited feedback channel. The optimization problem is cast in the form of minimizing the average transmission power at the base-station subject to users' target signal-to-interference-plus-noise ratios (SINR) and outage probability constraints. The goal is to derive the feedback bit allocations among the users and the corresponding channel magnitude and direction quantization codebooks in a high-resolution quantization regime. Toward this end, this paper develops an optimization framework using approximate analytical closed-form solutions, the accuracy of which is then verified by numerical results. The results show that, for channels in the real space, the number of channel direction quantization bits should be $(M-1)$ times the number of channel magnitude quantization bits, where $M$ is the number of base-station antennas. Moreover, users with higher requested quality-of-service (QoS), i.e. lower target outage probabilities, and higher requested downlink rates, i.e. higher target SINR's, should use larger shares of the feedback rate. It is also shown that, for the target QoS parameters to be feasible, the total feedback bandwidth should scale logarithmically with the geometric mean of the target SINR values and the geometric mean of the inverse target outage probabilities. In particular, the minimum required feedback rate is shown to increase if the users' target parameters deviate from the corresponding geometric means. Finally, the paper shows that, as the total number of feedback bits $B$ increases, the performance of the limited-feedback system approaches the perfect-CSI system as ${2^{-{B}/{M^2}}}$.

preprint2011arXiv

Capacity of the Gaussian Relay Channel with Correlated Noises to Within a Constant Gap

This paper studies the relaying strategies and the approximate capacity of the classic three-node Gaussian relay channel, but where the noises at the relay and at the destination are correlated. It is shown that the capacity of such a relay channel can be achieved to within a constant gap of $\hf \log_2 3 =0.7925$ bits using a modified version of the noisy network coding strategy, where the quantization level at the relay is set in a correlation dependent way. As a corollary, this result establishes that the conventional compress-and-forward scheme also achieves to within a constant gap to the capacity. In contrast, the decode-and-forward and the single-tap amplify-and-forward relaying strategies can have an infinite gap to capacity in the regime where the noises at the relay and at the destination are highly correlated, and the gain of the relay-to-destination link goes to infinity.

preprint2011arXiv

On Noisy Network Coding for a Gaussian Relay Chain Network with Correlated Noises

Noisy network coding, which elegantly combines the conventional compress-and-forward relaying strategy and ideas from network coding, has recently drawn much attention for its simplicity and optimality in achieving to within constant gap of the capacity of the multisource multicast Gaussian network. The constant-gap result, however, applies only to Gaussian relay networks with independent noises. This paper investigates the application of noisy network coding to networks with correlated noises. By focusing on a four-node Gaussian relay chain network with a particular noise correlation structure, it is shown that noisy network coding can no longer achieve to within constant gap to capacity with the choice of Gaussian inputs and Gaussian quantization. The cut-set bound of the relay chain network in this particular case, however, can be achieved to within half a bit by a simple concatenation of a correlation-aware noisy network coding strategy and a decode-and-forward scheme.

preprint2010arXiv

Enhanced surface acceleration of fast electrons by using sub-wavelength grating targets

Surface acceleration of fast electrons in intense laser-plasma interaction is improved by using sub-wavelength grating targets. The fast electron beam emitted along the target surface was enhanced by more than three times relative to that by using planar target. The total number of the fast electrons ejected from the front side of target was also increased by about one time. The method to enhance the surface acceleration of fast electron is effective for various targets with sub-wavelength structured surface, and can be applied widely in the cone-guided fast ignition, energetic ion acceleration, plasma device, and other high energy density physics experiments.

preprint2007arXiv

Capacity of a Class of Modulo-Sum Relay Channels

This paper characterizes the capacity of a class of modulo additive noise relay channels, in which the relay observes a corrupted version of the noise and has a separate channel to the destination. The capacity is shown to be strictly below the cut-set bound in general and achievable using a quantize-and-forward strategy at the relay. This result confirms a conjecture by Ahlswede and Han about the capacity of channels with rate limited state information at the destination for this particular class of channels.

preprint2007arXiv

Grassmannian Beamforming for MIMO Amplify-and-Forward Relaying

In this paper, we derive the optimal transmitter/ receiver beamforming vectors and relay weighting matrix for the multiple-input multiple-output amplify-and-forward relay channel. The analysis is accomplished in two steps. In the first step, the direct link between the transmitter (Tx) and receiver (Rx) is ignored and we show that the transmitter and the relay should map their signals to the strongest right singular vectors of the Tx-relay and relay-Rx channels. Based on the distributions of these vectors for independent identically distributed (i.i.d.) Rayleigh channels, the Grassmannian codebooks are used for quantizing and sending back the channel information to the transmitter and the relay. The simulation results show that even a few number of bits can considerably increase the link reliability in terms of bit error rate. For the second step, the direct link is considered in the problem model and we derive the optimization problem that identifies the optimal Tx beamforming vector. For the i.i.d Rayleigh channels, we show that the solution to this problem is uniformly distributed on the unit sphere and we justify the appropriateness of the Grassmannian codebook (for determining the optimal beamforming vector), both analytically and by simulation. Finally, a modified quantizing scheme is presented which introduces a negligible degradation in the system performance but significantly reduces the required number of feedback bits.

preprint2006arXiv

Bilayer Low-Density Parity-Check Codes for Decode-and-Forward in Relay Channels

This paper describes an efficient implementation of binning for the relay channel using low-density parity-check (LDPC) codes. We devise bilayer LDPC codes to approach the theoretically promised rate of the decode-and-forward relaying strategy by incorporating relay-generated information bits in specially designed bilayer graphical code structures. While conventional LDPC codes are sensitively tuned to operate efficiently at a certain channel parameter, the proposed bilayer LDPC codes are capable of working at two different channel parameters and two different rates: that at the relay and at the destination. To analyze the performance of bilayer LDPC codes, bilayer density evolution is devised as an extension of the standard density evolution algorithm. Based on bilayer density evolution, a design methodology is developed for the bilayer codes in which the degree distribution is iteratively improved using linear programming. Further, in order to approach the theoretical decode-and-forward rate for a wide range of channel parameters, this paper proposes two different forms bilayer codes, the bilayer-expurgated and bilayer-lengthened codes. It is demonstrated that a properly designed bilayer LDPC code can achieve an asymptotic infinite-length threshold within 0.24 dB gap to the Shannon limits of two different channels simultaneously for a wide range of channel parameters. By practical code construction, finite-length bilayer codes are shown to be able to approach within a 0.6 dB gap to the theoretical decode-and-forward rate of the relay channel at a block length of $10^5$ and a bit-error probability (BER) of $10^{-4}$. Finally, it is demonstrated that a generalized version of the proposed bilayer code construction is applicable to relay networks with multiple relays.

Wei Yu

What is connected

Connect this record

See the researcher in context

Building this map preview

70 published item(s)

EmambaIR: Efficient Visual State Space Model for Event-guided Image Reconstruction

Rationale-Grounded In-Context Learning for Time Series Reasoning with Multimodal Large Language Models

Active Sensing for Communications by Learning

Deep Learning for Channel Sensing and Hybrid Precoding in TDD Massive MIMO OFDM Systems

Energy Efficient HARQ for Ultrareliability via Novel Outage Probability Bound and Geometric Programming

Interference Nulling Using Reconfigurable Intelligent Surface

Joint Design of Hybrid Beamforming and Reflection Coefficients in RIS-aided mmWave MIMO Systems

Learning Based User Scheduling in Reconfigurable Intelligent Surface Assisted Multiuser Downlink

Learning Progressive Distributed Compression Strategies from Local Channel State Information

Modular Action Concept Grounding in Semantic Video Prediction

Orbital hybridization and electrostatic interaction in a double molecule transistor

Quasi-periodic oscillations of the X-ray burst from the magnetar SGR J1935+2154 and associated with the fast radio burst FRB 200428

Scheduling Versus Contention for Massive Random Access in Massive MIMO Systems

The accretion flow geometry of MAXI J1820+070 through broadband noise research with Insight-HXMT

The evolution of the corona in MAXI J1535-571 through type-C quasi-periodic oscillations with Insight-HXMT

Deep Learning for Distributed Channel Feedback and Multiuser Precoding in FDD Massive MIMO

Room temperature ferromagnetism of monolayer chromium telluride with perpendicular magnetic anisotropy

Spatial Deep Learning for Wireless Scheduling

Adaptive Semantic-Visual Tree for Hierarchical Embeddings

DPCrowd: Privacy-preserving and Communication-efficient Decentralized Statistical Estimation for Real-time Crowd-sourced Data

Energy-Efficient Processing and Robust Wireless Cooperative Transmission for Edge Inference

Enhanced Channel Estimation in Massive MIMO via Coordinated Pilot Design

Information Relaxation and A Duality-Driven Algorithm for Stochastic Dynamic Programs

Joint User Identification, Channel Estimation, and Signal Detection for Grant-Free NOMA

Massive Access for 5G and Beyond

Multi-Agent Reinforcement Learning for Adaptive User Association in Dynamic mmWave Networks

Optimal Virtual Network Function Deployment for 5G Network Slicing in a Hybrid Cloud Infrastructure

Optimizing Downlink Resource Allocation in Multiuser MIMO Networks via Fractional Programming and the Hungarian Algorithm

Products-10K: A Large-scale Product Recognition Dataset

Semi-Submersible Wind Turbine Hull Shape Design for a Favorable System Response Behavior

Signal-Dependent Performance Analysis of Orthogonal Matching Pursuit for Exact Sparse Recovery

Tunable Anisotropic Thermal Transport in Super-Aligned Carbon Nanotube Films

A Stochastic Analysis of Network MIMO Systems

An uplink-downlink duality for cloud radio access network

Content-Centric Sparse Multicast Beamforming for Cache-Enabled Cloud RAN

Energy Efficiency of Downlink Transmission Strategies for Cloud Radio Access Networks

Generation of intense circularly polarized attosecond light bursts from relativistic laser plasmas

Hybrid Digital and Analog Beamforming Design for Large-Scale Antenna Arrays

On Optimal Fronthaul Compression and Decoding Strategies for Uplink Cloud Radio Access Networks

Role of Interference Alignment in Wireless Cellular Network Optimization

Cloud Radio Access Network: Virtualizing Wireless Access for Dense Heterogeneous Systems

Content-Centric Multicast Beamforming in Cache-Enabled Cloud Radio Access Networks

Inverse regression for longitudinal data

On learning optimized reaction diffusion processes for effective image restoration

The Improved Job Scheduling Algorithm of Hadoop Platform

Distributed Pricing-Based User Association for Downlink Heterogeneous Cellular Networks

Large-Scale MIMO versus Network MIMO for Multicell Interference Mitigation

Optimized Backhaul Compression for Uplink Cloud Radio Access Network

Optimizing User Association and Spectrum Allocation in HetNets: A Utility Perspective

Sparse Beamforming and User-Centric Clustering for Downlink Cloud Radio Access Network

Visualizing and Comparing Convolutional Neural Networks

Ion heating dynamics in solid buried layer targets irradiated by ultra-short intense laser pulses

Learning High-level Image Representation for Image Retrieval via Multi-Task DNN using Clickthrough Data

The state-of-the-art in web-scale semantic information processing for cloud computing

Uplink Multicell Processing with Limited Backhaul via Per-Base-Station Successive Interference Cancellation

Data Structure Lower Bounds on Random Access to Grammar-Compressed Strings

Electrostatic field acceleration of laser-driven ion bunch by using double layer thin foils

Gaussian Z-Interference Channel with a Relay Link: Achievability Region and Asymptotic Sum Capacity

Incremental Relaying for the Gaussian Interference Channel with a Degraded Broadcasting Relay

On the Capacity of the $K$-User Cyclic Gaussian Interference Channel

On the Capacity of the K-User Cyclic Gaussian Interference Channel

Two Birds and One Stone: Gaussian Interference Channel with a Shared Out-of-Band Relay of Limited Rate

Bit Allocation Law for Multi-Antenna Channel Feedback Quantization: Single-User Case

Bit Allocation Laws for Multi-Antenna Channel Feedback Quantization: Multi-User Case

Capacity of the Gaussian Relay Channel with Correlated Noises to Within a Constant Gap

On Noisy Network Coding for a Gaussian Relay Chain Network with Correlated Noises

Enhanced surface acceleration of fast electrons by using sub-wavelength grating targets

Capacity of a Class of Modulo-Sum Relay Channels

Grassmannian Beamforming for MIMO Amplify-and-Forward Relaying

Bilayer Low-Density Parity-Check Codes for Decode-and-Forward in Relay Channels