Source author record

Wei Xu

Wei Xu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

134works

45topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

RankQ: Offline-to-Online Reinforcement Learning via Self-Supervised Action Ranking

Offline-to-online reinforcement learning (RL) improves sample efficiency by leveraging pre-collected datasets prior to online interaction. A key challenge, however, is learning an accurate critic in large state--action spaces with limited dataset coverage. To mitigate harmful updates from value overestimation, prior methods impose pessimism by down-weighting out-of-distribution (OOD) actions relative to dataset actions. While effective, this essentially acts as a behavior cloning anchor and can hinder downstream online policy improvement when dataset actions are suboptimal. We propose RankQ, an offline-to-online Q-learning objective that augments temporal-difference learning with a self-supervised multi-term ranking loss to enforce structured action ordering. By learning relative action preferences rather than uniformly penalizing unseen actions, RankQ shapes the Q-function such that action gradients are directed toward higher-quality behaviors. Across sparse reward D4RL benchmarks, RankQ achieves performance competitive with or superior to seven prior methods. In vision-based robot learning, RankQ enables effective offline-to-online fine-tuning of a pretrained vision-language-action (VLA) model in a low-data regime, achieving on average a 42.7% higher simulation success rate than the next best method. In a high-data setting, RankQ improves simulation performance by 13.7% over the next best method and achieves strong sim-to-real transfer, increasing real-world cube stacking success from 43.1% to 84.7% relative to the VLA's initial performance.

preprint2024arXiv

New research paradigms and agenda of human factors science in the intelligence era

This paper proposes the innovative concept of "human factors science" to characterize engineering psychology, human factors engineering, human-computer interaction, and other similar fields. Although the perspectives in these fields differ, they share a common approach: "human-centered design." In the AI era, the human-machine relationship presents a trans-era evolution to "human-AI teaming." The change has raised challenges for human factors science, compelling us to re-examine current research paradigms and agendas. Based on our previous work, this paper proposes three research paradigms: (1) human-AI joint cognitive systems: this regards an intelligent agent as a cognitive agent with a certain level of cognitive capabilities. A human-AI system can be characterized as a joint cognitive system in which humans and intelligent agents work as teammates for collaboration; (2) human-AI joint cognitive ecosystems: an intelligent ecosystem with multiple human-AI systems can be represented as a human-AI joint cognitive ecosystem. The overall performance of the ecosystem depends on optima collaboration and design across the multiple human-AI systems; (3) intelligent sociotechnical systems (iSTS): human-AI systems are design, developed, and deployed in an iSTS environment. The successful design, development, and deployment of a human-AI system within an iSTS environment depends on the synergistic optimization between the subsystems. This paper looks forward to the future research agenda of human factors science from three aspects: human-AI interaction, intelligent human-machine interface, and human-AI teaming. Analyses show that the three new research paradigms will benefit future research in human factors science. We believe the proposed research paradigms and the future research agenda will mutually promote each other, further advancing human factors science in the AI era.

preprint2024arXiv

Unified Diffusion-Based Rigid and Non-Rigid Editing with Text and Image Guidance

Existing text-to-image editing methods tend to excel either in rigid or non-rigid editing but encounter challenges when combining both, resulting in misaligned outputs with the provided text prompts. In addition, integrating reference images for control remains challenging. To address these issues, we present a versatile image editing framework capable of executing both rigid and non-rigid edits, guided by either textual prompts or reference images. We leverage a dual-path injection scheme to handle diverse editing scenarios and introduce an integrated self-attention mechanism for fusion of appearance and structural information. To mitigate potential visual artifacts, we further employ latent fusion techniques to adjust intermediate latents. Compared to previous work, our approach represents a significant advance in achieving precise and versatile image editing. Comprehensive experiments validate the efficacy of our method, showcasing competitive or superior results in text-based editing and appearance transfer tasks, encompassing both rigid and non-rigid settings.

preprint2023arXiv

Secure Communication for Spatially Correlated Massive MIMO with Low-Resolution DACs

In this paper, the performance of a secure massive multiple-input multiple-output (MIMO) system adopting low-resolution digital-to-analog converters (DACs) is analyzed over spatially correlated wireless channels. A tight lower bound for the achievable secrecy rate is derived with artificial noise (AN) transmitted in the null space of the user channels. Using the analytical results, the impact of spatial correlation on the secrecy rate is explicitly evaluated in the presence of low-resolution DACs. The analytical observations reveal that using low-resolution DACs can be beneficial to the secrecy performance compared with ideal DACs, when the channels are strongly correlated and optimal power allocation is not employed.

preprint2023arXiv

Secure Communication for Spatially Correlated RIS-Aided Multiuser Massive MIMO Systems: Analysis and Optimization

This letter investigates the secure communication in a reconfigurable intelligent surface (RIS)-aided multiuser massive multiple-input multiple-output (MIMO) system exploiting artificial noise (AN). We first derive a closed-form expression of the ergodic secrecy rate under spatially correlated MIMO channels. By using this derived result, we further optimize the power fraction of AN in closed form and the RIS phase shifts by developing a gradient-based algorithm, which requires only statistical channel state information (CSI). Our analysis shows that spatial correlation at the RIS provides an additional dimension for optimizing the RIS phase shifts. Numerical simulations validate the analytical results which show the insightful interplay among the system parameters and the degradation of secrecy performance due to high spatial correlation at the RIS.

preprint2022arXiv

A Deep Finite Difference Emulator for the Fast Simulation of Coupled Viscous Burgers' Equation

This work proposes a deep learning-based emulator for the efficient computation of the coupled viscous Burgers' equation with random initial conditions. In a departure from traditional data-driven deep learning approaches, the proposed emulator does not require a classical numerical solver to collect training data. Instead, it makes direct use of the problem's physics. Specifically, the model emulates a second-order finite difference solver, i.e., the Crank-Nicolson scheme in learning dynamics. A systematic case study is conducted to examine the model's prediction performance, generalization ability, and computational efficiency. The computed results are graphically represented and compared to those of state-of-the-art numerical solvers.

preprint2022arXiv

An End-to-End Transformer Model for Crowd Localization

Crowd localization, predicting head positions, is a more practical and high-level task than simply counting. Existing methods employ pseudo-bounding boxes or pre-designed localization maps, relying on complex post-processing to obtain the head positions. In this paper, we propose an elegant, end-to-end Crowd Localization Transformer named CLTR that solves the task in the regression-based paradigm. The proposed method views the crowd localization as a direct set prediction problem, taking extracted features and trainable embeddings as input of the transformer-decoder. To reduce the ambiguous points and generate more reasonable matching results, we introduce a KMO-based Hungarian matcher, which adopts the nearby context as the auxiliary matching cost. Extensive experiments conducted on five datasets in various data settings show the effectiveness of our method. In particular, the proposed method achieves the best localization performance on the NWPU-Crowd, UCF-QNRF, and ShanghaiTech Part A datasets.

preprint2022arXiv

Cooperative Reflection and Synchronization Design for Distributed Multiple-RIS Communications

To reap the promised gain achieved by distributed reconfigurable intelligent surfaces (RISs)-enhanced communications in a wireless network, timing synchronization among these metasurfaces is an essential prerequisite in practice. This paper proposes a unified framework for the joint estimation of the unknown timing offsets and the RIS channel parameters, as well as the design of cooperative reflection and synchronization algorithm for the distributed multiple-RIS communication. Considering that RIS is usually a passive device with limited capability of signal processing, the individual timing offset and channel gains of each hop of the RIS links cannot be directly estimated. To make the estimation tractable, we propose to estimate the cascaded channels and timing offsets jointly by deriving a maximum likelihood estimator. Furthermore, we theoretically characterize the Cramer-Rao lower bound (CRLB) to evaluate the accuracy of this estimator. By using the proposed estimator and the derived CRLBs, an efficient resynchronization algorithm is devised jointly at the RISs and the destination to compensate the multiple timing offsets. Based on the majorization-minimization framework, the proposed algorithm admits semi-closed and closed form solutions for the RIS reflection matrices and the timing offset equalizer, respectively. Simulation results verify that our theoretical analysis well matches the numerical tests and validate the effectiveness of the proposed resynchronization algorithm.

preprint2022arXiv

Data Augmentation Empowered Neural Precoding for Multiuser MIMO with MMSE Model

Precoding design exploiting deep learning methods has been widely studied for multiuser multiple-input multiple-output (MU-MIMO) systems. However, conventional neural precoding design applies black-box-based neural networks which are less interpretable. In this paper, we propose a deep learning-based precoding method based on an interpretable design of a neural precoding network, namely iPNet. In particular, the iPNet mimics the classic minimum mean-squared error (MMSE) precoding and approximates the matrix inversion in the design of the neural network architecture. Specifically, the proposed iPNet consists of a model-driven component network, responsible for augmenting the input channel state information (CSI), and a data-driven sub-network, responsible for precoding calculation from this augmented CSI. The latter data-driven module is explicitly interpreted as an unsupervised learner of the MMSE precoder. Simulation results show that by exploiting the augmented CSI, the proposed iPNet achieves noticeable performance gain over existing black-box designs and also exhibits enhanced generalizability against CSI mismatches.

preprint2022arXiv

Deep CSI Compression for Massive MIMO: A Self-information Model-driven Neural Network

In order to fully exploit the advantages of massive multiple-input multiple-output (mMIMO), it is critical for the transmitter to accurately acquire the channel state information (CSI). Deep learning (DL)-based methods have been proposed for CSI compression and feedback to the transmitter. Although most existing DL-based methods consider the CSI matrix as an image, structural features of the CSI image are rarely exploited in neural network design. As such, we propose a model of self-information that dynamically measures the amount of information contained in each patch of a CSI image from the perspective of structural features. Then, by applying the self-information model, we propose a model-and-data-driven network for CSI compression and feedback, namely IdasNet. The IdasNet includes the design of a module of self-information deletion and selection (IDAS), an encoder of informative feature compression (IFC), and a decoder of informative feature recovery (IFR). In particular, the model-driven module of IDAS pre-compresses the CSI image by removing informative redundancy in terms of the self-information. The encoder of IFC then conducts feature compression to the pre-compressed CSI image and generates a feature codeword which contains two components, i.e., codeword values and position indices of the codeword values. Subsequently, the IFR decoder decouples the codeword values as well as position indices to recover the CSI image. Experimental results verify that the proposed IdasNet noticeably outperforms existing DL-based networks under various compression ratios while it has the number of network parameters reduced by orders-of-magnitude compared with various existing methods.

preprint2022arXiv

Deployment of long distance multi-moving robots for underground pipe inspection

Blueprint of an in-pipe climbing robot that works with sharp transmissions to study complex line relationships. Standard wheeled/happening pipe climbing robots tend to slide when exploring pipe turns. Instruments help achieve a very distinct delay sequence in which the robot slides and drags as it progresses. The proposed transmission joins the farthest ground plane of the standard two-output transmission. This opens up a substantial time for 3 output transmissions. This instrument takes into account the force exerted on each track within the line relation to specifically alter the robot's track speed, unlocking the key to fine control. Deflection of the robot across pipe networks with different bearings and non-slip pipe bends demonstrate the integrity of the proposed structure.

preprint2022arXiv

Distributed Neural Precoding for Hybrid mmWave MIMO Communications with Limited Feedback

Hybrid precoding is a cost-efficient technique for millimeter wave (mmWave) massive multiple-input multiple-output (MIMO) communications. This paper proposes a deep learning approach by using a distributed neural network for hybrid analog-and-digital precoding design with limited feedback. The proposed distributed neural precoding network, called DNet, is committed to achieving two objectives. First, the DNet realizes channel state information (CSI) compression with a distributed architecture of neural networks, which enables practical deployment on multiple users. Specifically, this neural network is composed of multiple independent sub-networks with the same structure and parameters, which reduces both the number of training parameters and network complexity. Secondly, DNet learns the calculation of hybrid precoding from reconstructed CSI from limited feedback. Different from existing black-box neural network design, the DNet is specifically designed according to the data form of the matrix calculation of hybrid precoding. Simulation results show that the proposed DNet significantly improves the performance up to nearly 50% compared to traditional limited feedback precoding methods under the tests with various CSI compression ratios.

preprint2022arXiv

Do You Need the Entropy Reward (in Practice)?

Maximum entropy (MaxEnt) RL maximizes a combination of the original task reward and an entropy reward. It is believed that the regularization imposed by entropy, on both policy improvement and policy evaluation, together contributes to good exploration, training convergence, and robustness of learned policies. This paper takes a closer look at entropy as an intrinsic reward, by conducting various ablation studies on soft actor-critic (SAC), a popular representative of MaxEnt RL. Our findings reveal that in general, entropy rewards should be applied with caution to policy evaluation. On one hand, the entropy reward, like any other intrinsic reward, could obscure the main task reward if it is not properly managed. We identify some failure cases of the entropy reward especially in episodic Markov decision processes (MDPs), where it could cause the policy to be overly optimistic or pessimistic. On the other hand, our large-scale empirical study shows that using entropy regularization alone in policy improvement, leads to comparable or even better performance and robustness than using it in both policy improvement and policy evaluation. Based on these observations, we recommend either normalizing the entropy reward to a zero mean (SACZero), or simply removing it from policy evaluation (SACLite) for better practical results.

preprint2022arXiv

Efficient and Probabilistic Adaptive Voxel Mapping for Accurate Online LiDAR Odometry

This paper proposes an efficient and probabilistic adaptive voxel mapping method for LiDAR odometry. The map is a collection of voxels; each contains one plane (or edge) feature that enables the probabilistic representation of the environment and accurate registration of a new LiDAR scan. We further analyze the need for coarse-to-fine voxel mapping and then use a novel voxel map organized by a Hash table and octrees to build and update the map efficiently. We apply the proposed voxel map to an iterated extended Kalman filter and construct a maximum a posteriori probability problem for pose estimation. Experiments on the open KITTI dataset show the high accuracy and efficiency of our method compared to other state-of-the-art methods. Outdoor experiments on unstructured environments with non-repetitive scanning LiDARs further verify the adaptability of our mapping method to different environments and LiDAR scanning patterns. Our codes and dataset are open-sourced on Github

preprint2022arXiv

Energy Efficient Beamforming Optimization for Integrated Sensing and Communication

This paper investigates the optimization of beamforming design in a system with integrated sensing and communication (ISAC), where the base station (BS) sends signals for simultaneous multiuser communication and radar sensing. We aim at maximizing the energy efficiency (EE) of the multiuser communication while guaranteeing the sensing requirement in terms of individual radar beampattern gains. The problem is a complicated nonconvex fractional program which is challenging to be solved. By appropriately reformulating the problem and then applying the techniques of successive convex approximation (SCA) and semidefinite relaxation (SDR), we propose an iterative algorithm to address this problem. In theory, we prove that the introduced relaxation of the SDR is rigorously tight. Numerical results validate the effectiveness of the proposed algorithm.

preprint2022arXiv

Extracting a Knowledge Base of COVID-19 Events from Social Media

In this paper, we present a manually annotated corpus of 10,000 tweets containing public reports of five COVID-19 events, including positive and negative tests, deaths, denied access to testing, claimed cures and preventions. We designed slot-filling questions for each event type and annotated a total of 31 fine-grained slots, such as the location of events, recent travel, and close contacts. We show that our corpus can support fine-tuning BERT-based classifiers to automatically extract publicly reported events and help track the spread of a new disease. We also demonstrate that, by aggregating events extracted from millions of tweets, we achieve surprisingly high precision when answering complex queries, such as "Which organizations have employees that tested positive in Philadelphia?" We will release our corpus (with user-information removed), automatic extraction models, and the corresponding knowledge base to the research community.

preprint2022arXiv

FAST-LIVO: Fast and Tightly-coupled Sparse-Direct LiDAR-Inertial-Visual Odometry

To achieve accurate and robust pose estimation in Simultaneous Localization and Mapping (SLAM) task, multi-sensor fusion is proven to be an effective solution and thus provides great potential in robotic applications. This paper proposes FAST-LIVO, a fast LiDAR-Inertial-Visual Odometry system, which builds on two tightly-coupled and direct odometry subsystems: a VIO subsystem and a LIO subsystem. The LIO subsystem registers raw points (instead of feature points on e.g., edges or planes) of a new scan to an incrementally-built point cloud map. The map points are additionally attached with image patches, which are then used in the VIO subsystem to align a new image by minimizing the direct photometric errors without extracting any visual features (e.g., ORB or FAST corner features). To further improve the VIO robustness and accuracy, a novel outlier rejection method is proposed to reject unstable map points that lie on edges or are occluded in the image view. Experiments on both open data sequences and our customized device data are conducted. The results show our proposed system outperforms other counterparts and can handle challenging environments at reduced computation cost. The system supports both multi-line spinning LiDARs and emerging solid-state LiDARs with completely different scanning patterns, and can run in real-time on both Intel and ARM processors. We open source our code and dataset of this work on Github to benefit the robotics community.

preprint2022arXiv

Focal Inverse Distance Transform Maps for Crowd Localization

In this paper, we focus on the crowd localization task, a crucial topic of crowd analysis. Most regression-based methods utilize convolution neural networks (CNN) to regress a density map, which can not accurately locate the instance in the extremely dense scene, attributed to two crucial reasons: 1) the density map consists of a series of blurry Gaussian blobs, 2) severe overlaps exist in the dense region of the density map. To tackle this issue, we propose a novel Focal Inverse Distance Transform (FIDT) map for the crowd localization task. Compared with the density maps, the FIDT maps accurately describe the persons' locations without overlapping in dense regions. Based on the FIDT maps, a Local-Maxima-Detection-Strategy (LMDS) is derived to effectively extract the center point for each individual. Furthermore, we introduce an Independent SSIM (I-SSIM) loss to make the model tend to learn the local structural information, better recognizing local maxima. Extensive experiments demonstrate that the proposed method reports state-of-the-art localization performance on six crowd datasets and one vehicle dataset. Additionally, we find that the proposed method shows superior robustness on the negative and extremely dense scenes, which further verifies the effectiveness of the FIDT maps. The code and model will be available at https://github.com/dk-liang/FIDTM.

preprint2022arXiv

Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning

Standard model-free reinforcement learning algorithms optimize a policy that generates the action to be taken in the current time step in order to maximize expected future return. While flexible, it faces difficulties arising from the inefficient exploration due to its single step nature. In this work, we present Generative Planning method (GPM), which can generate actions not only for the current step, but also for a number of future steps (thus termed as generative planning). This brings several benefits to GPM. Firstly, since GPM is trained by maximizing value, the plans generated from it can be regarded as intentional action sequences for reaching high value regions. GPM can therefore leverage its generated multi-step plans for temporally coordinated exploration towards high value regions, which is potentially more effective than a sequence of actions generated by perturbing each action at single step level, whose consistent movement decays exponentially with the number of exploration steps. Secondly, starting from a crude initial plan generator, GPM can refine it to be adaptive to the task, which, in return, benefits future explorations. This is potentially more effective than commonly used action-repeat strategy, which is non-adaptive in its form of plans. Additionally, since the multi-step plan can be interpreted as the intent of the agent from now to a span of time period into the future, it offers a more informative and intuitive signal for interpretation. Experiments are conducted on several benchmark environments and the results demonstrated its effectiveness compared with several baseline methods.

preprint2022arXiv

Hierarchical Reinforcement Learning By Discovering Intrinsic Options

We propose a hierarchical reinforcement learning method, HIDIO, that can learn task-agnostic options in a self-supervised manner while jointly learning to utilize them to solve sparse-reward tasks. Unlike current hierarchical RL approaches that tend to formulate goal-reaching low-level tasks or pre-define ad hoc lower-level policies, HIDIO encourages lower-level option learning that is independent of the task at hand, requiring few assumptions or little knowledge about the task structure. These options are learned through an intrinsic entropy minimization objective conditioned on the option sub-trajectories. The learned options are diverse and task-agnostic. In experiments on sparse-reward robotic manipulation and navigation tasks, HIDIO achieves higher success rates with greater sample efficiency than regular RL baselines and two state-of-the-art hierarchical RL methods.

preprint2022arXiv

HMRNet: High and Multi-Resolution Network with Bidirectional Feature Calibration for Brain Structure Segmentation in Radiotherapy

Accurate segmentation of Anatomical brain Barriers to Cancer spread (ABCs) plays an important role for automatic delineation of Clinical Target Volume (CTV) of brain tumors in radiotherapy. Despite that variants of U-Net are state-of-the-art segmentation models, they have limited performance when dealing with ABCs structures with various shapes and sizes, especially thin structures (e.g., the falx cerebri) that span only few slices. To deal with this problem, we propose a High and Multi-Resolution Network (HMRNet) that consists of a multi-scale feature learning branch and a high-resolution branch, which can maintain the high-resolution contextual information and extract more robust representations of anatomical structures with various scales. We further design a Bidirectional Feature Calibration (BFC) block to enable the two branches to generate spatial attention maps for mutual feature calibration. Considering the different sizes and positions of ABCs structures, our network was applied after a rough localization of each structure to obtain fine segmentation results. Experiments on the MICCAI 2020 ABCs challenge dataset showed that: 1) Our proposed two-stage segmentation strategy largely outperformed methods segmenting all the structures in just one stage; 2) The proposed HMRNet with two branches can maintain high-resolution representations and is effective to improve the performance on thin structures; 3) The proposed BFC block outperformed existing attention methods using monodirectional feature calibration. Our method won the second place of ABCs 2020 challenge and has a potential for more accurate and reasonable delineation of CTV of brain tumors.

preprint2022arXiv

Intelligent MIMO Detection Using Meta Learning

In a K-best detector for multiple-input-multiple-output(MIMO) systems, the value of K needs to be sufficiently large to achieve near-maximum-likelihood (ML) performance. By treating K as a variable that can be adjusted according to a fitting function of some learnable coefficients, an intelligent MIMO detection network based on deep neural networks (DNN) is proposed to reduce complexity of the detection algorithm with little performance degradation. In particular, the proposed intelligent detection algorithm uses meta learning to learn the coefficients of the fitting function for K to circumvent the problem of learning K directly. The idea of network fusion is used to combine the learning results of the meta learning component networks. Simulation results show that the proposed scheme achieves near-ML detection performance while its complexity is close to that of linear detectors. Besides, it also exhibits strong ability of fast training.

preprint2022arXiv

Learning to Optimize Resource Assignment for Task Offloading in Mobile Edge Computing

In this paper, we consider a multiuser mobile edge computing (MEC) system, where a mixed-integer offloading strategy is used to assist the resource assignment for task offloading. Although the conventional branch and bound (BnB) approach can be applied to solve this problem, a huge burden of computational complexity arises which limits the application of BnB. To address this issue, we propose an intelligent BnB (IBnB) approach which applies deep learning (DL) to learn the pruning strategy of the BnB approach. By using this learning scheme, the structure of the BnB approach ensures near-optimal performance and meanwhile DL-based pruning strategy significantly reduces the complexity. Numerical results verify that the proposed IBnB approach achieves optimal performance with complexity reduced by over 80%.

preprint2022arXiv

Nuclear phase retrieval spectroscopy using resonant x-ray scattering

Light-matter interaction is exploited in spectroscopic techniques to access information about molecular, atomic or nuclear constituents of the sample of interest. While scattered light carries both amplitude and phase information of the electromagnetic field, most of the time the latter is lost in intensity measurements. However, often the phase information is paramount to reconstruct the desired information of the target, as it is well known from coherent x-ray imaging. Here we introduce a new phase retrieval algorithm which allows us to reconstruct the field phase information from two-dimensional time- and energy-resolved spectra. We apply this method to the particular case of x-ray scattering off Mössbauer nuclei at a synchrotron radiation source. Knowledge of the phase allows also for an excellent reconstruction of the energy spectra from experimental data, which could not be achieved with this resolution otherwise. Our approach provides an efficient novel data analysis tool which will benefit x-ray quantum optics and Mössbauer spectroscopy with synchrotron radiation alike.

preprint2022arXiv

PNM: Pixel Null Model for General Image Segmentation

A major challenge in image segmentation is classifying object boundaries. Recent efforts propose to refine the segmentation result with boundary masks. However, models are still prone to misclassifying boundary pixels even when they correctly capture the object contours. In such cases, even a perfect boundary map is unhelpful for segmentation refinement. In this paper, we argue that assigning proper prior weights to error-prone pixels such as object boundaries can significantly improve the segmentation quality. Specifically, we present the \textit{pixel null model} (PNM), a prior model that weights each pixel according to its probability of being correctly classified by a random segmenter. Empirical analysis shows that PNM captures the misclassification distribution of different state-of-the-art (SOTA) segmenters. Extensive experiments on semantic, instance, and panoptic segmentation tasks over three datasets (Cityscapes, ADE20K, MS COCO) confirm that PNM consistently improves the segmentation quality of most SOTA methods (including the vision transformers) and outperforms boundary-based methods by a large margin. We also observe that the widely-used mean IoU (mIoU) metric is insensitive to boundaries of different sharpness. As a byproduct, we propose a new metric, \textit{PNM IoU}, which perceives the boundary sharpness and better reflects the model segmentation performance in error-prone regions.

preprint2022arXiv

Pre-train or Annotate? Domain Adaptation with a Constrained Budget

Recent work has demonstrated that pre-training in-domain language models can boost performance when adapting to a new domain. However, the costs associated with pre-training raise an important question: given a fixed budget, what steps should an NLP practitioner take to maximize performance? In this paper, we view domain adaptation with a constrained budget as a consumer choice problem, where the goal is to select an optimal combination of data annotation and pre-training. We measure annotation costs of three procedural text datasets, along with the pre-training costs of several in-domain language models. The utility of different combinations of pre-training and data annotation are evaluated under varying budget constraints to assess which combination strategy works best. We find that for small budgets, spending all funds on annotation leads to the best performance; once the budget becomes large enough, however, a combination of data annotation and in-domain pre-training yields better performance. Our experiments suggest task-specific data annotation should be part of an economical strategy when adapting an NLP model to a new domain.

preprint2022arXiv

Pressure-induced mixed states caused by spin-elastic interactions during first-order spin phase transition in spin crossover compounds

Recently, the possibility of exploiting the phenomenon of spin transition (ST) has been intensively investigated, therefore, it is particularly important to study the behavior of ST under various stimuli. Here, the shape and content of the intermediate phase of ST in Hoffmann-like compounds [Fe(Fpz)2M(CN)4](M = Pt, Pd) under external stimuli are studied. For this purpose, magnetic and Raman spectroscopy measurements were carried out. In pressure-induced spin transition (PIST), a mixture of high-spin and low-spin states appears, while in temperature-induced spin transition (TIST), a homogeneous state occurs. The first-order ST induced by pressure has a hysteresis, but is not abrupt. Whereas, the temperature-induced spin transition at ambient pressure is hysteretic and abrupt. To investigate this difference, we discuss using a thermodynamic model that considers elastic interactions, showing that the slope of the hysteresis loop is related to the appearance of internal pressure, which is related to the difference in sample compressibility under high spin and low spin states.

preprint2022arXiv

RIS-Assisted Quasi-Static Broad Coverage for Wideband mmWave Massive MIMO Systems

Reconfigurable intelligent surfaces (RISs) can establish favorable wireless environments to combat the severe attenuation and blockages in millimeter-wave (mmWave) bands. However, to achieve the optimal enhancement of performance, the instantaneous channel state information (CSI) needs to be estimated at the cost of a large overhead that scales with the number of RIS elements and the number of users. In this paper, we design a quasi-static broad coverage at the RIS with the reduced overhead based on the statistical CSI. We propose a design framework to synthesize the power pattern reflected by the RIS that meets the customized requirements of broad coverage. For the communication of broadcast channels, we generalize the broad coverage of the single transmit stream to the scenario of multiple streams. Moreover, we employ the quasi-static broad coverage for a multiuser orthogonal frequency division multiplexing access (OFDMA) system, and derive the analytical expression of the downlink rate, which is proved to increase logarithmically with the power gain reflected by the RIS. By taking into account the overhead of channel estimation, the proposed quasi-static broad coverage even outperforms the design method that optimizes the RIS phases using the instantaneous CSI. Numerical simulations are conducted to verify these observations.

preprint2022arXiv

Testing gravitational redshift based on microwave frequency links onboard China Space Station

In 2022 China Space Station (CSS) will be equipped with atomic clocks and optical clocks with stabilities of $2 \times 10^{-16}$ and $8 \times 10^{-18}$, respectively, which provides an excellent opportunity to test gravitational redshift (GR) with higher accuracy than previous results. Based on high-precise frequency links between CSS and a ground station, we formulated a model and provided simulation experiments to test GR. Simulation results suggest that this method could test the GR at the accuracy level of $(0.27 \pm 2.15) \times10^{-7}$, more than two orders in magnitude higher than the result of the experiment of a hydrogen clock on board a flying rocket more than 40 years ago.

preprint2022arXiv

TransCrowd: weakly-supervised crowd counting with transformers

The mainstream crowd counting methods usually utilize the convolution neural network (CNN) to regress a density map, requiring point-level annotations. However, annotating each person with a point is an expensive and laborious process. During the testing phase, the point-level annotations are not considered to evaluate the counting accuracy, which means the point-level annotations are redundant. Hence, it is desirable to develop weakly-supervised counting methods that just rely on count-level annotations, a more economical way of labeling. Current weakly-supervised counting methods adopt the CNN to regress a total count of the crowd by an image-to-count paradigm. However, having limited receptive fields for context modeling is an intrinsic limitation of these weakly-supervised CNN-based methods. These methods thus cannot achieve satisfactory performance, with limited applications in the real world. The transformer is a popular sequence-to-sequence prediction model in natural language processing (NLP), which contains a global receptive field. In this paper, we propose TransCrowd, which reformulates the weakly-supervised crowd counting problem from the perspective of sequence-to-count based on transformers. We observe that the proposed TransCrowd can effectively extract the semantic crowd information by using the self-attention mechanism of transformer. To the best of our knowledge, this is the first work to adopt a pure transformer for crowd counting research. Experiments on five benchmark datasets demonstrate that the proposed TransCrowd achieves superior performance compared with all the weakly-supervised CNN-based counting methods and gains highly competitive counting performance compared with some popular fully-supervised counting methods.

preprint2022arXiv

Worst-case Design for RIS-aided Over-the-air Computation with Imperfect CSI

Over-the-air computation (AirComp) enables fast wireless data aggregation at the receiver through concurrent transmission by sensors in the application of Internet-of-Things (IoT). To further improve the performance of AirComp under unfavorable propagation channel conditions, we consider the problem of computation distortion minimization in a reconfigurable intelligent surface (RIS)-aided AirComp system. In particular, we take into account an additive bounded uncertainty of the channel state information (CSI) and the total power constraint, and jointly optimize the transceiver (Tx-Rx) and the RIS phase design from the perspective of worst-case robustness by minimizing the mean squared error (MSE) of the computation. To solve this intractable nonconvex problem, we develop an efficient alternating algorithm where both solutions to the robust sub-problem and to the joint design of Tx-Rx and RIS are obtained in closed forms. Simulation results demonstrate the effectiveness of the proposed method.

preprint2021arXiv

Analysis and Optimization for RIS-Aided Multi-Pair Communications Relying on Statistical CSI

In this paper, we investigate a reconfigurable intelligent surface (RIS) aided multi-pair communication system, in which multi-pair users exchange information via an RIS. We derive an approximate expression of the achievable rate by assuming that statistical channel state information (CSI) is available. A genetic algorithm (GA) to solve the rate maximization problem is proposed as well. In particular, we consider implementations of RISs with continuous phase shifts (CPSs) and discrete phase shifts (DPSs). Simulation results verify the correctness of the obtained results and show that the proposed GA method has almost the same performance as the globally optimal solution. In addition, numerical results show that three quantization bits can achieve a large portion of the sum achievable rate for the CPSs setup.

preprint2021arXiv

Avoiding dynamic small obstacles with onboard sensing and computating on aerial robots

In practical applications, autonomous quadrotors are still facing significant challenges, such as the detection and avoidance of very small and even dynamic obstacles (e.g., tree branches, power lines). In this paper, we propose a compact, integrated, and fully autonomous quadrotor system, which can fly safely in cluttered environments while avoiding dynamic small obstacles. Our quadrotor platform is equipped with a forward-looking three-dimensional (3D) light detection and ranging (lidar) sensor to perceive the environment and an onboard embedded computer to perform all the estimation, mapping, and planning tasks. Specifically, the computer estimates the current pose of the UAV, maintains a local map (time-accumulated point clouds KD-Trees), and computes a safe trajectory using kinodynamic A* search to the goal point. The whole perception and planning system can run onboard at 50Hz with careful optimization. Various indoor and outdoor experiments show that the system can avoid dynamic small obstacles (down to 20mm diameter bar) while flying at 2m/s in cluttered environments. Our codes and hardware design are open-sourced on Github.

preprint2021arXiv

Deep Reinforcement Learning Based Dynamic Trajectory Control for UAV-assisted Mobile Edge Computing

In this paper, we consider a platform of flying mobile edge computing (F-MEC), where unmanned aerial vehicles (UAVs) serve as equipment providing computation resource, and they enable task offloading from user equipment (UE). We aim to minimize energy consumption of all the UEs via optimizing the user association, resource allocation and the trajectory of UAVs. To this end, we first propose a Convex optimizAtion based Trajectory control algorithm (CAT), which solves the problem in an iterative way by using block coordinate descent (BCD) method. Then, to make the real-time decision while taking into account the dynamics of the environment (i.e., UAV may take off from different locations), we propose a deep Reinforcement leArning based Trajectory control algorithm (RAT). In RAT, we apply the Prioritized Experience Replay (PER) to improve the convergence of the training procedure. Different from the convex optimization based algorithm which may be susceptible to the initial points and requires iterations, RAT can be adapted to any taking off points of the UAVs and can obtain the solution more rapidly than CAT once training process has been completed. Simulation results show that the proposed CAT and RAT achieve the similar performance and both outperform traditional algorithms.

preprint2021arXiv

ikd-Tree: An Incremental K-D Tree for Robotic Applications

This paper proposes an efficient data structure, ikd-Tree, for dynamic space partition. The ikd-Tree incrementally updates a k-d tree with new coming points only, leading to much lower computation time than existing static k-d trees. Besides point-wise operations, the ikd-Tree supports several features such as box-wise operations and down-sampling that are practically useful in robotic applications. In parallel to the incremental operations (i.e., insert, re-insert, and delete), ikd-Tree actively monitors the tree structure and partially re-balances the tree, which enables efficient nearest point search in later stages. The ikd-Tree is carefully engineered and supports multi-thread parallel computing to maximize the overall efficiency. We validate the ikd-Tree in both theory and practical experiments. On theory level, a complete time complexity analysis is presented to prove the high efficiency. On experiment level, the ikd-Tree is tested on both randomized datasets and real-world LiDAR point data in LiDAR-inertial odometry and mapping application. In all tests, ikd-Tree consumes only 4% of the running time in a static k-d tree.

preprint2021arXiv

MetaView: Few-shot Active Object Recognition

In robot sensing scenarios, instead of passively utilizing human captured views, an agent should be able to actively choose informative viewpoints of a 3D object as discriminative evidence to boost the recognition accuracy. This task is referred to as active object recognition. Recent works on this task rely on a massive amount of training examples to train an optimal view selection policy. But in realistic robot sensing scenarios, the large-scale training data may not exist and whether the intelligent view selection policy can be still learned from few object samples remains unclear. In this paper, we study this new problem which is extremely challenging but very meaningful in robot sensing -- Few-shot Active Object Recognition, i.e., to learn view selection policies from few object samples, which has not been considered and addressed before. We solve the proposed problem by adopting the framework of meta learning and name our method "MetaView". Extensive experiments on both category-level and instance-level classification tasks demonstrate that the proposed method can efficiently resolve issues that are hard for state-of-the-art active object recognition methods to handle, and outperform several baselines by large margins.

preprint2021arXiv

R2LIVE: A Robust, Real-time, LiDAR-Inertial-Visual tightly-coupled state Estimator and mapping

In this letter, we propose a robust, real-time tightly-coupled multi-sensor fusion framework, which fuses measurement from LiDAR, inertial sensor, and visual camera to achieve robust and accurate state estimation. Our proposed framework is composed of two parts: the filter-based odometry and factor graph optimization. To guarantee real-time performance, we estimate the state within the framework of error-state iterated Kalman-filter, and further improve the overall precision with our factor graph optimization. Taking advantage of measurement from all individual sensors, our algorithm is robust enough to various visual failure, LiDAR-degenerated scenarios, and is able to run in real-time on an on-board computation platform, as shown by extensive experiments conducted in indoor, outdoor, and mixed environment of different scale. Moreover, the results show that our proposed framework can improve the accuracy of state-of-the-art LiDAR-inertial or visual-inertial odometry. To share our findings and to make contributions to the community, we open source our codes on our Github.

preprint2021arXiv

The solution space structure of planted constraint satisfaction problems with growing domains

Planting a solution into the random RB model, which is a prototype of random constraint satisfaction problem (CSP) with growing domains, can generate very hard satisfiable CSP benchmarks. We study the solution space structure of the planted RB model. With constraint density growing, we find that this model goes through four phase transitions. In the replica symmetric phase, what we call the independent phase transition occurs, after which the planted cluster (cluster containing the planted solution) is separated from the giant cluster. Then the solutions except that in the planted cluster go through the same clustering phase transition and the same satisfiability phase transition as the random RB model. The planted cluster goes through the isolated phase transition, after which the planted cluster contains only one solution. This phase diagram provides strong evidence that this model can generate very hard satisfiable CSP benchmarks. For over constraint instances (where the constraint density is very large), we find that the configuration space has only a single energy valley, which makes the instances tractable. Experiments using Belief Propagation confirm the locations of the clustering, satisfiability (by configurations outside the planted cluster), and isolated phase transition points.

preprint2020arXiv

A Bayes Factor Approach with Informative Prior for Rare Genetic Variant Analysis from Next Generation Sequencing Data

The discovery of rare genetic variants through Next Generation Sequencing is a very challenging issue in the field of human genetics. We propose a novel region-based statistical approach based on a Bayes Factor (BF) to assess evidence of association between a set of rare variants (RVs) located on the same genomic region and a disease outcome in the context of case-control design. Marginal likelihoods are computed under the null and alternative hypotheses assuming a binomial distribution for the RV count in the region and a beta or mixture of Dirac and beta prior distribution for the probability of RV. We derive the theoretical null distribution of the BF under our prior setting and show that a Bayesian control of the False Discovery Rate (BFDR) can be obtained for genome-wide inference. Informative priors are introduced using prior evidence of association from a Kolmogorov-Smirnov test statistic. We use our simulation program, sim1000G, to generate RV data similar to the 1,000 genomes sequencing project. Our simulation studies showed that the new BF statistic outperforms standard methods (SKAT, SKAT-O, Burden test) in case-control studies with moderate sample sizes and is equivalent to them under large sample size scenarios. Our real data application to a lung cancer case-control study found enrichment for RVs in known and novel cancer genes. It also suggests that using the BF with informative prior improves the overall gene discovery compared to the BF with non-informative prior.

preprint2020arXiv

An averaging principle for fractional stochastic differential equations with Lévy noise

This paper is devoted to the study of an averaging principle for fractional stochastic differential equations in Rnwith Lévy motion, using an integral transform method. We obtain a time-averaged equation under suitable assumptions. Furthermore, we show that the solutions of averaged equation approach the solutions of the original equation. Our results in this paper provide better understanding for effective approximation of fractional dynamical systems with non-Gaussian Lévy noise.

preprint2020arXiv

Analog Versus Hybrid Precoding for Multiuser Massive MIMO with Quantized CSI Feedback

In this letter, we study the performance of a downlink multiuser massive multiple-input multiple-output (MIMO) system with sub-connected structure over limited feedback channels. Tight rate approximations are theoretically analyzed for the system with pure analog precoding and hybrid precoding. The effect of quantized analog and digital precoding is characterized in the derived expressions. Furthermore, it is revealed that the pure analog precoding outperforms the hybrid precoding using maximal-ratio transmission (MRT) or zero forcing (ZF) under certain conditions, and we theoretically characterize the conditions in closed form with respect to signal-to-noise ratio (SNR), the number of users and the number of feedback bits. Numerical results verify the derived conclusions on both Rayleigh channels and mmWave channels.

preprint2020arXiv

AnciNet: An Efficient Deep Learning Approach for Feedback Compression of Estimated CSI in Massive MIMO Systems

Accurate channel state information (CSI) feedback plays a vital role in improving the performance gain of massive multiple-input multiple-output (m-MIMO) systems, where the dilemma is excessive CSI overhead versus limited feedback bandwith. By considering the noisy CSI due to imperfect channel estimation, we propose a novel deep neural network architecture, namely AnciNet, to conduct the CSI feedback with limited bandwidth. AnciNet extracts noise-free features from the noisy CSI samples to achieve effective CSI compression for the feedback. Experimental results verify that the proposed AnciNet approach outperforms the existing techniques under various conditions.

preprint2020arXiv

Asymptotic Results for Heavy-tailed Lévy Processes and their Exponential Functionals

In this paper we first provide several conditional limit theorems for Lévy processes with negative drift and regularly varying tail. Then we apply them to study the asymptotic behavior of expectations of some exponential functionals of heavy-tailed Lévy processes. As the key point, we observe that the asymptotics mainly depends on the sample paths with early arrival large jump. Both the polynomial decay rate and the exact expression of the limit coefficients are given. As an application, we give an exact description for the extinction speed of continuous-state branching processes in heavy-tailed Lévy random environment with stable branching mechanism.

preprint2020arXiv

Attacking Optical Character Recognition (OCR) Systems with Adversarial Watermarks

Optical character recognition (OCR) is widely applied in real applications serving as a key preprocessing tool. The adoption of deep neural network (DNN) in OCR results in the vulnerability against adversarial examples which are crafted to mislead the output of the threat model. Different from vanilla colorful images, images of printed text have clear backgrounds usually. However, adversarial examples generated by most of the existing adversarial attacks are unnatural and pollute the background severely. To address this issue, we propose a watermark attack method to produce natural distortion that is in the disguise of watermarks and evade human eyes' detection. Experimental results show that watermark attacks can yield a set of natural adversarial examples attached with watermarks and attain similar attack performance to the state-of-the-art methods in different attack scenarios.

preprint2020arXiv

Chimbuko: A Workflow-Level Scalable Performance Trace Analysis Tool

Because of the limits input/output systems currently impose on high-performance computing systems, a new generation of workflows that include online data reduction and analysis is emerging. Diagnosing their performance requires sophisticated performance analysis capabilities due to the complexity of execution patterns and underlying hardware, and no tool could handle the voluminous performance trace data needed to detect potential problems. This work introduces Chimbuko, a performance analysis framework that provides real-time, distributed, in situ anomaly detection. Data volumes are reduced for human-level processing without losing necessary details. Chimbuko supports online performance monitoring via a visualization module that presents the overall workflow anomaly distribution, call stacks, and timelines. Chimbuko also supports the capture and reduction of performance provenance. To the best of our knowledge, Chimbuko is the first online, distributed, and scalable workflow-level performance trace analysis framework, and we demonstrate the tool's usefulness on Oak Ridge National Laboratory's Summit system.

preprint2020arXiv

Determining geopotential difference via relativistic precise point positioning time comparison: A case study using simulated observations

According to general relativity theory (GRT), the geopotential difference (GD) can be determined by comparing the change in time difference between precise clocks using the precise point positioning (PPP) time transfer technique, referred to as the relativistic PPP time comparison approach. We focused on high-precision time comparison between two precise clocks for determining the GD using the relativistic PPP time transfer,and conducted simulation experiments to validate the approach. In the experiments, we consider three cases to evaluate the performance of the approach using clocks with different stabilities, namely, the frequency stabilities of the clocks equipped at three selected ground stations are respectively (Case 1), (Case 2), and (Case 3) at time period. Conclusions are drawn from the experimental results. First, high-precision clocks can significantly improve the accuracy for PPP time transfer, but the improvement is limited by measurement noises. Compared to Case 1, the long-term stabilities of OPMT-BRUX as well as PTBB-BRUX are improved in Cases 2 and 3. The frequency stabilities of Cases 1-3 are approximately 4.28*10-16, 4.00*10-17, and 3.22*10-17 at 10-day averaging time for OPMT-BRUX, respectively, and for PTBB-BRUX, these values are approximately 3.73*10-16, 8.17*10-17, and 4.64*10-17. Second, the geopotential difference between any two stations can be determined at the decimeter level, with its accuracy being consistent with the stabilities of the time links in Cases 1-3. In Case 3, the determined geopotential differences between OPMT and BRUX deviate from the EIGEN-6C4 model values by -0.64 m2/s2 with an uncertainty of 1.11 m2/s2, whereas the deviation error between PTBB and BRUX is 0.76 m2/s2 with an uncertainty of 1.79 m2/s2. The approach proposed in this study can be also applied to testing GRT.

preprint2020arXiv

Discourse Level Factors for Sentence Deletion in Text Simplification

This paper presents a data-driven study focusing on analyzing and predicting sentence deletion -- a prevalent but understudied phenomenon in document simplification -- on a large English text simplification corpus. We inspect various document and discourse factors associated with sentence deletion, using a new manually annotated sentence alignment corpus we collected. We reveal that professional editors utilize different strategies to meet readability standards of elementary and middle schools. To predict whether a sentence will be deleted during simplification to a certain level, we harness automatically aligned data to train a classification model. Evaluated on our manually annotated data, our best models reached F1 scores of 65.2 and 59.7 for this task at the levels of elementary and middle school, respectively. We find that discourse level factors contribute to the challenging task of predicting sentence deletion for simplification.

preprint2020arXiv

Distributed IRS with Statistical Passive Beamforming for MISO Communications

Intelligent reflecting surface (IRS) has recently been identified as a prominent technology with the ability of enhancing wireless communication by dynamically manipulating the propagation environment. This paper investigates a multiple-input single-output (MISO) system deploying distributed IRSs. For practical considerations, we propose an efficient design of passive reflecting beamforming for the IRSs to exploit statistical channel state information (CSI) and analyze the achievable rate of the network taking into account the impact of CSI estimation error. The ergodic achievable rate is derived in a closed form, which provides insightful system design guidelines. Numerical results confirm the accuracy of the derived results and unveil the performance superiority of the proposed distributed IRS deployment over the conventional centralized deployment.

preprint2020arXiv

Energy-Efficient Wireless Communications with Distributed Reconfigurable Intelligent Surfaces

This paper investigates the problem of resource allocation for a wireless communication network with distributed reconfigurable intelligent surfaces (RISs). In this network, multiple RISs are spatially distributed to serve wireless users and the energy efficiency of the network is maximized by dynamically controlling the on-off status of each RIS as well as optimizing the reflection coefficients matrix of the RISs. This problem is posed as a joint optimization problem of transmit beamforming and RIS control, whose goal is to maximize the energy efficiency under minimum rate constraints of the users. To solve this problem, two iterative algorithms are proposed for the single-user case and multi-user case. For the single-user case, the phase optimization problem is solved by using a successive convex approximation method, which admits a closed-form solution at each step. Moreover, the optimal RIS on-off status is obtained by using the dual method. For the multi-user case, a low-complexity greedy searching method is proposed to solve the RIS on-off optimization problem. Simulation results show that the proposed scheme achieves up to 33\% and 68\% gains in terms of the energy efficiency in both single-user and multi-user cases compared to the conventional RIS scheme and amplify-and-forward relay scheme, respectively.

preprint2020arXiv

Feature Statistics Guided Efficient Filter Pruning

Building compact convolutional neural networks (CNNs) with reliable performance is a critical but challenging task, especially when deploying them in real-world applications. As a common approach to reduce the size of CNNs, pruning methods delete part of the CNN filters according to some metrics such as $l1$-norm. However, previous methods hardly leverage the information variance in a single feature map and the similarity characteristics among feature maps. In this paper, we propose a novel filter pruning method, which incorporates two kinds of feature map selections: diversity-aware selection (DFS) and similarity-aware selection (SFS). DFS aims to discover features with low information diversity while SFS removes features that have high similarities with others. We conduct extensive empirical experiments with various CNN architectures on publicly available datasets. The experimental results demonstrate that our model obtains up to 91.6% parameter decrease and 83.7% FLOPs reduction with almost no accuracy loss.

preprint2020arXiv

Generalizing Natural Language Analysis through Span-relation Representations

Natural language processing covers a wide variety of tasks predicting syntax, semantics, and information content, and usually each type of output is generated with specially designed architectures. In this paper, we provide the simple insight that a great variety of tasks can be represented in a single unified format consisting of labeling spans and relations between spans, thus a single task-independent model can be used across different tasks. We perform extensive experiments to test this insight on 10 disparate tasks spanning dependency parsing (syntax), semantic role labeling (semantics), relation extraction (information content), aspect based sentiment analysis (sentiment), and many others, achieving performance comparable to state-of-the-art specialized models. We further demonstrate benefits of multi-task learning, and also show that the proposed method makes it easy to analyze differences and similarities in how the model handles different tasks. Finally, we convert these datasets into a unified format to build a benchmark, which provides a holistic testbed for evaluating future models for generalized natural language analysis.

preprint2020arXiv

Hybrid Transceiver Optimization for Multi-Hop Communications

Multi-hop communication with the aid of large-scale antenna arrays will play a vital role in future emergence communication systems. In this paper, we investigate amplify-and-forward based and multiple-input multiple-output assisted multi-hop communication, in which all nodes employ hybrid transceivers. Moreover, channel errors are taken into account in our hybrid transceiver design. Based on the matrix-monotonic optimization framework, the optimal structures of the robust hybrid transceivers are derived. By utilizing these optimal structures, the optimizations of analog transceivers and digital transceivers can be separated without loss of optimality. This fact greatly simplifies the joint optimization of analog and digital transceivers. Since the optimization of analog transceivers under unit-modulus constraints is non-convex, a projection type algorithm is proposed for analog transceiver optimization to overcome this difficulty. Based on the derived analog transceivers, the optimal digital transceivers can then be derived using matrix-monotonic optimization. Numeral results obtained demonstrate the performance advantages of the proposed hybrid transceiver designs over other existing solutions.

preprint2020arXiv

Implicit Generative Modeling for Efficient Exploration

Efficient exploration remains a challenging problem in reinforcement learning, especially for those tasks where rewards from environments are sparse. A commonly used approach for exploring such environments is to introduce some "intrinsic" reward. In this work, we focus on model uncertainty estimation as an intrinsic reward for efficient exploration. In particular, we introduce an implicit generative modeling approach to estimate a Bayesian uncertainty of the agent's belief of the environment dynamics. Each random draw from our generative model is a neural network that instantiates the dynamic function, hence multiple draws would approximate the posterior, and the variance in the future prediction based on this posterior is used as an intrinsic reward for exploration. We design a training algorithm for our generative model based on the amortized Stein Variational Gradient Descent. In experiments, we compare our implementation with state-of-the-art intrinsic reward-based exploration approaches, including two recent approaches based on an ensemble of dynamic models. In challenging exploration tasks, our implicit generative model consistently outperforms competing approaches regarding data efficiency in exploration.

preprint2020arXiv

Interactive Visual Study of Multiple Attributes Learning Model of X-Ray Scattering Images

Existing interactive visualization tools for deep learning are mostly applied to the training, debugging, and refinement of neural network models working on natural images. However, visual analytics tools are lacking for the specific application of x-ray image classification with multiple structural attributes. In this paper, we present an interactive system for domain scientists to visually study the multiple attributes learning models applied to x-ray scattering images. It allows domain scientists to interactively explore this important type of scientific images in embedded spaces that are defined on the model prediction output, the actual labels, and the discovered feature space of neural networks. Users are allowed to flexibly select instance images, their clusters, and compare them regarding the specified visual representation of attributes. The exploration is guided by the manifestation of model performance related to mutual relationships among attributes, which often affect the learning accuracy and effectiveness. The system thus supports domain scientists to improve the training dataset and model, find questionable attributes labels, and identify outlier images or spurious data clusters. Case studies and scientists feedback demonstrate its functionalities and usefulness.

preprint2020arXiv

Interpreting Galaxy Deblender GAN from the Discriminator's Perspective

Generative adversarial networks (GANs) are well known for their unsupervised learning capabilities. A recent success in the field of astronomy is deblending two overlapping galaxy images via a branched GAN model. However, it remains a significant challenge to comprehend how the network works, which is particularly difficult for non-expert users. This research focuses on behaviors of one of the network's major components, the Discriminator, which plays a vital role but is often overlooked, Specifically, we enhance the Layer-wise Relevance Propagation (LRP) scheme to generate a heatmap-based visualization. We call this technique Polarized-LRP and it consists of two parts i.e. positive contribution heatmaps for ground truth images and negative contribution heatmaps for generated images. Using the Galaxy Zoo dataset we demonstrate that our method clearly reveals attention areas of the Discriminator when differentiating generated galaxy images from ground truth images. To connect the Discriminator's impact on the Generator, we visualize the gradual changes of the Generator across the training process. An interesting result we have achieved there is the detection of a problematic data augmentation procedure that would else have remained hidden. We find that our proposed method serves as a useful visual analytical tool for a deeper understanding of GAN models.

preprint2020arXiv

Joint Transmit Power and Placement Optimization for URLLC-enabled UAV Relay Systems

This letter considers an unmanned aerial vehicle (UAV)-enabled relay communication system for delivering latency-critical messages with ultra-high reliability, where the relay is operating under amplifier-and-forward (AF) mode. We aim to jointly optimize the UAV location and power to minimize decoding error probability while guaranteeing the latency constraints. Both the free-space channel model and three-dimensional (3-D) channel model are considered. For the first model, we propose a low-complexity iterative algorithm to solve the problem, while globally optimal solution is derived for the case when the signal-to-noise ratio (SNR) is extremely high. For the second model, we also propose a low-complexity iterative algorithm to solve the problem. Simulation results confirm the performance advantages of our proposed algorithms.

preprint2020arXiv

Multi-cell Edge Coverage Enhancement Using Mobile UAV-Relay

Unmanned aerial vehicle (UAV)-assisted communication is a promising technology in future wireless communication networks. UAVs can not only help offload data traffic from ground base stations (GBSs), but also improve the quality of service of cell-edge users (CEUs). In this paper, we consider the enhancement of cell-edge communications through a mobile relay, i.e., UAV, in multi-cell networks. During each transmission period, GBSs first send data to the UAV, and then the UAV forwards its received data to CEUs according to a certain association strategy. In order to maximize the sum rate of all CEUs, we jointly optimize the UAV mobility management, including trajectory, velocity, and acceleration, and association strategy of CEUs to the UAV, subject to minimum rate requirements of CEUs, mobility constraints of the UAV and causal buffer constraints in practice. To address the mixed-integer nonconvex problem, we transform it into two convex subproblems by applying tight bounds and relaxations. An iterative algorithm was proposed to solve the two subproblems in an alternating manner. Numerical results show that the proposed algorithm achieves higher rates of CEUs as compared with existing benchmark schemes.

preprint2020arXiv

Multi-hop Reading Comprehension across Documents with Path-based Graph Convolutional Network

Multi-hop reading comprehension across multiple documents attracts much attention recently. In this paper, we propose a novel approach to tackle this multi-hop reading comprehension problem. Inspired by human reasoning processing, we construct a path-based reasoning graph from supporting documents. This graph can combine both the idea of the graph-based and path-based approaches, so it is better for multi-hop reasoning. Meanwhile, we propose Gated-RGCN to accumulate evidence on the path-based reasoning graph, which contains a new question-aware gating mechanism to regulate the usefulness of information propagating across documents and add question information during reasoning. We evaluate our approach on WikiHop dataset, and our approach achieves state-of-the-art accuracy against previously published approaches. Especially, our ensemble model surpasses human performance by 4.2%.

preprint2020arXiv

Multicell MIMO Communications Relying on Intelligent Reflecting Surface

Intelligent reflecting surfaces (IRSs) constitute a disruptive wireless communication technique capable of creating a controllable propagation environment. In this paper, we propose to invoke an IRS at the cell boundary of multiple cells to assist the downlink transmission to cell-edge users, whilst mitigating the inter-cell interference, which is a crucial issue in multicell communication systems. We aim for maximizing the weighted sum rate (WSR) of all users through jointly optimizing the active precoding matrices at the base stations (BSs) and the phase shifts at the IRS subject to each BS's power constraint and unit modulus constraint. Both the BSs and the users are equipped with multiple antennas, which enhances the spectral efficiency by exploiting the spatial multiplexing gain. Due to the non-convexity of the problem, we first reformulate it into an equivalent one, which is solved by using the block coordinate descent (BCD) algorithm, where the precoding matrices and phase shifts are alternately optimized. The optimal precoding matrices can be obtained in closed form, when fixing the phase shifts. A pair of efficient algorithms are proposed for solving the phase shift optimization problem, namely the Majorization-Minimization (MM) Algorithm and the Complex Circle Manifold (CCM) Method. Both algorithms are guaranteed to converge to at least locally optimal solutions. We also extend the proposed algorithms to the more general multiple-IRS and network MIMO scenarios. Finally, our simulation results confirm the advantages of introducing IRSs in enhancing the cell-edge user performance.

preprint2020arXiv

Mutual Information-based State-Control for Intrinsically Motivated Reinforcement Learning

In reinforcement learning, an agent learns to reach a set of goals by means of an external reward signal. In the natural world, intelligent organisms learn from internal drives, bypassing the need for external signals, which is beneficial for a wide range of tasks. Motivated by this observation, we propose to formulate an intrinsic objective as the mutual information between the goal states and the controllable states. This objective encourages the agent to take control of its environment. Subsequently, we derive a surrogate objective of the proposed reward function, which can be optimized efficiently. Lastly, we evaluate the developed framework in different robotic manipulation and navigation tasks and demonstrate the efficacy of our approach. A video showing experimental results is available at https://youtu.be/CT4CKMWBYz0

preprint2020arXiv

Numerical Analysis of History-dependent Variational-hemivariational Inequalities

In this paper, numerical analysis is carried out for a class of history-dependent variational-hemivariational inequalities arising in contact problems. Three different numerical treatments for temporal discretization are proposed to approximate the continuous model. Fixed-point iteration algorithms are employed to implement the implicit scheme and the convergence is proved with a convergence rate independent of the time step-size and mesh grid-size. A special temporal discretization is introduced for the history-dependent operator, leading to numerical schemes for which the unique solvability and error bounds for the temporally discrete systems can be proved without any restriction on the time step-size. As for spatial approximation, the finite element method is applied and an optimal order error estimate for the linear element solutions is provided under appropriate regularity assumptions. Numerical examples are presented to illustrate the theoretical results.

preprint2020arXiv

Octopus: Privacy-Preserving Collaborative Evaluation of Loan Stacking

With the rise of online lenders, the loan stacking problem has become a significant issue in the financial industry. One of the key steps in the fight against it is the querying of the loan history of a borrower from peer lenders. This is especially important in markets without a trusted credit bureau. To protect participants privacy and business interests, we want to hide borrower identities and lenders data from the loan originator, while simultaneously verifying that the borrower authorizes the query. In this paper, we propose Octopus, a distributed system to execute the query while meeting all the above security requirements. Theoretically, Octopus is sound. Practically, it integrates multiple optimizations to reduce communication and computation overhead. Evaluation shows that Octopus can run on 800 geographically distributed servers and can perform a query within about 0.5 seconds on average.

preprint2020arXiv

On Uplink Performance of Multiuser Massive MIMO Relay Network With Limited RF Chains

This paper considers a multiuser massive multiple-input multiple-output uplink with the help of an analog amplify-and-forward relay. The base station equips a large array of $N_d$ antennas but is supported by a far smaller number of radio-frequency chains. By first deriving new results for a cascaded phase-aligned two-hop channel, we obtain a tight bound for the ergodic rate in closed form for both perfect and quantized channel phase information. The rate is characterized as a function of a scaled equivalent signal-to-noise ratio of the two-hop channel. It implies that the source and relay powers can be respectively scaled down as $1/N_d^a$ and $1/N_d^{1-a}~ (0\!\leq\!a\!\leq\!1)$ for an asymptotically unchanged sum rate. Then for the rate maximization, the problem of power allocation is optimized with closed-form solutions. Simulation results verified the observations of our derived results.

preprint2020arXiv

PrivPy: Enabling Scalable and General Privacy-Preserving Machine Learning

We introduce PrivPy, a practical privacy-preserving collaborative computation framework, especially optimized for machine learning tasks. PrivPy provides an easy-to-use and highly compatible Python programming front-end which supports high-level array operations and different secure computation engines to allow for security assumptions and performance trade-offs. With PrivPy, programmers can write modern machine learning algorithms conveniently and efficiently in Python. We also design and implement a new efficient computation engine, with which people can use competing cloud providers to efficiently perform general arithmetics over real numbers. We demonstrate the usability and scalability of PrivPy using common machine learning models (e.g. logistic regression and convolutional neural networks) and real-world datasets (including a 5000-by-1-million matrix).

preprint2020arXiv

Spectral and Energy Efficiency of IRS-Assisted MISO Communication with Hardware Impairments

In this letter, we analyze the spectral and energy efficiency of an intelligent reflecting surface (IRS)-assisted multiple-input single-output (MISO) downlink system with hardware impairments. An extended error vector magnitude (EEVM) model is utilized to characterize the impact of radio-frequency (RF) impairments at the access point (AP) and phase noise is considered for the imperfect IRS. We show that the spectral efficiency is limited due to the hardware impairments even when the numbers of AP antennas and IRS elements grow infinitely large, which is in contrast with the conventional case with ideal hardware. Moreover, the performance degradation at high SNR is shown to be mainly affected by the AP hardware impairments rather than the phase noise of IRS. We further obtain the optimal transmit power in closed form for energy efficiency maximization. Simulation results are provided to verify these results.

preprint2019arXiv

Optimal Multi-View Video Transmission in Multiuser Wireless Networks by Exploiting Natural and View Synthesis-Enabled Multicast Opportunities

Multi-view videos (MVVs) provide immersive viewing experience, at the cost of traffic load increase for wireless networks. In this paper, we would like to optimize MVV transmission in a multiuser wireless network by exploiting both natural multicast opportunities and view synthesis-enabled multicast opportunities. Specifically, we first establish a mathematical model to specify view synthesis at the server and each user, and characterize its impact on multicast opportunities. This model is highly nontrivial and fundamentally enables the optimization of view synthesis-based multicast opportunities. For given video quality requirements of all users, we consider the optimization of view selection, transmission time and power allocation to minimize the average weighted sum energy consumption for view transmission and synthesis. In addition, under the energy consumption constraints at the server and each user respectively, we consider the optimization of view selection, transmission time and power allocation and video quality selection to maximize the total utility. These two optimization problems are challenging mixed discrete-continuous optimization problems. For the first problem, we propose an algorithm to obtain an optimal solution with reduced computational complexity by exploiting optimality properties. For each problem, to reduce computational complexity, we also propose a low-complexity algorithm to obtain a suboptimal solution, using Difference of Convex (DC) programming. Finally, numerical results show the advantage of the proposed solutions over existing ones, and demonstrate the importance of the optimization of view synthesis-enabled multicast opportunities in MVV transmission.

preprint2019arXiv

Optimal Multi-View Video Transmission in OFDMA Systems

In this letter, we study the transmission of a multi-view video (MVV) to multiple users in an Orthogonal Frequency Division Multiple Access (OFDMA) system. To maximally improve transmission efficiency, we exploit both natural multicast opportunities and view synthesis-enabled multicast opportunities. First, we establish a communication model for transmission of a MVV to multiple users in an OFDMA system. Then, we formulate the minimization problem of the average weighted sum energy consumption for view transmission and synthesis with respect to view selection and transmission power and subcarrier allocation. The optimization problem is a challenging mixed discrete-continuous optimization problem with huge numbers of variables and constraints. A low-complexity algorithm is proposed to obtain a suboptimal solution. Finally, numerical results further demonstrate the value of view synthesis-enabled multicast opportunities for MVV transmission in OFDMA systems.

preprint2019arXiv

Secrecy Rate Maximization for Intelligent Reflecting Surface Assisted Multi-Antenna Communications

We investigate transmission optimization for intelligent reflecting surface (IRS) assisted multi-antenna systems from the physical-layer security perspective. The design goal is to maximize the system secrecy rate subject to the source transmit power constraint and the unit modulus constraints imposed on phase shifts at the IRS. To solve this complicated non-convex problem, we develop an efficient alternating algorithm where the solutions to the transmit covariance of the source and the phase shift matrix of the IRS are achieved in closed form and semi-closed forms, respectively. The convergence of the proposed algorithm is guaranteed theoretically. Simulations results validate the performance advantage of the proposed optimized design.

preprint2016arXiv

A Data-Driven Approach for Mapping Multivariate Data to Color

A wide variety of color schemes have been devised for mapping scalar data to color. Some use the data value to index a color scale. Others assign colors to different, usually blended disjoint materials, to handle areas where materials overlap. A number of methods can map low-dimensional data to color, however, these methods do not scale to higher dimensional data. Likewise, schemes that take a more artistic approach through color mixing and the like also face limits when it comes to the number of variables they can encode. We address the challenge of mapping multivariate data to color and avoid these limitations at the same time. It is a data driven method, which first gauges the similarity of the attributes and then arranges them according to the periphery of a convex 2D color space, such as HSL. The color of a multivariate data sample is then obtained via generalized barycentric coordinate (GBC) interpolation.

preprint2016arXiv

ABC-CNN: An Attention Based Convolutional Neural Network for Visual Question Answering

We propose a novel attention based deep learning architecture for visual question answering task (VQA). Given an image and an image related natural language question, VQA generates the natural language answer for the question. Generating the correct answers requires the model's attention to focus on the regions corresponding to the question, because different questions inquire about the attributes of different image regions. We introduce an attention based configurable convolutional neural network (ABC-CNN) to learn such question-guided attention. ABC-CNN determines an attention map for an image-question pair by convolving the image feature map with configurable convolutional kernels derived from the question's semantics. We evaluate the ABC-CNN architecture on three benchmark VQA datasets: Toronto COCO-QA, DAQUAR, and VQA dataset. ABC-CNN model achieves significant improvements over state-of-the-art methods on these datasets. The question-guided attention generated by ABC-CNN is also shown to reflect the regions that are highly relevant to the questions.

preprint2016arXiv

An Optimization Framework For Online Ride-sharing Markets

Taxi services and product delivery services are instrumental for our modern society. Thanks to the emergence of sharing economy, ride-sharing services such as Uber, Didi, Lyft and Google's Waze Rider are becoming more ubiquitous and grow into an integral part of our everyday lives. However, the efficiency of these services are severely limited by the sub-optimal and imbalanced matching between the supply and demand. We need a generalized framework and corresponding efficient algorithms to address the efficient matching, and hence optimize the performance of these markets. Existing studies for taxi and delivery services are only applicable in scenarios of the one-sided market. In contrast, this work investigates a highly generalized model for the taxi and delivery services in the market economy (abbreviated as"taxi and delivery market") that can be widely used in two-sided markets. Further, we present efficient online and offline algorithms for different applications. We verify our algorithm with theoretical analysis and trace-driven simulations under realistic settings.

preprint2016arXiv

Asymptotic results for exponential functionals of Levy processes

In this work we give a complete description to the asymptotic behaviors of exponential functionals of Lévy processes and divide them into five different types according to their convergence rates. Not only their exact convergence speeds are proved, the accurate limit constants are also given. As an application, we study the survival probabilities of continuous-state branching processes in random environment defined in He et al. (2016). Like the discrete case and branching diffusion in random environment, we classify them into five different types according to their extinction speeds.

preprint2016arXiv

Attention to Scale: Scale-aware Semantic Image Segmentation

Incorporating multi-scale features in fully convolutional neural networks (FCNs) has been a key element to achieving state-of-the-art performance on semantic image segmentation. One common way to extract multi-scale features is to feed multiple resized input images to a shared deep network and then merge the resulting features for pixelwise classification. In this work, we propose an attention mechanism that learns to softly weight the multi-scale features at each pixel location. We adapt a state-of-the-art semantic image segmentation model, which we jointly train with multi-scale input images and the attention model. The proposed attention model not only outperforms average- and max-pooling, but allows us to diagnostically visualize the importance of features at different positions and scales. Moreover, we show that adding extra supervision to the output at each scale is essential to achieving excellent performance when merging multi-scale features. We demonstrate the effectiveness of our model with extensive experiments on three challenging datasets, including PASCAL-Person-Part, PASCAL VOC 2012 and a subset of MS-COCO 2014.

preprint2016arXiv

Automatically Building Face Datasets of New Domains from Weakly Labeled Data with Pretrained Models

Training data are critical in face recognition systems. However, labeling a large scale face data for a particular domain is very tedious. In this paper, we propose a method to automatically and incrementally construct datasets from massive weakly labeled data of the target domain which are readily available on the Internet under the help of a pretrained face model. More specifically, given a large scale weakly labeled dataset in which each face image is associated with a label, i.e. the name of an identity, we create a graph for each identity with edges linking matched faces verified by the existing model under a tight threshold. Then we use the maximal subgraph as the cleaned data for that identity. With the cleaned dataset, we update the existing face model and use the new model to filter the original dataset to get a larger cleaned dataset. We collect a large weakly labeled dataset containing 530,560 Asian face images of 7,962 identities from the Internet, which will be published for the study of face recognition. By running the filtering process, we obtain a cleaned datasets (99.7+% purity) of size 223,767 (recall 70.9%). On our testing dataset of Asian faces, the model trained by the cleaned dataset achieves recognition rate 93.1%, which obviously outperforms the model trained by the public dataset CASIA whose recognition rate is 85.9%.

preprint2016arXiv

CFO: Conditional Focused Neural Question Answering with Large-scale Knowledge Bases

How can we enable computers to automatically answer questions like "Who created the character Harry Potter"? Carefully built knowledge bases provide rich sources of facts. However, it remains a challenge to answer factoid questions raised in natural language due to numerous expressions of one question. In particular, we focus on the most common questions --- ones that can be answered with a single fact in the knowledge base. We propose CFO, a Conditional Focused neural-network-based approach to answering factoid questions with knowledge bases. Our approach first zooms in a question to find more probable candidate subject mentions, and infers the final answers with a unified conditional probabilistic framework. Powered by deep recurrent neural networks and neural embeddings, our proposed CFO achieves an accuracy of 75.7% on a dataset of 108k questions - the largest public one to date. It outperforms the current state of the art by an absolute margin of 11.8%.

preprint2016arXiv

CNN-RNN: A Unified Framework for Multi-label Image Classification

While deep convolutional neural networks (CNNs) have shown a great success in single-label image classification, it is important to note that real world images generally contain multiple labels, which could correspond to different objects, scenes, actions and attributes in an image. Traditional approaches to multi-label image classification learn independent classifiers for each category and employ ranking or thresholding on the classification results. These techniques, although working well, fail to explicitly exploit the label dependencies in an image. In this paper, we utilize recurrent neural networks (RNNs) to address this problem. Combined with CNNs, the proposed CNN-RNN framework learns a joint image-label embedding to characterize the semantic label dependency as well as the image-label relevance, and it can be trained end-to-end from scratch to integrate both information in a unified framework. Experimental results on public benchmark datasets demonstrate that the proposed architecture achieves better performance than the state-of-the-art multi-label classification model

preprint2016arXiv

Continuous-state branching processes in Levy random environments

A general continuous-state branching processes in random environment (CBRE-process) is defined as the strong solution of a stochastic integral equation. The environment is determined by a Lévy process with no jump less than $-1$. We give characterizations of the quenched and annealed transition semigroups of the process in terms of a backward stochastic integral equation driven by another Lévy process determined by the environment. The process hits zero with strictly positive probability if and only if its branching mechanism satisfies Grey's condition. In that case, a characterization of the extinction probability is given using a random differential equation with singular terminal condition. The strong Feller property of the CBRE-process is established by a coupling method. We also prove a necessary and sufficient condition for the ergodicity of the subcricital CBRE process with immigration.

preprint2016arXiv

Dataset and Neural Recurrent Sequence Labeling Model for Open-Domain Factoid Question Answering

While question answering (QA) with neural network, i.e. neural QA, has achieved promising results in recent years, lacking of large scale real-word QA dataset is still a challenge for developing and evaluating neural QA system. To alleviate this problem, we propose a large scale human annotated real-world QA dataset WebQA with more than 42k questions and 556k evidences. As existing neural QA methods resolve QA either as sequence generation or classification/ranking problem, they face challenges of expensive softmax computation, unseen answers handling or separate candidate answer generation component. In this work, we cast neural QA as a sequence labeling problem and propose an end-to-end sequence labeling model, which overcomes all the above challenges. Experimental results on WebQA show that our model outperforms the baselines significantly with an F1 score of 74.69% with word-based input, and the performance drops only 3.72 F1 points with more challenging character-based input.

preprint2016arXiv

Debugging OpenStack Problems Using a State Graph Approach

It is hard to operate and debug systems like OpenStack that integrate many independently developed modules with multiple levels of abstractions. A major challenge is to navigate through the complex dependencies and relationships of the states in different modules or subsystems, to ensure the correctness and consistency of these states. We present a system that captures the runtime states and events from the entire OpenStack-Ceph stack, and automatically organizes these data into a graph that we call system operation state graph (SOSG).With SOSG we can use intuitive graph traversal techniques to solve problems like reasoning about the state of a virtual machine. Also, using graph-based anomaly detection, we can automatically discover hidden problems in OpenStack. We have a scalable implementation of SOSG, and evaluate the approach on a 125-node production OpenStack cluster, finding a number of interesting problems.

preprint2016arXiv

Deep Joint Face Hallucination and Recognition

Deep models have achieved impressive performance for face hallucination tasks. However, we observe that directly feeding the hallucinated facial images into recog- nition models can even degrade the recognition performance despite the much better visualization quality. In this paper, we address this problem by jointly learning a deep model for two tasks, i.e. face hallucination and recognition. In particular, we design an end-to-end deep convolution network with hallucination sub-network cascaded by recognition sub-network. The recognition sub- network are responsible for producing discriminative feature representations using the hallucinated images as inputs generated by hallucination sub-network. During training, we feed LR facial images into the network and optimize the parameters by minimizing two loss items, i.e. 1) face hallucination loss measured by the pixel wise difference between the ground truth HR images and network-generated images; and 2) verification loss which is measured by the classification error and intra-class distance. We extensively evaluate our method on LFW and YTF datasets. The experimental results show that our method can achieve recognition accuracy 97.95% on 4x down-sampled LFW testing set, outperforming the accuracy 96.35% of conventional face recognition model. And on the more challenging YTF dataset, we achieve recognition accuracy 90.65%, a margin over the recognition accuracy 89.45% obtained by conventional face recognition model on the 4x down-sampled version.

preprint2016arXiv

Deep Recurrent Models with Fast-Forward Connections for Neural Machine Translation

Neural machine translation (NMT) aims at solving machine translation (MT) problems using neural networks and has exhibited promising results in recent years. However, most of the existing NMT models are shallow and there is still a performance gap between a single NMT model and the best conventional MT system. In this work, we introduce a new type of linear connections, named fast-forward connections, based on deep Long Short-Term Memory (LSTM) networks, and an interleaved bi-directional architecture for stacking the LSTM layers. Fast-forward connections play an essential role in propagating the gradients and building a deep topology of depth 16. On the WMT'14 English-to-French task, we achieve BLEU=37.7 with a single attention model, which outperforms the corresponding single shallow model by 6.2 BLEU points. This is the first time that a single NMT model achieves state-of-the-art performance and outperforms the best conventional model by 0.7 BLEU points. We can still achieve BLEU=36.3 even without using an attention mechanism. After special handling of unknown words and model ensembling, we obtain the best score reported to date on this task with BLEU=40.4. Our models are also validated on the more difficult WMT'14 English-to-German task.

preprint2016arXiv

Multiband Effects and the Bose-Hubbard Model in One-Dimensional Lattices

We study phase diagrams of one-dimensional bosons with contact interactions in the presence of a lattice. We use the worm algorithm in continuous space and focus on the incommensurate superfluid Mott-insulator transition. Our results are compared to those from the one-band Bose-Hubbard model. When Wannier states are used to determine the Bose-Hubbard model parameters, the comparison unveils an apparent breakdown of the one-band description for strong interactions, even for the Mott-insulating state with an average of one particle per site ($n=1$) in deep lattices. We introduce an inverse confined scattering analysis to obtain the ratio $U/J$, with which the Bose-Hubbard model provides correct results for strong interactions, deep lattices, and $n=1$.

preprint2016arXiv

Multipair Massive MIMO Relaying with Pilot-Data Transmission Overlay

We propose a pilot-data transmission overlay scheme for multipair massive multiple-input multiple-output (MIMO) relaying systems employing either half- or full-duplex (HD or FD) communications at the relay station (RS). In the proposed scheme, pilots are transmitted in partial overlap with data to decrease the channel estimation overhead. The RS can detect the source data with minimal destination pilot interference by exploiting the asymptotic orthogonality of massive MIMO channels. Then pilot-data interference can be effectively suppressed with assistance of the detected source data in the destination channel estimation. Due to the transmission overlay, the effective data period is extended, hence improving system throughput. Both theoretical and simulation results confirm that the proposed pilot-data overlay scheme outperforms the conventional separate pilot-data design in the limited coherence time interval scenario. Moreover, asymptotic analyses at high and low SNR regions demonstrate the superiority of the proposed scheme regardless of the coherence interval length. Because of simultaneous transmission, the proper allocation of source data transmission and relay data forwarding power can further improve the system performance. Hence a power allocation problem is formulated and a successive convex approximation approach is proposed to solve the non-convex optimization problem with the FD pilot-data transmission overlay.

preprint2016arXiv

Nonparametric Estimation for Jump-Diffusion CIR Model

We study the nonparametric estimation for the intensity of Poisson random measure in jump-diffusion CIR model based on the low frequency observations. This is given in terms of the minimization of norms on a nonempty, closed and convex subset of some special Hilbert space. We establish the measurability of the estimator and derive its consistency and asymptotic risk bound.

preprint2016arXiv

Secure Massive MIMO Systems with Limited RF Chains

In future practical deployments of massive multi-input multi-output (MIMO) systems, the number of radio frequency (RF) chains at the base stations (BSs) may be much smaller than the number of BS antennas to reduce the overall expenditure. In this paper, we propose a novel design framework for joint data and artificial noise (AN) precoding in a multiuser massive MIMO system with limited number of RF chains, which improves the wireless security performance. With imperfect channel state information (CSI), we analytically derive an achievable lower bound on the ergodic secrecy rate of any mobile terminal (MT), for both analog and hybrid precoding schemes. The closed-form lower bound is used to determine optimal power splitting between data and AN that maximizes the secrecy rate through simple one-dimensional search. Analytical and numerical results together reveal that the proposed hybrid precoder, although suffers from reduced secrecy rate compared with theoretical full-dimensional precoder, is free of the high computational complexity of large-scale matrix inversion and null-space calculations, and largely reduces the hardware cost.

preprint2016arXiv

Semi-Supervised Learning for Neural Machine Translation

While end-to-end neural machine translation (NMT) has made remarkable progress recently, NMT systems only rely on parallel corpora for parameter estimation. Since parallel corpora are usually limited in quantity, quality, and coverage, especially for low-resource languages, it is appealing to exploit monolingual corpora to improve NMT. We propose a semi-supervised approach for training NMT models on the concatenation of labeled (parallel corpora) and unlabeled (monolingual corpora) data. The central idea is to reconstruct the monolingual corpora using an autoencoder, in which the source-to-target and target-to-source translation models serve as the encoder and decoder, respectively. Our approach can not only exploit the monolingual corpora of the target language, but also of the source language. Experiments on the Chinese-English dataset show that our approach achieves significant improvements over state-of-the-art SMT and NMT systems.

preprint2016arXiv

Study of Magnetic Hysteresis Effects in a Storage Ring Using Precision Tune Measurement

With advances in accelerator science and technology in the recent decades, the accelerator community has focused on the development of next-generation light sources, for example the diffraction-limited storage rings (DLSRs), which requires precision control of the electron beam energy and betatron tunes. This work is aimed at understanding magnet hysteresis effects on the electron beam energy and lattice focusing in the circular accelerators, and developing new methods to gain better control of these effects. In this paper, we will report our recent experimental study of the magnetic hysteresis effects and their impacts on the Duke storage ring lattice using the transverse feedback based precision tune measurement system. The major magnet hysteresis effects associated with magnet normalization and lattice ramping are carefully studied to determine an effective procedure for lattice preparation while maintaining a high degree of reproducibility of lattice focusing. The local hysteresis effects are also studied by measuring the betatron tune shifts resulted from adjusting the setting of a quadrupole. A new technique has been developed to precisely recover the focusing strength of the quadrupole by returning it to a proper setting to overcome the local hysteresis effect.

preprint2016arXiv

Universal scaling of density and momentum distributions in Lieb-Liniger gases

We present an exact numerical study of the scaling of density and momentum distribution functions of harmonically trapped one-dimensional bosons with repulsive contact interactions at zero and finite temperatures. We use path integral quantum Monte Carlo with worm updates in our calculations at finite interaction strengths, and the Bose-Fermi mapping in the Tonks-Girardeau regime. We discuss the homogeneous case and, within the local density approximation, use it to motivate the scaling in the presence of a harmonic trap. For the momentum distribution function, we pay special attention to the high momentum tails and their $k^{-4}$ asymptotic behavior.

preprint2016arXiv

Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks

We present an approach that exploits hierarchical Recurrent Neural Networks (RNNs) to tackle the video captioning problem, i.e., generating one or multiple sentences to describe a realistic video. Our hierarchical framework contains a sentence generator and a paragraph generator. The sentence generator produces one simple short sentence that describes a specific short video interval. It exploits both temporal- and spatial-attention mechanisms to selectively focus on visual elements during generation. The paragraph generator captures the inter-sentence dependency by taking as input the sentential embedding produced by the sentence generator, combining it with the paragraph history, and outputting the new initial state for the sentence generator. We evaluate our approach on two large-scale benchmark datasets: YouTubeClips and TACoS-MultiLevel. The experiments demonstrate that our approach significantly outperforms the current state-of-the-art methods with BLEU@4 scores 0.499 and 0.305 respectively.

preprint2015arXiv

Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question Answering

In this paper, we present the mQA model, which is able to answer questions about the content of an image. The answer can be a sentence, a phrase or a single word. Our model contains four components: a Long Short-Term Memory (LSTM) to extract the question representation, a Convolutional Neural Network (CNN) to extract the visual representation, an LSTM for storing the linguistic context in an answer, and a fusing component to combine the information from the first three components and generate the answer. We construct a Freestyle Multilingual Image Question Answering (FM-IQA) dataset to train and evaluate our mQA model. It contains over 150,000 images and 310,000 freestyle Chinese question-answer pairs and their English translations. The quality of the generated answers of our mQA model on this dataset is evaluated by human judges through a Turing Test. Specifically, we mix the answers provided by humans and our model. The human judges need to distinguish our model from the human. They will also provide a score (i.e. 0, 1, 2, the larger the better) indicating the quality of the answer. We propose strategies to monitor the quality of this evaluation process. The experiments show that in 64.7% of cases, the human judges cannot distinguish our model from humans. The average score is 1.454 (1.918 for human). The details of this work, including the FM-IQA dataset, can be found on the project page: http://idl.baidu.com/FM-IQA.html

preprint2015arXiv

Bidirectional LSTM-CRF Models for Sequence Tagging

In this paper, we propose a variety of Long Short-Term Memory (LSTM) based models for sequence tagging. These models include LSTM networks, bidirectional LSTM (BI-LSTM) networks, LSTM with a Conditional Random Field (CRF) layer (LSTM-CRF) and bidirectional LSTM with a CRF layer (BI-LSTM-CRF). Our work is the first to apply a bidirectional LSTM CRF (denoted as BI-LSTM-CRF) model to NLP benchmark sequence tagging data sets. We show that the BI-LSTM-CRF model can efficiently use both past and future input features thanks to a bidirectional LSTM component. It can also use sentence level tag information thanks to a CRF layer. The BI-LSTM-CRF model can produce state of the art (or close to) accuracy on POS, chunking and NER data sets. In addition, it is robust and has less dependence on word embedding as compared to previous observations.

preprint2015arXiv

Concept for a Future Super Proton-Proton Collider

Following the discovery of the Higgs boson at LHC, new large colliders are being studied by the international high-energy community to explore Higgs physics in detail and new physics beyond the Standard Model. In China, a two-stage circular collider project CEPC-SPPC is proposed, with the first stage CEPC (Circular Electron Positron Collier, a so-called Higgs factory) focused on Higgs physics, and the second stage SPPC (Super Proton-Proton Collider) focused on new physics beyond the Standard Model. This paper discusses this second stage.

preprint2015arXiv

Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)

In this paper, we present a multimodal Recurrent Neural Network (m-RNN) model for generating novel image captions. It directly models the probability distribution of generating a word given previous words and an image. Image captions are generated by sampling from this distribution. The model consists of two sub-networks: a deep recurrent neural network for sentences and a deep convolutional network for images. These two sub-networks interact with each other in a multimodal layer to form the whole m-RNN model. The effectiveness of our model is validated on four benchmark datasets (IAPR TC-12, Flickr 8K, Flickr 30K and MS COCO). Our model outperforms the state-of-the-art methods. In addition, we apply the m-RNN model to retrieval tasks for retrieving images or sentences, and achieves significant performance improvement over the state-of-the-art methods which directly optimize the ranking objective function for retrieval. The project page of this work is: www.stat.ucla.edu/~junhua.mao/m-RNN.html .

preprint2015arXiv

Entropy relations and the application of black holes with cosmological constant and Gauss-Bonnet term

Based on the entropy relations, we derive thermodynamic bound for entropy and area of horizons of Schwarzschild-dS black hole, including the event horizon, Cauchy horizon and negative horizon (i.e. the horizon with negative value), which are all geometrical bound and made up of the cosmological radius. Consider the first derivative of entropy relations together, we get the first law of thermodynamics for all horizons. We also obtain the Smarr relation of horizons by using the scaling discussion. For thermodynamics of all horizons, the cosmological constant is treated as a thermodynamical variable. Especially for thermodynamics of negative horizon, it is defined well in the $r<0$ side of spacetime. The validity of this formula seems to work well for three-horizons black holes. We also generalize the discussion to thermodynamics for event horizon and Cauchy horizon of Gauss-Bonnet charged flat black holes, as the Gauss-Bonnet coupling constant is also considered as thermodynamical variable. These give further clue on the crucial role that the entropy relations of multi-horizons play in black hole thermodynamics and understanding the entropy at the microscopic level.

preprint2015arXiv

First commissioning of the HLS-II storage ring

To meet the increasing requirements of synchrotron radiation users, the upgrade project to enhance the performance of Hefei Light Source (HLS), named HLS-II, was launched in 2010, and in 2014 the first commissioning of HLS-II was successfully completed. After the commissioning, the main design goals for the HLS-II storage ring have been achieved, with natural emittance of electron beam lower than 40 nm-rad at 800 MeV, five insertion devices installed in straight sections and root mean square (rms) jitter of closed orbit smaller than 4 μm, making HLS-II at a higher level among the same class of machines in the world. This paper reports on the results of the commissioning of the HLS-II storage ring, which includes linear optics correction, compensation of insertion devices effect and closed orbit feedback.

preprint2015arXiv

Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images

In this paper, we address the task of learning novel visual concepts, and their interactions with other concepts, from a few images with sentence descriptions. Using linguistic context and visual features, our method is able to efficiently hypothesize the semantic meaning of new words and add them to its word dictionary so that they can be used to describe images which contain these novel concepts. Our method has an image captioning module based on m-RNN with several improvements. In particular, we propose a transposed weight sharing scheme, which not only improves performance on image captioning, but also makes the model more suitable for the novel concept learning task. We propose methods to prevent overfitting the new concepts. In addition, three novel concept datasets are constructed for this new task. In the experiments, we show that our method effectively learns novel visual concepts from a few examples without disturbing the previously learned concepts. The project page is http://www.stat.ucla.edu/~junhua.mao/projects/child_learning.html

preprint2015arXiv

Performance of a Free Space Optical Relay-Assisted Hybrid RF/FSO System in Generalized M-Distributed Channels

This paper investigates the average symbol error rate (ASER) performance of a dual-hop hybrid relaying system relying on both radio frequency (RF) and free space optical (FSO) links. Specifically, the RF link is used for supporting mobile communication, while the FSO link is adopted as the backhaul of the cellular infrastructure. Considering non-line-of-sight (NLoS) RF transmissions and a generalized atmospheric turbulence (AT) channel, the associated statistical features constituted of both the exact and the asymptotic moment generating functions (MGF) are derived in closed form. They are then used for calculating the ASER of M-ary phase shift keying (PSK), differentially encoded non-coherent PSK (DPSK) and non-coherent frequency-shift keying (FSK). A range of additional asymptotic expressions are also derived for all the modulation schemes under high signal-to-noise ratios (SNR). It is observed from the asymptotic analysis that the ASERs of all the modulation schemes are dominated by the average SNR of the RF link in the hybrid relaying system using a fixed relay gain, while in the relaying system using a dynamic channel dependent relay gain, the ASERs of all the modulation schemes depend both on the average SNR and on the AT condition of the FSO path. We also find that the fixed-gain relaying strategy achieves twice the diversity order of the channel-dependent relaying strategy albeit at the cost of requiring a high power amplifier (PA) dynamic range at the relay node. Furthermore, by comparing the asymptotic ASERs, we calculate the SNR differences between the different modulation schemes in both the fixed-gain and the channel-dependent relaying system. Finally, simulation results are presented for confirming the accuracy of our expressions and observations.

preprint2015arXiv

Solution space structure of random constraint satisfaction problems with growing domains

In this paper we study the solution space structure of model RB, a standard prototype of Constraint Satisfaction Problem (CSPs) with growing domains. Using rigorous the first and the second moment method, we show that in the solvable phase close to the satisfiability transition, solutions are clustered into exponential number of well-separated clusters, with each cluster contains sub-exponential number of solutions. As a consequence, the system has a clustering (dynamical) transition but no condensation transition. This picture of phase diagram is different from other classic random CSPs with fixed domain size, such as random K-Satisfiability (K-SAT) and graph coloring problems, where condensation transition exists and is distinct from satisfiability transition. Our result verifies the non-rigorous results obtained using cavity method from spin glass theory, and sheds light on the structures of solution spaces of problems with a large number of states.

preprint2015arXiv

Thermodynamic relations for entropy and temperature of multi-horizons black holes

We present some entropy and temperature relations of multi-horizons, even including the "virtual" horizon. These relations are related to product, division and sum of entropy and temperature of multi-horizons. We obtain the additional thermodynamic relations of both static and rotating black holes in three and four dimensional (A)dS spacetime. Especially, a new dimensionless, charges-independence and $T_+S_+=T_-S_-$ like relation is presented. This relation does not depend on the mass, electric charge, angular momentum and cosmological constant, as it is always a constant. These relations lead us to get some interesting thermodynamic bound of entropy and temperature, including the Penrose inequality which is the first geometrical inequality of black holes. Besides, based on these new relations, one can obtain the first law of thermodynamics and Smarr relation for all horizons of black hole.

preprint2015arXiv

Totally Distributed Energy-Efficient Transmission in MIMO Interference Channels

In this paper, we consider the problem of maximizing the energy efficiency (EE) for multi-input multi-output (MIMO) interference channels, subject to the per-link power constraint. To avoid extensive information exchange among all links, the optimization problem is formulated as a noncooperative game, where each link maximizes its own EE. We show that this game always admits a Nash equilibrium (NE) and the sufficient condition for the uniqueness of the NE is derived for the case of arbitrary channel matrices, which can be checked in practice. To reach the NE of this game, we develop a totally distributed EE algorithm, in which each link updates its own transmit covariance matrix in a completely distributed and asynchronous way: Some players may update their solutions more frequently than others or even use the outdated interference information. The sufficient conditions that guarantee the global convergence of the proposed algorithm to the NE of the game have been given as well. We also study the impact of the circuit power consumption on the sum-EE performance of the proposed algorithm in the case when the links are separated sufficiently far away. Moreover, the tradeoff between the sum-EE and the sum-spectral efficiency (SE) is investigated with the proposed algorithm under two special cases: 1) low transmit power constraint regime; 2) high transmit power constraint regime. Finally, extensive simulations are conducted to evaluate the impact of various system parameters on the system performance.

preprint2014arXiv

$P-V$ criticality of AdS black hole in the Einstein-Maxwell-power-Yang-Mills gravity

We study the $P-V$ critical behaivor of N-dimensional AdS black holes in Einstein-Maxwell-power-Yang-Mills gravity. Our results show the existence of the Van der Waals like small-large black hole phase transitions when taking some special values of charges of the Maxwell and Yang-Mills (YM) fields. Further to calculate the critical exponents of the black holes at the critical point, we find that they are the same as those in the Van der Waals liquid-gas system.

preprint2014arXiv

A Note on Entropy Relations of Black Hole Horizons

We focus on the entropy relations of black holes in three, four and higher dimensions. These entropy relations include entropy product, "part" entropy product and entropy sum. We also discuss their differences and similarities, in order to make a further study on understanding the origin of black hole entropy at the microscopic level.

preprint2014arXiv

Critical phenomena of static charged AdS black holes in conformal gravity

The extended thermodynamics of static charged AdS black holes in conformal gravity is analyzed. The $P-V$ criticality of these black holes has some unusual features. There exists a single critical point with critical temperature $T_c$ and critical pressure $P_c$. At fixed $T>T_c$ (or at fixed $P>P_c$), there are two zeroth order phase transition points but no first order phase transition points. The systems favors large pressure states at constant $T$, or high temperature states at constant $P$.

preprint2014arXiv

Exact black hole formation in three dimensions

We consider three dimensional Einstein gravity non-minimally coupled to a real scalar field with a self-interacting scalar potential and present the exact black hole formation in three dimensions. Firstly we obtain an exact time-dependent spherically symmetric solution describing the gravitational collapse to a scalar black hole at the infinite time, i.e. in the static limit. The solution can only be asymptotically AdS because of the No-Go theorem in three dimensions which is resulted from the existence of a smooth black hole horizon. Then we analyze their geometric properties and properties of the time evolution. We also get the exact time-dependent solution in the minimal coupling model after taking a conformal transformation.

preprint2014arXiv

Explain Images with Multimodal Recurrent Neural Networks

In this paper, we present a multimodal Recurrent Neural Network (m-RNN) model for generating novel sentence descriptions to explain the content of images. It directly models the probability distribution of generating a word given previous words and the image. Image descriptions are generated by sampling from this distribution. The model consists of two sub-networks: a deep recurrent neural network for sentences and a deep convolutional network for images. These two sub-networks interact with each other in a multimodal layer to form the whole m-RNN model. The effectiveness of our model is validated on three benchmark datasets (IAPR TC-12, Flickr 8K, and Flickr 30K). Our model outperforms the state-of-the-art generative method. In addition, the m-RNN model can be applied to retrieval tasks for retrieving images or sentences, and achieves significant performance improvement over the state-of-the-art methods which directly optimize the ranking objective function for retrieval.

preprint2014arXiv

Extended phase space thermodynamics for third order Lovelock black holes in diverse dimensions

Treating the cosmological constant as thermodynamic pressure and its conjugate as thermodynamic volume, we investigate the critical behavior of the third order Lovelock black holes in diverse dimensions. For black hole horizons with different normalized sectional curvature $k=0,\pm1$, the corresponding critical behaviors differ drastically. For $k=0$, there is no critical point in the extended thermodynamic phase space. For $k=-1$, there is a single critical point in any dimension $d\geq 7$, and for $k=+1$, there is a single critical point in $7$ dimension and two critical points in $8,9,10,11$ dimensions. We studied the corresponding phase structures in all possible cases.

preprint2014arXiv

Gauss-Bonnet coupling constant as a free thermodynamical variable and the associated criticality

The thermodynamic phase space of Gauss-Bonnet (GB) AdS black holes is extended, taking the inverse of the GB coupling constant as a new thermodynamic pressure $P_{\mathrm{GB}}$. We studied the critical behavior associated with $P_{\mathrm{GB}}$ in the extended thermodynamic phase space at fixed cosmological constant and electric charge. The result shows that when the black holes are neutral, the associated critical points can only exist in five dimensional GB-AdS black holes with spherical topology, and the corresponding critical exponents are identical to those for Van der Waals system. For charged GB-AdS black holes, it is shown that there can be only one critical point in five dimensions (for black holes with either spherical or hyperbolic topologies), which also requires the electric charge to be bounded within some appropriate range; while in $d>5$ dimensions, there can be up to two different critical points at the same electric charge, and the phase transition can occur only at temperatures which are not in between the two critical values.

preprint2014arXiv

Low-Complexity Hybrid Precoding in Massive Multiuser MIMO Systems

Massive multiple-input multiple-output (MIMO) is envisioned to offer considerable capacity improvement, but at the cost of high complexity of the hardware. In this paper, we propose a low-complexity hybrid precoding scheme to approach the performance of the traditional baseband zero-forcing (ZF) precoding (referred to as full-complexity ZF), which is considered a virtually optimal linear precoding scheme in massive MIMO systems. The proposed hybrid precoding scheme, named phased-ZF (PZF), essentially applies phase-only control at the RF domain and then performs a low-dimensional baseband ZF precoding based on the effective channel seen from baseband. Heavily quantized RF phase control up to $2$ bits of precision is also considered and shown to incur very limited degradation. The proposed scheme is simulated in both ideal Rayleigh fading channels and sparsely scattered millimeter wave (mmWave) channels, both achieving highly desirable performance.

preprint2014arXiv

New class of rotating perfect fluid black holes in three dimensional gravity

We obtain a new class of rotating black holes for Einstein theory with perfect fluid source in (2+1) dimensions. We conclude that these black hole solutions only depend on variable angular velocity $m(r)$. Some examples of these black holes are given explicitly. In particular, the unknown static black hole in this special background is obtained. In addition, the general properties including the horizon structure, energy conditions and equation of state, mass and angular momentum are explained in detail.

preprint2014arXiv

Percolative superconductivity in La2CuO4.06 by lattice granularity patterns with scanning micro X-ray absorption near edge structure

The simplest cuprate superconductor La2CuO4+y with mobile oxygen interstitials exhibits a clear phase separation, but only recently a bulk multiscale structural phase separation has been observed by using scanning micro X-ray diffraction. Here we get further information on the structural phase separation, using local probe X-ray absorption near edge structure. The spatial distribution of superconducting units is a key parameter controlling percolative superconductivity in complex matter with dispersed superconducting units. These oxides form super-molecular architectures made of superconducting atomic monolayers intercalated by spacers. Oxygen interstitials enter into the rocksalt La2O2+y spacer layers forming oxygen interstitials rich puddles and poor puddles. Their spatial distribution has been determined by using scanning La L3-edge micro X-ray absorption near edge structure. Percolating networks of oxygen rich puddles are observed in different micrometer size portions of the crystals. Moreover, the complex surface resistivity shows two jumps associated to the onset of intra-puddle and inter-puddles percolative superconductivity. The similarity of oxygen doped La2CuO4+y, with the well established phase separation in iron selenide superconductors is also discussed.

preprint2014arXiv

The "universal property" of Horizon Entropy Sum of Black Holes in Four Dimensional Asymptotical (anti-)de-Sitter Spacetime Background

We present a new ``universal property'' of entropy, that is the ``entropy sum'' relation of black holes in four dimensional (anti-)de-Sitter asymptotical background. They depend only on the cosmological constant with the necessary effect of the un-physical ``virtual'' horizon included in the spacetime where only the cosmological constant, mass of black hole, rotation parameter and Maxwell field exist. When there is more extra matter field in the spacetime, one will find the ``entropy sum'' is also dependent of the strength of these extra matter field. For both cases, we conclude that the ``entropy sum'' does not depend on the conserved charges $M$, $Q$ and $J$, while it does depend on the property of background spacetime. We will mainly test the ``entropy sum'' relation in static, stationary black hole and some black hole with extra matter source (scalar hair and higher curvature) in the asymptotical (anti-)de-sitter spacetime background. Besides, we point out a newly found counter example of the mass independence of the ''entropy product'' relation in the spacetime with extra scalar hair case, while the ``entropy sum'' relation still holds. These result are indeed suggestive to some underlying microscopic mechanism. Moreover, the cosmological constant and extra matter field dependence of the ``entropy sum'' of all horizon seems to reveal that ``entropy sum'' is more general as it is only related to the background field. For the case of asymptotical flat spacetime without any matter source, we give a note for the Kerr black hole case in appendix. One will find only mass dependence of ``entropy sum'' appears. It makes us believe that, considering the dependence of ``entropy sum'', the mass background field may be regarded as the next order of cosmological constant background field and extra matter field.

preprint2014arXiv

The Entropy Relations of Black Holes with Multihorizons in Higher Dimensions

We study the entropy relations of multi-horizons black holes in higher dimensional (A)dS spacetime with maximal symmetries, including Einstein-Maxwell gravity and $f(R)$(-Maxwell) gravity. These additional equalities in thermodynamics are expected to be useful to understanding the origin of black hole entropy at the microscopic level. Revisiting the entropy product introduced by Cvetic etc, in our case, it has an unexpected behavior. It is shown that this electric charge $Q$ plays an important role in this entropy product. The entropy product of charged black holes only depends on the electric charge $Q$ and is mass independence. When $Q$ vanishes in the solution, it turns to mass dependence, even when including the effect of the un-physical ``virtual'' horizons. In this sense, the ``universal relation'' of this entropy product is destroyed. Then we introduce another kind of ``universal'' entropy relation, which only depends on the cosmological constant $Λ$ and the background topology $k$, and which does not depend on the conserved charges $Q$, nor even the mass $M$.

preprint2014arXiv

The Entropy Sum of (A)dS Black Holes in Four and Higher Dimensions

We present the "entropy sum" relation of (A)dS charged black holes in higher dimensional Einstein-Maxwell gravity, $f(R)$ gravity, Gauss-Bonnet gravity and gauged supergravity. For their "entropy sum" with the necessary effect of the un-physical "virtual" horizon included, we conclude the general results that the cosmological constant dependence and Gauss-Bonnet coupling constant dependence do hold in both the four and six dimensions, while the "entropy sum" is always vanishing in odd dimensions. Furthermore, the "entropy sum" of all horizons is related to the geometry of the horizons in four and six dimensions. In these explicitly four cases, one also finds that the conserved charges $M$ (the mass), $Q$ (the charge from Maxwell field or supergravity) and the parameter $a$ (the angular momentum) play no role in the "entropy sum" relations.

preprint2014arXiv

Thermodynamics of rotating black holes with scalar hair in three dimensions

Introducing a new form of scalar potential $V(ϕ)$, we derive a proper form of the rotating black hole solution in three-dimensional Einstein gravity with nonminimally coupled scalar field and find that the first law of thermodynamics of this new rotating hairy black hole can be protected, where the scalar field parameter $B$ is constrained to relate to the black hole size. We also disclose the Hawking-Page phase transition between this rotating hairy black holes and the pure thermal radiation. Moreover, we study phase transitions between this rotating hairy black hole and rotating BTZ black hole. Considering the matchings for the temperature and angular momentum, we find that the rotating BTZ black hole always has smaller free energy which is a thermodynamically more preferred phase. Additionally, we evaluate the thermodynamics of the rotating black hole with minimally coupled scalar hair in three dimensions, which exhibits that the thermodynamical behaviors of this rotating hairy black hole are very similar to those of the rotating black hole with nonminimally coupled scalar hair.

preprint2014arXiv

Three dimensional rotating hairy black holes, asymptotics and thermodynamics

A rotating hairy black hole solution is found in gravity minimally coupled to a self-interacting real scalar field in three spacetime dimensions. Then we discuss analytically the horizon structure and find an analogue of the famous Kerr bound in (2+1) dimensions because of the existence of black hole horizons. We present the asymptotic symmetries and find the same symmetry group (i.e. the conformal group) and central charge as in pure gravity. Based on the asymptotic behavior, the mass and angular momentum are presented by the Regge-Teitelboim approach. Other thermodynamic quantities are also obtained and the first law of black hole thermodynamics and Smarr relation are checked. In addition, we also investigate the local thermodynamic stability and find the existence of Hawking-Page phase transition in the rotating hairy black hole.

preprint2013arXiv

A waveguide overloaded cavity kicker for the HLS II longitudinal feedback system

In the upgrade project of Hefei Light Source (HLS II), a new digital longitudinal bunch-by-bunch feedback system will be developed to suppress the coupled bunch instabilities in the storage ring effectively. We design a new waveguide overloaded cavity longitudinal feedback kicker as the feedback actuator. The beam pipe of the kicker is racetrack shape so as to avoid a transition part to the octagonal vacuum chamber. The central frequency and the bandwidth of the kicker have been simulated and optimized to achieve design goals by the HFSS code. The higher shunt impedance can be obtained by using a nose cone to reduce the feedback power requirement. Before the kicker cavity was installed in the storage ring, a variety of measurements were carried out to check its performance. All these results of simulation and measurement are presented.

preprint2013arXiv

Charged black hole with a scalar hair in (2+1) dimensions

We obtain and analyze an exact solution to Einstein-Maxwell-scalar theory in $(2+1)$ dimensions, in which the scalar field couples to gravity in a non-minimal way, and it also couples to itself with the self-interacting potential solely determined by the metric ansatz. A negative cosmological constant naturally emerges as a constant term in the scalar potential. The metric is static and circularly symmetric, and contains a curvature singularity at the origin. The conditions for the metric to contain 0, 1, 2 horizons are identified, and the effects of the scalar and electric charges on the size of the black hole radius are discussed. Under proper choices of parameters, the metric degenerates into some previously known solutions in $(2+1)$-dimensional gravity.

preprint2013arXiv

Novel rotating hairy black hole in (2+1)-dimensions

We present some novel rotating hairy black hole metric in $(2+1)$ dimensions, which is an exact solution to the field equations of the Einstein-scalar-AdS theory with a non-minimal coupling. The scalar potential is determined by the metric ansatz and consistency of the field equations and cannot be prescribed arbitrarily. In the simplified, critical case, the scalar potential contains two independent constant parameters, which are respectively related to the mass and angular momentum of the black hole in a particular way. As long as the angular momentum does not vanish, the metric can have zero, one or two horizons. The case with no horizon is physically uninteresting because of the curvature singularity lying at the origin. We identified the necessary conditions for at least one horizon to be present in the solution, which imposes some bound on the mass-angular momentum ratio. For some particular choice of parameters our solution degenerates into some previously known black hole solutions.

preprint2013arXiv

Parameter Estimation in Two-type Continuous-state Branching Processes with Immigration

We study the estimation of two-type continuous-state branching processes with immigration (CBI-processes). The ergodicity of the processes is proved. We also establish the strong consistency and central limit theorems of the conditional least squares estimators and the weighted conditional least squares estimators of the drift and diffusion coefficients based on low frequency observations.

preprint2012arXiv

Accelerating BTZ spacetime

An exact solution of $(2+1)$-dimensional Einstein gravity with cosmological constant is studied. The corresponding spacetime is interpreted as an accelerating BTZ spacetime. The proper acceleration, horizon structure, temperature and entropy are presented in detail. The metric being studied is very similar to the one studied by Astorino in arXiv:1101.2616, but the range of parameters is different which results in significant changes in the causal structures.

preprint2012arXiv

Asymmetric non-Gaussian effects in a tumor growth model with immunization

The dynamical evolution of a tumor growth model, under immune surveillance and subject to asymmetric non-Gaussian $α$-stableLévy noise, is explored. The lifetime of a tumor staying in the range between the tumor-free state and the stable tumor state, and the likelihood of noise-inducing tumor extinction, are characterized by the mean exit time (also called mean residence time) and the escape probability, respectively. For various initial densities of tumor cells, the mean exit time and the escape probability are computed with different noise parameters. It is observed that unlike the Gaussian noise or symmetric non-Gaussian noise, the asymmetric non-Gaussian noise plays a constructive role in the tumor evolution in this simple model. By adjusting the noise parameters, the mean exit time can be shortened and the escape probability can be increased, simultaneously. This suggests that a tumor may be mitigated with higher probability in a shorter time, under certain external environmental stimuli.

preprint2012arXiv

Hamiltonian description of singular Lagrangian systems with spontaneously broken time translation symmetry

Shapere and Wilczek recently found some singular Lagrangian systems which spontaneously breaks time translation symmetry. The common feature of their models is that the energy functions are multivalued in terms of the canonical phase space variables and the symmetry breaking ground states are all located at the brunching point singularities. By enlarging the phase space and making use of Dirac's theory on constrained Hamiltonian systems, we present the Hamiltonian description of some of the models discussed by Shapere and Wilczek and found that both the multivaluedness and the brunching point singularities can be avoided, while the spontaneous breaking oftime translation becomes more transparent. It is also shown that the breaking of time translation is always accompanied by the breaking of time reversal.

preprint2012arXiv

Landau meets Newton: time translation symmetry breaking in classical mechanics

Every classical Newtonian mechanical system can be equipped with a nonstandard Hamiltonian structure, in which the Hamiltonian is the square of the canonical Hamiltonian up to a constant shift, and the Poisson bracket is nonlinear. In such a formalism, time translation symmetry can be spontaneously broken, provided the potential function becomes negative. A nice analogy between time translation symmetry breaking and the Landau theory of second order phase transitions is established, together with several example cases illustrating time translation breaking ground states. In particular, the $Λ$CDM model of FRW cosmology is reformulated as the time translation symmetry breaking ground states.

preprint2012arXiv

Lévy Noise-Induced Stochastic Resonance in a Bistable System

Stochastic resonance phenomenon induced by non-Gaussian Lévy noise in a second-order bistable system is investigated. The signal-noise-ratio for different parameters is computed by an efficient numerical scheme. The influences of the noise intensity, stability index of Lévy noise and amplitude of external signal on the occurrence of stochastic resonance phenomenon are characterized. This implies that a high amplitude of signal not only enhances the output power spectrum of system but also promotes stochastic resonance, and a proper adjustment of Lévy noise intensity in a certain range enlarges the peak value of output power spectrum which is significant for stochastic resonance. Moreover, with the optimal damping parameter, lowering the stability index leads to larger fluctuations of Lévy noise, and further reduces the chance of the stochastic resonance.

preprint2012arXiv

MIMO Relaying Broadcast Channels with Linear Precoding and Quantized Channel State Information Feedback

Multi-antenna relaying has emerged as a promising technology to enhance the system performance in cellular networks. However, when precoding techniques are utilized to obtain multi-antenna gains, the system generally requires channel state information (CSI) at the transmitters. We consider a linear precoding scheme in a MIMO relaying broadcast channel with quantized CSI feedback from both two-hop links. With this scheme, each remote user feeds back its quantized CSI to the relay, and the relay sends back the quantized precoding information to the base station (BS). An upper bound on the rate loss due to quantized channel knowledge is first characterized. Then, in order to maintain the rate loss within a predetermined gap for growing SNRs, a strategy of scaling quantization quality of both two-hop links is proposed. It is revealed that the numbers of feedback bits of both links should scale linearly with the transmit power at the relay, while only the bit number of feedback from the relay to the BS needs to grow with the increasing transmit power at the BS. Numerical results are provided to verify the proposed strategy for feedback quality control.

preprint2012arXiv

Non-Gaussian dynamics of a tumor growth system with immunization

This paper is devoted to exploring the effects of non-Gaussian fluctuations on dynamical evolution of a tumor growth model with immunization, subject to non-Gaussian α-stable type Lévy noise. The corresponding deterministic model has two meaningful states which represent the state of tumor extinction and the state of stable tumor, respectively. To characterize the lifetime for different initial densities of tumor cells staying in the domain between these two states and the likelihood of crossing this domain, the mean exit time and the escape probability are quantified by numerically solving differential integral equations with appropriate exterior boundary conditions. The relationships between the dynamical properties and the noise parameters are examined. It is found that in the different stages of tumor, the noise parameters have different influence on the lifetime and the likelihood inducing tumor extinction. These results are relevant for determining efficient therapeutic regimes to induce the extinction of tumor cells.

preprint2012arXiv

Novel accelerating Einstein vacua and smooth inhomogeneous Riemannian manifolds

A novel class of Einstein vacua is presented, which possess non-vanishing cosmological constant and accelerating horizon with the topology of $S^{D-3}$ fibration over $S^{1}$. After Euclideanization, the solution describes a conformally distorted $S^{D-1}$ fibration over $S^1$, which is smooth, compact and inhomogeneous, and can be regarded as analogue of Don Page's gravitational instanton.

preprint2012arXiv

Water-oil drainage dynamics in oil-wet random microfluidic porous media analogs

Displacement experiments carried out in microfluidic porous media analogs show that reduced surface tension leads to a more stable displacement, opposite to the process in Hele-Shaw cells where surface tension stabilizes the displacement of a more viscous fluid by a less viscous fluid. In addition, geometry of porous media is observed to play an important role. Three random microfluidic porous media analogs were made to study water-oil drainage dynamics, featuring a pattern of randomly connected channels with a uniform width, a pattern with Gaussian channel width distribution, and a pattern with large isolated pores. The microfluidic chips fabricated using Polydimenthylsiloxane with glass covers have the internal surface treated by Trichlorosilane to achieve a uniform oil-wet condition. The aqueous phase displaces the oil phase, with a viscosity ratio of about 1:40 and a density ratio of 1:0.85. Videos 1-3 show water flooding processes. It is observed that both channel size distribution (Video 2) and heterogeneity in pore size (Video 3) lead to stronger fingers and reduced displacement efficiency. Video 4 shows that meniscus in small channels retreat as water front moves into a nearby large cavity due to the disparity in the capillary force and contact angle hysteresis. Videos 5 and 6, both taken at 100X magnification in Chip 2, show the stabilizing effect of reduced interfacial tension.

preprint2011arXiv

Accelerating vacua in Gauss-Bonnet gravity

Accelerating vacua with maximally symmetric, but not necessarily spherical, sections for Einstein and Gauss-Bonnet gravities in generic dimensions are obtained. The acceleration parameter has the effect of shifting the cosmological constants in Einstein gravity, whereas in Gauss-Bonnet gravity the effective cosmological constants remain the same in the presence of acceleration as in the case without acceleration.

preprint2011arXiv

Einstein frame and Jordan frame revisited: are they mathematically equivalent?

This paper has been withdrawn by the author due to a crucial sign error in equation (5).

preprint2011arXiv

Towards Optimal One Pass Large Scale Learning with Averaged Stochastic Gradient Descent

For large scale learning problems, it is desirable if we can obtain the optimal model parameters by going through the data in only one pass. Polyak and Juditsky (1992) showed that asymptotically the test performance of the simple average of the parameters obtained by stochastic gradient descent (SGD) is as good as that of the parameters which minimize the empirical cost. However, to our knowledge, despite its optimal asymptotic convergence rate, averaged SGD (ASGD) received little attention in recent research on large scale learning. One possible reason is that it may take a prohibitively large number of training samples for ASGD to reach its asymptotic region for most real problems. In this paper, we present a finite sample analysis for the method of Polyak and Juditsky (1992). Our analysis shows that it indeed usually takes a huge number of samples for ASGD to reach its asymptotic region for improperly chosen learning rate. More importantly, based on our analysis, we propose a simple way to properly set learning rate so that it takes a reasonable amount of data for ASGD to reach its asymptotic region. We compare ASGD using our proposed learning rate with other well known algorithms for training large scale linear classifiers. The experiments clearly show the superiority of ASGD.

preprint2010arXiv

Associated production of a neutral top-Higgs with a heavy-quark pair in the γγcollisions at ILC

We have studied the associated production processes of a neutral top-Higgs in the topcolor assisted technicolor model with a pair of heavy quarks in γγcollisions at the International Linear Collider (ILC). We find that the cross section for t\bar{t}h_t in γγcollisions is at the level of a few fb with the c.m. energy \sqrt{s}=1000 GeV, which is consistent with the results of the cross section of t\bar{t}H in the standard model and the cross section of t\bar{t}h in the minimal supersymmetric standard modeland the little Higgs models. It should be distinct that hundreds of to thousands of h_t per year can be produced at the ILC, this process of γγ\to t\bar{t}h_t is really interesting in testing the standard model and searching the signs of technicolor.

preprint2010arXiv

Five-dimensional vacuum Einstein spacetimes in C-metric like coordinates

A 5-dimensional Einstein spacetime with (non)vanishing cosmological constant is analyzed in detail. The metric is in close analogy with the 4-dimensional massless uncharged C-metric in many aspects. The coordinate system, horizons and causal structures, relations to standard de Sitter, anti de Sitter and Minkowski vacua are investigated. After a boost and Kaluza-Klein reduction, we get an exact solution of 4-dimensional Einstein-Maxwell-Liouville theory which reduces to a solution to Einstein-Liouville theory in the limit of zero boost velocity and to that of Einstein-Maxwell-diliton theory in the case of zero cosmological constant.

preprint2008arXiv

Applying Bayesian Neural Networks to Event Reconstruction in Reactor Neutrino Experiments

A toy detector has been designed to simulate central detectors in reactor neutrino experiments in the paper. The electron samples from the Monte-Carlo simulation of the toy detector have been reconstructed by the method of Bayesian neural networks (BNN) and the standard algorithm, a maximum likelihood method (MLD), respectively. The result of the event reconstruction using BNN has been compared with the one using MLD. Compared to MLD, the uncertainties of the electron vertex are not improved, but the energy resolutions are significantly improved using BNN. And the improvement is more obvious for the high energy electrons than the low energy ones.

Wei Xu

What is connected

Connect this record

See the researcher in context

Building this map preview

134 published item(s)

RankQ: Offline-to-Online Reinforcement Learning via Self-Supervised Action Ranking

New research paradigms and agenda of human factors science in the intelligence era

Unified Diffusion-Based Rigid and Non-Rigid Editing with Text and Image Guidance

Secure Communication for Spatially Correlated Massive MIMO with Low-Resolution DACs

Secure Communication for Spatially Correlated RIS-Aided Multiuser Massive MIMO Systems: Analysis and Optimization

A Deep Finite Difference Emulator for the Fast Simulation of Coupled Viscous Burgers' Equation

An End-to-End Transformer Model for Crowd Localization

Cooperative Reflection and Synchronization Design for Distributed Multiple-RIS Communications

Data Augmentation Empowered Neural Precoding for Multiuser MIMO with MMSE Model

Deep CSI Compression for Massive MIMO: A Self-information Model-driven Neural Network

Deployment of long distance multi-moving robots for underground pipe inspection

Distributed Neural Precoding for Hybrid mmWave MIMO Communications with Limited Feedback

Do You Need the Entropy Reward (in Practice)?

Efficient and Probabilistic Adaptive Voxel Mapping for Accurate Online LiDAR Odometry

Energy Efficient Beamforming Optimization for Integrated Sensing and Communication

Extracting a Knowledge Base of COVID-19 Events from Social Media

FAST-LIVO: Fast and Tightly-coupled Sparse-Direct LiDAR-Inertial-Visual Odometry

Focal Inverse Distance Transform Maps for Crowd Localization

Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning

Hierarchical Reinforcement Learning By Discovering Intrinsic Options

HMRNet: High and Multi-Resolution Network with Bidirectional Feature Calibration for Brain Structure Segmentation in Radiotherapy

Intelligent MIMO Detection Using Meta Learning

Learning to Optimize Resource Assignment for Task Offloading in Mobile Edge Computing

Nuclear phase retrieval spectroscopy using resonant x-ray scattering

PNM: Pixel Null Model for General Image Segmentation

Pre-train or Annotate? Domain Adaptation with a Constrained Budget

Pressure-induced mixed states caused by spin-elastic interactions during first-order spin phase transition in spin crossover compounds

RIS-Assisted Quasi-Static Broad Coverage for Wideband mmWave Massive MIMO Systems

Testing gravitational redshift based on microwave frequency links onboard China Space Station

TransCrowd: weakly-supervised crowd counting with transformers

Worst-case Design for RIS-aided Over-the-air Computation with Imperfect CSI

Analysis and Optimization for RIS-Aided Multi-Pair Communications Relying on Statistical CSI

Avoiding dynamic small obstacles with onboard sensing and computating on aerial robots

Deep Reinforcement Learning Based Dynamic Trajectory Control for UAV-assisted Mobile Edge Computing

ikd-Tree: An Incremental K-D Tree for Robotic Applications

MetaView: Few-shot Active Object Recognition

R2LIVE: A Robust, Real-time, LiDAR-Inertial-Visual tightly-coupled state Estimator and mapping

The solution space structure of planted constraint satisfaction problems with growing domains

A Bayes Factor Approach with Informative Prior for Rare Genetic Variant Analysis from Next Generation Sequencing Data

An averaging principle for fractional stochastic differential equations with Lévy noise

Analog Versus Hybrid Precoding for Multiuser Massive MIMO with Quantized CSI Feedback

AnciNet: An Efficient Deep Learning Approach for Feedback Compression of Estimated CSI in Massive MIMO Systems

Asymptotic Results for Heavy-tailed Lévy Processes and their Exponential Functionals

Attacking Optical Character Recognition (OCR) Systems with Adversarial Watermarks

Chimbuko: A Workflow-Level Scalable Performance Trace Analysis Tool

Determining geopotential difference via relativistic precise point positioning time comparison: A case study using simulated observations

Discourse Level Factors for Sentence Deletion in Text Simplification

Distributed IRS with Statistical Passive Beamforming for MISO Communications

Energy-Efficient Wireless Communications with Distributed Reconfigurable Intelligent Surfaces

Feature Statistics Guided Efficient Filter Pruning

Generalizing Natural Language Analysis through Span-relation Representations

Hybrid Transceiver Optimization for Multi-Hop Communications

Implicit Generative Modeling for Efficient Exploration

Interactive Visual Study of Multiple Attributes Learning Model of X-Ray Scattering Images

Interpreting Galaxy Deblender GAN from the Discriminator's Perspective

Joint Transmit Power and Placement Optimization for URLLC-enabled UAV Relay Systems

Multi-cell Edge Coverage Enhancement Using Mobile UAV-Relay

Multi-hop Reading Comprehension across Documents with Path-based Graph Convolutional Network

Multicell MIMO Communications Relying on Intelligent Reflecting Surface

Mutual Information-based State-Control for Intrinsically Motivated Reinforcement Learning

Numerical Analysis of History-dependent Variational-hemivariational Inequalities

Octopus: Privacy-Preserving Collaborative Evaluation of Loan Stacking

On Uplink Performance of Multiuser Massive MIMO Relay Network With Limited RF Chains

PrivPy: Enabling Scalable and General Privacy-Preserving Machine Learning

Spectral and Energy Efficiency of IRS-Assisted MISO Communication with Hardware Impairments

Optimal Multi-View Video Transmission in Multiuser Wireless Networks by Exploiting Natural and View Synthesis-Enabled Multicast Opportunities

Optimal Multi-View Video Transmission in OFDMA Systems

Secrecy Rate Maximization for Intelligent Reflecting Surface Assisted Multi-Antenna Communications

A Data-Driven Approach for Mapping Multivariate Data to Color

ABC-CNN: An Attention Based Convolutional Neural Network for Visual Question Answering

An Optimization Framework For Online Ride-sharing Markets

Asymptotic results for exponential functionals of Levy processes

Attention to Scale: Scale-aware Semantic Image Segmentation

Automatically Building Face Datasets of New Domains from Weakly Labeled Data with Pretrained Models