Source author record

Rui Wang

Rui Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Information Theory math.IT Computer Vision Artificial Intelligence Computation and Language cond-mat.mes-hall cond-mat.mtrl-sci astro-ph.SR eess.SP Graphics Biomolecules eess.AS Networking and Internet Architecture cond-mat.str-el Cryptography and Security physics.space-ph Robotics Sound nucl-th cond-mat.soft cond-mat.supr-con eess.SY Neural and Evolutionary Computing physics.optics Systems and Control Distributed, Parallel, and Cluster Computing Genomics Methodology physics.chem-ph Populations and Evolution Quantitative Methods cond-mat.stat-mech eess.IV hep-ph math.OC math.SG nucl-ex quant-ph Applications astro-ph.GA cond-mat.dis-nn Databases hep-ex Information Retrieval math.FA Performance physics.acc-ph physics.atm-clus astro-ph.CO Computational Geometry Data Structures and Algorithms gr-qc hep-lat hep-th Human-Computer Interaction Logic in Computer Science math.AG math.AT math.DG math.NA math.SP math.ST Multimedia nlin.AO Numerical Analysis physics.app-ph physics.comp-ph physics.ins-det Statistics Theory

Catalog footprint

What is connected

273works

70topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Federated Cross-Client Subgraph Pattern Detection

Subgraph pattern detection aims to uncover complex interaction structures in graphs. However, state-of-the-art graph neural network (GNN)-based solutions assume centralized access to the entire graph. When graphs are instead distributed across multiple parties, client-local GNN computations diverge from those of a centralized model, resulting in a representation-equivalence gap. We formalize this as a structural observability problem, where subgraph patterns crossing partition boundaries become locally unidentifiable. To bridge this gap, we propose a per-step, layer-wise embedding exchange framework in which clients synchronize intermediate node representations at each layer of the forward pass, without exposing raw features or labels. Under an extended-subgraph assumption and shared model parameters across clients, this framework recovers the same node representations as a centralized GNN over the full graph. Experiments on synthetic directed multigraphs with cycles, bicliques, and scatter-gather patterns show that embedding exchange and federated parameter aggregation are complementary rather than interchangeable: their combination recovers most of the representation gap, provided exchanged embeddings are fresh per-step rather than stale per-epoch.

preprint2026arXiv

Large-Small Model Collaboration for Farmland Semantic Change Detection

Farmland Semantic Change Detection (SCD) is essential for cultivated land protection, yet existing benchmarks and models remain insufficient for fine-grained farmland conversion monitoring. Current datasets often lack dedicated "from-to" annotations, while visual change detection models are easily disturbed by phenology-induced pseudo-changes caused by crop rotation, seasonal variation, and illumination differences. To address these challenges, we construct HZNU-FCD, a large-scale fine-grained farmland SCD benchmark with a unified five-class farmland-to-non-farmland annotation protocol. It contains 4,588 bitemporal image pairs with pixel-level labels for practical farmland protection. Based on this benchmark, we propose a large-small collaborative SCD framework that integrates a task-driven small visual model with a frozen large vision-language model. The small model, Fine-grained Difference-aware Mamba (FD-Mamba), learns dense change representations for boundary preservation and small-region localization. The large-model pathway, Cross-modal Logical Arbitration (CMLA), introduces CLIP-based textual priors for prompt-guided semantic arbitration and pseudo-change suppression. To enable effective collaboration, we design a hard-region co-training strategy that supervises the CMLA semantic score map only on low-confidence pixels. Experiments show that our method achieves 97.63% F1, 96.32% IoU, and 96.35% SCD_IoU_mean on HZNU-FCD with only 6.65M trainable parameters. Compared with the multimodal ChangeCLIP-ViT, which leverages vision-language information for change detection, our method improves F1 by 10.19 percentage points on HZNU-FCD. It also achieves 91.43% F1 and 84.21% IoU on LEVIR-CD, and 93.85% F1 and 88.41% IoU on WHU-CD, demonstrating strong robustness and generalization. The code is available at https://github.com/Lovelymili/FD-Mamba.

preprint2026arXiv

PrepBench: How Far Are We from Natural-Language-Driven Data Preparation?

Data preparation is a central and time-consuming stage in data analysis workflows. Traditionally, commercial tools have relied on graphical user interfaces (GUIs) to simplify data preparation, allowing users to define transformations through visual operators and workflows. Recent advances in large language models (LLMs) raise the possibility of a paradigm shift toward natural language (NL)-driven data preparation, in which users can specify preparation intents in NL directly. However, it remains unclear how far current LLM-based agents are from this paradigm shift in practice. Existing code generation benchmarks do not capture key characteristics of data preparation, including ambiguous user intents, imperfect real-world data, and the need to translate code into interpretable workflows for validation. To bridge this gap, we present PrepBench, a benchmark designed to evaluate NL-driven data preparation along three core capabilities: interactive disambiguation, prep-code generation, and code-to-workflow translation. We crawl data from the Preppin' Data Challenges, and then extend it into a systematically designed benchmark. The benchmark covers diverse domains, and each task involves 3 to 18 data preparation steps. Nearly half of the tasks require over 100 lines of Python code, and the longest solutions approach 300 lines. Our evaluation shows that, despite recent progress, realizing this paradigm shift remains challenging for state-of-the-art LLMs. PrepBench provides a principled benchmark for measuring this gap and helps identify key challenges toward realizing NL-driven data preparation.

preprint2026arXiv

SciResearcher: Scaling Deep Research Agents for Frontier Scientific Reasoning

Frontier scientific reasoning is rapidly emerging as a key foundation for advancing AI agents in automated scientific discovery. Deep research agents offer a promising approach to this challenge. These models develop robust problem-solving capabilities through post-training on information-seeking tasks, which are typically curated via knowledge graph construction or iterative web browsing. However, these strategies face inherent limitations in frontier science, where domain-specific knowledge is scattered across sparse and heterogeneous academic sources, and problem solving requires sophisticated computation and reasoning far beyond factual recall. To bridge this gap, we introduce SciResearcher, a fully automated agentic framework for frontier-science data construction. SciResearcher synthesizes diverse conceptual and computational tasks grounded in academic evidence, while eliciting information acquisition, tool-integrated reasoning, and long-horizon capabilities. Leveraging the curated data for supervised fine-tuning and agentic reinforcement learning, we develop SciResearcher-8B, an agent foundation model that achieves 19.46% on the HLE-Bio/Chem-Gold benchmark, establishing a new state of the art at its parameter scale and surpassing several larger proprietary agents. It further achieves 13-15% absolute gains on SuperGPQA-Hard-Biology and TRQA-Literature benchmarks. Overall, SciResearcher introduces a new paradigm for automated data construction for frontier scientific reasoning and offers a scalable path toward future scientific agents.

preprint2026arXiv

TouchMap-OR: Multi-View 3D Mapping of Hand-Surface Contacts

Hand-surface interactions between clinicians, patients, and medical equipment play a central role in pathogen transmission during medical procedures. However, these interactions remain largely unobserved, as current infection-prevention practices rely on manual observation and cannot reconstruct detailed contact histories. In this work we formulate the problem of identity-resolved hand-surface interaction reconstruction in operating rooms and introduce TouchMap-OR, a multi-view RGB-D vision system that models clinicians, articulated hand geometry, and the semantic structure of the clinical environment to infer when and where contacts occur. The system reconstructs globally consistent multi-person 3D skeleton tracks across cameras while estimating articulated MANO hand meshes from RGB observations aligned to depth data. Multi-view hand reconstructions are fused and associated with tracked clinicians to obtain consistent left and right hand trajectories. A semantic 3D model of the operating room is built from multi-view segmentation and depth fusion, enabling reconstructed hand trajectories to be mapped to specific surfaces, including medical equipment, movable objects, and patient body sites. Temporal hand-surface proximity is used to infer contact episodes describing which clinician touched which surface and when. We evaluate TouchMap-OR on recordings from three real anesthesia inductions with manually annotated contact events. TouchMap-OR achieves 0.75 binary contact F1, outperforming tracking-based baselines while maintaining comparable multi-person tracking accuracy and achieving 0.96 identity attribution accuracy.

preprint2026arXiv

Towards Backdoor-Based Ownership Verification for Vision-Language-Action Models

Vision-Language-Action models (VLAs) support generalist robotic control by enabling end-to-end decision policies directly from multi-modal inputs. As trained VLAs are increasingly shared and adapted, protecting model ownership becomes essential for secure deployment and responsible open-source usage. In this paper, we present GuardVLA, the first backdoor-based ownership verification framework specifically designed for VLAs. GuardVLA embeds a stealthy and harmless backdoor watermark into the protected model during training by injecting secret messages into embodied visual data. For post-release verification, we propose a swap-and-detect mechanism, in which the trigger projector and an external classifier head are used to activate and detect the embedded backdoor based on prediction probabilities. Extensive experiments across multiple datasets, model architectures, and adaptation settings demonstrate that GuardVLA enables reliable ownership verification while preserving benign task performance. Further results show that the embedded watermark remains detectable under post-release model adaptation.

preprint2025arXiv

Decoupling Constraint from Two Direction in Evolutionary Constrained Multi-objective Optimization

Real-world Constrained Multi-objective Optimization Problems (CMOPs) often contain multiple constraints, and understanding and utilizing the coupling between these constraints is crucial for solving CMOPs. However, existing Constrained Multi-objective Evolutionary Algorithms (CMOEAs) typically ignore these couplings and treat all constraints as a single aggregate, which lacks interpretability regarding the specific geometric roles of constraints. To address this limitation, we first analyze how different constraints interact and show that the final Constrained Pareto Front (CPF) depends not only on the Pareto fronts of individual constraints but also on the boundaries of infeasible regions. This insight implies that CMOPs with different coupling types must be solved from different search directions. Accordingly, we propose a novel algorithm named Decoupling Constraint from Two Directions (DCF2D). This method periodically detects constraint couplings and spawns an auxiliary population for each relevant constraint with an appropriate search direction. Extensive experiments on seven challenging CMOP benchmark suites and on a collection of real-world CMOPs demonstrate that DCF2D outperforms five state-of-the-art CMOEAs, including existing decoupling-based methods.

preprint2024arXiv

Observation and manipulation of quantum interference in a superconducting Kerr parametric oscillator

Quantum tunneling is the phenomenon that makes superconducting circuits "quantum". Recently, there has been a renewed interest in using quantum tunneling in phase space of a Kerr parametric oscillator as a resource for quantum information processing. Here, we report a direct observation of quantum interference induced by such tunneling in a planar superconducting circuit through Wigner tomography. We experimentally elucidate all essential properties of this quantum interference, such as mapping from Fock states to cat states, a temporal oscillation due to the pump detuning, as well as its characteristic Rabi oscillations and Ramsey fringes. Finally, we perform gate operations as manipulations of the observed quantum interference. Our findings lay the groundwork for further studies on quantum properties of superconducting Kerr parametric oscillators and their use in quantum information technologies.

preprint2024arXiv

Phase Transition Study meets Machine Learning

In recent years, machine learning (ML) techniques have emerged as powerful tools for studying many-body complex systems, and encompassing phase transitions in various domains of physics. This mini review provides a concise yet comprehensive examination of the advancements achieved in applying ML to investigate phase transitions, with a primary focus on those involved in nuclear matter studies.

preprint2024arXiv

Structured Learning in Time-dependent Cox Models

Cox models with time-dependent coefficients and covariates are widely used in survival analysis. In high-dimensional settings, sparse regularization techniques are employed for variable selection, but existing methods for time-dependent Cox models lack flexibility in enforcing specific sparsity patterns (i.e., covariate structures). We propose a flexible framework for variable selection in time-dependent Cox models, accommodating complex selection rules. Our method can adapt to arbitrary grouping structures, including interaction selection, temporal, spatial, tree, and directed acyclic graph structures. It achieves accurate estimation with low false alarm rates. We develop the sox package, implementing a network flow algorithm for efficiently solving models with complex covariate structures. sox offers a user-friendly interface for specifying grouping structures and delivers fast computation. Through examples, including a case study on identifying predictors of time to all-cause death in atrial fibrillation patients, we demonstrate the practical application of our method with specific selection rules.

preprint2023arXiv

A Theory of Human-Like Few-Shot Learning

We aim to bridge the gap between our common-sense few-sample human learning and large-data machine learning. We derive a theory of human-like few-shot learning from von-Neuman-Landauer's principle. modelling human learning is difficult as how people learn varies from one to another. Under commonly accepted definitions, we prove that all human or animal few-shot learning, and major models including Free Energy Principle and Bayesian Program Learning that model such learning, approximate our theory, under Church-Turing thesis. We find that deep generative model like variational autoencoder (VAE) can be used to approximate our theory and perform significantly better than baseline models including deep neural networks, for image recognition, low resource language processing, and character recognition.

preprint2023arXiv

An Indoor Environment Sensing and Localization System via mmWave Phased Array

An indoor layout sensing and localization system in 60GHz millimeter wave (mmWave) band, named mmReality, is elaborated in this paper. The mmReality system consists of one transmitter and one mobile receiver, each with a phased array and a single radio frequency (RF) chain. To reconstruct the room layout, the pilot signal is delivered from the transmitter to the receiver via different pairs of transmission and receiving beams, so that the signals at all antenna elements can be resolved. Then, the spatial smoothing and two-dimensional multiple signal classification (MUSIC) algorithm is applied to detect the angle-of-arrival (AoAs) and angle-of-departure (AoDs) of the rays from the transmitter to the receiver. Moreover, the technique of multi-carrier ranging is adopted to measure the distance of each propagation path. Synthesizing the above geometrical parameters, the location of receiver relative to the transmitter can be pinpointed, both line-of-sight (LoS) and non-line-of-sight (NLoS) paths can also be determined. Therefore, the room layout can be reconstructed by moving the receiver and repeating the above measurement in different locations of the room. At the end, we show that the reconstructed room layout can be utilized to locate a mobile device according to its AoA spectrum, even with single access point.

preprint2023arXiv

Boosting Large Language Model for Speech Synthesis: An Empirical Study

Large language models (LLMs) have made significant advancements in natural language processing and are concurrently extending the language ability to other modalities, such as speech and vision. Nevertheless, most of the previous work focuses on prompting LLMs with perception abilities like auditory comprehension, and the effective approach for augmenting LLMs with speech synthesis capabilities remains ambiguous. In this paper, we conduct a comprehensive empirical exploration of boosting LLMs with the ability to generate speech, by combining pre-trained LLM LLaMA/OPT and text-to-speech synthesis model VALL-E. We compare three integration methods between LLMs and speech synthesis models, including directly fine-tuned LLMs, superposed layers of LLMs and VALL-E, and coupled LLMs and VALL-E using LLMs as a powerful text encoder. Experimental results show that, using LoRA method to fine-tune LLMs directly to boost the speech synthesis capability does not work well, and superposed LLMs and VALL-E can improve the quality of generated speech both in speaker similarity and word error rate (WER). Among these three methods, coupled methods leveraging LLMs as the text encoder can achieve the best performance, making it outperform original speech synthesis models with a consistently better speaker similarity and a significant (10.9%) WER reduction.

preprint2023arXiv

On the nature of overcharging and charge inversion in electrical double layers

Understanding overcharging and charge inversion is one of the long-standing challenges in soft matter and biophysics. To study these phenomena, we employ the modified Gaussian renormalized fluctuation theory, which allows for the self-consistent accounting of spatially varying ionic strength, as well as the spatial variations in dielectric permittivity and excluded volume effects. The underlying dependence of overcharging on the electrostatic coupling is elucidated by varying surface charge, counterion valency, and dielectric contrast. Consistent with simulations, three characteristic regimes corresponding to weak, moderate, and strong coupling are identified. Important features like the inversion of zeta potential, crowding and ionic layering at the surface are successfully captured. For weak coupling, there is no overcharging. In the moderate coupling regime, overcharging increases with surface charge. Finally, in the strong coupling regime, ionic crowding and saturation in overcharging are observed. Our theory predicts non-monotonic dependence of charge inversion on multivalent salt concentration as well as the addition of monovalent salt, in quantitative agreement with experiments.

preprint2023arXiv

PACO: Parts and Attributes of Common Objects

Object models are gradually progressing from predicting just category labels to providing detailed descriptions of object instances. This motivates the need for large datasets which go beyond traditional object masks and provide richer annotations such as part masks and attributes. Hence, we introduce PACO: Parts and Attributes of Common Objects. It spans 75 object categories, 456 object-part categories and 55 attributes across image (LVIS) and video (Ego4D) datasets. We provide 641K part masks annotated across 260K object boxes, with roughly half of them exhaustively annotated with attributes as well. We design evaluation metrics and provide benchmark results for three tasks on the dataset: part mask segmentation, object and part attribute prediction and zero-shot instance detection. Dataset, models, and code are open-sourced at https://github.com/facebookresearch/paco.

preprint2023arXiv

Universal Multimodal Representation for Language Understanding

Representation learning is the foundation of natural language processing (NLP). This work presents new methods to employ visual information as assistant signals to general NLP tasks. For each sentence, we first retrieve a flexible number of images either from a light topic-image lookup table extracted over the existing sentence-image pairs or a shared cross-modal embedding space that is pre-trained on out-of-shelf text-image pairs. Then, the text and images are encoded by a Transformer encoder and convolutional neural network, respectively. The two sequences of representations are further fused by an attention layer for the interaction of the two modalities. In this study, the retrieval process is controllable and flexible. The universal visual representation overcomes the lack of large-scale bilingual sentence-image pairs. Our method can be easily applied to text-only tasks without manually annotated multimodal parallel corpora. We apply the proposed method to a wide range of natural language generation and understanding tasks, including neural machine translation, natural language inference, and semantic similarity. Experimental results show that our method is generally effective for different tasks and languages. Analysis indicates that the visual signals enrich textual representations of content words, provide fine-grained grounding information about the relationship between concepts and events, and potentially conduce to disambiguation.

preprint2023arXiv

Workload Failure Prediction for Data Centers

Failed workloads that consumed significant computational resources in time and space affect the efficiency of data centers significantly and thus limit the amount of scientific work that can be achieved. While the computational power has increased significantly over the years, detection and prediction of workload failures have lagged far behind and will become increasingly critical as the system scale and complexity further increase. In this study, we analyze workload traces collected from a production cluster and train machine learning models on a large amount of data sets to predict workload failures. Our prediction models consist of a queue-time model that estimates the probability of workload failures before execution and a runtime model that predicts failures at runtime. Evaluation results show that the queue-time model and runtime model can predict workload failures with a maximum precision score of 90.61% and 97.75%, respectively. By integrating the runtime model with the job scheduler, it helps reduce CPU time, and memory usage by up to 16.7% and 14.53%, respectively.

preprint2022arXiv

A new class of bilayer kagome lattice compounds with Dirac nodal lines and pressure-induced superconductivity

Kagome lattice composed of transition-metal ions provides a great opportunity to explore the intertwining between geometry, electronic orders and band topology. The discovery of multiple competing orders that connect intimately with the underlying topological band structure in nonmagnetic kagome metals $A$V$_3$Sb$_5$ ($A$ = K, Rb, Cs) further pushes this topic to the quantum frontier. Here we report the discovery and characterization of a new class of vanadium-based compounds with kagome bilayers, namely $A$V$_6$Sb$_6$ ($A$ = K, Rb, Cs) and V$_6$Sb$_4$, which, together with $A$V$_3$Sb$_5$, compose a series of kagome compounds with a generic chemical formula ($A_{m-1}$Sb$_{2m}$)(V$_3$Sb)$_n$ (m = 1, 2; n = 1, 2). Theoretical calculations combined with angle-resolved photoemission measurements reveal that these compounds feature Dirac nodal lines in close vicinity to the Fermi level. Pressure-induced superconductivity in $A$V$_6$Sb$_6$ further suggests promising emergent phenomena in these materials. The establishment of a new family of layered kagome materials paves the way for designer of fascinating kagome systems with diverse topological nontrivialities and collective ground states.

preprint2022arXiv

A novel method for data augmentation: Nine Dot Moving Least Square (ND-MLS)

Data augmentation greatly increases the amount of data obtained based on labeled data to save on expenses and labor for data collection and labeling. We present a new approach for data augmentation called nine-dot MLS (ND-MLS). This approach is proposed based on the idea of image defor-mation. Images are deformed based on control points, which are calculated by ND-MLS. The method can generate over 2000 images for one exist-ing dataset in a short time. To verify this data augmentation method, extensive tests were performed covering 3 main tasks of computer vision, namely, classification, detection and segmentation. The results show that 1) in classification, 10 images per category were used for training, and VGGNet can obtain 92% top-1 acc on the MNIST dataset of handwritten digits by ND-MLS. In the Omniglot dataset, the few-shot accuracy usu-ally decreases with the increase in character categories. However, the ND-MLS method has stable performance and obtains 96.5 top-1 acc in Res-Net on 100 different handwritten character classification tasks; 2) in segmentation, under the premise of only ten original images, DeepLab obtains 93.5%, 85%, and 73.3% m_IOU(10) on the bottle, horse, and grass test datasets, respectively, while the cat test dataset obtains 86.7% m_IOU(10) with the SegNet model; 3) with only 10 original images from each category in object detection, YOLO v4 obtains 100% and 97.2% bottle and horse detection, respectively, while the cat dataset obtains 93.6% with YOLO v3. In summary, ND-MLS can perform well on classification, object detec-tion, and semantic segmentation tasks by using only a few data.

preprint2022arXiv

A Study of Syntactic Multi-Modality in Non-Autoregressive Machine Translation

It is difficult for non-autoregressive translation (NAT) models to capture the multi-modal distribution of target translations due to their conditional independence assumption, which is known as the "multi-modality problem", including the lexical multi-modality and the syntactic multi-modality. While the first one has been well studied, the syntactic multi-modality brings severe challenge to the standard cross entropy (XE) loss in NAT and is under studied. In this paper, we conduct a systematic study on the syntactic multi-modality problem. Specifically, we decompose it into short- and long-range syntactic multi-modalities and evaluate several recent NAT algorithms with advanced loss functions on both carefully designed synthesized datasets and real datasets. We find that the Connectionist Temporal Classification (CTC) loss and the Order-Agnostic Cross Entropy (OAXE) loss can better handle short- and long-range syntactic multi-modalities respectively. Furthermore, we take the best of both and design a new loss function to better handle the complicated syntactic multi-modality in real-world datasets. To facilitate practical usage, we provide a guide to use different loss functions for different kinds of syntactic multi-modality.

preprint2022arXiv

Accelerating Edge Intelligence via Integrated Sensing and Communication

Realizing edge intelligence consists of sensing, communication, training, and inference stages. Conventionally, the sensing and communication stages are executed sequentially, which results in excessive amount of dataset generation and uploading time. This paper proposes to accelerate edge intelligence via integrated sensing and communication (ISAC). As such, the sensing and communication stages are merged so as to make the best use of the wireless signals for the dual purpose of dataset generation and uploading. However, ISAC also introduces additional interference between sensing and communication functionalities. To address this challenge, this paper proposes a classification error minimization formulation to design the ISAC beamforming and time allocation. The globally optimal solution is derived via the rank-1 guaranteed semidefinite relaxation, and performance analysis is performed to quantify the ISAC gain over that of conventional edge intelligence. Simulation results are provided to verify the effectiveness of the proposed ISAC-assisted edge intelligence system. Interestingly, we find that ISAC is always beneficial, when the duration of generating a sample is more than the duration of uploading a sample. Otherwise, the ISAC gain can vanish or even be negative. Nevertheless, we still derive a sufficient condition, under which a positive ISAC gain is feasible.

preprint2022arXiv

Accelerating Federated Edge Learning via Topology Optimization

Federated edge learning (FEEL) is envisioned as a promising paradigm to achieve privacy-preserving distributed learning. However, it consumes excessive learning time due to the existence of straggler devices. In this paper, a novel topology-optimized federated edge learning (TOFEL) scheme is proposed to tackle the heterogeneity issue in federated learning and to improve the communication-and-computation efficiency. Specifically, a problem of jointly optimizing the aggregation topology and computing speed is formulated to minimize the weighted summation of energy consumption and latency. To solve the mixed-integer nonlinear problem, we propose a novel solution method of penalty-based successive convex approximation, which converges to a stationary point of the primal problem under mild conditions. To facilitate real-time decision making, an imitation-learning based method is developed, where deep neural networks (DNNs) are trained offline to mimic the penalty-based method, and the trained imitation DNNs are deployed at the edge devices for online inference. Thereby, an efficient imitate-learning based approach is seamlessly integrated into the TOFEL framework. Simulation results demonstrate that the proposed TOFEL scheme accelerates the federated learning process, and achieves a higher energy efficiency. Moreover, we apply the scheme to 3D object detection with multi-vehicle point cloud datasets in the CARLA simulator. The results confirm the superior learning performance of the TOFEL scheme over conventional designs with the same resource and deadline constraints.

preprint2022arXiv

An approximate randomization test for high-dimensional two-sample Behrens-Fisher problem under arbitrary covariances

This paper is concerned with the problem of comparing the population means of two groups of independent observations. An approximate randomization test procedure based on the test statistic of Chen and Qin (2010) is proposed. The asymptotic behavior of the test statistic as well as the randomized statistic is studied under weak conditions. In our theoretical framework, observations are not assumed to be identically distributed even within groups. No condition on the eigenstructure of the covariance matrices is imposed. And the sample sizes of the two groups are allowed to be unbalanced. Under general conditions, all possible asymptotic distributions of the test statistic are obtained. We derive the asymptotic level and local power of the approximate randomization test procedure. Our theoretical results show that the proposed test procedure can adapt to all possible asymptotic distributions of the test statistic and always has correct test level asymptotically. Also, the proposed test procedure has good power behavior. Our numerical experiments show that the proposed test procedure has favorable performance compared with several alternative test procedures.

preprint2022arXiv

An Interactive Image-based Modeling System

This paper propose a interactive 3D modeling method and corresponding system based on single or multiple uncalibrated images. The main feature of this method is that, according to the modeling habits of ordinary people, the 3D model of the target is reconstructed from coarse to fine images. On the basis of determining the approximate shape, the user adds or modify projection constraints and spatial constraints, and apply topology modification, gradually realize camera calibration, refine rough model, and finally complete the reconstruction of objects with arbitrary geometry and topology. During the interactive process, the geometric parameters and camera projection matrix are solved in real time, and the reconstruction results are displayed in a 3D window.

preprint2022arXiv

Analysis of the distribution, rotation and scale characteristics of solar wind switchbacks: comparison between the first and second encounters of Parker Solar Probe

The S-shaped magnetic structure in the solar wind formed by the twisting of magnetic field lines is called a switchback, whose main characteristics are the reversal of the magnetic field and the significant increase in the solar wind radial velocity. We identify 242 switchbacks during the first two encounters of Parker Solar Probe (PSP). Statistics methods are applied to analyze the distribution and the rotation angle and direction of the magnetic field rotation of the switchbacks. The diameter of switchbacks is estimated with a minimum variance analysis (MVA) method based on the assumption of a cylindrical magnetic tube. We also make a comparison between switchbacks from inside and the boundary of coronal holes. The main conclusions are as follows: (1) the rotation angles of switchbacks observed during the first encounter seem larger than those of the switchbacks observed during the second encounter in general; (2) the tangential component of the velocity inside the switchbacks tends to be more positive (westward) than in the ambient solar wind; (3) switchbacks are more likely to rotate clockwise than anticlockwise, and the number of switchbacks with clockwise rotation is 1.48 and 2.65 times of those with anticlockwise rotation during the first and second encounters, respectively; (4) the diameter of switchbacks is about 10^5 km on average and across five orders of magnitude (10^3 -- 10^7 km).

preprint2022arXiv

Approximately Equivariant Networks for Imperfectly Symmetric Dynamics

Incorporating symmetry as an inductive bias into neural network architecture has led to improvements in generalization, data efficiency, and physical consistency in dynamics modeling. Methods such as CNNs or equivariant neural networks use weight tying to enforce symmetries such as shift invariance or rotational equivariance. However, despite the fact that physical laws obey many symmetries, real-world dynamical data rarely conforms to strict mathematical symmetry either due to noisy or incomplete data or to symmetry breaking features in the underlying dynamical system. We explore approximately equivariant networks which are biased towards preserving symmetry but are not strictly constrained to do so. By relaxing equivariance constraints, we find that our models can outperform both baselines with no symmetry bias and baselines with overly strict symmetry in both simulated turbulence domains and real-world multi-stream jet flow.

preprint2022arXiv

Beamforming Design and Performance Evaluation for Reconfigurable Intelligent Surface Assisted Wireless Communication Systems With Non-Ideal Hardware

Reconfigurable intelligent surface (RIS) can effectively control the wavefront of the impinging signals and has emerged as a cost-effective promising solution to improve the spectrum and energy efficiency of wireless systems. Most existing researches on RIS assume that the hardware operations are perfect. However, both physical transceiver and RIS suffer from inevitable hardware impairments in practice, which can lead to severe system performance degradation and increase the complexity of beamforming optimization. Consequently, the existing researches on RIS, including channel estimation, beamforming optimization, spectrum and energy efficiency analysis, etc., cannot directly apply to the case of hardware impairments. In this paper, by taking hardware impairments into consideration, we conduct the joint transmit and reflect beamforming optimization, and reevaluate the system performance. First, we characterize the closed-form estimators of direct and cascaded channels in both single-user and multi-user cases, and analyze the impact of hardware impairments on channel estimation accuracy. Then, the optimal transmit beamforming solution is derived, and a gradient descent method-based algorithm is also proposed to optimize the reflect beamforming. Moreover, we analyze the three types of asymptotic channel capacities with respect to the transmit power, the antenna number, and the reflecting element number. Finally, in terms of the system energy consumption, we analyze the power scaling law and the energy efficiency. Our experimental results also reveal an encouraging phenomenon that the RIS-assisted wireless system with massive reflecting elements can achieve both high spectrum and energy efficiency without the need for massive antennas and without allocating too many resources to optimize the reflect beamforming.

preprint2022arXiv

BEVT: BERT Pretraining of Video Transformers

This paper studies the BERT pretraining of video transformers. It is a straightforward but worth-studying extension given the recent success from BERT pretraining of image transformers. We introduce BEVT which decouples video representation learning into spatial representation learning and temporal dynamics learning. In particular, BEVT first performs masked image modeling on image data, and then conducts masked image modeling jointly with masked video modeling on video data. This design is motivated by two observations: 1) transformers learned on image datasets provide decent spatial priors that can ease the learning of video transformers, which are often times computationally-intensive if trained from scratch; 2) discriminative clues, i.e., spatial and temporal information, needed to make correct predictions vary among different videos due to large intra-class and inter-class variations. We conduct extensive experiments on three challenging video benchmarks where BEVT achieves very promising results. On Kinetics 400, for which recognition mostly relies on discriminative spatial representations, BEVT achieves comparable results to strong supervised baselines. On Something-Something-V2 and Diving 48, which contain videos relying on temporal dynamics, BEVT outperforms by clear margins all alternative baselines and achieves state-of-the-art performance with a 71.4\% and 87.2\% Top-1 accuracy respectively. Code will be made available at \url{https://github.com/xyzforever/BEVT}.

preprint2022arXiv

Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation

Back-translation is a critical component of Unsupervised Neural Machine Translation (UNMT), which generates pseudo parallel data from target monolingual data. A UNMT model is trained on the pseudo parallel data with translated source, and translates natural source sentences in inference. The source discrepancy between training and inference hinders the translation performance of UNMT models. By carefully designing experiments, we identify two representative characteristics of the data gap in source: (1) style gap (i.e., translated vs. natural text style) that leads to poor generalization capability; (2) content gap that induces the model to produce hallucination content biased towards the target language. To narrow the data gap, we propose an online self-training approach, which simultaneously uses the pseudo parallel data {natural source, translated target} to mimic the inference scenario. Experimental results on several widely-used language pairs show that our approach outperforms two strong baselines (XLM and MASS) by remedying the style and content gaps.

preprint2022arXiv

Buildup of the Magnetic Flux Ropes in Homologous Solar Eruptions

Homologous coronal mass ejections (CMEs) are an interesting phenomenon, and it is possible to investigate the formation of CMEs by comparing multi-CMEs under a homologous physical condition. AR 11283 had been present on the solar surface for several days when a bipole emerged on 2011 September 4. Its positive polarity collided with the pre-existing negative polarity belonging to a different bipole, producing recurrent solar activities along the polarity inversion line (PIL) between the colliding polarities, namely the so-called collisional PIL (cPIL). Our results show that a large amount of energy and helicity were built up in the form of magnetic flux ropes (MFRs), with recurrent release and accumulation processes. These MFRs were built up along the cPIL. A flux deficit method is adopted and shows that magnetic cancellation happens along the cPIL due to the collisional shearing scenario proposed by Chintzoglou et al. The total amount of canceled flux was $\sim$0.7$\times$10$^{21}$ Mx with an uncertainty of $\sim$13.2$\%$ within the confidence region of the 30$^\circ$ sun-center distance. The canceled flux amounts to 24$\%$ of the total unsigned flux of the bipolar magnetic region. The results show that the magnetic fields beside the cPIL are very sheared, and the average shear angle is above 70$^\circ$ after the collision. The fast expansion of the twist kernels of the MFRs and the continuous eruptive activities are both driven by the collisional shearing process. These results are important for better understanding the buildup process of the MFRs associated with homologous solar eruptions.

preprint2022arXiv

CCP: Correlated Clustering and Projection for Dimensionality Reduction

Most dimensionality reduction methods employ frequency domain representations obtained from matrix diagonalization and may not be efficient for large datasets with relatively high intrinsic dimensions. To address this challenge, Correlated Clustering and Projection (CCP) offers a novel data domain strategy that does not need to solve any matrix. CCP partitions high-dimensional features into correlated clusters and then projects correlated features in each cluster into a one-dimensional representation based on sample correlations. Residue-Similarity (R-S) scores and indexes, the shape of data in Riemannian manifolds, and algebraic topology-based persistent Laplacian are introduced for visualization and analysis. Proposed methods are validated with benchmark datasets associated with various machine learning algorithms.

preprint2022arXiv

Chern-Simons superconductors and their instabilities

Two-dimensional quantum antiferromagnets host rich physics, including long-range ordering, high-$T_c$ superconductivity, quantum spin liquid behavior, topological ordering, a variety of other exotic phases, and quantum criticalities. Frustrating perturbations in antiferromagnets may give rise to strong quantum fluctuations, challenging the theoretical understanding of the many-body ground state. Here we develop a method to describe the quantum antiferromagnets using fermionic degrees of freedom. The method is based on a formally exact mapping between spin exchange models and theories describing fermionic matter with the emergent $U(1)$ Chern-Simons gauge field. For the planar Néel state, this mapping self-consistently generates the Chern-Simons superconductor mean-field ground state of introduced spinless fermions. We systematically compare the Chern-Simons superconductor state with the planar Néel state at the level of collective modes as well as order parameters. We reveal qualitative and quantitative correspondences between these two states. We demonstrate that such a construction using the fractionalized excitations and Chern-Simons gauge field can not only describe the Néel order but can also be applied to study quantum spin liquids. Furthermore, we show that the confinement-deconfinement transitions from the Néel order to quantum spin liquids are signaled and characterized by the instabilities of Chern-Simons superconductors, driven by strong frustration. The results suggest observing and classifying the descendants of antiferromagnets, including other ordered states and unconventional superconductors, as well as emergent quantum spin liquids.

preprint2022arXiv

Collaborative Representation for SPD Matrices with Application to Image-Set Classification

Collaborative representation-based classification (CRC) has demonstrated remarkable progress in the past few years because of its closed-form analytical solutions. However, the existing CRC methods are incapable of processing the nonlinear variational information directly. Recent advances illustrate that how to effectively model these nonlinear variational information and learn invariant representations is an open challenge in the community of computer vision and pattern recognition To this end, we try to design a new algorithm to handle this problem. Firstly, the second-order statistic, i.e., covariance matrix is applied to model the original image sets. Due to the space formed by a set of nonsingular covariance matrices is a well-known Symmetric Positive Definite (SPD) manifold, generalising the Euclidean collaborative representation to the SPD manifold is not an easy task. Then, we devise two strategies to cope with this issue. One attempts to embed the SPD manifold-valued data representations into an associated tangent space via the matrix logarithm map. Another is to embed them into a Reproducing Kernel Hilbert Space (RKHS) by utilizing the Riemannian kernel function. After these two treatments, CRC is applicable to the SPD manifold-valued features. The evaluations on four banchmarking datasets justify its effectiveness.

preprint2022arXiv

Cross-domain Contrastive Learning for Unsupervised Domain Adaptation

Unsupervised domain adaptation (UDA) aims to transfer knowledge learned from a fully-labeled source domain to a different unlabeled target domain. Most existing UDA methods learn domain-invariant feature representations by minimizing feature distances across domains. In this work, we build upon contrastive self-supervised learning to align features so as to reduce the domain discrepancy between training and testing sets. Exploring the same set of categories shared by both domains, we introduce a simple yet effective framework CDCL, for domain alignment. In particular, given an anchor image from one domain, we minimize its distances to cross-domain samples from the same class relative to those from different categories. Since target labels are unavailable, we use a clustering-based approach with carefully initialized centers to produce pseudo labels. In addition, we demonstrate that CDCL is a general framework and can be adapted to the data-free setting, where the source data are unavailable during training, with minimal modification. We conduct experiments on two widely used domain adaptation benchmarks, i.e., Office-31 and VisDA-2017, for image classification tasks, and demonstrate that CDCL achieves state-of-the-art performance on both datasets.

preprint2022arXiv

Data Augmentation vs. Equivariant Networks: A Theory of Generalization on Dynamics Forecasting

Exploiting symmetry in dynamical systems is a powerful way to improve the generalization of deep learning. The model learns to be invariant to transformation and hence is more robust to distribution shift. Data augmentation and equivariant networks are two major approaches to injecting symmetry into learning. However, their exact role in improving generalization is not well understood. In this work, we derive the generalization bounds for data augmentation and equivariant networks, characterizing their effect on learning in a unified framework. Unlike most prior theories for the i.i.d. setting, we focus on non-stationary dynamics forecasting with complex temporal dependencies.

preprint2022arXiv

Deep Reinforcement Learning for Online Routing of Unmanned Aerial Vehicles with Wireless Power Transfer

The unmanned aerial vehicle (UAV) plays an vital role in various applications such as delivery, military mission, disaster rescue, communication, etc., due to its flexibility and versatility. This paper proposes a deep reinforcement learning method to solve the UAV online routing problem with wireless power transfer, which can charge the UAV remotely without wires, thus extending the capability of the battery-limited UAV. Our study considers the power consumption of the UAV and the wireless charging process. Unlike the previous works, we solve the problem by a designed deep neural network. The model is trained using a deep reinforcement learning method offline, and is used to optimize the UAV routing problem online. On small and large scale instances, the proposed model runs from four times to 500 times faster than Google OR-tools, the state-of-the-art combinatorial optimization solver, with identical solution quality. It also outperforms different types of heuristic and local search methods in terms of both run-time and optimality. In addition, once the model is trained, it can scale to new generated problem instances with arbitrary topology that are not seen during training. The proposed method is practically applicable when the problem scale is large and the response time is crucial.

preprint2022arXiv

Determining the number of factors in a large-dimensional generalised factor model

This paper proposes new estimators of the number of factors for a generalised factor model with more relaxed assumptions than the strict factor model. Under the framework of large cross-sections $N$ and large time dimensions $T$, we first derive the bias-corrected estimator $\hat σ^2_*$ of the noise variance in a generalised factor model by random matrix theory. Then we construct three information criteria based on $\hat σ^2_*$, further propose the consistent estimators of the number of factors. Finally, simulations and real data analysis illustrate that our proposed estimations are more accurate and avoid the overestimation in some existing works.

preprint2022arXiv

DreamNet: A Deep Riemannian Network based on SPD Manifold Learning for Visual Classification

Image set-based visual classification methods have achieved remarkable performance, via characterising the image set in terms of a non-singular covariance matrix on a symmetric positive definite (SPD) manifold. To adapt to complicated visual scenarios better, several Riemannian networks (RiemNets) for SPD matrix nonlinear processing have recently been studied. However, it is pertinent to ask, whether greater accuracy gains can be achieved by simply increasing the depth of RiemNets. The answer appears to be negative, as deeper RiemNets tend to lose generalization ability. To explore a possible solution to this issue, we propose a new architecture for SPD matrix learning. Specifically, to enrich the deep representations, we adopt SPDNet [1] as the backbone, with a stacked Riemannian autoencoder (SRAE) built on the tail. The associated reconstruction error term can make the embedding functions of both SRAE and of each RAE an approximate identity mapping, which helps to prevent the degradation of statistical information. We then insert several residual-like blocks with shortcut connections to augment the representational capacity of SRAE, and to simplify the training of a deeper network. The experimental evidence demonstrates that our DreamNet can achieve improved accuracy with increased depth of the network.

preprint2022arXiv

Edge Federated Learning Via Unit-Modulus Over-The-Air Computation

Edge federated learning (FL) is an emerging paradigm that trains a global parametric model from distributed datasets based on wireless communications. This paper proposes a unit-modulus over-the-air computation (UMAirComp) framework to facilitate efficient edge federated learning, which simultaneously uploads local model parameters and updates global model parameters via analog beamforming. The proposed framework avoids sophisticated baseband signal processing, leading to low communication delays and implementation costs. Training loss bounds of UMAirComp FL systems are derived and two low-complexity large-scale optimization algorithms, termed penalty alternating minimization (PAM) and accelerated gradient projection (AGP), are proposed to minimize the nonconvex nonsmooth loss bound. Simulation results show that the proposed UMAirComp framework with PAM algorithm achieves a smaller mean square error of model parameters' estimation, training loss, and test error compared with other benchmark schemes. Moreover, the proposed UMAirComp framework with AGP algorithm achieves satisfactory performance while reduces the computational complexity by orders of magnitude compared with existing optimization algorithms. Finally, we demonstrate the implementation of UMAirComp in a vehicle-to-everything autonomous driving simulation platform. It is found that autonomous driving tasks are more sensitive to model parameter errors than other tasks since the neural networks for autonomous driving contain sparser model parameters.

preprint2022arXiv

Effect of Homomorphic Encryption on the Performance of Training Federated Learning Generative Adversarial Networks

A Generative Adversarial Network (GAN) is a deep-learning generative model in the field of Machine Learning (ML) that involves training two Neural Networks (NN) using a sizable data set. In certain fields, such as medicine, the training data may be hospital patient records that are stored across different hospitals. The classic centralized approach would involve sending the data to a centralized server where the model would be trained. However, that would involve breaching the privacy and confidentiality of the patients and their data, which would be unacceptable. Therefore, Federated Learning (FL), an ML technique that trains ML models in a distributed setting without data ever leaving the host device, would be a better alternative to the centralized option. In this ML technique, only parameters and certain metadata would be communicated. In spite of that, there still exist attacks that can infer user data using the parameters and metadata. A fully privacy-preserving solution involves homomorphically encrypting (HE) the data communicated. This paper will focus on the performance loss of training an FL-GAN with three different types of Homomorphic Encryption: Partial Homomorphic Encryption (PHE), Somewhat Homomorphic Encryption (SHE), and Fully Homomorphic Encryption (FHE). We will also test the performance loss of Multi-Party Computations (MPC), as it has homomorphic properties. The performances will be compared to the performance of training an FL-GAN without encryption as well. Our experiments show that the more complex the encryption method is, the longer it takes, with the extra time taken for HE is quite significant in comparison to the base case of FL.

preprint2022arXiv

Efficient and High-quality Prehensile Rearrangement in Cluttered and Confined Spaces

Prehensile object rearrangement in cluttered and confined spaces has broad applications but is also challenging. For instance, rearranging products in a grocery shelf means that the robot cannot directly access all objects and has limited free space. This is harder than tabletop rearrangement where objects are easily accessible with top-down grasps, which simplifies robot-object interactions. This work focuses on problems where such interactions are critical for completing tasks. It proposes a new efficient and complete solver under general constraints for monotone instances, which can be solved by moving each object at most once. The monotone solver reasons about robot-object constraints and uses them to effectively prune the search space. The new monotone solver is integrated with a global planner to solve non-monotone instances with high-quality solutions fast. Furthermore, this work contributes an effective pre-processing tool to significantly speed up online motion planning queries for rearrangement in confined spaces. Experiments further demonstrate that the proposed monotone solver, equipped with the pre-processing tool, results in 57.3% faster computation and 3 times higher success rate than state-of-the-art methods. Similarly, the resulting global planner is computationally more efficient and has a higher success rate, while producing high-quality solutions for non-monotone instances (i.e., only 1.3 additional actions are needed on average). Videos of demonstrating solutions on a real robotic system and codes can be found at https://github.com/Rui1223/uniform_object_rearrangement.

preprint2022arXiv

Efficient conditioned face animation using frontally-viewed embedding

As the quality of few shot facial animation from landmarks increases, new applications become possible, such as ultra low bandwidth video chat compression with a high degree of realism. However, there are some important challenges to tackle in order to improve the experience in real world conditions. In particular, the current approaches fail to represent profile views without distortions, while running in a low compute regime. We focus on this key problem by introducing a multi-frames embedding dubbed Frontalizer to improve profile views rendering. In addition to this core improvement, we explore the learning of a latent code conditioning generations along with landmarks to better convey facial expressions. Our dense models achieves 22% of improvement in perceptual quality and 73% reduction of landmark error over the first order model baseline on a subset of DFDC videos containing head movements. Declined with mobile architectures, our models outperform the previous state-of-the-art (improving perceptual quality by more than 16% and reducing landmark error by more than 47% on two datasets) while running on real time on iPhone 8 with very low bandwidth requirements.

preprint2022arXiv

EfficientTDNN: Efficient Architecture Search for Speaker Recognition

Convolutional neural networks (CNNs), such as the time-delay neural network (TDNN), have shown their remarkable capability in learning speaker embedding. However, they meanwhile bring a huge computational cost in storage size, processing, and memory. Discovering the specialized CNN that meets a specific constraint requires a substantial effort of human experts. Compared with hand-designed approaches, neural architecture search (NAS) appears as a practical technique in automating the manual architecture design process and has attracted increasing interest in spoken language processing tasks such as speaker recognition. In this paper, we propose EfficientTDNN, an efficient architecture search framework consisting of a TDNN-based supernet and a TDNN-NAS algorithm. The proposed supernet introduces temporal convolution of different ranges of the receptive field and feature aggregation of various resolutions from different layers to TDNN. On top of it, the TDNN-NAS algorithm quickly searches for the desired TDNN architecture via weight-sharing subnets, which surprisingly reduces computation while handling the vast number of devices with various resources requirements. Experimental results on the VoxCeleb dataset show the proposed EfficientTDNN enables approximate $10^{13}$ architectures concerning depth, kernel, and width. Considering different computation constraints, it achieves a 2.20% equal error rate (EER) with 204M multiply-accumulate operations (MACs), 1.41% EER with 571M MACs as well as 0.94% EER with 1.45G MACs. Comprehensive investigations suggest that the trained supernet generalizes subnets not sampled during training and obtains a favorable trade-off between accuracy and efficiency.

preprint2022arXiv

Evaluation of Four Black-box Adversarial Attacks and Some Query-efficient Improvement Analysis

With the fast development of machine learning technologies, deep learning models have been deployed in almost every aspect of everyday life. However, the privacy and security of these models are threatened by adversarial attacks. Among which black-box attack is closer to reality, where limited knowledge can be acquired from the model. In this paper, we provided basic background knowledge about adversarial attack and analyzed four black-box attack algorithms: Bandits, NES, Square Attack and ZOsignSGD comprehensively. We also explored the newly proposed Square Attack method with respect to square size, hoping to improve its query efficiency.

preprint2022arXiv

Exact solutions of Kondo problems in higher-order fermions

The conformal field theory (CFT) approach to Kondo problems, originally developed by Affleck and Ludwig (AL), has greatly advanced the fundamental knowledge of Kondo physics. The CFT approach to Kondo impurities is based on a necessary approximation, i.e., the linearization of the low-lying excitations in a narrow energy window about the Fermi surface. This treatment works well in normal metal baths, but encounters fundamental difficulties in systems with Fermi points and high-order dispersion relations. Prominent examples of such systems are the recently-proposed topological semimetals with emergent higher-order fermions. Here, we develop a new CFT technique that yields exact solutions to the Kondo problems in higher-order fermion systems. Our approach does not require any linearization of the low-lying excitations, and more importantly, it rigorously bosonizes the entire energy spectrum of the higher-order fermions. Therefore, it provides a more solid theoretical base for evaluating the thermodynamic quantities at finite temperatures. Our work significantly broadens the scope of CFT techniques and brings about unprecedented applications beyond the reach of conventional methods.

preprint2022arXiv

Fano Interference in a Single-Molecule Junction

Trends of miniaturized devices and quantum interference electronics lead to the long desire of Fano interference in single-molecule junctions, here, which is successfully demonstrated using the 2,7-di(4-pyridyl)-9,9'-spirobifluorene molecule with a long backbone group and a short side group. Experimentally, the two electrically coupled groups are found to contribute to two blurred degenerate points in the differential conductance mapping. This forms a characteristic non-centrosymmetric double-crossing feature, with distinct temperature response for each crossing. Theoretically, we describe the practical in-junction electron transmission using a new two-tunnelling-channel coupling model and obtain a working formula with a Fano term and a Breit-Wigner term. The formula is shown to provide a good fit for all the mapping data and their temperature dependence in three dimensions, identifying the Fano component. Our work thus forms a complete set of evidence of the Fano interference in a single-molecule junction induced by two-tunnelling-channel coupling transport. Density functional theory calculations are used to corroborate this new physics.

preprint2022arXiv

FLVoogd: Robust And Privacy Preserving Federated Learning

In this work, we propose FLVoogd, an updated federated learning method in which servers and clients collaboratively eliminate Byzantine attacks while preserving privacy. In particular, servers use automatic Density-based Spatial Clustering of Applications with Noise (DBSCAN) combined with S2PC to cluster the benign majority without acquiring sensitive personal information. Meanwhile, clients build dual models and perform test-based distance controlling to adjust their local models toward the global one to achieve personalizing. Our framework is automatic and adaptive that servers/clients don't need to tune the parameters during the training. In addition, our framework leverages Secure Multi-party Computation (SMPC) operations, including multiplications, additions, and comparison, where costly operations, like division and square root, are not required. Evaluations are carried out on some conventional datasets from the image classification field. The result shows that FLVoogd can effectively reject malicious uploads in most scenarios; meanwhile, it avoids data leakage from the server-side.

preprint2022arXiv

Grassmannian Discriminant Maps (GDM) for Manifold Dimensionality Reduction with Application to Image Set Classification

In image set classification, a considerable progress has been made by representing original image sets on Grassmann manifolds. In order to extend the advantages of the Euclidean based dimensionality reduction methods to the Grassmann Manifold, several methods have been suggested recently which jointly perform dimensionality reduction and metric learning on Grassmann manifold to improve performance. Nevertheless, when applied to complex datasets, the learned features do not exhibit enough discriminatory power. To overcome this problem, we propose a new method named Grassmannian Discriminant Maps (GDM) for manifold dimensionality reduction problems. The core of the method is a new discriminant function for metric learning and dimensionality reduction. For comparison and better understanding, we also study a simple variations to GDM. The key difference between them is the discriminant function. We experiment on data sets corresponding to three tasks: face recognition, object categorization, and hand gesture recognition to evaluate the proposed method and its simple extensions. Compared with the state of the art, the results achieved show the effectiveness of the proposed algorithm.

preprint2022arXiv

High quality entanglement distribution through telecommunication fiber using near-infrared non-degenerate photon pairs

For practical quantum communications, the efficiency of the entire system (source, quantum channel and detectors) must be taken into account. In many urban environments, the quantum channel in the form of telecommunication optical fiber (confirming to ITU G.652D standards) are available, but the detectors in this range typically have low efficiency. We investigate the possibility that for campus-type communications, entangled photons prepared in the Near-Infrared Range (NIR) can be transmitted successfully while preserving polarization entanglement. We demonstrate the distribution of degenerate and non-degenerate entangled photon pairs of wavelength around 810 nm through standard telecommunication fiber. This technique benefits from the high efficiency of the NIR single photon detectors and the mature design of setups around 810 nm.. In this work, we obtain high quality entanglement (visibility is 94.8\% based on the raw data) after an overall distance of 12 km, corresponding to about -36 dB of fiber induced loss.

preprint2022arXiv

Hybridization of evolutionary algorithm and deep reinforcement learning for multi-objective orienteering optimization

Multi-objective orienteering problems (MO-OPs) are classical multi-objective routing problems and have received a lot of attention in the past decades. This study seeks to solve MO-OPs through a problem-decomposition framework, that is, a MO-OP is decomposed into a multi-objective knapsack problem (MOKP) and a travelling salesman problem (TSP). The MOKP and TSP are then solved by a multi-objective evolutionary algorithm (MOEA) and a deep reinforcement learning (DRL) method, respectively. While the MOEA module is for selecting cities, the DRL module is for planning a Hamiltonian path for these cities. An iterative use of these two modules drives the population towards the Pareto front of MO-OPs. The effectiveness of the proposed method is compared against NSGA-II and NSGA-III on various types of MO-OP instances. Experimental results show that our method exhibits the best performance on almost all the test instances, and has shown strong generalization ability.

preprint2022arXiv

Hyperspectral Unmixing Based on Nonnegative Matrix Factorization: A Comprehensive Review

Hyperspectral unmixing has been an important technique that estimates a set of endmembers and their corresponding abundances from a hyperspectral image (HSI). Nonnegative matrix factorization (NMF) plays an increasingly significant role in solving this problem. In this article, we present a comprehensive survey of the NMF-based methods proposed for hyperspectral unmixing. Taking the NMF model as a baseline, we show how to improve NMF by utilizing the main properties of HSIs (e.g., spectral, spatial, and structural information). We categorize three important development directions including constrained NMF, structured NMF, and generalized NMF. Furthermore, several experiments are conducted to illustrate the effectiveness of associated algorithms. Finally, we conclude the article with possible future directions with the purposes of providing guidelines and inspiration to promote the development of hyperspectral unmixing.

preprint2022arXiv

Improving Deep Image Matting via Local Smoothness Assumption

Natural image matting is a fundamental and challenging computer vision task. Conventionally, the problem is formulated as an underconstrained problem. Since the problem is ill-posed, further assumptions on the data distribution are required to make the problem well-posed. For classical matting methods, a commonly adopted assumption is the local smoothness assumption on foreground and background colors. However, the use of such assumptions was not systematically considered for deep learning based matting methods. In this work, we consider two local smoothness assumptions which can help improving deep image matting models. Based on the local smoothness assumptions, we propose three techniques, i.e., training set refinement, color augmentation and backpropagating refinement, which can improve the performance of the deep image matting model significantly. We conduct experiments to examine the effectiveness of the proposed algorithm. The experimental results show that the proposed method has favorable performance compared with existing matting methods.

preprint2022arXiv

LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT

Self-supervised speech representation learning has shown promising results in various speech processing tasks. However, the pre-trained models, e.g., HuBERT, are storage-intensive Transformers, limiting their scope of applications under low-resource settings. To this end, we propose LightHuBERT, a once-for-all Transformer compression framework, to find the desired architectures automatically by pruning structured parameters. More precisely, we create a Transformer-based supernet that is nested with thousands of weight-sharing subnets and design a two-stage distillation strategy to leverage the contextualized latent representations from HuBERT. Experiments on automatic speech recognition (ASR) and the SUPERB benchmark show the proposed LightHuBERT enables over $10^9$ architectures concerning the embedding dimension, attention dimension, head number, feed-forward network ratio, and network depth. LightHuBERT outperforms the original HuBERT on ASR and five SUPERB tasks with the HuBERT size, achieves comparable performance to the teacher model in most tasks with a reduction of 29% parameters, and obtains a $3.5\times$ compression ratio in three SUPERB tasks, e.g., automatic speaker verification, keyword spotting, and intent classification, with a slight accuracy loss. The code and pre-trained models are available at https://github.com/mechanicalsea/lighthubert.

preprint2022arXiv

M Subdwarf Research III. Spectroscopic Diagnostics for Breaking Parameter Degeneracy

To understand the parameter degeneracy of M subdwarf spectra at low resolution, we assemble a large number of spectral features in the wavelength range of 0.6-2.5 μm with band strength quantified by narrowband indices. Based on the index trends of BT-Settl model sequences, we illustrate how the main atmospheric parameters (Teff, log g, [M/H], and [alpha/Fe]) affect each spectral feature differently. Furthermore, we propose a four-step process to determine the four parameters sequentially, which extends the basic idea proposed by Jao et al. Each step contains several spectral features that break the degeneracy effect when determining a specific stellar parameter. Finally, the feasibility of each spectroscopic diagnostic with different spectral qualities is investigated. The result is resolution-independent down to R~200.

preprint2022arXiv

MINERVAS: Massive INterior EnviRonments VirtuAl Synthesis

With the rapid development of data-driven techniques, data has played an essential role in various computer vision tasks. Many realistic and synthetic datasets have been proposed to address different problems. However, there are lots of unresolved challenges: (1) the creation of dataset is usually a tedious process with manual annotations, (2) most datasets are only designed for a single specific task, (3) the modification or randomization of the 3D scene is difficult, and (4) the release of commercial 3D data may encounter copyright issue. This paper presents MINERVAS, a Massive INterior EnviRonments VirtuAl Synthesis system, to facilitate the 3D scene modification and the 2D image synthesis for various vision tasks. In particular, we design a programmable pipeline with Domain-Specific Language, allowing users to (1) select scenes from the commercial indoor scene database, (2) synthesize scenes for different tasks with customized rules, and (3) render various imagery data, such as visual color, geometric structures, semantic label. Our system eases the difficulty of customizing massive numbers of scenes for different tasks and relieves users from manipulating fine-grained scene configurations by providing user-controllable randomness using multi-level samplers. Most importantly, it empowers users to access commercial scene databases with millions of indoor scenes and protects the copyright of core data assets, e.g., 3D CAD models. We demonstrate the validity and flexibility of our system by using our synthesized data to improve the performance on different kinds of computer vision tasks.

preprint2022arXiv

Model Pruning Based on Quantified Similarity of Feature Maps

Convolutional Neural Networks (CNNs) has been applied in numerous Internet of Things (IoT) devices for multifarious downstream tasks. However, with the increasing amount of data on edge devices, CNNs can hardly complete some tasks in time with limited computing and storage resources. Recently, filter pruning has been regarded as an effective technique to compress and accelerate CNNs, but existing methods rarely prune CNNs from the perspective of compressing high-dimensional tensors. In this paper, we propose a novel theory to find redundant information in three-dimensional tensors, namely Quantified Similarity between Feature Maps (QSFM), and utilize this theory to guide the filter pruning procedure. We perform QSFM on datasets (CIFAR-10, CIFAR-100 and ILSVRC-12) and edge devices, demonstrate that the proposed method can find the redundant information in the neural networks effectively with comparable compression and tolerable drop of accuracy. Without any fine-tuning operation, QSFM can compress ResNet-56 on CIFAR-10 significantly (48.7% FLOPs and 57.9% parameters are reduced) with only a loss of 0.54% in the top-1 accuracy. For the practical application of edge devices, QSFM can accelerate MobileNet-V2 inference speed by 1.53 times with only a loss of 1.23% in the ILSVRC-12 top-1 accuracy.

preprint2022arXiv

Morphological Anti-Aliasing Method for Boundary Slope Prediction

Image pixel aliasing caused by insufficient sampling is a long-standing problem in the field of computer graphics. It has always been the goal of researchers to seek anti-aliasing algorithms with high speed and good effect. Due to the deficiencies in local detection and reconstruction of sloping line boundaries, a morphological anti-aliasing method for boundary slope prediction is proposed. This method uses the information of the local line boundary slope to predict and test the end positions of the line boundary in the global scope, thereby reconstructing The boundary information more consistent with the actual boundary is obtained, and a more accurate linear boundary shape is obtained with only a small increase in the amount of calculation. Compared with the previous morphological anti-aliasing algorithm, the proposed method is based on the global morphological boundary. , can reconstruct the straight line boundary more accurately, and apply it to the anti-aliasing calculation, which can further improve the color transition of the straight line boundary, make the inclined straight line boundary have higher continuity, and obtain a better anti-aliasing effect.

preprint2022arXiv

MULTI-FLGANs: Multi-Distributed Adversarial Networks for Non-IID distribution

Federated learning is an emerging concept in the domain of distributed machine learning. This concept has enabled GANs to benefit from the rich distributed training data while preserving privacy. However, in a non-iid setting, current federated GAN architectures are unstable, struggling to learn the distinct features and vulnerable to mode collapse. In this paper, we propose a novel architecture MULTI-FLGAN to solve the problem of low-quality images, mode collapse and instability for non-iid datasets. Our results show that MULTI-FLGAN is four times as stable and performant (i.e. high inception score) on average over 20 clients compared to baseline FLGAN.

preprint2022arXiv

Multi-objective Pointer Network for Combinatorial Optimization

Multi-objective combinatorial optimization problems (MOCOPs), one type of complex optimization problems, widely exist in various real applications. Although meta-heuristics have been successfully applied to address MOCOPs, the calculation time is often much longer. Recently, a number of deep reinforcement learning (DRL) methods have been proposed to generate approximate optimal solutions to the combinatorial optimization problems. However, the existing studies on DRL have seldom focused on MOCOPs. This study proposes a single-model deep reinforcement learning framework, called multi-objective Pointer Network (MOPN), where the input structure of PN is effectively improved so that the single PN is capable of solving MOCOPs. In addition, two training strategies, based on representative model and transfer learning, respectively, are proposed to further enhance the performance of MOPN in different application scenarios. Moreover, compared to classical meta-heuristics, MOPN only consumes much less time on forward propagation to obtain the Pareto front. Meanwhile, MOPN is insensitive to problem scale, meaning that a trained MOPN is able to address MOCOPs with different scales. To verify the performance of MOPN, extensive experiments are conducted on three multi-objective traveling salesman problems, in comparison with one state-of-the-art model DRL-MOA and three classical multi-objective meta-heuristics. Experimental results demonstrate that the proposed model outperforms all the comparative methods with only 20\% to 40\% training time of DRL-MOA.

preprint2022arXiv

Multi-View Self-Attention Based Transformer for Speaker Recognition

Initially developed for natural language processing (NLP), Transformer model is now widely used for speech processing tasks such as speaker recognition, due to its powerful sequence modeling capabilities. However, conventional self-attention mechanisms are originally designed for modeling textual sequence without considering the characteristics of speech and speaker modeling. Besides, different Transformer variants for speaker recognition have not been well studied. In this work, we propose a novel multi-view self-attention mechanism and present an empirical study of different Transformer variants with or without the proposed attention mechanism for speaker recognition. Specifically, to balance the capabilities of capturing global dependencies and modeling the locality, we propose a multi-view self-attention mechanism for speaker Transformer, in which different attention heads can attend to different ranges of the receptive field. Furthermore, we introduce and compare five Transformer variants with different network architectures, embedding locations, and pooling methods to learn speaker embeddings. Experimental results on the VoxCeleb1 and VoxCeleb2 datasets show that the proposed multi-view self-attention mechanism achieves improvement in the performance of speaker recognition, and the proposed speaker Transformer network attains excellent results compared with state-of-the-art models.

preprint2022arXiv

Next-ViT: Next Generation Vision Transformer for Efficient Deployment in Realistic Industrial Scenarios

Due to the complex attention mechanisms and model design, most existing vision Transformers (ViTs) can not perform as efficiently as convolutional neural networks (CNNs) in realistic industrial deployment scenarios, e.g. TensorRT and CoreML. This poses a distinct challenge: Can a visual neural network be designed to infer as fast as CNNs and perform as powerful as ViTs? Recent works have tried to design CNN-Transformer hybrid architectures to address this issue, yet the overall performance of these works is far away from satisfactory. To end these, we propose a next generation vision Transformer for efficient deployment in realistic industrial scenarios, namely Next-ViT, which dominates both CNNs and ViTs from the perspective of latency/accuracy trade-off. In this work, the Next Convolution Block (NCB) and Next Transformer Block (NTB) are respectively developed to capture local and global information with deployment-friendly mechanisms. Then, Next Hybrid Strategy (NHS) is designed to stack NCB and NTB in an efficient hybrid paradigm, which boosts performance in various downstream tasks. Extensive experiments show that Next-ViT significantly outperforms existing CNNs, ViTs and CNN-Transformer hybrid architectures with respect to the latency/accuracy trade-off across various vision tasks. On TensorRT, Next-ViT surpasses ResNet by 5.5 mAP (from 40.4 to 45.9) on COCO detection and 7.7% mIoU (from 38.8% to 46.5%) on ADE20K segmentation under similar latency. Meanwhile, it achieves comparable performance with CSWin, while the inference speed is accelerated by 3.6x. On CoreML, Next-ViT surpasses EfficientFormer by 4.6 mAP (from 42.6 to 47.2) on COCO detection and 3.5% mIoU (from 45.1% to 48.6%) on ADE20K segmentation under similar latency. Our code and models are made public at: https://github.com/bytedance/Next-ViT

preprint2022arXiv

Normal and Visibility Estimation of Human Face from a Single Image

Recent work on the intrinsic image of humans starts to consider the visibility of incident illumination and encodes the light transfer function by spherical harmonics. In this paper, we show that such a light transfer function can be further decomposed into visibility and cosine terms related to surface normal. Such decomposition allows us to recover the surface normal in addition to visibility. We propose a deep learning-based approach with a reconstruction loss for training on real-world images. Results show that compared with previous works, the reconstruction of human face from our method better reveals the surface normal and shading details especially around regions where visibility effect is strong.

preprint2022arXiv

On the Effectiveness of Pinyin-Character Dual-Decoding for End-to-End Mandarin Chinese ASR

End-to-end automatic speech recognition (ASR) has achieved promising results. However, most existing end-to-end ASR methods neglect the use of specific language characteristics. For Mandarin Chinese ASR tasks, there exist mutual promotion relationship between Pinyin and Character where Chinese characters can be romanized by Pinyin. Based on the above intuition, we first investigate types of end-to-end encoder-decoder based models in the single-input dual-output (SIDO) multi-task framework, after which a novel asynchronous decoding with fuzzy Pinyin sampling method is proposed according to the one-to-one correspondence characteristics between Pinyin and Character. Furthermore, we proposed a two-stage training strategy to make training more stable and converge faster. The results on the test sets of AISHELL-1 dataset show that the proposed enhanced dual-decoder model without a language model is improved by a big margin compared to strong baseline models.

preprint2022arXiv

Online Object Model Reconstruction and Reuse for Lifelong Improvement of Robot Manipulation

This work proposes a robotic pipeline for picking and constrained placement of objects without geometric shape priors. Compared to recent efforts developed for similar tasks, where every object was assumed to be novel, the proposed system recognizes previously manipulated objects and performs online model reconstruction and reuse. Over a lifelong manipulation process, the system keeps learning features of objects it has interacted with and updates their reconstructed models. Whenever an instance of a previously manipulated object reappears, the system aims to first recognize it and then register its previously reconstructed model given the current observation. This step greatly reduces object shape uncertainty allowing the system to even reason for parts of objects, which are currently not observable. This also results in better manipulation efficiency as it reduces the need for active perception of the target object during manipulation. To get a reusable reconstructed model, the proposed pipeline adopts: i) TSDF for object representation, and ii) a variant of the standard particle filter algorithm for pose estimation and tracking of the partial object model. Furthermore, an effective way to construct and maintain a dataset of manipulated objects is presented. A sequence of real-world manipulation experiments is performed. They show how future manipulation tasks become more effective and efficient by reusing reconstructed models of previously manipulated objects, which were generated during their prior manipulation, instead of treating objects as novel every time.

preprint2022arXiv

Parallel Pre-trained Transformers (PPT) for Synthetic Data-based Instance Segmentation

Recently, Synthetic data-based Instance Segmentation has become an exceedingly favorable optimization paradigm since it leverages simulation rendering and physics to generate high-quality image-annotation pairs. In this paper, we propose a Parallel Pre-trained Transformers (PPT) framework to accomplish the synthetic data-based Instance Segmentation task. Specifically, we leverage the off-the-shelf pre-trained vision Transformers to alleviate the gap between natural and synthetic data, which helps to provide good generalization in the downstream synthetic data scene with few samples. Swin-B-based CBNet V2, SwinL-based CBNet V2 and Swin-L-based Uniformer are employed for parallel feature learning, and the results of these three models are fused by pixel-level Non-maximum Suppression (NMS) algorithm to obtain more robust results. The experimental results reveal that PPT ranks first in the CVPR2022 AVA Accessibility Vision and Autonomy Challenge, with a 65.155% mAP.

preprint2022arXiv

Parameter Choices for Sparse Regularization with the $\ell_1$ Norm

We consider a regularization problem whose objective function consists of a convex fidelity term and a regularization term determined by the $\ell_1$ norm composed with a linear transform. Empirical results show that the regularization with the $\ell_1$ norm can promote sparsity of a regularized solution. It is the goal of this paper to understand theoretically the effect of the regularization parameter on the sparsity of the regularized solutions. We establish a characterization of the sparsity under the transform matrix of the solution. When the fidelity term has a special structure and the transform matrix coincides with a identity matrix, the resulting characterization can be taken as a regularization parameter choice strategy with which the regularization problem has a solution having a sparsity of a certain level. We study choices of the regularization parameter so that the regularization term alleviates the ill-posedness and promote sparsity of the resulting regularized solution. Numerical experiments demonstrate that choices of the regularization parameters can balance the sparsity of the solutions of the regularization problem and its approximation to the minimizer of the fidelity function.

preprint2022arXiv

Passive Motion Detection via mmWave Communication System

In this paper, an integrated passive sensing and communication system working in 60 GHz band is elaborated, and the sensing performance is investigated in an application of hand gesture recognition. Specifically, in this integrated system, there are two radio frequency (RF) chains at the receiver and one at the transmitter. Each RF chain is connected with one phased array for analog beamforming. To facilitate simultaneous sensing and communication, the transmitter delivers one stream of information-bearing signals via two beam lobes, one is aligned with the main signal propagation path and the other is directed to the sensing target. Signals from the two lobes are received by the two RF chains at the receiver, respectively. By cross ambiguity coherent processing, the time-Doppler spectrograms of hand gestures can be obtained. Relying on the passive sensing system, a dataset of received signals, where three types of hand gestures are sensed, is collected by using Line-of-Sight (LoS) and Non-Line-of-Sight (NLoS) paths as the reference channel respectively. Then a neural network is trained by the dataset for motion detection. It is shown that the classification accuracy rate is high as long as sufficient sensing time is assured. Finally, an empirical model characterizing the relation between the classification accuracy and sensing duration is derived analytically.

preprint2022arXiv

Persistent Homology for Effective Non-Prehensile Manipulation

This work explores the use of topological tools for achieving effective non-prehensile manipulation in cluttered, constrained workspaces. In particular, it proposes the use of persistent homology as a guiding principle in identifying the appropriate non-prehensile actions, such as pushing, to clean a cluttered space with a robotic arm so as to allow the retrieval of a target object. Persistent homology enables the automatic identification of connected components of blocking objects in the space without the need for manual input or tuning of parameters. The proposed algorithm uses this information to push groups of cylindrical objects together and aims to minimize the number of pushing actions needed to reach to the target. Simulated experiments in a physics engine using a model of the Baxter robot show that the proposed topology-driven solution is achieving significantly higher success rate in solving such constrained problems relatively to state-of-the-art alternatives from the literature. It manages to keep the number of pushing actions low, is computationally efficient and the resulting decisions and motion appear natural for effectively solving such tasks.

preprint2022arXiv

Persistent Path Laplacian

Path homology proposed by S.-T.Yau and his co-workers provides a new mathematical model for directed graphs and networks. Persistent path homology (PPH) extends the path homology with filtration to deal with asymmetry structures. However, PPH is constrained to purely topological persistence and cannot track the homotopic shape evolution of data during filtration. To overcome the limitation of PPH, persistent path Laplacian (PPL) is introduced to capture the shape evolution of data. PPL's harmonic spectra fully recover PPH's topological persistence and its non-harmonic spectra reveal the homotopic shape evolution of data during filtration.

preprint2022arXiv

PriFit: Learning to Fit Primitives Improves Few Shot Point Cloud Segmentation

We present PriFit, a semi-supervised approach for label-efficient learning of 3D point cloud segmentation networks. PriFit combines geometric primitive fitting with point-based representation learning. Its key idea is to learn point representations whose clustering reveals shape regions that can be approximated well by basic geometric primitives, such as cuboids and ellipsoids. The learned point representations can then be re-used in existing network architectures for 3D point cloud segmentation, and improves their performance in the few-shot setting. According to our experiments on the widely used ShapeNet and PartNet benchmarks, PriFit outperforms several state-of-the-art methods in this setting, suggesting that decomposability into primitives is a useful prior for learning representations predictive of semantic parts. We present a number of ablative experiments varying the choice of geometric primitives and downstream tasks to demonstrate the effectiveness of the method.

preprint2022arXiv

Q-balls Formation and the Production of Gravitational Waves With Non-minimal Gravitational Coupling

We propose to introduce non-minimal couplings of Affleck-Dine (AD) field to gravity by adding the coupling of AD field to the Ricci scalar curvature. As the Jordan frame supergravity always predict $|Φ|^2 {\cal R}/6$ type coupling for scalars with canonical kinetic terms, we propose a way to realize the required $c_0|Φ|^2 {\cal R}$-type couplings with generic $c_0$ for canonical complex scalar fields after SUSY breaking. The impacts of such non-minimal gravitational couplings for AD field is shown, especially on the Q-balls formation and the associated gravitational wave (GW) productions. New form of scalar potential for AD field in the Einstein frame is obtained. By numerical simulations, we find that, with non-minimal gravitational coupling to AD field, Q-balls can successfully form even with the choice of non-negative $K$ parameter for $ξ>0$. The associated GW productions as well as their dependences on the $ξ$ parameter are also discussed.

preprint2022arXiv

R2: A Distributed Remote Function Execution Mechanism With Built-in Metadata

Named data networking (NDN) constructs a network by names, providing a flexible and decentralized way to manage resources within the edge computing continuum. This paper aims to solve the question, "Given a function with its parameters and metadata, how to select the executor in a distributed manner and obtain the result in NDN?" To answer it, we design R2 that involves the following stages. First, we design a name structure including data, function names, and other function parameters. Second, we develop a 2-phase mechanism, where in the first phase, the function request from a client-first reaches the data source and retrieves the metadata, then the best node is selected while the metadata is responding to the client. In the second phase, the chosen node directly retrieves the data, executes the function, and provides the result to the client. Furthermore, we propose a stop condition to intelligently reduce the processing time of the first phase and provide a simple proof and range analysis. Simulations confirm that R2 outperforms the current solutions in terms of resource allocation, especially when the data volume and the function complexity are high. In the experiments, when the data size is 100 KiB and the function complexity is $\mathcal{O}(n^2)$, the speedup ratio is 4.61. To further evaluate R2, we also implement a general intermediate data processing logic named ``Bolt'' implemented on an app-level in ndnSIM. We believe that R2 shall help the researchers and developers to verify their ideas smoothly.

preprint2022arXiv

RAW-GNN: RAndom Walk Aggregation based Graph Neural Network

Graph-Convolution-based methods have been successfully applied to representation learning on homophily graphs where nodes with the same label or similar attributes tend to connect with one another. Due to the homophily assumption of Graph Convolutional Networks (GCNs) that these methods use, they are not suitable for heterophily graphs where nodes with different labels or dissimilar attributes tend to be adjacent. Several methods have attempted to address this heterophily problem, but they do not change the fundamental aggregation mechanism of GCNs because they rely on summation operators to aggregate information from neighboring nodes, which is implicitly subject to the homophily assumption. Here, we introduce a novel aggregation mechanism and develop a RAndom Walk Aggregation-based Graph Neural Network (called RAW-GNN) method. The proposed approach integrates the random walk strategy with graph neural networks. The new method utilizes breadth-first random walk search to capture homophily information and depth-first search to collect heterophily information. It replaces the conventional neighborhoods with path-based neighborhoods and introduces a new path-based aggregator based on Recurrent Neural Networks. These designs make RAW-GNN suitable for both homophily and heterophily graphs. Extensive experimental results showed that the new method achieved state-of-the-art performance on a variety of homophily and heterophily graphs.

preprint2022arXiv

Real-time Monte Carlo Denoising with Weight Sharing Kernel Prediction Network

Real-time Monte Carlo denoising aims at removing severe noise under low samples per pixel (spp) in a strict time budget. Recently, kernel-prediction methods use a neural network to predict each pixel's filtering kernel and have shown a great potential to remove Monte Carlo noise. However, the heavy computation overhead blocks these methods from real-time applications. This paper expands the kernel-prediction method and proposes a novel approach to denoise very low spp (e.g., 1-spp) Monte Carlo path traced images at real-time frame rates. Instead of using the neural network to directly predict the kernel map, i.e., the complete weights of each per-pixel filtering kernel, we predict an encoding of the kernel map, followed by a high-efficiency decoder with unfolding operations for a high-quality reconstruction of the filtering kernels. The kernel map encoding yields a compact single-channel representation of the kernel map, which can significantly reduce the kernel-prediction network's throughput. In addition, we adopt a scalable kernel fusion module to improve denoising quality. The proposed approach preserves kernel prediction methods' denoising quality while roughly halving its denoising time for 1-spp noisy inputs. In addition, compared with the recent neural bilateral grid-based real-time denoiser, our approach benefits from the high parallelism of kernel-based reconstruction and produces better denoising results at equal time.

preprint2022arXiv

Real-time Rendering and Editing of Scattering Effects for Translucent Objects

The photorealistic rendering of the transparent effect of translucent objects is a hot research topic in recent years. A real-time photorealistic rendering and material dynamic editing method for the diffuse scattering effect of translucent objects is proposed based on the bidirectional surface scattering reflectance function's (BSSRDF) Dipole approximation. The diffuse scattering material function in the Dipo le approximation is decomposed into the product form of the shape-related function and the translucent material-related function through principal component analysis; using this decomposition representation, under the real-time photorealistic rendering framework of pre-radiative transmission and the scattering transmission to realize real-time editing of translucent object materials under various light sources. In addition, a method for quadratic wavelet compression of precomputed radiative transfer data in the spatial domain is also proposed. Using the correlation of surface points in the spatial distribution position, on the premise of ensuring the rendering quality, the data is greatly compressed and the rendering is efficiently improved. The experimental results show that the method in this paper can generate a highly realistic translucent effect and ensure the real-time rendering speed.

preprint2022arXiv

Reinforcement Learning Based Robust Policy Design for Relay and Power Optimization in DF Relaying Networks

In this paper, we study the outage minimization problem in a decode-and-forward cooperative network with relay uncertainty. To reduce the outage probability and improve the quality of service, existing researches usually rely on the assumption of both exact instantaneous channel state information (CSI) and environmental uncertainty. However, it is difficult to obtain perfect instantaneous CSI immediately under practical situations where channel states change rapidly, and the uncertainty in communication environments may not be observed, which makes traditional methods not applicable. Therefore, we turn to reinforcement learning (RL) methods for solutions, which do not need any prior knowledge of underlying channel or assumptions of environmental uncertainty. RL method is to learn from the interaction with communication environment, optimize its action policy, and then propose relay selection and power allocation schemes. We first analyse the robustness of RL action policy by giving the lower bound of the worst-case performance, when RL methods are applied to communication scenarios with environment uncertainty. Then, we propose a robust algorithm for outage probability minimization based on RL. Simulation results reveal that compared with traditional RL methods, our approach has better generalization ability and can improve the worst-case performance by about 6% when evaluated in unseen environments.

preprint2022arXiv

Relationship between Successive Flares in the Same Active Region and Space-Weather HMI Active Region Patch (SHARP) Parameters

A solar active region (AR) may produce multiple notable flares during its passage across the solar disk. We investigate successive flares from flare-eruptive active regions, and explore their relationship with solar magnetic parameters. We examine six ARs in this study, each with at least one major flare above X1.0. The Space-Weather HMI Active Region Patch (SHARP) is employed in this study to parameterize the ARs. We aim to identify the most flare-related SHARP parameters and lay foundation for future practical flare forecasts. We first evaluate the correlation coefficients between the SHARP parameters and the successive flare production. Then we adopt a Natural Gradient Boost (NGBoost) method to analyze the relationship between the SHARP parameters and the successive flare bursts. Based on the correlation analysis and the importance distribution returned from NGBoost, we select 8 most flare-related SHARP parameters. Finally, we discuss the physical meanings of the 8 selected parameters and their relationship with flare production.

preprint2022arXiv

Robins-Monro Augmented Lagrangian Method for Stochastic Convex Optimization

In this paper, we propose a Robbins-Monro augmented Lagrangian method (RMALM) to solve a class of constrained stochastic convex optimization, which can be regarded as a hybrid of the Robbins-Monro type stochastic approximation method and the augmented Lagrangian method of convex optimizations. Under mild conditions, we show that the proposed algorithm exhibits a linear convergence rate. Moreover, instead of verifying a computationally intractable stopping criteria, we show that the RMALM with the increasing subproblem iteration number has a global complexity $\mathcal{O}(1/\varepsilon^{1+q})$ for the $\varepsilon$-solution (i.e., $\mathbb{E}\left(\|x^k-x^*\|^2\right) < \varepsilon$), where $q$ is any positive number. Numerical results on synthetic and real data demonstrate that the proposed algorithm outperforms the existing algorithms.

preprint2022arXiv

Rule-based Procedural Tree Modeling Approach

In some entertainment and virtual reality applications, it is necessary to model and draw the real world realistically, so as to improve the fidelity of natural scenes and make users have a better sense of immersion. However, due to the morphological structure of trees The complexity and variety present many challenges for photorealistic modeling and rendering of trees. This paper reviews the progress achieved in photorealistic modeling and rendering of tree branches, leaves, and bark over the past few decades. The main achievement is mainly a rule-based procedural tree modeling method.

preprint2022arXiv

S-type stars discovered in Medium-Resolution Spectra of LAMOST DR9

In this paper, we report on 606 S-type stars identified from Data Release 9 of the LAMOST medium-resolution spectroscopic (MRS) survey, and 539 of them are reported for the first time. The discovery of these stars is a three-step process, i.e., selecting with the ZrO band indices greater than 0.25, excluding non-S-type stars with the iterative Support Vector Machine method, and finally retaining stars with absolute bolometric magnitude larger than -7.1. The 606 stars are consistent with the distribution of known S-type stars in the color-magnitude diagram. We estimated the C/Os using the [C/Fe] and [O/Fe] provided by APOGEE and the MARCS model for S-type stars, respectively, and the results of the two methods show that C/Os of all stars are larger than 0.5. Both the locations on the color-magnitude diagram and C/Os further verify the nature of our S-type sample. Investigating the effect of TiO and atmospheric parameters on ZrO with the sample, we found that log g has a more significant impact on ZrO than Teff and [Fe/H], and both TiO and log g may negatively correlate with ZrO. According to the criterion of Tian et al. (2020), a total of 238 binary candidates were found by the zero-point-calibrated radial velocities from the officially released catalog of LAMOST MRS and the catalog of Zhang et al. (2021). A catalog of these 606 S-type stars is available from the following link https://doi.org/10.12149/101097.

preprint2022arXiv

Secure Rate-Splitting for MIMO Broadcast Channel with Imperfect CSIT and a Jammer

In this paper, we investigate the secure rate-splitting for the two-user multiple-input multiple-output (MIMO) broadcast channel with imperfect channel state information at the transmitter (CSIT) and a multiple-antenna jammer, where each receiver has an equal number of antennas and the jammer has perfect channel state information (CSI). Specifically, we design a secure rate-splitting multiple-access strategy, where the security of split private and common messages is ensured by precoder design with joint nulling and aligning the leakage information, regarding different antenna configurations. Moreover, we show that the sum-secure degrees-of-freedom (SDoF) achieved by secure rate-splitting is optimal and outperforms that by conventional zero-forcing. Therefore, we reveal the sum-SDoF of the two-user MIMO broadcast channel with imperfect CSIT and a jammer, and validate the superiority of rate-splitting for the security purpose in this scenario with emphasis of MIMO.

preprint2022arXiv

Self Punishment and Reward Backfill for Deep Q-Learning

Reinforcement learning agents learn by encouraging behaviours which maximize their total reward, usually provided by the environment. In many environments, however, the reward is provided after a series of actions rather than each single action, leading the agent to experience ambiguity in terms of whether those actions are effective, an issue known as the credit assignment problem. In this paper, we propose two strategies inspired by behavioural psychology to enable the agent to intrinsically estimate more informative reward values for actions with no reward. The first strategy, called self-punishment (SP), discourages the agent from making mistakes that lead to undesirable terminal states. The second strategy, called the rewards backfill (RB), backpropagates the rewards between two rewarded actions. We prove that, under certain assumptions and regardless of the reinforcement learning algorithm used, these two strategies maintain the order of policies in the space of all possible policies in terms of their total reward, and, by extension, maintain the optimal policy. Hence, our proposed strategies integrate with any reinforcement learning algorithm that learns a value or action-value function through experience. We incorporated these two strategies into three popular deep reinforcement learning approaches and evaluated the results on thirty Atari games. After parameter tuning, our results indicate that the proposed strategies improve the tested methods in over 65 percent of tested games by up to over 25 times performance improvement.

preprint2022arXiv

SenseCare: A Research Platform for Medical Image Informatics and Interactive 3D Visualization

Clinical research on smart health has an increasing demand for intelligent and clinic-oriented medical image computing algorithms and platforms that support various applications. To this end, we have developed SenseCare research platform, which is designed to facilitate translational research on intelligent diagnosis and treatment planning in various clinical scenarios. To enable clinical research with Artificial Intelligence (AI), SenseCare provides a range of AI toolkits for different tasks, including image segmentation, registration, lesion and landmark detection from various image modalities ranging from radiology to pathology. In addition, SenseCare is clinic-oriented and supports a wide range of clinical applications such as diagnosis and surgical planning for lung cancer, pelvic tumor, coronary artery disease, etc. SenseCare provides several appealing functions and features such as advanced 3D visualization, concurrent and efficient web-based access, fast data synchronization and high data security, multi-center deployment, support for collaborative research, etc. In this report, we present an overview of SenseCare as an efficient platform providing comprehensive toolkits and high extensibility for intelligent image analysis and clinical research in different application scenarios. We also summarize the research outcome through the collaboration with multiple hospitals.

preprint2022arXiv

Sparse Sampling and Completion for Light Transport in VPL-based Rendering

The many-light formulation provides a general framework for rendering various illumination effects using hundreds of thousands of virtual point lights (VPLs). To efficiently gather the contributions of the VPLs, lightcuts and its extensions cluster the VPLs, which implicitly approximates the lighting matrix with some representative blocks similar to vector quantization. In this paper, we propose a new approximation method based on the previous lightcut method and a low-rank matrix factorization model. As many researchers pointed out, the lighting matrix is low rank, which implies that it can be completed from a small set of known entries. We first generate a conservative global light cut with bounded error and partition the lighting matrix into slices by the coordinate and normal of the surface points using the method of lightslice. Then we perform two passes of randomly sampling on each matrix slice. In the first pass, uniformly distributed random entries are sampled to coarsen the global light cut, further clustering the similar light for the spatially localized surface points of the slices. In the second pass, more entries are sampled according to the possibility distribution function estimated from the first sampling result. Then each matrix slice is factorized into a product of two smaller low-rank matrices constrained by the sampled entries, which delivers a completion of the lighting matrix. The factorized form provides an additional speedup for adding up the matrix columns which is more GPU friendly. Compared with the previous lightcut based methods, we approximate the lighting matrix with some signal specialized bases via factorization. The experimental results shows that we can achieve significant acceleration than the state of the art many-light methods.

preprint2022arXiv

SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing

Motivated by the success of T5 (Text-To-Text Transfer Transformer) in pre-trained natural language processing models, we propose a unified-modal SpeechT5 framework that explores the encoder-decoder pre-training for self-supervised speech/text representation learning. The SpeechT5 framework consists of a shared encoder-decoder network and six modal-specific (speech/text) pre/post-nets. After preprocessing the input speech/text through the pre-nets, the shared encoder-decoder network models the sequence-to-sequence transformation, and then the post-nets generate the output in the speech/text modality based on the output of the decoder. Leveraging large-scale unlabeled speech and text data, we pre-train SpeechT5 to learn a unified-modal representation, hoping to improve the modeling capability for both speech and text. To align the textual and speech information into this unified semantic space, we propose a cross-modal vector quantization approach that randomly mixes up speech/text states with latent units as the interface between encoder and decoder. Extensive evaluations show the superiority of the proposed SpeechT5 framework on a wide variety of spoken language processing tasks, including automatic speech recognition, speech synthesis, speech translation, voice conversion, speech enhancement, and speaker identification. We release our code and model at https://github.com/microsoft/SpeechT5.

preprint2022arXiv

Spinodal Enhancement of Light Nuclei Yield Ratio in Relativistic Heavy Ion Collisions

Using a relativistic transport model to describe the evolution of the quantum chromodynamic matter produced in Au+Au collisions at $\sqrt{s_{NN}}=3-200$ GeV, we study the effect of a first-order phase transition in the equation of state of this matter on the yield ratio $N_tN_p/ N_d^2$ ($tp/d^2$) of produced proton ($p$), deuteron ($d$), and triton ($t$). We find that the large density inhomogeneities generated by the spinodal instability during the first-order phase transition can survive the fast expansion of the subsequent hadronic matter and lead to an enhanced $tp/d^2$ in central collisions at $\sqrt{s_{NN}}=3-5$ GeV as seen in the experiments by the STAR Collaboration and the E864 Collaboration. However, this enhancement subsides with increasing collision centrality, and the resulting almost flat centrality dependence of $tp/d^2$ at $\sqrt{s_{NN}}=3$ GeV can also be used as a signal for the first-order phase transition.

preprint2022arXiv

Supervised ML Solution for Band Assignment in Dual-Band Systems with Omnidirectional and Directional Antennas

Many wireless networks, including 5G NR (New Radio) and future beyond 5G cellular systems, are expected to operate on multiple frequency bands. This paper considers the band assignment (BA) problem in dual-band systems, where the basestation (BS) chooses one of the two available frequency bands (centimeter-wave and millimeter-wave bands) to communicate with the user equipment (UE). While the millimeter-wave band might offer higher data rate, there is a significant probability of outage during which the communication should be carried on the (more reliable) centimeter-wave band. With mobility, the BA can be perceived as a sequential problem, where the BS uses previously observed information to predict the best band for a future time step. We formulate the BA as a binary classification problem and propose supervised Machine Learning (ML) solutions. We study the problem when both the BS and the UE use (i) omnidirectional antennas and (ii) both use directional antennas. In the omnidirectional case, we derive analytical benchmark solutions based on the Gaussian Process (GP) assumption for the inter-band shadow fading. In the directional case, where the labeling is shown to be complex, we propose an efficient labeling approach based on the Viterbi Algorithm (VA). We compare the performances for two channel models: (i) a stochastic channel and (ii) a ray-tracing based channel.

preprint2022arXiv

Tailoring Quadrupole Topological Insulators with Periodic Driving and Disorder

The quadrupole topological insulator (QTI) has attracted intense studies as a prototype of symmetry-protected higher-order topological phases of matter with a quantized quadrupole moment. The realization of QTIs has been reported in various static settings with periodic structures. Here, we theoretically investigate topological phase transitions and establish the QTI phase in a periodically driven system with disorder. In the clean limit, the Floquet QTI phase emerges from a topologically trivial band structure driven by elliptically polarized irradiation. More strikingly, starting from a pure and static system with trivial topology, we unveil an intriguing QTI phase which necessitates the simultaneous presence of disorder and periodic driving. Furthermore, we reveal that particle-hole symmetry is sufficient to protect the QTI. Our work not only establishes a new strategy to design QTIs but also enriches the symmetry-protected mechanism of higher-order topology.

preprint2022arXiv

The generality of uncooperative and cooperative effects in elementary hydrogen-bonded systems

The cooperative effect plays a significant role in understanding the intermolecular donor-acceptor interactions of hydrogen bonds (H-bonds, D-H...A). Herein, using the benchmark method of high-precision ab initio, the well-known cooperative effect is reproduced in elementary H-bonded systems with different D and A atoms. That is, with the decreasing of intermolecular distance, the D-H bond length first increases and then decreases, while the H...A bond length decreases. On the contrary, when D and A are the same, as the intermolecular distance decreases, the D-H bond length decreases without increasing, which is referred to as the uncooperative effect. Further analyses conclude that compared to cooperative H-bonded systems, uncooperative systems at their respective equilibrium position have a larger core-valence bifurcation (CVB) index (>0.022) and lower binding energies (<0.25 eV), showing a clear linear inverse relationship related to H-bond strength. Therefore, the intermolecular non-H-bonding interactions are predicted to reflect the uncooperative characteristics, which is confirmed by high-precision ab initio calculations. These findings provide a direction for the comprehensive understanding of H-bonds.

preprint2022arXiv

TOSE: A Fast Capacity Estimation Algorithm Based on Spike Approximations

Capacity is one of the most important performance metrics for wireless communication networks. It describes the maximum rate at which the information can be transmitted of a wireless communication system. To support the growing demand for wireless traffic, wireless networks are becoming more dense and complicated, leading to a higher difficulty to derive the capacity. Unfortunately, most existing methods for the capacity calculation take a polynomial time complexity. This will become unaffordable for future ultra-dense networks, where both the number of base stations (BSs) and the number of users are extremely large. In this paper, we propose a fast algorithm TOSE to estimate the capacity for ultra-dense wireless networks. Based on the spiked model of random matrix theory (RMT), our algorithm can avoid the exact eigenvalue derivations of large dimensional matrices, which are complicated and inevitable in conventional capacity calculation methods. Instead, fast eigenvalue estimations can be realized based on the spike approximations in our TOSE algorithm. Our simulation results show that TOSE is an accurate and fast capacity approximation algorithm. Its estimation error is below 5%, and it runs in linear time, which is much lower than the polynomial time complexity of existing methods. In addition, TOSE has superior generality, since it is independent of the distributions of BSs and users, and the shape of network areas.

preprint2022arXiv

Transport Model Comparison Studies of Intermediate-Energy Heavy-Ion Collisions

Transport models are the main method to obtain physics information from low to relativistic-energy heavy-ion collisions. The Transport Model Evaluation Project (TMEP) has been pursued to test the robustness of transport model predictions in reaching consistent conclusions from the same type of physical model. Calculations under controlled conditions of physical input and set-up were performed with various participating codes. These included both calculations of nuclear matter in a box with periodic boundary conditions, and more realistic calculations of heavy-ion collisions. In this intermediate review, we summarize and discuss the present status of the project. We also provide condensed descriptions of the 26 participating codes, which contributed to some part of the project. These include the major codes in use today. We review the main results of the studies completed so far. They show, that in box calculations the differences between the codes can be well understood and a convergence of the results can be reached. These studies also highlight the systematic differences between the two families of transport codes, known as BUU and QMD type codes. However, when the codes were compared in full heavy-ion collisions using different physical models, as recently for pion production, they still yielded substantially different results. This calls for further comparisons of heavy-ion collisions with controlled models and of box comparisons of important ingredients, like momentum-dependent fields, which are currently underway. We often indicate improved strategies in performing transport simulations and thus provide guidance to code developers. Results of transport simulations of heavy-ion collisions from a given code will have more significance if the code can be validated against benchmark calculations such as the ones summarized in this review.

preprint2022arXiv

TRT-ViT: TensorRT-oriented Vision Transformer

We revisit the existing excellent Transformers from the perspective of practical application. Most of them are not even as efficient as the basic ResNets series and deviate from the realistic deployment scenario. It may be due to the current criterion to measure computation efficiency, such as FLOPs or parameters is one-sided, sub-optimal, and hardware-insensitive. Thus, this paper directly treats the TensorRT latency on the specific hardware as an efficiency metric, which provides more comprehensive feedback involving computational capacity, memory cost, and bandwidth. Based on a series of controlled experiments, this work derives four practical guidelines for TensorRT-oriented and deployment-friendly network design, e.g., early CNN and late Transformer at stage-level, early Transformer and late CNN at block-level. Accordingly, a family of TensortRT-oriented Transformers is presented, abbreviated as TRT-ViT. Extensive experiments demonstrate that TRT-ViT significantly outperforms existing ConvNets and vision Transformers with respect to the latency/accuracy trade-off across diverse visual tasks, e.g., image classification, object detection and semantic segmentation. For example, at 82.7% ImageNet-1k top-1 accuracy, TRT-ViT is 2.7$\times$ faster than CSWin and 2.0$\times$ faster than Twins. On the MS-COCO object detection task, TRT-ViT achieves comparable performance with Twins, while the inference speed is increased by 2.8$\times$.

preprint2022arXiv

Turning Channel Noise into an Accelerator for Over-the-Air Principal Component Analysis

Recently years, the attempts on distilling mobile data into useful knowledge has been led to the deployment of machine learning algorithms at the network edge. Principal component analysis (PCA) is a classic technique for extracting the linear structure of a dataset, which is useful for feature extraction and data compression. In this work, we propose the deployment of distributed PCA over a multi-access channel based on the algorithm of stochastic gradient descent to learn the dominant feature space of a distributed dataset at multiple devices. Over-the-air aggregation is adopted to reduce the multi-access latency, giving the name over-the-air PCA. The novelty of this design lies in exploiting channel noise to accelerate the descent in the region around each saddle point encountered by gradient descent, thereby increasing the convergence speed of over-the-air PCA. The idea is materialized by proposing a power-control scheme which detects the type of descent region and controlling the level of channel noise accordingly. The scheme is proved to achieve a faster convergence rate than in the case without power control.

preprint2022arXiv

Uniform Object Rearrangement: From Complete Monotone Primitives to Efficient Non-Monotone Informed Search

Object rearrangement is a widely-applicable and challenging task for robots. Geometric constraints must be carefully examined to avoid collisions and combinatorial issues arise as the number of objects increases. This work studies the algorithmic structure of rearranging uniform objects, where robot-object collisions do not occur but object-object collisions have to be avoided. The objective is minimizing the number of object transfers under the assumption that the robot can manipulate one object at a time. An efficiently computable decomposition of the configuration space is used to create a "region graph", which classifies all continuous paths of equivalent collision possibilities. Based on this compact but rich representation, a complete dynamic programming primitive DFSDP performs a recursive depth first search to solve monotone problems quickly, i.e., those instances that do not require objects to be moved first to an intermediate buffer. DFSDP is extended to solve single-buffer, non-monotone instances, given a choice of an object and a buffer. This work utilizes these primitives as local planners in an informed search framework for more general, non-monotone instances. The search utilizes partial solutions from the primitives to identify the most promising choice of objects and buffers. Experiments demonstrate that the proposed solution returns near-optimal paths with higher success rate, even for challenging non-monotone instances, than other leading alternatives.

preprint2022arXiv

Unique quantum impurity states driven by a vortex in topological superconductors

The interplay of magnetic impurity and vortex in a topological superconductor is of fundamental interest with major implications for implementing quantum computation. There are multiple degrees of freedom interacting with the impurity in the system, including the Majorana zero mode (MZM), the Caroli de Gennes Matricon (CdGM) states, and the electron bulk states that form Cooper pairs, which makes the impurity pinned vortex state elusive to date in topological superconductors. Here, we present an accurate solution of the problem, based on a generalized mapping scheme and the density-matrix renormalization group (DMRG) method. We identify in-gap states that are distinct from the established Yu-Shiba-Rusinov (YSR) states. The newly found in-gap physics is driven by three prominent mechanisms: (i) the coupling of impurity and bulk states leading to competition between Kondo screening and Cooper pairing, resulting in a singlet-doublet quantum phase transition, (ii) the coupling of impurity and CdGM states introducing an effective Zeeman splitting to the doublet state, and (iii) the coupling of impurity and MZM turning the singlet-doublet transition into a crossover. These mechanisms cooperatively produce a unique spin-resolved local density of states. Despite the MZM, a robust nearly zero-energy peak is generated, with comparable height but opposite spin polarization as that of the MZM. These results signify novel emergent physics and offer insights to elucidate intriguing experimental phenomena.

preprint2022arXiv

Unveiling the dynamics of nucleosynthesis in relativistic heavy-ion collisions

Like nucleosynthesis during the early universe, light nuclei are also produced in relativistic heavy-ion collisions. Although the deuteron ($d$) yields in these collisions can be well described by the statistical hadronization model (SHM), which assumes that particle yields are fixed at a common chemical freezeout near the phase boundary between the quark-gluon plasma and the hadron gas, the recently measured triton ($^3\text{H}$) yields in Au+Au collisions at $\sqrt{s_{NN}}=7.7-200$ GeV are overestimated systematically by this model. Here, we develop a comprehensive kinetic approach to study the effects of hadronic re-scatterings, such as $πNN\leftrightarrowπd$ and $πNNN\leftrightarrowπ^3\text{H}~(^3\text{He})$, on $d$, $^3\text{H}$, and $^3\text{He}$ production in these collisions. We find that these reactions have little effects on the deuteron yield but reduce the $^3\text{H}$ and $^3\text{He}$ yields by about a factor of 1.8 from their initial values given by the SHM. This finding helps resolve the overestimation of triton production in the SHM and provides the evidence for hadronic re-scattering effects on nucleosynthesis in relativistic heavy-ion collisions.

preprint2022arXiv

Using Autoencoders on Differentially Private Federated Learning GANs

Machine learning has been applied to almost all fields of computer science over the past decades. The introduction of GANs allowed for new possibilities in fields of medical research and text prediction. However, these new fields work with ever more privacy-sensitive data. In order to maintain user privacy, a combination of federated learning, differential privacy and GANs can be used to work with private data without giving away a users' privacy. Recently, two implementations of such combinations have been published: DP-Fed-Avg GAN and GS-WGAN. This paper compares their performance and introduces an alternative version of DP-Fed-Avg GAN that makes use of denoising techniques to combat the loss in accuracy that generally occurs when applying differential privacy and federated learning to GANs. We also compare the novel adaptation of denoised DP-Fed-Avg GAN to the state-of-the-art implementations in this field.

preprint2022arXiv

Variational Hierarchical Directed Bounding Box Construction for Solid Mesh Models

Object oriented bounding box tree (OBB-Tree for short) has many applications in collision detection, real-time rendering, etc. It has a wide range of applications. The construction of the hierarchical directed bounding box of the solid mesh model is studied, and a new optimization solution method is proposed. But this part of the external space volume that does not belong to the solid mesh model is used as the error, and an error calculation method based on hardware acceleration is given. Secondly, the hierarchical bounding box construction problem is transformed into a variational approximation problem, and the optimal hierarchical directed bounding box is obtained by solving the global error minimum. In the optimization calculation, we propose that combining Lloyd clustering iteration in the same layer and MultiGrid-like reciprocating iteration between layers. Compared with previous results, this method can generate aired original solid mesh models are more tightly packed with hierarchical directed bounding box approximation. In the practical application of collision detection, the results constructed using this method can reduce the computational time of collision detection and improve detection efficiency.

preprint2022arXiv

Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling

Transformer-based models have achieved top performance on major video recognition benchmarks. Benefiting from the self-attention mechanism, these models show stronger ability of modeling long-range dependencies compared to CNN-based models. However, significant computation overheads, resulted from the quadratic complexity of self-attention on top of a tremendous number of tokens, limit the use of existing video transformers in applications with limited resources like mobile devices. In this paper, we extend Mobile-Former to Video Mobile-Former, which decouples the video architecture into a lightweight 3D-CNNs for local context modeling and a Transformer modules for global interaction modeling in a parallel fashion. To avoid significant computational cost incurred by computing self-attention between the large number of local patches in videos, we propose to use very few global tokens (e.g., 6) for a whole video in Transformers to exchange information with 3D-CNNs with a cross-attention mechanism. Through efficient global spatial-temporal modeling, Video Mobile-Former significantly improves the video recognition performance of alternative lightweight baselines, and outperforms other efficient CNN-based models at the low FLOP regime from 500M to 6G total FLOPs on various video recognition tasks. It is worth noting that Video Mobile-Former is the first Transformer-based video model which constrains the computational budget within 1G FLOPs.

preprint2022arXiv

VU-BERT: A Unified framework for Visual Dialog

The visual dialog task attempts to train an agent to answer multi-turn questions given an image, which requires the deep understanding of interactions between the image and dialog history. Existing researches tend to employ the modality-specific modules to model the interactions, which might be troublesome to use. To fill in this gap, we propose a unified framework for image-text joint embedding, named VU-BERT, and apply patch projection to obtain vision embedding firstly in visual dialog tasks to simplify the model. The model is trained over two tasks: masked language modeling and next utterance retrieval. These tasks help in learning visual concepts, utterances dependence, and the relationships between these two modalities. Finally, our VU-BERT achieves competitive performance (0.7287 NDCG scores) on VisDial v1.0 Datasets.

preprint2022arXiv

Weyl nodes with higher-order topology in an optically driven nodal-line semimetal

Creating and manipulating topological states is a key goal of condensed matter physics. Periodic driving offers a powerful method to manipulate electronic states, and even to create topological states in solids. Here, we investigate the tunable Floquet states in a periodically driven higher-order nodal line semimetal with both spatial inversion and time-reversal symmetries. We found that the Floquet Weyl semimetal states, which support both one-dimensional hinge Fermi arc and two-dimensional surface Fermi arc states, can be induced in the higher-order nodal-line semimetal by shining circularly polarized light. Moreover, we show that the location of Weyl nodes and the curvature of surface Fermi arcs can be tuned by adjusting the propagation direction and incident angle of light.

preprint2022arXiv

WikiDiverse: A Multimodal Entity Linking Dataset with Diversified Contextual Topics and Entity Types

Multimodal Entity Linking (MEL) which aims at linking mentions with multimodal contexts to the referent entities from a knowledge base (e.g., Wikipedia), is an essential task for many multimodal applications. Although much attention has been paid to MEL, the shortcomings of existing MEL datasets including limited contextual topics and entity types, simplified mention ambiguity, and restricted availability, have caused great obstacles to the research and application of MEL. In this paper, we present WikiDiverse, a high-quality human-annotated MEL dataset with diversified contextual topics and entity types from Wikinews, which uses Wikipedia as the corresponding knowledge base. A well-tailored annotation procedure is adopted to ensure the quality of the dataset. Based on WikiDiverse, a sequence of well-designed MEL models with intra-modality and inter-modality attentions are implemented, which utilize the visual information of images more adequately than existing MEL models do. Extensive experimental analyses are conducted to investigate the contributions of different modalities in terms of MEL, facilitating the future research on this task. The dataset and baseline models are available at https://github.com/wangxw5/wikiDiverse.

preprint2021arXiv

Bayesian User Localization and Tracking for Reconfigurable Intelligent Surface Aided MIMO Systems

In this paper, we study the user localization and tracking problem in the reconfigurable intelligent surface (RIS) aided multiple-input multiple-output (MIMO) system, where a multi-antenna base station (BS) and multiple RISs are deployed to assist the localization and tracking of a multi-antenna user. By establishing a probability transition model for user mobility, we develop a message-passing algorithm, termed the Bayesian user localization and tracking (BULT) algorithm, to estimate and track the user position and the angle-of-arrival (AoAs) at the user in an online fashion. We also derive Bayesian Cramér Rao bound (BCRB) to characterize the fundamental performance limit of the considered tracking problem. To improve the tracking performance, we optimize the beamforming design at the BS and the RISs to minimize the derived BCRB. Simulation results show that our BULT algorithm can perform close to the derived BCRB, and significantly outperforms the counterpart algorithms without exploiting the temporal correlation of the user location.

preprint2021arXiv

Blockchain-Based Federated Learning in Mobile Edge Networks with Application in Internet of Vehicles

The rapid increase of the data scale in Internet of Vehicles (IoV) system paradigm, hews out new possibilities in boosting the service quality for the emerging applications through data sharing. Nevertheless, privacy concerns are major bottlenecks for data providers to share private data in traditional IoV networks. To this end, federated learning (FL) as an emerging learning paradigm, where data providers only send local model updates trained on their local raw data rather than upload any raw data, has been recently proposed to build a privacy-preserving data sharing models. Unfortunately, by analyzing on the differences of uploaded local model updates from data providers, private information can still be divulged, and performance of the system cannot be guaranteed when partial federated nodes executes malicious behavior. Additionally, traditional cloud-based FL poses challenges to the communication overhead with the rapid increase of terminal equipment in IoV system. All these issues inspire us to propose an autonomous blockchain empowered privacy-preserving FL framework in this paper, where the mobile edge computing (MEC) technology was naturally integrated in IoV system.

preprint2021arXiv

Cache Placement Optimization in Mobile Edge Computing Networks with Unaware Environment -- An Extended Multi-armed Bandit Approach

Caching high-frequency reuse contents at the edge servers in the mobile edge computing (MEC) network omits the part of backhaul transmission and further releases the pressure of data traffic. However, how to efficiently decide the caching contents for edge servers is still an open problem, which refers to the cache capacity of edge servers, the popularity of each content, and the wireless channel quality during transmission. In this paper, we discuss the influence of unknown user density and popularity of content on the cache placement solution at the edge server. Specifically, towards the implementation of the cache placement solution in the practical network, there are two problems needing to be solved. First, the estimation of unknown users' preference needs a huge amount of records of users' previous requests. Second, the overlapping serving regions among edge servers cause the wrong estimation of users' preference, which hinders the individual decision of caching placement. To address the first issue, we propose a learning-based solution to adaptively optimize the cache placement policy. We develop the extended multi-armed bandit (Extended MAB), which combines the generalized global bandit (GGB) and Standard Multi-armed bandit (MAB). For the second problem, a multi-agent Extended MAB-based solution is presented to avoid the mis-estimation of parameters and achieve the decentralized cache placement policy. The proposed solution determines the primary time slot and secondary time slot for each edge server. The proposed strategies are proven to achieve the bounded regret according to the mathematical analysis. Extensive simulations verify the optimality of the proposed strategies when comparing with baselines.

preprint2021arXiv

Color Contrast Enhanced Rendering for Optical See-through Head-mounted Displays

Most commercially available optical see-through head-mounted displays (OST-HMDs) utilize optical combiners to simultaneously visualize the physical background and virtual objects. The displayed images perceived by users are a blend of rendered pixels and background colors. Enabling high fidelity color perception in mixed reality (MR) scenarios using OST-HMDs is an important but challenging task. We propose a real-time rendering scheme to enhance the color contrast between virtual objects and the surrounding background for OST-HMDs. Inspired by the discovery of color perception in psychophysics, we first formulate the color contrast enhancement as a constrained optimization problem. We then design an end-to-end algorithm to search the optimal complementary shift in both chromaticity and luminance of the displayed color. This aims at enhancing the contrast between virtual objects and the real background as well as keeping the consistency with the original color. We assess the performance of our approach using a simulated OST-HMD environment and an off-the-shelf OST-HMD. Experimental results from objective evaluations and subjective user studies demonstrate that the proposed approach makes rendered virtual objects more distinguishable from the surrounding background, thereby bringing a better visual experience.

preprint2021arXiv

Deep reinforcement learning for universal quantum state preparation via dynamic pulse control

Accurate and efficient preparation of quantum state is a core issue in building a quantum computer. In this paper, we investigate how to prepare a certain single- or two-qubit target state from arbitrary initial states in semiconductor double quantum dots with the aid of deep reinforcement learning. Our method is based on the training of the network over numerous preparing tasks. The results show that once the network is well trained, it works for any initial states in the continuous Hilbert space. Thus repeated training for new preparation tasks is avoided. Our scheme outperforms the traditional optimization approaches based on gradient with both the higher designing efficiency and the preparation quality in discrete control space. Moreover, we find that the control trajectories designed by our scheme are robust against static and dynamic fluctuations, such as charge and nuclear noises.

preprint2021arXiv

Electric and magnetic fields tuned spin-polarized topological phases in two-dimensional ferromagnetic MnBi$_4$Te$_7$

Applying electric or magnetic fields is widely used to not only create and manipulate topological states but also facilitate their observations in experiments. In this work, we show by first-principles calculations and topological analysis that the time-reversal (TR) symmetry-broken quantum spin Hall (QSH) state emerges in a two-dimensional ferromagnetic MnBi$_4$Te$_7$ monolayer. This TR-symmetry broken QSH phase possesses a highly tunable nontrivial band gap under an external electric field (or tuning interlayer distance). Furthermore, based on the Wannier-function-based tight-binding approach, we reveal that a topological phase transition from the TR-symmetry broken QSH phase to the quantum anomalous Hall (QAH) phase occurs with the increase of magnetic fields. Besides, we also find that a reverse electric fields can facilitate the realization of QAH phase. Our work not only uncovers the ferromagnetic topological properties the MnBi$_4$Te$_7$ monolayer tuned by electric and magnetic fields, but also can stimulate further applications to spintronics and topological devices.

preprint2021arXiv

Floquet Valley-Polarized Quantum Anomalous Hall State in Nonmagnetic Heterobilayers

The valley-polarized quantum anomalous Hall (VQAH) state, which forwards a strategy for combining valleytronics and spintronics with nontrivial topology, attracts intensive interest in condensed-matter physics. So far, the explored VQAH states have still been limited to magnetic systems. Here, using the low-energy effective model and Floquet theorem, we propose a different mechanism to realize the Floquet VQAH state in nonmagnetic heterobilayers under light irradiation. We then realize this proposal via first-principles calculations in transition metal dichalcogenide heterobilayers, which initially possess the time-reversal invariant valley quantum spin Hall (VQSH) state. By irradiating circularly polarized light, the time-reversal invariant VQSH state can evolve into the VQAH state, behaving as an optically switchable topological spin-valley filter. These findings not only offer a rational scheme to realize the VQAH state without magnetic orders, but also pave a fascinating path for designing topological spintronic and valleytronic devices.

preprint2021arXiv

Hierarchical Reinforcement Learning for Relay Selection and Power Optimization in Two-Hop Cooperative Relay Network

Cooperative communication is an effective approach to improve spectrum utilization. In order to reduce outage probability of communication system, most studies propose various schemes for relay selection and power allocation, which are based on the assumption of channel state information (CSI). However, it is difficult to get an accurate CSI in practice. In this paper, we study the outage probability minimizing problem subjected to a total transmission power constraint in a two-hop cooperative relay network. We use reinforcement learning (RL) methods to learn strategies for relay selection and power allocation, which do not need any prior knowledge of CSI but simply rely on the interaction with communication environment. It is noted that conventional RL methods, including most deep reinforcement learning (DRL) methods, cannot perform well when the search space is too large. Therefore, we first propose a DRL framework with an outage-based reward function, which is then used as a baseline. Then, we further propose a hierarchical reinforcement learning (HRL) framework and training algorithm. A key difference from other RL-based methods in existing literatures is that, our proposed HRL approach decomposes relay selection and power allocation into two hierarchical optimization objectives, which are trained in different levels. With the simplification of search space, the HRL approach can solve the problem of sparse reward, while the conventional RL method fails. Simulation results reveal that compared with traditional DRL method, the HRL training algorithm can reach convergence 30 training iterations earlier and reduce the outage probability by 5% in two-hop relay network with the same outage threshold.

preprint2021arXiv

Joint Location and Communication Study for Intelligent Reflecting Surface Aided Wireless Communication System

Intelligent reflecting surface (IRS) is a novel burgeoning concept, which possesses advantages in enhancing wireless communication and user localization, while maintaining low hardware cost and energy consumption. Herein, we establish an IRS-aided mmWave-MIMO based joint localization and communication system (IMM-JLCS), and probe into its performance evaluation and optimization design. Specifically, first, we provide the signal, channel and estimation error models, and contrive the working process of the IMM-JLCS in detail. Then, by configuring appropriate IRS phase shifts, we derive the closed-form expressions of the Cramer-Rao Lower Bound (CRLB) of the position/orientation estimation errors and the effective achievable data rate (EADR), with respect to the time allocation ratio of the beam alignment and localization stage (BALS). Subsequently, we investigate the trade-off between the two performance metrics, for which we propose a joint optimization algorithm. Finally, we carry out simulations and comparisons to view the trade-off and validate the effectiveness of the proposed algorithm, in the presence of distinct levels of estimation uncertainty and user mobility. Our results demonstrate that the proposed algorithm can find the joint optimal solution for the position/orientation estimation accuracy and EADR, with its optimization performance being robust to slight localization or channel estimation errors and user mobility.

preprint2021arXiv

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Text to speech (TTS) has been broadly used to synthesize natural and intelligible speech in different scenarios. Deploying TTS in various end devices such as mobile phones or embedded devices requires extremely small memory usage and inference latency. While non-autoregressive TTS models such as FastSpeech have achieved significantly faster inference speed than autoregressive models, their model size and inference latency are still large for the deployment in resource constrained devices. In this paper, we propose LightSpeech, which leverages neural architecture search~(NAS) to automatically design more lightweight and efficient models based on FastSpeech. We first profile the components of current FastSpeech model and carefully design a novel search space containing various lightweight and potentially effective architectures. Then NAS is utilized to automatically discover well performing architectures within the search space. Experiments show that the model discovered by our method achieves 15x model compression ratio and 6.5x inference speedup on CPU with on par voice quality. Audio demos are provided at https://speechresearch.github.io/lightspeech.

preprint2021arXiv

Magnetic field induced Valley-Polarized Quantum Anomalous Hall Effects in Ferromagnetic van der Waals Heterostructures

The valley-polarized quantum anomalous Hall effect (VQAHE) attracts intensive interest since it uniquely combines valleytronics and spintronics with nontrivial band topology. Here, based on first-principles calculations and Wannier-function-based tight-binding (WFTB) model, we reveal that valley-based Hall effects and especially the VQAHE induced by external magnetic fields can occur in two-dimensional (2D) ferromagnetic van der Waals heterostructures (vdWHs). The results show that considerable valley-splitting derived from the Zeeman exchange energy drives these vdWHs generating the valley anomalous Hall effect and then the VQAHE. The chiral edge states and quantized Hall conductance are utilized to confirm the presence of VQAHE. Besides, it is also found that external electric fields (or tuning interlayer distances) can facilitate the realization of VQAHE, and thus we present a phase diagram in a broad parameter regime of magnetic fields and electric fields (or interlayer distances). Our work not only offers a class of ferromagnetic vdWHs to realize various valley-based Hall phases, but also can guide advancements for designing topological devices with spin-valley filtering effects based on the VQAHE.

preprint2021arXiv

Methodology-centered review of molecular modeling, simulation, and prediction of SARS-CoV-2

The deadly coronavirus disease 2019 (COVID-19) pandemic caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has gone out of control globally. Despite much effort by scientists, medical experts, and society in general, the slow progress on drug discovery and antibody therapeutic development, the unknown possible side effects of the existing vaccines, and the high transmission rate of the SARS-CoV-2, remind us of the sad reality that our current understanding of the transmission, infectivity, and evolution of SARS-CoV-2 is unfortunately very limited. The major limitation is the lack of mechanistic understanding of viral-host cell interactions, the viral regulation, protein-protein interactions, including antibody-antigen binding, protein-drug binding, host immune response, etc. This limitation will likely haunt the scientific community for a long time and have a devastating consequence in combating COVID-19 and other pathogens. Notably, compared to the long-cycle, highly cost, and safety-demanding molecular-level experiments, the theoretical and computational studies are economical, speedy, and easy to perform. There exists a tsunami of the literature on molecular modeling, simulation, and prediction of SARS-CoV-2 that has become impossible to fully be covered in a review. To provide the reader a quick update about the status of molecular modeling, simulation, and prediction of SARS-CoV-2, we present a comprehensive and systematic methodology-centered narrative in the nick of time. Aspects such as molecular modeling, Monte Carlo (MC) methods, structural bioinformatics, machine learning, deep learning, and mathematical approaches are included in this review. This review will be beneficial to researchers who are looking for ways to contribute to SARS-CoV-2 studies and those who are assessing the current status in the field.

preprint2021arXiv

Mimicing the Kane-Mele type spin orbit interaction by spin-flexual phonon coupling in graphene devices

On the efforts of enhancing the spin orbit interaction (SOI) of graphene for seeking the dissipationless quantum spin Hall devices, unique Kane-Mele type SOI and high mobility samples are desired. However, common external decoration often introduces extrinsic Rashba-type SOI and simultaneous impurity scattering. Here we show, by the EDTA-Dy molecule decorating, the Kane-Mele type SOI is mimicked with even improved carrier mobility. It is evidenced by the suppressed weak localization at equal carrier densities and simultaneous Elliot-Yafet spin relaxation. The extracted spin scattering time is monotonically dependent on the carrier elastic scattering time, where the Elliot-Yafet plot gives the interaction strength of 3.3 meV. Improved quantum Hall plateaus can be even seen after the external operation. This is attributed to the spin-flexural phonon coupling induced by the enhanced graphene ripples, as revealed by the in-plane magnetotransport measurement.

preprint2021arXiv

On Secure Degrees of Freedom of the MIMO Interference Channel with Local Output Feedback

This paper studies the problem of sum-secure degrees of freedom (SDoF) of the (M,M,N,N) multiple-input multiple-output (MIMO) interference channel with local output feedback, so as to build an information-theoretic foundation and provide practical transmission schemes for 6G-enabled vehicles-to-vehicles (V2V). For this problem, we propose two novel transmission schemes, i.e., the interference decoding scheme and the interference alignment scheme, and thus establish a sum-SDoF lower bound. In particular, to optimize the phase duration, we analyze the security and decoding constraints and formulate a linear-fractional optimization problem. Furthermore, we show that the derived sum-SDoF lower bound is the sum-SDoF for M <= N/2, N=M, and 2N <= M antenna configurations, and reveal that for a fixed N, the optimal M to maximize the sum-SDoF is not less than 2N. Through simulations, we examine the secure sum-rate performance of proposed transmission schemes, and reveal that using local output feedback can lead to a higher secure sum-rate than that by using delayed channel state information at the transmitter.

preprint2021arXiv

Passive Beamforming Design and Channel Estimation for IRS Communication System with Few-Bit ADCs

Utilizing intelligent reflecting surface (IRS) was proven to be efficient in improving the energy efficiency for wireless networks. In this paper, we investigate the passive beamforming and channel estimation for IRS assisted wireless communications with low-resolution analog-to-digital converters (ADCs) at the receiver. We derive the approximate achievable rate by using the Bussgang theorem. Based on the derived analytical achievable rate expression, we maximize the achievable rate by using semidefinite programming (SDP), branch-and-bound (BB), and gradient-based approaches. A maximum likelihood (ML) estimator is then proposed for channel estimation by considering the $\mathrm{1}$-bit quantization ADC. Numerical result shows that the proposed beamforming design and channel estimation method significantly outperforms the existing methods.

preprint2021arXiv

Photoinduced Quantum Anomalous Hall States in the Topological Anderson Insulator

The realization of the quantum anomalous Hall (QAH) effect without magnetic doping attracts intensive interest since magnetically doped topological insulators usually possess inhomogeneity of ferromagnetic order. Here, we propose a different strategy to realize intriguing QAH states arising from the interplay of light and non-magnetic disorder in two-dimensional topologically trivial systems. By combining the Born approximation and Floquet theory, we show that a time-reversal invariant disorder-induced topological insulator, known as the topological Anderson insulator (TAI), would evolve into a time-reversal broken TAI and then into a QAH insulator by shining circularly polarized light. We utilize spin and charge Hall conductivities, which can be measured in experiments directly, to distinguish these three different topological phases. This work not only offers an exciting opportunity to realize the high-temperature QAH effect without magnetic orders, but also is important for applications of topological states to spintronics.

preprint2021arXiv

Prediction and mitigation of mutation threats to COVID-19 vaccines and antibody therapies

Antibody therapeutics and vaccines are among our last resort to end the raging COVID-19 pandemic. They, however, are prone to over 5,000 mutations on the spike (S) protein uncovered by a Mutation Tracker based on over 200,000 genome isolates. It is imperative to understand how mutations would impact vaccines and antibodies in the development. In this work, we study the mechanism, frequency, and ratio of mutations on the S protein. Additionally, we use 56 antibody structures and analyze their 2D and 3D characteristics. Moreover, we predict the mutation-induced binding free energy (BFE) changes for the complexes of S protein and antibodies or ACE2. By integrating genetics, biophysics, deep learning, and algebraic topology, we reveal that most of 462 mutations on the receptor-binding domain (RBD) will weaken the binding of S protein and antibodies and disrupt the efficacy and reliability of antibody therapies and vaccines. A list of 31 vaccine escape mutants is identified, while many other disruptive mutations are detailed as well. We also unveil that about 65\% existing RBD mutations, including those variants recently found in the United Kingdom (UK) and South Africa, are binding-strengthen mutations, resulting in more infectious COVID-19 variants. We discover the disparity between the extreme values of RBD mutation-induced BFE strengthening and weakening of the bindings with antibodies and ACE2, suggesting that SARS-CoV-2 is at an advanced stage of evolution for human infection, while the human immune system is able to produce optimized antibodies. This discovery implies the vulnerability of current vaccines and antibody drugs to new mutations. Our predictions were validated by comparison with more than 1,400 deep mutations on the S protein RBD. Our results show the urgent need to develop new mutation-resistant vaccines and antibodies and to prepare for seasonal vaccinations.

preprint2021arXiv

QoS-Driven Resource Optimization for Intelligent Fog Radio Access Network: A Dynamic Power Allocation Perspective

The fog radio access network (Fog-RAN) has been considered a promising wireless access architecture to help shorten the communication delay and relieve the large data delivery burden over the backhaul links. However, limited by conventional inflexible communication design, Fog-RAN cannot be used in some complex communication scenarios. In this study, we focus on investigating a more intelligent Fog-RAN to assist the communication in a high-speed railway environment. Due to the train's continuously moving, the communication should be designed intelligently to adapt to channel variation. Specifically, we dynamically optimize the power allocation in the remote radio heads (RRHs) to minimize the total network power cost considering multiple quality-of-service (QoS) requirements and channel variation. The impact of caching on the power allocation is considered. The dynamic power optimization is analyzed to obtain a closed-form solution in certain cases. The inherent tradeoff among the total network cost, delay and delivery content size is further discussed. To evaluate the performance of the proposed dynamic power allocation, we present an invariant power allocation counterpart as a performance comparison benchmark. The result of our simulation reveals that dynamic power allocation can significantly outperform the invariant power allocation scheme, especially with a random caching strategy or limited caching resources at the RRHs.

preprint2021arXiv

Reconfigurable Intelligent Surface Assisted Edge Machine Learning

The ever-growing popularity and rapid improving of artificial intelligence (AI) have raised rethinking on the evolution of wireless networks. Mobile edge computing (MEC) provides a natural platform for AI applications since it provides rich computation resources to train AI models, as well as low-latency access to the data generated by mobile and Internet of Things devices. In this paper, we present an infrastructure to perform machine learning tasks at an MEC server with the assistance of a reconfigurable intelligent surface (RIS). In contrast to conventional communication systems where the principal criteria are to maximize the throughput, we aim at optimizing the learning performance. Specifically, we minimize the maximum learning error of all users by jointly optimizing the beamforming vectors of the base station and the phase-shift matrix of the RIS. An alternating optimization-based framework is proposed to optimize the two terms iteratively, where closed-form expressions of the beamforming vectors are derived, and an alternating direction method of multipliers (ADMM)-based algorithm is designed together with an error level searching framework to effectively solve the nonconvex optimization problem of the phase-shift matrix. Simulation results demonstrate significant gains of deploying an RIS and validate the advantages of our proposed algorithms over various benchmarks.

preprint2021arXiv

Remote Sensing Image Super-resolution and Object Detection: Benchmark and State of the Art

For the past two decades, there have been significant efforts to develop methods for object detection in Remote Sensing (RS) images. In most cases, the datasets for small object detection in remote sensing images are inadequate. Many researchers used scene classification datasets for object detection, which has its limitations; for example, the large-sized objects outnumber the small objects in object categories. Thus, they lack diversity; this further affects the detection performance of small object detectors in RS images. This paper reviews current datasets and object detection methods (deep learning-based) for remote sensing images. We also propose a large-scale, publicly available benchmark Remote Sensing Super-resolution Object Detection (RSSOD) dataset. The RSSOD dataset consists of 1,759 hand-annotated images with 22,091 instances of very high resolution (VHR) images with a spatial resolution of ~0.05 m. There are five classes with varying frequencies of labels per class. The image patches are extracted from satellite images, including real image distortions such as tangential scale distortion and skew distortion. We also propose a novel Multi-class Cyclic super-resolution Generative adversarial network with Residual feature aggregation (MCGR) and auxiliary YOLOv5 detector to benchmark image super-resolution-based object detection and compare with the existing state-of-the-art methods based on image super-resolution (SR). The proposed MCGR achieved state-of-the-art performance for image SR with an improvement of 1.2dB PSNR compared to the current state-of-the-art NLSN method. MCGR achieved best object detection mAPs of 0.758, 0.881, 0.841, and 0.983, respectively, for five-class, four-class, two-class, and single classes, respectively surpassing the performance of the state-of-the-art object detectors YOLOv5, EfficientDet, Faster RCNN, SSD, and RetinaNet.

preprint2021arXiv

SG-Net: Syntax Guided Transformer for Language Representation

Understanding human language is one of the key themes of artificial intelligence. For language representation, the capacity of effectively modeling the linguistic knowledge from the detail-riddled and lengthy texts and getting rid of the noises is essential to improve its performance. Traditional attentive models attend to all words without explicit constraint, which results in inaccurate concentration on some dispensable words. In this work, we propose using syntax to guide the text modeling by incorporating explicit syntactic constraints into attention mechanisms for better linguistically motivated word representations. In detail, for self-attention network (SAN) sponsored Transformer-based encoder, we introduce syntactic dependency of interest (SDOI) design into the SAN to form an SDOI-SAN with syntax-guided self-attention. Syntax-guided network (SG-Net) is then composed of this extra SDOI-SAN and the SAN from the original Transformer encoder through a dual contextual architecture for better linguistics inspired representation. The proposed SG-Net is applied to typical Transformer encoders. Extensive experiments on popular benchmark tasks, including machine reading comprehension, natural language inference, and neural machine translation show the effectiveness of the proposed SG-Net design.

preprint2021arXiv

Text Compression-aided Transformer Encoding

Text encoding is one of the most important steps in Natural Language Processing (NLP). It has been done well by the self-attention mechanism in the current state-of-the-art Transformer encoder, which has brought about significant improvements in the performance of many NLP tasks. Though the Transformer encoder may effectively capture general information in its resulting representations, the backbone information, meaning the gist of the input text, is not specifically focused on. In this paper, we propose explicit and implicit text compression approaches to enhance the Transformer encoding and evaluate models using this approach on several typical downstream tasks that rely on the encoding heavily. Our explicit text compression approaches use dedicated models to compress text, while our implicit text compression approach simply adds an additional module to the main model to handle text compression. We propose three ways of integration, namely backbone source-side fusion, target-side fusion, and both-side fusion, to integrate the backbone information into Transformer-based models for various downstream tasks. Our evaluation on benchmark datasets shows that the proposed explicit and implicit text compression approaches improve results in comparison to strong baselines. We therefore conclude, when comparing the encodings to the baseline models, text compression helps the encoders to learn better language representations.

preprint2021arXiv

Unsupervised heart abnormality detection based on phonocardiogram analysis with Beta Variational Auto-Encoders

Heart Sound (also known as phonocardiogram (PCG)) analysis is a popular way that detects cardiovascular diseases (CVDs). Most PCG analysis uses supervised way, which demands both normal and abnormal samples. This paper proposes a method of unsupervised PCG analysis that uses beta variational auto-encoder ($β-\text{VAE}$) to model the normal PCG signals. The best performed model reaches an AUC (Area Under Curve) value of 0.91 in ROC (Receiver Operating Characteristic) test for PCG signals collected from the same source. Unlike majority of $β-\text{VAE}$s that are used as generative models, the best-performed $β-\text{VAE}$ has a $β$ value smaller than 1. Further experiments then find that the introduction of a light weighted KL divergence between distribution of latent space and normal distribution improves the performance of anomaly PCG detection based on anomaly scores resulted by reconstruction loss. The fact suggests that anomaly score based on reconstruction loss may be better than anomaly scores based on latent vectors of samples

preprint2020arXiv

Accurate Word Representations with Universal Visual Guidance

Word representation is a fundamental component in neural language understanding models. Recently, pre-trained language models (PrLMs) offer a new performant method of contextualized word representations by leveraging the sequence-level context for modeling. Although the PrLMs generally give more accurate contextualized word representations than non-contextualized models do, they are still subject to a sequence of text contexts without diverse hints for word representation from multimodality. This paper thus proposes a visual representation method to explicitly enhance conventional word embedding with multiple-aspect senses from visual guidance. In detail, we build a small-scale word-image dictionary from a multimodal seed dataset where each word corresponds to diverse related images. The texts and paired images are encoded in parallel, followed by an attention layer to integrate the multimodal representations. We show that the method substantially improves the accuracy of disambiguation. Experiments on 12 natural language understanding and machine translation tasks further verify the effectiveness and the generalization capability of the proposed approach.

preprint2020arXiv

Achievable DoF Regions of Three-User MIMO Broadcast Channel with Delayed CSIT

For the two-user multiple-input multiple-output (MIMO) broadcast channel with delayed channel state information at the transmitter (CSIT) and arbitrary antenna configurations, all the degrees-of-freedom (DoF) regions are obtained. However, for the three-user MIMO broadcast channel with delayed CSIT and arbitrary antenna configurations, the DoF region of order-2 messages is still unclear and only a partial achievable DoF region of order-1 messages is obtained, where the order-2 messages and order-1 messages are desired by two receivers and one receiver, respectively. In this paper, for the three-user MIMO broadcast channel with delayed CSIT and arbitrary antenna configurations, we first design transmission schemes for order-2 messages and order-1 messages. Next, we propose to analyze the achievable DoF region of transmission scheme by transformation approach. In particular, we transform the decoding condition of transmission scheme w.r.t. phase duration into the achievable DoF region w.r.t. achievable DoF, through achievable DoF tuple expression connecting phase duration and achievable DoF. As a result, the DoF region of order-2 messages is characterized and an achievable DoF region of order-1 messages is completely expressed. Besides, for order-1 messages, we derive the sufficient condition, under which the proposed achievable DoF region is the DoF region.

preprint2020arXiv

Angle Aware User Cooperation for Secure Massive MIMO in Rician Fading Channel

Massive multiple-input multiple-output communications can achieve high-level security by concentrating radio frequency signals towards the legitimate users. However, this system is vulnerable in a Rician fading environment if the eavesdropper positions itself such that its channel is highly "similar" to the channel of a legitimate user. To address this problem, this paper proposes an angle aware user cooperation (AAUC) scheme, which avoids direct transmission to the attacked user and relies on other users for cooperative relaying. The proposed scheme only requires the eavesdropper's angle information, and adopts an angular secrecy model to represent the average secrecy rate of the attacked system. With this angular model, the AAUC problem turns out to be nonconvex, and a successive convex optimization algorithm, which converges to a Karush-Kuhn-Tucker solution, is proposed. Furthermore, a closed-form solution and a Bregman first-order method are derived for the cases of large-scale antennas and large-scale users, respectively. Extension to the intelligent reflecting surfaces based scheme is also discussed. Simulation results demonstrate the effectiveness of the proposed successive convex optimization based AAUC scheme, and also validate the low-complexity nature of the proposed large-scale optimization algorithms.

preprint2020arXiv

Channel Estimation and Power Scaling Law of Large Reflecting Surface with Non-Ideal Hardware

Large reflecting surface (LRS) has emerged as a new solution to improve the energy and spectrum efficiency of wireless communication system. Most existing studies were conducted with an assumption of ideal hardware, and the impact of hardware impairments receives little attention. However, the non-negligible hardware impairments should be taken into consideration when we evaluate the system performance. In this paper, we consider an LRS assisted communication system with hardware impairments, and focus on the channel estimation study and the power scaling law analysis. First, with linear minimum mean square error estimation, we theoretically characterize the relationship between channel estimation performance and impairment level, number of reflecting elements, and pilot power. After that, we analyze the power scaling law and reveal that if the base station (BS) has perfect channel state information, the transmit power of user can be made inversely proportional to the number of BS antennas and the square of the number of reflecting elements with no reduction in performance; If the BS has imperfectly estimated channel state information, to achieve the same performance, the transmit power of user can be made inversely proportional to the square-root of the number of BS antennas and the square of the number of reflecting elements.

preprint2020arXiv

Characterizing SARS-CoV-2 mutations in the United States

The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has been mutating since it was first sequenced in early January 2020. The genetic variants have developed into a few distinct clusters with different properties. Since the United States (US) has the highest number of viral infected patients globally, it is essential to understand the US SARS-CoV-2. Using genotyping, sequence-alignment, time-evolution, $k$-means clustering, protein-folding stability, algebraic topology, and network theory, we reveal that the US SARS-CoV-2 has four substrains and five top US SARS-CoV-2 mutations were first detected in China (2 cases), Singapore (2 cases), and the United Kingdom (1 case). The next three top US SARS-CoV-2 mutations were first detected in the US. These eight top mutations belong to two disconnected groups. The first group consisting of 5 concurrent mutations is prevailing, while the other group with three concurrent mutations gradually fades out. Our analysis suggests that female immune systems are more active than those of males in responding to SARS-CoV-2 infections. We identify that one of the top mutations, 27964C$>$T-(S24L) on ORF8, has an unusually strong gender dependence. Based on the analysis of all mutations on the spike protein, we further uncover that three of four US SASR-CoV-2 substrains become more infectious. Our study calls for effective viral control and containing strategies in the US.

preprint2020arXiv

Constraining the in-medium nucleon-nucleon cross section from the width of nuclear giant dipole resonance

We develop a new lattice Hamiltonian method for solving the Boltzmann-Uehling-Uhlenbeck (BUU) equation. Adopting the stochastic approach to treat the collision term and using the GPU parallel computing to carry out the calculations allows for a rather high accuracy in evaluating the collision term, especially its Pauli blocking, leading thus to a new level of precision in solving the BUU equation. Applying this lattice BUU method to study the width of giant dipole resonance (GDR) in nuclei, where the accurate treatment of the collision term is crucial, we find that the obtained GDR width of $^{208}{\rm Pb}$ shows a strong dependence on the in-medium nucleon-nucleon cross section $σ_{\rm NN}^*$. A very large medium reduction of $σ_{\rm NN}^*$ is needed to reproduce the measured value of the GDR width of $^{208}{\rm Pb}$ at the Research Center for Nuclear Physics in Osaka, Japan.

preprint2020arXiv

Cooperative Multi-Point Vehicular Positioning Using Millimeter-Wave Surface Reflection (Extended version)

Multi-point vehicular positioning is one essential operation for autonomous vehicles. However, the state-of-the-art positioning technologies, relying on reflected signals from a target (i.e., RADAR and LIDAR), cannot work without line-of-sight. Besides, it takes significant time for environment scanning and object recognition with potential detection inaccuracy, especially in complex urban situations. Some recent fatal accidents involving autonomous vehicles further expose such limitations. In this paper, we aim at overcoming these limitations by proposing a novel relative positioning approach, called Cooperative Multi-point Positioning (COMPOP). The COMPOP establishes cooperation between a target vehicle (TV) and a sensing vehicle (SV) if a LoS path exists, where a TV explicitly lets an SV to know the TV's existence by transmitting positioning waveforms. This cooperation makes it possible to remove the time-consuming scanning and target recognizing processes, facilitating real-time positioning. One prerequisite for the cooperation is a clock synchronization between a pair of TV and SV. To this end, we use a phase-differential-of-arrival based approach to remove the TV-SV clock difference from the received signal. With clock difference correction, the TV's position can be obtained via peak detection over a 3D power spectrum constructed by a Fourier transform (FT) based algorithm. The COMPOP also incorporates nearby vehicles, without knowing their locations, into the above cooperation for the case without a LoS path. The effectiveness of the COMPOP is verified by several simulations concerning practical channel parameters.

preprint2020arXiv

Correlators in the supereigenvalue model in the Ramond sector

We investigate the supereigenvalue model in the Ramond sector. We prove that its partition function can be obtained by acting on elementary functions with exponents of the given operators. The Virasoro constraints for this supereigenvalue model are presented. The remarkable property of these bosonic constraint operators is that they obey the Witt algebra and null 3-algebra. The compact expression of correlators can be derived from these Virasoro constraints.

preprint2020arXiv

D3VO: Deep Depth, Deep Pose and Deep Uncertainty for Monocular Visual Odometry

We propose D3VO as a novel framework for monocular visual odometry that exploits deep networks on three levels -- deep depth, pose and uncertainty estimation. We first propose a novel self-supervised monocular depth estimation network trained on stereo videos without any external supervision. In particular, it aligns the training image pairs into similar lighting condition with predictive brightness transformation parameters. Besides, we model the photometric uncertainties of pixels on the input images, which improves the depth estimation accuracy and provides a learned weighting function for the photometric residuals in direct (feature-less) visual odometry. Evaluation results show that the proposed network outperforms state-of-the-art self-supervised depth estimation networks. D3VO tightly incorporates the predicted depth, pose and uncertainty into a direct visual odometry method to boost both the front-end tracking as well as the back-end non-linear optimization. We evaluate D3VO in terms of monocular visual odometry on both the KITTI odometry benchmark and the EuRoC MAV dataset.The results show that D3VO outperforms state-of-the-art traditional monocular VO methods by a large margin. It also achieves comparable results to state-of-the-art stereo/LiDAR odometry on KITTI and to the state-of-the-art visual-inertial odometry on EuRoC MAV, while using only a single camera.

preprint2020arXiv

Decoding asymptomatic COVID-19 infection and transmission

Coronavirus disease 2019 (COVID-19) is a continuously devastating public health and the world economy. One of the major challenges in controlling the COVID-19 outbreak is its asymptomatic infection and transmission, which are elusive and defenseless in most situations. The pathogenicity and virulence of asymptomatic COVID-19 remain mysterious. Based on the genotyping of 20656 Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) genome isolates, we reveal that asymptomatic infection is linked to SARS-CoV-2 11083G>T mutation, i.e., leucine (L) to phenylalanine (F) substitution at the residue 37 (L37F) of nonstructure protein 6 (NSP6). By analyzing the distribution of 11083G>T in various countries, we unveil that 11083G>T may correlate with the hypotoxicity of SARS-CoV-2. Moreover, we show a global decaying tendency of the 11083G>T mutation ratio indicating that 11083G>T hinders SARS-CoV-2 transmission capacity. Sequence alignment found both NSP6 and residue 37 neighborhoods are relatively conservative over a few coronaviral species, indicating their importance in regulating host cell autophagy to undermine innate cellular defense against viral infection. Using machine learning and topological data analysis, we demonstrate that mutation L37F has made NSP6 energetically less stable. The rigidity and flexibility index and several network models suggest that mutation L37F may have compromised the NSP6 function, leading to a relatively weak SARS-CoV subtype. This assessment is a good agreement with our genotyping of SARS-CoV-2 evolution and transmission across various countries and regions over the past few months.

preprint2020arXiv

Decoding SARS-CoV-2 transmission, evolution and ramification on COVID-19 diagnosis, vaccine, and medicine

Tremendous effort has been given to the development of diagnostic tests, preventive vaccines, and therapeutic medicines for coronavirus disease 2019 (COVID-19) caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Much of this development has been based on the reference genome collected on January 5, 2020. Based on the genotyping of 6156 genome samples collected up to April 24, 2020, we report that SARS-CoV-2 has had 4459 alarmingly mutations which can be clustered into five subtypes. We introduce mutation ratio and mutation $h$-index to characterize the protein conservativeness and unveil that SARS-CoV-2 envelope protein, main protease, and endoribonuclease protein are relatively conservative, while SARS-CoV-2 nucleocapsid protein, spike protein, and papain-like protease are relatively non-conservative. In particular, the nucleocapsid protein has more than half its genes changed in the past few months, signaling devastating impacts on the ongoing development of COVID-19 diagnosis, vaccines, and drugs.

preprint2020arXiv

Deep Audio-Visual Learning: A Survey

Audio-visual learning, aimed at exploiting the relationship between audio and visual modalities, has drawn considerable attention since deep learning started to be used successfully. Researchers tend to leverage these two modalities either to improve the performance of previously considered single-modality tasks or to address new challenging problems. In this paper, we provide a comprehensive survey of recent audio-visual learning development. We divide the current audio-visual learning tasks into four different subfields: audio-visual separation and localization, audio-visual correspondence learning, audio-visual generation, and audio-visual representation learning. State-of-the-art methods as well as the remaining challenges of each subfield are further discussed. Finally, we summarize the commonly used datasets and performance metrics.

preprint2020arXiv

Deep Manifold Prior

We present a prior for manifold structured data, such as surfaces of 3D shapes, where deep neural networks are adopted to reconstruct a target shape using gradient descent starting from a random initialization. We show that surfaces generated this way are smooth, with limiting behavior characterized by Gaussian processes, and we mathematically derive such properties for fully-connected as well as convolutional networks. We demonstrate our method in a variety of manifold reconstruction applications, such as point cloud denoising and interpolation, achieving considerably better results against competitive baselines while requiring no training data. We also show that when training data is available, our method allows developing alternate parametrizations of surfaces under the framework of AtlasNet, leading to a compact network architecture and better reconstruction results on standard image to shape reconstruction benchmarks.

preprint2020arXiv

Deep Reinforcement Learning for Multi-objective Optimization

This study proposes an end-to-end framework for solving multi-objective optimization problems (MOPs) using Deep Reinforcement Learning (DRL), that we call DRL-MOA. The idea of decomposition is adopted to decompose the MOP into a set of scalar optimization subproblems. Then each subproblem is modelled as a neural network. Model parameters of all the subproblems are optimized collaboratively according to a neighborhood-based parameter-transfer strategy and the DRL training algorithm. Pareto optimal solutions can be directly obtained through the trained neural network models. In specific, the multi-objective travelling salesman problem (MOTSP) is solved in this work using the DRL-MOA method by modelling the subproblem as a Pointer Network. Extensive experiments have been conducted to study the DRL-MOA and various benchmark methods are compared with it. It is found that, once the trained model is available, it can scale to newly encountered problems with no need of re-training the model. The solutions can be directly obtained by a simple forward calculation of the neural network; thereby, no iteration is required and the MOP can be always solved in a reasonable time. The proposed method provides a new way of solving the MOP by means of DRL. It has shown a set of new characteristics, e.g., strong generalization ability and fast solving speed in comparison with the existing methods for multi-objective optimizations. Experimental results show the effectiveness and competitiveness of the proposed method in terms of model performance and running time.

preprint2020arXiv

DeepNetQoE: Self-adaptive QoE Optimization Framework of Deep Networks

Future advances in deep learning and its impact on the development of artificial intelligence (AI) in all fields depends heavily on data size and computational power. Sacrificing massive computing resources in exchange for better precision rates of the network model is recognized by many researchers. This leads to huge computing consumption and satisfactory results are not always expected when computing resources are limited. Therefore, it is necessary to find a balance between resources and model performance to achieve satisfactory results. This article proposes a self-adaptive quality of experience (QoE) framework, DeepNetQoE, to guide the training of deep networks. A self-adaptive QoE model is set up that relates the model's accuracy with the computing resources required for training which will allow the experience value of the model to improve. To maximize the experience value when computer resources are limited, a resource allocation model and solutions need to be established. In addition, we carry out experiments based on four network models to analyze the experience values with respect to the crowd counting example. Experimental results show that the proposed DeepNetQoE is capable of adaptively obtaining a high experience value according to user needs and therefore guiding users to determine the computational resources allocated to the network models.

preprint2020arXiv

DH3D: Deep Hierarchical 3D Descriptors for Robust Large-Scale 6DoF Relocalization

For relocalization in large-scale point clouds, we propose the first approach that unifies global place recognition and local 6DoF pose refinement. To this end, we design a Siamese network that jointly learns 3D local feature detection and description directly from raw 3D points. It integrates FlexConv and Squeeze-and-Excitation (SE) to assure that the learned local descriptor captures multi-level geometric information and channel-wise relations. For detecting 3D keypoints we predict the discriminativeness of the local descriptors in an unsupervised manner. We generate the global descriptor by directly aggregating the learned local descriptors with an effective attention mechanism. In this way, local and global 3D descriptors are inferred in one single forward pass. Experiments on various benchmarks demonstrate that our method achieves competitive results for both global point cloud retrieval and local point cloud registration in comparison to state-of-the-art approaches. To validate the generalizability and robustness of our 3D keypoints, we demonstrate that our method also performs favorably without fine-tuning on the registration of point clouds that were generated by a visual SLAM system. Code and related materials are available at https://vision.in.tum.de/research/vslam/dh3d.

preprint2020arXiv

DirectShape: Direct Photometric Alignment of Shape Priors for Visual Vehicle Pose and Shape Estimation

Scene understanding from images is a challenging problem encountered in autonomous driving. On the object level, while 2D methods have gradually evolved from computing simple bounding boxes to delivering finer grained results like instance segmentations, the 3D family is still dominated by estimating 3D bounding boxes. In this paper, we propose a novel approach to jointly infer the 3D rigid-body poses and shapes of vehicles from a stereo image pair using shape priors. Unlike previous works that geometrically align shapes to point clouds from dense stereo reconstruction, our approach works directly on images by combining a photometric and a silhouette alignment term in the energy function. An adaptive sparse point selection scheme is proposed to efficiently measure the consistency with both terms. In experiments, we show superior performance of our method on 3D pose and shape estimation over the previous geometric approach and demonstrate that our method can also be applied as a refinement step and significantly boost the performances of several state-of-the-art deep learning based 3D object detectors. All related materials and demonstration videos are available at the project page https://vision.in.tum.de/research/vslam/direct-shape.

preprint2020arXiv

Double ramification cycles with orbifold targets

In this paper, we consider double ramification cycles with orbifold targets. An explicit formula for double ramification cycles with orbifold targets, which is parallel to and generalizes the one known for the smooth case, is provided. Some applications for orbifold Gromov--Witten theory are also included.

preprint2020arXiv

Energy Efficiency Analysis of Intelligent Reflecting Surface System with Hardware Impairments

Recently, as explosive growth of mobile data traffic, the performance of wireless communication systems requires to be enhanced significantly in future. Intelligent reflecting surface (IRS) can be used as a promising way to improve the energy efficiency of wireless communications with less complexity and hardware cost. Most existing studies consider the cases with ideal hardware, however, both physical transceiver and IRS suffer from non-negligible hardware impairments which may greatly degrade the system performance. In this paper, by taking hardware impairments into consideration, we focus on the energy efficiency analysis of IRS system. Our first contribution is to derive the optimal receive combining and transmit beamforming vectors. After that, we characterize the asymptotic channel capacity. With the derived asymptotic channel capacity and the power consumption model, the analytical upper and lower bounds of energy efficiency are provided. Our results show that an IRS system can achieve both high spectral efficiency and high energy efficiency with moderate number of antennas. This observation is encouraging for that there is no need to cost a lot on expensive high-quality antennas, which corresponds to the requirements of new communication paradigms.

preprint2020arXiv

Enhanced POET: Open-Ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their Solutions

Creating open-ended algorithms, which generate their own never-ending stream of novel and appropriately challenging learning opportunities, could help to automate and accelerate progress in machine learning. A recent step in this direction is the Paired Open-Ended Trailblazer (POET), an algorithm that generates and solves its own challenges, and allows solutions to goal-switch between challenges to avoid local optima. However, the original POET was unable to demonstrate its full creative potential because of limitations of the algorithm itself and because of external issues including a limited problem space and lack of a universal progress measure. Importantly, both limitations pose impediments not only for POET, but for the pursuit of open-endedness in general. Here we introduce and empirically validate two new innovations to the original algorithm, as well as two external innovations designed to help elucidate its full potential. Together, these four advances enable the most open-ended algorithmic demonstration to date. The algorithmic innovations are (1) a domain-general measure of how meaningfully novel new challenges are, enabling the system to potentially create and solve interesting challenges endlessly, and (2) an efficient heuristic for determining when agents should goal-switch from one problem to another (helping open-ended search better scale). Outside the algorithm itself, to enable a more definitive demonstration of open-endedness, we introduce (3) a novel, more flexible way to encode environmental challenges, and (4) a generic measure of the extent to which a system continues to exhibit open-ended innovation. Enhanced POET produces a diverse range of sophisticated behaviors that solve a wide range of environmental challenges, many of which cannot be solved through other means.

preprint2020arXiv

Estimating Q(s,s') with Deep Deterministic Dynamics Gradients

In this paper, we introduce a novel form of value function, $Q(s, s')$, that expresses the utility of transitioning from a state $s$ to a neighboring state $s'$ and then acting optimally thereafter. In order to derive an optimal policy, we develop a forward dynamics model that learns to make next-state predictions that maximize this value. This formulation decouples actions from values while still learning off-policy. We highlight the benefits of this approach in terms of value function transfer, learning within redundant action spaces, and learning off-policy from state observations generated by sub-optimal or completely random policies. Code and videos are available at http://sites.google.com/view/qss-paper.

preprint2020arXiv

Explicit Reordering for Neural Machine Translation

In Transformer-based neural machine translation (NMT), the positional encoding mechanism helps the self-attention networks to learn the source representation with order dependency, which makes the Transformer-based NMT achieve state-of-the-art results for various translation tasks. However, Transformer-based NMT only adds representations of positions sequentially to word vectors in the input sentence and does not explicitly consider reordering information in this sentence. In this paper, we first empirically investigate the relationship between source reordering information and translation performance. The empirical findings show that the source input with the target order learned from the bilingual parallel dataset can substantially improve translation performance. Thus, we propose a novel reordering method to explicitly model this reordering information for the Transformer-based NMT. The empirical results on the WMT14 English-to-German, WAT ASPEC Japanese-to-English, and WMT17 Chinese-to-English translation tasks show the effectiveness of the proposed approach.

preprint2020arXiv

Fiber: A Platform for Efficient Development and Distributed Training for Reinforcement Learning and Population-Based Methods

Recent advances in machine learning are consistently enabled by increasing amounts of computation. Reinforcement learning (RL) and population-based methods in particular pose unique challenges for efficiency and flexibility to the underlying distributed computing frameworks. These challenges include frequent interaction with simulations, the need for dynamic scaling, and the need for a user interface with low adoption cost and consistency across different backends. In this paper we address these challenges while still retaining development efficiency and flexibility for both research and practical applications by introducing Fiber, a scalable distributed computing framework for RL and population-based methods. Fiber aims to significantly expand the accessibility of large-scale parallel computation to users of otherwise complicated RL and population-based approaches without the need to for specialized computational expertise.

preprint2020arXiv

First-Order Preconditioning via Hypergradient Descent

Standard gradient descent methods are susceptible to a range of issues that can impede training, such as high correlations and different scaling in parameter space.These difficulties can be addressed by second-order approaches that apply a pre-conditioning matrix to the gradient to improve convergence. Unfortunately, such algorithms typically struggle to scale to high-dimensional problems, in part because the calculation of specific preconditioners such as the inverse Hessian or Fisher information matrix is highly expensive. We introduce first-order preconditioning (FOP), a fast, scalable approach that generalizes previous work on hypergradient descent (Almeida et al., 1998; Maclaurin et al., 2015; Baydin et al.,2017) to learn a preconditioning matrix that only makes use of first-order information. Experiments show that FOP is able to improve the performance of standard deep learning optimizers on visual classification and reinforcement learning tasks with minimal computational overhead. We also investigate the properties of the learned preconditioning matrices and perform a preliminary theoretical analysis of the algorithm.

preprint2020arXiv

Graph-to-Sequence Neural Machine Translation

Neural machine translation (NMT) usually works in a seq2seq learning way by viewing either source or target sentence as a linear sequence of words, which can be regarded as a special case of graph, taking words in the sequence as nodes and relationships between words as edges. In the light of the current NMT models more or less capture graph information among the sequence in a latent way, we present a graph-to-sequence model facilitating explicit graph information capturing. In detail, we propose a graph-based SAN-based NMT model called Graph-Transformer by capturing information of subgraphs of different orders in every layers. Subgraphs are put into different groups according to their orders, and every group of subgraphs respectively reflect different levels of dependency between words. For fusing subgraph representations, we empirically explore three methods which weight different groups of subgraphs of different orders. Results of experiments on WMT14 English-German and IWSLT14 German-English show that our method can effectively boost the Transformer with an improvement of 1.1 BLEU points on WMT14 English-German dataset and 1.0 BLEU points on IWSLT14 German-English dataset.

preprint2020arXiv

Grasp State Assessment of Deformable Objects Using Visual-Tactile Fusion Perception

Humans can quickly determine the force required to grasp a deformable object to prevent its sliding or excessive deformation through vision and touch, which is still a challenging task for robots. To address this issue, we propose a novel 3D convolution-based visual-tactile fusion deep neural network (C3D-VTFN) to evaluate the grasp state of various deformable objects in this paper. Specifically, we divide the grasp states of deformable objects into three categories of sliding, appropriate and excessive. Also, a dataset for training and testing the proposed network is built by extensive grasping and lifting experiments with different widths and forces on 16 various deformable objects with a robotic arm equipped with a wrist camera and a tactile sensor. As a result, a classification accuracy as high as 99.97% is achieved. Furthermore, some delicate grasp experiments based on the proposed network are implemented in this paper. The experimental results demonstrate that the C3D-VTFN is accurate and efficient enough for grasp state assessment, which can be widely applied to automatic force control, adaptive grasping, and other visual-tactile spatiotemporal sequence learning problems.

preprint2020arXiv

Hardware-irrelevant parallel processing system

Parallel processing technology has been a primary tool for achieving high-speed, high-accuracy, and broadband processing for many years across modern information systems and data processing such as optical and radar, synthetic aperture radar imaging, digital beam forming, and digital filtering systems. However, hardware deviations in a parallel processing system (PPS) severely degrade system performance and pose an urgent challenge. We propose a hardware-irrelevant PPS of which the performance is unaffected by hardware deviations. In this system, an embedded convolutional recurrent autoencoder (CRAE), which learns inherent system patterns as well as acquires and removes adverse effects brought by hardware deviations, is adopted. We implement a hardware-irrelevant PPS into a parallel photonic sampling system to accomplish a high-performance analog-to-digital conversion for microwave signals with high frequency and broad bandwidth. Under one system state, a category of signals with two different mismatch degrees is utilized to train the CRAE, which can then compensate for mismatches in various categories of signals with multiple mismatch degrees under random system states. Our approach is extensively applicable to achieving hardware-irrelevant PPSs which are either discrete or integrated in photonic, electric, and other fields.

preprint2020arXiv

Host immune response driving SARS-CoV-2 evolution

The transmission and evolution of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) are of paramount importance to the controlling and combating of coronavirus disease 2019 (COVID-19) pandemic. Currently, near 15,000 SARS-CoV-2 single mutations have been recorded, having a great ramification to the development of diagnostics, vaccines, antibody therapies, and drugs. However, little is known about SARS-CoV-2 evolutionary characteristics and general trend. In this work, we present a comprehensive genotyping analysis of existing SARS-CoV-2 mutations. We reveal that host immune response via APOBEC and ADAR gene editing gives rise to near 65\% of recorded mutations. Additionally, we show that children under age five and the elderly may be at high risk from COVID-19 because of their overreacting to the viral infection. Moreover, we uncover that populations of Oceania and Africa react significantly more intensively to SARS-CoV-2 infection than those of Europe and Asia, which may explain why African Americans were shown to be at increased risk of dying from COVID-19, in addition to their high risk of getting sick from COVID-19 caused by systemic health and social inequities. Finally, our study indicates that for two viral genome sequences of the same origin, their evolution order may be determined from the ratio of mutation type C$>$T over T$>$C.

preprint2020arXiv

Intrinsic quantum anomalous Hall phase induced by proximity in germanene/Cr$_2$Ge$_2$Te$_6$ van der Waals heterostructure

A van der Waals heterostructure combined with intrinsic magnetism and topological orders have recently paved attractive avenues to realize quantum anomalous Hall effects. In this work, using first-principles calculations and effective model analysis, we propose that the robust quantum anomalous Hall states with sizable band gaps emerge in the van der Waals heterostructure of germanene/Cr$_2$Ge$_2$Te$_6$. This heterostructure possesses high thermodynamic stability, thus facilitating its experimental fabrication. Furthermore, we uncover that the proximity effect enhances the coupling between the germanene and Cr$_2$Ge$_2$Te$_6$ layers, inducing the nontrivial band gaps in a wide range from 29 meV to 72 meV. The chiral edge states inside the band gap, leading to Hall conductance quantized to $-e^2/h$, are clearly visible. This findings provide an ideal candidate to detect the quantum anomalous Hall states and realize further applications to nontrivial quantum transport at a high temperature.

preprint2020arXiv

Joint Optimization of File Placement and Delivery in Cache-Assisted Wireless Networks with Limited Lifetime and Cache Space

In this paper, the scheduling of downlink file transmission in one cell with the assistance of cache nodes with finite cache space is studied. Specifically, requesting users arrive randomly and the base station (BS) reactively multicasts files to the requesting users and selected cache nodes. The latter can offload the traffic in their coverage areas from the BS. We consider the joint optimization of the abovementioned file placement and delivery within a finite lifetime subject to the cache space constraint. Within the lifetime, the allocation of multicast power and symbol number for each file transmission at the BS is formulated as a dynamic programming problem with a random stage number. Note that there are no existing solutions to this problem. We develop an asymptotically optimal solution framework by transforming the original problem to an equivalent finite-horizon Markov decision process (MDP) with a fixed stage number. A novel approximation approach is then proposed to address the curse of dimensionality, where the analytical expressions of approximate value functions are provided. We also derive analytical bounds on the exact value function and approximation error. The approximate value functions depend on some system statistics, e.g., requesting users' distribution. One reinforcement learning algorithm is proposed for the scenario where these statistics are unknown.

preprint2020arXiv

Knowledge Distillation for Multilingual Unsupervised Neural Machine Translation

Unsupervised neural machine translation (UNMT) has recently achieved remarkable results for several language pairs. However, it can only translate between a single language pair and cannot produce translation results for multiple language pairs at the same time. That is, research on multilingual UNMT has been limited. In this paper, we empirically introduce a simple method to translate between thirteen languages using a single encoder and a single decoder, making use of multilingual data to improve UNMT for all language pairs. On the basis of the empirical findings, we propose two knowledge distillation methods to further enhance multilingual UNMT performance. Our experiments on a dataset with English translated to and from twelve other languages (including three language families and six language branches) show remarkable results, surpassing strong unsupervised individual baselines while achieving promising performance between non-English language pairs in zero-shot translation scenarios and alleviating poor performance in low-resource language pairs.

preprint2020arXiv

Label-Efficient Learning on Point Clouds using Approximate Convex Decompositions

The problems of shape classification and part segmentation from 3D point clouds have garnered increasing attention in the last few years. Both of these problems, however, suffer from relatively small training sets, creating the need for statistically efficient methods to learn 3D shape representations. In this paper, we investigate the use of Approximate Convex Decompositions (ACD) as a self-supervisory signal for label-efficient learning of point cloud representations. We show that using ACD to approximate ground truth segmentation provides excellent self-supervision for learning 3D point cloud representations that are highly effective on downstream tasks. We report improvements over the state-of-the-art for unsupervised representation learning on the ModelNet40 shape classification dataset and significant gains in few-shot part segmentation on the ShapeNetPart dataset.Code available at https://github.com/matheusgadelha/PointCloudLearningACD

preprint2020arXiv

Learning Centric Power Allocation for Edge Intelligence

While machine-type communication (MTC) devices generate massive data, they often cannot process this data due to limited energy and computation power. To this end, edge intelligence has been proposed, which collects distributed data and performs machine learning at the edge. However, this paradigm needs to maximize the learning performance instead of the communication throughput, for which the celebrated water-filling and max-min fairness algorithms become inefficient since they allocate resources merely according to the quality of wireless channels. This paper proposes a learning centric power allocation (LCPA) method, which allocates radio resources based on an empirical classification error model. To get insights into LCPA, an asymptotic optimal solution is derived. The solution shows that the transmit powers are inversely proportional to the channel gain, and scale exponentially with the learning parameters. Experimental results show that the proposed LCPA algorithm significantly outperforms other power allocation algorithms.

preprint2020arXiv

Learning Generative Models of Shape Handles

We present a generative model to synthesize 3D shapes as sets of handles -- lightweight proxies that approximate the original 3D shape -- for applications in interactive editing, shape parsing, and building compact 3D representations. Our model can generate handle sets with varying cardinality and different types of handles (Figure 1). Key to our approach is a deep architecture that predicts both the parameters and existence of shape handles, and a novel similarity measure that can easily accommodate different types of handles, such as cuboids or sphere-meshes. We leverage the recent advances in semantic 3D annotation as well as automatic shape summarizing techniques to supervise our approach. We show that the resulting shape representations are intuitive and achieve superior quality than previous state-of-the-art. Finally, we demonstrate how our method can be used in applications such as interactive shape editing, completion, and interpolation, leveraging the latent space learned by our model to guide these tasks. Project page: http://mgadelha.me/shapehandles.

preprint2020arXiv

Long-term continuous assessment of SRAM PUF and source of random numbers

The qualities of Physical Unclonable Functions (PUFs) suffer from several noticeable degradations due to silicon aging. In this paper, we investigate the long-term effects of silicon aging on PUFs derived from the start-up behavior of Static Random Access Memories (SRAM). Previous research on SRAM aging is based on transistor-level simulation or accelerated aging test at high temperature and voltage to observe aging effects within a short period of time. In contrast, we have run a long-term continuous power-up test on 16 Arduino Leonardo boards under nominal conditions for two years. In total, we collected around 175 million measurements for reliability, uniqueness and randomness evaluations. Analysis shows that the number of bits that flip with respect to the reference increased by 19.3% while min-entropy of SRAM PUF noise improves by 19.3% on average after two years of aging. The impact of aging on reliability is smaller under nominal conditions than was previously assessed by the accelerated aging test. The test we conduct in this work more closely resembles the conditions of a device in the field, and therefore we more accurately evaluate how silicon aging affects SRAM PUFs.

preprint2020arXiv

M Subdwarf Research. II. Atmospheric Parameters and Kinematics

Applying the revised M subdwarf classification criteria discussed in Paper I to LAMOST DR7, combining the M subdwarf sample from Savcheva et al, a new M subdwarf sample was constructed for further study. The atmospheric parameters for each object were derived fitting with the PHOENIX grid, combining with Gaia DR2, the relationship between the gravity and metallicity were explored according to the locus both in the color-absolute magnitude diagram and the reduced proper motion diagram. Objects that have both the largest gravity and the lowest metallicity are located away from the main-sequence cloud and may be considered as the intrinsic M subdwarfs, which can be classified as luminosity class VI. Another group of objects whose spectra show typical M subdwarf characters have lower gravity and relatively moderate metal deficiency and occupy part of the ordinary M dwarf region in both diagrams. The Galactic U , V , W space velocity components and their dispersion show that the local Galactic halo population sampled in the solar neighborhood is represented by objects of high gravity and an inconspicuous bimodal metallicity distribution, with a fraction of prograde orbits. The other M subdwarfs seem to partly belong to the thick disk component with a significant fraction of thin disk moderately metal-poor objects intricately mixed with them. However, the selection effects, especially the favored anti-center direction of investigation in the LAMOST sub-sample, but also contamination by multiplicity and parameter coupling could play important roles and need to be further investigated.

preprint2020arXiv

Machine Intelligence at the Edge with Learning Centric Power Allocation

While machine-type communication (MTC) devices generate considerable amounts of data, they often cannot process the data due to limited energy and computational power. To empower MTC with intelligence, edge machine learning has been proposed. However, power allocation in this paradigm requires maximizing the learning performance instead of the communication throughput, for which the celebrated water-filling and max-min fairness algorithms become inefficient. To this end, this paper proposes learning centric power allocation (LCPA), which provides a new perspective on radio resource allocation in learning driven scenarios. By employing 1) an empirical classification error model that is supported by learning theory and 2) an uncertainty sampling method that accounts for different distributions at users, LCPA is formulated as a nonconvex nonsmooth optimization problem, and is solved using a majorization minimization (MM) framework. To get deeper insights into LCPA, asymptotic analysis shows that the transmit powers are inversely proportional to the channel gains, and scale exponentially with the learning parameters. This is in contrast to traditional power allocations where quality of wireless channels is the only consideration. Last but not least, a large-scale optimization algorithm termed mirror-prox LCPA is further proposed to enable LCPA in large-scale settings. Extensive numerical results demonstrate that the proposed LCPA algorithms outperform traditional power allocation algorithms, and the large-scale optimization algorithm reduces the computation time by orders of magnitude compared with MM-based LCPA but still achieves competing learning performance.

preprint2020arXiv

Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond

Machine reading comprehension (MRC) aims to teach machines to read and comprehend human languages, which is a long-standing goal of natural language processing (NLP). With the burst of deep neural networks and the evolution of contextualized language models (CLMs), the research of MRC has experienced two significant breakthroughs. MRC and CLM, as a phenomenon, have a great impact on the NLP community. In this survey, we provide a comprehensive and comparative review on MRC covering overall research topics about 1) the origin and development of MRC and CLM, with a particular focus on the role of CLMs; 2) the impact of MRC and CLM to the NLP community; 3) the definition, datasets, and evaluation of MRC; 4) general MRC architecture and technical methods in the view of two-stage Encoder-Decoder solving architecture from the insights of the cognitive process of humans; 5) previous highlights, emerging topics, and our empirical analysis, among which we especially focus on what works in different periods of MRC researches. We propose a full-view categorization and new taxonomies on these topics. The primary views we have arrived at are that 1) MRC boosts the progress from language processing to understanding; 2) the rapid improvement of MRC systems greatly benefits from the development of CLMs; 3) the theme of MRC is gradually moving from shallow text matching to cognitive reasoning.

preprint2020arXiv

Many-body Resonance in a Correlated Topological Kagome Antiferromagnet

We use scanning tunneling microscopy/spectroscopy (STM/S) to elucidate the atomically resolved electronic structure in strongly correlated topological kagome magnet Mn$_3$Sn. In stark contrast to its broad single-particle electronic structure, we observe a pronounced resonance with a Fano line shape at the Fermi level resembling the many-body Kondo resonance. We find that this resonance does not arise from the step edges or atomic impurities, but the intrinsic kagome lattice. Moreover, the resonance is robust against the perturbation of a vector magnetic field, but broadens substantially with increasing temperature, signaling strongly interacting physics. We show that this resonance can be understood as the result of geometrical frustration and strong correlation based on the kagome lattice Hubbard model. Our results point to the emergent many-body resonance behavior in a topological kagome magnet.

preprint2020arXiv

Modeling Future Cost for Neural Machine Translation

Existing neural machine translation (NMT) systems utilize sequence-to-sequence neural networks to generate target translation word by word, and then make the generated word at each time-step and the counterpart in the references as consistent as possible. However, the trained translation model tends to focus on ensuring the accuracy of the generated target word at the current time-step and does not consider its future cost which means the expected cost of generating the subsequent target translation (i.e., the next target word). To respond to this issue, we propose a simple and effective method to model the future cost of each target word for NMT systems. In detail, a time-dependent future cost is estimated based on the current generated target word and its contextual information to boost the training of the NMT model. Furthermore, the learned future context representation at the current time-step is used to help the generation of the next target word in the decoding. Experimental results on three widely-used translation datasets, including the WMT14 German-to-English, WMT14 English-to-French, and WMT17 Chinese-to-English, show that the proposed approach achieves significant improvements over strong Transformer-based NMT baseline.

preprint2020arXiv

Multi-Domain Dialogue Acts and Response Co-Generation

Generating fluent and informative responses is of critical importance for task-oriented dialogue systems. Existing pipeline approaches generally predict multiple dialogue acts first and use them to assist response generation. There are at least two shortcomings with such approaches. First, the inherent structures of multi-domain dialogue acts are neglected. Second, the semantic associations between acts and responses are not taken into account for response generation. To address these issues, we propose a neural co-generation model that generates dialogue acts and responses concurrently. Unlike those pipeline approaches, our act generation module preserves the semantic structures of multi-domain dialogue acts and our response generation module dynamically attends to different acts as needed. We train the two modules jointly using an uncertainty loss to adjust their task weights adaptively. Extensive experiments are conducted on the large-scale MultiWOZ dataset and the results show that our model achieves very favorable improvement over several state-of-the-art models in both automatic and human evaluations.

preprint2020arXiv

Mutations on COVID-19 diagnostic targets

Effective, sensitive, and reliable diagnostic reagents are of paramount importance for combating the ongoing coronavirus disease 2019 (COVID-19) pandemic at a time there is no preventive vaccine nor specific drug available for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). It would be an absolute tragedy if currently used diagnostic reagents are undermined in any manner. Based on the genotyping of 7818 SARS-CoV-2 genome samples collected up to May 1, 2020, we reveal that essentially all of the current COVID-19 diagnostic targets have had mutations. We further show that SARS-CoV-2 has the most devastating mutations on the targets of various nucleocapsid (N) gene primers and probes, which have been unfortunately used by countries around the world to diagnose COVID-19. Our findings explain what has seriously gone wrong with a specific diagnostic reagent made in China. To understand whether SARS-CoV-2 genes have mutated unevenly, we have computed the mutation ratio and mutation $h$-index of all SARS-CoV genes, indicating that the N gene is the most non-conservative gene in the SARS-CoV-2 genome. Our findings enable researchers to target the most conservative SARS-CoV-2 genes and proteins for the design and development of COVID-19 diagnostic reagents, preventive vaccines, and therapeutic medicines.

preprint2020arXiv

Mutations strengthened SARS-CoV-2 infectivity

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infectivity is a major concern in coronavirus disease 2019 (COVID-19) prevention and economic reopening. However, rigorous determination of SARS-COV-2 infectivity is essentially impossible owing to its continuous evolution with over 13752 single nucleotide polymorphisms (SNP) variants in six different subtypes. We develop an advanced machine learning algorithm based on the algebraic topology to quantitatively evaluate the binding affinity changes of SARS-CoV-2 spike glycoprotein (S protein) and host angiotensin-converting enzyme 2 (ACE2) receptor following the mutations. Based on mutation-induced binding affinity changes, we reveal that five out of six SARS-CoV-2 subtypes have become either moderately or slightly more infectious, while one subtype has weakened its infectivity. We find that SARS-CoV-2 is slightly more infectious than SARS-CoV according to computed S protein-ACE2 binding affinity changes. Based on a systematic evaluation of all possible 3686 future mutations on the S protein receptor-binding domain (RBD), we show that most likely future mutations will make SARS-CoV-2 more infectious. Combining sequence alignment, probability analysis, and binding affinity calculation, we predict that a few residues on the receptor-binding motif (RBM), i.e., 452, 489, 500, 501, and 505, have very high chances to mutate into significantly more infectious COVID-19 strains.

preprint2020arXiv

Neural Topic Modeling with Bidirectional Adversarial Training

Recent years have witnessed a surge of interests of using neural topic models for automatic topic extraction from text, since they avoid the complicated mathematical derivations for model inference as in traditional topic models such as Latent Dirichlet Allocation (LDA). However, these models either typically assume improper prior (e.g. Gaussian or Logistic Normal) over latent topic space or could not infer topic distribution for a given document. To address these limitations, we propose a neural topic modeling approach, called Bidirectional Adversarial Topic (BAT) model, which represents the first attempt of applying bidirectional adversarial training for neural topic modeling. The proposed BAT builds a two-way projection between the document-topic distribution and the document-word distribution. It uses a generator to capture the semantic patterns from texts and an encoder for topic inference. Furthermore, to incorporate word relatedness information, the Bidirectional Adversarial Topic model with Gaussian (Gaussian-BAT) is extended from BAT. To verify the effectiveness of BAT and Gaussian-BAT, three benchmark corpora are used in our experiments. The experimental results show that BAT and Gaussian-BAT obtain more coherent topics, outperforming several competitive baselines. Moreover, when performing text clustering based on the extracted topics, our models outperform all the baselines, with more significant improvements achieved by Gaussian-BAT where an increase of near 6\% is observed in accuracy.

preprint2020arXiv

Orbifold Gromov--Witten theory of weighted blowups

Consider a compact symplectic sub-orbifold groupoid $\sf S$ of a compact symplectic orbifold groupoid $(\mathsf X,ω)$. Let $\mathsf X_{\mathfrak a}$ be the weight-$\mathfrak a$ blowup of $\sf X$ along $\sf S$, and $\mathsf D_{\mathfrak a}=\mathsf{PN}_{\mathfrak a}$ be the exceptional divisor, where $\sf N$ is the normal bundle of $\sf S$ in $\sf X$. In this paper we show that the absolute orbifold Gromov--Witten theory of $\mathsf X_{\mathfrak a}$ can be effectively and uniquely reconstructed from the absolute orbifold Gromov--Witten theories of $\sf X$, $\sf S$ and $\mathsf D_{\mathfrak a}$, the natural restriction homomorphism $H^*_{\text{CR}}({\sf X})\rightarrow H^*_{\text{CR}}({\sf S})$ and the first Chern class of the tautological line bundle over $\mathsf D_{\mathfrak a}$. To achieve this we first prove similar results for the relative orbifold Gromov--Witten theories of $(\mathsf X_{\mathfrak a}|\mathsf D_{\mathfrak a})$ and $(\mathsf N_{\mathfrak a}|\mathsf D_{\mathfrak a})$. As applications of these results, we prove an orbifold version of a conjecture of Maulik--Pandharipande on the Gromov--Witten theory of blowups along complete intersections, a conjecture on the Gromov--Witten theory of root constructions and a conjecture on Leray--Hirsch result for orbifold Gromov--Witten theory of Tseng--You.

preprint2020arXiv

Propensity Score Weighting for Covariate Adjustment in Randomized Clinical Trials

Chance imbalance in baseline characteristics is common in randomized clinical trials. Regression adjustment such as the analysis of covariance (ANCOVA) is often used to account for imbalance and increase precision of the treatment effect estimate. An objective alternative is through inverse probability weighting (IPW) of the propensity scores. Although IPW and ANCOVA are asymptotically equivalent, the former may demonstrate inferior performance in finite samples. In this article, we point out that IPW is a special case of the general class of balancing weights, and advocate to use overlap weighting (OW) for covariate adjustment. The OW method has a unique advantage of completely removing chance imbalance when the propensity score is estimated by logistic regression. We show that the OW estimator attains the same semiparametric variance lower bound as the most efficient ANCOVA estimator and the IPW estimator for a continuous outcome, and derive closed-form variance estimators for OW when estimating additive and ratio estimands. Through extensive simulations, we demonstrate OW consistently outperforms IPW in finite samples and improves the efficiency over ANCOVA and augmented IPW when the degree of treatment effect heterogeneity is moderate or when the outcome model is incorrectly specified. We apply the proposed OW estimator to the Best Apnea Interventions for Research (BestAIR) randomized trial to evaluate the effect of continuous positive airway pressure on patient health outcomes. All the discussed propensity score weighting methods are implemented in the R package PSweight.

preprint2020arXiv

Reconfigurable Intelligent Surface Aided Wireless Localization

The advantages of millimeter-wave and large antenna arrays technologies for accurate wireless localization received extensive attentions recently. However, how to further improve the accuracy of wireless localization, even in the case of obstructed line-of-sight, is largely undiscovered. In this paper, the reconfigurable intelligent surface (RIS) is introduced into the system to make the positioning more accurate. First, we establish the three-dimensional RIS-assisted wireless localization channel model. After that, we derive the Fisher information matrix and the Cramer-Rao lower bound for evaluating the estimation of absolute mobile station position. Finally, we propose an alternative optimization method and a gradient decent method to optimize the reflect beamforming, which aims to minimize the Cramer-Rao lower bound to obtain a more accurate estimation. Our results show that the proposed methods significantly improve the accuracy of positioning, and decimeter-level or even centimeter-level positioning can be achieved by utilizing the RIS with a large number of reflecting elements.

preprint2020arXiv

Regularized Context Gates on Transformer for Machine Translation

Context gates are effective to control the contributions from the source and target contexts in the recurrent neural network (RNN) based neural machine translation (NMT). However, it is challenging to extend them into the advanced Transformer architecture, which is more complicated than RNN. This paper first provides a method to identify source and target contexts and then introduce a gate mechanism to control the source and target contributions in Transformer. In addition, to further reduce the bias problem in the gate mechanism, this paper proposes a regularization method to guide the learning of the gates with supervision automatically generated using pointwise mutual information. Extensive experiments on 4 translation datasets demonstrate that the proposed model obtains an averaged gain of 1.0 BLEU score over a strong Transformer baseline.

preprint2020arXiv

Relational Graph Attention Network for Aspect-based Sentiment Analysis

Aspect-based sentiment analysis aims to determine the sentiment polarity towards a specific aspect in online reviews. Most recent efforts adopt attention-based neural network models to implicitly connect aspects with opinion words. However, due to the complexity of language and the existence of multiple aspects in a single sentence, these models often confuse the connections. In this paper, we address this problem by means of effective encoding of syntax information. Firstly, we define a unified aspect-oriented dependency tree structure rooted at a target aspect by reshaping and pruning an ordinary dependency parse tree. Then, we propose a relational graph attention network (R-GAT) to encode the new tree structure for sentiment prediction. Extensive experiments are conducted on the SemEval 2014 and Twitter datasets, and the experimental results confirm that the connections between aspects and opinion words can be better established with our approach, and the performance of the graph attention network (GAT) is significantly improved as a consequence.

preprint2020arXiv

Repositioning of 8565 existing drugs for COVID-19

The coronavirus disease 2019 (COVID-19) pandemic caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has infected near 5 million people and led to over 0.3 million deaths. Currently, there is no specific anti-SARS-CoV-2 medication. New drug discovery typically takes more than ten years. Drug repositioning becomes one of the most feasible approaches for combating COVID-19. This work curates the largest available experimental dataset for SARS-CoV-2 or SARS-CoV main protease inhibitors. Based on this dataset, we develop validated machine learning models with relatively low root mean square error to screen 1553 FDA-approved drugs as well as other 7012 investigational or off-market drugs in DrugBank. We found that many existing drugs might be potentially potent to SARS-CoV-2. The druggability of many potent SARS-CoV-2 main protease inhibitors is analyzed. This work offers a foundation for further experimental studies of COVID-19 drug repositioning.

preprint2020arXiv

Representer Theorems in Banach Spaces: Minimum Norm Interpolation, Regularized Learning and Semi-Discrete Inverse Problems

Constructing or learning a function from a finite number of sampled data points (measurements) is a fundamental problem in science and engineering. This is often formulated as a minimum norm interpolation problem, regularized learning problem or, in general, a semi-discrete inverse problem, in certain functional spaces. The choice of an appropriate space is crucial for solutions of these problems. Motivated by sparse representations of the reconstructed functions such as compressed sensing and sparse learning, much of the recent research interest has been directed to considering these problems in certain Banach spaces in order to obtain their sparse solutions, which is a feasible approach to overcome challenges coming from the big data nature of most practical applications. It is the goal of this paper to provide a systematic study of the representer theorems for these problems in Banach spaces. There are a few existing results for these problems in a Banach space, with all of them regarding implicit representer theorems. We aim at obtaining explicit representer theorems based on which convenient solution methods will then be developed. For the minimum norm interpolation, the explicit representer theorems enable us to express the infimum in terms of the norm of the linear combination of the interpolation functionals. For the purpose of developing efficient computational algorithms, we establish the fixed-point equation formulation of solutions of these problems. We reveal that unlike in a Hilbert space, in general, solutions of these problems in a Banach space may not be able to be reduced to truly finite dimensional problems (with certain infinite dimensional components hidden). We demonstrate how this obstacle can be removed, reducing the original problem to a truly finite dimensional one, in the special case when the Banach space is $\ell_1(\mathbb{N})$.

preprint2020arXiv

Review of COVID-19 Antibody Therapies

Under the global health emergency caused by coronavirus disease 2019 (COVID-19), efficient and specific therapies are urgently needed. Compared with traditional small-molecular drugs, antibody therapies are relatively easy to develop and as specific as vaccines in targeting severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), and thus attract much attention in the past few months. This work reviews seven existing antibodies for SARS-CoV-2 spike (S) protein with three-dimensional (3D) structures deposited in the Protein Data Bank. Five antibody structures associated with SARS-CoV are evaluated for their potential in neutralizing SARS-CoV-2. The interactions of these antibodies with the S protein receptor-binding domain (RBD) are compared with those of angiotensin-converting enzyme 2 (ACE2) and RBD complexes. Due to the orders of magnitude in the discrepancies of experimental binding affinities, we introduce topological data analysis (TDA), a variety of network models, and deep learning to analyze the binding strength and therapeutic potential of the aforementioned fourteen antibody-antigen complexes. The current COVID-19 antibody clinical trials, which are not limited to the S protein target, are also reviewed.

preprint2020arXiv

Revisiting Simple Domain Adaptation Methods in Unsupervised Neural Machine Translation

Domain adaptation has been well-studied in supervised neural machine translation (SNMT). However, it has not been well-studied for unsupervised neural machine translation (UNMT), although UNMT has recently achieved remarkable results in several domain-specific language pairs. Besides the inconsistent domains between training data and test data for SNMT, there sometimes exists an inconsistent domain between two monolingual training data for UNMT. In this work, we empirically show different scenarios for unsupervised neural machine translation. Based on these scenarios, we revisit the effect of the existing domain adaptation methods including batch weighting and fine tuning methods in UNMT. Finally, we propose modified methods to improve the performances of domain-specific UNMT systems.

preprint2020arXiv

Safe and Effective Picking Paths in Clutter given Discrete Distributions of Object Poses

Picking an item in the presence of other objects can be challenging as it involves occlusions and partial views. Given object models, one approach is to perform object pose estimation and use the most likely candidate pose per object to pick the target without collisions. This approach, however, ignores the uncertainty of the perception process both regarding the target's and the surrounding objects' poses. This work proposes first a perception process for 6D pose estimation, which returns a discrete distribution of object poses in a scene. Then, an open-loop planning pipeline is proposed to return safe and effective solutions for moving a robotic arm to pick, which (a) minimizes the probability of collision with the obstructing objects; and (b) maximizes the probability of reaching the target item. The planning framework models the challenge as a stochastic variant of the Minimum Constraint Removal (MCR) problem. The effectiveness of the methodology is verified given both simulated and real data in different scenarios. The experiments demonstrate the importance of considering the uncertainty of the perception process in terms of safe execution. The results also show that the methodology is more effective than conservative MCR approaches, which avoid all possible object poses regardless of the reported uncertainty.

preprint2020arXiv

Scheduling for Mobile Edge Computing with Random User Arrivals: An Approximate MDP and Reinforcement Learning Approach

In this paper, we investigate the scheduling design of a mobile edge computing (MEC) system, where active mobile devices with computation tasks randomly appear in a cell. Every task can be computed at either the mobile device or the MEC server. We jointly optimize the task offloading decision, uplink transmission device selection and power allocation by formulating the problem as an infinite-horizon Markov decision process (MDP). Compared with most of the existing literature, this is the first attempt to address the transmission and computation optimization with the random device arrivals in an infinite time horizon to our best knowledge. Due to the uncertainty in the device number and location, the conventional approximate MDP approaches addressing the curse of dimensionality cannot be applied. An alternative and suitable low-complexity solution framework is proposed in this work. We first introduce a baseline scheduling policy, whose value function can be derived analytically with the statistics of random mobile device arrivals. Then, one-step policy iteration is adopted to obtain a sub-optimal scheduling policy whose performance can be bounded analytically. The complexity of deriving the sub-optimal policy is reduced dramatically compared with conventional solutions of MDP by eliminating the complicated value iteration. To address a more general scenario where the statistics of random mobile device arrivals are unknown, a novel and efficient algorithm integrating reinforcement learning and stochastic gradient descent (SGD) is proposed to improve the system performance in an online manner. Simulation results show that the gain of the sub-optimal policy over various benchmarks is significant.

preprint2020arXiv

SPCANet: Stellar Parameters and Chemical Abundances Network for LAMOST-II Medium Resolution Survey

The fundamental stellar atmospheric parameters T_eff and log g and 13 chemical abundances are derived for medium-resolution spectroscopy from LAMOST Medium-Resolution Survey (MRS) data sets with a deep-learning method. The neural networks we designed, named as SPCANet, precisely map LAMOST MRS spectra to stellar parameters and chemical abundances. The stellar labels derived by SPCANet are with precisions of 119 K for T_eff and 0.17 dex for log g. The abundance precision of 11 elements including [C/H], [N/H], [O/H], [Mg/H], [Al/H], [Si/H], [S/H], [Ca/H], [Ti/H], [Cr/H], [Fe/H], and [Ni/H] are 0.06~0.12 dex, while of [Cu/H] is 0.19 dex. These precisions can be reached even for spectra with signal-to-noise as low as 10. The results of SPCANet are consistent with those from other surveys such as APOGEE, GALAH and RAVE, and are also validated with the previous literature values including clusters and field stars. The catalog of the estimated parameters is available at \url{http://paperdata.china-vo.org/LAMOST/MRS_parameters_elements.csv}.

preprint2020arXiv

Students Need More Attention: BERT-based AttentionModel for Small Data with Application to AutomaticPatient Message Triage

Small and imbalanced datasets commonly seen in healthcare represent a challenge when training classifiers based on deep learning models. So motivated, we propose a novel framework based on BioBERT (Bidirectional Encoder Representations from Transformers forBiomedical TextMining). Specifically, (i) we introduce Label Embeddings for Self-Attention in each layer of BERT, which we call LESA-BERT, and (ii) by distilling LESA-BERT to smaller variants, we aim to reduce overfitting and model size when working on small datasets. As an application, our framework is utilized to build a model for patient portal message triage that classifies the urgency of a message into three categories: non-urgent, medium and urgent. Experiments demonstrate that our approach can outperform several strong baseline classifiers by a significant margin of 4.3% in terms of macro F1 score. The code for this project is publicly available at \url{https://github.com/shijing001/text_classifiers}.

preprint2020arXiv

Syntax-aware Data Augmentation for Neural Machine Translation

Data augmentation is an effective performance enhancement in neural machine translation (NMT) by generating additional bilingual data. In this paper, we propose a novel data augmentation enhancement strategy for neural machine translation. Different from existing data augmentation methods which simply choose words with the same probability across different sentences for modification, we set sentence-specific probability for word selection by considering their roles in sentence. We use dependency parse tree of input sentence as an effective clue to determine selecting probability for every words in each sentence. Our proposed method is evaluated on WMT14 English-to-German dataset and IWSLT14 German-to-English dataset. The result of extensive experiments show our proposed syntax-aware data augmentation method may effectively boost existing sentence-independent methods for significant translation performance improvement.

preprint2020arXiv

Towards Physics-informed Deep Learning for Turbulent Flow Prediction

While deep learning has shown tremendous success in a wide range of domains, it remains a grand challenge to incorporate physical principles in a systematic manner to the design, training, and inference of such models. In this paper, we aim to predict turbulent flow by learning its highly nonlinear dynamics from spatiotemporal velocity fields of large-scale fluid flow simulations of relevance to turbulence modeling and climate modeling. We adopt a hybrid approach by marrying two well-established turbulent flow simulation techniques with deep learning. Specifically, we introduce trainable spectral filters in a coupled model of Reynolds-averaged Navier-Stokes (RANS) and Large Eddy Simulation (LES), followed by a specialized U-net for prediction. Our approach, which we call turbulent-Flow Net (TF-Net), is grounded in a principled physics model, yet offers the flexibility of learned representations. We compare our model, TF-Net, with state-of-the-art baselines and observe significant reductions in error for predictions 60 frames ahead. Most importantly, our method predicts physical fields that obey desirable physical characteristics, such as conservation of mass, whilst faithfully emulating the turbulent kinetic energy field and spectrum, which are critical for accurate prediction of turbulent flows.

preprint2020arXiv

UMAP-assisted $K$-means clustering of large-scale SARS-CoV-2 mutation datasets

Coronavirus disease 2019 (COVID-19) caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has a worldwide devastating effect. The understanding of evolution and transmission of SARS-CoV-2 is of paramount importance for the COVID-19 control, combating, and prevention. Due to the rapid growth of both the number of SARS-CoV-2 genome sequences and the number of unique mutations, the phylogenetic analysis of SARS-CoV-2 genome isolates faces an emergent large-data challenge. We introduce a dimension-reduced $k$-means clustering strategy to tackle this challenge. We examine the performance and effectiveness of three dimension-reduction algorithms: principal component analysis (PCA), t-distributed stochastic neighbor embedding (t-SNE), and uniform manifold approximation and projection (UMAP). By using four benchmark datasets, we found that UMAP is the best-suited technique due to its stable, reliable, and efficient performance, its ability to improve clustering accuracy, especially for large Jaccard distanced-based datasets, and its superior clustering visualization. The UMAP-assisted $k$-means clustering enables us to shed light on increasingly large datasets from SARS-CoV-2 genome isolates.

preprint2020arXiv

Unveiling the molecular mechanism of SARS-CoV-2 main protease inhibition from 92 crystal structures

Currently, there is no effective antiviral drugs nor vaccine for coronavirus disease 2019 (COVID-19) caused by acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Due to its high conservativeness and low similarity with human genes, SARS-CoV-2 main protease (M$^{\text{pro}}$) is one of the most favorable drug targets. However, the current understanding of the molecular mechanism of M$^{\text{pro}}$ inhibition is limited by the lack of reliable binding affinity ranking and prediction of existing structures of M$^{\text{pro}}$-inhibitor complexes. This work integrates mathematics and deep learning (MathDL) to provide a reliable ranking of the binding affinities of 92 SARS-CoV-2 M$^{\text{pro}}$ inhibitor structures. We reveal that Gly143 residue in M$^{\text{pro}}$ is the most attractive site to form hydrogen bonds, followed by Cys145, Glu166, and His163. We also identify 45 targeted covalent bonding inhibitors. Validation on the PDBbind v2016 core set benchmark shows the MathDL has achieved the top performance with Pearson's correlation coefficient ($R_p$) being 0.858. Most importantly, MathDL is validated on a carefully curated SARS-CoV-2 inhibitor dataset with the averaged $R_p$ as high as 0.751, which endows the reliability of the present binding affinity prediction. The present binding affinity ranking, interaction analysis, and fragment decomposition offer a foundation for future drug discovery efforts.

preprint2020arXiv

What leads to generalization of object proposals?

Object proposal generation is often the first step in many detection models. It is lucrative to train a good proposal model, that generalizes to unseen classes. This could help scaling detection models to larger number of classes with fewer annotations. Motivated by this, we study how a detection model trained on a small set of source classes can provide proposals that generalize to unseen classes. We systematically study the properties of the dataset - visual diversity and label space granularity - required for good generalization. We show the trade-off between using fine-grained labels and coarse labels. We introduce the idea of prototypical classes: a set of sufficient and necessary classes required to train a detection model to obtain generalized proposals in a more data-efficient way. On the Open Images V4 dataset, we show that only 25% of the classes can be selected to form such a prototypical set. The resulting proposals from a model trained with these classes is only 4.3% worse than using all the classes, in terms of average recall (AR). We also demonstrate that Faster R-CNN model leads to better generalization of proposals compared to a single-stage network like RetinaNet.

preprint2019arXiv

End-to-end Quantum Secured Inter-Domain 5G Service Orchestration Over Dynamically Switched Flex-Grid Optical Networks Enabled by a q-ROADM

Dynamic and flexible optical networking enabled by NFV and SDN are the key technology enablers for supporting the dynamicity and bandwidth requirements of emerging 5G network services. To achieve the objective of 5G, Network Services (NSes) must be often deployed transparently over multiple administrative and technological domains. Such case often presents security risks since a typical NS may comprise a chain of network functions, each executed in different remote locations, and tampering within the network infrastructure may compromise their communication. To avoid such threats, QKD has been identified and proposed as a future-proof method immune to any algorithmic cryptanalysis based on quantum-physics mechanisms. The maturity of QKD has enabled the R&D of quantum networks coexisting with optical networks using telecom equipment. This makes the QKD a suitable candidate for the security of distributed and virtualised network services. In this paper, for the first time, we propose a dynamic quantum-secured optical network for supporting network services that are dynamically created by chaining VNF over multiple network domains. This work includes a new quantum-ROADM, extensions to SDN-enabled optical control plane, and extensions to NFV orchestration to achieve quantum-aware, on-demand chaining of VNFs. The experimental results verify the capability of routing quantum and classical data channels both individually and dynamically over shared fibre links. Moreover, quantum secured chaining of VNFs in 5G networks is experimentally demonstrated via interconnecting four autonomous 5G islands simultaneously through the q-ROADM with eight optical channels using the 5GUK Exchange orchestration platform. The experimental scenarios and results confirm the benefit of the proposed data plane architecture and control/management plane framework.

preprint2019arXiv

Experimental observation of the gate-controlled reversal of the anomalous Hall effect in the intrinsic magnetic topological insulator MnBi2Te4 device

Here we report the reserved anomalous Hall effect (AHE) in the 5-septuple-layer van der Waals device of the intrinsic magnetic topological insulator MnBi2Te4. By employing the top/bottom gate, a negative AHE loop gradually decreases to zero and changes to a reversed sign. The reversed AHE exhibits distinct coercive fields and temperature dependence from the previous AHE. It reaches the maximum inside the gap of the Dirac cone. The newly-seen reversed AHE is attributed to the competition of the intrinsic Berry curvature and the Dirac-gap enhanced extrinsic skew scattering. Its gate-controlled switching contributes a scheme for the topological spin field-effect transistors.

preprint2016arXiv

3D Shape Induction from 2D Views of Multiple Objects

In this paper we investigate the problem of inducing a distribution over three-dimensional structures given two-dimensional views of multiple objects taken from unknown viewpoints. Our approach called "projective generative adversarial networks" (PrGANs) trains a deep generative model of 3D shapes whose projections match the distributions of the input 2D views. The addition of a projection module allows us to infer the underlying 3D shape distribution without using any 3D, viewpoint information, or annotation during the learning phase. We show that our approach produces 3D shapes of comparable quality to GANs trained on 3D data for a number of shape categories including chairs, airplanes, and cars. Experiments also show that the disentangled representation of 2D shapes into geometry and viewpoint leads to a good generative model of 2D shapes. The key advantage is that our model allows us to predict 3D, viewpoint, and generate novel views from an input image in a completely unsupervised manner.

preprint2016arXiv

Accounting for interactions and complex inter-subject dependency in estimating treatment effect in cluster randomized trials with missing outcomes

Semi-parametric methods are often used for the estimation of intervention effects on correlated outcomes in cluster-randomized trials (CRTs). When outcomes are missing at random (MAR), Inverse Probability Weighted (IPW) methods incorporating baseline covariates can be used to deal with informative missingness. Also, augmented generalized estimating equations (AUG) correct for imbalance in baseline covariates but need to be extended for MAR outcomes. However, in the presence of interactions between treatment and baseline covariates, neither method alone produces consistent estimates for the marginal treatment effect if the model for interaction is not correctly specified. We propose an AUG-IPW estimator that weights by the inverse of the probability of being a complete case and allows different outcome models in each intervention arm. This estimator is doubly robust (DR), it gives correct estimates whether the missing data process or the outcome model is correctly specified. We consider the problem of covariate interference which arises when the outcome of an individual may depend on covariates of other individuals. When interfering covariates are not modeled, the DR property prevents bias as long as covariate interference is not present simultaneously for the outcome and the missingness. An R package is developed implementing the proposed method. An extensive simulation study and an application to a CRT of HIV risk reduction-intervention in South Africa illustrate the method.

preprint2016arXiv

Beamforming for Simultaneous Wireless Information and Power Transfer in Two-Way Relay Channels

This paper studies simultaneous wireless information and power transfer (SWIPT) systems in two-way relaying (TWR) channels. Here, two source nodes receive information and energy simultaneously via power splitting (PS) from the signals sent by a multi-antenna relay node. Our objective is to maximize the weighted sum of the harvested energy at two source nodes subject to quality of service (QoS) constraints and the relay power constraints. Three well-known and practical two-way relay strategies are considered, i.e., amplify-and-forward (AF), bit level XOR based decode-and-forward (DF-XOR) and symbol level superposition coding based DF (DF-SUP). For each relaying strategy, we formulate the joint energy transmit beamforming and PS ratios optimization as a nonconvex quadratically constrained problem. To find a closed-form solution of the formulated problem, we decouple the primal problem into two subproblems. In the first problem, we intend to optimize beamforming vector for a given PS ratio. In the second subproblem, we optimize the PS ratio with a given beamforming vector. It is worth noting that although the corresponding subproblem are nonconvex, the optimal solution of each subproblem can still be found by using certain techniques. We provide numerical results that demonstrate the advantage of adapting the different relaying strategies and weighted factors to harvest energy in two-way relaying channel.

preprint2016arXiv

Canonical connection on contact manifolds

We introduce a canonical affine connection on the contact manifold $(Q,ξ)$, which is associated to each contact triad $(Q,λ,J)$ where $λ$ is a contact form and $J:ξ\to ξ$ is an endomorphism with $J^2 = -id$ compatible to $dλ$. We call it the \emph{contact triad connection} of $(Q,λ,J)$ and prove its existence and uniqueness. The connection is canonical in that the pull-back connection $ϕ^*\nabla$ of a triad connection $\nabla$ becomes the triad connection of the pull-back triad $(Q, ϕ^*λ, ϕ^*J)$ for any diffeomorphism $ϕ:Q \to Q$ satisfying $ϕ^*λ= λ$ (sometimes called a strict contact diffeomorphism). It also preserves both the triad metric $$ g_{(λ,J)} = dλ(\cdot, J\cdot) + λ\otimes λ$$ and $J$ regarded as an endomorphism on $TQ = \mathbb R\{X_λ\}\oplus ξ$, and is characterized by its torsion properties and the requirement that the contact form $λ$ be holomorphic in the $CR$-sense. In particular, the connection restricts to a Hermitian connection $\nabla^π$ on the Hermitian vector bundle $(ξ,J,g_ξ)$ with $g_ξ= dλ(\cdot, J\cdot)|_ξ$, which we call the \emph{contact Hermitian connection} of $(ξ,J,g_ξ)$. These connections greatly simplify tensorial calculations in the sequels \cite{oh-wang1}, \cite{oh-wang2} performed in the authors' analytic study of the map $w$, called contact instantons, which satisfy the nonlinear elliptic system of equations $\overline{\partial}^πw = 0, \, d(w^*λ\circ j) = 0$ in the contact triad $(Q,λ,J)$.

preprint2016arXiv

Dynamic Virtual Machine Management via Approximate Markov Decision Process

Efficient virtual machine (VM) management can dramatically reduce energy consumption in data centers. Existing VM management algorithms fall into two categories based on whether the VMs' resource demands are assumed to be static or dynamic. The former category fails to maximize the resource utilization as they cannot adapt to the dynamic nature of VMs' resource demands. Most approaches in the latter category are heuristical and lack theoretical performance guarantees. In this work, we formulate dynamic VM management as a large-scale Markov Decision Process (MDP) problem and derive an optimal solution. Our analysis of real-world data traces supports our choice of the modeling approach. However, solving the large-scale MDP problem suffers from the curse of dimensionality. Therefore, we further exploit the special structure of the problem and propose an approximate MDP-based dynamic VM management method, called MadVM. We prove the convergence of MadVM and analyze the bound of its approximation error. Moreover, MadVM can be implemented in a distributed system, which should suit the needs of real data centers. Extensive simulations based on two real-world workload traces show that MadVM achieves significant performance gains over two existing baseline approaches in power consumption, resource shortage and the number of VM migrations. Specifically, the more intensely the resource demands fluctuate, the more MadVM outperforms.

preprint2016arXiv

Ferromagnetism and superconductivity with possible $p+ip$ pairing symmetry in partially hydrogenated graphene

By means of first-principles calculations, we predict two new types of partially hydrogenated graphene systems: C$_{6}$H$_{1}$ and C$_{6}$H$_{5}$, which are shown to be ferromagnetic (FM) semimetal and FM narrow-gap semiconductor, respectively. When properly doped, the Fermi surface of the two systems consists of an electron pocket or six hole patches in the first Brillouin zone with completely spin-polarized charge carries. If superconductivity exists in these systems, the stable pairing symmetries are shown to be $p+ip$ for electron doped case, and anisotropic $p+ip$ for hole doped case. The predicted systems may provide fascinating platforms for studying the novel properties of ferromagnetism and triplet-pairing superconductivity as well as two-dimensional spintronics.

preprint2016arXiv

Global explicit particle-in-cell simulations of the nonstationary bow shock and magnetosphere

We carry out two-dimensional global particle-in-cell simulations of the interaction between the solar wind and a dipole field to study the formation of the bow shock and magnetosphere. A self-reforming bow shock ahead of a dipole field is presented by using relatively high temporal-spatial resolutions. We find that (1) the bow shock and the magnetosphere are formed and reach a quasi-stable state after several ion cyclotron periods, and (2) under the Bz southward solar wind condition the bow shock undergoes a self-reformation for low \b{eta}i and high MA. Simultaneously, a magnetic reconnection in the magnetotail is found. For high \b{eta}i and low MA, the shock becomes quasi-stationary, and the magnetotail reconnection disappears. In addition, (3) the magnetopause deflects the magnetosheath plasmas. The sheath particles injected at the quasi-perpendicular region of the bow shock can be convected to downstream of an oblique shock region. A fraction of these sheath particles can leak out from the magnetosheath at the wings of the bow shock. Hence, the downstream situation is more complicated than that for a planar shock produced in local simulations.

preprint2016arXiv

Interference-Constrained Pricing for D2D Networks

The concept of device-to-device (D2D) communications underlaying cellular networks opens up potential benefits for improving system performance but also brings new challenges such as interference management. In this paper, we propose a pricing framework for interference management from the D2D users to the cellular system, where the base station (BS) protects itself (or its serving cellular users) by pricing the crosstier interference caused from the D2D users. A Stackelberg game is formulated to model the interactions between the BS and D2D users. Specifically, the BS sets prices to a maximize its revenue (or any desired utility) subject to an interference temperature constraint. For given prices, the D2D users competitively adapt their power allocation strategies for individual utility maximization. We first analyze the competition among the D2D users by noncooperative game theory and an iterative based distributed power allocation algorithm is proposed. Then, depending on how much network information the BS knows, we develop two optimal algorithms, one for uniform pricing with limited network information and the other for differentiated pricing with global network information. The uniform pricing algorithm can be implemented by a fully distributed manner and requires minimum information exchange between the BS and D2D users, and the differentiated pricing algorithm is partially distributed and requires no iteration between the BS and D2D users. Then a suboptimal differentiated pricing scheme is proposed to reduce complexity and it can be implemented in a fully distributed fashion. Extensive simulations are conducted to verify the proposed framework and algorithms.

preprint2016arXiv

Kondo-induced hybrid topological insulator in two-dimensional electron system with a quadratic band crossing point

We investigate the Kondo effect in the two-dimensional electron system with a non-trivial quadratic energy band crossing point. We show that the Kondo effect can induce a new hybrid topological insulator phase which is a coexistence state of the quantum anomalous Hall effect and the TRSbroken quantum spin Hall effect. This hybrid topological insulator exhibits not only a quantized charge Hall current but also a net spin current, which are localized at the edge boundaries. This peculiar topological state arises due to the interplay of two marginally relevant operators, i.e., the Kondo-coupling between the electrons and the local magnetic moment and the electron-electron interaction in the two-dimensional system.

preprint2016arXiv

Mobility-Aware Caching for Content-Centric Wireless Networks: Modeling and Methodology

As mobile services are shifting from "connection-centric" communications to "content-centric" communications, content-centric wireless networking emerges as a promising paradigm to evolve the current network architecture. Caching popular content at the wireless edge, including base stations (BSs) and user terminals (UTs), provides an effective approach to alleviate the heavy burden on backhaul links, as well as lowering delays and deployment costs. In contrast to wired networks, a unique characteristic of content-centric wireless networks (CCWNs) is the mobility of mobile users. While it has rarely been considered by existing works in caching design, user mobility contains various helpful side information that can be exploited to improve caching efficiency at both BSs and UTs. In this paper, we present a general framework on mobility-aware caching in CCWNs. Key properties of user mobility patterns that are useful for content caching will be firstly identified, and then different design methodologies for mobility-aware caching will be proposed. Moreover, two design examples will be provided to illustrate the proposed framework in details, and interesting future research directions will be identified.

preprint2016arXiv

Optimal QoS-Aware Channel Assignment in D2D Communications with Partial CSI

In this paper, we propose effective channel assignment algorithms for network utility maximization in a cellular network with underlaying device-to-device (D2D) communications. A major innovation is the consideration of partial channel state information (CSI), i.e., the base station (BS) is assumed to be able to acquire `partial' instantaneous CSI of the cellular and D2D links, as well as, the interference links. In contrast to existing works, multiple D2D links are allowed to share the same channel, and the quality of service (QoS) requirements for both the cellular and D2D links are enforced. We first develop an optimal channel assignment algorithm based on dynamic programming (DP), which enjoys a much lower complexity compared to exhaustive search and will serve as a performance benchmark. To further reduce complexity, we propose a cluster-based sub-optimal channel assignment algorithm. New closed-form expressions for the expected weighted sum-rate and the successful transmission probabilities are also derived. Simulation results verify the effectiveness of the proposed algorithms. Moreover, by comparing different partial CSI scenarios, we observe that the CSI of the D2D communication links and the interference links from the D2D transmitters to the BS significantly affects the network performance, while the CSI of the interference links from the BS to the D2D receivers only has a negligible impact.

preprint2016arXiv

QoS-Aware Joint Mode Selection and Channel Assignment for D2D Communications

Underlaying device-to-device (D2D) communications to a cellular network is considered as a key technique to improve spectral efficiency in 5G networks. For such D2D systems, mode selection and resource allocation have been widely utilized for managing interference. However, previous works allowed at most one D2D link to access the same channel, while mode selection and resource allocation are typically separately designed. In this paper, we jointly optimize the mode selection and channel assignment in a cellular network with underlaying D2D communications, where multiple D2D links may share the same channel. Meanwhile, the QoS requirements for both cellular and D2D links are guaranteed, in terms of Signal-to-Interference-plus-Noise Ratio (SINR). We first propose an optimal dynamic programming (DP) algorithm, which provides a much lower computation complexity compared to exhaustive search and serves as the performance bench mark. A bipartite graph based greedy algorithm is then proposed to achieve a polynomial time complexity. Simulation results will demonstrate the advantage of allowing each channel to be accessed by multiple D2D links in dense D2D networks, as well as, the effectiveness of the proposed algorithms.

preprint2016arXiv

Quantum anomalies in superconducting Weyl metals

We theoretically study the quantum anomalies in the superconducting Weyl metals based on the topological field theory. It is demonstrated that the Fermi arc and the surface Andreev bound state, characteristic of the superconducting Weyl metals, are the manifestations of two underlying phenomenon, namely the chiral anomaly and the parity-like anomaly, respectively. The first anomaly is inherited from the Berry curvature around the original Weyl points, while the second is the result of the superconductivity. We show that, all the fascinating topological behavior of the superconducting Weyl metals, either intranode FFLO or the internode BCS pairing state, can be satisfactorily described and predicted by our topological field theory.

preprint2016arXiv

Solar Energetic Particle Event Associated with the 2012 July 23 Extreme Solar Storm

We study the solar energetic particle (SEP) event associated with the 2012 July 23 extreme solar storm, for which STEREO and the spacecraft at L1 provide multi-point remote sensing and in situ observations. The extreme solar storm, with a superfast shock and extremely enhanced ejecta magnetic fields observed near 1 AU at STEREO A, was caused by the combination of successive coronal mass ejections (CMEs). Meanwhile, energetic particles were observed by STEREO and near-Earth spacecraft such as ACE and SOHO, suggestive of a wide longitudinal spread of the particles at 1 AU. Combining the SEP observations with in situ plasma and magnetic field measurements we investigate the longitudinal distribution of the SEP event in connection with the associated shock and CMEs. Our results underscore the complex magnetic configuration of the inner heliosphere formed by solar eruptions. The examinations of particle intensities, proton anisotropy distributions, element abundance ratios, magnetic connectivity and spectra also give important clues for the particle acceleration, transport and distribution.

preprint2016arXiv

Spinon Majorana fermions

A realization of Majorana fermions is proposed in the frustrated magnets via the topological proximity effect. Specifically, we consider a theoretical model, where a topological insulator is coupled to a frustrated magnetic material through the spin exchange interaction. Using the renormalization group-based self-consistent mean-field approach, and calculating the self-energy correction due to the topological insulator, we find that the spin texture and the spin-momentum locking of the Dirac cone will be inherited by the spinons in the nearby frustrated magnets. This leads to a particular topological state of matter that supports the Majorana excitations. Unlike the conventional realization in SC systems, these Majorana fermions are the combination of spinons and anti-spinons, rather than electrons and holes. They can participate in the transport of spinons, leading to nontrivial properties of the spin transport.

preprint2016arXiv

Sympathetic Solar Filament Eruptions on 2015 March 15

The 2015 March 15 coronal mass ejection as one of the two that together drove the largest geomagnetic storm of solar cycle 24 so far was associated with sympathetic filament eruptions. We investigate the relations between the different filaments involved in the eruption. A surge-like small-scale filament motion is confirmed as the trigger that initiated the erupting filament with multi-wavelength observations and using a forced magnetic field extrapolation method. When the erupting filament moved to an open magnetic field region, it experienced an obvious acceleration process and was accompanied by a C-class flare and the rise of another larger filament that eventually failed to erupt. We measure the decay index of the background magnetic field, which presents a critical height of 118 Mm. Combining with a potential field source surface extrapolation method, we analyze the distributions of the large-scale magnetic field, which indicates that the open magnetic field region may provide a favorable condition for F2 rapid acceleration and have some relation with the largest solar storm. The comparison between the successful and failed filament eruptions suggests that the confining magnetic field plays an important role in the preconditions for an eruption.

preprint2016arXiv

Time-resolved Measurement of Quadrupole Wakefields in Corrugated Structures

Corrugated structures have recently been widely used for manipulating electron beam longitudinal phase space and for producing THz radiation. Here we report on time-resolved measurements of the quadrupole wakefields in planar corrugated structures. It is shown that while the time- dependent quadrupole wakefield produced by a planar corrugated structure causes significant growth in beam transverse emittance, it can be effectively canceled with a second corrugated structure with orthogonal orientation. The strengths of the time-dependent quadrupole wakefields for various corrugated structure gaps are also measured and found to be in good agreement with theories. Our work should forward the applications of corrugated structures in many accelerator based scientific facilities.

preprint2016arXiv

Universal anyons at the irradiated surface of topological insulator

Anyons have recently received great attention due to their promising application in topological quantum computation. The best validated system that enjoys the anyonic excitations are the Laughlin states. The quasi-particles in Laughlin states are neither fermions nor bosons but possess the discrete statistical angle ? = ?=m, with m being an integer. Here we report a possible realization of the universal Abelian anyons, whose statistical angle can be tuned continuously by external parameters and can take any arbitrary values interpolating ? = 0 and ? = ?. The proposed setup is the surface state of a three dimensional topological insulator driven by an amplitude-modulated circularly-polarized light. It is found that the external field leads to a particular Floquet phase, which is a two-spatial-dimensional analogy of the Weyl semimetal phase in the Floquet first Brillouin zone. The chiral anomaly of this phase results in a U(1) Chern-Simons gauge theory with a tunable Floquet Chern number. Owing to this underlying gauge field theory, the irradiated surface of topological insulator constitutes a promising platform for the observation of the universal anyons.

preprint2015arXiv

Charge transfer of edge states in zigzag silicene nanoribbons with Stone-Wales defects

Stone-Wales (SW) defects are favorably existed in graphenelike materials with honeycomb lattice structure and potentially employed to change the electronic properties in band engineering. In this paper, we investigate structural and electronic properties of SW defects in bulk silicene and its nanoribbons as a function of their concentration using the methods of periodic boundary conditions with first-principles calculations. We first calculate the formation energy, structural properties, and electronic band structures of SW defects in bulk silicene, with dependence on the concentration of SW defects. Our results show a good agreement with available values from the previous first-principles calculations. The energetics, structural aspects, and electronic properties of SW defects with dependence on defect concentration and location in edge-hydrogenated zigzag silicene nanoribbons are obtained. For all calculated concentrations, the SW defects prefer to locate at the edge due to the lower formation energy. The SW defects at the center of silicene nanoribbons slightly influence on the electronic properties, whereas the SW defects at the edge of silicene nanoribbons split the degenerate edge states and induce a sizable gap, which depends on the concentration of defects. It is worth to find that the SW defects produce a perturbation repulsive potential, which leads the decomposed charge of edge states at the side with defect to transfer to the other side without defect.

preprint2015arXiv

DC-Informative Joint Color-Frequency Modulation for Visible Light Communications

In this paper, we consider the problem of constellation design for a visible light communication (VLC) system using red/green/blue light-emitting diodes (RGB LED), and propose a method termed DC-informative joint color-frequency modulation (DCI-JCFM). This method jointly utilizes available diversity resources including different optical wavelengths, multiple baseband subcarriers, and adaptive DC-bias. Constellation is designed in a high dimensional space, where the compact sphere packing advantage over lower dimensional counterparts is utilized. Taking into account multiple practical illumination constraints, a non-convex optimization problem is formulated, seeking the least error rate with a fixed spectral efficiency. The proposed scheme is compared with a decoupled scheme, where constellation is designed separately for each LED. Notable gains for DCI-JCFM are observed through simulations where balanced, unbalanced and very unbalanced color illuminations are considered.

preprint2015arXiv

Demonstration of nonlinear-energy-spread compensation in relativistic electron bunches with corrugated structures

High quality electron beams with flat distributions in both energy and current are critical for many accelerator-based scientific facilities such as free-electron lasers and MeV ultrafast electron diffraction and microscopes. In this Letter we report on using corrugated structures to compensate for the beam nonlinear energy chirp imprinted by the curvature of the radio-frequency field, leading to a significant reduction in beam energy spread. By using a pair of corrugated structures with orthogonal orientations, we show that the quadrupole wake fields which otherwise increase beam emittance can be effectively canceled. This work also extends the applications of corrugated structures to the low beam charge (a few pC) and low beam energy (a few MeV) regime and may have a strong impact in many accelerator-based facilities.

preprint2015arXiv

Hybrid Millimeter-Wave Systems: A Novel Paradigm for HetNets

Heterogeneous Networks (HetNets) are known to enhance the bandwidth efficiency and throughput of wireless networks by more effectively utilizing the network resources. However, the higher density of users and access points in HetNets introduces significant inter-user interference that needs to be mitigated through complex and sophisticated interference cancellation schemes. Moreover, due to significant channel attenuation and presence of hardware impairments, e.g., phase noise and amplifier nonlinearities, the vast bandwidth in the millimeter-wave band has not been fully utilized to date. In order to enable the development of multi-Gigabit per second wireless networks, we introduce a novel millimeter-wave HetNet paradigm, termed hybrid HetNet, which exploits the vast bandwidth and propagation characteristics in the 60 GHz and 70-80 GHz bands to reduce the impact of interference in HetNets. Simulation results are presented to illustrate the performance advantage of hybrid HetNets with respect to traditional networks. Next, two specific transceiver structures that enable hand-offs from the 60 GHz band, i.e., the V-band to the 70-80 GHz band, i.e., the E-band, and vice versa are proposed. Finally, the practical and regulatory challenges for establishing a hybrid HetNet are outlined.

preprint2015arXiv

Impact of pickup ions on the shock front nonstationarity and energy dissipation of the heliospheric termination shock: Two-dimensional full particle simulations and comparison with Voyager 2 observations

The transition between the supersonic solar wind and the subsonic heliosheath, the termination shock (TS), was observed by Voyager 2 (V2) on 2007 August 31-September 1 at a distance of 84 AU from the Sun. The data reveal multiple crossings of a complex, quasi-perpendicular supercritical shock. These experimental data are the starting point for a more sophisticated analysis that includes computer modeling of a shock in the presence of pickup ions (PUIs). here, we present two-dimensional (2-D) particle-in-cell (PIC) simulations of the TS including PUIs self-consistently. We also report the ion velocity distribution across the TS using the Faraday cup data from V2. A relatively complete plasma and magnetic field data set from V2 gives us the opportunity to do a full comparison between the experimental data and PIC simulation results. Our results show that: (1) The nonstationarity of the shock front is mainly caused by the ripples along the shock front and these ripples from even if the percentage of PUIs is high. (2) PUIs play a key role in the energy dissipation of the TS, and most of the incident ion dynamic energy is transferred to the thermal energy of PUIs instead of solar wind ions (SWIs). (3) The simulated composite heliosheath ion velocity distribution function is a superposition of a cold core formed by transmitted SWIs, the shoulders contributed by the hot reflected SWIs and directly transmitted PUIs, and the wings of the distribution dominated by the very hot reflected PUIs. (4) The V2 Faraday cups observed the cool core of the distribution, so they saw only a tip of the iceberg. For the evolution of the cool core distribution function across the TS, the computed results agree reasonably well with the V2experimental results.

preprint2015arXiv

Inhomogeneous screening near a dielectric interface

Screening is one of the most important concepts in the study of charged systems. Near a dielectric interface, the ion distribution in a salt solution can be highly nonuniform. Here, we develop a theory that self-consistently treats the inhomogeneous screening effects. At higher concentrations when the bulk Debye screening length is comparable to the Bjerrum length, the double layer structure and interfacial properties are significantly affected by the inhomogeneous screening. In particular, the depletion zone is considerably wider than that predicted by the bulk screening approximation or the WKB approximation. For asymmetric salts, the inhomogeneous screening leads to enhanced charge separation and surface potential.

preprint2015arXiv

Lateral Heterojunction Sb2Te3/Bi2Te3 and its topological transport

A lateral heterojunction of topological insulator Sb2Te3/Bi2Te3 was successfully synthesized using a two-step solvothermal method. The two crystalline components were separated well by a sharp lattice-matched interface when the optimized procedure was used. Inspecting the heterojunction using high-resolution transmission electron microscopy showed that epitaxial growth occurred along the horizontal plane. A rectification curve was observed at low temperatures. Quantum correction from the weak antilocalization reveals the transport of the topological surface state. There was, therefore, a staggered-gap lateral heterojunction with a small junction voltage, which is appealing for a platform for spin filters and one-dimensional topological interface states.

preprint2015arXiv

Magnetic dipolar interaction between correlated triplets created by singlet fission in tetracene crystals

Singlet fission (SF) can potentially break the Shockley-Queisser efficiency limit in single-junction solar cells by splitting one photo-excited singlet exciton (S1) into two triplets (2T1) in organic semiconductors. A dark multi-exciton (ME) state has been proposed as the intermediate connecting S1 to 2T1. However, the exact nature of this ME state, especially how the doubly-excited triplets interact, remains elusive. Here, we report a quantitative study on the magnetic dipolar interaction between SF-induced correlated triplets in tetracene crystals by monitoring quantum beats relevant to the ME sublevels at room temperature. The resonances of ME sublevels approached by tuning an external magnetic field are observed to be avoided, which agrees well with the theoretical predictions considering a magnetic dipolar interaction of ~ 0.008 GHz. Our work paves a way to quantify the magnetic dipolar interaction in organic materials and marks an important step towards understanding the underlying physics of the ME state.

preprint2015arXiv

Magnetic Field Restructuring Associated with Two Successive Solar Eruptions

We examine two successive flare eruptions (X5.4 and X1.3) on 2012 March 7 in the NOAA active region 11429 and investigate the magnetic field reconfiguration associated with the two eruptions. Using an advanced non-linear force-free field (NLFFF) extrapolation method based on the SDO/HMI vector magnetograms, we obtain a stepwise decrease in the magnetic free energy during the eruptions, which is roughly 20%-30% of the energy of the pre-flare phase. We also calculate the magnetic helicity, and suggest that the changes of the sign of the helicity injection rate might be associated with the eruptions. Through the investigation of the magnetic field evolution, we find that the appearance of the "implosion" phenomenon has a strong relationship with the occurrence of the first X-class flare. Meanwhile, the magnetic field changes of the successive eruptions with implosion and without implosion were well observed.

preprint2015arXiv

On Sun-to-Earth Propagation of Coronal Mass Ejections: 2. Slow Events and Comparison with Others

As a follow-up study on Sun-to-Earth propagation of fast coronal mass ejections (CMEs), we examine the Sun-to-Earth characteristics of slow CMEs combining heliospheric imaging and in situ observations. Three events of particular interest, the 2010 June 16, 2011 March 25 and 2012 September 25 CMEs, are selected for this study. We compare slow CMEs with fast and intermediate-speed events, and obtain key results complementing the attempt of \citet{liu13} to create a general picture of CME Sun-to-Earth propagation: (1) the Sun-to-Earth propagation of a typical slow CME can be approximately described by two phases, a gradual acceleration out to about 20-30 solar radii, followed by a nearly invariant speed around the average solar wind level, (2) comparison between different types of CMEs indicates that faster CMEs tend to accelerate and decelerate more rapidly and have shorter cessation distances for the acceleration and deceleration, (3) both intermediate-speed and slow CMEs would have a speed comparable to the average solar wind level before reaching 1 AU, (4) slow CMEs have a high potential to interact with other solar wind structures in the Sun-Earth space due to their slow motion, providing critical ingredients to enhance space weather, and (5) the slow CMEs studied here lack strong magnetic fields at the Earth but tend to preserve a flux-rope structure with axis generally perpendicular to the radial direction from the Sun. We also suggest a "best" strategy for the application of a triangulation concept in determining CME Sun-to-Earth kinematics, which helps to clarify confusions about CME geometry assumptions in the triangulation and to improve CME analysis and observations.

preprint2015arXiv

On the theoretical description of weakly charged surfaces

It is widely accepted that the Poisson-Boltzmann (PB) theory provides a valid description for charged surfaces in the so-called weak coupling limit. Here, we show that the image charge repulsion creates a depletion boundary layer that cannot be captured by a regular perturbation approach. The correct weak-coupling theory must include the self-energy of the ion due to the image charge interaction. The image force qualitatively alters the double layer structure and properties, and gives rise to many non-PB effects, such as nonmonotonic dependence of the surface energy on concentration and charge inversion. In the presence of dielectric discontinuity, there is no limiting condition for which the PB theory is valid.

preprint2015arXiv

Operator Reproducing Kernel Hilbert Spaces

Motivated by the need of processing functional-valued data, or more general, operatorvalued data, we introduce the notion of the operator reproducing kernel Hilbert space (ORKHS). This space admits a unique operator reproducing kernel which reproduces a family of continuous linear operators on the space. The theory of ORKHSs and the associated operator reproducing kernels are established. A special class of ORKHSs, known as the perfect ORKHSs, are studied, which reproduce the family of the standard point-evaluation operators and at the same time another different family of continuous linear operators. The perfect ORKHSs are characterized in terms of features, especially for those with respect to integral operators. In particular, several specific examples of the perfect ORKHSs are presented. We apply the theory of ORKHSs to sampling and regularized learning, where operator-valued data are considered. Specifically, a general complete reconstruction formula from linear operators values is established in the framework of ORKHSs. The average sampling and the reconstruction of vector-valued functions are considered in specific ORKHSs. We also investigate in the ORKHSs setting the regularized learning schemes, which learn a target element from operator-valued data. The desired representer theorems of the learning problems are established to demonstrate the key roles played by the ORKHSs and the operator reproducing kernels in machine learning from operator-valued data. We finally point out that the continuity of linear operators, used to obtain the operator-valued data, on an ORKHS is necessary for the stability of the numerical reconstruction algorithm using the resulting data.

preprint2015arXiv

Phase transition in finite density and temperature lattice QCD

We investigate the behavior of the chiral condensate in lattice QCD at finite temperature and finite chemical potential. The study was done using two flavors of light quarks and with a series of $β$ and $ma$ at the lattice size $24\times12^{2}\times6$. The calculation was done in the Taylar expansion formalism. We are able to calculate the first and second order derivatives of $\langle\barψψ\rangle$ in both isoscalar and isovector channels. With the first derivatives being small, we find that the second derivatives are sizable close to the phase transition and the magnitude of $\barψψ$ decreases under the influence of finite chemical potential in both channels.

preprint2015arXiv

Plasma and Magnetic Field Characteristics of Solar Coronal Mass Ejections in Relation to Geomagnetic Storm Intensity and Variability

The largest geomagnetic storms of solar cycle 24 so far occurred on 2015 March 17 and June 22 with $D_{\rm st}$ minima of $-223$ and $-195$ nT, respectively. Both of the geomagnetic storms show a multi-step development. We examine the plasma and magnetic field characteristics of the driving coronal mass ejections (CMEs) in connection with the development of the geomagnetic storms. A particular effort is to reconstruct the in situ structure using a Grad-Shafranov technique and compare the reconstruction results with solar observations, which gives a larger spatial perspective of the source conditions than one-dimensional in situ measurements. Key results are obtained concerning how the plasma and magnetic field characteristics of CMEs control the geomagnetic storm intensity and variability: (1) a sheath-ejecta-ejecta mechanism and a sheath-sheath-ejecta scenario are proposed for the multi-step development of the 2015 March 17 and June 22 geomagnetic storms, respectively; (2) two contrasting cases of how the CME flux-rope characteristics generate intense geomagnetic storms are found, which indicates that a southward flux-rope orientation is not a necessity for a strong geomagnetic storm; and (3) the unexpected 2015 March 17 intense geomagnetic storm resulted from the interaction between two successive CMEs plus the compression by a high-speed stream from behind, which is essentially the "perfect storm" scenario proposed by \citet[][i.e., a combination of circumstances results in an event of unusual magnitude]{liu14a}, so the "perfect storm" scenario may not be as rare as the phrase implies.

preprint2015arXiv

Positioning the neutron drip line and the r-process paths in the nuclear landscape

Exploring nucleon drip lines and astrophysical rapid neutron capture process (r-process) paths in the nuclear landscape is extremely challenging in nuclear physics and astrophysics. While various models predict similar proton drip line, their predictions for neutron drip line and the r-process paths involving heavy neutron-rich nuclei exhibit a significant variation which hampers our accurate understanding of the r-process nucleosynthesis mechanism. Using microscopic density functional theory with a representative set of non-relativistic and relativistic interactions, we demonstrate for the first time that this variation is mainly due to the uncertainty of nuclear matter symmetry energy $E_{\rm{sym}}(ρ_{\rm{sc}})$ at the subsaturation cross density $ρ_{\rm{sc}}=0.11/0.16\timesρ_0$ ($ρ_0$ is saturation density), which reflects the symmetry energy of heavy nuclei. Using the recent accurate constraint on $E_{\rm{sym}}(ρ_{\rm{sc}})$ from the binding energy difference of heavy isotope pairs, we obtain quite precise predictions for the location of the neutron drip line, the r-process paths and the number of bound nuclei in the nuclear landscape. Our results have important implications on extrapolating the properties of unknown neutron-rich rare isotopes from the data on known nuclei.

preprint2015arXiv

Single-Step Tunable Group Delay Phaser for Real-Time Spectrum Sniffing

This paper presents a single-step tunable group delay phaser for spectrum sniffing. This device may be seen as a "time filter", where frequencies are suppressed by time separation rather than by spectral attenuation. Compared to its multiple-step counterpart, this phaser features higher processing resolution, greater simplicity, lower loss and better channel equalization, due to the smaller and channel-independent group delay swing. A three-channel example is provided for illustration.

preprint2015arXiv

The role of active region coronal magnetic field in determining coronal mass ejection propagation direction

We study the role of the coronal magnetic field configuration of an active region in determining the propagation direction of a coronal mass ejection (CME). The CME occurred in the active region 11944 (S09W01) near the disk center on 2014 January 7 and was associated with an X1.2 flare. A new CME reconstruction procedure based on a polarimetric technique is adopted, which shows that the CME changed its propagation direction by around 28$^\circ$ in latitude within 2.5 R$_\odot$ and 43$^\circ$ in longitude within 6.5 R$_\odot$ with respect to the CME source region. This significant non-radial motion is consistent with the finding of M$\ddot{o}$stl et al. (2015). We use nonlinear force-free field (NLFFF) and potential field source surface (PFSS) extrapolation methods to determine the configurations of the coronal magnetic field. We also calculate the magnetic energy density distributions at different heights based on the extrapolations. Our results show that the active region coronal magnetic field has a strong influence on the CME propagation direction. This is consistent with the "channelling" by the active region coronal magnetic field itself, rather than deflection by nearby structures. These results indicate that the active region coronal magnetic field configuration has to be taken into account in order to determine CME propagation direction correctly.

preprint2014arXiv

Audio Splicing Detection and Localization Using Environmental Signature

Audio splicing is one of the most common manipulation techniques in the area of audio forensics. In this paper, the magnitudes of acoustic channel impulse response and ambient noise are proposed as the environmental signature. Specifically, the spliced audio segments are detected according to the magnitude correlation between the query frames and reference frames via a statically optimal threshold. The detection accuracy is further refined by comparing the adjacent frames. The effectiveness of the proposed method is tested on two data sets. One is generated from TIMIT database, and the other one is made in four acoustic environments using a commercial grade microphones. Experimental results show that the proposed method not only detects the presence of spliced frames, but also localizes the forgery segments with near perfect accuracy. Comparison results illustrate that the identification accuracy of the proposed scheme is higher than the previous schemes. In addition, experimental results also show that the proposed scheme is robust to MP3 compression attack, which is also superior to the previous works.

preprint2014arXiv

Channel Estimation and Optimal Training Design for Correlated MIMO Two-Way Relay Systems in Colored Environment

In this paper, while considering the impact of antenna correlation and the interference from neighboring users, we analyze channel estimation and training sequence design for multi-input multi-output (MIMO) two-way relay (TWR) systems. To this end, we propose to decompose the bidirectional transmission links into two phases, i.e., the multiple access (MAC) phase and the broadcasting (BC) phase. By considering the Kronecker-structured channel model, we derive the optimal linear minimum mean-square-error (LMMSE) channel estimators. The corresponding training designs for the MAC and BC phases are then formulated and solved to improve channel estimation accuracy. For the general scenario of training sequence design for both phases, two iterative training design algorithms are proposed that are verified to produce training sequences that result in near optimal channel estimation performance. Furthermore, for specific practical scenarios, where the covariance matrices of the channel or disturbances are of particular structures, the optimal training sequence design guidelines are derived. In order to reduce training overhead, the minimum required training length for channel estimation in both the MAC and BC phases are also derived. Comprehensive simulations are carried out to demonstrate the effectiveness of the proposed training designs.

preprint2014arXiv

Coefficient of performance under maximum $χ$ criterion in a two-level atomic system as a refrigerator

A two-level atomic system as a working substance is used to set up a refrigerator consisting of two quantum adiabatic and two isochoric processes (two constant-frequency processes $ω_a$ and $ω_b$ with $ω_a<ω_b$), during which the two-level system is in contact with two heat reservoirs at temperatures $T_h$ and $T_c (<T_h)$. Considering finite-time operation of two isochoric processes, we derive analytical expressions for cooling rate $R$ and coefficient of performance (COP) $\varepsilon$. The COP at maximum $χ(= \varepsilon R)$ figure of merit is numerically determined, and it is proved to be in nice agreement with the so-called Curzon and Ahlborn COP $\varepsilon_{CA}=\sqrt{1+\varepsilon_C}-1$, where $\varepsilon_C=T_c/(T_h-T_c)$ is the Carnot COP. In the high-temperature limit, the COP at maximum $χ$ figure of merit, $\varepsilon^*$, can be expressed analytically by $\varepsilon^* = \varepsilon_+ \equiv (\sqrt{9+8\varepsilon_C}-3)/2$, which was derived previously as the upper bound of optimal COP for the low-dissipation or minimally nonlinear irreversible refrigerators. Within context of irreversible thermodynamics, we prove that the value of $\varepsilon_{+}$ is also the upper bound of COP at maximum $χ$ figure of merit when we regard our model as a linear irreversible refrigerator.

preprint2014arXiv

Constellation Design for Multi-color Visible Light Communications

In this paper, we propose a novel high dimensional constellation design scheme for visible light communication (VLC) systems employing red/green/blue light emitting diodes (RGB LEDs). It is in fact a generalized color shift keying (CSK) scheme which does not suffer efficiency loss due to a constrained sum intensity for all constellation symbols. Crucial lighting requirements are included as optimization constraints. To control non-linear distortion, the optical peak-to-average-power ratio (PAPR) of LEDs is individually constrained. Fixing the average optical power, our scheme is able to achieve much lower bit-error rate (BER) than conventional schems especially when illumination color is more "unbalanced". When cross-talks exist among the multiple optical channels, we apply a singular value decomposition (SVD)-based pre-equalizer and redesign the constellations, and such scheme is shown to outperform post-equalized schemes based on zero-forcing or linear minimum-mean-squared-error (LMMSE) principles. To further reduce system BER, a binary switching algorithm (BSA) is employed the first time for labeling high dimensional constellation. We thus obtains the optimal bits-to-symbols mapping.

preprint2014arXiv

Continuous Self Energy of Ions at the Dielectric Interface

We present a simple, unified theory for the self-energy of an ion near a dielectric interface. Our theory accounts for both the short-range (solvation) and long-range (image force) electrostatic forces, charge polarization induced by these forces, and the cavity energy. In contrast to previous models, our self energy is continuous across the interface and thus applicable to both the water and air (oil) sides of the interface. With no fitting parameters, we predict the specific ion effect on the interfacial affinity of halogen anions at the water/air interface, and the strong adsorption of hydrophobic ions at the water/oil interface, in agreement with experiments and atomistic simulations.

preprint2014arXiv

Entanglement, subsystem particle numbers and topology in free fermion systems

We study the relationship between bipartite entanglement, subsystem particle number and topology in a half-filled free fermion system. It is proposed that the spin-projected particle numbers can distinguish the quantum spin Hall state from other states, and can be used to establish a new topological index for the system. Furthermore, we apply the new topological invariant to a disordered system and show that a topological phase transition occurs when the disorder strength is increased beyond a critical value. It is also shown that the subsystem particle number fluctuation displays behavior very similar to that of the entanglement entropy. This provides a lower-bound estimation for the entanglement entropy, which can be utilized to obtain an estimate of the entanglement entropy experimentally.

preprint2014arXiv

From DC-Biased to DC-Informative Optical OFDM

We propose a novel modulation scheme for intensity modulation and direct detection (IM/DD) based optical communication system employing orthogonal frequency division multiplexing (OFDM). This method utilizes the DC-bias, which typically is discarded at the receiver-end, to carry information to achieve higher power efficiency. By formulating and solving a convex optimization problem, a constellation in high dimensional space is designed offline for the input of the transmitter-side inverse fast Fourier transform (IFFT) block. We point out that one can choose partial or full DC power for information transmission. Under the condition that the spectrum efficiency is fixed and attainable, this method bears notable power gain over traditional DC-biased optical OFDM (DCO-OFDM).

preprint2014arXiv

Joint Spatial Division and Multiplexing for mm-Wave Channels

Massive MIMO systems are well-suited for mm-Wave communications, as large arrays can be built with reasonable form factors, and the high array gains enable reasonable coverage even for outdoor communications. One of the main obstacles for using such systems in frequency-division duplex mode, namely the high overhead for the feedback of channel state information (CSI) to the transmitter, can be mitigated by the recently proposed JSDM (Joint Spatial Division and Multiplexing) algorithm. In this paper we analyze the performance of this algorithm in some realistic propagation channels that take into account the partial overlap of the angular spectra from different users, as well as the sparsity of mm-Wave channels. We formulate the problem of user grouping for two different objectives, namely maximizing spatial multiplexing, and maximizing total received power, in a graph-theoretic framework. As the resulting problems are numerically difficult, we proposed (sub optimum) greedy algorithms as efficient solution methods. Numerical examples show that the different algorithms may be superior in different settings.We furthermore develop a new, "degenerate" version of JSDM that only requires average CSI at the transmitter, and thus greatly reduces the computational burden. Evaluations in propagation channels obtained from ray tracing results, as well as in measured outdoor channels show that this low-complexity version performs surprisingly well in mm-Wave channels.

preprint2014arXiv

MIMO Multiway Relaying with Pairwise Data Exchange: A Degrees of Freedom Perspective

In this paper, we study achievable degrees of freedom (DoF) of a multiple-input multiple-output (MIMO) multiway relay channel (mRC) where $K$ users, each equipped with $M$ antennas, exchange messages in a pairwise manner via a common $N$-antenna relay node. % A novel and systematic way of joint beamforming design at the users and at the relay is proposed to align signals for efficient implementation of physical-layer network coding (PNC). It is shown that, when the user number $K=3$, the proposed beamforming design can achieve the DoF capacity of the considered mRC for any $(M,N)$ setups. % For the scenarios with $K>3$, we show that the proposed signaling scheme can be improved by disabling a portion of relay antennas so as to align signals more efficiently. Our analysis reveals that the obtained achievable DoF is always piecewise linear, and is bounded either by the number of user antennas $M$ or by the number of relay antennas $N$. Further, we show that the DoF capacity can be achieved for $\frac{M}{N} \in \left(0,\frac{K-1}{K(K-2)} \right]$ and $\frac{M}{N} \in \left[\frac{1}{K(K-1)}+\frac{1}{2},\infty \right)$, which provides a broader range of the DoF capacity than the existing results. Asymptotic DoF as $K\rightarrow \infty$ is also derived based on the proposed signaling scheme.

preprint2014arXiv

Polarization-dependent exciton dynamics in tetracene single crystals

We conduct polarization-dependent ultrafast spectroscopy to study the dynamics of singlet fission in tetracene single crystals. The spectrotemporal species for singlet and triplet excitons in transient absorption spectra are found to be strongly dependent on probe polarization. By carefully analyzing the polarization dependence, the signals contributed by different transitions related to singlet excitons have been disentangled, which is further applied to construct the correlation between dynamics of singlet and triplet excitons. The anisotropy of exciton dynamics provides an alternative approach to tackle the long-standing challenge in understanding the mechanism of singlet fission in organic semiconductors.

preprint2014arXiv

Sun-to-Earth Characteristics of Two Coronal Mass Ejections Interacting near 1 AU: Formation of a Complex Ejecta and Generation of a Two-Step Geomagnetic Storm

On 2012 September 30 - October 1 the Earth underwent a two-step geomagnetic storm. We examine the Sun-to-Earth characteristics of the coronal mass ejections (CMEs) responsible for the geomagnetic storm with combined heliospheric imaging and in situ observations. The first CME, which occurred on 2012 September 25, is a slow event and shows an acceleration followed by a nearly invariant speed in the whole Sun-Earth space. The second event, launched from the Sun on 2012 September 27, exhibits a quick acceleration, then a rapid deceleration and finally a nearly constant speed, a typical Sun-to-Earth propagation profile for fast CMEs \citep{liu13}. These two CMEs interacted near 1 AU as predicted by the heliospheric imaging observations and formed a complex ejecta observed at Wind, with a shock inside that enhanced the pre-existing southward magnetic field. Reconstruction of the complex ejecta with the in situ data indicates an overall left-handed flux rope-like configuration, with an embedded concave-outward shock front, a maximum magnetic field strength deviating from the flux rope axis and convex-outward field lines ahead of the shock. While the reconstruction results are consistent with the picture of CME-CME interactions, a magnetic cloud-like structure without clear signs of CME interactions \citep{lugaz14} is anticipated when the merging process is finished.

preprint2014arXiv

Superlinear density dependence of singlet fission rate in tetracene films

We experimentally show that the rate of singlet fission in tetracene films has a superlinear dependence on the density of photo-excited singlet excitons with ultrafast transient absorption spectroscopy. The spectrotemporal features of singlet and triplet dynamics can be disentangled from experimental data with the algorithm of singular value decomposition. The correlation between their temporal dynamics indicates a nonlinear density dependence of fission rate, which leads to a conjecture of coherent singlet fission process arising from superradiant excitons in crystalline tetracene. This hypothesis might be able to resolve some long-standing controversies.

preprint2014arXiv

Theory of Polymer Chains in Poor Solvent: Single-Chain Structure, Solution Thermodynamics and Theta Point

Using the language of the Flory chi parameter, we develop a theory that unifies the treatment of the single-chain structure and the solution thermodynamics of polymers in poor solvents. The structure of a globule and its melting thermodynamics is examined using the self-consistent filed theory. Our results show that the chain conformation involves three states prior to the globule-to-coil transition: the fully-collapsed globule, the swollen globule and the molten globule, which are distinguished by the core density and the interfacial thickness. By examining the chain-length dependence of the melting of the swollen globule, we find universal scaling behavior in the chain properties near the Theta point. The information of density profile and free energy of the globule is used in the dilute solution thermodynamics to study the phase equilibrium of polymer solution. Our results show different scaling behavior of the solubility of polymers in the dilute solution compared to the F-H theory, both in the chi dependence and the chain-length dependence. From the perspectives of single chain structure and solution thermodynamics, our results verifies the consistency of the Theta point defined by different criteria in the limit of infinite chain length: the disappearance of the second viral coefficient, the abrupt change in chain size and the critical point in the phase diagram of the polymer solution. Our results show the value of chi at the Theta point is 0.5 (for the case of equal monomer and solvent volume), which coincides with the value predicted from the F-H theory.

preprint2013arXiv

3-D nonlinear force-free field reconstruction of solar active region 11158 by direct boundary integral equation

A 3-D coronal magnetic field is reconstructed for NOAA 11158 on Feb 14, 2011. A GPU-accelerated direct boundary integral equation (DBIE) method is implemented. This is about 1000 times faster than the original DBIE used on solar NLFFF modeling. Using the SDO/HMI vector magnetogram as the bottom boundary condition, the reconstructed magnetic field lines are compared with the projected EUV loop structures from different views three-dimensionally by SDO/AIA and STEREO A/B spacecraft simultaneously for the first time. They show very good agreement so that the topological configurations of the magnetic fields can be analyzed, thus its role in the flare process of the active region can be better understood. A quantitative comparison with some stereoscopically reconstructed coronal loops shows that the present averaged misalignment angles are at the same order as the state-of-the-art results obtained with reconstructed coronal loops as prescribed conditions and better than other NLFFF methods. It is found that the observed coronal loop structures can be grouped into bundles of closed and open loops with some central bright coronal loops around the polarity inversion line (PIL). The reconstructed highly-shearing magnetic field lines agree very well with the low-lying S-shaped filament channel along PIL. They are in a pivot position to all other surrounding coronal structures, and a group of electric current lines co-aligned with the central bright EUV loops overlying the filament channel is also obtained. This central lower-lying magnetic field loop system must have played a key role in powering the flare. It should be noted that while a strand-like coronal feature along PIL may be related to the filament, one cannot simply attribute all the coronal bright features along PIL to manifestation of the filament without any stereoscopically information. It shows that DBIE is rigorous and effective.

preprint2013arXiv

Channel Estimation, Carrier Recovery, and Data Detection in the Presence of Phase Noise in OFDM Relay Systems

Due to its time-varying nature, oscillator phase noise can significantly degrade the performance of channel estimation, carrier recovery, and data detection blocks in high-speed wireless communication systems. In this paper, we analyze joint channel, \emph{carrier frequency offset (CFO)}, and phase noise estimation plus data detection in \emph{orthogonal frequency division multiplexing (OFDM)} relay systems. To achieve this goal, a detailed transmission framework involving both training and data symbols is presented. In the data transmission phase, a comb-type OFDM symbol consisting of both pilots and data symbols is proposed to track phase noise over an OFDM frame. Next, a novel algorithm that applies the training symbols to jointly estimate the channel responses, CFO, and phase noise based on the maximum a posteriori criterion is proposed. Additionally, a new \emph{hybrid Cramér-Rao lower bound} for evaluating the performance of channel estimation and carrier recovery algorithms in OFDM relay networks is derived. Finally, an iterative receiver for joint phase noise estimation and data detection at the destination node is derived. Extensive simulations demonstrate that the application of the proposed estimation and receiver blocks significantly improves the performance of OFDM relay networks in the presence of phase noise.

preprint2013arXiv

Counterexample-Preserving Reduction for Symbolic Model Checking

The cost of LTL model checking is highly sensitive to the length of the formula under verification. We observe that, under some specific conditions, the input LTL formula can be reduced to an easier-to-handle one before model checking. In our reduction, these two formulae need not to be logically equivalent, but they share the same counterexample set w.r.t the model. In the case that the model is symbolically represented, the condition enabling such reduction can be detected with a lightweight effort (e.g., with SAT-solving). In this paper, we tentatively name such technique "Counterexample-Preserving Reduction" (CePRe for short), and finally the proposed technquie is experimentally evaluated by adapting NuSMV.

preprint2013arXiv

Effect of Temperature and Charged Particle Fluence on the Resistivity of Polycrystalline CVD Diamond Sensors

The resistivity of polycrystalline chemical vapor deposition diamond sensors is studied in samples exposed to fluences relevant to the environment of the High Luminosity Large Hadron Collider. We measure the leakage current for a range of bias voltages on samples irradiated with 800 MeV protons up to 1.6\times 10^{16} p/cm^2. The proton beam at LANSCE, Los Alamos National Laboratory, was applied to irradiate the samples. The devices' resistivity is extracted for temperatures in the -10^\circC to +20^\circC range.

preprint2013arXiv

Floquet Weyl Semimetal Induced by Off-Resonant Light

We propose that a Floquet Weyl semimetal state can be induced in three-dimensional topological insulators, either nonmagnetic or magnetic, by the application of off-resonant light. The virtual photon processes play a critical role in renormalizing the Dirac mass and so resulting in a topological semimetal with vanishing gap at Weyl points. The present mechanism via off-resonant light is quite different from that via on-resonant light, the latter being recently suggested to give rise to a Floquet topological state in ordinary band insulators.

preprint2013arXiv

Production and spectroscopy of hadrons containing a b quark at ATLAS

We present studies of the production and spectroscopy of some members of the B-hadron family. We reconstruct B ground states in their hadronic decay modes with a J/ψ or Υ in the final state. These studies are based on the 2011 7 TeV dataset collected by the ATLAS detector.

preprint2013arXiv

RITA: An Index-Tuning Advisor for Replicated Databases

Given a replicated database, a divergent design tunes the indexes in each replica differently in order to specialize it for a specific subset of the workload. This specialization brings significant performance gains compared to the common practice of having the same indexes in all replicas, but requires the development of new tuning tools for database administrators. In this paper we introduce RITA (Replication-aware Index Tuning Advisor), a novel divergent-tuning advisor that offers several essential features not found in existing tools: it generates robust divergent designs that allow the system to adapt gracefully to replica failures; it computes designs that spread the load evenly among specialized replicas, both during normal operation and when replicas fail; it monitors the workload online in order to detect changes that require a recomputation of the divergent design; and, it offers suggestions to elastically reconfigure the system (by adding/removing replicas or adding/dropping indexes) to respond to workload changes. The key technical innovation behind RITA is showing that the problem of selecting an optimal design can be formulated as a Binary Integer Program (BIP). The BIP has a relatively small number of variables, which makes it feasible to solve it efficiently using any off-the-shelf linear-optimization software. Experimental results demonstrate that RITA computes better divergent designs compared to existing tools, offers more features, and has fast execution times.

preprint2013arXiv

Secure Beamforming for MIMO Two-Way Communications with an Untrusted Relay

This paper studies the secure beamforming design in a multiple-antenna three-node system where two source nodes exchange messages with the help of an untrusted relay node. The relay acts as both an essential signal forwarder and a potential eavesdropper. Both two-phase and three-phase two-way relay strategies are considered. Our goal is to jointly optimize the source and relay beamformers for maximizing the secrecy sum rate of the two-way communications. We first derive the optimal relay beamformer structures. Then, iterative algorithms are proposed to find source and relay beamformers jointly based on alternating optimization. Furthermore, we conduct asymptotic analysis on the maximum secrecy sum-rate. Our analysis shows that when all transmit powers approach infinity, the two-phase two-way relay scheme achieves the maximum secrecy sum rate if the source beamformers are designed such that the received signals at the relay align in the same direction. This reveals an important advantage of signal alignment technique in against eavesdropping. It is also shown that if the source powers approach zero the three-phase scheme performs the best while the two-phase scheme is even worse than direct transmission. Simulation results have verified the efficiency of the secure beamforming algorithms as well as the analytical findings.

preprint2013arXiv

Third harmonic generation in graphene and few-layer graphite films

We observe optical third harmonic generation from graphene and few-layer graphite flakes produced by exfoliation. The emission scales with the cube of the intensity of the incident near-infrared femtosecond pulses and has a wavelength that is one-third of the incident wavelength, both consistent with third harmonic generation. We extract an effective third-order susceptibility for graphene that is on the order of 1e-16 square meter per square volt, which is comparable to that for materials that are resonantly excited, but larger than for materials that are transparent at the fundamental and third harmonic wavelengths. By measuring a set of flakes with different numbers of atomic layers, we find that the emission scales with the square of the number of atomic layers, which suggests that the susceptibility of graphene is independent of layer number, at least for a few layers.

preprint2012arXiv

An Empirical Study of MAUC in Multi-class Problems with Uncertain Cost Matrices

Cost-sensitive learning relies on the availability of a known and fixed cost matrix. However, in some scenarios, the cost matrix is uncertain during training, and re-train a classifier after the cost matrix is specified would not be an option. For binary classification, this issue can be successfully addressed by methods maximizing the Area Under the ROC Curve (AUC) metric. Since the AUC can measure performance of base classifiers independent of cost during training, and a larger AUC is more likely to lead to a smaller total cost in testing using the threshold moving method. As an extension of AUC to multi-class problems, MAUC has attracted lots of attentions and been widely used. Although MAUC also measures performance of base classifiers independent of cost, it is unclear whether a larger MAUC of classifiers is more likely to lead to a smaller total cost. In fact, it is also unclear what kinds of post-processing methods should be used in multi-class problems to convert base classifiers into discrete classifiers such that the total cost is as small as possible. In the paper, we empirically explore the relationship between MAUC and the total cost of classifiers by applying two categories of post-processing methods. Our results suggest that a larger MAUC is also beneficial. Interestingly, simple calibration methods that convert the output matrix into posterior probabilities perform better than existing sophisticated post re-optimization methods.

preprint2012arXiv

Approximate Principal Direction Trees

We introduce a new spatial data structure for high dimensional data called the \emph{approximate principal direction tree} (APD tree) that adapts to the intrinsic dimension of the data. Our algorithm ensures vector-quantization accuracy similar to that of computationally-expensive PCA trees with similar time-complexity to that of lower-accuracy RP trees. APD trees use a small number of power-method iterations to find splitting planes for recursively partitioning the data. As such they provide a natural trade-off between the running-time and accuracy achieved by RP and PCA trees. Our theoretical results establish a) strong performance guarantees regardless of the convergence rate of the power-method and b) that $O(\log d)$ iterations suffice to establish the guarantee of PCA trees when the intrinsic dimension is $d$. We demonstrate this trade-off and the efficacy of our data structure on both the CPU and GPU.

preprint2012arXiv

Exciton diffusion in semiconducting single-wall carbon nanotubes studied by transient absorption microscopy

Spatiotemporal dynamics of excitons in isolated semiconducting single-walled carbon nanotubes are studied using transient absorption microscopy. Differential reflection and transmission of an 810-nm probe pulse after excitation by a 750-nm pump pulse are measured. We observe a bi-exponentially decaying signal with a fast time constant of 0.66 ps and a slower time constant of 2.8 ps. Both constants are independent of the pump fluence. By spatially and temporally resolving the differential reflection, we are able to observe a diffusion of excitons, and measure a diffusion coefficient of 200 cm2/s at room temperature and 300 cm2/s at lower temperatures of 10 K and 150 K.

preprint2012arXiv

Linear Precoding Designs for Amplify-and-Forward Multiuser Two-Way Relay Systems

Two-way relaying can improve spectral efficiency in two-user cooperative communications. It also has great potential in multiuser systems. A major problem of designing a multiuser two-way relay system (MU-TWRS) is transceiver or precoding design to suppress co-channel interference. This paper aims to study linear precoding designs for a cellular MU-TWRS where a multi-antenna base station (BS) conducts bi-directional communications with multiple mobile stations (MSs) via a multi-antenna relay station (RS) with amplify-and-forward relay strategy. The design goal is to optimize uplink performance, including total mean-square error (Total-MSE) and sum rate, while maintaining individual signal-to-interference-plus-noise ratio (SINR) requirement for downlink signals. We show that the BS precoding design with the RS precoder fixed can be converted to a standard second order cone programming (SOCP) and the optimal solution is obtained efficiently. The RS precoding design with the BS precoder fixed, on the other hand, is non-convex and we present an iterative algorithm to find a local optimal solution. Then, the joint BS-RS precoding is obtained by solving the BS precoding and the RS precoding alternately. Comprehensive simulation is conducted to demonstrate the effectiveness of the proposed precoding designs.

preprint2012arXiv

Minimax Classifier for Uncertain Costs

Many studies on the cost-sensitive learning assumed that a unique cost matrix is known for a problem. However, this assumption may not hold for many real-world problems. For example, a classifier might need to be applied in several circumstances, each of which associates with a different cost matrix. Or, different human experts have different opinions about the costs for a given problem. Motivated by these facts, this study aims to seek the minimax classifier over multiple cost matrices. In summary, we theoretically proved that, no matter how many cost matrices are involved, the minimax problem can be tackled by solving a number of standard cost-sensitive problems and sub-problems that involve only two cost matrices. As a result, a general framework for achieving minimax classifier over multiple cost matrices is suggested and justified by preliminary empirical studies.

preprint2012arXiv

Non-Hierarchical Clock Synchronization for Wireless Sensor Networks

Time synchronization is important for a variety of applications in wireless sensor networks including scheduling communication resources, coordinating sensor wake/sleep cycles, and aligning signals for distributed transmission/reception. This paper describes a non-hierarchical approach to time synchronization in wireless sensor networks that has low overhead and can be implemented at the physical and/or MAC layers. Unlike most of the prior approaches, the approach described in this paper allows all nodes to use exactly the same distributed algorithm and does not require local averaging of measurements from other nodes. Analytical results show that the non-hierarchical approach can provide monotonic expected convergence of both drifts and offsets under broad conditions on the network topology and local clock update stepsize. Numerical results are also presented verifying the analysis under two particular network topologies.

preprint2012arXiv

Optimal Linear Transceiver Designs for Cognitive Two-Way Relay Networks

This paper studies a cooperative cognitive radio network where two primary users (PUs) exchange information with the help of a secondary user (SU) that is equipped with multiple antennas and in return, the SU superimposes its own messages along with the primary transmission. The fundamental problem in the considered network is the design of transmission strategies at the secondary node. It involves three basic elements: first, how to split the power for relaying the primary signals and for transmitting the secondary signals; second, what two-way relay strategy should be used to assist the bidirectional communication between the two PUs; third, how to jointly design the primary and secondary transmit precoders. This work aims to address this problem by proposing a transmission framework of maximizing the achievable rate of the SU while maintaining the rate requirements of the two PUs. Three well-known and practical two-way relay strategies are considered: amplify-and-forward (AF), bit level XOR based decode-and-forward (DF-XOR) and symbol level superposition coding based DF (DF-SUP). For each relay strategy, although the design problem is non-convex, we find the optimal solution by using certain transformation techniques and optimization tools such as semidefinite programming (SDP) and second-order cone programming (SOCP). Closed-form solutions are also obtained under certain conditions. Simulation results show that when the rate requirements of the two PUs are symmetric, by using the DF-XOR strategy and applying the proposed optimal precoding, the SU requires the least power for relaying and thus reserves the most power to transmit its own signal. In the asymmetric scenario, on the other hand, the DF-SUP strategy with the corresponding optimal precoding is the best.

preprint2012arXiv

Optimal Sampling Points in Reproducing Kernel Hilbert Spaces

The recent developments of basis pursuit and compressed sensing seek to extract information from as few samples as possible. In such applications, since the number of samples is restricted, one should deploy the sampling points wisely. We are motivated to study the optimal distribution of finite sampling points. Formulation under the framework of optimal reconstruction yields a minimization problem. In the discrete case, we estimate the distance between the optimal subspace resulting from a general Karhunen-Loeve transform and the kernel space to obtain another algorithm that is computationally favorable. Numerical experiments are then presented to illustrate the performance of the algorithms for the searching of optimal sampling points.

preprint2012arXiv

Reversed Drifting Quasi-periodic Pulsating Structure in an X1.3 Solar Flare on 2005 July 30

Based on the analysis of the microwave observations at frequency of 2.60 -- 3.80 GHz in a solar X1.3 flare event observed at Solar Broadband RadioSpectrometer in Huairou (SBRS/Huairou) on 2005 July 30, an interesting reversed drifting quasi-periodic pulsating structure (R-DPS) is confirmed. The R-DPS is mainly composed of two drifting pulsating components: one is a relatively slow very short-period pulsation (VSP) with period of about 130 -- 170 ms, the other is a relatively fast VSP with period of about 70 -- 80 ms. The R-DPS has a weak left-handed circular polarization. Based on the synthetic investigations of Reuven Ramaty High Energy Solar Spectroscopic Imaging (RHESSI) hard X-ray, Geostationary Operational Environmental Satellite (GOES) soft X-ray observation, and magnetic field extrapolation, we suggest the R-DPS possibly reflects flaring dynamic processes of the emission source regions.

preprint2012arXiv

Ultrafast and spatially resolved studies of charge carriers in atomically-thin molybdenum disulfide

Atomically-thin molybdenum disulfide is emerging as a new nanomaterial with potential applications in the fields of electronic and photonics. Charge carrier dynamics plays an essential role in determining its electronic and optical properties. We report spatially and temporally resolved pump-probe studies of charge carriers in atomically-thin molybdenum disulfide samples fabricated by mechanical exfoliation. Carriers are injected by interband absorption of a 390-nm pump pulse and detected by measuring differential reflection of a time-delayed and spatially-scanned probe pulse that is tuned to an exciton transition. Several parameters on charge carrier dynamics are deduced, including carrier lifetime, diffusion coefficient, diffusion length, and mobility.

preprint2011arXiv

A New and Efficient Algorithm-Based Fault Tolerance Scheme for A Million Way Parallelism

Fault tolerance overhead of high performance computing (HPC) applications is becoming critical to the efficient utilization of HPC systems at large scale. HPC applications typically tolerate fail-stop failures by checkpointing. Another promising method is in the algorithm level, called algorithmic recovery. These two methods can achieve high efficiency when the system scale is not very large, but will both lose their effectiveness when systems approach the scale of Exaflops, where the number of processors including in system is expected to achieve one million. This paper develops a new and efficient algorithm-based fault tolerance scheme for HPC applications. When failure occurs during the execution, we do not stop to wait for the recovery of corrupted data, but replace them with the corresponding redundant data and continue the execution. A background accelerated recovery method is also proposed to rebuild redundancy to tolerate multiple times of failures during the execution. To demonstrate the feasibility of our new scheme, we have incorporated it to the High Performance Linpack. Theoretical analysis demonstrates that our new fault tolerance scheme can still be effective even when the system scale achieves the Exaflops. Experiment using SiCortex SC5832 verifies the feasibility of the scheme, and indicates that the advantage of our scheme can be observable even in a small scale.

preprint2011arXiv

A Survey on Delay-Aware Resource Control for Wireless Systems --- Large Deviation Theory, Stochastic Lyapunov Drift and Distributed Stochastic Learning

In this tutorial paper, a comprehensive survey is given on several major systematic approaches in dealing with delay-aware control problems, namely the equivalent rate constraint approach, the Lyapunov stability drift approach and the approximate Markov Decision Process (MDP) approach using stochastic learning. These approaches essentially embrace most of the existing literature regarding delay-aware resource control in wireless systems. They have their relative pros and cons in terms of performance, complexity and implementation issues. For each of the approaches, the problem setup, the general solution and the design methodology are discussed. Applications of these approaches to delay-aware resource allocation are illustrated with examples in single-hop wireless networks. Furthermore, recent results regarding delay-aware multi-hop routing designs in general multi-hop networks are elaborated. Finally, the delay performance of the various approaches are compared through simulations using an example of the uplink OFDMA systems.

preprint2011arXiv

Ab initio study of the thermodynamic properties of rare-earthmagnesium intermetallics MgRE (RE=Y, Dy, Pr, Tb)

We have performed an ab initio study of the thermodynamical properties of rare-earth-magnesium intermetallic compounds MgRE (RE=Y, Dy, Pr, Tb) with CsCl-type B2-type structures. The calculations have been carried out the density functional theory and density functional perturbation theory in combination with the quasiharmonic approximation. The phonon-dispersion curves and phonon total and partial density of states have been investigated. Our results show that the contribution of RE atoms is dominant in phonon frequency, and this character agrees with the previous discussion by using atomistic simulations. The temperature dependence of various quantities such as the thermal expansions, bulk modulus, and the heat capacity are obtained. The electronic contributions to the specific heat are discussed, and found to be important for the calculated MgRE intermetallics.

preprint2011arXiv

Edge dislocation core structures in FCC metals determined from ab initio calculations combined with the improved Peierls-Nabarro equation

We have employed the improved Peierls-Nabarro (P-N) equation to study the properties of 1/2<110> edge dislocation in {111} plane in FCC metals Al, Cu, Ir, Pd, and Pt. The generalized-stacking-fault energy (GSFE) surface entering the equation is calculated by using first-principles density functional theory (DFT). The accuracy of the method has been tested by calculating values for various stacking fault energies which favorably compare with the previous theoretical and experimental results. The core structures, including the core widths both of the edge and screw components, dissociation behavior for edge dislocations have been investigated. The dissociated distance between two partials for Al in our calculation agrees well with the values obtained from the numeric simulation with DFT and molecular dynamics simulation, as well as experiment. Our calculations show that it is preferred to create partial dislocation in Cu, and to be easily observed full dislocation in Al, Ir, Pd, and especially Pt.

preprint2011arXiv

Feature Selection for MAUC-Oriented Classification Systems

Feature selection is an important pre-processing step for many pattern classification tasks. Traditionally, feature selection methods are designed to obtain a feature subset that can lead to high classification accuracy. However, classification accuracy has recently been shown to be an inappropriate performance metric of classification systems in many cases. Instead, the Area Under the receiver operating characteristic Curve (AUC) and its multi-class extension, MAUC, have been proved to be better alternatives. Hence, the target of classification system design is gradually shifting from seeking a system with the maximum classification accuracy to obtaining a system with the maximum AUC/MAUC. Previous investigations have shown that traditional feature selection methods need to be modified to cope with this new objective. These methods most often are restricted to binary classification problems only. In this study, a filter feature selection method, namely MAUC Decomposition based Feature Selection (MDFS), is proposed for multi-class classification problems. To the best of our knowledge, MDFS is the first method specifically designed to select features for building classification systems with maximum MAUC. Extensive empirical results demonstrate the advantage of MDFS over several compared feature selection methods.

preprint2011arXiv

First-principles calculations of phonon and thermodynamic properties of AlRE (RE= Y, Gd, Pr, Yb) intermetallic compounds

The phonon and thermodynamic properties of rare-earth-aluminum intermetallics AlRE (RE=Y, Gd, Pr, Yb) with B2-type structure are investigated by performing density functional theory and density functional perturbation theory within the quasiharmonic approximation. The phonon spectra and phonon density of states, including the phonon partial density of states and total density of states, have been discussed. Our results demonstrate that the density of states is mostly composed of Al states at the high frequency. The temperature dependence of various quantities such as the thermal expansions, the heat capacities at constant volume and constant pressure, the isothermal bulk modulus, and the entropy are obtained. The electronic contribution to the specific heat is discussed, and the presented results show that the thermal electronic excitation affecting the thermal properties is inessential.

preprint2011arXiv

First-principles calculations on finite temperature elastic properties of B2-AlRE (RE=Y, Tb, Pr, Nd, Dy) intermetallics

We have investigated the finite temperature elastic properties of AlRE (RE=Y, Tb, Pr, Nd, Dy) with B2-type structures from first principles. The phonon free energy and thermal expansion is obtained from the quasiharmonic approach based on density-functional perturbation theory. The static volume-dependent elastic constants are obtained from energy-strain functions by using the first-principles total-energy method. The comparison between our predicted results and the ultrasonic experimental data for a benchmark material Al provides excellent agreements. At T = 0K, our calculated values of lattice equilibrium volume and elastic moduli of our calculated AlRE (RE=Y, Tb, Pr, Nd, Dy) intermetallics agree well with the previous theoretical results. The temperature dependent elastic constants exhibit a normal behavior with temperature, i.e., decrease and approach linearity at higher temperature and zero slope around zero temperature. Furthermore, the anisotropy ratio and sound velocities as a function of temperature has also been discussed.

preprint2011arXiv

Joint Source and Relay Precoding Designs for MIMO Two-Way Relaying Based on MSE Criterion

Properly designed precoders can significantly improve the spectral efficiency of multiple-input multiple-output (MIMO) relay systems. In this paper, we investigate joint source and relay precoding design based on the mean-square-error (MSE) criterion in MIMO two-way relay systems, where two multi-antenna source nodes exchange information via a multi-antenna amplify-and-forward relay node. This problem is non-convex and its optimal solution remains unsolved. Aiming to find an efficient way to solve the problem, we first decouple the primal problem into three tractable sub-problems, and then propose an iterative precoding design algorithm based on alternating optimization. The solution to each sub-problem is optimal and unique, thus the convergence of the iterative algorithm is guaranteed. Secondly, we propose a structured precoding design to lower the computational complexity. The proposed precoding structure is able to parallelize the channels in the multiple access (MAC) phase and broadcast (BC) phase. It thus reduces the precoding design to a simple power allocation problem. Lastly, for the special case where only a single data stream is transmitted from each source node, we present a source-antenna-selection (SAS) based precoding design algorithm. This algorithm selects only one antenna for transmission from each source and thus requires lower signalling overhead. Comprehensive simulation is conducted to evaluate the effectiveness of all the proposed precoding designs.

preprint2011arXiv

Opportunistic Buffered Decode-Wait-and-Forward (OBDWF) Protocol for Mobile Wireless Relay Networks

In this paper, we propose an opportunistic buffered decode-wait-and-forward (OBDWF) protocol to exploit both relay buffering and relay mobility to enhance the system throughput and the end-to-end packet delay under bursty arrivals. We consider a point-to-point communication link assisted by K mobile relays. We illustrate that the OBDWF protocol could achieve a better throughput and delay performance compared with existing baseline systems such as the conventional dynamic decode-and-forward (DDF) and amplified-and-forward (AF) protocol. In addition to simulation performance, we also derived closed-form asymptotic throughput and delay expressions of the OBDWF protocol. Specifically, the proposed OBDWF protocol achieves an asymptotic throughput O(logK) with O(1) total transmit power in the relay network. This is a significant gain compared with the best known performance in conventional protocols (O(logK) throughput with O(K) total transmit power). With bursty arrivals, we show that both the stability region and average delay of the proposed OBDWF protocol can achieve order-wise performance gain O(K) compared with conventional DDF protocol.

preprint2011arXiv

Optical pump-probe studies of carrier dynamics in few-layer MoS2

Femtosecond optical pump-probe technique is used to study charge carrier dynamics in few-layer MoS2 samples fabricated by mechanical exfoliation. An ultrafast pump pulse excites carriers and differential reflection of a probe pulse tuned to an excitonic resonance is detected. We find that the spectrum of the differential reflection signal is similar to the derivative of the resonance. The decay time of the signal is in the range of 150 to 400 ps, increasing with the probe wavelength. With a fixed probe wavelength, the signal magnitude is proportional to the pump fluence, while the decay time is independent of it.

preprint2011arXiv

The temperature-dependent elastic properties of B2-MgRE intermetallic compounds from first principles

Using the density functional theory (DFT) formulated within the framework of the plane-wave basis projector augmented wave (PAW) method, the temperature-dependent elastic properties of MgRE (RE=Y, Dy, Pr, Sc, Tb) intermetallics with B2-type structure are presented from first-principles. Our calculations are based on the fact that the elastic moduli as a function of temperature mainly results from thermal expansion. The comparison between the predicted results and the available experimental data for a benchmark material NiAl provides good agreements. At $T=0K$, our calculated values of lattice parameter and elastic moduli for MgRE intermetallics show excellent agreement with previous theoretical results and experimental data. While temperature increases, we find that the elastic constants decrease and approach linearity at higher temperature and zero slope around zero temperature.

preprint2011arXiv

The third-order elastic moduli and pressure derivatives for AlRE (RE=Y, Pr, Nd, Tb, Dy, Ce) intermetallics with B2-structure: A first-principles study

The third-order elastic moduli and pressure derivatives of the second-order elastic constants of novel B2-type AlRE (RE=Y, Pr, Nd, Tb, Dy, Ce) intermetallics are presented from first-principles calculations. The elastic moduli are obtained from the coefficients of the polynomials from the nonlinear least-squares fitting of the energy-strain functions. The calculated second-order elastic constants of AlRE intermetallics are consistent with the previous calculations. To judge that our computational accuracy is reasonable, the calculated third-order constants of Al are compared with the available experimental data and other theoretical results and found very good agreement. In comparison with the theory of the linear elasticity, the third-order effects are very important with the finite strains are lager than approximately 3.5%. Finally, the pressure derivative has been discussed.

preprint2010arXiv

Blind Spectrum Sensing by Information Theoretic Criteria for Cognitive Radios

Spectrum sensing is a fundamental and critical issue for opportunistic spectrum access in cognitive radio networks. Among the many spectrum sensing methods, the information theoretic criteria (ITC) based method is a promising blind method which can reliably detect the primary users while requiring little prior information. In this paper, we provide an intensive treatment on the ITC sensing method. To this end, we first introduce a new over-determined channel model constructed by applying multiple antennas or over sampling at the secondary user in order to make the ITC applicable. Then, a simplified ITC sensing algorithm is introduced, which needs to compute and compare only two decision values. Compared with the original ITC sensing algorithm, the simplified algorithm significantly reduces the computational complexity without losing any performance. Applying the recent advances in random matrix theory, we then derive closed-form expressions to tightly approximate both the probability of false alarm and probability of detection. Based on the insight derived from the analytical study, we further present a generalized ITC sensing algorithm which can provide flexible tradeoff between the probability of detection and probability of false alarm. Finally, comprehensive simulations are carried out to evaluate the performance of the proposed ITC sensing algorithms. Results show that they considerably outperform other blind spectrum sensing methods in certain cases.

preprint2010arXiv

Decentralized Fair Scheduling in Two-Hop Relay-Assisted Cognitive OFDMA Systems

In this paper, we consider a two-hop relay-assisted cognitive downlink OFDMA system (named as secondary system) dynamically accessing a spectrum licensed to a primary network, thereby improving the efficiency of spectrum usage. A cluster-based relay-assisted architecture is proposed for the secondary system, where relay stations are employed for minimizing the interference to the users in the primary network and achieving fairness for cell-edge users. Based on this architecture, an asymptotically optimal solution is derived for jointly controlling data rates, transmission power, and subchannel allocation to optimize the average weighted sum goodput where the proportional fair scheduling (PFS) is included as a special case. This solution supports decentralized implementation, requires small communication overhead, and is robust against imperfect channel state information at the transmitter (CSIT) and sensing measurement. The proposed solution achieves significant throughput gains and better user-fairness compared with the existing designs. Finally, we derived a simple and asymptotically optimal scheduling solution as well as the associated closed-form performance under the proportional fair scheduling for a large number of users. The system throughput is shown to be $\mathcal{O}\left(N(1-q_p)(1-q_p^N)\ln\ln K_c\right)$, where $K_c$ is the number of users in one cluster, $N$ is the number of subchannels and $q_p$ is the active probability of primary users.

preprint2010arXiv

Negative refraction at deep-ultraviolet frequency in monocrystalline graphite

Negative refraction is such a prominent electromagnetic phenomenon that most researchers believe it can only occur in artificially engineered metamaterials. In this article, we report negative refraction for all incident angles for the first time in a naturally existing material. Using ellipsometry measurement of the equifrequency contour in the deep-ultraviolet frequency region (typically 254 nm), obvious negative refraction was demonstrated in monocrystalline graphite for incident angles ranging from 20o to 70o. This negative refraction is attributed to extremely strong anisotropy in the crystal structure of graphite, which gives the crystal indefinite permeability. This result not only explores a new route to identifying natural negative-index materials, but it also holds promise for the development of an ultraviolet hyperlens, which may lead to a breakthrough in nanolithography, the most critical technology necessary for the next generation of electronics.

preprint2010arXiv

Nonparametric inference procedure for percentiles of the random effects distribution in meta-analysis

To investigate whether treating cancer patients with erythropoiesis-stimulating agents (ESAs) would increase the mortality risk, Bennett et al. [Journal of the American Medical Association 299 (2008) 914--924] conducted a meta-analysis with the data from 52 phase III trials comparing ESAs with placebo or standard of care. With a standard parametric random effects modeling approach, the study concluded that ESA administration was significantly associated with increased average mortality risk. In this article we present a simple nonparametric inference procedure for the distribution of the random effects. We re-analyzed the ESA mortality data with the new method. Our results about the center of the random effects distribution were markedly different from those reported by Bennett et al. Moreover, our procedure, which estimates the distribution of the random effects, as opposed to just a simple population average, suggests that the ESA may be beneficial to mortality for approximately a quarter of the study populations. This new meta-analysis technique can be implemented with study-level summary statistics. In contrast to existing methods for parametric random effects models, the validity of our proposal does not require the number of studies involved to be large. From the results of an extensive numerical study, we find that the new procedure performs well even with moderate individual study sample sizes.

preprint2010arXiv

Queue-Aware Distributive Resource Control for Delay-Sensitive Two-Hop MIMO Cooperative Systems

In this paper, we consider a queue-aware distributive resource control algorithm for two-hop MIMO cooperative systems. We shall illustrate that relay buffering is an effective way to reduce the intrinsic half-duplex penalty in cooperative systems. The complex interactions of the queues at the source node and the relays are modeled as an average-cost infinite horizon Markov Decision Process (MDP). The traditional approach solving this MDP problem involves centralized control with huge complexity. To obtain a distributive and low complexity solution, we introduce a linear structure which approximates the value function of the associated Bellman equation by the sum of per-node value functions. We derive a distributive two-stage two-winner auction-based control policy which is a function of the local CSI and local QSI only. Furthermore, to estimate the best fit approximation parameter, we propose a distributive online stochastic learning algorithm using stochastic approximation theory. Finally, we establish technical conditions for almost-sure convergence and show that under heavy traffic, the proposed low complexity distributive control is global optimal.

Rui Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

273 published item(s)

Federated Cross-Client Subgraph Pattern Detection

Large-Small Model Collaboration for Farmland Semantic Change Detection

PrepBench: How Far Are We from Natural-Language-Driven Data Preparation?

SciResearcher: Scaling Deep Research Agents for Frontier Scientific Reasoning

TouchMap-OR: Multi-View 3D Mapping of Hand-Surface Contacts

Towards Backdoor-Based Ownership Verification for Vision-Language-Action Models

Decoupling Constraint from Two Direction in Evolutionary Constrained Multi-objective Optimization

Observation and manipulation of quantum interference in a superconducting Kerr parametric oscillator

Phase Transition Study meets Machine Learning

Structured Learning in Time-dependent Cox Models

A Theory of Human-Like Few-Shot Learning

An Indoor Environment Sensing and Localization System via mmWave Phased Array

Boosting Large Language Model for Speech Synthesis: An Empirical Study

On the nature of overcharging and charge inversion in electrical double layers

PACO: Parts and Attributes of Common Objects

Universal Multimodal Representation for Language Understanding

Workload Failure Prediction for Data Centers

A new class of bilayer kagome lattice compounds with Dirac nodal lines and pressure-induced superconductivity

A novel method for data augmentation: Nine Dot Moving Least Square (ND-MLS)

A Study of Syntactic Multi-Modality in Non-Autoregressive Machine Translation

Accelerating Edge Intelligence via Integrated Sensing and Communication

Accelerating Federated Edge Learning via Topology Optimization

An approximate randomization test for high-dimensional two-sample Behrens-Fisher problem under arbitrary covariances

An Interactive Image-based Modeling System

Analysis of the distribution, rotation and scale characteristics of solar wind switchbacks: comparison between the first and second encounters of Parker Solar Probe

Approximately Equivariant Networks for Imperfectly Symmetric Dynamics

Beamforming Design and Performance Evaluation for Reconfigurable Intelligent Surface Assisted Wireless Communication Systems With Non-Ideal Hardware

BEVT: BERT Pretraining of Video Transformers

Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation

Buildup of the Magnetic Flux Ropes in Homologous Solar Eruptions

CCP: Correlated Clustering and Projection for Dimensionality Reduction

Chern-Simons superconductors and their instabilities

Collaborative Representation for SPD Matrices with Application to Image-Set Classification

Cross-domain Contrastive Learning for Unsupervised Domain Adaptation

Data Augmentation vs. Equivariant Networks: A Theory of Generalization on Dynamics Forecasting

Deep Reinforcement Learning for Online Routing of Unmanned Aerial Vehicles with Wireless Power Transfer

Determining the number of factors in a large-dimensional generalised factor model

DreamNet: A Deep Riemannian Network based on SPD Manifold Learning for Visual Classification

Edge Federated Learning Via Unit-Modulus Over-The-Air Computation

Effect of Homomorphic Encryption on the Performance of Training Federated Learning Generative Adversarial Networks

Efficient and High-quality Prehensile Rearrangement in Cluttered and Confined Spaces

Efficient conditioned face animation using frontally-viewed embedding

EfficientTDNN: Efficient Architecture Search for Speaker Recognition

Evaluation of Four Black-box Adversarial Attacks and Some Query-efficient Improvement Analysis

Exact solutions of Kondo problems in higher-order fermions

Fano Interference in a Single-Molecule Junction

FLVoogd: Robust And Privacy Preserving Federated Learning

Grassmannian Discriminant Maps (GDM) for Manifold Dimensionality Reduction with Application to Image Set Classification

High quality entanglement distribution through telecommunication fiber using near-infrared non-degenerate photon pairs

Hybridization of evolutionary algorithm and deep reinforcement learning for multi-objective orienteering optimization

Hyperspectral Unmixing Based on Nonnegative Matrix Factorization: A Comprehensive Review

Improving Deep Image Matting via Local Smoothness Assumption

LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT

M Subdwarf Research III. Spectroscopic Diagnostics for Breaking Parameter Degeneracy

MINERVAS: Massive INterior EnviRonments VirtuAl Synthesis

Model Pruning Based on Quantified Similarity of Feature Maps

Morphological Anti-Aliasing Method for Boundary Slope Prediction

MULTI-FLGANs: Multi-Distributed Adversarial Networks for Non-IID distribution

Multi-objective Pointer Network for Combinatorial Optimization

Multi-View Self-Attention Based Transformer for Speaker Recognition

Next-ViT: Next Generation Vision Transformer for Efficient Deployment in Realistic Industrial Scenarios

Normal and Visibility Estimation of Human Face from a Single Image

On the Effectiveness of Pinyin-Character Dual-Decoding for End-to-End Mandarin Chinese ASR

Online Object Model Reconstruction and Reuse for Lifelong Improvement of Robot Manipulation

Parallel Pre-trained Transformers (PPT) for Synthetic Data-based Instance Segmentation

Parameter Choices for Sparse Regularization with the $\ell_1$ Norm

Passive Motion Detection via mmWave Communication System

Persistent Homology for Effective Non-Prehensile Manipulation

Persistent Path Laplacian

PriFit: Learning to Fit Primitives Improves Few Shot Point Cloud Segmentation

Q-balls Formation and the Production of Gravitational Waves With Non-minimal Gravitational Coupling

R2: A Distributed Remote Function Execution Mechanism With Built-in Metadata

RAW-GNN: RAndom Walk Aggregation based Graph Neural Network

Real-time Monte Carlo Denoising with Weight Sharing Kernel Prediction Network