Source author record

Xu Chen

Xu Chen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

117works

36topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

CurEvo: Curriculum-Guided Self-Evolution for Video Understanding

Recent advances in self-evolution video understanding frameworks have demonstrated the potential of autonomous learning without human annotations. However, existing methods often suffer from weakly controlled optimization and uncontrolled difficulty progression, as they lack structured guidance throughout the iterative learning process. To address these limitations, we propose CurEvo, a curriculum-guided self-evolution framework that introduces curriculum learning into self-evolution to achieve more structured and progressive model improvement. CurEvo dynamically regulates task difficulty, refines evaluation criteria, and balances data diversity according to model competence, forming a curriculum-guided feedback loop that aligns learning complexity with model capability. Built upon this principle, we develop a multi-dimensional adaptive QA framework that jointly evolves question generation and answer evaluation across perception, recognition, and understanding dimensions, ensuring coherent and measurable curriculum progression. Through this integration, CurEvo transforms weakly controlled self-evolution into a more structured learning process for autonomous video understanding. Across seven backbones, CurEvo consistently improves both benchmark accuracy and evaluator-based semantic score on four VideoQA benchmarks, validating the effectiveness of curriculum-guided self-evolution for video understanding.

preprint2023arXiv

HiFlash: Communication-Efficient Hierarchical Federated Learning with Adaptive Staleness Control and Heterogeneity-aware Client-Edge Association

Federated learning (FL) is a promising paradigm that enables collaboratively learning a shared model across massive clients while keeping the training data locally. However, for many existing FL systems, clients need to frequently exchange model parameters of large data size with the remote cloud server directly via wide-area networks (WAN), leading to significant communication overhead and long transmission time. To mitigate the communication bottleneck, we resort to the hierarchical federated learning paradigm of HiFL, which reaps the benefits of mobile edge computing and combines synchronous client-edge model aggregation and asynchronous edge-cloud model aggregation together to greatly reduce the traffic volumes of WAN transmissions. Specifically, we first analyze the convergence bound of HiFL theoretically and identify the key controllable factors for model performance improvement. We then advocate an enhanced design of HiFlash by innovatively integrating deep reinforcement learning based adaptive staleness control and heterogeneity-aware client-edge association strategy to boost the system efficiency and mitigate the staleness effect without compromising model accuracy. Extensive experiments corroborate the superior performance of HiFlash in model accuracy, communication reduction, and system efficiency.

preprint2023arXiv

Offline Imitation Learning with Variational Counterfactual Reasoning

In offline imitation learning (IL), an agent aims to learn an optimal expert behavior policy without additional online environment interactions. However, in many real-world scenarios, such as robotics manipulation, the offline dataset is collected from suboptimal behaviors without rewards. Due to the scarce expert data, the agents usually suffer from simply memorizing poor trajectories and are vulnerable to variations in the environments, lacking the capability of generalizing to new environments. To automatically generate high-quality expert data and improve the generalization ability of the agent, we propose a framework named \underline{O}ffline \underline{I}mitation \underline{L}earning with \underline{C}ounterfactual data \underline{A}ugmentation (OILCA) by doing counterfactual inference. In particular, we leverage identifiable variational autoencoder to generate \textit{counterfactual} samples for expert data augmentation. We theoretically analyze the influence of the generated expert data and the improvement of generalization. Moreover, we conduct extensive experiments to demonstrate that our approach significantly outperforms various baselines on both \textsc{DeepMind Control Suite} benchmark for in-distribution performance and \textsc{CausalWorld} benchmark for out-of-distribution generalization. Our code is available at \url{https://github.com/ZexuSun/OILCA-NeurIPS23}.

preprint2023arXiv

Professional Network Matters: Connections Empower Person-Job Fit

Online recruitment platforms typically employ Person-Job Fit models in the core service that automatically match suitable job seekers with appropriate job positions. While existing works leverage historical or contextual information, they often disregard a crucial aspect: job seekers' social relationships in professional networks. This paper emphasizes the importance of incorporating professional networks into the Person-Job Fit model. Our innovative approach consists of two stages: (1) defining a Workplace Heterogeneous Information Network (WHIN) to capture heterogeneous knowledge, including professional connections and pre-training representations of various entities using a heterogeneous graph neural network; (2) designing a Contextual Social Attention Graph Neural Network (CSAGNN) that supplements users' missing information with professional connections' contextual information. We introduce a job-specific attention mechanism in CSAGNN to handle noisy professional networks, leveraging pre-trained entity representations from WHIN. We demonstrate the effectiveness of our approach through experimental evaluations conducted across three real-world recruitment datasets from LinkedIn, showing superior performance compared to baseline models.

preprint2023arXiv

Real-Time High-Resolution Pedestrian Detection in Crowded Scenes via Parallel Edge Offloading

To identify dense and small-size pedestrians in surveillance systems, high-resolution cameras are widely deployed, where high-resolution images are captured and delivered to off-the-shelf pedestrian detection models. However, given the highly computation-intensive workload brought by the high resolution, the resource-constrained cameras fail to afford accurate inference in real time. To address that, we propose Hode, an offloaded video analytic framework that utilizes multiple edge nodes in proximity to expedite pedestrian detection with high-resolution inputs. Specifically, Hode can intelligently split high-resolution images into respective regions and then offload them to distributed edge nodes to perform pedestrian detection in parallel. A spatio-temporal flow filtering method is designed to enable context-aware region partitioning, as well as a DRL-based scheduling algorithm to allow accuracy-aware load balance among heterogeneous edge nodes. Extensive evaluation results using realistic prototypes show that Hode can achieve up to 2.01% speedup with very mild accuracy loss.

preprint2023arXiv

Superconductivity in an Orbital-reoriented SnAs Square Lattice: a Case Study of Li0.6Sn2As2 and NaSnAs

Searching for functional square lattices in layered superconductor systems offers an explicit clue to modify the electron behavior and find exotic properties. The trigonal SnAs3 structural units in SnAs-based systems are relatively conformable to distortion, which provides the possibility to achieve structurally topological transformation and higher superconducting transition temperatures. In the present work, the functional As square lattice was realized and activated in Li0.6Sn2As2 and NaSnAs through a topotactic structural transformation of trigonal SnAs3 to square SnAs4 under pressure, resulting in a record-high Tc among all synthesized SnAs-based compounds. Meanwhile, the conductive channel transfers from the out-of-plane pz orbital to the in-plane px+py orbitals, facilitating electron hopping within the square 2D lattice and boosting the superconductivity. The reorientation of p-orbital following a directed local structure transformation provides an effective strategy to modify layered superconductors.

preprint2022arXiv

3D Dense Face Alignment with Fused Features by Aggregating CNNs and GCNs

In this paper, we propose a novel multi-level aggregation network to regress the coordinates of the vertices of a 3D face from a single 2D image in an end-to-end manner. This is achieved by seamlessly combining standard convolutional neural networks (CNNs) with Graph Convolution Networks (GCNs). By iteratively and hierarchically fusing the features across different layers and stages of the CNNs and GCNs, our approach can provide a dense face alignment and 3D face reconstruction simultaneously for the benefit of direct feature learning of 3D face mesh. Experiments on several challenging datasets demonstrate that our method outperforms state-of-the-art approaches on both 2D and 3D face alignment tasks.

preprint2022arXiv

Analog MIMO Communication for One-shot Distributed Principal Component Analysis

A fundamental algorithm for data analytics at the edge of wireless networks is distributed principal component analysis (DPCA), which finds the most important information embedded in a distributed high-dimensional dataset by distributed computation of a reduced-dimension data subspace, called principal components (PCs). In this paper, to support one-shot DPCA in wireless systems, we propose a framework of analog MIMO transmission featuring the uncoded analog transmission of local PCs for estimating the global PCs. To cope with channel distortion and noise, two maximum-likelihood (global) PC estimators are presented corresponding to the cases with and without receive channel state information (CSI). The first design, termed coherent PC estimator, is derived by solving a Procrustes problem and reveals the form of regularized channel inversion where the regulation attempts to alleviate the effects of both receiver noise and data noise. The second one, termed blind PC estimator, is designed based on the subspace channel-rotation-invariance property and computes a centroid of received local PCs on a Grassmann manifold. Using the manifold-perturbation theory, tight bounds on the mean square subspace distance (MSSD) of both estimators are derived for performance evaluation. The results reveal simple scaling laws of MSSD concerning device population, data and channel signal-to-noise ratios (SNRs), and array sizes. More importantly, both estimators are found to have identical scaling laws, suggesting the dispensability of CSI to accelerate DPCA. Simulation results validate the derived results and demonstrate the promising latency performance of the proposed analog MIMO

preprint2022arXiv

Cluster extent inference revisited: quantification and localization of brain activity

Cluster inference based on spatial extent thresholding is the most popular analysis method for finding activated brain areas in neuroimaging. However, the method has several well-known issues. While powerful for finding brain regions with some activation, the method as currently defined does not allow any further quantification or localization of signal. In this paper we repair this gap. We show that cluster-extent inference can be used (1.) to infer the presence of signal in anatomical regions of interest and (2.) to quantify the percentage of active voxels in any cluster or region of interest. These additional inferences come for free, i.e. they do not require any further adjustment of the alpha-level of tests, while retaining full familywise error control. We achieve this extension of the possibilities of cluster inference by an embedding of the method into a closed testing procedure, and solving the graph-theoretic k-separator problem that results from this embedding. The new method can be used in combination with random field theory or permutations. We demonstrate the usefulness of the method in a large-scale application to neuroimaging data from the Neurovault database.

preprint2022arXiv

Collaboration in Participant-Centric Federated Learning: A Game-Theoretical Perspective

Federated learning (FL) is a promising distributed framework for collaborative artificial intelligence model training while protecting user privacy. A bootstrapping component that has attracted significant research attention is the design of incentive mechanism to stimulate user collaboration in FL. The majority of works adopt a broker-centric approach to help the central operator to attract participants and further obtain a well-trained model. Few works consider forging participant-centric collaboration among participants to pursue an FL model for their common interests, which induces dramatic differences in incentive mechanism design from the broker-centric FL. To coordinate the selfish and heterogeneous participants, we propose a novel analytic framework for incentivizing effective and efficient collaborations for participant-centric FL. Specifically, we respectively propose two novel game models for contribution-oblivious FL (COFL) and contribution-aware FL (CAFL), where the latter one implements a minimum contribution threshold mechanism. We further analyze the uniqueness and existence for Nash equilibrium of both COFL and CAFL games and design efficient algorithms to achieve equilibrium solutions. Extensive performance evaluations show that there exists free-riding phenomenon in COFL, which can be greatly alleviated through the adoption of CAFL model with the optimized minimum threshold.

preprint2022arXiv

Debiased Recommendation with Neural Stratification

Debiased recommender models have recently attracted increasing attention from the academic and industry communities. Existing models are mostly based on the technique of inverse propensity score (IPS). However, in the recommendation domain, IPS can be hard to estimate given the sparse and noisy nature of the observed user-item exposure data. To alleviate this problem, in this paper, we assume that the user preference can be dominated by a small amount of latent factors, and propose to cluster the users for computing more accurate IPS via increasing the exposure densities. Basically, such method is similar with the spirit of stratification models in applied statistics. However, unlike previous heuristic stratification strategy, we learn the cluster criterion by presenting the users with low ranking embeddings, which are future shared with the user representations in the recommender model. At last, we find that our model has strong connections with the previous two types of debiased recommender models. We conduct extensive experiments based on real-world datasets to demonstrate the effectiveness of the proposed method.

preprint2022arXiv

Debiased Recommendation with User Feature Balancing

Debiased recommendation has recently attracted increasing attention from both industry and academic communities. Traditional models mostly rely on the inverse propensity score (IPS), which can be hard to estimate and may suffer from the high variance issue. To alleviate these problems, in this paper, we propose a novel debiased recommendation framework based on user feature balancing. The general idea is to introduce a projection function to adjust user feature distributions, such that the ideal unbiased learning objective can be upper bounded by a solvable objective purely based on the offline dataset. In the upper bound, the projected user distributions are expected to be equal given different items. From the causal inference perspective, this requirement aims to remove the causal relation from the user to the item, which enables us to achieve unbiased recommendation, bypassing the computation of IPS. In order to efficiently balance the user distributions upon each item pair, we propose three strategies, including clipping, sampling and adversarial learning to improve the training process. For more robust optimization, we deploy an explicit model to capture the potential latent confounders in recommendation systems. To the best of our knowledge, this paper is the first work on debiased recommendation based on confounder balancing. In the experiments, we compare our framework with many state-of-the-art methods based on synthetic, semi-synthetic and real-world datasets. Extensive experiments demonstrate that our model is effective in promoting the recommendation performance.

preprint2022arXiv

Depth Estimation Matters Most: Improving Per-Object Depth Estimation for Monocular 3D Detection and Tracking

Monocular image-based 3D perception has become an active research area in recent years owing to its applications in autonomous driving. Approaches to monocular 3D perception including detection and tracking, however, often yield inferior performance when compared to LiDAR-based techniques. Through systematic analysis, we identified that per-object depth estimation accuracy is a major factor bounding the performance. Motivated by this observation, we propose a multi-level fusion method that combines different representations (RGB and pseudo-LiDAR) and temporal information across multiple frames for objects (tracklets) to enhance per-object depth estimation. Our proposed fusion method achieves the state-of-the-art performance of per-object depth estimation on the Waymo Open Dataset, the KITTI detection dataset, and the KITTI MOT dataset. We further demonstrate that by simply replacing estimated depth with fusion-enhanced depth, we can achieve significant improvements in monocular 3D perception tasks, including detection and tracking.

preprint2022arXiv

Edge Robotics: Edge-Computing-Accelerated Multi-Robot Simultaneous Localization and Mapping

With the wide penetration of smart robots in multifarious fields, Simultaneous Localization and Mapping (SLAM) technique in robotics has attracted growing attention in the community. Yet collaborating SLAM over multiple robots still remains challenging due to performance contradiction between the intensive graphics computation of SLAM and the limited computing capability of robots. While traditional solutions resort to the powerful cloud servers acting as an external computation provider, we show by real-world measurements that the significant communication overhead in data offloading prevents its practicability to real deployment. To tackle these challenges, this paper promotes the emerging edge computing paradigm into multi-robot SLAM and proposes RecSLAM, a multi-robot laser SLAM system that focuses on accelerating map construction process under the robot-edge-cloud architecture. In contrast to conventional multi-robot SLAM that generates graphic maps on robots and completely merges them on the cloud, RecSLAM develops a hierarchical map fusion technique that directs robots' raw data to edge servers for real-time fusion and then sends to the cloud for global merging. To optimize the overall pipeline, an efficient multi-robot SLAM collaborative processing framework is introduced to adaptively optimize robot-to-edge offloading tailored to heterogeneous edge resource conditions, meanwhile ensuring the workload balancing among the edge servers. Extensive evaluations show RecSLAM can achieve up to 39% processing latency reduction over the state-of-the-art. Besides, a proof-of-concept prototype is developed and deployed in real scenes to demonstrate its effectiveness.

preprint2022arXiv

Enabling Long-Term Cooperation in Cross-Silo Federated Learning: A Repeated Game Perspective

Cross-silo federated learning (FL) is a distributed learning approach where clients of the same interest train a global model cooperatively while keeping their local data private. The success of a cross-silo FL process requires active participation of many clients. Clients in cross-silo FL aim to optimize their long-term benefits by selfishly choosing their participation levels. While there has been some work on incentivizing clients to join FL, the analysis of clients' long-term selfish participation behaviors in cross-silo FL remains largely unexplored. In this paper, we analyze the selfish participation behaviors of heterogeneous clients in cross-silo FL. Specifically, we model clients' long-term selfish participation behaviors as an infinitely repeated game. For the stage game SPFL, we derive the unique Nash equilibrium (NE), and propose a distributed algorithm for each client to calculate its equilibrium participation strategy. We show that at the NE, clients fall into at most three categories: (i) free riders, (ii) a unique partial contributor (if exists), and (iii) contributors. For the long-term interactions among clients, we derive a cooperative strategy for clients which minimizes the number of free riders while increasing the amount of local data for model training. We show that enforced by a punishment strategy, such a cooperative strategy is a subgame perfect Nash equilibrium (SPNE) of the infinitely repeated game, under which some clients who are free riders at the NE of the stage game choose to be (partial) contributors. We further propose an algorithm to calculate the optimal SPNE which minimizes the number of free riders while maximizing the amount of local data for model training. Simulation results show that our derived optimal SPNE can effectively reduce the number of free riders by up to 99.3% and increase the amount of local data for model training by up to 82.3%.

preprint2022arXiv

Explainable Legal Case Matching via Inverse Optimal Transport-based Rationale Extraction

As an essential operation of legal retrieval, legal case matching plays a central role in intelligent legal systems. This task has a high demand on the explainability of matching results because of its critical impacts on downstream applications -- the matched legal cases may provide supportive evidence for the judgments of target cases and thus influence the fairness and justice of legal decisions. Focusing on this challenging task, we propose a novel and explainable method, namely \textit{IOT-Match}, with the help of computational optimal transport, which formulates the legal case matching problem as an inverse optimal transport (IOT) problem. Different from most existing methods, which merely focus on the sentence-level semantic similarity between legal cases, our IOT-Match learns to extract rationales from paired legal cases based on both semantics and legal characteristics of their sentences. The extracted rationales are further applied to generate faithful explanations and conduct matching. Moreover, the proposed IOT-Match is robust to the alignment label insufficiency issue commonly in practical legal case matching tasks, which is suitable for both supervised and semi-supervised learning paradigms. To demonstrate the superiority of our IOT-Match method and construct a benchmark of explainable legal case matching task, we not only extend the well-known Challenge of AI in Law (CAIL) dataset but also build a new Explainable Legal cAse Matching (ELAM) dataset, which contains lots of legal cases with detailed and explainable annotations. Experiments on these two datasets show that our IOT-Match outperforms state-of-the-art methods consistently on matching prediction, rationale extraction, and explanation generation.

preprint2022arXiv

Exploration of the origin of 2020 X-ray outburst in OJ 287

Research into OJ 287 has been ongoing for many years. In 2020 April-June, this source underwent the second highest X-ray outburst (second only to the 2016-2017 outburst) and the mechanism of this outburst is still under debate. In this paper, we discuss two scenarios to explore the origin of the outburst: an after-effect of a black hole-disc impact and a tidal disruption event (TDE). We present the weak correlations of the spectral index versus X-ray flux and the hardness ratio (HR) versus the soft X-ray flux during the outburst, and these features are different from the case in the quiescent state. The correlations are compared with those of the 2016-2017 outburst with the highest X-ray flux in monitoring history. Analysis of the outbursts in 2016-2017 and 2020 shows that the expected time of the X-ray outburst, based on the theory of the after-effect of the black hole-disc impact and the estimation of available data, is inconsistent with historical observations. The soft X-ray spectra, the barely temporal evolution of colour, and the evolution of the HR mean that the 2020 outburst shares similar features with the 2016-2017 outburst, which was considered as a possible candidate for a TDE. Additionally, we find that the predictions of full TDEs ($t^{-5/3}$) and partial TDEs ($t^{-9/4}$) for the soft X-ray decay light curve are well fitted. Our analysis suggests that the 2020 outburst in OJ 287 is probably related to the TDE candidate.

preprint2022arXiv

FastRE: Towards Fast Relation Extraction with Convolutional Encoder and Improved Cascade Binary Tagging Framework

Recent work for extracting relations from texts has achieved excellent performance. However, most existing methods pay less attention to the efficiency, making it still challenging to quickly extract relations from massive or streaming text data in realistic scenarios. The main efficiency bottleneck is that these methods use a Transformer-based pre-trained language model for encoding, which heavily affects the training speed and inference speed. To address this issue, we propose a fast relation extraction model (FastRE) based on convolutional encoder and improved cascade binary tagging framework. Compared to previous work, FastRE employs several innovations to improve efficiency while also keeping promising performance. Concretely, FastRE adopts a novel convolutional encoder architecture combined with dilated convolution, gated unit and residual connection, which significantly reduces the computation cost of training and inference, while maintaining the satisfactory performance. Moreover, to improve the cascade binary tagging framework, FastRE first introduces a type-relation mapping mechanism to accelerate tagging efficiency and alleviate relation redundancy, and then utilizes a position-dependent adaptive thresholding strategy to obtain higher tagging accuracy and better model generalization. Experimental results demonstrate that FastRE is well balanced between efficiency and performance, and achieves 3-10x training speed, 7-15x inference speed faster, and 1/100 parameters compared to the state-of-the-art models, while the performance is still competitive.

preprint2022arXiv

gDNA: Towards Generative Detailed Neural Avatars

To make 3D human avatars widely available, we must be able to generate a variety of 3D virtual humans with varied identities and shapes in arbitrary poses. This task is challenging due to the diversity of clothed body shapes, their complex articulations, and the resulting rich, yet stochastic geometric detail in clothing. Hence, current methods to represent 3D people do not provide a full generative model of people in clothing. In this paper, we propose a novel method that learns to generate detailed 3D shapes of people in a variety of garments with corresponding skinning weights. Specifically, we devise a multi-subject forward skinning module that is learned from only a few posed, un-rigged scans per subject. To capture the stochastic nature of high-frequency details in garments, we leverage an adversarial loss formulation that encourages the model to capture the underlying statistics. We provide empirical evidence that this leads to realistic generation of local details such as wrinkles. We show that our model is able to generate natural human avatars wearing diverse and detailed clothing. Furthermore, we show that our method can be used on the task of fitting human models to raw scans, outperforming the previous state-of-the-art.

preprint2022arXiv

Generalizable Information Theoretic Causal Representation

It is evidence that representation learning can improve model's performance over multiple downstream tasks in many real-world scenarios, such as image classification and recommender systems. Existing learning approaches rely on establishing the correlation (or its proxy) between features and the downstream task (labels), which typically results in a representation containing cause, effect and spurious correlated variables of the label. Its generalizability may deteriorate because of the unstability of the non-causal parts. In this paper, we propose to learn causal representation from observational data by regularizing the learning procedure with mutual information measures according to our hypothetical causal graph. The optimization involves a counterfactual loss, based on which we deduce a theoretical guarantee that the causality-inspired learning is with reduced sample complexity and better generalization ability. Extensive experiments show that the models trained on causal representations learned by our approach is robust under adversarial attacks and distribution shift.

preprint2022arXiv

GraphAD: A Graph Neural Network for Entity-Wise Multivariate Time-Series Anomaly Detection

In recent years, the emergence and development of third-party platforms have greatly facilitated the growth of the Online to Offline (O2O) business. However, the large amount of transaction data raises new challenges for retailers, especially anomaly detection in operating conditions. Thus, platforms begin to develop intelligent business assistants with embedded anomaly detection methods to reduce the management burden on retailers. Traditional time-series anomaly detection methods capture underlying patterns from the perspectives of time and attributes, ignoring the difference between retailers in this scenario. Besides, similar transaction patterns extracted by the platforms can also provide guidance to individual retailers and enrich their available information without privacy issues. In this paper, we pose an entity-wise multivariate time-series anomaly detection problem that considers the time-series of each unique entity. To address this challenge, we propose GraphAD, a novel multivariate time-series anomaly detection model based on the graph neural network. GraphAD decomposes the Key Performance Indicator (KPI) into stable and volatility components and extracts their patterns in terms of attributes, entities and temporal perspectives via graph neural networks. We also construct a real-world entity-wise multivariate time-series dataset from the business data of Ele.me. The experimental results on this dataset show that GraphAD significantly outperforms existing anomaly detection methods.

preprint2022arXiv

Gumble Softmax For User Behavior Modeling

Recently, sequential recommendation systems are important in solving the information overload in many online services. Current methods in sequential recommendation focus on learning a fixed number of representations for each user at any time, with a single representation or multi representations for the user. However, when a user is exploring items on an e-commerce recommendation system, the number of this user's hobbies may change overtime (e.g. increase/reduce one more interest), affected by the user's evolving self needs. Moreover, different users may have various number of interests. In this paper, we argue that it is meaningful to explore a personalized dynamic number of user interests, and learn a dynamic group of user interest representations accordingly. We propose a sequential model with dynamic number of representations for recommendation systems (RDRSR). Specifically, RDRSR is composed of a dynamic interest discriminator (DID) module and a dynamic interest allocator (DIA) module. The DID module explores the number of a user's interests by learning the overall sequential characteristics with bi-directional self-attention and Gumbel-Softmax. The DIA module make the historical clicked items into a group of item groups and constructs user's dynamic interest representation. Additionally, experiments on the real-world datasets demonstrates our model's effectiveness.

preprint2022arXiv

Knowledge-Guided Learning for Transceiver Design in Over-the-Air Federated Learning

In this paper, we consider communication-efficient over-the-air federated learning (FL), where multiple edge devices with non-independent and identically distributed datasets perform multiple local iterations in each communication round and then concurrently transmit their updated gradients to an edge server over the same radio channel for global model aggregation using over-the-air computation (AirComp). We derive the upper bound of the time-average norm of the gradients to characterize the convergence of AirComp-assisted FL, which reveals the impact of the model aggregation errors accumulated over all communication rounds on convergence. Based on the convergence analysis, we formulate an optimization problem to minimize the upper bound to enhance the learning performance, followed by proposing an alternating optimization algorithm to facilitate the optimal transceiver design for AirComp-assisted FL. As the alternating optimization algorithm suffers from high computation complexity, we further develop a knowledge-guided learning algorithm that exploits the structure of the analytic expression of the optimal transmit power to achieve computation-efficient transceiver design. Simulation results demonstrate that the proposed knowledge-guided learning algorithm achieves a comparable performance as the alternating optimization algorithm, but with a much lower computation complexity. Moreover, both proposed algorithms outperform the baseline methods in terms of convergence speed and test accuracy.

preprint2022arXiv

Learning to Identify Top Elo Ratings: A Dueling Bandits Approach

The Elo rating system is widely adopted to evaluate the skills of (chess) game and sports players. Recently it has been also integrated into machine learning algorithms in evaluating the performance of computerised AI agents. However, an accurate estimation of the Elo rating (for the top players) often requires many rounds of competitions, which can be expensive to carry out. In this paper, to improve the sample efficiency of the Elo evaluation (for top players), we propose an efficient online match scheduling algorithm. Specifically, we identify and match the top players through a dueling bandits framework and tailor the bandit algorithm to the gradient-based update of Elo. We show that it reduces the per-step memory and time complexity to constant, compared to the traditional likelihood maximization approaches requiring $O(t)$ time. Our algorithm has a regret guarantee of $\tilde{O}(\sqrt{T})$, sublinear in the number of competition rounds and has been extended to the multidimensional Elo ratings for handling intransitive games. We empirically demonstrate that our method achieves superior convergence speed and time efficiency on a variety of gaming tasks.

preprint2022arXiv

Measuring "Why" in Recommender Systems: a Comprehensive Survey on the Evaluation of Explainable Recommendation

Explainable recommendation has shown its great advantages for improving recommendation persuasiveness, user satisfaction, system transparency, among others. A fundamental problem of explainable recommendation is how to evaluate the explanations. In the past few years, various evaluation strategies have been proposed. However, they are scattered in different papers, and there lacks a systematic and detailed comparison between them. To bridge this gap, in this paper, we comprehensively review the previous work, and provide different taxonomies for them according to the evaluation perspectives and evaluation methods. Beyond summarizing the previous work, we also analyze the (dis)advantages of existing evaluation methods and provide a series of guidelines on how to select them. The contents of this survey are based on more than 100 papers from top-tier conferences like IJCAI, AAAI, TheWebConf, Recsys, UMAP, and IUI, and their complete summarization are presented at https://shimo.im/sheets/VKrpYTcwVH6KXgdy/MODOC/. With this survey, we finally aim to provide a clear and comprehensive review on the evaluation of explainable recommendation.

preprint2022arXiv

Multi-Agent Reinforcement Learning for Markov Routing Games: A New Modeling Paradigm For Dynamic Traffic Assignment

This paper aims to develop a paradigm that models the learning behavior of intelligent agents (including but not limited to autonomous vehicles, connected and automated vehicles, or human-driven vehicles with intelligent navigation systems where human drivers follow the navigation instructions completely) with a utility-optimizing goal and the system's equilibrating processes in a routing game among atomic selfish agents. Such a paradigm can assist policymakers in devising optimal operational and planning countermeasures under both normal and abnormal circumstances. To this end, we develop a Markov routing game (MRG) in which each agent learns and updates her own en-route path choice policy while interacting with others in transportation networks. To efficiently solve MRG, we formulate it as multi-agent reinforcement learning (MARL) and devise a mean field multi-agent deep Q learning (MF-MA-DQL) approach that captures the competition among agents. The linkage between the classical DUE paradigm and our proposed Markov routing game (MRG) is discussed. We show that the routing behavior of intelligent agents is shown to converge to the classical notion of predictive dynamic user equilibrium (DUE) when traffic environments are simulated using dynamic loading models (DNL). In other words, the MRG depicts DUEs assuming perfect information and deterministic environments propagated by DNL models. Four examples are solved to illustrate the algorithm efficiency and consistency between DUE and the MRG equilibrium, on a simple network without and with spillback, the Ortuzar Willumsen (OW) Network, and a real-world network near Columbia University's campus in Manhattan of New York City.

preprint2022arXiv

Neural Message Passing for Visual Relationship Detection

Visual relationship detection aims to detect the interactions between objects in an image; however, this task suffers from combinatorial explosion due to the variety of objects and interactions. Since the interactions associated with the same object are dependent, we explore the dependency of interactions to reduce the search space. We explicitly model objects and interactions by an interaction graph and then propose a message-passing-style algorithm to propagate the contextual information. We thus call the proposed method neural message passing (NMP). We further integrate language priors and spatial cues to rule out unrealistic interactions and capture spatial interactions. Experimental results on two benchmark datasets demonstrate the superiority of our proposed method. Our code is available at https://github.com/PhyllisH/NMP.

preprint2022arXiv

PINA: Learning a Personalized Implicit Neural Avatar from a Single RGB-D Video Sequence

We present a novel method to learn Personalized Implicit Neural Avatars (PINA) from a short RGB-D sequence. This allows non-expert users to create a detailed and personalized virtual copy of themselves, which can be animated with realistic clothing deformations. PINA does not require complete scans, nor does it require a prior learned from large datasets of clothed humans. Learning a complete avatar in this setting is challenging, since only few depth observations are available, which are noisy and incomplete (i.e. only partial visibility of the body per frame). We propose a method to learn the shape and non-rigid deformations via a pose-conditioned implicit surface and a deformation field, defined in canonical space. This allows us to fuse all partial observations into a single consistent canonical representation. Fusion is formulated as a global optimization problem over the pose, shape and skinning parameters. The method can learn neural avatars from real noisy RGB-D sequences for a diverse set of people and clothing styles and these avatars can be animated given unseen motion sequences.

preprint2022arXiv

RecBole 2.0: Towards a More Up-to-Date Recommendation Library

In order to support the study of recent advances in recommender systems, this paper presents an extended recommendation library consisting of eight packages for up-to-date topics and architectures. First of all, from a data perspective, we consider three important topics related to data issues (i.e., sparsity, bias and distribution shift), and develop five packages accordingly: meta-learning, data augmentation, debiasing, fairness and cross-domain recommendation. Furthermore, from a model perspective, we develop two benchmarking packages for Transformer-based and graph neural network (GNN)-based models, respectively. All the packages (consisting of 65 new models) are developed based on a popular recommendation framework RecBole, ensuring that both the implementation and interface are unified. For each package, we provide complete implementations from data loading, experimental setup, evaluation and algorithm implementation. This library provides a valuable resource to facilitate the up-to-date research in recommender systems. The project is released at the link: https://github.com/RUCAIBox/RecBole2.0.

preprint2022arXiv

Robust PCA Unrolling Network for Super-resolution Vessel Extraction in X-ray Coronary Angiography

Although robust PCA has been increasingly adopted to extract vessels from X-ray coronary angiography (XCA) images, challenging problems such as inefficient vessel-sparsity modelling, noisy and dynamic background artefacts, and high computational cost still remain unsolved. Therefore, we propose a novel robust PCA unrolling network with sparse feature selection for super-resolution XCA vessel imaging. Being embedded within a patch-wise spatiotemporal super-resolution framework that is built upon a pooling layer and a convolutional long short-term memory network, the proposed network can not only gradually prune complex vessel-like artefacts and noisy backgrounds in XCA during network training but also iteratively learn and select the high-level spatiotemporal semantic information of moving contrast agents flowing in the XCA-imaged vessels. The experimental results show that the proposed method significantly outperforms state-of-the-art methods, especially in the imaging of the vessel network and its distal vessels, by restoring the intensity and geometry profiles of heterogeneous vessels against complex and dynamic backgrounds.

preprint2022arXiv

Sequential Recommendation with User Evolving Preference Decomposition

Modeling user sequential behaviors has recently attracted increasing attention in the recommendation domain. Existing methods mostly assume coherent preference in the same sequence. However, user personalities are volatile and easily changed, and there can be multiple mixed preferences underlying user behaviors. To solve this problem, in this paper, we propose a novel sequential recommender model via decomposing and modeling user independent preferences. To achieve this goal, we highlight three practical challenges considering the inconsistent, evolving and uneven nature of the user behavior, which are seldom noticed by the previous work. For overcoming these challenges in a unified framework, we introduce a reinforcement learning module to simulate the evolution of user preference. More specifically, the action aims to allocate each item into a sub-sequence or create a new one according to how the previous items are decomposed as well as the time interval between successive behaviors. The reward is associated with the final loss of the learning objective, aiming to generate sub-sequences which can better fit the training data. We conduct extensive experiments based on six real-world datasets across different domains. Compared with the state-of-the-art methods, empirical studies manifest that our model can on average improve the performance by about 8.21%, 10.08%, 10.32%, and 9.82% on the metrics of Precision, Recall, NDCG and MRR, respectively.

preprint2022arXiv

Three-Dimensional Spectrum Occupancy Measurement using UAV: Performance Analysis and Algorithm Design

Spectrum sharing, as an approach to significantly improve spectrum efficiency in the era of 6th generation mobile networks (6G), has attracted extensive attention. Radio Environment Map (REM) based low-complexity spectrum sharing is widely studied where the spectrum occupancy measurement (SOM) is vital to construct REM. The SOM in three-dimensional (3D) space is becoming increasingly essential to support the spectrum sharing with space-air-ground integrated network being a great momentum of 6G. In this paper, we analyze the performance of 3D SOM to further study the tradeoff between accuracy and efficiency in 3D SOM. We discover that the error of 3D SOM is related with the area of the boundary surfaces of licensed networks, the number of discretized cubes, and the length of the edge of 3D space. Moreover, we design a fast and accurate 3D SOM algorithm that utilizes unmanned aerial vehicle (UAV) to measure the spectrum occupancy considering the path planning of UAV, which improves the measurement efficiency by requiring less measurement time and flight time of the UAV for satisfactory performance. The theoretical results obtained in this paper reveal the essential dependencies that describe the 3D SOM methodology, and the proposed algorithm is beneficial to improve the efficiency of 3D SOM. It is noted that the theoretical results and algorithm in this paper may provide a guideline for more areas such as spectrum monitoring, spectrum measurement, network measurement, planning, etc.

preprint2022arXiv

TIPS: Transaction Inclusion Protocol with Signaling in DAG-based Blockchain

Directed Acyclic Graph (DAG) is a popular approach to achieve scalability of blockchain networks. Due to its high efficiency in data communication and great scalability, DAG has been widely adopted in many applications such as Internet of Things (IoT) and Decentralized Finance (DeFi). DAG-based blockchain, nevertheless, faces the key challenge of transaction inclusion collision due to the high concurrency and the network delay. Particularly, the transaction inclusion collision in DAG-based blockchain leads to the revenue and throughput dilemmas, which would greatly degrade the system performance. In this paper, we propose "TIPS", the Transaction Inclusion Protocol with Signaling, which broadcasts a signal indicating the transactions in the block. We show that with the prompt broadcast of a signal, TIPS substantially reduces the transaction collision and thus resolves these dilemmas. Moreover, we show that TIPS can defend against both the denial-of-service and the delay-of-service attacks. We also conduct intensive experiments to demonstrate the superior performance of the proposed protocol.

preprint2022arXiv

Towards Equivalent Transformation of User Preferences in Cross Domain Recommendation

Cross domain recommendation (CDR) is one popular research topic in recommender systems. This paper focuses on a popular scenario for CDR where different domains share the same set of users but no overlapping items. The majority of recent methods have explored the shared-user representation to transfer knowledge across domains. However, the idea of shared-user representation resorts to learn the overlapped features of user preferences and suppresses the domain-specific features. Other works try to capture the domain-specific features by an MLP mapping but require heuristic human knowledge of choosing samples to train the mapping. In this paper, we attempt to learn both features of user preferences in a more principled way. We assume that each user's preferences in one domain can be expressed by the other one, and these preferences can be mutually converted to each other with the so-called equivalent transformation. Based on this assumption, we propose an equivalent transformation learner (ETL) which models the joint distribution of user behaviors across domains. The equivalent transformation in ETL relaxes the idea of shared-user representation and allows the learned preferences in different domains to preserve the domain-specific features as well as the overlapped features. Extensive experiments on three public benchmarks demonstrate the effectiveness of ETL compared with recent state-of-the-art methods. Codes and data are available online:~\url{https://github.com/xuChenSJTU/ETL-master}

preprint2022arXiv

User behavior understanding in real world settings

How to extract meaningful information in user historical behavior plays a crucial role in recommendation. User behavior sequence often contains multiple conceptually distinct items that belong to different item groups and the number of the item groups is changing over time. It is necessary to learn a dynamic group of representations according the item groups in a user historical behavior. However, current works only learns a predefined and fixed number representations which includes single representation methods and multi representations methods from the user context that could lead to suboptimal recommendation quality. In this paper we propose a model that can automatically and adaptively generates a dynamic group of representations from the user behavior accordingly. To be specific, AutoRep is composed of an informative representation construct (IRC) module and a dynamic representations construct (DRC) module. The IRC module learns the overall sequential characteristics of user behavior with a bi-directional architecture transformer. The DRC module dynamically allocate the item in the user behavior into different item groups and form a dynamic group of representations in a differentiable method. Such design improves the model recommendation performance. We evaluate the proposed model on five benchmark datasets. The results show that AutoRep outperforms representative baselines. Further ablation study has been conducted to deepen our understandings of AutoRep, including the proposed module IRC and DRC.

preprint2022arXiv

Working memory inspired hierarchical video decomposition with transformative representations

Video decomposition is very important to extract moving foreground objects from complex backgrounds in computer vision, machine learning, and medical imaging, e.g., extracting moving contrast-filled vessels from the complex and noisy backgrounds of X-ray coronary angiography (XCA). However, the challenges caused by dynamic backgrounds, overlapping heterogeneous environments and complex noises still exist in video decomposition. To solve these problems, this study is the first to introduce a flexible visual working memory model in video decomposition tasks to provide interpretable and high-performance hierarchical deep architecture, integrating the transformative representations between sensory and control layers from the perspective of visual and cognitive neuroscience. Specifically, robust PCA unrolling networks acting as a structure-regularized sensor layer decompose XCA into sparse/low-rank structured representations to separate moving contrast-filled vessels from noisy and complex backgrounds. Then, patch recurrent convolutional LSTM networks with a backprojection module embody unstructured random representations of the control layer in working memory, recurrently projecting spatiotemporally decomposed nonlocal patches into orthogonal subspaces for heterogeneous vessel retrieval and interference suppression. This video decomposition deep architecture effectively restores the heterogeneous profiles of intensity and the geometries of moving objects against the complex background interferences. Experiments show that the proposed method significantly outperforms state-of-the-art methods in accurate moving contrast-filled vessel extraction with excellent flexibility and computational efficiency.

preprint2022arXiv

Zero Trust Architecture for 6G Security

The upcoming sixth generation (6G) network is envisioned to be more open and heterogeneous than earlier generations. This challenges conventional security architectures, which typically rely on the construction of a security perimeter at network boundaries. In this article, we propose a software-defined zero trust architecture (ZTA) for 6G networks, which is promising for establishing an elastic and scalable security regime. This architecture achieves secure access control through adaptive collaborations among the involved control domains, and can effectively prevent malicious access behaviors such as distributed denial of service (DDoS) attacks, malware spread, and zero-day exploits. We also introduce key design aspects of this architecture and show the simulation results of a case study, which shows the effectiveness and robustness of ZTA for 6G. Furthermore, we discuss open issues to further promote this new architecture.

preprint2021arXiv

Constraining Evolution of Magnetic Field Strength in Dissipation Region of Two BL Lac Objects

With the assumption that the optical variability timescale is dominated by the cooling time of the synchrotron process for BL Lac objects, we estimate time dependent magnetic field strength of the emission region for two BL Lac objects. The average magnetic field strengths are consistent with those estimated from core shift measurement and spectral energy distribution modelling. Variation of magnetic field strength in dissipation region is discovered. Variability of flux and magnetic field strength show no clear correlation, which indicates the variation of magnetic field is not the dominant reason of variability origin. The evolution of magnetic field strength can provide another approach to constrain the energy dissipation mechanism in jet.

preprint2021arXiv

Deep Reinforcement Learning with Spatio-temporal Traffic Forecasting for Data-Driven Base Station Sleep Control

To meet the ever increasing mobile traffic demand in 5G era, base stations (BSs) have been densely deployed in radio access networks (RANs) to increase the network coverage and capacity. However, as the high density of BSs is designed to accommodate peak traffic, it would consume an unnecessarily large amount of energy if BSs are on during off-peak time. To save the energy consumption of cellular networks, an effective way is to deactivate some idle base stations that do not serve any traffic demand. In this paper, we develop a traffic-aware dynamic BS sleep control framework, named DeepBSC, which presents a novel data-driven learning approach to determine the BS active/sleep modes while meeting lower energy consumption and satisfactory Quality of Service (QoS) requirements. Specifically, the traffic demands are predicted by the proposed GS-STN model, which leverages the geographical and semantic spatial-temporal correlations of mobile traffic. With accurate mobile traffic forecasting, the BS sleep control problem is cast as a Markov Decision Process that is solved by Actor-Critic reinforcement learning methods. To reduce the variance of cost estimation in the dynamic environment, we propose a benchmark transformation method that provides robust performance indicator for policy update. To expedite the training process, we adopt a Deep Deterministic Policy Gradient (DDPG) approach, together with an explorer network, which can strengthen the exploration further. Extensive experiments with a real-world dataset corroborate that our proposed framework significantly outperforms the existing methods.

preprint2021arXiv

Discovery of two families of VSb-based compounds with V-kagome lattice

We report the structure and physical properties of two newly-discovered compounds AV8Sb12 and AV6Sb6 (A = Cs, Rb), which have C2 (space group: Cmmm) and C3 (space group: R-3m) symmetry, respectively. The basic V-kagome unit is present in both compounds, but stacking differently. A V2Sb2 layer is sandwiched between two V3Sb5 layers in AV8Sb12, altering the V-kagome lattice and lowering the symmetry of kagome layer from hexagonal to orthorhombic. In AV6Sb6, the building block is a more complex slab made up of two half-V3Sb5 layers that are intercalated by Cs cations along the c-axis. Transport property measurements demonstrate that both compounds are nonmagnetic metals, with carrier concentrations at around 1021cm-3. No superconductivity has been observed in CsV8Sb12 above 0.3 K under in-situ pressure up to 46 GPa. In contrast to CsV3Sb5, theoretical calculations and angle-resolved photoemission spectroscopy (ARPES) reveal a quasi-two-dimensional electronic structure in CsV8Sb12 with C2 symmetry and no van Hove singularities near the Fermi level. Our findings will stimulate more research into V-based kagome quantum materials.

preprint2021arXiv

Discrete Knowledge Graph Embedding based on Discrete Optimization

This paper proposes a discrete knowledge graph (KG) embedding (DKGE) method, which projects KG entities and relations into the Hamming space based on a computationally tractable discrete optimization algorithm, to solve the formidable storage and computation cost challenges in traditional continuous graph embedding methods. The convergence of DKGE can be guaranteed theoretically. Extensive experiments demonstrate that DKGE achieves superior accuracy than classical hashing functions that map the effective continuous embeddings into discrete codes. Besides, DKGE reaches comparable accuracy with much lower computational complexity and storage compared to many continuous graph embedding methods.

preprint2021arXiv

EC-SAGINs: Edge Computing-enhanced Space-Air-Ground Integrated Networks for Internet of Vehicles

Edge computing-enhanced Internet of Vehicles (EC-IoV) enables ubiquitous data processing and content sharing among vehicles and terrestrial edge computing (TEC) infrastructures (e.g., 5G base stations and roadside units) with little or no human intervention, plays a key role in the intelligent transportation systems. However, EC-IoV is heavily dependent on the connections and interactions between vehicles and TEC infrastructures, thus will break down in some remote areas where TEC infrastructures are unavailable (e.g., desert, isolated islands and disaster-stricken areas). Driven by the ubiquitous connections and global-area coverage, space-air-ground integrated networks (SAGINs) efficiently support seamless coverage and efficient resource management, represent the next frontier for edge computing. In light of this, we first review the state-of-the-art edge computing research for SAGINs in this article. After discussing several existing orbital and aerial edge computing architectures, we propose a framework of edge computing-enabled space-air-ground integrated networks (EC-SAGINs) to support various IoV services for the vehicles in remote areas. The main objective of the framework is to minimize the task completion time and satellite resource usage. To this end, a pre-classification scheme is presented to reduce the size of action space, and a deep imitation learning (DIL) driven offloading and caching algorithm is proposed to achieve real-time decision making. Simulation results show the effectiveness of our proposed scheme. At last, we also discuss some technology challenges and future directions.

preprint2021arXiv

Generate Natural Language Explanations for Recommendation

Providing personalized explanations for recommendations can help users to understand the underlying insight of the recommendation results, which is helpful to the effectiveness, transparency, persuasiveness and trustworthiness of recommender systems. Current explainable recommendation models mostly generate textual explanations based on pre-defined sentence templates. However, the expressiveness power of template-based explanation sentences are limited to the pre-defined expressions, and manually defining the expressions require significant human efforts. Motivated by this problem, we propose to generate free-text natural language explanations for personalized recommendation. In particular, we propose a hierarchical sequence-to-sequence model (HSS) for personalized explanation generation. Different from conventional sentence generation in NLP research, a great challenge of explanation generation in e-commerce recommendation is that not all sentences in user reviews are of explanation purpose. To solve the problem, we further propose an auto-denoising mechanism based on topical item feature words for sentence generation. Experiments on various e-commerce product domains show that our approach can not only improve the recommendation accuracy, but also the explanation quality in terms of the offline measures and feature words coverage. This research is one of the initial steps to grant intelligent agents with the ability to explain itself based on natural language sentences.

preprint2021arXiv

Generating Multi-scale Maps from Remote Sensing Images via Series Generative Adversarial Networks

Considering the success of generative adversarial networks (GANs) for image-to-image translation, researchers have attempted to translate remote sensing images (RSIs) to maps (rs2map) through GAN for cartography. However, these studies involved limited scales, which hinders multi-scale map creation. By extending their method, multi-scale RSIs can be trivially translated to multi-scale maps (multi-scale rs2map translation) through scale-wise rs2map models trained for certain scales (parallel strategy). However, this strategy has two theoretical limitations. First, inconsistency between various spatial resolutions of multi-scale RSIs and object generalization on multi-scale maps (RS-m inconsistency) increasingly complicate the extraction of geographical information from RSIs for rs2map models with decreasing scale. Second, as rs2map translation is cross-domain, generators incur high computation costs to transform the RSI pixel distribution to that on maps. Thus, we designed a series strategy of generators for multi-scale rs2map translation to address these limitations. In this strategy, high-resolution RSIs are inputted to an rs2map model to output large-scale maps, which are translated to multi-scale maps through series multi-scale map translation models. The series strategy avoids RS-m inconsistency as inputs are high-resolution large-scale RSIs, and reduces the distribution gap in multi-scale map generation through similar pixel distributions among multi-scale maps. Our experimental results showed better quality multi-scale map generation with the series strategy, as shown by average increases of 11.69%, 53.78%, 55.42%, and 72.34% in the structural similarity index, edge structural similarity index, intersection over union (road), and intersection over union (water) for data from Mexico City and Tokyo at zoom level 17-13.

preprint2021arXiv

Joint Radar and Communication: A Survey

Joint radar and communication (JRC) technology has become important for civil and military applications for decades. This paper introduces the concepts, characteristics and advantages of JRC technology, presenting the typical applications that have benefited from JRC technology currently and in the future. This paper explores the state-of-the-art of JRC in the levels of coexistence, cooperation, co-design and collaboration. Compared to previous surveys, this paper reviews the entire trends that drive the development of radar sensing and wireless communication using JRC. Specifically, we explore an open research issue on radar and communication operating with mutual benefits based on collaboration, which represents the fourth stage of JRC evolution. This paper provides useful perspectives for future researches of JRC technology.

preprint2021arXiv

Learning Post-Hoc Causal Explanations for Recommendation

State-of-the-art recommender systems have the ability to generate high-quality recommendations, but usually cannot provide intuitive explanations to humans due to the usage of black-box prediction models. The lack of transparency has highlighted the critical importance of improving the explainability of recommender systems. In this paper, we propose to extract causal rules from the user interaction history as post-hoc explanations for the black-box sequential recommendation mechanisms, whilst maintain the predictive accuracy of the recommendation model. Our approach firstly achieves counterfactual examples with the aid of a perturbation model, and then extracts personalized causal relationships for the recommendation model through a causal rule mining algorithm. Experiments are conducted on several state-of-the-art sequential recommendation models and real-world datasets to verify the performance of our model on generating causal explanations. Meanwhile, We evaluate the discovered causal explanations in terms of quality and fidelity, which show that compared with conventional association rules, causal rules can provide personalized and more effective explanations for the behavior of black-box recommendation models.

preprint2021arXiv

Visually-aware Recommendation with Aesthetic Features

Visual information plays a critical role in human decision-making process. While recent developments on visually-aware recommender systems have taken the product image into account, none of them has considered the aesthetic aspect. We argue that the aesthetic factor is very important in modeling and predicting users' preferences, especially for some fashion-related domains like clothing and jewelry. This work addresses the need of modeling aesthetic information in visually-aware recommender systems. Technically speaking, we make three key contributions in leveraging deep aesthetic features: (1) To describe the aesthetics of products, we introduce the aesthetic features extracted from product images by a deep aesthetic network. We incorporate these features into recommender system to model users' preferences in the aesthetic aspect. (2) Since in clothing recommendation, time is very important for users to make decision, we design a new tensor decomposition model for implicit feedback data. The aesthetic features are then injected to the basic tensor model to capture the temporal dynamics of aesthetic preferences (e.g., seasonal patterns). (3) We also use the aesthetic features to optimize the learning strategy on implicit feedback data. We enrich the pairwise training samples by considering the similarity among items in the visual space and graph space; the key idea is that a user may likely have similar perception on similar items. We perform extensive experiments on several real-world datasets and demonstrate the usefulness of aesthetic features and the effectiveness of our proposed methods.

preprint2020arXiv

Age of Processing: Age-driven Status Sampling and Processing Offloading for Edge Computing-enabled Real-time IoT Applications

The freshness of status information is of great importance for time-critical Internet of Things (IoT) applications. A metric measuring status freshness is the age-of-information (AoI), which captures the time elapsed from the status being generated at the source node (e.g., a sensor) to the latest status update.However, in intelligent IoT applications such as video surveillance, the status information is revealed after some computation intensive and time-consuming data processing operations, which would affect the status freshness. In this paper, we propose a novel metric, age-of-processing (AoP), to quantify such status freshness, which captures the time elapsed of the newest received processed status data since it is generated. Compared with AoI, AoP further takes the data processing time into account. Since an IoT device has limited computation and energy resource, the device can choose to offload the data processing to the nearby edge server under constrained status sampling frequency.We aim to minimize the average AoP in a long-term process by jointly optimizing the status sampling frequency and processing offloading policy. We formulate this online problem as an infinite-horizon constrained Markov decision process (CMDP) with average reward criterion. We then transform the CMDP problem into an unconstrained Markov decision process (MDP) by leveraging a Lagrangian method, and propose a Lagrangian transformation framework for the original CMDP problem. Furthermore, we integrate the framework with perturbation based refinement for achieving the optimal policy of the CMDP problem. Extensive numerical evaluations show that the proposed algorithm outperforms the benchmarks, with an average AoP reduction up to 30%.

preprint2020arXiv

An Edge Computing-based Photo Crowdsourcing Framework for Real-time 3D Reconstruction

Image-based three-dimensional (3D) reconstruction utilizes a set of photos to build 3D model and can be widely used in many emerging applications such as augmented reality (AR) and disaster recovery. Most of existing 3D reconstruction methods require a mobile user to walk around the target area and reconstruct objectives with a hand-held camera, which is inefficient and time-consuming. To meet the requirements of delay intensive and resource hungry applications in 5G, we propose an edge computing-based photo crowdsourcing (EC-PCS) framework in this paper. The main objective is to collect a set of representative photos from ubiquitous mobile and Internet of Things (IoT) devices at the network edge for real-time 3D model reconstruction, with network resource and monetary cost considerations. Specifically, we first propose a photo pricing mechanism by jointly considering their freshness, resolution and data size. Then, we design a novel photo selection scheme to dynamically select a set of photos with the required target coverage and the minimum monetary cost. We prove the NP-hardness of such problem, and develop an efficient greedy-based approximation algorithm to obtain a near-optimal solution. Moreover, an optimal network resource allocation scheme is presented, in order to minimize the maximum uploading delay of the selected photos to the edge server. Finally, a 3D reconstruction algorithm and a 3D model caching scheme are performed by the edge server in real time. Extensive experimental results based on real-world datasets demonstrate the superior performance of our EC-PCS system over the existing mechanisms.

preprint2020arXiv

AxeChain: A Secure and Decentralized blockchain for solving Easily-Verifiable problems

While Proof-of-Work (PoW) is the most widely used consensus mechanism for blockchain, it received harsh criticism due to its massive waste of energy for meaningless hash calculation. Some studies have introduced Proof-of-Stake to address this issue. However, such protocols widen the gap between rich and poor and in the worst case lead to an oligopoly, where the rich control the entire network. Other studies have attempted to translate the energy consumption of PoW into useful work, but they have many limitations, such as narrow application scope, serious security issues and impractical incentive model. In this paper, we introduce AxeChain, which can use the computing power of blockchain to solve practical problems raised by users without greatly compromising decentralization or security. AxeChain achieves this by coupling hard problem solving with PoW mining. We model the security of AxeChain and derive a balance curve between power utilization and system security. That is, under the reasonable assumption that the attack power does not exceed 1/3 of the total power, 1/2 of total power can be safely used to solve practical problems. We also design a novel incentive model based on the amount of work involved in problem solving, balancing the interests of both the users and miners. Moreover, our experimental results show that AxeChain provides strong security guarantees, no matter what kind of problem is submitted.

preprint2020arXiv

Category Level Object Pose Estimation via Neural Analysis-by-Synthesis

Many object pose estimation algorithms rely on the analysis-by-synthesis framework which requires explicit representations of individual object instances. In this paper we combine a gradient-based fitting procedure with a parametric neural image synthesis module that is capable of implicitly representing the appearance, shape and pose of entire object categories, thus rendering the need for explicit CAD models per object instance unnecessary. The image synthesis network is designed to efficiently span the pose configuration space so that model capacity can be used to capture the shape and local appearance (i.e., texture) variations jointly. At inference time the synthesized images are compared to the target via an appearance based loss and the error signal is backpropagated through the network to the input parameters. Keeping the network parameters fixed, this allows for iterative optimization of the object pose, shape and appearance in a joint manner and we experimentally show that the method can recover orientation of objects with high accuracy from 2D images alone. When provided with depth measurements, to overcome scale ambiguities, the method can accurately recover the full 6DOF pose successfully.

preprint2020arXiv

Collaborative Adversarial Learning for RelationalLearning on Multiple Bipartite Graphs

Relational learning aims to make relation inference by exploiting the correlations among different types of entities. Exploring relational learning on multiple bipartite graphs has been receiving attention because of its popular applications such as recommendations. How to make efficient relation inference with few observed links is the main problem on multiple bipartite graphs. Most existing approaches attempt to solve the sparsity problem via learning shared representations to integrate knowledge from multi-source data for shared entities. However, they merely model the correlations from one aspect (e.g. distribution, representation), and cannot impose sufficient constraints on different relations of the shared entities. One effective way of modeling the multi-domain data is to learn the joint distribution of the shared entities across domains.In this paper, we propose Collaborative Adversarial Learning (CAL) that explicitly models the joint distribution of the shared entities across multiple bipartite graphs. The objective of CAL is formulated from a variational lower bound that maximizes the joint log-likelihoods of the observations. In particular, CAL consists of distribution-level and feature-level alignments for knowledge from multiple bipartite graphs. The two-level alignment acts as two different constraints on different relations of the shared entities and facilitates better knowledge transfer for relational learning on multiple bipartite graphs. Extensive experiments on two real-world datasets have shown that the proposed model outperforms the existing methods.

preprint2020arXiv

Convergence of Edge Computing and Deep Learning: A Comprehensive Survey

Ubiquitous sensors and smart devices from factories and communities are generating massive amounts of data, and ever-increasing computing power is driving the core of computation and services from the cloud to the edge of the network. As an important enabler broadly changing people's lives, from face recognition to ambitious smart factories and cities, developments of artificial intelligence (especially deep learning, DL) based applications and services are thriving. However, due to efficiency and latency issues, the current cloud computing service architecture hinders the vision of "providing artificial intelligence for every person and every organization at everywhere". Thus, unleashing DL services using resources at the network edge near the data sources has emerged as a desirable solution. Therefore, edge intelligence, aiming to facilitate the deployment of DL services by edge computing, has received significant attention. In addition, DL, as the representative technique of artificial intelligence, can be integrated into edge computing frameworks to build intelligent edge for dynamic, adaptive edge maintenance and management. With regard to mutually beneficial edge intelligence and intelligent edge, this paper introduces and discusses: 1) the application scenarios of both; 2) the practical implementation methods and enabling technologies, namely DL training and inference in the customized edge computing framework; 3) challenges and future trends of more pervasive and fine-grained intelligence. We believe that by consolidating information scattered across the communication, networking, and DL areas, this survey can help readers to understand the connections between enabling technologies while promoting further discussions on the fusion of edge intelligence and intelligent edge, i.e., Edge DL.

preprint2020arXiv

Curvature induced polarization and spectral index behavior for PKS 1502+106

A comprehensive study of multifrequency correlations can shed light on the nature of variation for blazars. In this work, we collect the long-term radio, optical and $γ$-ray light curves of PKS 1502+106. After performing the localized cross-correlation function analysis, we find that correlations between radio and $γ$-ray or $V$ band are beyond the $3σ$ significance level. The lag of the $γ$-ray relative to 15 GHz is $-60^{+5}_{-10}$ days, translating to a distance $3.18^{+0.50}_{-0.27}$ parsec (pc) between them. Within uncertainties, the locations of the $γ$-ray and optical emitting regions are roughly the same, and are away from the jet base within $1.2$ pc. The derived magnetic field in optical and $γ$-ray emitting regions is about $0.36$ G. The logarithm of $γ$-ray flux is significantly linearly correlated with that of $V$ band fluxes, which can be explained by the synchrotron self-Compton (SSC) process, the external Compton (EC) processes, or the combination of them. We find a significant linear correlation in the plot of $\log\prod$ (polarization degree) versus $\log νF_ν$ at $V$ band, and use the empirical relation $Π\sim \sin^n θ'$ ($θ'$ is the observing angle in the comoving frame blob) to explain it. The behaviors of color index (generally redder when brighter at the active state) and $γ$-ray spectral index (softer when brighter) could be well explained by the twisted jet model. These findings suggest that the curvature effect (mainly due to the change of the viewing angle) is dominant in the variation phenomena of fluxes, spectral indices, and polarization degrees for PKS 1502+106.

preprint2020arXiv

Decoupled Variational Embedding for Signed Directed Networks

Node representation learning for signed directed networks has received considerable attention in many real-world applications such as link sign prediction, node classification and node recommendation. The challenge lies in how to adequately encode the complex topological information of the networks. Recent studies mainly focus on preserving the first-order network topology which indicates the closeness relationships of nodes. However, these methods generally fail to capture the high-order topology which indicates the local structures of nodes and serves as an essential characteristic of the network topology. In addition, for the first-order topology, the additional value of non-existent links is largely ignored. In this paper, we propose to learn more representative node embeddings by simultaneously capturing the first-order and high-order topology in signed directed networks. In particular, we reformulate the representation learning problem on signed directed networks from a variational auto-encoding perspective and further develop a decoupled variational embedding (DVE) method. DVE leverages a specially designed auto-encoder structure to capture both the first-order and high-order topology of signed directed networks, and thus learns more representative node embedding. Extensive experiments are conducted on three widely used real-world datasets. Comprehensive results on both link sign prediction and node recommendation task demonstrate the effectiveness of DVE. Qualitative results and analysis are also given to provide a better understanding of DVE.

preprint2020arXiv

DeepCP: Deep Learning Driven Cascade Prediction Based Autonomous Content Placement in Closed Social Network

Online social networks (OSNs) are emerging as the most popular mainstream platform for content cascade diffusion. In order to provide satisfactory quality of experience (QoE) for users in OSNs, much research dedicates to proactive content placement by using the propagation pattern, user's personal profiles and social relationships in open social network scenarios (e.g., Twitter and Weibo). In this paper, we take a new direction of popularity-aware content placement in a closed social network (e.g., WeChat Moment) where user's privacy is highly enhanced. We propose a novel data-driven holistic deep learning framework, namely DeepCP, for joint diffusion-aware cascade prediction and autonomous content placement without utilizing users' personal and social information. We first devise a time-window LSTM model for content popularity prediction and cascade geo-distribution estimation. Accordingly, we further propose a novel autonomous content placement mechanism CP-GAN which adopts the generative adversarial network (GAN) for agile placement decision making to reduce the content access latency and enhance users' QoE. We conduct extensive experiments using cascade diffusion traces in WeChat Moment (WM). Evaluation results corroborate that the proposed DeepCP framework can predict the content popularity with a high accuracy, generate efficient placement decision in a real-time manner, and achieve significant content access latency reduction over existing schemes.

preprint2020arXiv

Dual Graph Embedding for Object-Tag LinkPrediction on the Knowledge Graph

Knowledge graphs (KGs) composed of users, objects, and tags are widely used in web applications ranging from E-commerce, social media sites to news portals. This paper concentrates on an attractive application which aims to predict the object-tag links in the KG for better tag recommendation and object explanation. When predicting the object-tag links, both the first-order and high-order proximities between entities in the KG propagate essential similarity information for better prediction. Most existing methods focus on preserving the first-order proximity between entities in the KG. However, they cannot capture the high-order proximities in an explicit way, and the adopted margin-based criterion cannot measure the first-order proximity on the global structure accurately. In this paper, we propose a novel approach named Dual Graph Embedding (DGE) that models both the first-order and high-order proximities in the KG via an auto-encoding architecture to facilitate better object-tag relation inference. Here the dual graphs contain an object graph and a tag graph that explicitly depict the high-order object-object and tag-tag proximities in the KG. The dual graph encoder in DGE then encodes these high-order proximities in the dual graphs into entity embeddings. The decoder formulates a skip-gram objective that maximizes the first-order proximity between observed object-tag pairs over the global proximity structure. With the supervision of the decoder, the embeddings derived by the encoder will be refined to capture both the first-order and high-order proximities in the KG for better link prediction. Extensive experiments on three real-world datasets demonstrate that DGE outperforms the state-of-the-art methods.

preprint2020arXiv

Explainable Recommendation: A Survey and New Perspectives

Explainable recommendation attempts to develop models that generate not only high-quality recommendations but also intuitive explanations. The explanations may either be post-hoc or directly come from an explainable model (also called interpretable or transparent model in some contexts). Explainable recommendation tries to address the problem of why: by providing explanations to users or system designers, it helps humans to understand why certain items are recommended by the algorithm, where the human can either be users or system designers. Explainable recommendation helps to improve the transparency, persuasiveness, effectiveness, trustworthiness, and satisfaction of recommendation systems. It also facilitates system designers for better system debugging. In recent years, a large number of explainable recommendation approaches -- especially model-based methods -- have been proposed and applied in real-world systems. In this survey, we provide a comprehensive review for the explainable recommendation research. We first highlight the position of explainable recommendation in recommender system research by categorizing recommendation problems into the 5W, i.e., what, when, who, where, and why. We then conduct a comprehensive survey of explainable recommendation on three perspectives: 1) We provide a chronological research timeline of explainable recommendation. 2) We provide a two-dimensional taxonomy to classify existing explainable recommendation research. 3) We summarize how explainable recommendation applies to different recommendation tasks. We also devote a chapter to discuss the explanation perspectives in broader IR and AI/ML research. We end the survey by discussing potential future directions to promote the explainable recommendation research area and beyond.

preprint2020arXiv

HFEL: Joint Edge Association and Resource Allocation for Cost-Efficient Hierarchical Federated Edge Learning

Federated Learning (FL) has been proposed as an appealing approach to handle data privacy issue of mobile devices compared to conventional machine learning at the remote cloud with raw user data uploading. By leveraging edge servers as intermediaries to perform partial model aggregation in proximity and relieve core network transmission overhead, it enables great potentials in low-latency and energy-efficient FL. Hence we introduce a novel Hierarchical Federated Edge Learning (HFEL) framework in which model aggregation is partially migrated to edge servers from the cloud. We further formulate a joint computation and communication resource allocation and edge association problem for device users under HFEL framework to achieve global cost minimization. To solve the problem, we propose an efficient resource scheduling algorithm in the HFEL framework. It can be decomposed into two subproblems: \emph{resource allocation} given a scheduled set of devices for each edge server and \emph{edge association} of device users across all the edge servers. With the optimal policy of the convex resource allocation subproblem for a set of devices under a single edge server, an efficient edge association strategy can be achieved through iterative global cost reduction adjustment process, which is shown to converge to a stable system point. Extensive performance evaluations demonstrate that our HFEL framework outperforms the proposed benchmarks in global cost saving and achieves better training performance compared to conventional federated learning.

preprint2020arXiv

HierTrain: Fast Hierarchical Edge AI Learning with Hybrid Parallelism in Mobile-Edge-Cloud Computing

Nowadays, deep neural networks (DNNs) are the core enablers for many emerging edge AI applications. Conventional approaches to training DNNs are generally implemented at central servers or cloud centers for centralized learning, which is typically time-consuming and resource-demanding due to the transmission of a large amount of data samples from the device to the remote cloud. To overcome these disadvantages, we consider accelerating the learning process of DNNs on the Mobile-Edge-Cloud Computing (MECC) paradigm. In this paper, we propose HierTrain, a hierarchical edge AI learning framework, which efficiently deploys the DNN training task over the hierarchical MECC architecture. We develop a novel \textit{hybrid parallelism} method, which is the key to HierTrain, to adaptively assign the DNN model layers and the data samples across the three levels of edge device, edge server and cloud center. We then formulate the problem of scheduling the DNN training tasks at both layer-granularity and sample-granularity. Solving this optimization problem enables us to achieve the minimum training time. We further implement a hardware prototype consisting of an edge device, an edge server and a cloud server, and conduct extensive experiments on it. Experimental results demonstrate that HierTrain can achieve up to 6.9x speedup compared to the cloud-based hierarchical training approach.

preprint2020arXiv

Human Body Model Fitting by Learned Gradient Descent

We propose a novel algorithm for the fitting of 3D human shape to images. Combining the accuracy and refinement capabilities of iterative gradient-based optimization techniques with the robustness of deep neural networks, we propose a gradient descent algorithm that leverages a neural network to predict the parameter update rule for each iteration. This per-parameter and state-aware update guides the optimizer towards a good solution in very few steps, converging in typically few steps. During training our approach only requires MoCap data of human poses, parametrized via SMPL. From this data the network learns a subspace of valid poses and shapes in which optimization is performed much more efficiently. The approach does not require any hard to acquire image-to-3D correspondences. At test time we only optimize the 2D joint re-projection error without the need for any further priors or regularization terms. We show empirically that this algorithm is fast (avg. 120ms convergence), robust to initialization and dataset, and achieves state-of-the-art results on public evaluation datasets including the challenging 3DPW in-the-wild benchmark (improvement over SMPLify 45%) and also approaches using image-to-3D correspondences

preprint2020arXiv

Joint Multi-User DNN Partitioning and Computational Resource Allocation for Collaborative Edge Intelligence

Mobile Edge Computing (MEC) has emerged as a promising supporting architecture providing a variety of resources to the network edge, thus acting as an enabler for edge intelligence services empowering massive mobile and Internet of Things (IoT) devices with AI capability. With the assistance of edge servers, user equipments (UEs) are able to run deep neural network (DNN) based AI applications, which are generally resource-hungry and compute-intensive, such that an individual UE can hardly afford by itself in real time. However the resources in each individual edge server are typically limited. Therefore, any resource optimization involving edge servers is by nature a resource-constrained optimization problem and needs to be tackled in such realistic context. Motivated by this observation, we investigate the optimization problem of DNN partitioning (an emerging DNN offloading scheme) in a realistic multi-user resource-constrained condition that rarely considered in previous works. Despite the extremely large solution space, we reveal several properties of this specific optimization problem of joint multi-UE DNN partitioning and computational resource allocation. We propose an algorithm called Iterative Alternating Optimization (IAO) that can achieve the optimal solution in polynomial time. In addition, we present rigorous theoretic analysis of our algorithm in terms of time complexity and performance under realistic estimation error. Moreover, we build a prototype that implements our framework and conduct extensive experiments using realistic DNN models, whose results demonstrate its effectiveness and efficiency.

preprint2020arXiv

Knowledge Distillation for Mobile Edge Computation Offloading

Edge computation offloading allows mobile end devices to put execution of compute-intensive task on the edge servers. End devices can decide whether offload the tasks to edge servers, cloud servers or execute locally according to current network condition and devices' profile in an online manner. In this article, we propose an edge computation offloading framework based on Deep Imitation Learning (DIL) and Knowledge Distillation (KD), which assists end devices to quickly make fine-grained decisions to optimize the delay of computation tasks online. We formalize computation offloading problem into a multi-label classification problem. Training samples for our DIL model are generated in an offline manner. After model is trained, we leverage knowledge distillation to obtain a lightweight DIL model, by which we further reduce the model's inference delay. Numerical experiment shows that the offloading decisions made by our model outperforms those made by other related policies in latency metric. Also, our model has the shortest inference delay among all policies.

preprint2020arXiv

Leveraging the Power of Prediction: Predictive Service Placement for Latency-Sensitive Mobile Edge Computing

Mobile edge computing (MEC) is emerging to support delay-sensitive 5G applications at the edge of mobile networks. When a user moves erratically among multiple MEC nodes, the challenge of how to dynamically migrate its service to maintain service performance (i.e., user-perceived latency) arises. However, frequent service migration can significantly increase operational cost, incurring the conflict between improving performance and reducing cost. To address these mis-aligned objectives, this paper studies the performance optimization of mobile edge service placement under the constraint of long-term cost budget. It is challenging because the budget involves the future uncertain information (e.g., user mobility). To overcome this difficulty, we devote to leveraging the power of prediction and advocate predictive service placement with predicted near-future information. By using two-timescale Lyapunov optimization method, we propose a T-slot predictive service placement (PSP) algorithm to incorporate the prediction of user mobility based on a frame-based design. We characterize the performance bounds of PSP in terms of cost-delay trade-off theoretically. Furthermore, we propose a new weight adjustment scheme for the queue in each frame named PSP-WU to exploit the historical queue information, which greatly reduces the length of queue while improving the quality of user-perceived latency. Rigorous theoretical analysis and extensive evaluations using realistic data traces demonstrate the superior performance of the proposed predictive schemes.

preprint2020arXiv

Liability Design for Autonomous Vehicles and Human-Driven Vehicles: A Hierarchical Game-Theoretic Approach

Autonomous vehicles (AVs) are inevitably entering our lives with potential benefits for improved traffic safety, mobility, and accessibility. However, AVs' benefits also introduce a serious potential challenge, in the form of complex interactions with human-driven vehicles (HVs). The emergence of AVs introduces uncertainty in the behavior of human actors and in the impact of the AV manufacturer on autonomous driving design. This paper thus aims to investigate how AVs affect road safety and to design socially optimal liability rules for AVs and human drivers. A unified game is developed, including a Nash game between human drivers, a Stackelberg game between the AV manufacturer and HVs, and a Stackelberg game between the law maker and other users. We also establish the existence and uniqueness of the equilibrium of the game. The game is then simulated with numerical examples to investigate the emergence of human drivers' moral hazard, the AV manufacturer's role in traffic safety, and the law maker's role in liability design. Our findings demonstrate that human drivers could develop moral hazard if they perceive their road environment has become safer and an optimal liability rule design is crucial to improve social welfare with advanced transportation technologies. More generally, the game-theoretic model developed in this paper provides an analytical tool to assist policy-makers in AV policymaking and hopefully mitigate uncertainty in the existing regulation landscape about AV technologies.

preprint2020arXiv

Personalized Federated Learning for Intelligent IoT Applications: A Cloud-Edge based Framework

Internet of Things (IoT) have widely penetrated in different aspects of modern life and many intelligent IoT services and applications are emerging. Recently, federated learning is proposed to train a globally shared model by exploiting a massive amount of user-generated data samples on IoT devices while preventing data leakage. However, the device, statistical and model heterogeneities inherent in the complex IoT environments pose great challenges to traditional federated learning, making it unsuitable to be directly deployed. In this article we advocate a personalized federated learning framework in a cloud-edge architecture for intelligent IoT applications. To cope with the heterogeneity issues in IoT environments, we investigate emerging personalized federated learning methods which are able to mitigate the negative effects caused by heterogeneity in different aspects. With the power of edge computing, the requirements for fast-processing capacity and low latency in intelligent IoT applications can also be achieved. We finally provide a case study of IoT based human activity recognition to demonstrate the effectiveness of personalized federated learning for intelligent IoT applications.

preprint2020arXiv

The first light curve modeling and orbital period change investigation of nine contact binaries around the short period cut-off

In this paper, we present the first light curve synthesis and orbital period change analysis of nine contact binaries around the short period limit. It is found that all these systems are W-subtype contact binaries. One of them is a medium contact system while the others are shallow contact ones. Four of them manifest obvious O'Connell effect explained by a dark spot or hot spot on one of the component stars. Third light was detected in three systems. By investigating orbital period variations, we found that four of the targets display a secular period decrease while the others exhibit a long-term period increase. The secular period decrease is more likely caused by angular momentum loss while the long-term period increase is due to mass transfer from the less massive component to the more massive one. Based on the statistic of 19 ultrashort period contact binaries with known orbital period changes, we found that seven of them display long-term decrease (three of them also exhibit cyclic variations), ten of them manifest long-term increase while two of them only show cyclic variation and that most of them are shallow contact binaries supporting the long timescale angular momentum loss theory suggested by Stepien. For the three deep contact systems, we found that they are probably triple systems. The tertiary companion plays an essential role during their formation and evolution.

preprint2020arXiv

When Deep Reinforcement Learning Meets Federated Learning: Intelligent Multi-Timescale Resource Management for Multi-access Edge Computing in 5G Ultra Dense Network

Ultra-dense edge computing (UDEC) has great potential, especially in the 5G era, but it still faces challenges in its current solutions, such as the lack of: i) efficient utilization of multiple 5G resources (e.g., computation, communication, storage and service resources); ii) low overhead offloading decision making and resource allocation strategies; and iii) privacy and security protection schemes. Thus, we first propose an intelligent ultra-dense edge computing (I-UDEC) framework, which integrates blockchain and Artificial Intelligence (AI) into 5G ultra-dense edge computing networks. First, we show the architecture of the framework. Then, in order to achieve real-time and low overhead computation offloading decisions and resource allocation strategies, we design a novel two-timescale deep reinforcement learning (\textit{2Ts-DRL}) approach, consisting of a fast-timescale and a slow-timescale learning process, respectively. The primary objective is to minimize the total offloading delay and network resource usage by jointly optimizing computation offloading, resource allocation and service caching placement. We also leverage federated learning (FL) to train the \textit{2Ts-DRL} model in a distributed manner, aiming to protect the edge devices' data privacy. Simulation results corroborate the effectiveness of both the \textit{2Ts-DRL} and FL in the I-UDEC framework and prove that our proposed algorithm can reduce task execution time up to 31.87%.

preprint2019arXiv

Locations of optical and $γ$-ray emitting regions in the jet of PMN J2345-1555

We collect long term $γ$-ray, optical and radio $15$ GHz light curves of quasar object PMN J2345-1555. The correlation analyses between them are performed via the local cross-correlation function (LCCF). We found that all the optical $V$, $R$ band and the infrared $J$ band are correlated with the radio 15 GHz at beyond $3σ$ significance level, and the lag times are $-221.81^{+6.26}_{-6.72}$, $-201.38^{+6.42}_{-6.02}$ and $-192.27^{+8.26}_{-7.37}$ days, respectively. The $γ$-ray is strongly correlated with optical, but weakly correlated with the radio. We present that time lags between different frequencies can be used as an alternative parameter to derive the core-shift measurement. For this target, the magnetic field and particle density at 1 parsec in jet are derived to be $0.61$ Gauss and $1533/γ_{\rm min}$ cm$^{-3}$, respectively. The black hole mass and the 15 GHz core position in jet are estimated to be $10^{8.44} {\rm M}_{\odot}$ and $30$ parsec, respectively. The lag times enable us to derive that the optical and the $γ$-ray emitting regions coincide, which are located at $4.26^{+0.83}_{-0.79}$ pc away from 15 GHz core position in jet and beyond the broad line region (BLR). We found that a $3σ$ correlation between the color index and the radio light curve, which indicates that opacity may play an important role in the variation. The $δV-δR$ behaviors are complex, while the $R-J$ shows a bluer when brighter trend. As hinted from radio images, we proposed a positional dependent spectral index model to explain the color index behaviors, which is complementary for the shock in jet model. The curvature effects and contribution from accretion disk may also affect variables of blazars in many aspects.

preprint2016arXiv

A Generalized LDPC Framework for Robust and Sublinear Compressive Sensing

Compressive sensing aims to recover a high-dimensional sparse signal from a relatively small number of measurements. In this paper, a novel design of the measurement matrix is proposed. The design is inspired by the construction of generalized low-density parity-check codes, where the capacity-achieving point-to-point codes serve as subcodes to robustly estimate the signal support. In the case that each entry of the $n$-dimensional $k$-sparse signal lies in a known discrete alphabet, the proposed scheme requires only $O(k \log n)$ measurements and arithmetic operations. In the case of arbitrary, possibly continuous alphabet, an error propagation graph is proposed to characterize the residual estimation error. With $O(k \log^2 n)$ measurements and computational complexity, the reconstruction error can be made arbitrarily small with high probability.

preprint2016arXiv

Amazon in the White Space: Social Recommendation Aided Distributed Spectrum Access

Distributed spectrum access (DSA) is challenging since an individual secondary user often has limited sensing capabilities only. One key insight is that channel recommendation among secondary users can help to take advantage of the inherent correlation structure of spectrum availability in both time and space, and enable users to obtain more informed spectrum opportunities. With this insight, we advocate to leverage the wisdom of crowds, and devise social recommendation aided DSA mechanisms to orient secondary users to make more intelligent spectrum access decisions, for both strong and weak network information cases. We start with the strong network information case where secondary users have the statistical information. To mitigate the difficulty due to the curse of dimensionality in the stochastic game approach, we take the one-step Nash approach and cast the social recommendation aided DSA decision making problem at each time slot as a strategic game. We show that it is a potential game, and then devise an algorithm to achieve the Nash equilibrium by exploiting its finite improvement property. For the weak information case where secondary users do not have the statistical information, we develop a distributed reinforcement learning mechanism for social recommendation aided DSA based on the local observations of secondary users only. Appealing to the maximum-norm contraction mapping, we also derive the conditions under which the distributed mechanism converges and characterize the equilibrium therein. Numerical results reveal that the proposed social recommendation aided DSA mechanisms can achieve superior performance using real social data traces and its performance loss in the weak network information case is insignificant, compared with the strong network information case.

preprint2016arXiv

Astronomical Observing Conditions at Xinglong Observatory from 2007 to 2014

Xinglong Observatory of the National Astronomical Observatories, Chinese Academy of Sciences (NAOC), is one of the major optical observatories in China, which hosts nine optical telescopes including the Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) and the 2.16 m reflector. Scientific research from these telescopes is focused on stars, galaxies, and exoplanets using multicolor photometry and spectroscopic observations. Therefore, it is important to provide the observing conditions of the site, in detail, to the astronomers for an efficient use of these facilities. In this article, we present the characterization of observing conditions at Xinglong Observatory based on the monitoring of meteorology, seeing and sky brightness during the period from 2007 to 2014. Results suggest that Xinglong Observatory is still a good site for astronomical observations. Our analysis of the observing conditions at Xinglong Observatory can be used as a reference to the observers on targets selection, observing strategy, and telescope operation.

preprint2016arXiv

Bipartite quantum coherence in noninertial frames

Quantum coherence as the fundamental characteristic of quantum physics, provides the valuable resource for quantum computation in exceeding the power of classical algorithms. The exploration of quantum coherence in relativistic systems is of significance from both the fundamental points of view and practical applications. We investigate the quantum coherence of two free modes of scalar and Dirac fields as detected by two relatively accelerated observers by resorting to the relative entropy of coherence. We show that the relative entropy of coherence monotonically decreases when acceleration goes up, as a consequence of the Unruh effect. Specifically, the initial states with parameters $α=b$ and $α=\sqrt{1-b^2}$ have the same initial relative entropy coherence at $a=0$ (with $a$ the acceleration), but degrade along two different trajectories. The relative entropy of coherence reaches vanishing value in the scalar field in the infinite acceleration limit, but non-vanishing value in the Dirac field. This suggests that in the Dirac field, the bipartite state possesses quantum coherence to some extent with the variation of the relative acceleration, and may lead to potential applications in quantum computation performed by observers in motion relatively.

preprint2016arXiv

Content Retrieval At the Edge: A Social-aware and Named Data Cooperative Framework

Recent years with the popularity of mobile devices have witnessed an explosive growth of mobile multimedia contents which dominate more than 50\% of mobile data traffic. This significant growth poses a severe challenge for future cellular networks. As a promising approach to overcome the challenge, we advocate Content Retrieval At the Edge, a content-centric cooperative service paradigm via device-to-device (D2D) communications to reduce cellular traffic volume in mobile networks. By leveraging the Named Data Networking (NDN) principle, we propose sNDN, a social-aware named data framework to achieve efficient cooperative content retrieval. Specifically, sNDN introduces Friendship Circle by grouping a user with her close friends of both high mobility similarity and high content similarity. We construct NDN routing tables conditioned on Friendship Circle encounter frequency to navigate a content request and a content reply packet between Friendship Circles, and leverage social properties in Friendship Circle to search for the final target as inner-Friendship Circle routing. The evaluation results demonstrate that sNDN can save cellular capacity greatly and outperform other content retrieval schemes significantly.

preprint2016arXiv

Content-Centric and Software-Defined Networking with Big Data

Many communities have researched the application of novel network architectures such as Content-Centric Networking (CCN) and Software-Defined Networking (SDN) to build the future Internet. Another emerging technology which is big data analysis has also won lots of attentions from academia to industry. Many splendid researches have been done on CCN, SDN, and big data, which all have addressed separately in the traditional literature. In this paper, we propose a novel network paradigm to jointly consider CCN, SDN, and big data, and provide the architecture internal data flow, big data processing and use cases which indicate the benefits and applicability. Simulation results are exhibited to show the potential benefits relating to the proposed network paradigm. We refer to this novel paradigm as Data-Driven Networking (DDN).

preprint2016arXiv

Exploiting Social Tie Structure for Cooperative Wireless Networking: A Social Group Utility Maximization Framework

In this paper, we develop a social group utility maximization (SGUM) framework for cooperative wireless networking that takes into account both social relationships and physical coupling among users. We show that this framework provides rich modeling flexibility and spans the continuum between non-cooperative game and network utility maximization (NUM) -- two traditionally disjoint paradigms for network optimization. Based on this framework, we study three important applications of SGUM, in database assisted spectrum access, power control, and random access control, respectively. For the case of database assisted spectrum access, we show that the SGUM game is a potential game and always admits a socially-aware Nash equilibrium (SNE). We develop a randomized distributed spectrum access algorithm that can asymptotically converge to the optimal SNE, derive upper bounds on the convergence time, and also quantify the trade-off between the performance and convergence time of the algorithm. We further show that the performance gap of SNE by the algorithm from the NUM solution decreases as the strength of social ties among users increases and the performance gap is zero when the strengths of social ties among users reach the maximum values. For the cases of power control and random access control, we show that there exists a unique SNE. Furthermore, as the strength of social ties increases from the minimum to the maximum, a player's SNE strategy migrates from the Nash equilibrium strategy in a standard non-cooperative game to the socially-optimal strategy in network utility maximization. Furthermore, we show that the SGUM framework can be generalized to take into account both positive and negative social ties among users and can be a useful tool for studying network security problems.

preprint2016arXiv

Feedback-Controlled Sequential Lasso Screening

One way to solve lasso problems when the dictionary does not fit into available memory is to first screen the dictionary to remove unneeded features. Prior research has shown that sequential screening methods offer the greatest promise in this endeavor. Most existing work on sequential screening targets the context of tuning parameter selection, where one screens and solves a sequence of $N$ lasso problems with a fixed grid of geometrically spaced regularization parameters. In contrast, we focus on the scenario where a target regularization parameter has already been chosen via cross-validated model selection, and we then need to solve many lasso instances using this fixed value. In this context, we propose and explore a feedback controlled sequential screening scheme. Feedback is used at each iteration to select the next problem to be solved. This allows the sequence of problems to be adapted to the instance presented and the number of intermediate problems to be automatically selected. We demonstrate our feedback scheme using several datasets including a dictionary of approximate size 100,000 by 300,000.

preprint2016arXiv

Paired-move multiple-try stochastic search for Bayesian variable selection

Variable selection is a key issue when analyzing high-dimensional data. The explosion of data with large sample sizes and dimensionality brings new challenges to this problem in both inference accuracy and computational complexity. To alleviate these problems, we propose a new scalable Markov chain Monte Carlo (MCMC) sampling algorithm for "large $p$ small $n$" scenarios by generalizing multiple-try Metropolis to discrete model spaces and further incorporating neighborhood-based stochastic search. The proof of reversibility of the proposed MCMC algorithm is provided. Extensive simulation studies are performed to examine the efficiency of the new algorithm compared with existing methods. A real data example is provided to illustrate the prediction performances of the new algorithm.

preprint2016arXiv

Performance of new 8-inch photomultiplier tube used for the Tibet muon-detector array

A new hybrid experiment has been constructed to measure the chemical composition of cosmic rays around the "knee" in the wide energy range by the Tibet AS$γ$ collaboration at Tibet, China, since 2014. They consist of a high-energy air-shower-core array (YAC-II), a high-density air-shower array (Tibet-III) and a large underground water-Cherenkov muon-detector array (MD). In order to obtain the primary proton, helium and iron spectra and their "knee" positions in the energy range lower than $10^{16}$ eV, each of PMTs equipped to the MD cell is required to measure the number of photons capable of covering a wide dynamic range of 100 - $10^{6}$ photoelectrons (PEs) according to Monte Carlo simulations. In this paper, we firstly compare the characteristic features between R5912-PMT made by Japan Hamamatsu and CR365-PMT made by Beijing Hamamatsu. This is the first comparison between R5912-PMT and CR365-PMT. If there exists no serious difference, we will then add two 8-inch-in-diameter PMTs to meet our requirements in each MD cell, which are responsible for the range of 100 - 10000 PEs and 2000 - 1000000 PEs, respectively. That is, MD cell is expected to be able to measure the number of muons over 6 orders of magnitude.

preprint2016arXiv

Sensitivity of YAC to measure the light-component spectrum of primary cosmic rays at the "knee" energies

A new air-shower core-detector array (YAC : Yangbajing Air-shower Core-detector array) has been developed to measure the primary cosmic-ray composition at the "knee" energies in Tibet, China, focusing mainly on the light components. The prototype experiment (YAC-I) consisting of 16 detectors has been constructed and operated at Yangbajing (4300 m a.s.l.) in Tibet since May 2009. YAC-I is installed in the Tibet-III AS array and operates together. In this paper, we performed a Monte Carlo simulation to check the sensitivity of YAC-I+Tibet-III array to the cosmic-ray light component of cosmic rays around the knee energies, taking account of the observation conditions of actual YAC-I+Tibet-III array. The selection of light component from others was made by use of an artificial neural network (ANN). The simulation shows that the light-component spectrum estimated by our methods can well reproduce the input ones within 10\% error, and there will be about 30\% systematic errors mostly induced by the primary and interaction models used. It is found that the full-scale YAC and the Tibet-III array is powerful to study the cosmic-ray composition, in particular, to obtain the energy spectra of protons and helium nuclei around the knee energies.

preprint2016arXiv

Sparse Channel Estimation for Massive MIMO with 1-bit Feedback per Dimension

In massive multiple-input multiple-output (MIMO) systems, acquisition of the channel state information at the transmitter side (CSIT) is crucial. In this paper, a practical CSIT estimation scheme is proposed for frequency division duplexing (FDD) massive MIMO systems. Specifically, each received pilot symbol is first quantized to one bit per dimension at the receiver side and then the quantized bits are fed back to the transmitter. A joint one-bit compressed sensing algorithm is implemented at the transmitter to recover the channel matrices. The algorithm leverages the hidden joint sparsity structure in the user channel matrices to minimize the training and feedback overhead, which is considered to be a major challenge for FDD systems. Moreover, the one-bit compressed sensing algorithm accurately recovers the channel directions for beamforming. The one-bit feedback mechanism can be implemented in practical systems using the uplink control channel. Simulation results show that the proposed scheme nearly achieves the maximum output signal-to-noise-ratio for beamforming based on the estimated CSIT.

preprint2016arXiv

The active W UMa type binary star V781 Tau revisited

In this paper, new determined BVR_cI_c light curves and radial velocities of V781 Tau are presented. By analyzing the light curves and radial velocities simultaneously, we found that V781 Tau is a W-subtype medium contact binary star with a mass ratio of q=2.207+-0.005 and a contact degree of f=21.6(+-1.0)%. The difference between the two light maxima was explained by a dark spot on the less massive primary component. The orbital period change of V781 Tau was also investigated. A secular decrease at a rate of $-6.01(+-2.28)*10^{-8} d/yr and a cyclic modulation with a period of 44.8+-5.7 yr and an amplitude of 0.0064+-0.0011 day were discovered. The continuous period decrease may be caused by angular momentum loss due to magnetic stellar wind. Applegate mechanism failed to explain the cyclic modulation. It is highly possible that the cyclic oscillation is the result of the light travel time effect by a third companion.

preprint2016arXiv

The Cartan Model for Equivariant Cohomology

In this article, we will discuss a new operator $d_{C}$ on $W(\mathfrak{g})\otimesΩ^{*}(M)$ and to construct a new Cartan model for equivariant cohomology. We use the new Cartan model to construct the corresponding BRST model and Weil model, and discuss the relations between them.

preprint2015arXiv

Connecting quantum contextuality and genuine multipartite nonlocality with the quantumness witness

The Clauser-Horne-Shimony-Holt-type noncontextuality inequality and the Svetlichny inequality are derived from the Alicki-Van Ryn quantumness witness. Thus a connection between quantumness and quantum contextuality, and that between quantumness and genuine multipartite nonlocality, are established.

preprint2015arXiv

Decentralized Computation Offloading Game For Mobile Cloud Computing

Mobile cloud computing is envisioned as a promising approach to augment computation capabilities of mobile devices for emerging resource-hungry mobile applications. In this paper, we propose a game theoretic approach for achieving efficient computation offloading for mobile cloud computing. We formulate the decentralized computation offloading decision making problem among mobile device users as a decentralized computation offloading game. We analyze the structural property of the game and show that the game always admits a Nash equilibrium. We then design a decentralized computation offloading mechanism that can achieve a Nash equilibrium of the game and quantify its efficiency ratio over the centralized optimal solution. Numerical results demonstrate that the proposed mechanism can achieve efficient computation offloading performance and scale well as the system size increases.

preprint2015arXiv

Deep Haar Scattering Networks

An orthogonal Haar scattering transform is a deep network, computed with a hierarchy of additions, subtractions and absolute values, over pairs of coefficients. It provides a simple mathematical model for unsupervised deep network learning. It implements non-linear contractions, which are optimized for classification, with an unsupervised pair matching algorithm, of polynomial complexity. A structured Haar scattering over graph data computes permutation invariant representations of groups of connected points in the graph. If the graph connectivity is unknown, unsupervised Haar pair learning can provide a consistent estimation of connected dyadic groups of points. Classification results are given on image data bases, defined on regular grids or graphs, with a connectivity which may be known or unknown.

preprint2015arXiv

Design of a 325MHz Half Wave Resonator prototype at IHEP

A 325MHz beta=0.14 superconducting half wave resonator(HWR) prototype has been developed at the Institute of High Energy Physics(IHEP), which can be applied in continuous wave (CW) high beam proton accelerators. In this paper, the electromagnetic (EM) design, multipacting simulation, mechanical optimization, and fabrication are introduced in details. In vertical test at 4.2K, the cavity reached Eacc=7MV/m with Q0=1.4*10^9 and Eacc=15.9MV/m with Q0=4.3*10^8.

preprint2015arXiv

Development of Yangbajing Air shower Core detector array for a new EAS hybrid Experiment

Aiming at the observation of cosmic-ray chemical composition at the "knee" energy region, we have been developinga new type air-shower core detector (YAC, Yangbajing Air shower Core detector array) to be set up at Yangbajing (90.522$^\circ$ E, 30.102$^\circ$ N, 4300 m above sea level, atmospheric depth: 606 g/m$^2$) in Tibet, China. YAC works together with the Tibet air-shower array (Tibet-III) and an underground water cherenkov muon detector array (MD) as a hybrid experiment. Each YAC detector unit consists of lead plates of 3.5 cm thick and a scintillation counter which detects the burst size induced by high energy particles in the air-shower cores. The burst size can be measured from 1 MIP (Minimum Ionization Particle) to $10^{6}$ MIPs. The first phase of this experiment, named "YAC-I", consists of 16 YAC detectors each having the size 40 cm $\times$ 50 cm and distributing in a grid with an effective area of 10 m$^{2}$. YAC-I is used to check hadronic interaction models. The second phase of the experiment, called "YAC-II", consists of 124 YAC detectors with coverage about 500 m$^2$. The inner 100 detectors of 80 cm $\times $ 50 cm each are deployed in a 10 $\times$ 10 matrix from with a 1.9 m separation and the outer 24 detectors of 100 cm $\times$ 50 cm each are distributed around them to reject non-core events whose shower cores are far from the YAC-II array. YAC-II is used to study the primary cosmic-ray composition, in particular, to obtain the energy spectra of proton, helium and iron nuclei between 5$\times$$10^{13}$ eV and $10^{16}$ eV covering the "knee" and also being connected with direct observations at energies around 100 TeV. We present the design and performance of YAC-II in this paper.

preprint2015arXiv

Efficient Multi-User Computation Offloading for Mobile-Edge Cloud Computing

Mobile-edge cloud computing is a new paradigm to provide cloud computing capabilities at the edge of pervasive radio access networks in close proximity to mobile users. In this paper, we first study the multi-user computation offloading problem for mobile-edge cloud computing in a multi-channel wireless interference environment. We show that it is NP-hard to compute a centralized optimal solution, and hence adopt a game theoretic approach for achieving efficient computation offloading in a distributed manner. We formulate the distributed computation offloading decision making problem among mobile device users as a multi-user computation offloading game. We analyze the structural property of the game and show that the game admits a Nash equilibrium and possesses the finite improvement property. We then design a distributed computation offloading algorithm that can achieve a Nash equilibrium, derive the upper bound of the convergence time, and quantify its efficiency ratio over the centralized optimal solutions in terms of two important performance metrics. We further extend our study to the scenario of multi-user computation offloading in the multi-channel wireless contention environment. Numerical results corroborate that the proposed algorithm can achieve superior computation offloading performance and scale well as the user size increases.

preprint2015arXiv

Exploiting Social Trust Assisted Reciprocity (STAR) towards Utility-Optimal Socially-aware Crowdsensing

Mobile crowdsensing takes advantage of pervasive mobile devices to collect and process data for a variety of applications (e.g., traffic monitoring, spectrum sensing). In this study, a socially-aware crowdsensing system is advocated, in which a cloud-based platform incentivizes mobile users to participate in sensing tasks} by leveraging social trust among users, upon receiving sensing requests. For this system, social trust assisted reciprocity (STAR) - a synergistic marriage of social trust and reciprocity, is exploited to design an incentive mechanism that stimulates users' participation. Given the social trust structure among users, the efficacy of STAR for satisfying users' sensing requests is thoroughly investigated. Specifically, it is first shown that all requests can be satisfied if and only if sufficient social credit can be "transferred" from users who request more sensing service than they can provide to users who can provide more than they request. Then utility maximization for sensing services under STAR is investigated, and it is shown that it boils down to maximizing the utility of a circulation flow in the combined social graph and request graph. Accordingly, an algorithm that iteratively cancels a cycle of positive weight in the residual graph is developed, which computes the optimal solution efficiently, for both cases of divisible and indivisible sensing service. Extensive simulation results corroborate that STAR can significantly outperform the mechanisms using social trust only or reciprocity only.

preprint2015arXiv

Multi-lingual Geoparsing based on Machine Translation

Our method for multi-lingual geoparsing uses monolingual tools and resources along with machine translation and alignment to return location words in many languages. Not only does our method save the time and cost of developing geoparsers for each language separately, but also it allows the possibility of a wide range of language capabilities within a single interface. We evaluated our method in our LanguageBridge prototype on location named entities using newswire, broadcast news and telephone conversations in English, Arabic and Chinese data from the Linguistic Data Consortium (LDC). Our results for geoparsing Chinese and Arabic text using our multi-lingual geoparsing method are comparable to our results for geoparsing English text with our English tools. Furthermore, experiments using our machine translation approach results in accuracy comparable to results from the same data that was translated manually.

preprint2015arXiv

Quantum Nonlocality Enhanced by Homogenization

Homogenization proposed in [Y.-C Wu and M. Żukowski, Phys. Rev. A 85, 022119 (2012)] is a procedure to transform a tight Bell inequality with partial correlations into a full-correlation form that is also tight. In this paper, we check the homogenizations of two families of $n$-partite Bell inequalities: the Hardy inequality and the tight Bell inequality without quantum violation. For Hardy's inequalities, their homogenizations bear stronger quantum violation for the maximally entangled state; the tight Bell inequalities without quantum violation give the boundary of quantum and supra-quantum, but their homogenizations do not have the similar properties. We find their homogenization are violated by the maximally entangled state. Numerically computation shows the the domains of quantum violation of homogenized Hardy's inequalities for the generalized GHZ states are smaller than those of Hardy's inequalities.

preprint2014arXiv

Classical demonstration of frequency dependent noise ellipse rotation using Optomechanically Induced Transparency

Cavities with extremely narrow linewidth of 10-100 Hz are required for realizing frequency dependent squeezing to enable gravitational wave detectors to surpass the free mass standard quantum limit over a broad frequency range. Hundred-meter-scale high finesse cavities have been proposed for this purpose. Optomechanically induced transparency (OMIT) enables the creation of optomechanical cavities in which the linewidth limit is set by the extremely narrow linewidth of a high Q factor mechanical resonator. Using an 85mm OMIT cavity with a silicon nitride membrane, we demonstrate a tunable linewidth from 3Hz up to several hundred Hz and frequency dependent noise ellipse rotation using classical light with squeezed added noise to simulate quantum squeezed light. The frequency dependent noise ellipse angle is rotated in close agreement with predictions.

preprint2014arXiv

EEG Spatial Decoding and Classification with Logit Shrinkage Regularized Directed Information Assessment (L-SODA)

There is an increasing interest in studying the neural interaction mechanisms behind patterns of cognitive brain activity. This paper proposes a new approach to infer such interaction mechanisms from electroencephalographic (EEG) data using a new estimator of directed information (DI) called logit shrinkage optimized directed information assessment (L-SODA). Unlike previous directed information measures applied to neural decoding, L-SODA uses shrinkage regularization on multinomial logistic regression to deal with the high dimensionality of multi-channel EEG signals and the small sizes of many real-world datasets. It is designed to make few a priori assumptions and can handle both non-linear and non-Gaussian flows among electrodes. Our L-SODA estimator of the DI is accompanied by robust statistical confidence intervals on the true DI that make it especially suitable for hypothesis testing on the information flow patterns. We evaluate our work in the context of two different problems where interaction localization is used to determine highly interactive areas for EEG signals spatially and temporally. First, by mapping the areas that have high DI into Brodmann area, we identify that the areas with high DI are associated with motor-related functions. We demonstrate that L-SODA provides better accuracy for neural decoding of EEG signals as compared to several state-of-the-art approaches on the Brain Computer Interface (BCI) EEG motor activity dataset. Second, the proposed L-SODA estimator is evaluated on the CHB-MIT Scalp EEG database. We demonstrate that compared to the state-of-the-art approaches, the proposed method provides better performance in detecting the epileptic seizure.

preprint2014arXiv

Imitation-based Social Spectrum Sharing

Dynamic spectrum sharing is a promising technology for improving the spectrum utilization. In this paper, we study how secondary users can share the spectrum in a distributed fashion based on social imitations. The imitation-based mechanism leverages the social intelligence of the secondary user crowd and only requires a low computational power for each individual user. We introduce the information sharing graph to model the social information sharing relationship among the secondary users. We propose an imitative spectrum access mechanism on a general information sharing graph such that each secondary user first estimates its expected throughput based on local observations, and then imitates the channel selection of another neighboring user who achieves a higher throughput. We show that the imitative spectrum access mechanism converges to an imitation equilibrium, where no beneficial imitation can be further carried out on the time average. Numerical results show that the imitative spectrum access mechanism can achieve efficient spectrum utilization and meanwhile provide good fairness across secondary users.

preprint2014arXiv

Many-Access Channels: The Gaussian Case with Random User Activities

Classical multiuser information theory studies the fundamental limits of models with a fixed (often small) number of users as the coding blocklength goes to infinity. This work proposes a new paradigm, referred to as many-user information theory, where the number of users is allowed to grow with the blocklength. This paradigm is motivated by emerging systems with a massive number of users in an area, such as machine-to-machine communication systems and sensor networks. The focus of the current paper is the many-access channel model, which consists of a single receiver and many transmitters, whose number increases unboundedly with the blocklength. Moreover, an unknown subset of transmitters may transmit in a given block and need to be identified. A new notion of capacity is introduced and characterized for the Gaussian many-access channel with random user activities. The capacity can be achieved by first detecting the set of active users and then decoding their messages.

preprint2014arXiv

Many-Broadcast Channels: Definition and Capacity in the Degraded Case

Classical multiuser information theory studies the fundamental limits of models with a fixed (often small) number of users as the coding blocklength goes to infinity. Motivated by emerging systems with a massive number of users, this paper studies the new {\em many-user paradigm}, where the number of users is allowed to grow with the blocklength. The focus of this paper is the degraded many-broadcast channel model, whose number of users may grow as fast as linearly with the blocklength. A notion of capacity in terms of message length is defined and an example of Gaussian degraded many-broadcast channel is studied. In addition, a numerical example for the Gaussian degraded many-broadcast channel with fixed transmit power constraint is solved, where every user achieves strictly positive message length asymptotically.

preprint2014arXiv

Optical Monitoring of OT 546 in 2009

We reported the monitoring results of OT 546 in V, R and I bands, observed on 22 nights from February 16 to July 1 in 2009 at Weihai Observatory of Shandong University. During our monitoring, its variability amplitude was small and a possible microvariability was detected on one night using both C and F tests

preprint2014arXiv

Optical Monitoring of the Seyfert Galaxy NGC 4151 and Possible Periodicities in the Historical Light Curve

We report B, V, and R band CCD photometry of the Seyfert galaxy NGC 4151 obtained with the 1.0-m telescope at Weihai Observatory of Shandong University and the 1.56-m telescope at Shanghai Astronomical Observatory from 2005 December to 2013 February. Combining all available data from literature, we have constructed a historical light curve from 1910 to 2013 to study the periodicity of the source using three different methods (the Jurkevich method, the Lomb-Scargle periodogram method and the Discrete Correlation Function method). We find possible periods of P_1=4\pm0.1, P_2=7.5\pm0.3 and P_3=15.9\pm0.3 yr.

preprint2014arXiv

Shrinkage Optimized Directed Information using Pictorial Structures for Action Recognition

In this paper, we propose a novel action recognition framework. The method uses pictorial structures and shrinkage optimized directed information assessment (SODA) coupled with Markov Random Fields called SODA+MRF to model the directional temporal dependency and bidirectional spatial dependency. As a variant of mutual information, directional information captures the directional information flow and temporal structure of video sequences across frames. Meanwhile, within each frame, Markov random fields are utilized to model the spatial relations among different parts of a human body and the body parts of different people. The proposed SODA+MRF model is robust to view point transformations and detect complex interactions accurately. We compare the proposed method against several baseline methods to highlight the effectiveness of the SODA+MRF model. We demonstrate that our algorithm has superior action recognition performance on the UCF action recognition dataset, the Olympic sports dataset and the collective activity dataset over several state-of-the-art methods.

preprint2014arXiv

Sky Brightness at Weihai Observatory of Shandong University

In this paper, a total of about 28000 images in $V$ and $R$ band obtained on 161 nights using the one-meter optical telescope at Weihai Observatory (WHO) of Shandong University since 2008 to 2012 have been processed to measure the sky brightness. It provides us with an unprecedented database, which can be used to study the variation of the sky brightness with the sky position, the moonlight contribution, and the twilight sky brightness. The darkest sky brightness is about 19.0 and 18.6 $mag$ $arcsec^{-2}$ in $V$ and $R$ band, respectively. An obvious darkening trend is found at the first half of the night at WHO, and the variation rate is much larger in summer than that in other seasons. The sky brightness variation depends more on the azimuth than on the altitude of the telescope pointing for WHO. Our results indicate that the sky brightness at WHO is seriously influenced by the urban light.

preprint2014arXiv

Spatial Spectrum Access Game

A key feature of wireless communications is the spatial reuse. However, the spatial aspect is not yet well understood for the purpose of designing efficient spectrum sharing mechanisms. In this paper, we propose a framework of spatial spectrum access games on directed interference graphs, which can model quite general interference relationship with spatial reuse in wireless networks. We show that a pure Nash equilibrium exists for the two classes of games: (1) any spatial spectrum access games on directed acyclic graphs, and (2) any games satisfying the congestion property on directed trees and directed forests. Under mild technical conditions, the spatial spectrum access games with random backoff and Aloha channel contention mechanisms on undirected graphs also have a pure Nash equilibrium. We also quantify the price of anarchy of the spatial spectrum access game. We then propose a distributed learning algorithm, which only utilizes users' local observations to adaptively adjust the spectrum access strategies. We show that the distributed learning algorithm can converge to an approximate mixed-strategy Nash equilibrium for any spatial spectrum access games. Numerical results demonstrate that the distributed learning algorithm achieves up to superior performance improvement over a random access algorithm.

preprint2014arXiv

Unsupervised Deep Haar Scattering on Graphs

The classification of high-dimensional data defined on graphs is particularly difficult when the graph geometry is unknown. We introduce a Haar scattering transform on graphs, which computes invariant signal descriptors. It is implemented with a deep cascade of additions, subtractions and absolute values, which iteratively compute orthogonal Haar wavelet transforms. Multiscale neighborhoods of unknown graphs are estimated by minimizing an average total variation, with a pair matching algorithm of polynomial complexity. Supervised classification with dimension reduction is tested on data bases of scrambled images, and for signals sampled on unknown irregular grids on a sphere.

preprint2013arXiv

A 2.0 Gb/s Throughput Decoder for QC-LDPC Convolutional Codes

This paper propose a decoder architecture for low-density parity-check convolutional code (LDPCCC). Specifically, the LDPCCC is derived from a quasi-cyclic (QC) LDPC block code. By making use of the quasi-cyclic structure, the proposed LDPCCC decoder adopts a dynamic message storage in the memory and uses a simple address controller. The decoder efficiently combines the memories in the pipelining processors into a large memory block so as to take advantage of the data-width of the embedded memory in a modern field-programmable gate array (FPGA). A rate-5/6 QC-LDPCCC has been implemented on an Altera Stratix FPGA. It achieves up to 2.0 Gb/s throughput with a clock frequency of 100 MHz. Moreover, the decoder displays an excellent error performance of lower than $10^{-13}$ at a bit-energy-to-noise-power-spectral-density ratio ($E_b/N_0$) of 3.55 dB.

preprint2013arXiv

Coupler Conditioning and High Power Testing of ADS Spoke Cavity

Power couplers, used in China-ADS proton linac injector I, are required to transfer 6kW RF power to the superconducting Spoke cavities. At present, first two couplers of coaxial design have been fabricated and accomplished high power test at IHEP. The test results indicated that couplers of this design are qualified to deliver 10kW RF power in continuous travelling wave mode. This paper described the coupler's room temperature test procedures and results, discussed the original high power test terminated due to serious out-gassing and after some modifications. In the final test, the couplers smoothly exceeded the design power level.

preprint2013arXiv

Database-assisted Distributed Spectrum Sharing

According to FCC's ruling for white-space spectrum access, white-space devices are required to query a database to determine the spectrum availability. In this paper, we study the database-assisted distributed white-space access point (AP) network design. We first model the cooperative and non-cooperative channel selection problems among the APs as the system-wide throughput optimization and non-cooperative AP channel selection games, respectively, and design distributed AP channel selection algorithms that achieve system optimal point and Nash equilibrium, respectively. We then propose a state-based game formulation for the distributed AP association problem of the secondary users by taking the cost of mobility into account. We show that the state-based distributed AP association game has the finite improvement property, and design a distributed AP association algorithm that can converge to a state-based Nash equilibrium. Numerical results show that the algorithm is robust to the perturbation by secondary users' dynamical leaving and entering the system.

preprint2013arXiv

Microvariability Detection of Mrk 421

The BL Lac object Mrk 421 was observed in optical bands from 2009 April to 2012 May with the 1.0 m telescope at Weihai Observatory of Shandong University. Microvariability was analysed by C and F tests, but no significant microvariability was detected during our observations.

preprint2013arXiv

Observation of Three-Mode Parametric Instability

Three-mode parametric interactions can occur in triply-resonant opto-mechanical systems in which two orthogonal optical modes are coupled with an appropriate mechanical mode. Using an optical cavity with a membrane inside, we report the first observation of three-mode parametric instability in a Fabry-Perot cavity, a phenomenon predicted to occur in long baseline advanced gravitational wave detectors. We present a large signal model for the phenomenon, which predicts exponential growth of mechanical oscillation followed by saturation. Our experimental results are consistent with this model. Contrary to expectations, parametric instability does not lead to loss of cavity lock, a fact which may make it easier to implement control techniques in Advanced gravitational wave detectors.

preprint2013arXiv

Predicting a User's Next Cell With Supervised Learning Based on Channel States

Knowing a user's next cell allows more efficient resource allocation and enables new location-aware services. To anticipate the cell a user will hand-over to, we introduce a new machine learning based prediction system. Therein, we formulate the prediction as a classification problem based on information that is readily available in cellular networks. Using only Channel State Information (CSI) and handover history, we perform classification by embedding Support Vector Machines (SVMs) into an efficient pre-processing structure. Simulation results from a Manhattan Grid scenario and from a realistic radio map of downtown Frankfurt show that our system provides timely prediction at high accuracy.

preprint2013arXiv

Quality of Service Games for Spectrum Sharing

Today's wireless networks are increasingly crowded with an explosion of wireless users, who have greater and more diverse quality of service (QoS) demands than ever before. However, the amount of spectrum that can be used to satisfy these demands remains finite. This leads to a great challenge for wireless users to effectively share the spectrum to achieve their QoS requirements. This paper presents a game theoretic model for spectrum sharing, where users seek to satisfy their QoS demands in a distributed fashion. Our spectrum sharing model is quite general, because we allow different wireless channels to provide different QoS, depending upon their channel conditions and how many users are trying to access them. Also, users can be highly heterogeneous, with different QoS demands, depending upon their activities, hardware capabilities, and technology choices. Under such a general setting, we show that it is NP hard to find a spectrum allocation which satisfies the maximum number of users' QoS requirements in a centralized fashion. We also show that allowing users to self-organize through distributed channel selections is a viable alternative to the centralized optimization, because better response updating is guaranteed to reach a pure Nash equilibria in polynomial time. By bounding the price of anarchy, we demonstrate that the worst case pure Nash equilibrium can be close to optimal, when users and channels are not very heterogenous. We also extend our model by considering the frequency spatial reuse, and consider the user interactions as a game upon a graph where players only contend with their neighbors. We prove that better response updating is still guaranteed to reach a pure Nash equilibrium in this more general spatial QoS satisfaction game.

preprint2013arXiv

The Public Safety Broadband Network: A Novel Architecture with Mobile Base Stations

A nationwide interoperable public safety broadband network is being planned by the United States government. The network will be based on long term evolution (LTE) standards and use recently designated spectrum in the 700 MHz band. The public safety network has different objectives and traffic patterns than commercial wireless networks. In particular, the public safety network puts more emphasis on coverage, reliability and latency in the worst case scenario. Moreover, the routine public safety traffic is relatively light, whereas when a major incident occurs, the traffic demand at the incident scene can be significantly heavier than that in a commercial network. Hence it is prohibitively costly to build the public safety network using conventional cellular network architecture consisting of an infrastructure of stationary base transceiver stations. A novel architecture is proposed in this paper for the public safety broadband network. The architecture deploys stationary base stations sparsely to serve light routine traffic and dispatches mobile base stations to incident scenes along with public safety personnel to support heavy traffic. The analysis shows that the proposed architecture can potentially offer more than 75% reduction in terms of the total number of base stations needed.

preprint2013arXiv

The study and design of RF coupler for Chinese ADS HWR Superconducting Cavity

RF power coupler is a key component of the superconducting accelerating system in Chinese ADS proton linac injector I, which is used to transmit 15kW RF power from the power source to the superconducting HWR cavity. According to the requirement of working frequency, power level, transmission capability and cooling condition, the physics design of coupler has been finished, which includes RF structure optimization, thermal simulation, thermal stress analysis and so on. Based on this design, the prototype of HWR coupler has been fabricated, and then has passed the high power test successfully.

preprint2013arXiv

Variability of OI 090.4

OI 090.4 was monitored on 21 nights from 2006 to 2012 for studying the variability. Strong variations occurred during the past 6 years. The long-term variability amplitude is consistent with previous results. Microvariability was analyzed for 43 intra-night light curves. 30 out of 43 light curves showed microvariability by C and F tests analysis.

preprint2012arXiv

Distributed Spectrum Access with Spatial Reuse

Efficient distributed spectrum sharing mechanism is crucial for improving the spectrum utilization. The spatial aspect of spectrum sharing, however, is less understood than many other aspects. In this paper, we generalize a recently proposed spatial congestion game framework to design efficient distributed spectrum access mechanisms with spatial reuse. We first propose a spatial channel selection game to model the distributed channel selection problem with fixed user locations. We show that the game is a potential game, and develop a distributed learning mechanism that converges to a Nash equilibrium only based on users' local observations. We then formulate the joint channel and location selection problem as a spatial channel selection and mobility game, and show that it is also a potential game. We next propose a distributed strategic mobility algorithm, jointly with the distributed learning mechanism, that can converge to a Nash equilibrium.

preprint2012arXiv

Evolutionarily Stable Spectrum Access

In this paper, we design distributed spectrum access mechanisms with both complete and incomplete network information. We propose an evolutionary spectrum access mechanism with complete network information, and show that the mechanism achieves an equilibrium that is globally evolutionarily stable. With incomplete network information, we propose a distributed learning mechanism, where each user utilizes local observations to estimate the expected throughput and learns to adjust its spectrum access strategy adaptively over time. We show that the learning mechanism converges to the same evolutionary equilibrium on the time average. Numerical results show that the proposed mechanisms are robust to the perturbations of users' channel selections.

preprint2012arXiv

Evolutionary Game and Learning for Dynamic Spectrum Access

Efficient dynamic spectrum access mechanism is crucial for improving the spectrum utilization. In this paper, we consider the dynamic spectrum access mechanism design with both complete and incomplete network information. When the network information is available, we propose an evolutionary spectrum access mechanism. We use the replicator dynamics to study the dynamics of channel selections, and show that the mechanism achieves an equilibrium that is an evolutionarily stable strategy and is also max-min fair. With incomplete network information, we propose a distributed reinforcement learning mechanism for dynamic spectrum access. Each secondary user applies the maximum likelihood estimation method to estimate its expected payoff based on the local observations, and learns to adjust its mixed strategy for channel selections adaptively over time. We study the convergence of the learning mechanism based on the theory of stochastic approximation, and show that it globally converges to an approximate Nash equilibrium. Numerical results show that the proposed evolutionary spectrum access and distributed reinforcement learning mechanisms achieve up to 82% and 70% performance improvement than a random access mechanism, respectively, and are robust to random perturbations of channel selections.

preprint2011arXiv

Adaptive Channel Recommendation For Opportunistic Spectrum Access

We propose a dynamic spectrum access scheme where secondary users recommend "good" channels to each other and access accordingly. We formulate the problem as an average reward based Markov decision process. We show the existence of the optimal stationary spectrum access policy, and explore its structure properties in two asymptotic cases. Since the action space of the Markov decision process is continuous, it is difficult to find the optimal policy by simply discretizing the action space and use the policy iteration, value iteration, or Q-learning methods. Instead, we propose a new algorithm based on the Model Reference Adaptive Search method, and prove its convergence to the optimal policy. Numerical results show that the proposed algorithms achieve up to 18% and 100% performance improvement than the static channel recommendation scheme in homogeneous and heterogeneous channel environments, respectively, and is more robust to channel dynamics.

Xu Chen

What is connected

Connect this record

See the researcher in context

Building this map preview

117 published item(s)

CurEvo: Curriculum-Guided Self-Evolution for Video Understanding

HiFlash: Communication-Efficient Hierarchical Federated Learning with Adaptive Staleness Control and Heterogeneity-aware Client-Edge Association

Offline Imitation Learning with Variational Counterfactual Reasoning

Professional Network Matters: Connections Empower Person-Job Fit

Real-Time High-Resolution Pedestrian Detection in Crowded Scenes via Parallel Edge Offloading

Superconductivity in an Orbital-reoriented SnAs Square Lattice: a Case Study of Li0.6Sn2As2 and NaSnAs

3D Dense Face Alignment with Fused Features by Aggregating CNNs and GCNs

Analog MIMO Communication for One-shot Distributed Principal Component Analysis

Cluster extent inference revisited: quantification and localization of brain activity

Collaboration in Participant-Centric Federated Learning: A Game-Theoretical Perspective

Debiased Recommendation with Neural Stratification

Debiased Recommendation with User Feature Balancing

Depth Estimation Matters Most: Improving Per-Object Depth Estimation for Monocular 3D Detection and Tracking

Edge Robotics: Edge-Computing-Accelerated Multi-Robot Simultaneous Localization and Mapping

Enabling Long-Term Cooperation in Cross-Silo Federated Learning: A Repeated Game Perspective

Explainable Legal Case Matching via Inverse Optimal Transport-based Rationale Extraction

Exploration of the origin of 2020 X-ray outburst in OJ 287

FastRE: Towards Fast Relation Extraction with Convolutional Encoder and Improved Cascade Binary Tagging Framework

gDNA: Towards Generative Detailed Neural Avatars

Generalizable Information Theoretic Causal Representation

GraphAD: A Graph Neural Network for Entity-Wise Multivariate Time-Series Anomaly Detection

Gumble Softmax For User Behavior Modeling

Knowledge-Guided Learning for Transceiver Design in Over-the-Air Federated Learning

Learning to Identify Top Elo Ratings: A Dueling Bandits Approach

Measuring "Why" in Recommender Systems: a Comprehensive Survey on the Evaluation of Explainable Recommendation

Multi-Agent Reinforcement Learning for Markov Routing Games: A New Modeling Paradigm For Dynamic Traffic Assignment

Neural Message Passing for Visual Relationship Detection

PINA: Learning a Personalized Implicit Neural Avatar from a Single RGB-D Video Sequence

RecBole 2.0: Towards a More Up-to-Date Recommendation Library

Robust PCA Unrolling Network for Super-resolution Vessel Extraction in X-ray Coronary Angiography

Sequential Recommendation with User Evolving Preference Decomposition

Three-Dimensional Spectrum Occupancy Measurement using UAV: Performance Analysis and Algorithm Design

TIPS: Transaction Inclusion Protocol with Signaling in DAG-based Blockchain

Towards Equivalent Transformation of User Preferences in Cross Domain Recommendation

User behavior understanding in real world settings

Working memory inspired hierarchical video decomposition with transformative representations

Zero Trust Architecture for 6G Security

Constraining Evolution of Magnetic Field Strength in Dissipation Region of Two BL Lac Objects

Deep Reinforcement Learning with Spatio-temporal Traffic Forecasting for Data-Driven Base Station Sleep Control

Discovery of two families of VSb-based compounds with V-kagome lattice

Discrete Knowledge Graph Embedding based on Discrete Optimization

EC-SAGINs: Edge Computing-enhanced Space-Air-Ground Integrated Networks for Internet of Vehicles

Generate Natural Language Explanations for Recommendation

Generating Multi-scale Maps from Remote Sensing Images via Series Generative Adversarial Networks

Joint Radar and Communication: A Survey

Learning Post-Hoc Causal Explanations for Recommendation

Visually-aware Recommendation with Aesthetic Features

Age of Processing: Age-driven Status Sampling and Processing Offloading for Edge Computing-enabled Real-time IoT Applications

An Edge Computing-based Photo Crowdsourcing Framework for Real-time 3D Reconstruction

AxeChain: A Secure and Decentralized blockchain for solving Easily-Verifiable problems

Category Level Object Pose Estimation via Neural Analysis-by-Synthesis

Collaborative Adversarial Learning for RelationalLearning on Multiple Bipartite Graphs

Convergence of Edge Computing and Deep Learning: A Comprehensive Survey

Curvature induced polarization and spectral index behavior for PKS 1502+106

Decoupled Variational Embedding for Signed Directed Networks

DeepCP: Deep Learning Driven Cascade Prediction Based Autonomous Content Placement in Closed Social Network

Dual Graph Embedding for Object-Tag LinkPrediction on the Knowledge Graph

Explainable Recommendation: A Survey and New Perspectives

HFEL: Joint Edge Association and Resource Allocation for Cost-Efficient Hierarchical Federated Edge Learning

HierTrain: Fast Hierarchical Edge AI Learning with Hybrid Parallelism in Mobile-Edge-Cloud Computing

Human Body Model Fitting by Learned Gradient Descent

Joint Multi-User DNN Partitioning and Computational Resource Allocation for Collaborative Edge Intelligence

Knowledge Distillation for Mobile Edge Computation Offloading

Leveraging the Power of Prediction: Predictive Service Placement for Latency-Sensitive Mobile Edge Computing

Liability Design for Autonomous Vehicles and Human-Driven Vehicles: A Hierarchical Game-Theoretic Approach

Personalized Federated Learning for Intelligent IoT Applications: A Cloud-Edge based Framework

The first light curve modeling and orbital period change investigation of nine contact binaries around the short period cut-off

When Deep Reinforcement Learning Meets Federated Learning: Intelligent Multi-Timescale Resource Management for Multi-access Edge Computing in 5G Ultra Dense Network

Locations of optical and $γ$-ray emitting regions in the jet of PMN J2345-1555

A Generalized LDPC Framework for Robust and Sublinear Compressive Sensing

Amazon in the White Space: Social Recommendation Aided Distributed Spectrum Access

Astronomical Observing Conditions at Xinglong Observatory from 2007 to 2014

Bipartite quantum coherence in noninertial frames

Content Retrieval At the Edge: A Social-aware and Named Data Cooperative Framework