Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
71works
0followers
38topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

71 published item(s)

preprint2026arXiv

Uncertainty-Aware Trip Purpose Inference from GPS Trajectories via POI Semantic Zones and Pareto Calibration

Large-scale GPS trajectory data offer rich observations of human mobility, yet assigning trip purposes to detected stops remains challenging due to the absence of individual-level ground truth, spatial uncertainty from GPS noise and incomplete points of interest (POIs) coverage, and fundamental behavioral differences across trip purposes. We propose a weakly supervised framework integrating neighborhood-level POI semantic zones with distance-weighted spatial likelihoods, differentiated inference strategies for mandatory and non-mandatory activities, and a multi-phase Pareto optimization that jointly minimizes distributional divergence from household travel survey statistics and maximizes inference reliability without requiring annotated labels. Evaluated on over 81 million staypoints in Los Angeles, the framework reduces activity type frequency Jensen-Shannon distance (JSD) by 23%, start time JSD by 48%, and duration JSD by 12% respectively relative to a comparable baseline. The proposed approach provides a scalable and uncertainty-aware path from raw GPS trajectories to semantically annotated mobility data for travel demand modeling and transportation policy analysis.

preprint2024arXiv

Exploring Adversarial Robustness of LiDAR-Camera Fusion Model in Autonomous Driving

Our study assesses the adversarial robustness of LiDAR-camera fusion models in 3D object detection. We introduce an attack technique that, by simply adding a limited number of physically constrained adversarial points above a car, can make the car undetectable by the fusion model. Experimental results reveal that even without changes to the image data channel, the fusion model can be deceived solely by manipulating the LiDAR data channel. This finding raises safety concerns in the field of autonomous driving. Further, we explore how the quantity of adversarial points, the distance between the front-near car and the LiDAR-equipped car, and various angular factors affect the attack success rate. We believe our research can contribute to the understanding of multi-sensor robustness, offering insights and guidance to enhance the safety of autonomous driving.

preprint2024arXiv

Multi-stages attention Breast cancer classification based on nonlinear spiking neural P neurons with autapses

Breast cancer(BC) is a prevalent type of malignant tumor in women. Early diagnosis and treatment are vital for enhancing the patients' survival rate. Downsampling in deep networks may lead to loss of information, so for compensating the detail and edge information and allowing convolutional neural networks to pay more attention to seek the lesion region, we propose a multi-stages attention architecture based on NSNP neurons with autapses. First, unlike the single-scale attention acquisition methods of existing methods, we set up spatial attention acquisition at each feature map scale of the convolutional network to obtain an fusion global information on attention guidance. Then we introduce a new type of NSNP variants called NSNP neurons with autapses. Specifically, NSNP systems are modularized as feature encoders, recoding the features extracted from convolutional neural network as well as the fusion of attention information and preserve the key characteristic elements in feature maps. This ensures the retention of valuable data while gradually transforming high-dimensional complicated info into low-dimensional ones. The proposed method is evaluated on the public dataset BreakHis at various magnifications and classification tasks. It achieves a classification accuracy of 96.32% at all magnification cases, outperforming state-of-the-art methods. Ablation studies are also performed, verifying the proposed model's efficacy. The source code is available at XhuBobYoung/Breast-cancer-Classification.

preprint2024arXiv

SLP-Net:An efficient lightweight network for segmentation of skin lesions

Prompt treatment for melanoma is crucial. To assist physicians in identifying lesion areas precisely in a quick manner, we propose a novel skin lesion segmentation technique namely SLP-Net, an ultra-lightweight segmentation network based on the spiking neural P(SNP) systems type mechanism. Most existing convolutional neural networks achieve high segmentation accuracy while neglecting the high hardware cost. SLP-Net, on the contrary, has a very small number of parameters and a high computation speed. We design a lightweight multi-scale feature extractor without the usual encoder-decoder structure. Rather than a decoder, a feature adaptation module is designed to replace it and implement multi-scale information decoding. Experiments at the ISIC2018 challenge demonstrate that the proposed model has the highest Acc and DSC among the state-of-the-art methods, while experiments on the PH2 dataset also demonstrate a favorable generalization ability. Finally, we compare the computational complexity as well as the computational speed of the models in experiments, where SLP-Net has the highest overall superiority

preprint2022arXiv

3PSDF: Three-Pole Signed Distance Function for Learning Surfaces with Arbitrary Topologies

Recent advances in learning 3D shapes using neural implicit functions have achieved impressive results by breaking the previous barrier of resolution and diversity for varying topologies. However, most of such approaches are limited to closed surfaces as they require the space to be divided into inside and outside. More recent works based on unsigned distance function have been proposed to handle complex geometry containing both the open and closed surfaces. Nonetheless, as their direct outputs are point clouds, robustly obtaining high-quality meshing results from discrete points remains an open question. We present a novel learnable implicit representation, called the three-pole signed distance function (3PSDF), that can represent non-watertight 3D shapes with arbitrary topologies while supporting easy field-to-mesh conversion using the classic Marching Cubes algorithm. The key to our method is the introduction of a new sign, the NULL sign, in addition to the conventional in and out labels. The existence of the null sign could stop the formation of a closed isosurface derived from the bisector of the in/out regions. Further, we propose a dedicated learning framework to effectively learn 3PSDF without worrying about the vanishing gradient due to the null labels. Experimental results show that our approach outperforms the previous state-of-the-art methods in a wide range of benchmarks both quantitatively and qualitatively.

preprint2022arXiv

A Comparative Study of Speaker Role Identification in Air Traffic Communication Using Deep Learning Approaches

Automatic spoken instruction understanding (SIU) of the controller-pilot conversations in the air traffic control (ATC) requires not only recognizing the words and semantics of the speech but also determining the role of the speaker. However, few of the published works on the automatic understanding systems in air traffic communication focus on speaker role identification (SRI). In this paper, we formulate the SRI task of controller-pilot communication as a binary classification problem. Furthermore, the text-based, speech-based, and speech and text based multi-modal methods are proposed to achieve a comprehensive comparison of the SRI task. To ablate the impacts of the comparative approaches, various advanced neural network architectures are applied to optimize the implementation of text-based and speech-based methods. Most importantly, a multi-modal speaker role identification network (MMSRINet) is designed to achieve the SRI task by considering both the speech and textual modality features. To aggregate modality features, the modal fusion module is proposed to fuse and squeeze acoustic and textual representations by modal attention mechanism and self-attention pooling layer, respectively. Finally, the comparative approaches are validated on the ATCSpeech corpus collected from a real-world ATC environment. The experimental results demonstrate that all the comparative approaches are worked for the SRI task, and the proposed MMSRINet shows the competitive performance and robustness than the other methods on both seen and unseen data, achieving 98.56%, and 98.08% accuracy, respectively.

preprint2022arXiv

Asynchronous Decentralized Federated Learning for Collaborative Fault Diagnosis of PV Stations

Due to the different losses caused by various photovoltaic (PV) array faults, accurate diagnosis of fault types is becoming increasingly important. Compared with a single one, multiple PV stations collect sufficient fault samples, but their data is not allowed to be shared directly due to potential conflicts of interest. Therefore, federated learning can be exploited to train a collaborative fault diagnosis model. However, the modeling efficiency is seriously affected by the model update mechanism since each PV station has a different computing capability and amount of data. Moreover, for the safe and stable operation of the PV system, the robustness of collaborative modeling must be guaranteed rather than simply being processed on a central server. To address these challenges, a novel asynchronous decentralized federated learning (ADFL) framework is proposed. Each PV station not only trains its local model but also participates in collaborative fault diagnosis by exchanging model parameters to improve the generalization without losing accuracy. The global model is aggregated distributedly to avoid central node failure. By designing the asynchronous update scheme, the communication overhead and training time are greatly reduced. Both the experiments and numerical simulations are carried out to verify the effectiveness of the proposed method.

preprint2022arXiv

Bidirectional Pricing and Demand Response for Nanogrids with HVAC Systems

Owing to the fluctuant renewable generation and power demand, the energy surplus or deficit in each nanogrid is embodied differently across time. To stimulate local renewable energy consumption and minimize the long-term energy cost, some issues still remain to be explored: when and how the energy demand and bidirectional trading prices are scheduled considering personal comfort preferences and environmental factors. For this purpose, the demand response and two-way pricing problems concurrently for nanogrids and a public monitoring entity (PME) are studied with exploiting the large potential thermal elastic ability of heating, ventilation and air-conditioning (HVAC) units. Different from nanogrids, in terms of minimizing time-average costs, PME aims to set reasonable prices and optimize profits by trading with nanogrids and the main grid bi-directionally. In particular, such bilevel energy management problem is formulated as a stochastic form in a long-term horizon. Since there are uncertain system parameters, time-coupled queue constraints and the interplay of bilevel decision-making, it is challenging to solve the formulated problems. To this end, we derive a form of relaxation based on Lyapunov optimization technique to make the energy management problem tractable without forecasting the related system parameters. The transaction between nanogrids and PME is captured by a one-leader and multi-follower Stackelberg game framework. Then, theoretical analysis of the existence and uniqueness of Stackelberg equilibrium (SE) is developed based on the proposed game property. Following that, we devise an optimization algorithm to reach the SE with less information exchange. Numerical experiments validate the effectiveness of the proposed approach.

preprint2022arXiv

CDRec: Cayley-Dickson Recommender

In this paper, we propose a recommendation framework named Cayley-Dickson Recommender. We introduce Cayley-Dickson construction which uses a recursive process to define hypercomplex algebras and their mathematical operations. We also design a graph convolution operator to learn representations in the hypercomplex space. To the best of our knowledge, it is the first time that Cayley-Dickson construction and graph convolution techniques have been used in hypercomplex recommendation. Compared with the state-of-the-art recommendation methods, our method achieves superior performance on real-world datasets.

preprint2022arXiv

Driving in Real Life with Inverse Reinforcement Learning

In this paper, we introduce the first learning-based planner to drive a car in dense, urban traffic using Inverse Reinforcement Learning (IRL). Our planner, DriveIRL, generates a diverse set of trajectory proposals, filters these trajectories with a lightweight and interpretable safety filter, and then uses a learned model to score each remaining trajectory. The best trajectory is then tracked by the low-level controller of our self-driving vehicle. We train our trajectory scoring model on a 500+ hour real-world dataset of expert driving demonstrations in Las Vegas within the maximum entropy IRL framework. DriveIRL's benefits include: a simple design due to only learning the trajectory scoring function, relatively interpretable features, and strong real-world performance. We validated DriveIRL on the Las Vegas Strip and demonstrated fully autonomous driving in heavy traffic, including scenarios involving cut-ins, abrupt braking by the lead vehicle, and hotel pickup/dropoff zones. Our dataset will be made public to help further research in this area.

preprint2022arXiv

Energy Management Based on Multi-Agent Deep Reinforcement Learning for A Multi-Energy Industrial Park

Owing to large industrial energy consumption, industrial production has brought a huge burden to the grid in terms of renewable energy access and power supply. Due to the coupling of multiple energy sources and the uncertainty of renewable energy and demand, centralized methods require large calculation and coordination overhead. Thus, this paper proposes a multi-energy management framework achieved by decentralized execution and centralized training for an industrial park. The energy management problem is formulated as a partially-observable Markov decision process, which is intractable by dynamic programming due to the lack of the prior knowledge of the underlying stochastic process. The objective is to minimize long-term energy costs while ensuring the demand of users. To solve this issue and improve the calculation speed, a novel multi-agent deep reinforcement learning algorithm is proposed, which contains the following key points: counterfactual baseline for facilitating contributing agents to learn better policies, soft actor-critic for improving robustness and exploring optimal solutions. A novel reward is designed by Lagrange multiplier method to ensure the capacity constraints of energy storage. In addition, considering that the increase in the number of agents leads to performance degradation due to large observation spaces, an attention mechanism is introduced to enhance the stability of policy and enable agents to focus on important energy-related information, which improves the exploration efficiency of soft actor-critic. Numerical results based on actual data verify the performance of the proposed algorithm with high scalability, indicating that the industrial park can minimize energy costs under different demands.

preprint2022arXiv

Fast Distributed Stochastic Scheduling for A Multi-Energy Industrial Park

The multi-energy management framework of industrial parks advocates energy conversion and scheduling, which takes full advantage of the compensation and temporal availability of multiple energy. However, how to exploit elastic loads and compensate inelastic loads to match multiple generators and storage is still a key problem under the uncertainty of demand and supply. To solve the issue, the energy management problem is constructed as a stochastic optimization problem. The optimization aims are to minimize the time-averaged energy cost and improve the energy efficiency while respecting the energy constraints. To achieve the distributed implementation in real time without knowing any priori knowledge of underlying stochastic process, a distributed stochastic gradient algorithm based on dual decomposition and a fast scheme are proposed. The numerical results based on real data show that the industrial park, by adopting the proposed algorithm, can achieve social welfare maximization asymptotically.

preprint2022arXiv

Federated Spectrum Learning for Reconfigurable Intelligent Surfaces-Aided Wireless Edge Networks

Increasing concerns on intelligent spectrum sensing call for efficient training and inference technologies. In this paper, we propose a novel federated learning (FL) framework, dubbed federated spectrum learning (FSL), which exploits the benefits of reconfigurable intelligent surfaces (RISs) and overcomes the unfavorable impact of deep fading channels. Distinguishingly, we endow conventional RISs with spectrum learning capabilities by leveraging a fully-trained convolutional neural network (CNN) model at each RIS controller, thereby helping the base station to cooperatively infer the users who request to participate in FL at the beginning of each training iteration. To fully exploit the potential of FL and RISs, we address three technical challenges: RISs phase shifts configuration, user-RIS association, and wireless bandwidth allocation. The resulting joint learning, wireless resource allocation, and user-RIS association design is formulated as an optimization problem whose objective is to maximize the system utility while considering the impact of FL prediction accuracy. In this context, the accuracy of FL prediction interplays with the performance of resource optimization. In particular, if the accuracy of the trained CNN model deteriorates, the performance of resource allocation worsens. The proposed FSL framework is tested by using real radio frequency (RF) traces and numerical results demonstrate its advantages in terms of spectrum prediction accuracy and system utility: a better CNN prediction accuracy and FL system utility can be achieved with a larger number of RISs and reflecting elements.

preprint2022arXiv

GR-GAN: Gradual Refinement Text-to-image Generation

A good Text-to-Image model should not only generate high quality images, but also ensure the consistency between the text and the generated image. Previous models failed to simultaneously fix both sides well. This paper proposes a Gradual Refinement Generative Adversarial Network (GR-GAN) to alleviates the problem efficiently. A GRG module is designed to generate images from low resolution to high resolution with the corresponding text constraints from coarse granularity (sentence) to fine granularity (word) stage by stage, a ITM module is designed to provide image-text matching losses at both sentence-image level and word-region level for corresponding stages. We also introduce a new metric Cross-Model Distance (CMD) for simultaneously evaluating image quality and image-text consistency. Experimental results show GR-GAN significant outperform previous models, and achieve new state-of-the-art on both FID and CMD. A detailed analysis demonstrates the efficiency of different generation stages in GR-GAN.

preprint2022arXiv

How to Share: Balancing Layer and Chain Sharing in Industrial Microservice Deployment

With the rapid development of smart manufacturing, edge computing-oriented microservice platforms are emerging as an important part of production control. In the containerized deployment of microservices, layer sharing can reduce the huge bandwidth consumption caused by image pulling, and chain sharing can reduce communication overhead caused by communication between microservices. The two sharing methods use the characteristics of each microservice to share resources during deployment. However, due to the limited resources of edge servers, it is difficult to meet the optimization goals of the two methods at the same time. Therefore, it is of critical importance to realize the improvement of service response efficiency by balancing the two sharing methods. This paper studies the optimal microservice deployment strategy that can balance layer sharing and chain sharing of microservices. We build a problem that minimizes microservice image pull delay and communication overhead and transform the problem into a linearly constrained integer quadratic programming problem through model reconstruction. A deployment strategy is obtained through the successive convex approximation (SCA) method. Experimental results show that the proposed deployment strategy can balance the two resource sharing methods. When the two sharing methods are equally considered, the average image pull delay can be reduced to 65% of the baseline, and the average communication overhead can be reduced to 30% of the baseline.

preprint2022arXiv

Inferring origin-destination distribution of agent transfer in a complex network using deep gated recurrent units

Predicting the origin-destination (OD) probability distribution of agent transfer is an important problem for managing complex systems. However, prediction accuracy of associated statistical estimators suffer from underdetermination. While specific techniques have been proposed to overcome this deficiency, there still lacks a general approach. Here, we propose a deep neural network framework with gated recurrent units (DNNGRU) to address this gap. Our DNNGRU is \emph{network-free}, as it is trained by supervised learning with time-series data on the volume of agents passing through edges. We use it to investigate how network topologies affect OD prediction accuracy, where performance enhancement is observed to depend on the degree of overlap between paths taken by different ODs. By comparing against methods that give exact results, we demonstrate the near-optimal performance of our DNNGRU, which we found to consistently outperform existing methods and alternative neural network architectures, under diverse data generation scenarios.

preprint2022arXiv

Joint Offloading Decision and Resource Allocation for Vehicular Fog-Edge Computing Networks: A Contract-Stackelberg Approach

With the popularity of mobile devices and development of computationally intensive applications, researchers are focusing on offloading computation to Mobile Edge Computing (MEC) server due to its high computational efficiency and low communication delay. As the computing resources of an MEC server are limited, vehicles in the urban area who have abundant idle resources should be fully utilized. However, offloading computing tasks to vehicles faces many challenging issues. In this paper, we introduce a vehicular fog-edge computing paradigm and formulate it as a multi-stage Stackelberg game to deal with these issues. Specifically, vehicles are not obligated to share resources, let alone disclose their private information (e.g., stay time and the amount of resources). Therefore, in the first stage, we design a contract-based incentive mechanism to motivate vehicles to contribute their idle resources. Next, due to the complicated interactions among vehicles, road-side unit (RSU), MEC server and mobile device users, it is challenging to coordinate the resources of all parties and design a transaction mechanism to make all entities benefit. In the second and third stages, based on Stackelberg game, we develop pricing strategies that maximize the utilities of all parties. The analytical forms of optimal strategies for each stage are given. Simulation results demonstrate the effectiveness of our proposed incentive mechanism, reveal the trends of energy consumption and offloading decisions of users with various parameters, and present the performance comparison between our framework and existing MEC offloading paradigm in vehicular networks.

preprint2022arXiv

Learning Interpretable Decision Rule Sets: A Submodular Optimization Approach

Rule sets are highly interpretable logical models in which the predicates for decision are expressed in disjunctive normal form (DNF, OR-of-ANDs), or, equivalently, the overall model comprises an unordered collection of if-then decision rules. In this paper, we consider a submodular optimization based approach for learning rule sets. The learning problem is framed as a subset selection task in which a subset of all possible rules needs to be selected to form an accurate and interpretable rule set. We employ an objective function that exhibits submodularity and thus is amenable to submodular optimization techniques. To overcome the difficulty arose from dealing with the exponential-sized ground set of rules, the subproblem of searching a rule is casted as another subset selection task that asks for a subset of features. We show it is possible to write the induced objective function for the subproblem as a difference of two submodular (DS) functions to make it approximately solvable by DS optimization algorithms. Overall, the proposed approach is simple, scalable, and likely to be benefited from further research on submodular optimization. Experiments on real datasets demonstrate the effectiveness of our method.

preprint2022arXiv

Monolithic Integration of Embedded III-V Lasers on SOI

Silicon photonic integration has gained great success in many application fields owing to the excellent optical device properties and complementary metal-oxide semiconductor (CMOS) compatibility. Realizing monolithic integration of III-V lasers and silicon photonic components on single silicon wafer is recognized as a long-standing obstacle for ultra-dense photonic integration, which can provide considerable economical, energy efficient and foundry-scalable on-chip light sources, that has not been reported yet. Here, we demonstrate embedded InAs/GaAs quantum dot (QD) lasers directly grown on trenched silicon-on-insulator (SOI) substrate, enabling monolithic integration with butt-coupled silicon waveguides. By utilizing the patterned grating structures inside pre-defined SOI trenches and unique epitaxial method via molecular beam epitaxy (MBE), high-performance embedded InAs QD lasers with out-coupled silicon waveguide are achieved on such template. By resolving the epitaxy and fabrication challenges in such monolithic integrated architecture, embedded III-V lasers on SOI with continuous-wave lasing up to 85 oC are obtained. The maximum output power of 6.8 mW can be measured from the end tip of the butt-coupled silicon waveguides, with estimated coupling efficiency of approximately -7.35 dB. The results presented here provide a scalable and low-cost epitaxial method for realization of on-chip light sources directly coupling to the silicon photonic components for future high-density photonic integration.

preprint2022arXiv

Multi-level Coordinated Energy Management for Energy Hub in Hybrid Markets with Distributionally Robust Scheduling

Maintaining energy balance and economical operation is significant for multi-energy systems such as the energy hub. However, it is usually challenged by the frequently changing and unpredictable uncertainties at different timescales. Under this scope, this paper investigates the coordinated energy management problem for day-ahead and intra-day conditions considering uncertainties of source-load and market prices concurrently. Note that the precise knowledge of distributions about uncertainties may be unaccessible before the decision-making in day-ahead phase. A two-stage chance-constrained model based on distributionally robust approach with ambiguous moment information is proposed to immunize scheduling strategies against the worst-case probability distributions. The first stage is dedicated to obtaining more energy arbitrage and operation flexibility by optimizing bidding strategies in day-ahead power, natural gas and carbon markets. The second stage focuses on the optimization of worst-case expected operation cost. It provides a robust energy equipment and load scheduling strategy for the reference of subsequent intra-day arrangements. With respect to different variations of electrical and thermal components, an intra-day two-timescale coordination is implemented step by step. The energy scheduling is re-dispatched circularly to minimize the operation and penalty costs. The possible energy imbalance is also compensated by this way. As the energy management program is nonlinear, chance-constrained and multi-stage, linearization and dual transformation techniques are designed to enhance tractability of programs. Experimental results show that the developed multi-level framework results in a carbon emission decrease of 37%, and reduces energy cost averagely 3% compared with corresponding contrasting cases. The obtained strategy validates a good tradeoff between robustness and optimality.

preprint2022arXiv

On the Advances and Challenges of Adaptive Online Testing

In recent years, the interest in developing adaptive solutions for online testing has grown significantly in the industry. While the advances related to this relative new technology have been developed in multiple domains, it lacks in the literature a systematic and complete treatment of the procedure that involves exploration, inference, and analysis. This short paper aims to develop a comprehensive understanding of adaptive online testing, including various building blocks and analytical results. We also address the latest developments, research directions, and challenges that have been less mentioned in the literature.

preprint2022arXiv

RangeUDF: Semantic Surface Reconstruction from 3D Point Clouds

We present RangeUDF, a new implicit representation based framework to recover the geometry and semantics of continuous 3D scene surfaces from point clouds. Unlike occupancy fields or signed distance fields which can only model closed 3D surfaces, our approach is not restricted to any type of topology. Being different from the existing unsigned distance fields, our framework does not suffer from any surface ambiguity. In addition, our RangeUDF can jointly estimate precise semantics for continuous surfaces. The key to our approach is a range-aware unsigned distance function together with a surface-oriented semantic segmentation module. Extensive experiments show that RangeUDF clearly surpasses state-of-the-art approaches for surface reconstruction on four point cloud datasets. Moreover, RangeUDF demonstrates superior generalization capability across multiple unseen datasets, which is nearly impossible for all existing approaches.

preprint2022arXiv

Rogue wave patterns associated with Okamoto polynomial hierarchies

We show that new types of rogue wave patterns exist in integrable systems, and these rogue patterns are described by root structures of Okamoto polynomial hierarchies. These rogue patterns arise when the $τ$ functions of rogue wave solutions are determinants of Schur polynomials with index jumps of three, and an internal free parameter in these rogue waves gets large. We demonstrate these new rogue patterns in the Manakov system and the three-wave resonant interaction system. For each system, we derive asymptotic predictions of its rogue patterns under a large internal parameter through Okamoto polynomial hierarchies. Unlike the previously reported rogue patterns associated with the Yablonskii-Vorob'ev hierarchy, a new feature in the present rogue patterns is that, the mapping from the root structure of Okamoto-hierarchy polynomials to the shape of the rogue pattern is linear only to the leading order, but becomes nonlinear to the next order. As a consequence, the current rogue patterns are often deformed, sometimes strongly deformed, from Okamoto root structures, unless the underlying free parameter is very large. Our analytical predictions of rogue patterns are compared to true solutions, and excellent agreement is observed, even when rogue patterns are strongly deformed from Okamoto root structures.

preprint2022arXiv

Rogue waves in the massive Thirring model

In this paper, general rogue wave solutions in the massive Thirring (MT) model are derived by using the Kadomtsev-Petviashvili (KP) hierarchy reduction method and these rational solutions are presented explicitly in terms of determinants whose matrix elements are elementary Schur polynomials. In the reduction process, three reduction conditions including one index- and two dimension-ones are proved to be consistent by only one constraint relation on parameters of tau-functions of the KP-Toda hierarchy.It is found that the rogue wave solutions in the MT model depend on two background parameters, which influence their orientation and duration. Differing from many other coupled integrable systems, the MT model only admits the rogue waves of bright-type, and the higher-order rogue waves represent the superposition of fundamental ones in which the non-reducible parameters determine the arrangement patterns of fundamental rogue waves. Particularly, the super rogue wave at each order can be achieved simply by setting all internal parameters to be zero, resulting in the amplitude of the sole huge peak of order $N$ being $2N+1$ times the background.Finally, rogue wave patterns are discussed when one of the internal parameters is large. Similar to other integrable equations, the patterns are shown to be associated with the root structures of the Yablonskii-Vorob'ev polynomial hierarchy through a linear transformation.

preprint2022arXiv

SensatUrban: Learning Semantics from Urban-Scale Photogrammetric Point Clouds

With the recent availability and affordability of commercial depth sensors and 3D scanners, an increasing number of 3D (i.e., RGBD, point cloud) datasets have been publicized to facilitate research in 3D computer vision. However, existing datasets either cover relatively small areas or have limited semantic annotations. Fine-grained understanding of urban-scale 3D scenes is still in its infancy. In this paper, we introduce SensatUrban, an urban-scale UAV photogrammetry point cloud dataset consisting of nearly three billion points collected from three UK cities, covering 7.6 km^2. Each point in the dataset has been labelled with fine-grained semantic annotations, resulting in a dataset that is three times the size of the previous existing largest photogrammetric point cloud dataset. In addition to the more commonly encountered categories such as road and vegetation, urban-level categories including rail, bridge, and river are also included in our dataset. Based on this dataset, we further build a benchmark to evaluate the performance of state-of-the-art segmentation algorithms. In particular, we provide a comprehensive analysis and identify several key challenges limiting urban-scale point cloud understanding. The dataset is available at http://point-cloud-analysis.cs.ox.ac.uk.

preprint2022arXiv

Sentiment-Aware Automatic Speech Recognition pre-training for enhanced Speech Emotion Recognition

We propose a novel multi-task pre-training method for Speech Emotion Recognition (SER). We pre-train SER model simultaneously on Automatic Speech Recognition (ASR) and sentiment classification tasks to make the acoustic ASR model more ``emotion aware''. We generate targets for the sentiment classification using text-to-sentiment model trained on publicly available data. Finally, we fine-tune the acoustic ASR on emotion annotated speech data. We evaluated the proposed approach on the MSP-Podcast dataset, where we achieved the best reported concordance correlation coefficient (CCC) of 0.41 for valence prediction.

preprint2022arXiv

Stack operation of tensor networks

The tensor network, as a facterization of tensors, aims at performing the operations that are common for normal tensors, such as addition, contraction and stacking. However, due to its non-unique network structure, only the tensor network contraction is so far well defined. In this paper, we propose a mathematically rigorous definition for the tensor network stack approach, that compress a large amount of tensor networks into a single one without changing their structures and configurations. We illustrate the main ideas with the matrix product states based machine learning as an example. Our results are compared with the for loop and the efficient coding method on both CPU and GPU.

preprint2022arXiv

Stochastic Gradient-based Fast Distributed Multi-Energy Management for an Industrial Park with Temporally-Coupled Constraints

Contemporary industrial parks are challenged by the growing concerns about high cost and low efficiency of energy supply. Moreover, in the case of uncertain supply/demand, how to mobilize delay-tolerant elastic loads and compensate real-time inelastic loads to match multi-energy generation/storage and minimize energy cost is a key issue. Since energy management is hardly to be implemented offline without knowing statistical information of random variables, this paper presents a systematic online energy cost minimization framework to fulfill the complementary utilization of multi-energy with time-varying generation, demand and price. Specifically to achieve charging/discharging constraints due to storage and short-term energy balancing, a fast distributed algorithm based on stochastic gradient with two-timescale implementation is proposed to ensure online implementation. To reduce the peak loads, an incentive mechanism is implemented by estimating users' willingness to shift. Analytical results on parameter setting are also given to guarantee feasibility and optimality of the proposed design. Numerical results show that when the bid-ask spread of electricity is small enough, the proposed algorithm can achieve the close-to-optimal cost asymptotically.

preprint2022arXiv

The quasi-Keplerian motion in the regular Bardeen spacetime

The second post-Newtonian solution for the quasi-Keplerian motion of a test particle in the gravitational field of regular Bardeen black hole is derived. The solution is formulated in terms of the test particle's orbital energy and angular momentum, as well as the mass and magnetic charge of the Bardeen black hole. The leading effects of the magnetic charge on the test particle's orbit and motion including perihelion precession are displayed explicitly. In particular, it is shown that to the second post-Newtonian order the magnetic charge does not affect the test particle's orbital period.

preprint2022arXiv

The second post-Newtonian motion in Reissner-Nordström spacetime

We derived the second post-Newtonian solution for the quasi-Keplerian motion of a charged test particle in the Reissner-Nordström spacetime under the harmonic coordinates. The solution is formulated in terms of the test particle's orbital energy and angular momentum, both of which are constants at the second post-Newtonian order. The charge effects on the test particle's motion including the orbital period and perihelion precession are displayed explicitly. Our results can be applied to the cases in which the test particle has small charge-to-mass ratio, or the test particle has arbitrary charge-to-mass ratio but the multiplication of the test particle and the gravitational source's charge-to-mass-ratios is much smaller than 1.

preprint2022arXiv

Towards Robust Off-policy Learning for Runtime Uncertainty

Off-policy learning plays a pivotal role in optimizing and evaluating policies prior to the online deployment. However, during the real-time serving, we observe varieties of interventions and constraints that cause inconsistency between the online and offline settings, which we summarize and term as runtime uncertainty. Such uncertainty cannot be learned from the logged data due to its abnormality and rareness nature. To assert a certain level of robustness, we perturb the off-policy estimators along an adversarial direction in view of the runtime uncertainty. It allows the resulting estimators to be robust not only to observed but also unexpected runtime uncertainties. Leveraging this idea, we bring runtime-uncertainty robustness to three major off-policy learning methods: the inverse propensity score method, reward-model method, and doubly robust method. We theoretically justify the robustness of our methods to runtime uncertainty, and demonstrate their effectiveness using both the simulation and the real-world online experiments.

preprint2021arXiv

Adversarial example generation with AdaBelief Optimizer and Crop Invariance

Deep neural networks are vulnerable to adversarial examples, which are crafted by applying small, human-imperceptible perturbations on the original images, so as to mislead deep neural networks to output inaccurate predictions. Adversarial attacks can thus be an important method to evaluate and select robust models in safety-critical applications. However, under the challenging black-box setting, most existing adversarial attacks often achieve relatively low success rates on adversarially trained networks and advanced defense models. In this paper, we propose AdaBelief Iterative Fast Gradient Method (ABI-FGM) and Crop-Invariant attack Method (CIM) to improves the transferability of adversarial examples. ABI-FGM and CIM can be readily integrated to build a strong gradient-based attack to further boost the success rates of adversarial examples for black-box attacks. Moreover, our method can also be naturally combined with other gradient-based attack methods to build a more robust attack to generate more transferable adversarial examples against the defense models. Extensive experiments on the ImageNet dataset demonstrate the method's effectiveness. Whether on adversarially trained networks or advanced defense models, our method has higher success rates than state-of-the-art gradient-based attack methods.

preprint2021arXiv

Analytic exposition of the graviton modes in fractional quantum Hall effects and its physical implications

Neutral excitations in a fractional quantum Hall droplet define the incompressibility gap of the topological phase. In this work, we derived a set of analytical results for the energy gap of the graviton modes with two-body and three-body Hamiltonians in both the long-wavelength and thermodynamic limit. These allow us to construct model Hamiltonians for the graviton modes in different FQH phases, and to elucidate a hierarchical structure of conformal Hilbert spaces (nullspaces of model Hamiltonians) with respect to the graviton modes and their corresponding ground states. Using the analytical tools developed, we perform numerical analysis with a particular focus on the Laughlin $ν= 1/5$ and the Gaffnian $ν= 2/5$ phases. Our calculation shows that for gapped phases, low-lying neutral excitations can undergo a "phase transition" even when the ground state is invariant. We discuss the compressibility of the Gaffnian phase, the possibility of multiple graviton modes, and the transition from the graviton modes to the "hollow-core" modes, as well as their experimental consequences.

preprint2021arXiv

ATCSpeechNet: A multilingual end-to-end speech recognition framework for air traffic control systems

In this paper, a multilingual end-to-end framework, called as ATCSpeechNet, is proposed to tackle the issue of translating communication speech into human-readable text in air traffic control (ATC) systems. In the proposed framework, we focus on integrating the multilingual automatic speech recognition (ASR) into one model, in which an end-to-end paradigm is developed to convert speech waveform into text directly, without any feature engineering or lexicon. In order to make up for the deficiency of the handcrafted feature engineering caused by ATC challenges, a speech representation learning (SRL) network is proposed to capture robust and discriminative speech representations from the raw wave. The self-supervised training strategy is adopted to optimize the SRL network from unlabeled data, and further to predict the speech features, i.e., wave-to-feature. An end-to-end architecture is improved to complete the ASR task, in which a grapheme-based modeling unit is applied to address the multilingual ASR issue. Facing the problem of small transcribed samples in the ATC domain, an unsupervised approach with mask prediction is applied to pre-train the backbone network of the ASR model on unlabeled data by a feature-to-feature process. Finally, by integrating the SRL with ASR, an end-to-end multilingual ASR framework is formulated in a supervised manner, which is able to translate the raw wave into text in one model, i.e., wave-to-text. Experimental results on the ATCSpeech corpus demonstrate that the proposed approach achieves a high performance with a very small labeled corpus and less resource consumption, only 4.20% label error rate on the 58-hour transcribed corpus. Compared to the baseline model, the proposed approach obtains over 100% relative performance improvement which can be further enhanced with the increasing of the size of the transcribed samples.

preprint2021arXiv

Contrastive Unsupervised Learning for Speech Emotion Recognition

Speech emotion recognition (SER) is a key technology to enable more natural human-machine communication. However, SER has long suffered from a lack of public large-scale labeled datasets. To circumvent this problem, we investigate how unsupervised representation learning on unlabeled datasets can benefit SER. We show that the contrastive predictive coding (CPC) method can learn salient representations from unlabeled datasets, which improves emotion recognition performance. In our experiments, this method achieved state-of-the-art concordance correlation coefficient (CCC) performance for all emotion primitives (activation, valence, and dominance) on IEMOCAP. Additionally, on the MSP- Podcast dataset, our method obtained considerable performance improvements compared to baselines.

preprint2021arXiv

D2A: A Dataset Built for AI-Based Vulnerability Detection Methods Using Differential Analysis

Static analysis tools are widely used for vulnerability detection as they understand programs with complex behavior and millions of lines of code. Despite their popularity, static analysis tools are known to generate an excess of false positives. The recent ability of Machine Learning models to understand programming languages opens new possibilities when applied to static analysis. However, existing datasets to train models for vulnerability identification suffer from multiple limitations such as limited bug context, limited size, and synthetic and unrealistic source code. We propose D2A, a differential analysis based approach to label issues reported by static analysis tools. The D2A dataset is built by analyzing version pairs from multiple open source projects. From each project, we select bug fixing commits and we run static analysis on the versions before and after such commits. If some issues detected in a before-commit version disappear in the corresponding after-commit version, they are very likely to be real bugs that got fixed by the commit. We use D2A to generate a large labeled dataset to train models for vulnerability identification. We show that the dataset can be used to build a classifier to identify possible false alarms among the issues reported by static analysis, hence helping developers prioritize and investigate potential true positives first.

preprint2021arXiv

DEFT: Distilling Entangled Factors by Preventing Information Diffusion

Disentanglement is a highly desirable property of representation owing to its similarity to human understanding and reasoning. Many works achieve disentanglement upon information bottlenecks (IB). Despite their elegant mathematical foundations, the IB branch usually exhibits lower performance. In order to provide an insight into the problem, we develop an annealing test to calculate the information freezing point (IFP), which is a transition state to freeze information into the latent variables. We also explore these clues or inductive biases for separating the entangled factors according to the differences in the IFP distributions. We found the existing approaches suffer from the information diffusion problem, according to which the increased information diffuses in all latent variables. Based on this insight, we propose a novel disentanglement framework, termed the distilling entangled factor (DEFT), to address the information diffusion problem by scaling backward information. DEFT applies a multistage training strategy, including multigroup encoders with different learning rates and piecewise disentanglement pressure, to disentangle the factors stage by stage. We evaluate DEFT on three variants of dSprite and SmallNORB, which show low-variance and high-level disentanglement scores. Furthermore, the experiment under the correlative factors shows incapable of TC-based approaches. DEFT also exhibits a competitive performance in the unsupervised setting.

preprint2021arXiv

Elementary Excitations in Fractional Quantum Hall Effect from Classical Constraints

Classical constraints on the reduced density matrix of quantum fluids in a single Landau level, termed as local exclusion conditions (LECs) [B. Yang, arXiv:1901.00047], have recently been shown to characterize the ground state of many FQH phases. In this work, we extend the LEC construction to build the elementary excitations, namely quasiholes and quasielectrons, of these FQH phases. In particular, we elucidate the quasihole counting, categorize various types of quasielectrons, and construct their microscopic wave functions. Our extensive numerical calculations indicate that the undressed quasielectron excitations of the Laughlin state obtained from LECs are topologically equivalent to those obtained from the composite fermion theory. Intriguingly, the LEC construction unveils interesting connections between different FQH phases and offers a novel perspective on exotic states such as the Gaffnian and the Fibonacci state.

preprint2021arXiv

General rogue waves in the three-wave resonant interaction systems

General rogue waves in (1+1)-dimensional three-wave resonant interaction systems are derived by the bilinear method. These solutions are divided into three families, which correspond to a simple root, two simple roots and a double root of a certain quartic equation arising from the dimension reduction respectively. It is shown that while the first family of solutions associated with a simple root exist for all signs of the nonlinear coefficients in the three-wave interaction equations, the other two families of solutions associated with two simple roots and a double root can only exist in the so-called soliton-exchange case, where the nonlinear coefficients have certain signs. Many of these rogue wave solutions, such as those associated with two simple roots, and higher-order solutions associated with a simple root, are new solutions which have not been reported before. Technically, our bilinear derivation of rogue waves for the double-root case is achieved by a generalization to the previous dimension reduction procedure in the bilinear method, and this generalized procedure allows us to treat roots of arbitrary multiplicities. Dynamics of the derived rogue waves is also examined, and new rogue-wave patterns are presented. Connection between these bilinear rogue waves and those derived earlier by Darboux transformation is also explained.

preprint2021arXiv

HSR: Hyperbolic Social Recommender

With the prevalence of online social media, users' social connections have been widely studied and utilized to enhance the performance of recommender systems. In this paper, we explore the use of hyperbolic geometry for social recommendation. We present Hyperbolic Social Recommender (HSR), a novel social recommendation framework that utilizes hyperbolic geometry to boost the performance. With the help of hyperbolic spaces, HSR can learn high-quality user and item representations for better modeling user-item interaction and user-user social relations. Via a series of extensive experiments, we show that our proposed HSR outperforms its Euclidean counterpart and state-of-the-art social recommenders in click-through rate prediction and top-K recommendation, demonstrating the effectiveness of social recommendation in the hyperbolic space.

preprint2021arXiv

Improving speech recognition models with small samples for air traffic control systems

In the domain of air traffic control (ATC) systems, efforts to train a practical automatic speech recognition (ASR) model always faces the problem of small training samples since the collection and annotation of speech samples are expert- and domain-dependent task. In this work, a novel training approach based on pretraining and transfer learning is proposed to address this issue, and an improved end-to-end deep learning model is developed to address the specific challenges of ASR in the ATC domain. An unsupervised pretraining strategy is first proposed to learn speech representations from unlabeled samples for a certain dataset. Specifically, a masking strategy is applied to improve the diversity of the sample without losing their general patterns. Subsequently, transfer learning is applied to fine-tune a pretrained or other optimized baseline models to finally achieves the supervised ASR task. By virtue of the common terminology used in the ATC domain, the transfer learning task can be regarded as a sub-domain adaption task, in which the transferred model is optimized using a joint corpus consisting of baseline samples and new transcribed samples from the target dataset. This joint corpus construction strategy enriches the size and diversity of the training samples, which is important for addressing the issue of the small transcribed corpus. In addition, speed perturbation is applied to augment the new transcribed samples to further improve the quality of the speech corpus. Three real ATC datasets are used to validate the proposed ASR model and training strategies. The experimental results demonstrate that the ASR performance is significantly improved on all three datasets, with an absolute character error rate only one-third of that achieved through the supervised training. The applicability of the proposed strategies to other ASR approaches is also validated.

preprint2021arXiv

Intelligent Spectrum Learning for Wireless Networks with Reconfigurable Intelligent Surfaces

Reconfigurable intelligent surface (RIS) has become a promising technology for enhancing the reliability of wireless communications, which is capable of reflecting the desired signals through appropriate phase shifts. However, the intended signals that impinge upon an RIS are often mixed with interfering signals, which are usually dynamic and unknown. In particular, the received signal-to-interference-plus-noise ratio (SINR) may be degraded by the signals reflected from the RISs that originate from non-intended users. To tackle this issue, we introduce the concept of intelligent spectrum learning (ISL), which uses an appropriately trained convolutional neural network (CNN) at the RIS controller to help the RISs infer the interfering signals directly from the incident signals. By capitalizing on the ISL, a distributed control algorithm is proposed to maximize the received SINR by dynamically configuring the active/inactive binary status of the RIS elements. Simulation results validate the performance improvement offered by deep learning and demonstrate the superiority of the proposed ISL-aided approach.

preprint2021arXiv

Pattern transformation in higher-order lumps of the Kadomtsev-Petviashvili I equation

Pattern formation in higher-order lumps of the Kadomtsev-Petviashvili I equation at large time is analytically studied. For a broad class of these higher-order lumps, we show that two types of solution patterns appear at large time. The first type of patterns comprise fundamental lumps arranged in triangular shapes, which are described analytically by root structures of the Yablonskii-Vorob'ev polynomials. As time evolves from large negative to large positive, this triangular pattern reverses itself along the x-direction. The second type of patterns comprise fundamental lumps arranged in non-triangular shapes in the outer region, which are described analytically by nonzero-root structures of the Wronskian-Hermit polynomials, together with possible fundamental lumps arranged in triangular shapes in the inner region, which are described analytically by root structures of the Yablonskii-Vorob'ev polynomials. When time evolves from large negative to large positive, the non-triangular pattern in the outer region switches its x and y directions, while the triangular pattern in the inner region, if it arises, reverses its direction along the x-axis. Our predicted patterns at large time are compared to true solutions, and excellent agreement is observed.

preprint2021arXiv

Random Transformation of Image Brightness for Adversarial Attack

Deep neural networks are vulnerable to adversarial examples, which are crafted by adding small, human-imperceptible perturbations to the original images, but make the model output inaccurate predictions. Before deep neural networks are deployed, adversarial attacks can thus be an important method to evaluate and select robust models in safety-critical applications. However, under the challenging black-box setting, the attack success rate, i.e., the transferability of adversarial examples, still needs to be improved. Based on image augmentation methods, we found that random transformation of image brightness can eliminate overfitting in the generation of adversarial examples and improve their transferability. To this end, we propose an adversarial example generation method based on this phenomenon, which can be integrated with Fast Gradient Sign Method (FGSM)-related methods to build a more robust gradient-based attack and generate adversarial examples with better transferability. Extensive experiments on the ImageNet dataset demonstrate the method's effectiveness. Whether on normally or adversarially trained networks, our method has a higher success rate for black-box attacks than other attack methods based on data augmentation. We hope that this method can help to evaluate and improve the robustness of models.

preprint2021arXiv

Rogue wave patterns in the nonlinear Schrödinger equation

Rogue wave patterns in the nonlinear Schrödinger equation are analytically studied. It is shown that when an internal parameter in the rogue waves (which controls the shape of initial weak perturbations to the uniform background) is large, these waves would exhibit clear geometric structures, which are formed by Peregrine waves in shapes such as triangle, pentagon, heptagon and nonagon, with a possible lower-order rogue wave at its center. These rogue patterns are analytically determined by the root structures of the Yablonskii-Vorob'ev polynomial hierarchy, and their orientations are controlled by the phase of the large parameter. It is also shown that when multiple internal parameters in the rogue waves are large but satisfy certain constraints, similar rogue patterns would still hold. Comparison between true rogue patterns and our analytical predictions shows excellent agreement.

preprint2021arXiv

Rogue Waves in (2+1)-Dimensional Three-Wave Resonant Interactions

Rogue waves in (2+1)-dimensional three-wave resonant interactions are studied. General rogue waves arising from a constant background, from a lump-soliton background and from a dark-soliton background have been derived, and their dynamics illustrated. For rogue waves arising from a constant background, fundamental rogue waves are line-shaped, and multi-rogue waves exhibit multiple intersecting lines. Higher-order rogue waves could also be line-shaped, but they exhibit multiple parallel lines. For rogue waves arising from a lump-soliton background, they could exhibit distinctive patterns due to their interaction with the lump soliton. For rogue waves arising from a dark-soliton background, their intensity pattern could feature half-line shapes or lump shapes, which are very novel.

preprint2020arXiv

3D Human Pose Estimation using Spatio-Temporal Networks with Explicit Occlusion Training

Estimating 3D poses from a monocular video is still a challenging task, despite the significant progress that has been made in recent years. Generally, the performance of existing methods drops when the target person is too small/large, or the motion is too fast/slow relative to the scale and speed of the training data. Moreover, to our knowledge, many of these methods are not designed or trained under severe occlusion explicitly, making their performance on handling occlusion compromised. Addressing these problems, we introduce a spatio-temporal network for robust 3D human pose estimation. As humans in videos may appear in different scales and have various motion speeds, we apply multi-scale spatial features for 2D joints or keypoints prediction in each individual frame, and multi-stride temporal convolutional net-works (TCNs) to estimate 3D joints or keypoints. Furthermore, we design a spatio-temporal discriminator based on body structures as well as limb motions to assess whether the predicted pose forms a valid pose and a valid movement. During training, we explicitly mask out some keypoints to simulate various occlusion cases, from minor to severe occlusion, so that our network can learn better and becomes robust to various degrees of occlusion. As there are limited 3D ground-truth data, we further utilize 2D video data to inject a semi-supervised learning capability to our network. Experiments on public datasets validate the effectiveness of our method, and our ablation studies show the strengths of our networkś individual submodules.

preprint2020arXiv

Analysis of Truck Driver Behavior to Design Different Lane Change Styles in Automated Driving

Lane change is a very demanding driving task and number of traffic accidents are induced by mistaken maneuvers. An automated lane change system has the potential to reduce driver workload and to improve driving safety. One challenge is how to improve driver acceptance on the automated system. From the viewpoint of human factors, an automated system with different styles would improve user acceptance as the drivers can adapt the style to different driving situations. This paper proposes a method to design different lane change styles in automated driving by analysis and modeling of truck driver behavior. A truck driving simulator experiment with 12 participants was conducted to identify the driver model parameters and three lane change styles were classified as the aggressive, medium, and conservative ones. The proposed automated lane change system was evaluated by another truck driving simulator experiment with the same 12 participants. Moreover, the effect of different driving styles on driver experience and acceptance was evaluated. The evaluation results demonstrate that the different lane change styles could be distinguished by the drivers; meanwhile, the three styles were overall evaluated as acceptable on safety issues and reliable by the human drivers. This study provides insight into designing the automated driving system with different driving styles and the findings can be applied to commercial automated trucks.

preprint2020arXiv

Combining quantum spin hall effect and superconductivity in few-layer stanene

Stanene was proposed to be a quantum spin hall insulator containing topological edges states and a time reversal invariant topological superconductor hosting helical Majorana edge mode. Recently, experimental evidences of existence of topological edge states have been found in monolayer stanene films and superconductivity has been observed in few-layer stanene films excluding single layer. An integrated system with both topological edge states and superconductivity are higly pursued as a possible platform to realize topological superconductivity. Few-layer stanene show great potential to meet this requirement and is highly desired in experiment. Here we successfully grow few-layer stanene on bismuth (111) substrate. Both topological edge states and superconducting gaps are observed by in-situ scanning tunneling microscopy/spectroscopy (STM/STS). Our results take a further step towards topological superconductivity by stanene films.

preprint2020arXiv

Computation Offloading in Multi-Access Edge Computing Networks: A Multi-Task Learning Approach

Multi-access edge computing (MEC) has already shown the potential in enabling mobile devices to bear the computation-intensive applications by offloading some tasks to a nearby access point (AP) integrated with a MEC server (MES). However, due to the varying network conditions and limited computation resources of the MES, the offloading decisions taken by a mobile device and the computational resources allocated by the MES may not be efficiently achieved with the lowest cost. In this paper, we propose a dynamic offloading framework for the MEC network, in which the uplink non-orthogonal multiple access (NOMA) is used to enable multiple devices to upload their tasks via the same frequency band. We formulate the offloading decision problem as a multiclass classification problem and formulate the MES computational resource allocation problem as a regression problem. Then a multi-task learning based feedforward neural network (MTFNN) model is designed to jointly optimize the offloading decision and computational resource allocation. Numerical results illustrate that the proposed MTFNN outperforms the conventional optimization method in terms of inference accuracy and computation complexity.

preprint2020arXiv

DeepPCO: End-to-End Point Cloud Odometry through Deep Parallel Neural Network

Odometry is of key importance for localization in the absence of a map. There is considerable work in the area of visual odometry (VO), and recent advances in deep learning have brought novel approaches to VO, which directly learn salient features from raw images. These learning-based approaches have led to more accurate and robust VO systems. However, they have not been well applied to point cloud data yet. In this work, we investigate how to exploit deep learning to estimate point cloud odometry (PCO), which may serve as a critical component in point cloud-based downstream tasks or learning-based systems. Specifically, we propose a novel end-to-end deep parallel neural network called DeepPCO, which can estimate the 6-DOF poses using consecutive point clouds. It consists of two parallel sub-networks to estimate 3-D translation and orientation respectively rather than a single neural network. We validate our approach on KITTI Visual Odometry/SLAM benchmark dataset with different baselines. Experiments demonstrate that the proposed approach achieves good performance in terms of pose accuracy.

preprint2020arXiv

Double-Layer Game Based Wireless Charging Scheduling for Electric Vehicles

Wireless charging technology provides a solution to the insufficient battery life of electric vehicles (EVs). However, the conflict of interests between wireless charging lanes (WCLs) and EVs is difficult to resolve. In the day-ahead electricity market, considering the revenue of WCLs caused by the deviation between actual electricity sales and pre-purchased electricity, as well as endurance and traveling experience of EVs, this paper proposes a charging scheduling algorithm based on a double-layer game model. In lower layer, the potential game is used to model the multi-vehicle game of vehicle charging planning. A shortest path algorithm based on the three-way greedy strategy is designed to solve in dynamic charging sequence problem, and the improved particle swarm optimization algorithm are used to solve the variable ordered potential game. In the upper layer, the reverse Stackelberg game is adopted to harmonize the cost of wireless charging lanes and electric vehicles. As the leader, WCLs stimulate EVs to carry out reasonable charing action by electricity price regulation. As the follower, EVs make the best charging decisions for a given electricity price. An iteration algorithm is designed to ensure the Nash equilibrium convergence of this game. The simulation results show that the double-layer game model proposed in this paper can effectively suppress the deviation between the actual electricity sales and the pre-sale of the charging lane caused by the disorderly charging behavior of the vehicle, and ensure the high endurance and traveling experience of EVs.

preprint2020arXiv

Dynamics of descending knots in a solar prominence and their possible contributions to the heating of the local corona

The knots in solar prominences are often observed to fall with nearly constant velocity, but the associated physical mechanism is currently not well understood. In this letter, we presented a prominence observed by New Vacuum Solar Telescope (NVST) in H-alpha wavelength. Knots that rose within the prominence appear to have been preferentially located at higher altitude, whereas those that fell were found throughout the entire prominence structure. The descending speed of the knots near the solar surface was higher than that far away from the solar surface. We noted that the knots near the solar surface may run along a set of coronal loops observed from the Atmospheric Imaging Assembly. Elsewhere, the majority of knots are interpreted to have descended across more horizontal magnetic field with a nearly constant speed. This lack of acceleration indicates that the liberated gravitational potential energy may not manifest as an increase in kinetic energy. Assuming instead that the descending knots were capable of exciting Alfven waves that could then dissipate within the local corona, the gravitational potential energy of the knots may have been converted into thermal energy. Assuming a perfectly elastic system, we therefore estimate that the gravitational energy loss rate of these observed knots amounts to 1/2000 of that required to heat the entire quiet-Sun, increasing to 1/320 when considering possibly further downward motions of the knots having disappeared in the H-alpha observations. This result suggests such a mechanism may contribute to the heating of the corona local to these prominences.

preprint2020arXiv

Energy Trading in Microgrids for Synergies among Electricity, Hydrogen and Heat Networks

The emerging paradigm of interconnected microgrids advocates energy trading or sharing among multiple microgrids. It helps make full use of the temporal availability of energy and diversity in operational costs when meeting various energy loads. However, energy trading might not completely absorb excess renewable energy. A multi-energy management framework including fuel cell vehicles, energy storage, combined heat and power system, and renewable energy is proposed, and the characteristics and scheduling arrangements of fuel cell vehicles are considered to further improve the local absorption of the renewable energy and enhance the economic benefits of microgrids. While intensive research has been conducted on energy scheduling and trading problem, a fundamental question still remains unanswered on microgrid economics. Namely, due to multi-energy coupling, stochastic renewable energy generation and demands, when and how a microgrid should schedule and trade energy with others, which maximizes its long-term benefit. This paper designs a joint energy scheduling and trading algorithm based on Lyapunov optimization and a double-auction mechanism. Its purpose is to determine the valuations of energy in the auction, optimally schedule energy distribution, and strategically purchase and sell energy with the current electricity prices. Simulations based on real data show that each individual microgrid, under the management of the proposed algorithm, can achieve a time-averaged profit that is arbitrarily close to an optimum value, while avoiding compromising its own comfort.

preprint2020arXiv

Fermi surface reconstruction and electron dynamics at the charge-density-wave transition in TiSe2

The evolution of the charge carrier concentrations and mobilities are examined across the charge-density-wave (CDW) transition in TiSe2. Combined quantum oscillation and magnetotransport measurements show that a small electron pocket dominates the electronic properties at low temperatures whilst an electron and hole pocket contribute at room temperature. At the CDW transition, an abrupt Fermi surface reconstruction and a minimum in the electron and hole mobilities are extracted from two-band and Kohler analysis of magnetotransport measurements. The minimum in the mobilities is associated with the overseen role of scattering from the softening CDW mode. With the carrier concentrations and dynamics dominated by the CDW and the associated bosonic mode, our results highlight TiSe2 as a prototypical system to study the Fermi surface reconstruction at a density-wave transition.

preprint2020arXiv

Geom-GCN: Geometric Graph Convolutional Networks

Message-passing neural networks (MPNNs) have been successfully applied to representation learning on graphs in a variety of real-world applications. However, two fundamental weaknesses of MPNNs' aggregators limit their ability to represent graph-structured data: losing the structural information of nodes in neighborhoods and lacking the ability to capture long-range dependencies in disassortative graphs. Few studies have noticed the weaknesses from different perspectives. From the observations on classical neural network and network geometry, we propose a novel geometric aggregation scheme for graph neural networks to overcome the two weaknesses. The behind basic idea is the aggregation on a graph can benefit from a continuous space underlying the graph. The proposed aggregation scheme is permutation-invariant and consists of three modules, node embedding, structural neighborhood, and bi-level aggregation. We also present an implementation of the scheme in graph convolutional networks, termed Geom-GCN (Geometric Graph Convolutional Networks), to perform transductive learning on graphs. Experimental results show the proposed Geom-GCN achieved state-of-the-art performance on a wide range of open datasets of graphs. Code is available at https://github.com/graphdml-uiuc-jlu/geom-gcn.

preprint2020arXiv

GRATE: Granular Recovery of Aggregated Tensor Data by Example

In this paper, we address the challenge of recovering an accurate breakdown of aggregated tensor data using disaggregation examples. This problem is motivated by several applications. For example, given the breakdown of energy consumption at some homes, how can we disaggregate the total energy consumed during the same period at other homes? In order to address this challenge, we propose GRATE, a principled method that turns the ill-posed task at hand into a constrained tensor factorization problem. Then, this optimization problem is tackled using an alternating least-squares algorithm. GRATE has the ability to handle exact aggregated data as well as inexact aggregation where some unobserved quantities contribute to the aggregated data. Special emphasis is given to the energy disaggregation problem where the goal is to provide energy breakdown for consumers from their monthly aggregated consumption. Experiments on two real datasets show the efficacy of GRATE in recovering more accurate disaggregation than state-of-the-art energy disaggregation methods.

preprint2020arXiv

Green Resource Allocation and Energy Management in Heterogeneous Small Cell Networks Powered by Hybrid Energy

In heterogeneous networks (HetNets), how to improve spectrum efficiency is a crucial issue. Meanwhile increased energy consumption inspires network operators to deploy renewable energy sources as assistance to traditional electricity. Based on above aspects, we allow base stations (BSs) to share their licensed spectrum resource with each other and adjust transmission power to adapt to the renewable energy level. Considering the sharing fairness among BSs, we formulate a multi-person bargaining problem as a stochastic optimization problem. We divide the optimization problem into three parts: data rate control, resource allocation and energy management. An online dynamic control algorithm is proposed to control admission rate and resource allocation to maximize the transmission and sharing profits with the least grid energy consumption. Simulation results investigate the time-varying data control and energy management of BSs and demonstrate the effectiveness of the proposed scheme.

preprint2020arXiv

Joint Optimization of the Deployment and Resource Allocation of UAVs in Vehicular Edge Computing and Networks

With the development of smart vehicles, computing-intensive tasks are widely and rapidly generated. To alleviate the burden of on-board CPU, connected vehicles can offload tasks to or make request from nearby edge server thanks to the emerging Mobile Edge Computing (MEC). However, such approach may sharply increase the workload of an edge server, and cause network congestion, especially in rural and mountain areas where there are few edge servers. To this end, a UAV-assisted MEC system is proposed in this paper, and joint optimization algorithm of the deployment and resource allocation of UAVs (JOAoDR) is proposed to decide the location and balance the resource and rewards of the UAVs. We solve a long-term profit maximization problem in terms of the operator. Numerical results demonstrated that our algorithm outperforms other benchmarks algorithm, and validated our solution.

preprint2020arXiv

Lessons Learned from Accident of Autonomous Vehicle Testing: An Edge Learning-aided Offloading Framework

This letter proposes an edge learning-based offloading framework for autonomous driving, where the deep learning tasks can be offloaded to the edge server to improve the inference accuracy while meeting the latency constraint. Since the delay and the inference accuracy are incurred by wireless communications and computing, an optimization problem is formulated to maximize the inference accuracy subject to the offloading probability, the pre-braking probability, and data quality. Simulations demonstrate the superiority of the proposed offloading framework.

preprint2020arXiv

Microscopic theory for nematic fractional quantum Hall effect

We analyse various microscopic properties of the nematic fractional quantum Hall effect (FQHN) in the thermodynamic limit, and present necessary conditions required of the microscopic Hamiltonians for the nematic FQHE to be robust. Analytical expressions for the degenerate ground state manifold, ground state energies, and gapless nematic modes are given in compact forms with the input interaction and the corresponding ground state structure factors. We relate the long wavelength limit of the neutral excitations to the guiding center metric deformation, and show explicitly the family of trial wavefunctions for the nematic modes with spatially varying nematic order near the quantum critical point. For short range interactions, the dynamics of the FQHN is completely determined by the long wavelength part of the ground state structure factor. The special case of the FQHN at $ν=1/3$ is discussed with new theoretical insights from the Haffnian parent Hamiltonian, leading to a number of rigorous statements and experimental implications.

preprint2020arXiv

Offloading Optimization in Edge Computing for Deep Learning Enabled Target Tracking by Internet-of-UAVs

The empowering unmanned aerial vehicles (UAVs) have been extensively used in providing intelligence such as target tracking. In our field experiments, a pre-trained convolutional neural network (CNN) is deployed at the UAV to identify a target (a vehicle) from the captured video frames and enable the UAV to keep tracking. However, this kind of visual target tracking demands a lot of computational resources due to the desired high inference accuracy and stringent delay requirement. This motivates us to consider offloading this type of deep learning (DL) tasks to a mobile edge computing (MEC) server due to limited computational resource and energy budget of the UAV, and further improve the inference accuracy. Specifically, we propose a novel hierarchical DL tasks distribution framework, where the UAV is embedded with lower layers of the pre-trained CNN model, while the MEC server with rich computing resources will handle the higher layers of the CNN model. An optimization problem is formulated to minimize the weighted-sum cost including the tracking delay and energy consumption introduced by communication and computing of the UAVs, while taking into account the quality of data (e.g., video frames) input to the DL model and the inference errors. Analytical results are obtained and insights are provided to understand the tradeoff between the weighted-sum cost and inference error rate in the proposed framework. Numerical results demonstrate the effectiveness of the proposed offloading framework.

preprint2020arXiv

RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds

We study the problem of efficient semantic segmentation for large-scale 3D point clouds. By relying on expensive sampling techniques or computationally heavy pre/post-processing steps, most existing approaches are only able to be trained and operate over small-scale point clouds. In this paper, we introduce RandLA-Net, an efficient and lightweight neural architecture to directly infer per-point semantics for large-scale point clouds. The key to our approach is to use random point sampling instead of more complex point selection approaches. Although remarkably computation and memory efficient, random sampling can discard key features by chance. To overcome this, we introduce a novel local feature aggregation module to progressively increase the receptive field for each 3D point, thereby effectively preserving geometric details. Extensive experiments show that our RandLA-Net can process 1 million points in a single pass with up to 200X faster than existing approaches. Moreover, our RandLA-Net clearly surpasses state-of-the-art approaches for semantic segmentation on two large-scale benchmarks Semantic3D and SemanticKITTI.

preprint2020arXiv

Universal patterns of rogue waves

Rogue wave patterns in the nonlinear Schrödinger (NLS) equation and the derivative NLS equation are analytically studied. It is shown that when the free parameters in the analytical expressions of these rogue waves are large, these waves would exhibit the same patterns, comprising fundamental rogue waves forming clear geometric structures such as triangle, pentagon, heptagon and nonagon, with a possible lower-order rogue wave at its center. These rogue patterns are analytically determined by the root structures of the Yablonskii-Vorob'ev polynomial hierarchy, and their orientations are controlled by the phase of the large free parameter. This connection of rogue wave patterns to the root structures of the Yablonskii-Vorob'ev polynomial hierarchy goes beyond the NLS and derivative NLS equations, and it gives rise to universal rogue wave patterns in integrable systems.

preprint2019arXiv

ATCSpeech: a multilingual pilot-controller speech corpus from real Air Traffic Control environment

Automatic Speech Recognition (ASR) is greatly developed in recent years, which expedites many applications on other fields. For the ASR research, speech corpus is always an essential foundation, especially for the vertical industry, such as Air Traffic Control (ATC). There are some speech corpora for common applications, public or paid. However, for the ATC, it is difficult to collect raw speeches from real systems due to safety issues. More importantly, for a supervised learning task like ASR, annotating the transcription is a more laborious work, which hugely restricts the prospect of ASR application. In this paper, a multilingual speech corpus (ATCSpeech) from real ATC systems, including accented Mandarin Chinese and English, is built and released to encourage the non-commercial ASR research in ATC domain. The corpus is detailly introduced from the perspective of data amount, speaker gender and role, speech quality and other attributions. In addition, the performance of our baseline ASR models is also reported. A community edition for our speech database can be applied and used under a special contrast. To our best knowledge, this is the first work that aims at building a real and multilingual ASR corpus for the air traffic related research.

preprint2019arXiv

Electron and positron spectra in the three dimensional spatial-dependent propagation model

The spatial-dependent propagation model has been successfully used to explain diverse observational phenomena, including the spectral hardening of cosmic-ray nuclei above $200$ GV, the large-scale dipole anisotropy and the diffusive gamma distribution. In this work, we further apply the spatial-dependent propagation model to both electrons and positrons. To account for the excess of positrons above $10$ GeV, an additional local source is introduced. And we also consider a more realistic spiral distribution of background sources. We find that due to the gradual hardening above $10$ GeV, the hardening of electron spectrum above tens of GeV can be explained in the SDP model and both positron and electron spectra less than TeV energies could be naturally described. The spatial-dependent propagation with spiral-distributed sources could conforms with the total electron spectrum in the whole energy. Meanwhile compared with the conventional model, the spatial-dependent propagation with spiral-distributed sources could produce larger background positron flux, so that the multiplier of background positron flux is $1.42$, which is much smaller than the required value by the conventional model. Thus the shortage of background positron flux could be solved. Furthermore we compute the anisotropy of electron under spatial-dependent propagation model, which is well below the observational limit of Fermi-LAT experiment.

preprint2019arXiv

General rogue waves in the Boussinesq equation

We derive general rogue wave solutions of arbitrary orders in the Boussinesq equation by the bilinear Kadomtsev-Petviashvili (KP) reduction method. These rogue solutions are given as Gram determinants with $2N-2$ free irreducible real parameters, where $N$ is the order of the rogue wave. Tuning these free parameters, rogue waves of various patterns are obtained, many of which have not been seen before. Compared to rogue waves in other integrable equations, a new feature of rogue waves in the Boussinesq equation is that the rogue wave of maximum amplitude at each order is generally asymmetric in space. On the technical aspect, our contribution to the bilinear KP-reduction method for rogue waves is a new judicious choice of differential operators in the procedure, which drastically simplifies the dimension reduction calculation as well as the analytical expressions of rogue wave solutions.

preprint2019arXiv

Learning Nonlinear Mixtures: Identifiability and Algorithm

Linear mixture models have proven very useful in a plethora of applications, e.g., topic modeling, clustering, and source separation. As a critical aspect of the linear mixture models, identifiability of the model parameters is well-studied, under frameworks such as independent component analysis and constrained matrix factorization. Nevertheless, when the linear mixtures are distorted by an unknown nonlinear functions -- which is well-motivated and more realistic in many cases -- the identifiability issues are much less studied. This work proposes an identification criterion for a nonlinear mixture model that is well grounded in many real-world applications, and offers identifiability guarantees. A practical implementation based on a judiciously designed neural network is proposed to realize the criterion, and an effective learning algorithm is proposed. Numerical results on synthetic and real-data corroborate effectiveness of the proposed method.

preprint2019arXiv

Pathwise Optimization for Merchant Energy Production

We study merchant energy production modeled as a compound switching and timing option. The resulting Markov decision process is intractable. State-of-the-art approximate dynamic programming methods applied to realistic instances of this model yield policies with large optimality gaps that are attributed to a weak upper (dual) bound on the optimal policy value. We extend pathwise optimization from stopping models to merchant energy production to investigate this issue. We apply principal component analysis and block coordinate descent in novel ways to respectively precondition and solve the ensuing ill conditioned and large scale linear program, which even a cutting-edge commercial solver is unable to handle directly. Compared to standard methods, our approach leads to substantially tighter dual bounds and smaller optimality gaps at the expense of considerably larger computational effort. Specifically, we provide numerical evidence for the near optimality of the operating policies based on least squares Monte Carlo and compute slightly better ones using our approach on a set of existing benchmark ethanol production instances. These findings suggest that both these policies are effective for the class of models we investigate. Our research has potential relevance for other commodity merchant operations settings.

preprint2019arXiv

Recurrent Two-Sided Loop Jets Caused by Magnetic Reconnection between Erupting Minifilaments and Nearby Large Filament

Using high spatial and temporal data from the New Vacuum Solar Telescope (NVST) and the Solar Dynamics Observatory (SDO), we present unambiguous observations of recurrent two-sided loop jets caused by magnetic reconnection between erupting minifilaments and nearby large filament. The observations demonstrate that three two-sided loop jets, which ejected along the large filament in opposite directions, had similar appearance and originated from the same region. We find that a minifilament erupted and drove the first jet. It reformed at the same neutral line later, and then underwent partial and total eruptions, drove the second and third jets, respectively. In the course of the jets, cool plasma was injected into the large filament. Furthermore, persistent magnetic flux cancelation occurred at the neutral line under the minifilament before its eruption and continued until the end of the observation. We infer that magnetic flux cancellation may account for building and then triggering the minifilament to erupt to produce the two-sided loop jets. This observation not only indicates that two-sided loop jets can be driven by minifilament eruptions, but also sheds new light on our understanding of the recurrent mechanism of two-sided loop jets.

preprint2019arXiv

Rogue waves in the generalized derivative nonlinear Schrodinger equations

General rogue waves are derived for the generalized derivative nonlinear Schrodinger (GDNLS) equations by a bilinear Kadomtsev-Petviashvili (KP) reduction method. These GDNLS equations contain the Kaup-Newell equation, the Chen-Lee-Liu equation and the Gerdjikov-Ivanov equation as special cases. In this bilinear framework, it is shown that rogue waves to all members of these equations are expressed by the same bilinear solution. Compared to previous bilinear KP reduction methods for rogue waves in other integrable equations, an important improvement in our current KP reduction procedure is a new parameterization of internal parameters in rogue waves. Under this new parameterization, the rogue wave expressions through elementary Schur polynomials are much simpler. In addition, the rogue wave with the highest peak amplitude at each order can be obtained by setting all those internal parameters to zero, and this maximum peak amplitude at order $N$ turns out to be $2N+1$ times the background amplitude, independent of the individual GDNLS equation and the background wavenumber. It is also reported that these GDNLS equations can be decomposed into two different bilinear systems which require different KP reductions, but the resulting rogue waves remain the same. Dynamics of rogue waves in the GDNLS equations is also analyzed. It is shown that the wavenumber of the constant background strongly affects the orientation and duration of the rogue wave. In addition, some new rogue patterns are presented.