Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
38works
0followers
34topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

38 published item(s)

preprint2026arXiv

M\textsuperscript{4}Fuse: Lightweight State-Space MoE with a Cross-Scale Gating Bridge for Brain Tumor Segmentation

Encoder-decoder imbalance and the reliance on large input volumes make many 3D brain tumor segmentation models both compute-heavy and brittle. We present M\textsuperscript{4}Fuse, a lightweight network that prioritizes discriminative brain tumor cues over exhaustive appearance reconstruction. Our method balances encoder and decoder capacity and replaces depth expansion with a synergistic design: it propagates long-range context with linear complexity via a grouped state space mixer, denoises and aligns skip features using a cross-scale dual-stage gating bridge, and absorbs cross-site acquisition shifts with a sample-level mixture-of-experts. On the BraTS2019 and BraTS2021 benchmarks, M\textsuperscript{4}Fuse outperforms other lightweight excellent methods in both parameter count and performance. Even at a challenging input resolution of \(64\times128\times128\) (half that of existing excellent models), M\textsuperscript{4}Fuse reduces parameters by 62.63\% and improves average performance by 0.09\%. Ablations of key components validate the method's exceptional parameter-to-accuracy efficiency and robustness across diverse data centers.

preprint2024arXiv

Point Cloud in the Air

Acquisition and processing of point clouds (PCs) is a crucial enabler for many emerging applications reliant on 3D spatial data, such as robot navigation, autonomous vehicles, and augmented reality. In most scenarios, PCs acquired by remote sensors must be transmitted to an edge server for fusion, segmentation, or inference. Wireless transmission of PCs not only puts on increased burden on the already congested wireless spectrum, but also confronts a unique set of challenges arising from the irregular and unstructured nature of PCs. In this paper, we meticulously delineate these challenges and offer a comprehensive examination of existing solutions while candidly acknowledging their inherent limitations. In response to these intricacies, we proffer four pragmatic solution frameworks, spanning advanced techniques, hybrid schemes, and distributed data aggregation approaches. In doing so, our goal is to chart a path toward efficient, reliable, and low-latency wireless PC transmission.

preprint2023arXiv

Feasibility Analysis of Grover-meets-Simon Algorithm

Quantum algorithm is a key tool for cryptanalysis. At present, people are committed to building powerful quantum algorithms and tapping the potential of quantum algorithms, so as to further analyze the security of cryptographic algorithms under quantum computing. Recombining classical quantum algorithms is one of the current ideas to construct quantum algorithms. However, they cannot be easily combined, the feasibility of quantum algorithms needs further analysis in quantum environment. This paper reanalyzes the existing combined algorithm Grover-meets-Simon algorithm in terms of the principle of deferred measurement. First of all, due to the collapse problem caused by the measurement, we negate the measurement process of Simon's algorithm during the process of the Grover-meets-Simon algorithm. Second, since the output of the unmeasured Simon algorithm is quantum linear systems of equations, we discuss the solution of quantum linear systems of equations and find it feasible to consider the deferred measurement of the parallel Simon algorithm alone. Finally, since the Grover-meets-Simon algorithm involves an iterative problem, we reconsider the feasibility of the algorithm when placing multiple measurements at the end. According to the maximum probability of success and query times, we get that the Grover-meets-Simon algorithm is not an effective attack algorithm when putting the measurement process of the Simon algorithm in the iterative process at the end of Grover-meets-Simon algorithm.

preprint2023arXiv

LostNet: A smart way for lost and find

Due to the enormous population growth of cities in recent years, objects are frequently lost and unclaimed on public transportation, in restaurants, or any other public areas. While services like Find My iPhone can easily identify lost electronic devices, more valuable objects cannot be tracked in an intelligent manner, making it impossible for administrators to reclaim a large number of lost and found items in a timely manner. We present a method that significantly reduces the complexity of searching by comparing previous images of lost and recovered things provided by the owner with photos taken when registered lost and found items are received. In this research, we will primarily design a photo matching network by combining the fine-tuning method of MobileNetv2 with CBAM Attention and using the Internet framework to develop an online lost and found image identification system. Our implementation gets a testing accuracy of 96.8% using only 665.12M GLFOPs and 3.5M training parameters. It can recognize practice images and can be run on a regular laptop.

preprint2023arXiv

Room-Temperature Highly-Tunable Coercivity and Highly-Efficient Nonvolatile Multi-States Magnetization Switching by Small Current in Single 2D Ferromagnet Fe$_3$GaTe$_2$

Room-temperature electrically-tuned coercivity and nonvolatile multi-states magnetization switching is crucial for next-generation low-power 2D spintronics. However, most methods have limited ability to adjust the coercivity of ferromagnetic systems, and room-temperature electrically-driven magnetization switching shows high critical current density and high power dissipation. Here, highly-tunable coercivity and highly-efficient nonvolatile multi-states magnetization switching are achieved at room temperature in single-material based devices by 2D van der Waals itinerant ferromagnet Fe$_3$GaTe$_2$. The coercivity can be readily tuned up to ~98.06% at 300 K by a tiny in-plane electric field that is 2-5 orders of magnitude smaller than that of other ferromagnetic systems. Moreover, the critical current density and power dissipation for room-temperature magnetization switching in 2D Fe$_3$GaTe$_2$ are down to ~1.7E5 A cm$^{-2}$ and ~4E12 W m$^{-3}$, respectively. Such switching power dissipation is 2-6 orders of magnitude lower than that of other 2D ferromagnetic systems. Meanwhile, multi-states magnetization switching are presented by continuously controlling the current, which can dramatically enhance the information storage capacity and develop new computing methodology. This work opens the avenue for room-temperature electrical control of ferromagnetism and potential applications for vdW-integrated 2D spintronics.

preprint2022arXiv

A Transfer Learning and Optimized CNN Based Intrusion Detection System for Internet of Vehicles

Modern vehicles, including autonomous vehicles and connected vehicles, are increasingly connected to the external world, which enables various functionalities and services. However, the improving connectivity also increases the attack surfaces of the Internet of Vehicles (IoV), causing its vulnerabilities to cyber-threats. Due to the lack of authentication and encryption procedures in vehicular networks, Intrusion Detection Systems (IDSs) are essential approaches to protect modern vehicle systems from network attacks. In this paper, a transfer learning and ensemble learning-based IDS is proposed for IoV systems using convolutional neural networks (CNNs) and hyper-parameter optimization techniques. In the experiments, the proposed IDS has demonstrated over 99.25% detection rates and F1-scores on two well-known public benchmark IoV security datasets: the Car-Hacking dataset and the CICIDS2017 dataset. This shows the effectiveness of the proposed IDS for cyber-attack detection in both intra-vehicle and external vehicular networks.

preprint2022arXiv

AUGER: Automatically Generating Review Comments with Pre-training Models

Code review is one of the best practices as a powerful safeguard for software quality. In practice, senior or highly skilled reviewers inspect source code and provide constructive comments, considering what authors may ignore, for example, some special cases. The collaborative validation between contributors results in code being highly qualified and less chance of bugs. However, since personal knowledge is limited and varies, the efficiency and effectiveness of code review practice are worthy of further improvement. In fact, it still takes a colossal and time-consuming effort to deliver useful review comments. This paper explores a synergy of multiple practical review comments to enhance code review and proposes AUGER (AUtomatically GEnerating Review comments): a review comments generator with pre-training models. We first collect empirical review data from 11 notable Java projects and construct a dataset of 10,882 code changes. By leveraging Text-to-Text Transfer Transformer (T5) models, the framework synthesizes valuable knowledge in the training stage and effectively outperforms baselines by 37.38% in ROUGE-L. 29% of our automatic review comments are considered useful according to prior studies. The inference generates just in 20 seconds and is also open to training further. Moreover, the performance also gets improved when thoroughly analyzed in case study.

preprint2022arXiv

Brain Cortical Functional Gradients Predict Cortical Folding Patterns via Attention Mesh Convolution

Since gyri and sulci, two basic anatomical building blocks of cortical folding patterns, were suggested to bear different functional roles, a precise mapping from brain function to gyro-sulcal patterns can provide profound insights into both biological and artificial neural networks. However, there lacks a generic theory and effective computational model so far, due to the highly nonlinear relation between them, huge inter-individual variabilities and a sophisticated description of brain function regions/networks distribution as mosaics, such that spatial patterning of them has not been considered. we adopted brain functional gradients derived from resting-state fMRI to embed the "gradual" change of functional connectivity patterns, and developed a novel attention mesh convolution model to predict cortical gyro-sulcal segmentation maps on individual brains. The convolution on mesh considers the spatial organization of functional gradients and folding patterns on a cortical sheet and the newly designed channel attention block enhances the interpretability of the contribution of different functional gradients to cortical folding prediction. Experiments show that the prediction performance via our model outperforms other state-of-the-art models. In addition, we found that the dominant functional gradients contribute less to folding prediction. On the activation maps of the last layer, some well-studied cortical landmarks are found on the borders of, rather than within, the highly activated regions. These results and findings suggest that a specifically designed artificial neural network can improve the precision of the mapping between brain functions and cortical folding patterns, and can provide valuable insight of brain anatomy-function relation for neuroscience.

preprint2022arXiv

Cointegration of SARS-CoV-2 Transmission with Weather Conditions and Mobility during the First Year of the COVID-19 Pandemic in the United States

Correlation between weather and the transmission of SARS-CoV-2 may suggest its seasonality. Cointegration analysis can avoid spurious correlation among time series data. We examined the cointegration of virus transmission with daily temperature, dewpoint, and confounding factors of mobility measurements during the first year of the pandemic in the United States. We examined the cointegration of the effective reproductive rate, Rt, of the virus with the dewpoint at two meters, the temperature at two meters, Apple driving mobility, and Google workplace mobility measurements. We found that dewpoint and Apple driving mobility are the best factors to cointegrate with Rt, although temperature and Google workplace mobility also cointegrate with Rt at substantial levels. We found that the optimal lag is two days for cointegration between Rt and weather variables, and three days for Rt and mobility. We observed clusters of states that share similar cointegration results of Rt, weather, and mobility, suggesting regional patterns. Our results support the correlation of weather with the spread of SARS-CoV-2 and its potential seasonality.

preprint2022arXiv

DeepRelease: Language-agnostic Release Notes Generation from Pull Requests of Open-source Software

The release note is an essential software artifact of open-source software that documents crucial information about changes, such as new features and bug fixes. With the help of release notes, both developers and users could have a general understanding of the latest version without browsing the source code. However, it is a daunting and time-consuming job for developers to produce release notes. Although prior studies have provided some automatic approaches, they generate release notes mainly by extracting information from code changes. This will result in language-specific and not being general enough to be applicable. Therefore, helping developers produce release notes effectively remains an unsolved challenge. To address the problem, we first conduct a manual study on the release notes of 900 GitHub projects, which reveals that more than 54% of projects produce their release notes with pull requests. Based on the empirical finding, we propose a deep learning based approach named DeepRelease (Deep learning based Release notes generator) to generate release notes according to pull requests. The process of release notes generation in DeepRelease includes the change entries generation and the change category (i.e., new features or bug fixes) generation, which are formulated as a text summarization task and a multi-class classification problem, respectively. Since DeepRelease fully employs text information from pull requests to summarize changes and identify the change category, it is language-agnostic and can be used for projects in any language. We build a dataset with over 46K release notes and evaluate DeepRelease on the dataset. The experimental results indicate that DeepRelease outperforms four baselines and can generate release notes similar to those manually written ones in a fraction of the time.

preprint2022arXiv

Event-based EV Charging Scheduling in A Microgrid of Buildings

With the popularization of the electric vehicles (EVs), EV charging demand is becoming an important load in the building. Considering the mobility of EVs from building to building and their uncertain charging demand, it is of great practical interest to control the EV charging process in a microgrid of buildings to optimize the total operation cost while ensuring the transmission safety between the microgrid and the main grid. We consider this important problem in this paper and make the following contributions. First, we formulate this problem as a Markov decision process to capture the uncertain supply and EV charging demand in the microgrid of buildings. Besides reducing the total operation cost of buildings, the model also considers the power exchange limitation to ensure transmission safety. Second, this model is reformulated under event-based optimization framework to alleviate the impact of large state and action space. By appropriately defining the event and event-based action, the EV charging process can be optimized by searching a randomized parametric event-based control policy in the microgrid controller and implementing a selecting-to-charging rule in each building controller. Third, a constrained gradient-based policy optimzation method with adjusting mechanism is proposed to iteratively find the optimal event-based control policy for EV charging demand in each building. Numerical experiments considering a microgrid of three buildings are conducted to analyze the structure and the performance of the event-based control policy for EV charging.

preprint2022arXiv

Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning

Visual grounding is a task to locate the target indicated by a natural language expression. Existing methods extend the generic object detection framework to this problem. They base the visual grounding on the features from pre-generated proposals or anchors, and fuse these features with the text embeddings to locate the target mentioned by the text. However, modeling the visual features from these predefined locations may fail to fully exploit the visual context and attribute information in the text query, which limits their performance. In this paper, we propose a transformer-based framework for accurate visual grounding by establishing text-conditioned discriminative features and performing multi-stage cross-modal reasoning. Specifically, we develop a visual-linguistic verification module to focus the visual features on regions relevant to the textual descriptions while suppressing the unrelated areas. A language-guided feature encoder is also devised to aggregate the visual contexts of the target object to improve the object's distinctiveness. To retrieve the target from the encoded visual features, we further propose a multi-stage cross-modal decoder to iteratively speculate on the correlations between the image and text for accurate target localization. Extensive experiments on five widely used datasets validate the efficacy of our proposed components and demonstrate state-of-the-art performance. Our code is public at https://github.com/yangli18/VLTVG.

preprint2022arXiv

LCCDE: A Decision-Based Ensemble Framework for Intrusion Detection in The Internet of Vehicles

Modern vehicles, including autonomous vehicles and connected vehicles, have adopted an increasing variety of functionalities through connections and communications with other vehicles, smart devices, and infrastructures. However, the growing connectivity of the Internet of Vehicles (IoV) also increases the vulnerabilities to network attacks. To protect IoV systems against cyber threats, Intrusion Detection Systems (IDSs) that can identify malicious cyber-attacks have been developed using Machine Learning (ML) approaches. To accurately detect various types of attacks in IoV networks, we propose a novel ensemble IDS framework named Leader Class and Confidence Decision Ensemble (LCCDE). It is constructed by determining the best-performing ML model among three advanced ML algorithms (XGBoost, LightGBM, and CatBoost) for every class or type of attack. The class leader models with their prediction confidence values are then utilized to make accurate decisions regarding the detection of various types of cyber-attacks. Experiments on two public IoV security datasets (Car-Hacking and CICIDS2017 datasets) demonstrate the effectiveness of the proposed LCCDE for intrusion detection on both intra-vehicle and external networks.

preprint2022arXiv

LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning

Efficient exploration is important for reinforcement learners to achieve high rewards. In multi-agent systems, coordinated exploration and behaviour is critical for agents to jointly achieve optimal outcomes. In this paper, we introduce a new general framework for improving coordination and performance of multi-agent reinforcement learners (MARL). Our framework, named Learnable Intrinsic-Reward Generation Selection algorithm (LIGS) introduces an adaptive learner, Generator that observes the agents and learns to construct intrinsic rewards online that coordinate the agents' joint exploration and joint behaviour. Using a novel combination of MARL and switching controls, LIGS determines the best states to learn to add intrinsic rewards which leads to a highly efficient learning process. LIGS can subdivide complex tasks making them easier to solve and enables systems of MARL agents to quickly solve environments with sparse rewards. LIGS can seamlessly adopt existing MARL algorithms and, our theory shows that it ensures convergence to policies that deliver higher system performance. We demonstrate its superior performance in challenging tasks in Foraging and StarCraft II.

preprint2022arXiv

Manipulating Interlayer Excitons for Ultra-pure Near-infrared Quantum Light Generation

Interlayer excitons (IXs) formed at the interface of atomically-thin semiconductors possess various novel properties. In a parallel development, nanoscale strain engineering has emerged as an effective means for creating 2D quantum light sources. Exploring the intersection of these two exciting areas, where strain and defects are exploited for the manipulation of IX toward the emergence of new functionalities, is currently at a nascent stage. Here, using MoS2/WSe2 heterostructure as a model system, we demonstrate how strain, defects, and layering can be utilized to create defect-bound IXs capable of bright, robust, and tunable quantum light emission in the technologically important near-infrared spectral range. We were able to achieve ultra-high single-photon purity with g(2)(0) = 0.01. Our strategy of creating site-controlled QEs from the defect-bound IXs represents a paradigm shift in 2D quantum photonics research, from engineering intralayer exciton in monolayer structures towards IXs at the interface of 2D heterostructures.

preprint2022arXiv

Metal Lines Associated with the Lyman-alpha Forest from eBOSS Data

We investigate the metal species associated with the Ly$α$ forest in the eBOSS quasar spectra. Metal absorption lines are revealed in the stacked spectra from cross-correlating the selected Ly$α$ absorbers in the forest and the flux fluctuation field. Up to 13 metal species are identified associated with relatively strong Ly$α$ absorbers (those with flux fluctuation $-1.0<δ_{\rm Lyα}<-0.6$ and with neutral hydrogen column density of ~ $10^{15-16}$ $\rm cm^{-2}$) over absorber redshift range of $2<z_{\rm abs}<4$. The column densities of these species decrease toward higher redshift and for weaker Ly$α$ absorbers. From modelling the column densities of various species, we find that the column density pattern suggests contributions from multiple gas components both in the circumgalactic medium (CGM) and in the intergalactic medium (IGM). While the low-ionization species (e.g., C II, Si II, and Mg II) can be explained by high-density, cool gas ($T \sim 10^4$ K) from the CGM, the high-ionization species may reside in low-density or high-temperature gas in the IGM. The measurements provide inputs to model metal contamination in the Ly$α$ forest baryon acoustic oscillations measurement. Comparison with metal absorptions in high-resolution quasar spectra and with hydrodynamic galaxy formation simulations can further elucidate the physical conditions of these Ly$α$ absorbers.

preprint2022arXiv

Shifting Trends of COVID-19 Tweet Sentiment with Respect to Voting Preferences in the 2020 Election Year of the United States

COVID-19 related policies were extensively politicized during the 2020 election year of the United States, resulting in polarizing viewpoints. Twitter users were particularly engaged during the 2020 election year. Here we investigated whether COVID-19 related tweets were associated with the overall election results at the state level during the period leading up to the election day. We observed weak correlations between the average sentiment of COVID-19 related tweets and popular votes in two-week intervals, and the trends gradually become opposite. We then compared the average sentiments of COVID-19 related tweets between states called in favor of Republican (red states) or Democratic parties (blue states). We found that at the beginning of lockdowns sentiments in the blue states were much more positive than those in the red states. However, sentiments in the red states gradually become more positive during the summer of 2020 and persisted until the election day.

preprint2022arXiv

TRGP: Trust Region Gradient Projection for Continual Learning

Catastrophic forgetting is one of the major challenges in continual learning. To address this issue, some existing methods put restrictive constraints on the optimization space of the new task for minimizing the interference to old tasks. However, this may lead to unsatisfactory performance for the new task, especially when the new task is strongly correlated with old tasks. To tackle this challenge, we propose Trust Region Gradient Projection (TRGP) for continual learning to facilitate the forward knowledge transfer based on an efficient characterization of task correlation. Particularly, we introduce a notion of `trust region&#39; to select the most related old tasks for the new task in a layer-wise and single-shot manner, using the norm of gradient projection onto the subspace spanned by task inputs. Then, a scaled weight projection is proposed to cleverly reuse the frozen weights of the selected old tasks in the trust region through a layer-wise scaling matrix. By jointly optimizing the scaling matrices and the model, where the model is updated along the directions orthogonal to the subspaces of old tasks, TRGP can effectively prompt knowledge transfer without forgetting. Extensive experiments show that our approach achieves significant improvement over related state-of-the-art methods.

preprint2021arXiv

Big Bird: Transformers for Longer Sequences

Transformers-based models, such as BERT, have been one of the most successful deep learning models for NLP. Unfortunately, one of their core limitations is the quadratic dependency (mainly in terms of memory) on the sequence length due to their full attention mechanism. To remedy this, we propose, BigBird, a sparse attention mechanism that reduces this quadratic dependency to linear. We show that BigBird is a universal approximator of sequence functions and is Turing complete, thereby preserving these properties of the quadratic, full attention model. Along the way, our theoretical analysis reveals some of the benefits of having $O(1)$ global tokens (such as CLS), that attend to the entire sequence as part of the sparse attention mechanism. The proposed sparse attention can handle sequences of length up to 8x of what was previously possible using similar hardware. As a consequence of the capability to handle longer context, BigBird drastically improves performance on various NLP tasks such as question answering and summarization. We also propose novel applications to genomics data.

preprint2021arXiv

Concept Drift Detection in Federated Networked Systems

As next-generation networks materialize, increasing levels of intelligence are required. Federated Learning has been identified as a key enabling technology of intelligent and distributed networks; however, it is prone to concept drift as with any machine learning application. Concept drift directly affects the model&#39;s performance and can result in severe consequences considering the critical and emergency services provided by modern networks. To mitigate the adverse effects of drift, this paper proposes a concept drift detection system leveraging the federated learning updates provided at each iteration of the federated training process. Using dimensionality reduction and clustering techniques, a framework that isolates the system&#39;s drifted nodes is presented through experiments using an Intelligent Transportation System as a use case. The presented work demonstrates that the proposed framework is able to detect drifted nodes in a variety of non-iid scenarios at different stages of drift and different levels of system exposure.

preprint2021arXiv

Fast Outage Analysis of Large-scale Production Clouds with Service Correlation Mining

Cloud-based services are surging into popularity in recent years. However, outages, i.e., severe incidents that always impact multiple services, can dramatically affect user experience and incur severe economic losses. Locating the root-cause service, i.e., the service that contains the root cause of the outage, is a crucial step to mitigate the impact of the outage. In current industrial practice, this is generally performed in a bootstrap manner and largely depends on human efforts: the service that directly causes the outage is identified first, and the suspected root cause is traced back manually from service to service during diagnosis until the actual root cause is found. Unfortunately, production cloud systems typically contain a large number of interdependent services. Such a manual root cause analysis is often time-consuming and labor-intensive. In this work, we propose COT, the first outage triage approach that considers the global view of service correlations. COT mines the correlations among services from outage diagnosis data. After learning from historical outages, COT can infer the root cause of emerging ones accurately. We implement COT and evaluate it on a real-world dataset containing one year of data collected from Microsoft Azure, one of the representative cloud computing platforms in the world. Our experimental results show that COT can reach a triage accuracy of 82.1%~83.5%, which outperforms the state-of-the-art triage approach by 28.0%~29.7%.

preprint2021arXiv

PWPAE: An Ensemble Framework for Concept Drift Adaptation in IoT Data Streams

As the number of Internet of Things (IoT) devices and systems have surged, IoT data analytics techniques have been developed to detect malicious cyber-attacks and secure IoT systems; however, concept drift issues often occur in IoT data analytics, as IoT data is often dynamic data streams that change over time, causing model degradation and attack detection failure. This is because traditional data analytics models are static models that cannot adapt to data distribution changes. In this paper, we propose a Performance Weighted Probability Averaging Ensemble (PWPAE) framework for drift adaptive IoT anomaly detection through IoT data stream analytics. Experiments on two public datasets show the effectiveness of our proposed PWPAE method compared against state-of-the-art methods.

preprint2021arXiv

Testing for Treatment Effect in Covariate-Adaptive Randomized Clinical Trials with Generalized Linear Models and Omitted Covariates

Concerns have been expressed over the validity of statistical inference under covariate-adaptive randomization despite the extensive use in clinical trials. In the literature, the inferential properties under covariate-adaptive randomization have been mainly studied for continuous responses; in particular, it is well known that the usual two sample t-test for treatment effect is typically conservative, in the sense that the actual test size is smaller than the nominal level. This phenomenon of invalid tests has also been found for generalized linear models without adjusting for the covariates and are sometimes more worrisome due to inflated Type I error. The purpose of this study is to examine the unadjusted test for treatment effect under generalized linear models and covariate-adaptive randomization. For a large class of covariate-adaptive randomization methods, we obtain the asymptotic distribution of the test statistic under the null hypothesis and derive the conditions under which the test is conservative, valid, or anti-conservative. Several commonly used generalized linear models, such as logistic regression and Poisson regression, are discussed in detail. An adjustment method is also proposed to achieve a valid size based on the asymptotic results. Numerical studies confirm the theoretical findings and demonstrate the effectiveness of the proposed adjustment method.

preprint2020arXiv

A Fast Radio Burst discovered in FAST drift scan survey

We report the discovery of a highly dispersed fast radio burst, FRB~181123, from an analysis of $\sim$1500~hr of drift-scan survey data taken using the Five-hundred-meter Aperture Spherical radio Telescope (FAST). The pulse has three distinct emission components, which vary with frequency across our 1.0--1.5~GHz observing band. We measure the peak flux density to be $>0.065$~Jy and the corresponding fluence $>0.2$~Jy~ms. Based on the observed dispersion measure of 1812~cm$^{-3}$~pc, we infer a redshift of $\sim 1.9$. From this, we estimate the peak luminosity and isotropic energy to be $\lesssim 2\times10^{43}$~erg~s$^{-1}$ and $\lesssim 2\times10^{40}$~erg, respectively. With only one FRB from the survey detected so far, our constraints on the event rate are limited. We derive a 95\% confidence lower limit for the event rate of 900 FRBs per day for FRBs with fluences $>0.025$~Jy~ms. We performed follow-up observations of the source with FAST for four hours and have not found a repeated burst. We discuss the implications of this discovery for our understanding of the physical mechanisms of FRBs.

preprint2020arXiv

A Progressive Sub-Network Searching Framework for Dynamic Inference

Many techniques have been developed, such as model compression, to make Deep Neural Networks (DNNs) inference more efficiently. Nevertheless, DNNs still lack excellent run-time dynamic inference capability to enable users trade-off accuracy and computation complexity (i.e., latency on target hardware) after model deployment, based on dynamic requirements and environments. Such research direction recently draws great attention, where one realization is to train the target DNN through a multiple-term objective function, which consists of cross-entropy terms from multiple sub-nets. Our investigation in this work show that the performance of dynamic inference highly relies on the quality of sub-net sampling. With objective to construct a dynamic DNN and search multiple high quality sub-nets with minimal searching cost, we propose a progressive sub-net searching framework, which is embedded with several effective techniques, including trainable noise ranking, channel group and fine-tuning threshold setting, sub-nets re-selection. The proposed framework empowers the target DNN with better dynamic inference capability, which outperforms prior works on both CIFAR-10 and ImageNet dataset via comprehensive experiments on different network structures. Taken ResNet18 as an example, our proposed method achieves much better dynamic inference accuracy compared with prior popular Universally-Slimmable-Network by 4.4%-maximally and 2.3%-averagely in ImageNet dataset with the same model size.

preprint2020arXiv

Artificial Multiferroics and Enhanced Magnetoelectric Effect in van der Waals Heterostructures

Multiferroic materials with coupled ferroelectric and ferromagnetic properties are important for multifunctional devices due to their potential ability of controlling magnetism via electric field, and vice versa. The recent discoveries of two-dimensional ferromagnetic and ferroelectric materials have ignited tremendous research interest and aroused hope to search for two-dimensional multiferroics. However, intrinsic two-dimensional multiferroic materials and, particularly, those with strong magnetoelectric couplings are still rare to date. In this paper, using first-principles simulations, we propose artificial two-dimensional multiferroics via a van der Waals heterostructure formed by ferromagnetic bilayer chromium triiodide (CrI3) and ferroelectric monolayer Sc2CO2. In addition to the coexistence of ferromagnetism and ferroelectricity, our calculations show that, by switching the electric polarization of Sc2CO2, we can tune the interlayer magnetic couplings of bilayer CrI3 between ferromagnetic and antiferromagnetic states. We further reveal that such a strong magnetoelectric effect is from a dramatic change of the band alignment induced by the strong build-in electric polarization in Sc2CO2 and the subsequent change of the interlayer magnetic coupling of bilayer CrI3. These artificial multiferroics and enhanced magnetoelectric effect give rise to realizing multifunctional nanoelectronics by van der Waals heterostructures.

preprint2020arXiv

Exciton Transport under Periodic Potential in MoSe2/WSe2 Heterostructures

The predicted formation of moire superlattices leading to confined excitonic states in heterostructures formed by stacking two lattice mismatched transition metal dichalcogenide (TMD) monolayers was recently experimentally confirmed. Such periodic potential in TMD heterostructure functions as a diffusion barrier that affects the energy transport and dynamics of interlayer excitons (electron and hole spatially concentrated in different monolayers). Understanding the transport of excitons under such condition is essential to establish the material system as a next generation device platform. In this work, we experimentally quantify the diffusion barrier experienced by the interlayer excitons in a hexagonal boron nitride (hBN) encapsulated, molybdenum diselenide tungsten/diselenide (MoSe2/WSe2) heterostructure by studying the exciton transport at various temperatures.

preprint2020arXiv

First-Principles Studies of Second-Order Nonlinear Optical Properties of Organic-Inorganic Hybrid Halide Perovskites

Organic-inorganic hybrid halide perovskites have ignited tremendous interests for photovoltaic applications. However, their nonlinear optical response has not been studied although many of these structures lack the centrosymmetry and exhibit ferroelectricity. In this work, we employ our developed large-scale parallel, first-principles simulation tool (ArchNLO) to explore second-order nonlinear optical properties of a typical family of organic-inorganic hybrid halide perovskites, CH3NH3MX3 (M= Ge, Sn, Pb; X=Cl, Br, I). We find that these hybrid perovskites exhibit second harmonic generation and linear electro-optic effect. The nonlinear optical effects are strongly influenced by the types and positions of cations/anions and corresponding band gaps. Particularly, the distorted cubic phase, which is essentially triclinic, of CH3NH3SnI3 shows significant second harmonic generation and electro-optic effect, which are comparable with those widely used materials, such as GaAs. These second-order optical properties of organic-inorganic hybrid halide perovskites and their low-temperature, solution-based fabrication pave the way for achieving and implementing nonlinear optical devices with low cost.

preprint2020arXiv

Giant linearly-polarized photogalvanic effect and second harmonic generation in two-dimensional axion insulators

The second-order nonlinear optical (NLO) processes, such as the photogalvanic effect and second-order harmonic generation (SHG), play crucial roles in probing and controlling light-matter interactions for energy and device applications. To date, most studies of second-order NLO processes focus on materials with broken spatial inversion symmetry, such as proper ferroelectrics and noncentrosymmetric Weyl semimetals. Nevertheless, inversion symmetry of Shubnikov groups can be broken via spin-ordering in centrosymmetric crystals. Unfortunately, these materials are less common, and their NLO responses are usually weak. Combining quantum perturbation theory and first-principles simulations, we predict a giant injection-current photogalvanic effect and SHG in a family of emerging axion insulators, the even septuple layers of MnBi2Te4 (MBT) materials that exhibit the zero-plateau quantum anomalous Hall (QAH) effect. Their amplitudes of injection current and SHG are about two orders of magnitude larger than those of widely used ferroelectrics, such as BiFeO3 and LiNbO3. Moreover, unlike the usual injection current observed under circularly-polarized light, the injection photocurrent of MBTs only emerges under linearly polarized light, making it convenient for device applications. These unique characters are from a combination effect of parity-time symmetry, three-fold rotation symmetry, and significant spin-orbit coupling. These enhanced NLO effects are valuable for characterizing subtle topological orders in QAH systems and also shed light on novel infrared photo-detector and photovoltaic applications based on magnetic topological materials.

preprint2020arXiv

Intrinsic Spin Photogalvanic Effect in Nonmagnetic Insulator

We show that with the help of spin-orbit coupling, nonlinear light-matter interactions can efficiently couple with spin and valley degrees of freedom. This revealed spin photogalvanic effect can generate the long-time pursued intrinsic pure spin current (PSC) in non-centrosymmetric nonmagnetic insulators. Different from the spin and valley Hall effect, such a photo-driven spin current is universal and can be generated without external bias field. Using first-principles simulation, we study monolayer transition metal dichalcogenides (TMDs) to demonstrate this effect and confirm an enhanced PSC under linearly polarized photoexcitation. The amplitude of the PSC is one order larger than that of the charge current observed in monolayer TMDs. This exotic nonlinear light-spin interaction indicates that light can be utilized as a rapid fashion to manipulate the spin-polarized current, which is crucial for future low-dissipation nanodevices.

preprint2020arXiv

KSM: Fast Multiple Task Adaption via Kernel-wise Soft Mask Learning

Deep Neural Networks (DNN) could forget the knowledge about earlier tasks when learning new tasks, and this is known as \textit{catastrophic forgetting}. While recent continual learning methods are capable of alleviating the catastrophic problem on toy-sized datasets, some issues still remain to be tackled when applying them in real-world problems. Recently, the fast mask-based learning method (e.g. piggyback \cite{mallya2018piggyback}) is proposed to address these issues by learning only a binary element-wise mask in a fast manner, while keeping the backbone model fixed. However, the binary mask has limited modeling capacity for new tasks. A more recent work \cite{hung2019compacting} proposes a compress-grow-based method (CPG) to achieve better accuracy for new tasks by partially training backbone model, but with order-higher training cost, which makes it infeasible to be deployed into popular state-of-the-art edge-/mobile-learning. The primary goal of this work is to simultaneously achieve fast and high-accuracy multi task adaption in continual learning setting. Thus motivated, we propose a new training method called \textit{kernel-wise Soft Mask} (KSM), which learns a kernel-wise hybrid binary and real-value soft mask for each task, while using the same backbone model. Such a soft mask can be viewed as a superposition of a binary mask and a properly scaled real-value tensor, which offers a richer representation capability without low-level kernel support to meet the objective of low hardware overhead. We validate KSM on multiple benchmark datasets against recent state-of-the-art methods (e.g. Piggyback, Packnet, CPG, etc.), which shows good improvement in both accuracy and training cost.

preprint2020arXiv

Nonreciprocal Second-Harmonic Generation in Few-Layer Chromium Triiodide

It is of fundamental importance but challenging to simultaneously identify atomic and magnetic configurations of two-dimensional van der Waals materials. In this work, we show that the nonreciprocal second-harmonic generation (SHG) can be a powerful tool to answer this challenge. Despite the preserved lattice inversion symmetry, the interlayer antiferromagnetic order and spin-orbit coupling generate enhanced SHG in PT-symmetric bilayer chromium triiodide (CrI3). Importantly, the in-plane polarization-resolved SHG is sensitive to subtly different interlayer structures that cannot be told by linear optical spectra. Beyond bilayer, we further predict that the intensity and angle-resolved SHG can be employed to identify both interlayer atomic and magnetic configurations of trilayer CrI3. Our first-principles results agree with available measurements and show the potential of SHG as a non-contacting approach to explore correlations between interlayer structures and magnetic orders of emerging ultra-thin magnetic materials.

preprint2020arXiv

Photo-degradation Protection in 2D In-Plane Heterostructures Revealed by Hyperspectral Nanoimaging: the Role of Nano-Interface 2D Alloys

Single-layer heterostructures exhibit striking quasiparticle properties and many-body interaction effects that hold promise for a range of applications. However, their properties can be altered by intrinsic and extrinsic defects, thus diminishing their applicability. Therefore, it is of paramount importance to identify defects and understand 2D materials&#39; degradation over time using advanced multimodal imaging techniques as well as stabilize degradation via built-in interface protection. Here we implemented a liquid-phase precursor approach to synthesize 2D in-plane MoS2-WS2 heterostructures exhibiting nanoscale alloyed interfaces and map exotic interface effects during photo-degradation using a novel combination of hyperspectral tip-enhanced photoluminescence, Raman and near-field nanoscopy. Surprisingly, 2D alloyed regions exhibit remarkable thermal and photo-degradation stability providing protection against oxidation. Coupled with surface and interface strain, 2D alloy regions create localized potential wells that concentrate excitonic species via a charge carrier funneling effect. These results provide a clear understanding of the importance of 2D alloys as systems able to withstand degradation effects over time, and could be now used to stabilize optoelectronic devices based on 2D materials.

preprint2020arXiv

Quasiparticle Energies and Excitonic Effects of Chromium Trichloride: from Two Dimensions to Bulk

Layered van der Waals (vdW) magnetic materials have attracted significant research interest to date. In this work, we employ the first-principles many-body perturbation theory to calculate excited-state properties of a prototype vdW magnet, chromium trichloride (CrCl3), covering monolayer, bilayer, and bulk structures. Unlike usual non-magnetic vdW semiconductors, in which many-electron interactions and excited states are sensitive to dimensionality, many-electron interactions are always enhanced and dominate quasiparticle energies and optical responses of both two-dimensional and bulk CrCl3. The electron-hole (e-h) binding energy can reach 3 eV in monolayer and remains as high as 2 eV in bulk. Because of the cancellation effect between self-energy corrections and e-h binding energies, the lowest-energy exciton (optical gap) is almost not affected by the change of dimensionality. Besides, for the excitons with similar e-h binding energies, their dipole oscillator strength can differ by a few orders of magnitude.Our analysis shows that such a big difference is from a unique interference effect between complex exciton wavefunctions and interband transitions. Finally, we find that the interlayer stacking sequence and magnetic coupling barely change quasiparticle band gaps and optical absorption spectra of CrCl3. Our calculated low-energy exciton peak positions agree with available measurements. These findings give insight into the understanding of many-electron interactions and the interplay between magnetic orders and optical excitations in vdW magnetic materials.

preprint2020arXiv

Raman Response and Transport Properties of One-Dimensional van der Waals Tellurium Nanowires

Tellurium can form nanowires of helical atomic chains. Given their unique one-dimensional van der Waals structure, these nanowires are expected to show remarkably different physical and electronic properties than bulk tellurium. Here we show that few-chain and single-chain van der Waals tellurium nanowires can be isolated using carbon nanotube and boron nitride nanotube encapsulation. With the approach, the number of atomic chains can be controlled by the inner diameter of the nanotube. The Raman response of the structures suggests that the interaction between a single-atomic tellurium chain and a carbon nanotube is weak, and that the inter-chain interaction becomes stronger as the number of chains increases. Compared with bare tellurium nanowires on SiO2, nanowires encapsulated in boron nitride nanotubes exhibit a dramatically enhanced current-carrying capacity, with a current density of 1.5*10^8 A cm-2, which exceeds that of most semiconducting nanowires. We also use our tellurium nanowires encapsulated in boron nitride nanotubes to create field-effect transistors that have a diameter of only 2 nm.

preprint2020arXiv

Study on $η_{c2}(η_{b2})$ electromagnetic decay into double photons

Within the framework of nonrelativistic QCD (NRQCD) factorization formalism, we compute the helicity amplitude as well as the decay width of $η_{Q2}$ ($Q=c,b$) electromagnetic decay into two photons up to next-to-next-to-leading order (NNLO) in $α_s$ expansion. For the first time, we verify the validity of NRQCD factorization for the D-wave quarkonium decay at NNLO. We find that the $\mathcal{O}(α_s)$ and $\mathcal{O}(α_s^2)$ corrections to the helicity amplitude are negative and moderate, nevertheless both corrections combine to suppress the leading-order prediction for the decay width significantly. By approximating the total decay width of $η_{Q2}$ as the sum of those for the hadronic decay and the electric $E1$ transition, we obtain the branching ratios ${\rm Br}(η_{c2}\to 2γ)\approx 5\times10^{-6}$ and ${\rm Br}(η_{b2}\to 2γ)\approx 4\times10^{-7}$. To explore the potential measurement on $η_{Q2}$, we further evaluate the production cross section of $η_{Q2}$ at LHCb at the lowest order in $α_s$ expansion. With the kinematic constraint on the longitudinal rapidity $4.5>y>2$ and transverse momentum $P_T>(2-4)m_Q$ for $η_{Q2}$, we find the cross section can reach $2-50$ nb for $η_{c2}$, and $1-22$ pb for $η_{b2}$. Considering the integrated luminosity $\mathcal{L}=10\, {\rm fb}^{-1}$ at $\sqrt{s}=7$ TeV and $\sqrt{s}=13$ TeV, we estimate that there are several hundreds events of $pp\to η_{c2}\to 2γ$. Since the background is relatively clean, it is promising to reconstruct $η_{c2}$ through its electromagnetic decay. On the contrary, due to small branching ratio and production cross section, it is quite challenging to detect $η_{b2}\to 2γ$ at LHCb.

preprint2019arXiv

Curie Temperature of Emerging Two-Dimensional Magnetic Structures

Recent realizations of intrinsic, long-range magnetic orders in two-dimensional (2D) van der Waals materials have ignited tremendous research interests. In this work, we employ the XXZ Heisenberg model and Monte Carlo simulations to study a fundamental property of these emerging 2D magnetic materials, the Curie temperature (Tc). By including both onsite and neighbor couplings extracted from first-principles simulations, we have calculated Tc of monolayer chromium trihalides and Cr2Ge2Te6, which are of broad interests currently, and the simulation results agree with available measurements. We also clarify the roles played by anisotropic and isotropic interactions in deciding Tc of magnetic orders. Particularly, we find a universal, linear dependence between Tc and magnetic interactions within the parameter space of realistic materials. With this linear dependence, we can predict Tc of general 2D lattice structures, omitting the Monte Carlo simulations. Compared with the widely used Ising model, mean-field theory, and spin-wave theory, this work provides a convenient and quantitative estimation of Tc, giving hope to speeding up the search for novel 2D materials with higher Curie temperatures.

preprint2019arXiv

Deep Learning-Enhanced Variational Monte Carlo Method for Quantum Many-Body Physics

Artificial neural networks have been successfully incorporated into variational Monte Carlo method (VMC) to study quantum many-body systems. However, there have been few systematic studies of exploring quantum many-body physics using deep neural networks (DNNs), despite of the tremendous success enjoyed by DNNs in many other areas in recent years. One main challenge of implementing DNN in VMC is the inefficiency of optimizing such networks with large number of parameters. We introduce an importance sampling gradient optimization (ISGO) algorithm, which significantly improves the computational speed of training DNN in VMC. We design an efficient convolutional DNN architecture to compute the ground state of a one-dimensional (1D) SU($N$) spin chain. Our numerical results of the ground-state energies with up to 16 layers of DNN show excellent agreement with the Bethe-Ansatz exact solution. Furthermore, we also calculate the loop correlation function using the wave function obtained. Our work demonstrates the feasibility and advantages of applying DNNs to numerical quantum many-body calculations.