Source author record

Peng Xu

Peng Xu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

94works

47topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Ada-MK: Adaptive MegaKernel Optimization via Automated DAG-based Search for LLM Inference

When large language models (LLMs) serve real-time inference in commercial online advertising systems, end-to-end latency must be strictly bounded to the millisecond range. Yet every token generated during the decode phase triggers thousands of kernel launches, and kernel launch overhead alone can account for 14.6% of end-to-end inference time. MegaKernel eliminates launch overhead and inter-operator HBM round-trips by fusing multiple operators into a single persistent kernel. However, existing MegaKernel implementations face a fundamental tension between portability and efficiency on resource-constrained GPUs such as NVIDIA Ada: hand-tuned solutions are tightly coupled to specific architectures and lack portability, while auto-compiled approaches introduce runtime dynamic scheduling whose branch penalties are unacceptable in latency-critical settings. We observe that under a fixed deployment configuration, the optimal execution path of a MegaKernel is uniquely determined, and runtime dynamic decision-making can be entirely hoisted to compile time. Building on this insight, we propose Ada-MK: (1) a three-dimensional shared-memory constraint model combined with K-dimension splitting that reduces peak shared memory usage by 50%; (2) MLIR-based fine-grained DAG offline search that solidifies the optimal execution path, completely eliminating runtime branching; and (3) a heterogeneous hybrid inference engine that embeds MegaKernel as a plugin into TensorRT-LLM, combining high-throughput Prefill with low-latency Decode. On an NVIDIA L20, Ada-MK improves single-batch throughput by up to 23.6% over vanilla TensorRT-LLM and 50.2% over vLLM, achieving positive gains across all tested scenarios--the first industrial deployment of MegaKernel in a commercial online advertising system.

preprint2024arXiv

Deep Learning Driven Buffer-Aided Cooperative Networks for B5G/6G: Challenges, Solutions, and Future Opportunities

Buffer-aided cooperative networks (BACNs) have garnered significant attention due to their potential applications in beyond fifth generation (B5G) or sixth generation (6G) critical scenarios. This article explores various typical application scenarios of buffer-aided relaying in B5G/6G networks to emphasize the importance of incorporating BACN. Additionally, we delve into the crucial technical challenges in BACN, including stringent delay constraints, high reliability, imperfect channel state information (CSI), transmission security, and integrated network architecture. To address the challenges, we propose leveraging deep learning-based methods for the design and operation of B5G/6G networks with BACN, deviating from conventional buffer-aided relay selection approaches. In particular, we present two case studies to demonstrate the efficacy of centralized deep reinforcement learning (DRL) and decentralized DRL in buffer-aided non-terrestrial networks. Finally, we outline future research directions in B5G/6G that pertain to the utilization of BACN.

preprint2022arXiv

$r$-Hyperideals and Generalizations of $r$-Hyperideals in Krasner Hyperrings

In this study, we examine some properties of $r$-hyperideals in the commutative Krasner hyperrings. Some properties of $pr$-hyperideals are also studied. The relation between prime hyperideals and $r$-hyperideals is investigated. We show that the image and the inverse image of an $r$-hyperideal is also an $r$-hyperideal. We also introduce a generalization of r-hyperideals and we prove some properties of them.

preprint2022arXiv

$δ$-$r$-Hyperideals and $ϕ$-$δ$-$r$-Hyperideals of Commutative Krasner Hyperrings

In this paper, our purpose is to define the expansion of $r$-hyperideals and extend this concept to $ϕ$-$δ$-$r$-hyperideal. Let $\Re$ be a commutative Krasner hyperring with nonzero identity. Given an expansion $δ$ of hyperideals, a proper hyperideal $N$ of $\Re$ is called $δ$-$r$-hyperideal if $a\cdot b\in N$ with $ann(a)=0$ implies that $b\in δ(N)$, for all $a,b\in\Re$. Therefore, given an expansion $δ$ of hyperideals and a hyperideal reduction $ϕ$, a proper hyperideal $N$ of $\Re$ is called $ϕ$-$δ$-$r$-hyperideal if $a\cdot b\in N-ϕ(N)$ with $ann(a)=0$ implies that $b\inδ(N)$, for all $a,b\in\Re$. We investigate some of their properties and give some examples.

preprint2022arXiv

A UCB-based Tree Search Approach to Joint Verification-Correction Strategy for Large Scale Systems

Verification planning is a sequential decision-making problem that specifies a set of verification activities (VA) and correction activities (CA) at different phases of system development. While VAs are used to identify errors and defects, CAs also play important roles in system verification as they correct the identified errors and defects. However, current planning methods only consider VAs as decision choices. Because VAs and CAs have different activity spaces, planning a joint verification-correction strategy (JVCS) is still challenging, especially for large-size systems. Here we introduce a UCB-based tree search approach to search for near-optimal JVCSs. First, verification planning is simplified as repeatable bandit problems and an upper confidence bound rule for repeatable bandits (UCBRB) is presented with the optimal regret bound. Next, a tree search algorithm is proposed to search for feasible JVCSs. A tree-based ensemble learning model is also used to extend the tree search algorithm to handle local optimality issues. The proposed approach is evaluated on the notional case of a communication system.

preprint2022arXiv

Android-COCO: Android Malware Detection with Graph Neural Network for Byte- and Native-Code

With the popularity of Android growing exponentially, the amount of malware has significantly exploded. It is arguably one of the most viral problems on mobile platforms. Recently, various approaches have been introduced to detect Android malware, the majority of these are either based on the Manifest File features or the structural information, such as control flow graph and API calls. Among those methods, nearly all of them only consider the Java byte-code as the target to detect malicious behaviors. However, Recent research and our own statistics show that native payloads are commonly used in both benign and malicious apps. Current state-of-the-art Android static analysis tools avoid handling native method invocation. None of those tools have the capability to capture the inter-language behaviors. In this work, we explore an ensemble mechanism, which presents how the combination of byte-code and native-code analysis of Android applications can be efficiently used to cope with the advanced sophistication of Android malware. We, therefore, present a multi-layer approach that utilizes deep learning, natural language processing (NLP), as well as graph embedding techniques to handle the threats of Android malware, both from the Java byte-code and native code. After that, we design an ensemble algorithm to get the final result of malware detection system. To be specific, the first layer of our detection approach operates on the byte-code of application and the native code level, whereas the second layer focuses on the ensemble algorithm. Large-scale experiments on 100,113 samples (35,113 malware and 65,000 benign) show that only byte-code sub-system yields 99.8% accuracy and native-code sub-system yields an accuracy of 96.6%, whereas the Android-COCO method attains an accuracy of 99.86% which outperforms various related works.

preprint2022arXiv

ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation

Code-switching is a speech phenomenon occurring when a speaker switches language during a conversation. Despite the spontaneous nature of code-switching in conversational spoken language, most existing works collect code-switching data from read speech instead of spontaneous speech. ASCEND (A Spontaneous Chinese-English Dataset) is a high-quality Mandarin Chinese-English code-switching corpus built on spontaneous multi-turn conversational dialogue sources collected in Hong Kong. We report ASCEND's design and procedure for collecting the speech data, including annotations. ASCEND consists of 10.62 hours of clean speech, collected from 23 bilingual speakers of Chinese and English. Furthermore, we conduct baseline experiments using pre-trained wav2vec 2.0 models, achieving a best performance of 22.69\% character error rate and 27.05% mixed error rate.

preprint2022arXiv

Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset

Automatic speech recognition (ASR) on low resource languages improves the access of linguistic minorities to technological advantages provided by artificial intelligence (AI). In this paper, we address the problem of data scarcity for the Hong Kong Cantonese language by creating a new Cantonese dataset. Our dataset, Multi-Domain Cantonese Corpus (MDCC), consists of 73.6 hours of clean read speech paired with transcripts, collected from Cantonese audiobooks from Hong Kong. It comprises philosophy, politics, education, culture, lifestyle and family domains, covering a wide range of topics. We also review all existing Cantonese datasets and analyze them according to their speech type, data source, total size and availability. We further conduct experiments with Fairseq S2T Transformer, a state-of-the-art ASR model, on the biggest existing dataset, Common Voice zh-HK, and our proposed MDCC, and the results show the effectiveness of our dataset. In addition, we create a powerful and robust Cantonese ASR model by applying multi-dataset learning on MDCC and Common Voice zh-HK.

preprint2022arXiv

Charactering instrumental noises and stochastic gravitational wave signals from combined time-delay interferometry

LISA will detect gravitational waves (GWs) in the milli-Hz frequency band in space. Time-delay interferometry (TDI) is developed to suppress laser frequency noise beneath the acceleration noise and optical metrology noise. To identify stochastic GW signals, it would be required to characterize these noise components entangled in TDI data streams. In this work, we investigate noises characterization by combining the first-generation TDI channels from Michelson and Relay configurations. The Michelson channels are helpful to characterize acceleration noises in the lower frequency band, and the Relay configuration could effectively resolve optical path noises in the higher frequencies. Synergy could be achieved from their combination to determine these instrumental noises. Based on the characterized noises, we further reconstruct the power spectrum of noise in the selected TDI channel. Two cases are performed to characterize the spectrum shape of a stochastic GW signal. For a modeled signal, its parameter(s) could be directly estimated from the TDI data, and its spectrum could be recovered from the inferred values. And for an unexpected signal, its spectrum may be recognized and retrieved from noise-subtracted residual in which its power spectral density surpasses the noise level.

preprint2022arXiv

CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command Recognition

With the rise of deep learning and intelligent vehicle, the smart assistant has become an essential in-car component to facilitate driving and provide extra functionalities. In-car smart assistants should be able to process general as well as car-related commands and perform corresponding actions, which eases driving and improves safety. However, there is a data scarcity issue for low resource languages, hindering the development of research and applications. In this paper, we introduce a new dataset, Cantonese In-car Audio-Visual Speech Recognition (CI-AVSR), for in-car command recognition in the Cantonese language with both video and audio data. It consists of 4,984 samples (8.3 hours) of 200 in-car commands recorded by 30 native Cantonese speakers. Furthermore, we augment our dataset using common in-car background noises to simulate real environments, producing a dataset 10 times larger than the collected one. We provide detailed statistics of both the clean and the augmented versions of our dataset. Moreover, we implement two multimodal baselines to demonstrate the validity of CI-AVSR. Experiment results show that leveraging the visual signal improves the overall performance of the model. Although our best model can achieve a considerable quality on the clean test set, the speech recognition quality on the noisy data is still inferior and remains as an extremely challenging task for real in-car speech recognition systems. The dataset and code will be released at https://github.com/HLTCHKUST/CI-AVSR.

preprint2022arXiv

Deep Learning for Free-Hand Sketch: A Survey

Free-hand sketches are highly illustrative, and have been widely used by humans to depict objects or stories from ancient times to the present. The recent prevalence of touchscreen devices has made sketch creation a much easier task than ever and consequently made sketch-oriented applications increasingly popular. The progress of deep learning has immensely benefited free-hand sketch research and applications. This paper presents a comprehensive survey of the deep learning techniques oriented at free-hand sketch data, and the applications that they enable. The main contents of this survey include: (i) A discussion of the intrinsic traits and unique challenges of free-hand sketch, to highlight the essential differences between sketch data and other data modalities, e.g., natural photos. (ii) A review of the developments of free-hand sketch research in the deep learning era, by surveying existing datasets, research topics, and the state-of-the-art methods through a detailed taxonomy and experimental evaluation. (iii) Promotion of future work via a discussion of bottlenecks, open problems, and potential research directions for the community.

preprint2022arXiv

DeepChange: A Large Long-Term Person Re-Identification Benchmark with Clothes Change

Existing person re-identification (re-id) works mostly consider short-term application scenarios without clothes change. In real-world, however, we often dress differently across space and time. To solve this contrast, a few recent attempts have been made on long-term re-id with clothes change. Currently, one of the most significant limitations in this field is the lack of a large realistic benchmark. In this work, we contribute a large, realistic long-term person re-identification benchmark, named as DeepChange. It has several unique characteristics: (1) Realistic and rich personal appearance (e.g., clothes and hair style) and variations: Highly diverse clothes change and styles, with varying reappearing gaps in time from minutes to seasons, different weather conditions (e.g., sunny, cloudy, windy, rainy, snowy, extremely cold) and events (e.g., working, leisure, daily activities). (2) Rich camera setups: Raw videos were recorded by 17 outdoor varying resolution cameras operating in a real-world surveillance system. (3) The currently largest number of (17) cameras, (1, 121) identities, and (178, 407) bounding boxes, over the longest time span (12 months). Further, we investigate multimodal fusion strategies for tackling the clothes change challenge. Extensive experiments show that our fusion models outperform a wide variety of state-of-the-art models on DeepChange. Our dataset and documents are available at https://github.com/PengBoXiangShang/deepchange.

preprint2022arXiv

Density functional theory plus dynamical mean field theory within the framework of linear combination of numerical atomic orbitals: Formulation and benchmarks

The combination of density functional theory with dynamical mean-field theory (DFT+DMFT) has become a powerful first-principles approach to tackle strongly correlated materials in condensed matter physics. The wide use of this approach relies on robust and easy-to-use implementations, and its implementation in various numerical frameworks will increase its applicability on the one hand and help crosscheck the validity of the obtained results on the other. In the work, we develop a formalism within the linear combination of numerical atomic orbital (NAO) basis set framework, which allows for merging NAO-based DFT codes with DMFT quantum impurity solvers. The formalism is implemented by interfacing two NAO-based DFT codes with three DMFT impurity solvers, and its validity is testified by benchmark calculations for a wide range of strongly correlated materials, including 3\textit{d} transition metal compounds, lanthanides, and actinides. Our work not only enables DFT+DMFT calculations using popular and rapidly developing NAO-based DFT code packages, but also facilitates the combination of more advanced beyond-DFT methodologies available in this codes with the DMFT machinery.

preprint2022arXiv

DFT+$U$ within the framework of linear combination of numerical atomic orbitals

We present a formulation and implementation of the DFT+\textit{U} method within the framework of linear combination of numerical atomic orbitals (NAO). Our implementation not only enables single-point total energy and electronic-structure calculations but also provides access to atomic forces and stresses, hence allowing for full structure relaxations of periodic systems. Furthermore, our implementation allows one to deal with non-collinear spin texture, with the spin-orbit coupling (SOC) effect treated self-consistently. The key aspect behind our implementation is a suitable definition of the correlated subspace when multiple atomic orbitals with the same angular momentum are used, and this is addressed via the "Mulliken charge projector" constructed in terms of the first (most localized) atomic orbital within the $d/f$ angular momentum channel. The important Hubbard $U$ and Hund $J$ parameters can be estimated from a screened Coulomb potential of the Yukawa type, with the screening parameter either chosen semi-empirically or determined from the Thomas-Fermi screening model. Benchmark calculations are performed for four late transition metal monoxide bulk systems, i.e., MnO, FeO, CoO, and NiO, and for the 5$d$-electron compounds IrO$_2$. For the former type of systems, we check the performance of our DFT+$U$ implementation for calculating band gaps, magnetic moments, electronic band structures, as well as forces and stresses; for the latter, the efficacy of our DFT+$U$+SOC implementation is assessed. Systematic comparisons with available experimental results, and especially with the results from other implementation schemes are carried out, which demonstrate the validity of our NAO-based DFT+$U$ formalism and implementation.

preprint2022arXiv

Do As I Can, Not As I Say: Grounding Language in Robotic Affordances

Large language models can encode a wealth of semantic knowledge about the world. Such knowledge could be extremely useful to robots aiming to act upon high-level, temporally extended instructions expressed in natural language. However, a significant weakness of language models is that they lack real-world experience, which makes it difficult to leverage them for decision making within a given embodiment. For example, asking a language model to describe how to clean a spill might result in a reasonable narrative, but it may not be applicable to a particular agent, such as a robot, that needs to perform this task in a particular environment. We propose to provide real-world grounding by means of pretrained skills, which are used to constrain the model to propose natural language actions that are both feasible and contextually appropriate. The robot can act as the language model's "hands and eyes," while the language model supplies high-level semantic knowledge about the task. We show how low-level skills can be combined with large language models so that the language model provides high-level knowledge about the procedures for performing complex and temporally-extended instructions, while value functions associated with these skills provide the grounding necessary to connect this knowledge to a particular physical environment. We evaluate our method on a number of real-world robotic tasks, where we show the need for real-world grounding and that this approach is capable of completing long-horizon, abstract, natural language instructions on a mobile manipulator. The project's website and the video can be found at https://say-can.github.io/.

preprint2022arXiv

Entailment Tree Explanations via Iterative Retrieval-Generation Reasoner

Large language models have achieved high performance on various question answering (QA) benchmarks, but the explainability of their output remains elusive. Structured explanations, called entailment trees, were recently suggested as a way to explain and inspect a QA system's answer. In order to better generate such entailment trees, we propose an architecture called Iterative Retrieval-Generation Reasoner (IRGR). Our model is able to explain a given hypothesis by systematically generating a step-by-step explanation from textual premises. The IRGR model iteratively searches for suitable premises, constructing a single entailment step at a time. Contrary to previous approaches, our method combines generation steps and retrieval of premises, allowing the model to leverage intermediate conclusions, and mitigating the input size limit of baseline encoder-decoder models. We conduct experiments using the EntailmentBank dataset, where we outperform existing benchmarks on premise retrieval and entailment tree generation, with around 300% gain in overall correctness.

preprint2022arXiv

Global Gravity Field Model from Taiji-1 Observations

Taiji-1 is the first technology demonstration satellite of the Taiji program of China's space-borne gravitational wave antenna. After the demonstration of the key individual technologies, Taiji-1 continues collecting the data of the precision orbit determinations, satellite attitudes, and non-conservative forces exerted on the S/C. Therefore, during its free-fall, Taiji-1 can be viewed as operating in the high-low satellite-to-satellite tracking mode of a gravity recovery mission. In this work, we have selected and analyzed the one month data from Taiji-1's observations, and developed the techniques to resolve the long term interruptions and disturbances in the data due to the scheduled technology demonstration experiments. The first global gravity model \texttt{TJGM-r1911}, that independently derived from China's own satellite mission, is successfully built from Taiji-1's observations. Compared with gravity models from CHAMP and other satellite gravity missions, the accuracy discrepancies exist, which is mainly caused by the data discontinuity problem. As the extended free-falling phase been approved, Taiji-1 could serve as a gravity recovery mission for China since 2022 and it will provide us the independent measurement of both the static and the monthly time-variable global gravity field.

preprint2022arXiv

How to Understand Masked Autoencoders

"Masked Autoencoders (MAE) Are Scalable Vision Learners" revolutionizes the self-supervised learning method in that it not only achieves the state-of-the-art for image pre-training, but is also a milestone that bridges the gap between visual and linguistic masked autoencoding (BERT-style) pre-trainings. However, to our knowledge, to date there are no theoretical perspectives to explain the powerful expressivity of MAE. In this paper, we, for the first time, propose a unified theoretical framework that provides a mathematical understanding for MAE. Specifically, we explain the patch-based attention approaches of MAE using an integral kernel under a non-overlapping domain decomposition setting. To help the research community to further comprehend the main reasons of the great success of MAE, based on our framework, we pose five questions and answer them with mathematical rigor using insights from operator theory.

preprint2022arXiv

hybrid-Falcon: Hybrid Pattern Malware Detection and Categorization with Network Traffic and Program Code

Nowadays, Android is the most dominant operating system in the mobile ecosystem, with billions of people using its apps daily. As expected, this trend did not go unnoticed by miscreants, and Android became the favorite platform for discovering new victims through malicious apps. Moreover, these apps have become so sophisticated that they can bypass anti-malware measures to protect the users. Therefore, it is safe to admit that traditional anti-malware techniques have become cumbersome, sparking the urge to develop an efficient way to detect Android malware. This paper presents hybrid-Flacon, a hybrid pattern Android malware detection and categorization framework. It combines dynamic and static features of Android malware, which are from network traffic and code graph structure. In hybrid-Flacon, we treat network traffic as a dynamic feature and process it as a 2D image sequence. Meanwhile, hybrid-Flacon handles each network flow in the packet as a 2D image and uses a bidirectional LSTM network to process those 2D-image sequences to obtain vectors representing network packets. We use the program code graph for a static feature and introduce natural language processing (NLP) inspired techniques on function call graph (FCG). We design a graph neural network-based approach to convert the whole graph structure of Android apps to vectors. Finally, We utilize those converted vectors, both network and program code features, and concatenate them to detect and categorize the malware. Our results reveal that hybrid-Flacon yields better results as we get 97.16% accuracy on average for malware detection and 88.32% accuracy for malware categorization. Additionally, we release a dataset AndroNetMnist, which converts the network traffic to a 2D-image sequence and helps to accomplish malware detection on a 2D-image sequence.

preprint2022arXiv

Inexact Newton-CG Algorithms With Complexity Guarantees

We consider variants of a recently-developed Newton-CG algorithm for nonconvex problems \citep{royer2018newton} in which inexact estimates of the gradient and the Hessian information are used for various steps. Under certain conditions on the inexactness measures, we derive iteration complexity bounds for achieving $ε$-approximate second-order optimality that match best-known lower bounds. Our inexactness condition on the gradient is adaptive, allowing for crude accuracy in regions with large gradients. We describe two variants of our approach, one in which the step-size along the computed search direction is chosen adaptively and another in which the step-size is pre-defined. To obtain second-order optimality, our algorithms will make use of a negative curvature direction on some steps. These directions can be obtained, with high-probability, using a certain randomized algorithm. In this sense, all of our results hold with high-probability over the run of the algorithm. We evaluate the performance of our proposed algorithms empirically on several machine learning models.

preprint2022arXiv

Newton-MR: Inexact Newton Method With Minimum Residual Sub-problem Solver

We consider a variant of inexact Newton Method, called Newton-MR, in which the least-squares sub-problems are solved approximately using Minimum Residual method. By construction, Newton-MR can be readily applied for unconstrained optimization of a class of non-convex problems known as invex, which subsumes convexity as a sub-class. For invex optimization, instead of the classical Lipschitz continuity assumptions on gradient and Hessian, Newton-MR's global convergence can be guaranteed under a weaker notion of joint regularity of Hessian and gradient. We also obtain Newton-MR's problem-independent local convergence to the set of minima. We show that fast local/global convergence can be guaranteed under a novel inexactness condition, which, to our knowledge, is much weaker than the prior related works. Numerical results demonstrate the performance of Newton-MR as compared with several other Newton-type alternatives on a few machine learning problems.

preprint2022arXiv

QA4QG: Using Question Answering to Constrain Multi-Hop Question Generation

Multi-hop question generation (MQG) aims to generate complex questions which require reasoning over multiple pieces of information of the input passage. Most existing work on MQG has focused on exploring graph-based networks to equip the traditional Sequence-to-sequence framework with reasoning ability. However, these models do not take full advantage of the constraint between questions and answers. Furthermore, studies on multi-hop question answering (QA) suggest that Transformers can replace the graph structure for multi-hop reasoning. Therefore, in this work, we propose a novel framework, QA4QG, a QA-augmented BART-based framework for MQG. It augments the standard BART model with an additional multi-hop QA module to further constrain the generated question. Our results on the HotpotQA dataset show that QA4QG outperforms all state-of-the-art models, with an increase of 8 BLEU-4 and 8 ROUGE points compared to the best results previously reported. Our work suggests the advantage of introducing pre-trained language models and QA module for the MQG task.

preprint2022arXiv

TOD-CNN: An Effective Convolutional Neural Network for Tiny Object Detection in Sperm Videos

The detection of tiny objects in microscopic videos is a problematic point, especially in large-scale experiments. For tiny objects (such as sperms) in microscopic videos, current detection methods face challenges in fuzzy, irregular, and precise positioning of objects. In contrast, we present a convolutional neural network for tiny object detection (TOD-CNN) with an underlying data set of high-quality sperm microscopic videos (111 videos, $>$ 278,000 annotated objects), and a graphical user interface (GUI) is designed to employ and test the proposed model effectively. TOD-CNN is highly accurate, achieving $85.60\%$ AP$_{50}$ in the task of real-time sperm detection in microscopic videos. To demonstrate the importance of sperm detection technology in sperm quality analysis, we carry out relevant sperm quality evaluation metrics and compare them with the diagnosis results from medical doctors.

preprint2022arXiv

Topological Micromotion of Floquet Quantum Systems

The Floquet Hamiltonian has often been used to describe a time-periodic system. Nevertheless, because the Floquet Hamiltonian depends on a micro-motion parameter, the Floquet Hamiltonian with a fixed micro-motion parameter cannot faithfully represent a driven system, which manifests as the anomalous edge states. Here we show that an accurate description of a Floquet system requires a set of Hamiltonian exhausting all values of the micro-motion parameter, and this micro-motion parameter can be viewed as an extra synthetic dimension of the system. Therefore, we show that a $d$-dimensional Floquet system can be described by a $d+1$-dimensional static Hamiltonian, and the advantage of this representation is that the periodic boundary condition is automatically imposed along the extra-dimension, which enables a straightforward definition of topological invariants. The topological invariant in the $d+1$-dimensional system can ensure a $d-1$-dimensional edge state of the $d$-dimensional Floquet system. Here we show two examples where the topological invariant is a three-dimensional Hopf invariant. We highlight that our scheme of classifying Floquet topology on the micro-motion space is different from the previous classification of Floquet topology on the time space.

preprint2022arXiv

Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning

Reinforcement learning can train policies that effectively perform complex tasks. However for long-horizon tasks, the performance of these methods degrades with horizon, often necessitating reasoning over and chaining lower-level skills. Hierarchical reinforcement learning aims to enable this by providing a bank of low-level skills as action abstractions. Hierarchies can further improve on this by abstracting the space states as well. We posit that a suitable state abstraction should depend on the capabilities of the available lower-level policies. We propose Value Function Spaces: a simple approach that produces such a representation by using the value functions corresponding to each lower-level skill. These value functions capture the affordances of the scene, thus forming a representation that compactly abstracts task relevant information and robustly ignores distractors. Empirical evaluations for maze-solving and robotic manipulation tasks demonstrate that our approach improves long-horizon performance and enables better zero-shot generalization than alternative model-free and model-based methods.

preprint2021arXiv

Defect-free arbitrary-geometry assembly of mixed-species atom arrays

Optically trapped mixed-species single atom arrays with arbitrary geometries are an attractive and promising platform for various applications, because tunable quantum systems with multiple components provide extra degrees of freedom for experimental control. Here, we report the first demonstration of two-dimensional $6\times4$ dual-species atom assembly with a filling fraction of 0.88 (0.89) for $^{85}$Rb ($^{87}$Rb) atoms. This mixed-species atomic synthetic is achieved via rearranging initially randomly distributed atoms using a sorting algorithm (heuristic heteronuclear algorithm) which is proposed for bottom-up atom assembly with both user-defined geometries and two-species atom number ratios. Our fully tunable hybrid-atom system of scalable advantages is a good starting point for high-fidelity quantum logic, many-body quantum simulation and forming defect-free single molecule arrays.

preprint2021arXiv

High fidelity entanglement of neutral atoms via a Rydberg-mediated single-modulated-pulse controlled-PHASE gate

Neutral atom platform has become an attractive choice to study the science of quantum information and quantum simulation, where intense efforts have been devoted to the entangling processes between individual atoms. For the development of this area, two-qubit controlled-PHASE gate via Rydberg blockade is one of the most essential elements. Recent theoretical studies have suggested the advantages of introducing non-trivial waveform modulation into the gate protocol, which is anticipated to improve its performance towards the next stage. We report our recent experimental results in realizing a two-qubit controlled-PHASE($C_Z$) gate via off-resonant modulated driving(ORMD) embedded in two-photon transition for Rb atoms. It relies upon a single modulated driving pulse with a carefully calculated smooth waveform to gain the appropriate phase accumulations required by the two-qubit gate. Combining this $C_Z$ gate with global microwave pulses, two-atom entanglement is generated with the raw fidelity of 0.945(6). Accounting for state preparation and measurement (SPAM) errors, we extract the entanglement operation fidelity to be 0.980(7). Our work features completing the $C_Z$ gate operation within a single pulse to avoid shelved Rydberg population, thus demonstrate another promising route for realizing high-fidelity two-qubit gate for neutral atom platform.

preprint2021arXiv

On the criteria of large cavitation bubbles in a tube during a transient process

Extreme cavitation scenarios such as water column separations in hydraulic systems during transient processes caused by large cavitation bubbles can lead to catastrophic destruction. In the present paper, we study the onset criteria and dynamics of large cavitation bubbles in a tube. A new cavitation number $Ca_2 = {l^*}^{-1} Ca_0$ is proposed to describe the maximum length $L_{\max}$ of the cavitation bubble, where $l^*$ is a non-dimensional length of the water column indicating its slenderness, and $Ca_0$ is the classic cavitation number. Combined with the onset criteria for acceleration-induced cavitation ($Ca_1<1$, Pan et al. (2017)), we show that the occurrence of large cylindrical cavitation bubbles requires both $Ca_2<1$ and $Ca_1<1$ simultaneously. We also establish a Rayleigh-type model for the dynamics of large cavitation bubbles in a tube. The bubbles collapse at a finite end speed, and the time from the maximum bubble size to collapse is $T_c=\sqrt{2}\sqrt{lL_{\max}}\sqrt{\fracρ{p_\infty}}$, where $l$ is the length of the water column, $L_{\max}$ is the maximum bubble length, $ρ$ is the liquid density, and $p_{\infty}$ is the reference pressure in the far field. The analytical results are validated against systematic experiments using a modified 'tube-arrest' apparatus, which can decouple acceleration and velocity. The results in the current work can guide design and operation of hydraulic systems encountering transient processes.

preprint2021arXiv

The Twelvefold Way of Non-Sequential Lossless Compression

Many information sources are not just sequences of distinguishable symbols but rather have invariances governed by alternative counting paradigms such as permutations, combinations, and partitions. We consider an entire classification of these invariances called the twelvefold way in enumerative combinatorics and develop a method to characterize lossless compression limits. Explicit computations for all twelve settings are carried out for i.i.d. uniform and Bernoulli distributions. Comparisons among settings provide quantitative insight.

preprint2020arXiv

Better Long-Range Dependency By Bootstrapping A Mutual Information Regularizer

In this work, we develop a novel regularizer to improve the learning of long-range dependency of sequence data. Applied on language modelling, our regularizer expresses the inductive bias that sequence variables should have high mutual information even though the model might not see abundant observations for complex long-range dependency. We show how the `next sentence prediction (classification)' heuristic can be derived in a principled way from our mutual information estimation framework, and be further extended to maximize the mutual information of sequence variables. The proposed approach not only is effective at increasing the mutual information of segments under the learned model but more importantly, leads to a higher likelihood on holdout data, and improved generation quality. Code is released at https://github.com/BorealisAI/BMI.

preprint2020arXiv

CAiRE: An Empathetic Neural Chatbot

In this paper, we present an end-to-end empathetic conversation agent CAiRE. Our system adapts TransferTransfo (Wolf et al., 2019) learning approach that fine-tunes a large-scale pre-trained language model with multi-task objectives: response language modeling, response prediction and dialogue emotion detection. We evaluate our model on the recently proposed empathetic-dialogues dataset (Rashkin et al., 2019), the experiment results show that CAiRE achieves state-of-the-art performance on dialogue emotion detection and empathetic response generation.

preprint2020arXiv

Coach: A Coarse-to-Fine Approach for Cross-domain Slot Filling

As an essential task in task-oriented dialog systems, slot filling requires extensive training data in a certain domain. However, such data are not always available. Hence, cross-domain slot filling has naturally arisen to cope with this data scarcity problem. In this paper, we propose a Coarse-to-fine approach (Coach) for cross-domain slot filling. Our model first learns the general pattern of slot entities by detecting whether the tokens are slot entities or not. It then predicts the specific types for the slot entities. In addition, we propose a template regularization approach to improve the adaptation robustness by regularizing the representation of utterances based on utterance templates. Experimental results show that our model significantly outperforms state-of-the-art approaches in slot filling. Furthermore, our model can also be applied to the cross-domain named entity recognition task, and it achieves better adaptation performance than other existing baselines. The code is available at https://github.com/zliucr/coach.

preprint2020arXiv

Deep Self-Supervised Representation Learning for Free-Hand Sketch

In this paper, we tackle for the first time, the problem of self-supervised representation learning for free-hand sketches. This importantly addresses a common problem faced by the sketch community -- that annotated supervisory data are difficult to obtain. This problem is very challenging in that sketches are highly abstract and subject to different drawing styles, making existing solutions tailored for photos unsuitable. Key for the success of our self-supervised learning paradigm lies with our sketch-specific designs: (i) we propose a set of pretext tasks specifically designed for sketches that mimic different drawing styles, and (ii) we further exploit the use of a textual convolution network (TCN) in a dual-branch architecture for sketch feature learning, as means to accommodate the sequential stroke nature of sketches. We demonstrate the superiority of our sketch-specific designs through two sketch-related applications (retrieval and recognition) on a million-scale sketch dataset, and show that the proposed approach outperforms the state-of-the-art unsupervised representation learning methods, and significantly narrows the performance gap between with supervised representation learning.

preprint2020arXiv

EEG-based Brain-Computer Interfaces (BCIs): A Survey of Recent Studies on Signal Sensing Technologies and Computational Intelligence Approaches and their Applications

Brain-Computer Interface (BCI) is a powerful communication tool between users and systems, which enhances the capability of the human brain in communicating and interacting with the environment directly. Advances in neuroscience and computer science in the past decades have led to exciting developments in BCI, thereby making BCI a top interdisciplinary research area in computational neuroscience and intelligence. Recent technological advances such as wearable sensing devices, real-time data streaming, machine learning, and deep learning approaches have increased interest in electroencephalographic (EEG) based BCI for translational and healthcare applications. Many people benefit from EEG-based BCIs, which facilitate continuous monitoring of fluctuations in cognitive states under monotonous tasks in the workplace or at home. In this study, we survey the recent literature of EEG signal sensing technologies and computational intelligence approaches in BCI applications, compensated for the gaps in the systematic summary of the past five years (2015-2019). In specific, we first review the current status of BCI and its significant obstacles. Then, we present advanced signal sensing and enhancement technologies to collect and clean EEG signals, respectively. Furthermore, we demonstrate state-of-art computational intelligence techniques, including interpretable fuzzy models, transfer learning, deep learning, and combinations, to monitor, maintain, or track human cognitive states and operating performance in prevalent applications. Finally, we deliver a couple of innovative BCI-inspired healthcare applications and discuss some future research directions in EEG-based BCIs.

preprint2020arXiv

Embedding-based Zero-shot Retrieval through Query Generation

Passage retrieval addresses the problem of locating relevant passages, usually from a large corpus, given a query. In practice, lexical term-matching algorithms like BM25 are popular choices for retrieval owing to their efficiency. However, term-based matching algorithms often miss relevant passages that have no lexical overlap with the query and cannot be finetuned to downstream datasets. In this work, we consider the embedding-based two-tower architecture as our neural retrieval model. Since labeled data can be scarce and because neural retrieval models require vast amounts of data to train, we propose a novel method for generating synthetic training data for retrieval. Our system produces remarkable results, significantly outperforming BM25 on 5 out of 6 datasets tested, by an average of 2.45 points for Recall@1. In some cases, our model trained on synthetic data can even outperform the same model trained on real data

preprint2020arXiv

EmoGraph: Capturing Emotion Correlations using Graph Networks

Most emotion recognition methods tackle the emotion understanding task by considering individual emotion independently while ignoring their fuzziness nature and the interconnections among them. In this paper, we explore how emotion correlations can be captured and help different classification tasks. We propose EmoGraph that captures the dependencies among different emotions through graph networks. These graphs are constructed by leveraging the co-occurrence statistics among different emotion categories. Empirical results on two multi-label classification datasets demonstrate that EmoGraph outperforms strong baselines, especially for macro-F1. An additional experiment illustrates the captured emotion correlations can also benefit a single-label classification task.

preprint2020arXiv

Extension of causal decomposition in the mutual complex dynamic process

Causal decomposition depicts a cause-effect relationship that is not based on the concept of prediction, but based on the phase dependence of time series. It has been validated in both stochastic and deterministic systems and is now anticipated for its application in the complex dynamic process. Here, we present an extension of causal decomposition in the mutual complex dynamic process: cause and effect of time series are inherited in the decomposition of intrinsic components in a similar time scale. Furthermore, we illustrate comparative studies with predominate methods used in neuroscience, and show the applicability of the method particularly to physiological time series in brain-muscle interactions, implying the potential to the causality analysis in the complex physiological process.

preprint2020arXiv

Fault Detection and Identification using Bayesian Recurrent Neural Networks

In processing and manufacturing industries, there has been a large push to produce higher quality products and ensure maximum efficiency of processes. This requires approaches to effectively detect and resolve disturbances to ensure optimal operations. While the control system can compensate for many types of disturbances, there are changes to the process which it still cannot handle adequately. It is therefore important to further develop monitoring systems to effectively detect and identify those faults such that they can be quickly resolved by operators. In this paper, a novel probabilistic fault detection and identification method is proposed which adopts a newly developed deep learning approach using Bayesian recurrent neural networks~(BRNNs) with variational dropout. The BRNN model is general and can model complex nonlinear dynamics. Moreover, compared to traditional statistic-based data-driven fault detection and identification methods, the proposed BRNN-based method yields uncertainty estimates which allow for simultaneous fault detection of chemical processes, direct fault identification, and fault propagation analysis. The outstanding performance of this method is demonstrated and contrasted to (dynamic) principal component analysis, which are widely applied in the industry, in the benchmark Tennessee Eastman process~(TEP) and a real chemical manufacturing dataset.

preprint2020arXiv

Fine-Grained Instance-Level Sketch-Based Video Retrieval

Existing sketch-analysis work studies sketches depicting static objects or scenes. In this work, we propose a novel cross-modal retrieval problem of fine-grained instance-level sketch-based video retrieval (FG-SBVR), where a sketch sequence is used as a query to retrieve a specific target video instance. Compared with sketch-based still image retrieval, and coarse-grained category-level video retrieval, this is more challenging as both visual appearance and motion need to be simultaneously matched at a fine-grained level. We contribute the first FG-SBVR dataset with rich annotations. We then introduce a novel multi-stream multi-modality deep network to perform FG-SBVR under both strong and weakly supervised settings. The key component of the network is a relation module, designed to prevent model over-fitting given scarce training data. We show that this model significantly outperforms a number of existing state-of-the-art models designed for video analysis.

preprint2020arXiv

Hierarchical emotion-recognition framework based on discriminative brain neural network topology and ensemble co-decision strategy

Brain neural networks characterize various information propagation patterns for different emotional states. However, the statistical features based on traditional graph theory may ignore the spacial network difference. To reveal these inherent spatial features and increase the stability of emotional recognition, we proposed a hierarchical framework that can perform the multiple emotion recognitions with the multiple emotion-related spatial network topology patterns (MESNP) by combining a supervised learning with ensemble co-decision strategy. To evaluate the performance of our proposed MESNP approach, we conduct both off-line and simulated on-line experiments with two public datasets i.e., MAHNOB and DEAP. The experiment results demonstrated that MESNP can significantly enhance the classification performance for the multiple emotions. The highest accuracies of off-line experiments for MAHNOB-HCI and DEAP achieved 99.93% (3 classes) and 83.66% (4 classes), respectively. For simulated on-line experiments, we also obtained the best classification accuracies with 100% (3 classes) for MAHNOB and 99.22% (4 classes) for DEAP by proposed MESNP. These results further proved the efficiency of MESNP for structured feature extraction in mult-classification emotional task.

preprint2020arXiv

Learning Fast Adaptation on Cross-Accented Speech Recognition

Local dialects influence people to pronounce words of the same language differently from each other. The great variability and complex characteristics of accents creates a major challenge for training a robust and accent-agnostic automatic speech recognition (ASR) system. In this paper, we introduce a cross-accented English speech recognition task as a benchmark for measuring the ability of the model to adapt to unseen accents using the existing CommonVoice corpus. We also propose an accent-agnostic approach that extends the model-agnostic meta-learning (MAML) algorithm for fast adaptation to unseen accents. Our approach significantly outperforms joint training in both zero-shot, few-shot, and all-shot in the mixed-region and cross-region settings in terms of word error rate.

preprint2020arXiv

LUDA: Boost LSM Key Value Store Compactions with GPUs

Log-Structured-Merge (LSM) tree-based key value stores are facing critical challenges of fully leveraging the dramatic performance improvements of the underlying storage devices, which makes the compaction operations of LSM key value stores become CPU-bound, and slow compactions significantly degrade key value store performance. To address this issue, we propose LUDA, an LSM key value store with CUDA, which uses a GPU to accelerate compaction operations of LSM key value stores. How to efficiently parallelize compaction procedures as well as accommodate the optimal performance contract of the GPU architecture challenge LUDA. Specifically, LUDA overcomes these challenges by exploiting the data independence between compaction procedures and using cooperative sort mechanism and judicious data movements. Running on a commodity GPU under different levels of CPU overhead, evaluation results show that LUDA provides up to 2x higher throughput and 2x data processing speed, and achieves more stable 99th percentile latencies than LevelDB and RocksDB.

preprint2020arXiv

Meta-Transfer Learning for Code-Switched Speech Recognition

An increasing number of people in the world today speak a mixed-language as a result of being multilingual. However, building a speech recognition system for code-switching remains difficult due to the availability of limited resources and the expense and significant effort required to collect mixed-language data. We therefore propose a new learning method, meta-transfer learning, to transfer learn on a code-switched speech recognition system in a low-resource setting by judiciously extracting information from high-resource monolingual datasets. Our model learns to recognize individual languages, and transfer them so as to better recognize mixed-language speech by conditioning the optimization on the code-switching data. Based on experimental results, our model outperforms existing baselines on speech recognition and language modeling tasks, and is faster to converge.

preprint2020arXiv

On certain Iwahori--Hecke modules of $GL_3$ in characteristic $p$

In this note, we show that the natural analogue of certain finiteness result of Barthel--Livn$\acute{\text{e}}$ on $GL_2$ fails for $GL_3$. More precisely, within the pro-$p$-Iwahori invariants of a maximal compact induction of $GL_3$, we show there exist non-zero Iwahori--Hecke submodules of \emph{infinite} codimension.

preprint2020arXiv

On Learning Semantic Representations for Million-Scale Free-Hand Sketches

In this paper, we study learning semantic representations for million-scale free-hand sketches. This is highly challenging due to the domain-unique traits of sketches, e.g., diverse, sparse, abstract, noisy. We propose a dual-branch CNNRNN network architecture to represent sketches, which simultaneously encodes both the static and temporal patterns of sketch strokes. Based on this architecture, we further explore learning the sketch-oriented semantic representations in two challenging yet practical settings, i.e., hashing retrieval and zero-shot recognition on million-scale sketches. Specifically, we use our dual-branch architecture as a universal representation framework to design two sketch-specific deep models: (i) We propose a deep hashing model for sketch retrieval, where a novel hashing loss is specifically designed to accommodate both the abstract and messy traits of sketches. (ii) We propose a deep embedding model for sketch zero-shot recognition, via collecting a large-scale edge-map dataset and proposing to extract a set of semantic vectors from edge-maps as the semantic knowledge for sketch zero-shot domain alignment. Both deep models are evaluated by comprehensive experiments on million-scale sketches and outperform the state-of-the-art competitors.

preprint2020arXiv

On Variational Learning of Controllable Representations for Text without Supervision

The variational autoencoder (VAE) can learn the manifold of natural images on certain datasets, as evidenced by meaningful interpolating or extrapolating in the continuous latent space. However, on discrete data such as text, it is unclear if unsupervised learning can discover similar latent space that allows controllable manipulation. In this work, we find that sequence VAEs trained on text fail to properly decode when the latent codes are manipulated, because the modified codes often land in holes or vacant regions in the aggregated posterior latent space, where the decoding network fails to generalize. Both as a validation of the explanation and as a fix to the problem, we propose to constrain the posterior mean to a learned probability simplex, and performs manipulation within this simplex. Our proposed method mitigates the latent vacancy problem and achieves the first success in unsupervised learning of controllable representations for text. Empirically, our method outperforms unsupervised baselines and strong supervised approaches on text style transfer, and is capable of performing more flexible fine-grained control over text generation than existing methods.

preprint2020arXiv

Quantum-enhanced interferometry with asymmetric beam splitters

In this paper, we investigate the phase sensitivities in two-path optical interferometry with asymmetric beam splitters. Here, we present the optimal conditions for the transmission ratio and the phase of the beam splitter to gain the highest sensitivities for a general class of non-classical states with parity symmetry. Additionally, we address the controversial question of whether the scheme with a combination of coherent state and photon-added or photon-subtracted squeezed vacuum state is better or worse than the most celebrated one using a combination of coherent state and squeezed vacuum state.

preprint2020arXiv

Reconfigurable Intelligent Surfaces Assisted Communications with Discrete Phase Shifts: How Many Quantization Levels are Required to Achieve Full Diversity?

Due to hardware limitations, the phase shifts of the reflecting elements of reconfigurable intelligent surfaces (RISs) need to be quantized into discrete values. This letter aims to unveil the minimum required number of phase quantization levels $L$ in order to achieve the full diversity order in RIS-assisted wireless communication systems. With the aid of an upper bound of the outage probability, we first prove that the full diversity order is achievable provided that $L$ is not less than three. If $L=2$, on the other hand, we prove that the achievable diversity order cannot exceed $(N+1)/2$, where $N$ is the number of reflecting elements. This is obtained with the aid of a lower bound of the outage probability. Therefore, we prove that the minimum required value of $L$ to achieve the full diversity order is $L=3$. Simulation results verify the theoretical analysis and the impact of phase quantization levels on RIS-assisted communication systems.

preprint2020arXiv

Scalable quantum computing with qudits on a graph

We show a significant reduction of the number of quantum operations and the improvement of the circuit depth for the realization of the Toffoli gate by using qudits. This is done by establishing a general relation between the dimensionality of qudits and their topology of connections for a scalable multi-qudit processor, where higher qudit levels are used for substituting ancillas. The suggested model is of importance for the realization of quantum algorithms and as a method of quantum error correction codes for single-qubit operations.

preprint2020arXiv

Switchable next-nearest-neighbor coupling for controlled two-qubit operations

In a superconducting quantum processor with nearest neighbor coupling, the dispersive interaction between adjacent qubits can result in an effective next-nearest-neighbor coupling whose strength depends on the state of the intermediary qubit. Here, we theoretically explore the possibility of engineering this next-nearest-neighbor coupling to implement controlled two-qubit operations where the intermediary qubit controls the operation on the next-nearest neighbor pair of qubits. In particular, in a system comprising two types of superconducting qubits with anharmonicities of opposite-sign arranged in an -A-B-A- pattern, where the unwanted static ZZ coupling between adjacent qubits could be heavily suppressed, a switchable coupling between the next-nearest-neighbor qubits can be achieved via the intermediary qubit, the qubit state of which functions as an on/off switch for this coupling. Therefore, depending on the adopted activating scheme, various controlled two-qubit operations such as controlled-iSWAP gate can be realized, potentially enabling circuit depth reductions as to a standard decomposition approach for implementing generic quantum algorithms.

preprint2020arXiv

TRANS-BLSTM: Transformer with Bidirectional LSTM for Language Understanding

Bidirectional Encoder Representations from Transformers (BERT) has recently achieved state-of-the-art performance on a broad range of NLP tasks including sentence classification, machine translation, and question answering. The BERT model architecture is derived primarily from the transformer. Prior to the transformer era, bidirectional Long Short-Term Memory (BLSTM) has been the dominant modeling architecture for neural machine translation and question answering. In this paper, we investigate how these two modeling techniques can be combined to create a more powerful model architecture. We propose a new architecture denoted as Transformer with BLSTM (TRANS-BLSTM) which has a BLSTM layer integrated to each transformer block, leading to a joint modeling framework for transformer and BLSTM. We show that TRANS-BLSTM models consistently lead to improvements in accuracy compared to BERT baselines in GLUE and SQuAD 1.1 experiments. Our TRANS-BLSTM model obtains an F1 score of 94.01% on the SQuAD 1.1 development dataset, which is comparable to the state-of-the-art result.

preprint2020arXiv

Variational Transformers for Diverse Response Generation

Despite the great promise of Transformers in many sequence modeling tasks (e.g., machine translation), their deterministic nature hinders them from generalizing to high entropy tasks such as dialogue response generation. Previous work proposes to capture the variability of dialogue responses with a recurrent neural network (RNN)-based conditional variational autoencoder (CVAE). However, the autoregressive computation of the RNN limits the training efficiency. Therefore, we propose the Variational Transformer (VT), a variational self-attentive feed-forward sequence model. The VT combines the parallelizability and global receptive field of the Transformer with the variational nature of the CVAE by incorporating stochastic latent variables into Transformers. We explore two types of the VT: 1) modeling the discourse-level diversity with a global latent variable; and 2) augmenting the Transformer decoder with a sequence of fine-grained latent variables. Then, the proposed models are evaluated on three conversational datasets with both automatic metric and human evaluation. The experimental results show that our models improve standard Transformers and other baselines in terms of diversity, semantic relevance, and human judgment.

preprint2019arXiv

Balanced Coherence Times of Mixed-Species Atomic Qubits in a Dual $3\times3$ Magic-Intensity Optical Dipole Trap Array

In this work, we construct a polarization-mediated magic-intensity (MI) optical dipole trap (ODT) array, in which the detrimental effects of light shifts on the mixed-species qubits are efficiently mitigated so that the coherence times of the mixed-species qubits are both substantially enhanced and balanced for the first time. This mixed-species magic trapping technique relies on the tunability of the coefficient of the third-order cross term and ground state hyperpolarizability, which are inherently dependent on the degree of circular polarization of the trap laser. Experimentally, polarization of the ODT array for $^{85}$Rb qubits is finely adjusted to a definite value so that its working magnetic field required for magic trapping amounts to the one required for magically trapping $^{87}$Rb qubits in another ODT array with fully circular polarization. Ultimately, in such a polarization-mediated MI-ODT array, the coherence times of $^{87}$Rb and $^{85}$Rb qubits are respectively enhanced up to 891$\pm$47 ms and 943$\pm$35 ms. Furthermore, a new source of dephasing effect is revealed, which arises from the noise of the elliptic polarization, and the reduction in corresponding dephasing effect on the $^{85}$Rb qubits is attainable by use of shallow magic intensity. It is anticipated that the novel mixed-species MI-ODT array is a versatile platform for building scalable quantum computers with neutral atoms.

preprint2019arXiv

Integration of the Berry curvature on a qubit state manifold by coupling to a quantum meter system

We present a scheme that allows integration of the Berry curvature and thus determination of the Chern number of a qubit eigenstate manifold. Our proposal continuously couples the qubit with a meter system while it explores a quasi-adiabatic path in the manifold. The accumulated change of one of the meter observables then provides an estimate of the Chern number. By varying the initial state of the meter, we explore the delicate interplay between the measurement precision and the disturbance of the qubit. A simple argument yields a correction factor that allows estimation of the Chern number, even when the qubit is significantly disturbed during the probing. The Chern number arises from the geometric phase accumulated during the exploration, while we observe the dynamic phase to produce a broadening of the meter wave function. We show that a protocol, relying on three subsequent explorations, allows cancellation of the dynamic phase while the geometric phase is retained.

preprint2019arXiv

Preparation of a Heteronuclear Two-atom System in the 3D Motional Ground State in an Optical Tweezer

We report the realization of a heteronuclear two-atom of $^{87}$Rb-$^{85}$Rb in the ground state of an optical tweezer (OT). Starting by trapping two different isotopic single atoms, a $^{87}$Rb and a $^{85}$Rb in two strongly focused and linearly polarized OT with 4 $μ$m apart, we perform simultaneously three dimensional Raman sideband cooling for both atoms and the obtained 3D ground state probabilities of $^{87}$Rb and $^{85}$Rb are 0.91(5) and 0.91(10) respectively. There is no obvious crosstalk observed during the cooling process. We then merge them into one tweezer via a species-dependent transport, where the species-dependent potentials are made by changing the polarization of the OTs for each species from linear polarization to the desired circular polarization. The measurable increment of vibrational quantum due to merging is $0.013(1)$ for the axial dimension. This two-atom system can be used to investigate cold collisional physics, to form quantum logic gates, and to build a single heteronuclear molecule. It can also be scaled up to few-atom regime and extended to other atomic species and molecules, and thus to ultracold chemistry.

preprint2019arXiv

Two-qubit controlled-PHASE Rydberg blockade gate protocol for neutral atoms via off-resonant modulated driving within a single pulse

Neutral atom array serves as an ideal platform to study the quantum logic gates, where intense efforts have been devoted to improve the two-qubit gate fidelity. We report our recent findings in constructing a different type of two-qubit controlled-PHASE quantum gate protocol with neutral atoms enabled by Rydberg blockade, which aims at both robustness and high-fidelity. It relies upon modulated driving pulse with specially tailored smooth waveform to gain appropriate phase accumulations for quantum gates. The major features include finishing gate operation within a single pulse, not necessarily requiring individual site addressing, not sensitive to the exact value of blockade shift while suppressing population leakage error and rotation error. We anticipate its fidelity to be reasonably high under realistic considerations for errors such as atomic motion, laser power fluctuation, power imbalance, spontaneous emission and so on. Moreover, we hope that such type of protocol may inspire future improvements in quantum gate designs for other categories of qubit platforms and new applications in other areas of quantum optimal control.

preprint2019arXiv

ZAIGA: Zhaoshan Long-baseline Atom Interferometer Gravitation Antenna

The Zhaoshan long-baseline Atom Interferometer Gravitation Antenna (ZAIGA) is a new type of underground laser-linked interferometer facility, and is currently under construction. It is in the 200-meter-on-average underground of a mountain named Zhaoshan which is about 80 km southeast to Wuhan. ZAIGA will be equipped with long-baseline atom interferometers, high-precision atom clocks, and large-scale gyros. ZAIGA facility will take an equilateral triangle configuration with two 1-km-apart atom interferometers in each arm, a 300-meter vertical tunnel with atom fountain and atom clocks mounted, and a tracking-and-ranging 1-km-arm-length prototype with lattice optical clocks linked by locked lasers. The ZAIGA facility will be used for experimental research on gravitation and related problems including gravitational wave detection, high-precision test of the equivalence principle of micro-particles, clock based gravitational red-shift measurement, rotation measurement and gravito-magnetic effect.

preprint2018arXiv

Reconfiguration of Brain Network between Resting-state and Oddball Paradigm

The oddball paradigm is widely applied to the investigation of multiple cognitive functions. Prior studies have explored the cortical oscillation and power spectral differing from the resting-state conduction to oddball paradigm, but whether brain networks existing the significant difference is still unclear. Our study addressed how the brain reconfigures its architecture from a resting-state condition (i.e., baseline) to P300 stimulus task in the visual oddball paradigm. In this study, electroencephalogram (EEG) datasets were collected from 24 postgraduate students, who were required to only mentally count the number of target stimulus; afterwards the functional EEG networks constructed in different frequency bands were compared between baseline and oddball task conditions to evaluate the reconfiguration of functional network in the brain. Compared to the baseline, our results showed the significantly (p < 0.05) enhanced delta/theta EEG connectivity and decreased alpha default mode network in the progress of brain reconfiguration to the P300 task. Furthermore, the reconfigured coupling strengths were demonstrated to relate to P300 amplitudes, which were then regarded as input features to train a classifier to differentiate the high and low P300 amplitudes groups with an accuracy of 77.78%. The findings of our study help us to understand the changes of functional brain connectivity from resting-state to oddball stimulus task, and the reconfigured network pattern has the potential for the selection of good subjects for P300-based brain- computer interface.

preprint2016arXiv

A New Line Defect in NdTiO3 Perovskite

Perovskite oxides form an eclectic class of materials owing to their structural flexibility in accommodating cations of different sizes and valences. They host well known point and planar defects, but so far no line defect has been identified other than dislocations. Using analytical scanning transmission electron microscopy (STEM) and ab initio calculations we have detected and characterized the atomic and electronic structures of a novel line defect in NdTiO3 perovskite. It appears in STEM images as a perovskite cell rotated by 45 degrees. It consists of self-organized Ti-O vacancy lines replaced by Nd columns surrounding a central Ti-O octahedral chain containing Ti4+ ions, as opposed to Ti3+ in the host. The distinct Ti valence in this line defect introduces the possibility of engineering exotic conducting properties in a single preferred direction and tailoring novel desirable functionalities in this Mott insulator.

preprint2016arXiv

Coherence preservation of a single neutral atom qubit transferred between magic-intensity optical traps

We demonstrate that the coherence of a single mobile atomic qubit can be well preserved during a transfer process among different optical dipole traps (ODTs). This is a prerequisite step in realizing a large-scale neutral atom quantum information processing platform. A qubit encoded in the hyperfine manifold of $^{87}$Rb atom is dynamically extracted from the static quantum register by an auxiliary moving ODT and reinserted into the static ODT. Previous experiments were limited by decoherences induced by the differential light shifts of qubit states. Here we apply a magic-intensity trapping technique which mitigates the detrimental effects of light shifts and substantially enhances the coherence time to $225 \pm 21\,\mathrm{ms}$. The experimentally demonstrated magic trapping technique relies on the previously neglected hyperpolarizability contribution to the light shifts, which makes the light shift dependence on the trapping laser intensity to be parabolic. Because of the parabolic dependence, at a certain "magic" intensity, the first order sensitivity to trapping light intensity variations over ODT volume is eliminated. We experimentally demonstrate the utility of this approach and measure hyperpolarizability for the first time. Our results pave the way for constructing a scalable quantum-computing architectures with single atoms trapped in an array of magic ODTs.

preprint2016arXiv

Firing regulation of fast-spiking interneurons by autaptic inhibition

Fast-spiking (FS) interneurons in the brain are self-innervated by powerful inhibitory GABAergic autaptic connections. By computational modelling, we investigate how autaptic inhibition regulates the firing response of such interneurons. Our results indicate that autaptic inhibition both boosts the current threshold for action potential generation as well as modulates the input-output gain of FS interneurons. The autaptic transmission delay is identified as a key parameter that controls the firing patterns and determines multistability regions of FS interneurons. Furthermore, we observe that neuronal noise influences the firing regulation of FS interneurons by autaptic inhibition and extends their dynamic range for encoding inputs. Importantly, autaptic inhibition modulates noise-induced irregular firing of FS interneurons, such that coherent firing appears at an optimal autaptic inhibition level. Our result reveal the functional roles of autaptic inhibition in taming the firing dynamics of FS interneurons.

preprint2016arXiv

First-order post-Newtonian analysis of the relativistic tidal effects for satellite gradiometry and the Mashhoon-Theiss anomaly

With continuous advances in technology, future satellite gradiometry missions will be capable of performing precision relativistic experiments and imposing constraints on modern gravity theories. To this end, the full first-order post-Newtonian tidal tensor under inertially guided and Earth-pointing local frames along post-Newtonian orbits is worked out. The physical picture behind the "Mashhoon-Theiss anomaly" is explained at the post-Newtonian level. The relativistic precession of the local frame with respect to the sidereal frame will produce modulations of Newtonian tidal forces along certain bases, which gives rise to two different kinds of secular tidal tensors. The measurements of the secular tidal force from the frame-dragging effect is also discussed.

preprint2016arXiv

Forward and Reverse Entropy Power Inequalities in Convex Geometry

The entropy power inequality, which plays a fundamental role in information theory and probability, may be seen as an analogue of the Brunn-Minkowski inequality. Motivated by this connection to Convex Geometry, we survey various recent developments on forward and reverse entropy power inequalities not just for the Shannon-Boltzmann entropy but also more generally for Rényi entropy. In the process, we discuss connections between the so-called functional (or integral) and probabilistic (or entropic) analogues of some classical inequalities in geometric functional analysis

preprint2016arXiv

Gravitational wave astronomy: the current status

In the centenary year of Einstein's General Theory of Relativity, this paper reviews the current status of gravitational wave astronomy across a spectrum which stretches from attohertz to kilohertz frequencies. Sect. 1 of this paper reviews the historical development of gravitational wave astronomy from Einstein's first prediction to our current understanding the spectrum. It is shown that detection of signals in the audio frequency spectrum can be expected very soon, and that a north-south pair of next generation detectors would provide large scientific benefits. Sect. 2 reviews the theory of gravitational waves and the principles of detection using laser interferometry. The state of the art Advanced LIGO detectors are then described. These detectors have a high chance of detecting the first events in the near future. Sect. 3 reviews the KAGRA detector currently under development in Japan, which will be the first laser interferometer detector to use cryogenic test masses. Sect. 4 of this paper reviews gravitational wave detection in the nanohertz frequency band using the technique of pulsar timing. Sect. 5 reviews the status of gravitational wave detection in the attohertz frequency band, detectable in the polarisation of the cosmic microwave background, and discusses the prospects for detection of primordial waves from the big bang. The techniques described in sects. 1-5 have already placed significant limits on the strength of gravitational wave sources. Sects. 6 and 7 review ambitious plans for future space based gravitational wave detectors in the millihertz frequency band. Sect. 6 presents a roadmap for development of space based gravitational wave detectors by China while sect. 7 discusses a key enabling technology for space interferometry known as time delay interferometry.

preprint2016arXiv

Learning Subclass Representations for Visually-varied Image Classification

In this paper, we present a subclass-representation approach that predicts the probability of a social image belonging to one particular class. We explore the co-occurrence of user-contributed tags to find subclasses with a strong connection to the top level class. We then project each image on to the resulting subclass space to generate a subclass representation for the image. The novelty of the approach is that subclass representations make use of not only the content of the photos themselves, but also information on the co-occurrence of their tags, which determines membership in both subclasses and top-level classes. The novelty is also that the images are classified into smaller classes, which have a chance of being more visually stable and easier to model. These subclasses are used as a latent space and images are represented in this space by their probability of relatedness to all of the subclasses. In contrast to approaches directly modeling each top-level class based on the image content, the proposed method can exploit more information for visually diverse classes. The approach is evaluated on a set of $2$ million photos with 10 classes, released by the Multimedia 2013 Yahoo! Large-scale Flickr-tag Image Classification Grand Challenge. Experiments show that the proposed system delivers sound performance for visually diverse classes compared with methods that directly model top classes.

preprint2016arXiv

On the Outage Performance of Non-Orthogonal Multiple Access with One-Bit Feedback

In this paper, the outage performance of downlink non-orthogonal multiple access (NOMA) is investigated for the case where each user feeds back only one bit of its channel state information (CSI) to the base station. Conventionally, opportunistic one-bit feedback has been used in fading broadcast channels to select only one user for transmission. In contrast, the considered NOMA scheme adopts superposition coding to serve all users simultaneously in order to improve user fairness. A closed-form expression for the common outage probability (COP) is derived, along with the optimal diversity gains under two types of power constraints. Particularly, it is demonstrated that the diversity gain under a long-term power constraint is twice as large as that under a short-term power constraint. Furthermore, we study dynamic power allocation optimization for minimizing the COP, based on one-bit CSI feedback. This problem is challenging since the objective function is non-convex; however, under the short-term power constraint, we demonstrate that the original problem can be transformed into a set of convex problems. Under the long-term power constraint, an asymptotically optimal solution is obtained for high signal-to-noise ratio.

preprint2016arXiv

Preserving coherent spin and squeezed spin states of a spin-1 Bose-Einstein condensate with rotary echoes

A challenge in precision measurement with squeezed spin state arises from the spin dephasing due to stray magnetic fields. To suppress such environmental noises, we employ a continuous driving protocol, rotary echo, to enhance the spin coherence of a spin-1 Bose-Einstein condensate in stray magnetic fields. Our analytical and numerical results show that the coherent and the squeezed spin states are preserved for a significantly long time, compared to the free induction decay time, if the condition $hτ= mπ$ is met with $h$ the pulse amplitude and $τ$ pulse width. In particular, both the spin average and the spin squeezing, including the direction and the amplitude, are simultaneously fixed for a squeezed spin state. Our results point out a practical way to implement quantum measurements based on a spin-1 condensate beyond the standard quantum limit.

preprint2016arXiv

Regulation of Irregular Neuronal Firing by Autaptic Transmission

The importance of self-feedback autaptic transmission in modulating spike-time irregularity is still poorly understood. By using a biophysical model that incorporates autaptic coupling, we here show that self-innervation of neurons participates in the modulation of irregular neuronal firing, primarily by regulating the occurrence frequency of burst firing. In particular, we find that both excitatory and electrical autapses increase the occurrence of burst firing, thus reducing neuronal firing regularity. In contrast, inhibitory autapses suppress burst firing and therefore tend to improve the regularity of neuronal firing. Importantly, we show that these findings are independent of the firing properties of individual neurons, and as such can be observed for neurons operating in different modes. Our results provide an insightful mechanistic understanding of how different types of autapses shape irregular firing at the single-neuron level, and they highlight the functional importance of autaptic self-innervation in taming and modulating neurodynamics.

preprint2016arXiv

Relativistic Dynamics of Relative Motions (I): Post-Newtonian Extension of the Hill-Clohessy-Wiltshire Equations

With continuous advances in technologies related to deep space ranging and satellite gravity gradiometry, corrections from general relativity to the dynamics of relative orbital motions will certainly become important. In this work, we extend,in a systematic way, the Hill-Clohessy-Wiltshire Equations to include the complete first order post-Newtonian effects from general relativity. Within certain short time limit, post-Newtonian corrections to general periodic solutions of the Hill-Clohessy-Wiltshire Equations are also worked out.

preprint2016arXiv

Roadmap for gravitational wave detection in space - a preliminary study

Part of a review paper entitled "Gravitational wave astronomy: the current status.", appeared in " Science China Physics, Mechanics & Astronomy 58.12 (2015): 1-41.

preprint2016arXiv

Role of electron-electron interactions in the charge dynamics of rare-earth-doped CaFe2As2

We have investigated the charge dynamics and the nature of many-body interactions in La- and Pr- doped CaFe2As2. From the infrared part of the optical conductivity, we discover that the scattering rate of mobile carriers above 200 K exhibits saturation at the Mott-Ioffe-Regel limit of metallic transport. However, the dc resistivity continues to increase with temperature above 200 K due to the loss of Drude spectral weight. The loss of Drude spectral weight with increasing temperature is seen in a wide temperature range in the uncollapsed tetragonal phase, and this spectral weight is recovered at energy scales about one order of magnitude larger than the Fermi energy scale in these semimetals. The phenomena noted above have been observed previously in other correlated metals in which the dominant interactions are electronic in origin. Further evidence of significant electron-electron interactions is obtained from the presence of quadratic temperature and frequency-dependent terms in the scattering rate at low temperatures and frequencies in the uncollapsed tetragonal structures of La-doped and Pr-doped CaFe2As2. For temperatures below the structure collapse transition in Pr-doped CaFe2As2 at 70 K, the scattering rate decreases due to weakening of electronic correlations, and the Drude spectral weight decreases due to modification of the low-energy electronic structure.

preprint2016arXiv

Sub-sampled Newton Methods with Non-uniform Sampling

We consider the problem of finding the minimizer of a convex function $F: \mathbb R^d \rightarrow \mathbb R$ of the form $F(w) := \sum_{i=1}^n f_i(w) + R(w)$ where a low-rank factorization of $\nabla^2 f_i(w)$ is readily available. We consider the regime where $n \gg d$. As second-order methods prove to be effective in finding the minimizer to a high-precision, in this work, we propose randomized Newton-type algorithms that exploit \textit{non-uniform} sub-sampling of $\{\nabla^2 f_i(w)\}_{i=1}^{n}$, as well as inexact updates, as means to reduce the computational complexity. Two non-uniform sampling distributions based on {\it block norm squares} and {\it block partial leverage scores} are considered in order to capture important terms among $\{\nabla^2 f_i(w)\}_{i=1}^{n}$. We show that at each iteration non-uniformly sampling at most $\mathcal O(d \log d)$ terms from $\{\nabla^2 f_i(w)\}_{i=1}^{n}$ is sufficient to achieve a linear-quadratic convergence rate in $w$ when a suitable initial point is provided. In addition, we show that our algorithms achieve a lower computational complexity and exhibit more robustness and better dependence on problem specific quantities, such as the condition number, compared to similar existing methods, especially the ones based on uniform sampling. Finally, we empirically demonstrate that our methods are at least twice as fast as Newton's methods with ridge logistic regression on several real datasets.

preprint2015arXiv

Band alignment of epitaxial SrTiO3 thin films with (LaAlO3)0.3-(Sr2AlTaO6)0.7 (001)

SrTiO$_{3}$ (STO) epitaxial thin films and heterostructures are of considerable interest due to the wide range of functionalities they exhibit. The alloy perovskite (LaAlO$_{3}$)$_{0.3}$-(Sr$_{2}$AlTaO$_{6}$)$_{0.7}$ (LSAT) is commonly used as a substrate for these material structures due to its structural compatibility with STO and the strain-induced ferroelectric response in STO films grown on LSAT. However, surprisingly little is known about the electronic properties of the STO/LSAT interface despite its potentially important role in affecting the overall electronic structure of system. We examine the band alignment of STO/LSAT heterostructures using x-ray photoelectron spectroscopy for epitaxial STO films deposited using two different molecular beam epitaxy approaches. We find that the valence band offset ranges from +0.2(1) eV to -0.2(1) eV depending on the film surface termination. From these results we extract a conduction band offset from -2.4(1) eV to -2.8(1) eV, indicating that the conduction band edge is more deeply bound in STO and that LSAT will not act as a sink or trap for electrons in the supported film or multilayer.

preprint2015arXiv

Controllable high-fidelity quantum state transfer and entanglement generation in circuit QED

We propose a scheme to realize controllable quantum state transfer and entanglement generation among transmon qubits in the typical circuit QED setup based on adiabatic passage. Through designing the time-dependent driven pulses applied on the transmon qubits, we find that fast quantum sate transfer can be achieved between arbitrary two qubits and quantum entanglement among the qubits also can also be engineered. Furthermore, we numerically analyzed the influence of the decoherence on our scheme with the current experimental accessible systematical parameters. The result shows that our scheme is very robust against both the cavity decay and qubit relaxation, the fidelities of the state transfer and entanglement preparation process could be very high. In addition, our scheme is also shown to be insensitive to the inhomogeneous of qubit-resonator coupling strengths.

preprint2015arXiv

Interaction induced decay of a heteronuclear two-atom system

Two-atom systems in small traps are of fundamental interest, first of all for understanding the role of interactions in degenerate cold gases and for the creation of quantum gates in quantum information processing with single-atom traps. One of the key quantities is the inelastic relaxation (decay) time when one of the atoms or both are in a higher hyperfine state. Here we measure this quantity in a heteronuclear system of $^{87}$Rb and $^{85}$Rb in a micro optical trap and demonstrate experimentally and theoretically the presence of both fast and slow relaxation processes, depending on the choice of the initial hyperfine states. The developed experimental method allows us to single out a particular relaxation process and, in this sense, our experiment is a "superclean platform" for collisional physics studies. Our results have also implications for engineering of quantum states via controlled collisions and creation of two-qubit quantum gates.

preprint2015arXiv

Mixing Rates of Random Walks with Little Backtracking

Many regular graphs admit a natural partition of their edge set into cliques of the same order such that each vertex is contained in the same number of cliques. In this paper, we study the mixing rate of certain random walks on such graphs and we generalize previous results of Alon, Benjamini, Lubetzky and Sodin regarding the mixing rates of non-backtracking random walks on regular graphs.

preprint2015arXiv

NOMA: An Information Theoretic Perspective

In this letter, the performance of non-orthogonal multiple access (NOMA) is investigated from an information theoretic perspective. The relationships among the capacity region of broadcast channels and two rate regions achieved by NOMA and time-division multiple access (TDMA) are illustrated first. Then, the performance of NOMA is evaluated by considering TDMA as the benchmark, where both the sum rate and the individual user rates are used as the criteria. In a wireless downlink scenario with user pairing, the developed analytical results show that NOMA can outperform TDMA not only for the sum rate but also for each user's individual rate, particularly when the difference between the users' channels is large.

preprint2015arXiv

Signature of biased range in the non-dynamical Chern-Simons modified gravity and its measurements with satellite-satellite tracking missions: Theoretical studies

Having great accuracy in the range and range rate measurements,he GRACE mission and the planed GRACE Follow On mission can in principle be employed to place strong constraints on certain relativistic gravitational theories. In this paper, we work out the range observable of the non-dynamical Chern-Simons modified gravity for the Satellite-to-Satellite Tracking (SST) measurements. We find out that a characteristic time accumulating range signal appears in non-dynamical Chern-Simons gravity, which has no analogue found in the standard parity-preserving metric theories of gravity. The magnitude of this Chern-Simons range signal will reach a few times of $χcm$ for each free flight of these SST missions, here $χ$ is the dimensionless post-Newtonian parameter of the non-dynamical Chern-Simons theory. Therefore, with the 12 years data of the GRACE mission, one expects that the mass scale $M_{CS}=\frac{4\hbar c}{χa}$ of the non-dynamical Chern-Simons gravity could be constrained to be larger than $1.9\times10^{-9}eV$. For the GRACE FO mission that scheduled to be launched in 2017, the much stronger bound that $M_{CS}\geq5\times10^{-7}eV$ is expected.

preprint2015arXiv

Simulating dynamical quantum Hall effect with superconducting qubits

We propose an experimental scheme to simulate the dynamical quantum Hall effect and the related interaction-induced topological transition with a superconducting-qubit array. We show that a one-dimensional Heisenberg model with tunable parameters can be realized in an array of superconducting qubits. The quantized plateaus, which is a feature of the dynamical quantum Hall effect, will emerge in the Berry curvature of the superconducting qubits as a function of the coupling strength between nearest neighbor qubits. We numerically calculate the Berry curvatures of two-, four- and six-qubit arrays, and find that the interaction-induced topological transition can be easily observed with the simplest two-qubit array. Furthermore, we analyze some practical conditions in typical experiments for observing such dynamical quantum Hall effect.

preprint2015arXiv

Spacing Distribution of a Bernoulli Sampled Sequence

We investigate the spacing distribution of sequence \[S_n=\left\{0,\frac{1}{n},\frac{2}{n},\dots,\frac{n-1}{n},1\right\}\] after Bernoulli sampling. We describe the closed form expression of the probability mass function of the spacings, and show that the spacings converge in distribution to a geometric random variable.

preprint2015arXiv

Testing Chern-Simons modified gravity with orbiting superconductive gravity gradiometers --- The non-dynamical formulation

High precision Superconductivity Gravity Gradiometers (SGG) are powerful tools for relativistic experiments. In this paper, we work out the tidal signals in non-dynamical Chern-Simons modified gravity, which could be measured by orbiting SGGs around Earth. We find that, with proper orientations of multi-axes SGGs, the tidal signals from the Chern-Simons modification can be isolated in the combined data of different axes. Furthermore, for three-axes SGGs, such combined data is the trace of the total tidal matrix, which is invariant under the rotations of SGG axes and thus free from axis pointing errors. Following nearly circular orbits, the tests of the parity-violating Chern-Simons modification and the measurements of the gravitomagnetic sector in parity-conserving metric theories can be carried out independently in the same time. A first step analysis on noise sources is also included.

preprint2015arXiv

The Private Key Capacity of a Cooperative Pairwise-Independent Network

This paper studies the private key generation of a cooperative pairwise-independent network (PIN) with M+2 terminals (Alice, Bob and M relays), M >= 2. In this PIN, the correlated sources observed by every pair of terminals are independent of those sources observed by any other pair of terminal. All the terminals can communicate with each other over a public channel which is also observed by Eve noiselessly. The objective is to generate a private key between Alice and Bob under the help of the M relays; such a private key needs to be protected not only from Eve but also from individual relays simultaneously. The private key capacity of this PIN model is established, whose lower bound is obtained by proposing a novel random binning (RB) based key generation algorithm, and the upper bound is obtained based on the construction of M enhanced source models. The two bounds are shown to be exactly the same. Then, we consider a cooperative wireless network and use the estimates of fading channels to generate private keys. It has been shown that the proposed RB-based algorithm can achieve a multiplexing gain M-1, an improvement in comparison with the existing XOR- based algorithm whose achievable multiplexing gain is about [M]/2.

preprint2014arXiv

A remark on the simple cuspidal representations of GL(n, F)

Let $F$ be a non-archimedean local field of residue characteristic $p$, $G$ be the group $GL(n, F)$. In this note, under the assumption $(n, p)=1$, we show a simple cuspidal representation $π$ (that with normalized level $\frac{1}{n}$) of $G$ is determined uniquely up to isomorphism by the local constants of $χ\circ \text{det}\otimes π$ for all characters $χ$ of $F^\times$.

preprint2014arXiv

Bidirectional Control of Absence Seizures by the Basal Ganglia: A Computational Evidence

Absence epilepsy is believed to be associated with the abnormal interactions between the cerebral cortex and thalamus. Besides the direct coupling, anatomical evidence indicates that the cerebral cortex and thalamus also communicate indirectly through an important intermediate bridge--basal ganglia. It has been thus postulated that the basal ganglia might play key roles in the modulation of absence seizures, but the relevant biophysical mechanisms are still not completely established. Using a biophysically based model, we demonstrate here that the typical absence seizure activities can be controlled and modulated by the direct GABAergic projections from the substantia nigra pars reticulata (SNr) to either the thalamic reticular nucleus (TRN) or the specific relay nuclei (SRN) of thalamus, through different biophysical mechanisms. Under certain conditions, these two types of seizure control are observed to coexist in the same network. More importantly, due to the competition between the inhibitory SNr-TRN and SNr-SRN pathways, we find that both decreasing and increasing the activation of SNr neurons from the normal level may considerably suppress the generation of SWDs in the coexistence region. Overall, these results highlight the bidirectional functional roles of basal ganglia in controlling and modulating absence seizures, and might provide novel insights into the therapeutic treatments of this brain disorder.

preprint2014arXiv

Descope of the ALIA mission

The present work reports on a feasibility study commissioned by the Chinese Academy of Sciences of China to explore various possible mission options to detect gravitational waves in space alternative to that of the eLISA/LISA mission concept. Based on the relative merits assigned to science and technological viability, a few representative mission options descoped from the ALIA mission are considered. A semi-analytic Monte Carlo simulation is carried out to understand the cosmic black hole merger histories starting from intermediate mass black holes at high redshift as well as the possible scientific merits of the mission options considered in probing the light seed black holes and their coevolution with galaxies in early Universe. The study indicates that, by choosing the armlength of the interferometer to be three million kilometers and shifting the sensitivity floor to around one-hundredth Hz, together with a very moderate improvement on the position noise budget, there are certain mission options capable of exploring light seed, intermediate mass black hole binaries at high redshift that are not readily accessible to eLISA/LISA, and yet the technological requirements seem to within reach in the next few decades for China.

preprint2014arXiv

Hecke modules and supersingular representations of U(2,1)

Let F be a nonarchimedean local field of odd residual characteristic p. We classify finite-dimensional simple right modules for the pro-p-Iwahori-Hecke algebra $\mathcal{H}_C(G,I(1))$, where G is the unramified unitary group U(2,1)(E/F) in three variables. Using this description when C is the algebraic closure of $\mathbb{F}_p$, we define supersingular Hecke modules and show that the functor of I(1)-invariants induces a bijection between irreducible nonsupersingular mod-p representations of G and nonsupersingular simple right $\mathcal{H}_C(G,I(1))$-modules. We then use an argument of Paskunas to construct supersingular representations of G.

preprint2014arXiv

On the Jacquet Conjecture on the Local Converse Problem for p-adic GL_n

Based on previous results of Jiang, Nien and the third author, we prove that any two minimax unitarizable supercuspidals of GL_N that have the same depth and central character admit a special pair of Whittaker functions. This result gives a new reduction towards a final proof of Jacquet's conjecture on the local converse problem for GL_N. As a corollary of our result, we prove Jacquet's conjecture for GL_N, when N is prime.

preprint2014arXiv

Self-Organized Platinum Nanoparticles on Freestanding Graphene

Freestanding graphene membranes were successfully functionalized with platinum nanoparticles (Pt NPs). High-resolution transmission electron microscopy revealed a homogeneous distribution of single-crystal Pt NPs that tend to exhibit a preferred orientation. Unexpectedly, the NPs were also found to be partially exposed to the vacuum with the top Pt surface raised above the graphene substrate, as deduced from atomic-scale scanning tunneling microscopy images and detailed molecular dynamics simulations. Local strain accumulation during the growth process is thought to be the origin of the NP self-organization. These findings are expected to shape future approaches in developing Pt NP catalysts for fuel cells as well as NP-functionalized graphene based high-performance electronics.

preprint2013arXiv

Anisotropic Infrared Response of Vanadium Dioxide Microcrystals

Vanadium dioxide (VO2) undergoes a phase transition at a temperature of 340 K between an insulating monoclinic M1 phase and a conducting rutile phase. Accurate measurements of possible anisotropy of the electronic properties and phonon features of VO2 in the insulating monoclinic M1 and metallic rutile phases are a prerequisite for understanding the phase transition in this correlated system. Recently, it has become possible to grow single domain untwinned VO2 microcrystals which makes it possible to investigate the true anisotropy of VO2. We performed polarized transmission infrared micro-spectroscopy on these untwinned microcrystals in the spectral range between 200 cm-1 and 6000 cm-1 and have obtained the anisotropic phonon parameters and low frequency electronic properties in the insulating monoclinic M1 and metallic rutile phases. We have also performed ab initio GGA+U total energy calculations of phonon frequencies for both phases. We find our measurements and calculations to be in good agreement.

preprint2013arXiv

Large Scale Distributed Acoustic Modeling With Back-off N-grams

The paper revives an older approach to acoustic modeling that borrows from n-gram language modeling in an attempt to scale up both the amount of training data and model size (as measured by the number of parameters in the model), to approximately 100 times larger than current sizes used in automatic speech recognition. In such a data-rich setting, we can expand the phonetic context significantly beyond triphones, as well as increase the number of Gaussian mixture components for the context-dependent states that allow it. We have experimented with contexts that span seven or more context-independent phones, and up to 620 mixture components per state. Dealing with unseen phonetic contexts is accomplished using the familiar back-off technique used in language modeling due to implementation simplicity. The back-off acoustic model is estimated, stored and served using MapReduce distributed computing infrastructure. Speech recognition experiments are carried out in an N-best list rescoring framework for Google Voice Search. Training big models on large amounts of data proves to be an effective way to increase the accuracy of a state-of-the-art automatic speech recognition system. We use 87,000 hours of training data (speech along with transcription) obtained by filtering utterances in Voice Search logs on automatic speech recognition confidence. Models ranging in size between 20--40 million Gaussians are estimated using maximum likelihood training. They achieve relative reductions in word-error-rate of 11% and 6% when combined with first-pass models trained using maximum likelihood, and boosted maximum mutual information, respectively. Increasing the context size beyond five phones (quinphones) does not help.

preprint2013arXiv

Suppressing phase decoherence of a single atom qubit with CPMG sequence

We experimentally demonstrate the strong suppression of dephasing of a qubit stored in a single \textsuperscript{87}Rb atom in an optical dipole trap by using Carr-Purcell-Meiboom-Gill(CPMG) pulse sequences. Regarded as a repetition of spin echo, CPMG sequence is an optimal choice for suppressing both inhomogeneous and homogeneous phase decoherence mechanisms. In the trap with 830 nm wavelength and 0.7 mK potential depth, the spin relaxation time of single atoms is showed to be 830.8 ms. We obtain the reversible inhomogeneous dephasing time of $T_{2}^{\ast}$=1.4 ms. The homogeneous dephasing time is $T_{2}^{\prime}$ =102.7 ms in the spin echo process, by employing CPMG sequence with pulse number n = 6 the homogeneous dephasing is further suppressed by a factor of 3, and its corresponding coherence time is extended to $T_{2}^{\prime}$=304.5 ms.

preprint2012arXiv

Combining red- and blue-detuned optical potentials to form a Lamb-Dicke trap for a single neutral atom

We propose and demonstrate a scheme for strongly radially confining a single neutral atom in a bichromatic far-off resonance optical dipole trap(BFORT) . BFORT is composed of a blue-detuned Laguerre-Gaussian $LG^1_ 0$ beam and a red-detuned Gaussian beam. The trapping radial dimension of a single atom trapped in the Gaussian FORT can be greatly compressed by imposing a blue-detuned Laguerre-Gaussian $LG^1_ 0$ beam with moderate potential depth. By modulating the potential depth of the Gaussian FORT we observed that the resonant and parametric excitation of the oscillatory motion of a single atom in this BFORT and obtained the oscillation frequency that well fits prediction from the theoretical model. The frequency measurement shows that effective trapping dimension can be greatly sharper than that diffraction limited of microscopic objective we used. Then we show that the excess scattering rate due to imposing blue detuned light can be eliminated when single atoms is close to ground-state theoretically. So BFORT suits the purpose of acting as a Lamb-Dicke trap for further cooling a single neutral atom to motion ground-state and finding application in quantum information progressing.

preprint2012arXiv

On the Application of Noisy Network Coding to the Relay-Eavesdropper Channel

In this paper, we consider the design of a new secrecy transmission scheme for a four-node relay-eavesdropper channel. The key idea of the proposed scheme is to combine noisy network coding with the interference assisted strategy for wiretap channel with a helping interferer. A new achievable secrecy rate is characterized for both discrete memoryless and Gaussian channels. Such a new rate can be viewed as a general framework, where the existing interference assisted schemes such as noisy-forwarding and cooperative jamming approaches can be shown as special cases of the proposed scheme. In addition, under some channel condition where the existing schemes can only achieve zero secrecy rate, the proposed secrecy scheme can still offer significant performance gains.

preprint2009arXiv

Emergence of scalar matter from spinfoam model

A spinfoam model of 3D gravity non-minimally coupled with a scalar field is studied. By discretization of the scalar field, the model is worked out precisely in a purely combinational way. It is shown that the quantum physics of the scalar matter are totally encoded into the modified dynamics of SU(2) spin-network states which describe the quantum geometry of space. It turns out that the physics of the scalar matter coupled with gravity manifested in the low energy scale can be viewed as the phenomena emerged from this microscopical construction. This gives rise to a radical observation on the issue of the unification of geometry and matter.

Peng Xu

What is connected

Connect this record

See the researcher in context

Building this map preview

94 published item(s)

Ada-MK: Adaptive MegaKernel Optimization via Automated DAG-based Search for LLM Inference

Deep Learning Driven Buffer-Aided Cooperative Networks for B5G/6G: Challenges, Solutions, and Future Opportunities

$r$-Hyperideals and Generalizations of $r$-Hyperideals in Krasner Hyperrings

$δ$-$r$-Hyperideals and $ϕ$-$δ$-$r$-Hyperideals of Commutative Krasner Hyperrings

A UCB-based Tree Search Approach to Joint Verification-Correction Strategy for Large Scale Systems

Android-COCO: Android Malware Detection with Graph Neural Network for Byte- and Native-Code

ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation

Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset

Charactering instrumental noises and stochastic gravitational wave signals from combined time-delay interferometry

CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command Recognition

Deep Learning for Free-Hand Sketch: A Survey

DeepChange: A Large Long-Term Person Re-Identification Benchmark with Clothes Change

Density functional theory plus dynamical mean field theory within the framework of linear combination of numerical atomic orbitals: Formulation and benchmarks

DFT+$U$ within the framework of linear combination of numerical atomic orbitals

Do As I Can, Not As I Say: Grounding Language in Robotic Affordances

Entailment Tree Explanations via Iterative Retrieval-Generation Reasoner

Global Gravity Field Model from Taiji-1 Observations

How to Understand Masked Autoencoders

hybrid-Falcon: Hybrid Pattern Malware Detection and Categorization with Network Traffic and Program Code

Inexact Newton-CG Algorithms With Complexity Guarantees

Newton-MR: Inexact Newton Method With Minimum Residual Sub-problem Solver

QA4QG: Using Question Answering to Constrain Multi-Hop Question Generation

TOD-CNN: An Effective Convolutional Neural Network for Tiny Object Detection in Sperm Videos

Topological Micromotion of Floquet Quantum Systems

Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning

Defect-free arbitrary-geometry assembly of mixed-species atom arrays

High fidelity entanglement of neutral atoms via a Rydberg-mediated single-modulated-pulse controlled-PHASE gate

On the criteria of large cavitation bubbles in a tube during a transient process

The Twelvefold Way of Non-Sequential Lossless Compression

Better Long-Range Dependency By Bootstrapping A Mutual Information Regularizer

CAiRE: An Empathetic Neural Chatbot

Coach: A Coarse-to-Fine Approach for Cross-domain Slot Filling

Deep Self-Supervised Representation Learning for Free-Hand Sketch

EEG-based Brain-Computer Interfaces (BCIs): A Survey of Recent Studies on Signal Sensing Technologies and Computational Intelligence Approaches and their Applications

Embedding-based Zero-shot Retrieval through Query Generation

EmoGraph: Capturing Emotion Correlations using Graph Networks

Extension of causal decomposition in the mutual complex dynamic process

Fault Detection and Identification using Bayesian Recurrent Neural Networks

Fine-Grained Instance-Level Sketch-Based Video Retrieval

Hierarchical emotion-recognition framework based on discriminative brain neural network topology and ensemble co-decision strategy

Learning Fast Adaptation on Cross-Accented Speech Recognition

LUDA: Boost LSM Key Value Store Compactions with GPUs

Meta-Transfer Learning for Code-Switched Speech Recognition

On certain Iwahori--Hecke modules of $GL_3$ in characteristic $p$

On Learning Semantic Representations for Million-Scale Free-Hand Sketches

On Variational Learning of Controllable Representations for Text without Supervision

Quantum-enhanced interferometry with asymmetric beam splitters

Reconfigurable Intelligent Surfaces Assisted Communications with Discrete Phase Shifts: How Many Quantization Levels are Required to Achieve Full Diversity?

Scalable quantum computing with qudits on a graph

Switchable next-nearest-neighbor coupling for controlled two-qubit operations

TRANS-BLSTM: Transformer with Bidirectional LSTM for Language Understanding

Variational Transformers for Diverse Response Generation

Balanced Coherence Times of Mixed-Species Atomic Qubits in a Dual $3\times3$ Magic-Intensity Optical Dipole Trap Array

Integration of the Berry curvature on a qubit state manifold by coupling to a quantum meter system

Preparation of a Heteronuclear Two-atom System in the 3D Motional Ground State in an Optical Tweezer

Two-qubit controlled-PHASE Rydberg blockade gate protocol for neutral atoms via off-resonant modulated driving within a single pulse

ZAIGA: Zhaoshan Long-baseline Atom Interferometer Gravitation Antenna

Reconfiguration of Brain Network between Resting-state and Oddball Paradigm

A New Line Defect in NdTiO3 Perovskite

Coherence preservation of a single neutral atom qubit transferred between magic-intensity optical traps

Firing regulation of fast-spiking interneurons by autaptic inhibition

First-order post-Newtonian analysis of the relativistic tidal effects for satellite gradiometry and the Mashhoon-Theiss anomaly

Forward and Reverse Entropy Power Inequalities in Convex Geometry

Gravitational wave astronomy: the current status

Learning Subclass Representations for Visually-varied Image Classification

On the Outage Performance of Non-Orthogonal Multiple Access with One-Bit Feedback

Preserving coherent spin and squeezed spin states of a spin-1 Bose-Einstein condensate with rotary echoes

Regulation of Irregular Neuronal Firing by Autaptic Transmission

Relativistic Dynamics of Relative Motions (I): Post-Newtonian Extension of the Hill-Clohessy-Wiltshire Equations

Roadmap for gravitational wave detection in space - a preliminary study

Role of electron-electron interactions in the charge dynamics of rare-earth-doped CaFe2As2

Sub-sampled Newton Methods with Non-uniform Sampling

Band alignment of epitaxial SrTiO3 thin films with (LaAlO3)0.3-(Sr2AlTaO6)0.7 (001)

Controllable high-fidelity quantum state transfer and entanglement generation in circuit QED