Source author record

Xin Wang

Xin Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

quant-ph Computer Vision Machine Learning Information Theory math.IT astro-ph.CO eess.AS cond-mat.mes-hall Sound Artificial Intelligence astro-ph.SR eess.SP Computation and Language Cryptography and Security astro-ph.GA math-ph math.MP cond-mat.str-el astro-ph.HE cond-mat.mtrl-sci hep-th hep-ph Networking and Internet Architecture physics.soc-ph Social and Information Networks Databases Distributed, Parallel, and Cluster Computing eess.IV cond-mat.supr-con physics.atom-ph physics.plasm-ph math.CO math.OC Systems and Control Biological Physics eess.SY gr-qc physics.optics cond-mat.dis-nn cond-mat.quant-gas Data Structures and Algorithms Information Retrieval math.AG Multimedia nlin.SI physics.chem-ph astro-ph.EP cond-mat.soft Human-Computer Interaction math.DG Neurons and Cognition nucl-th Operating Systems physics.app-ph physics.geo-ph Programming Languages Robotics Applications Emerging Technologies Graphics math.NT math.ST Methodology Multiagent Systems nucl-ex Populations and Evolution q-fin.GN Software Engineering Statistics Theory

Catalog footprint

What is connected

319works

69topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Don't Click That: Teaching Web Agents to Resist Deceptive Interfaces

Vision-language model (VLM) based web agents demonstrate impressive autonomous GUI interaction but remain vulnerable to deceptive interface elements. Existing approaches either detect deception without task integration or document attacks without proposing defenses. We formalize deception-aware web agent defense and propose DUDE (Deceptive UI Detector & Evaluator), a two-stage framework combining hybrid-reward learning with asymmetric penalties and experience summarization to distill failure patterns into transferable guidance. We introduce RUC (Real UI Clickboxes), a benchmark of 1,407 scenarios spanning four domains and deception categories. Experiments show DUDE reduces deception susceptibility by 53.8% while maintaining task performance, establishing an effective foundation for robust web agent deployment.

preprint2025arXiv

Continuous Angular Power Spectrum Recovery From Channel Covariance via Chebyshev Polynomials

This paper proposes a Chebyshev polynomial expansion framework for the recovery of a continuous angular power spectrum (APS) from channel covariance. By exploiting the orthogonality of Chebyshev polynomials in a transformed domain, we derive an exact series representation of the covariance and reformulate the inherently ill-posed APS inversion as a finite-dimensional linear regression problem via truncation. The associated approximation error is directly controlled by the tail of the APS's Chebyshev series and decays rapidly with increasing angular smoothness. Building on this representation, we derive an exact semidefinite characterization of nonnegative APS and introduce a derivative-based regularizer that promotes smoothly varying APS profiles while preserving transitions of clusters. Simulation results show that the proposed Chebyshev-based framework yields accurate APS reconstruction, and enables reliable downlink (DL) covariance prediction from uplink (UL) measurements in a frequency division duplex (FDD) setting. These findings indicate that jointly exploiting smoothness and nonnegativity in a Chebyshev domain provides an effective tool for covariance-domain processing in multi-antenna systems.

preprint2024arXiv

Collaborative Watermarking for Adversarial Speech Synthesis

Advances in neural speech synthesis have brought us technology that is not only close to human naturalness, but is also capable of instant voice cloning with little data, and is highly accessible with pre-trained models available. Naturally, the potential flood of generated content raises the need for synthetic speech detection and watermarking. Recently, considerable research effort in synthetic speech detection has been related to the Automatic Speaker Verification and Spoofing Countermeasure Challenge (ASVspoof), which focuses on passive countermeasures. This paper takes a complementary view to generated speech detection: a synthesis system should make an active effort to watermark the generated speech in a way that aids detection by another machine, but remains transparent to a human listener. We propose a collaborative training scheme for synthetic speech watermarking and show that a HiFi-GAN neural vocoder collaborating with the ASVspoof 2021 baseline countermeasure models consistently improves detection performance over conventional classifier training. Furthermore, we demonstrate how collaborative training can be paired with augmentation strategies for added robustness against noise and time-stretching. Finally, listening tests demonstrate that collaborative training has little adverse effect on perceptual quality of vocoded speech.

preprint2024arXiv

Early Results from GLASS-JWST XXIII: The transmission of Lyman-alpha from UV-faint z ~ 3-6 galaxies

Lyman-alpha (Ly$α$) emission from galaxies can be used to trace neutral hydrogen in the epoch of reionization, however, there is a degeneracy between the attenuation of Ly$α$ in the intergalactic medium (IGM) and the line profile emitted from the galaxy. Large shifts of Ly$α$ redward of systemic due to scattering in the interstellar medium can boost Ly$α$ transmission in the IGM during reionization. The relationship between Ly$α$ velocity offset from systemic and other galaxy properties is not well-established at high-redshift or low luminosities, due to the difficulty of observing emission lines which trace systemic redshift. Rest-frame optical spectroscopy with JWST/NIRSpec has opened a new window into understanding of Ly$α$ at z>3. We present a sample of 12 UV-faint galaxies ($-20 \lesssim$ MUV $\lesssim -16$) at $3 \lesssim z \lesssim 6$, with Ly$α$ velocity offsets, $Δv_{\mathrm{Ly}α}$, measured from VLT/MUSE and JWST/NIRSpec from the GLASS-JWST Early Release Program. We find median $Δv_{\mathrm{Ly}α}$ of 205 km s$^{-1}$ and standard deviation 75 km s$^{-1}$, compared to 320 and 170km s$^{-1}$ for MUV < -20 galaxies in the literature. Our new sample demonstrates the previously observed trend of decreasing Ly$α$ velocity offset with decreasing UV luminosity and optical line velocity dispersion, extends to MUV $\gtrsim$ -20, consistent with a picture where the Ly$α$ profile is shaped by gas close to the systemic redshift. Our results imply that during reionization Ly$α$ from UV-faint galaxies will be preferentially attenuated, but that detecting Ly$α$ with low $Δv_{\mathrm{Ly}α}$ can be an indicator of large ionized bubbles.

preprint2024arXiv

High-Efficiency Resonant Beam Charging and Communication

With the development of Internet of Things (IoT), demands of power and data for IoT devices increase drastically. In order to resolve the supply-demand contradiction, simultaneous wireless information and power transfer (SWIPT) has been envisioned as an enabling technology by providing high-power energy transfer and high-rate data delivering concurrently. In this paper, we introduce a high-efficiency resonant beam (RB) charging and communication scheme. The scheme utilizes the semiconductor materials as gain medium, which has a better energy absorption capacity compared with the traditional solid-state one. Moreover, the telescope internal modulator (TIM) are adopted in the scheme which can concentrate beams to match the gain size, reducing the transmission loss. To evaluate the scheme SWIPT performance, we establish an analytical model and study the influence factors of its beam transmission, energy conversion, output power, and spectral efficiency. Numerical results shows that the proposed RB system can realize 16 W electric power output with 11 % end-to-end conversion efficiency, and support 18 bit/s/Hz spectral efficiency for communication.

preprint2024arXiv

Lyman Continuum Emission from AGN at 2.3$\lesssim$z$\lesssim$3.7 in the UVCANDELS Fields

We present the results of our search for Lyman continuum (LyC) emitting AGN at redshifts 2.3$\lesssim$z$\lesssim$4.9 from HST WFC3 F275W observations in the UVCANDELS fields. We also include LyC emission from AGN using HST WFC3 F225W, F275W, and F336W found in the ERS and HDUV data. We performed exhaustive queries of the Vizier database to locate AGN with high quality spectroscopic redshifts. In total, we found 51 AGN that met our criteria within the UVCANDELS and ERS footprints. Of these 51, we find 12 AGN had $\geq$4$σ$ detected LyC flux in the WFC3/UVIS images. Using space- and ground-based data from X-ray to radio, we fit the multi-wavelength photometric data of each AGN to a CIGALE SED and correlate various SED parameters to the LyC flux. KS-tests of the SED parameter distributions for the LyC-detected and non-detected AGN showed they are likely not distinct samples. However, we find that X-ray luminosity, star-formation onset age, and disk luminosity show strong correlations relative to their emitted LyC flux. We also find strong correlation of the LyC flux to several dust parameters, i.e., polar and toroidal dust emission, 6 $μm$ luminosity, and anti-correlation with metallicity and $A_{FUV}$. We simulate the LyC escape fraction ($f_{esc}$) using the CIGALE and IGM transmission models for the LyC-detected AGN and find an average $f_{esc}$$\simeq$18%, weighted by uncertainties. We stack the LyC flux of subsamples of AGN according to the wavelength continuum region in which they are detected and find no significant distinctions in their LyC emission, although our $sub-mm\ detected$ F336W sample shows the brightest stacked LyC flux. These findings indicate that LyC-production and -escape in AGN is more complicated than the simple assumption of thermal emission and a 100% escape fraction. Further testing of AGN models with larger samples than presented here is needed.

preprint2024arXiv

Spoofing attack augmentation: can differently-trained attack models improve generalisation?

A reliable deepfake detector or spoofing countermeasure (CM) should be robust in the face of unpredictable spoofing attacks. To encourage the learning of more generaliseable artefacts, rather than those specific only to known attacks, CMs are usually exposed to a broad variety of different attacks during training. Even so, the performance of deep-learning-based CM solutions are known to vary, sometimes substantially, when they are retrained with different initialisations, hyper-parameters or training data partitions. We show in this paper that the potency of spoofing attacks, also deep-learning-based, can similarly vary according to training conditions, sometimes resulting in substantial degradations to detection performance. Nevertheless, while a RawNet2 CM model is vulnerable when only modest adjustments are made to the attack algorithm, those based upon graph attention networks and self-supervised learning are reassuringly robust. The focus upon training data generated with different attack algorithms might not be sufficient on its own to ensure generaliability; some form of spoofing attack augmentation at the algorithm level can be complementary.

preprint2023arXiv

Learning-based Intelligent Surface Configuration, User Selection, Channel Allocation, and Modulation Adaptation for Jamming-resisting Multiuser OFDMA Systems

Reconfigurable intelligent surfaces (RISs) can potentially combat jamming attacks by diffusing jamming signals. This paper jointly optimizes user selection, channel allocation, modulation-coding, and RIS configuration in a multiuser OFDMA system under a jamming attack. This problem is non-trivial and has never been addressed, because of its mixed-integer programming nature and difficulties in acquiring channel state information (CSI) involving the RIS and jammer. We propose a new deep reinforcement learning (DRL)-based approach, which learns only through changes in the received data rates of the users to reject the jamming signals and maximize the sum rate of the system. The key idea is that we decouple the discrete selection of users, channels, and modulation-coding from the continuous RIS configuration, hence facilitating the RIS configuration with the latest twin delayed deep deterministic policy gradient (TD3) model. Another important aspect is that we show a winner-takes-all strategy is almost surely optimal for selecting the users, channels, and modulation-coding, given a learned RIS configuration. Simulations show that the new approach converges fast to fulfill the benefit of the RIS, due to its substantially small state and action spaces. Without the need of the CSI, the approach is promising and offers practical value.

preprint2023arXiv

Suppression of laser beam's polarization and intensity fluctuation via a Mach-Zehnder interferometer with proper feedback

Long ground-Rydberg coherence lifetime is interesting for implementing high-fidelity quantum logic gates, many-body physics, and other quantum information protocols. However, the potential well formed by a conventional far-off-resonance red-detuned optical-dipole trap that is attractive for ground-state cold atoms is usually repulsive for Rydberg atoms, which will result in the rapid loss of atoms and low repetition rate of the experimental sequence. Moreover, the coherence time will be sharply shortened due to the residual thermal motion of cold atoms. These issues can be addressed by a one-dimensional magic lattice trap, which can form a deeper potential trap than the traveling wave optical dipole trap when the output power is limited. In addition, these common techniques for atomic confinement generally have certain requirements for the polarization and intensity stability of the laser. Here, we demonstrated a method to suppress both the polarization drift and power fluctuation only based on the phase management of the Mach-Zehnder interferometer for a one-dimensional magic lattice trap. With the combination of three wave plates and the interferometer, we used the instrument to collect data in the time domain, analyzed the fluctuation of laser intensity, and calculated the noise power spectral density. We found that the total intensity fluctuation comprising laser power fluctuation and polarization drift was significantly suppressed, and the noise power spectral density after closed-loop locking with a typical bandwidth of 1-3000 Hz was significantly lower than that under the free running of the laser system. Typically, at 1000 Hz, the noise power spectral density after locking was about 10 dB lower than that under the free running of a master oscillator power amplifier system.The intensity-polarization control technique provides potential applications.

preprint2022arXiv

A Closer Look at Debiased Temporal Sentence Grounding in Videos: Dataset, Metric, and Approach

Temporal Sentence Grounding in Videos (TSGV), which aims to ground a natural language sentence in an untrimmed video, has drawn widespread attention over the past few years. However, recent studies have found that current benchmark datasets may have obvious moment annotation biases, enabling several simple baselines even without training to achieve SOTA performance. In this paper, we take a closer look at existing evaluation protocols, and find both the prevailing dataset and evaluation metrics are the devils that lead to untrustworthy benchmarking. Therefore, we propose to re-organize the two widely-used datasets, making the ground-truth moment distributions different in the training and test splits, i.e., out-of-distribution (OOD) test. Meanwhile, we introduce a new evaluation metric "dR@n,IoU@m" that discounts the basic recall scores to alleviate the inflating evaluation caused by biased datasets. New benchmarking results indicate that our proposed evaluation protocols can better monitor the research progress. Furthermore, we propose a novel causality-based Multi-branch Deconfounding Debiasing (MDD) framework for unbiased moment prediction. Specifically, we design a multi-branch deconfounder to eliminate the effects caused by multiple confounders with causal intervention. In order to help the model better align the semantics between sentence queries and video moments, we enhance the representations during feature encoding. Specifically, for textual information, the query is parsed into several verb-centered phrases to obtain a more fine-grained textual feature. For visual information, the positional information has been decomposed from moment features to enhance representations of moments with diverse locations. Extensive experiments demonstrate that our proposed approach can achieve competitive results among existing SOTA approaches and outperform the base model with great gains.

preprint2022arXiv

A Comprehensive Survey on Deep Clustering: Taxonomy, Challenges, and Future Directions

Clustering is a fundamental machine learning task which has been widely studied in the literature. Classic clustering methods follow the assumption that data are represented as features in a vectorized form through various representation learning techniques. As the data become increasingly complicated and complex, the shallow (traditional) clustering methods can no longer handle the high-dimensional data type. With the huge success of deep learning, especially the deep unsupervised learning, many representation learning techniques with deep architectures have been proposed in the past decade. Recently, the concept of Deep Clustering, i.e., jointly optimizing the representation learning and clustering, has been proposed and hence attracted growing attention in the community. Motivated by the tremendous success of deep learning in clustering, one of the most fundamental machine learning tasks, and the large number of recent advances in this direction, in this paper we conduct a comprehensive survey on deep clustering by proposing a new taxonomy of different state-of-the-art approaches. We summarize the essential components of deep clustering and categorize existing methods by the ways they design interactions between deep representation learning and clustering. Moreover, this survey also provides the popular benchmark datasets, evaluation metrics and open-source implementations to clearly illustrate various experimental settings. Last but not least, we discuss the practical applications of deep clustering and suggest challenging topics deserving further investigations as future directions.

preprint2022arXiv

A Practical Guide to Logical Access Voice Presentation Attack Detection

Voice-based human-machine interfaces with an automatic speaker verification (ASV) component are commonly used in the market. However, the threat from presentation attacks is also growing since attackers can use recent speech synthesis technology to produce a natural-sounding voice of a victim. Presentation attack detection (PAD) for ASV, or speech anti-spoofing, is therefore indispensable. Research on voice PAD has seen significant progress since the early 2010s, including the advancement in PAD models, benchmark datasets, and evaluation campaigns. This chapter presents a practical guide to the field of voice PAD, with a focus on logical access attacks using text-to-speech and voice conversion algorithms and spoofing countermeasures based on artifact detection. It introduces the basic concept of voice PAD, explains the common techniques, and provides an experimental study using recent methods on a benchmark dataset. Code for the experiments is open-sourced.

preprint2022arXiv

A Theoretical View on Sparsely Activated Networks

Deep and wide neural networks successfully fit very complex functions today, but dense models are starting to be prohibitively expensive for inference. To mitigate this, one promising direction is networks that activate a sparse subgraph of the network. The subgraph is chosen by a data-dependent routing function, enforcing a fixed mapping of inputs to subnetworks (e.g., the Mixture of Experts (MoE) paradigm in Switch Transformers). However, prior work is largely empirical, and while existing routing functions work well in practice, they do not lead to theoretical guarantees on approximation ability. We aim to provide a theoretical explanation for the power of sparse networks. As our first contribution, we present a formal model of data-dependent sparse networks that captures salient aspects of popular architectures. We then introduce a routing function based on locality sensitive hashing (LSH) that enables us to reason about how well sparse networks approximate target functions. After representing LSH-based sparse networks with our model, we prove that sparse networks can match the approximation power of dense networks on Lipschitz functions. Applying LSH on the input vectors means that the experts interpolate the target function in different subregions of the input space. To support our theory, we define various datasets based on Lipschitz target functions, and we show that sparse networks give a favorable trade-off between number of active units and approximation quality.

preprint2022arXiv

A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes

In this paper, we propose a dynamic cascaded encoder Automatic Speech Recognition (ASR) model, which unifies models for different deployment scenarios. Moreover, the model can significantly reduce model size and power consumption without loss of quality. Namely, with the dynamic cascaded encoder model, we explore three techniques to maximally boost the performance of each model size: 1) Use separate decoders for each sub-model while sharing the encoders; 2) Use funnel-pooling to improve the encoder efficiency; 3) Balance the size of causal and non-causal encoders to improve quality and fit deployment constraints. Overall, the proposed large-medium model has 30% smaller size and reduces power consumption by 33%, compared to the baseline cascaded encoder model. The triple-size model that unifies the large, medium, and small models achieves 37% total size reduction with minimal quality loss, while substantially reducing the engineering efforts of having separate models.

preprint2022arXiv

Accidental symmetries in the scalar potential of the Standard Model extended with two Higgs triplets

The extension of the Standard Model (SM) with two Higgs triplets offers an appealing way to account for both tiny Majorana neutrino masses via the type-II seesaw mechanism and the cosmological matter-antimatter asymmetry via the triplet leptogenesis. In this paper, we classify all possible accidental symmetries in the scalar potential of the two-Higgs-triplet model (2HTM). Based on the bilinear-field formalism, we show that the maximal symmetry group of the 2HTM potential is ${\rm SO(4)}$ and eight types of accidental symmetries in total can be identified. Furthermore, we examine the impact of the couplings between the SM Higgs doublet and the Higgs triplets on the accidental symmetries. The bounded-from-below conditions on the scalar potential with specific accidental symmetries are also derived. Taking the ${\rm SO(4)}$-invariant scalar potential as an example, we investigate the vacuum structures and the scalar mass spectra of the 2HTM.

preprint2022arXiv

Adaptive Worker Grouping For Communication-Efficient and Straggler-Tolerant Distributed SGD

Wall-clock convergence time and communication load are key performance metrics for the distributed implementation of stochastic gradient descent (SGD) in parameter server settings. Communication-adaptive distributed Adam (CADA) has been recently proposed as a way to reduce communication load via the adaptive selection of workers. CADA is subject to performance degradation in terms of wall-clock convergence time in the presence of stragglers. This paper proposes a novel scheme named grouping-based CADA (G-CADA) that retains the advantages of CADA in reducing the communication load, while increasing the robustness to stragglers at the cost of additional storage at the workers. G-CADA partitions the workers into groups of workers that are assigned the same data shards. Groups are scheduled adaptively at each iteration, and the server only waits for the fastest worker in each selected group. We provide analysis and experimental results to elaborate the significant gains on the wall-clock time, as well as communication load and computation load, of G-CADA over other benchmark schemes.

preprint2022arXiv

Adversarial Attack Framework on Graph Embedding Models with Limited Knowledge

With the success of the graph embedding model in both academic and industry areas, the robustness of graph embedding against adversarial attack inevitably becomes a crucial problem in graph learning. Existing works usually perform the attack in a white-box fashion: they need to access the predictions/labels to construct their adversarial loss. However, the inaccessibility of predictions/labels makes the white-box attack impractical to a real graph learning system. This paper promotes current frameworks in a more general and flexible sense -- we demand to attack various kinds of graph embedding models with black-box driven. We investigate the theoretical connections between graph signal processing and graph embedding models and formulate the graph embedding model as a general graph signal process with a corresponding graph filter. Therefore, we design a generalized adversarial attacker: GF-Attack. Without accessing any labels and model predictions, GF-Attack can perform the attack directly on the graph filter in a black-box fashion. We further prove that GF-Attack can perform an effective attack without knowing the number of layers of graph embedding models. To validate the generalization of GF-Attack, we construct the attacker on four popular graph embedding models. Extensive experiments validate the effectiveness of GF-Attack on several benchmark datasets.

preprint2022arXiv

An Edge-Cloud Integrated Framework for Flexible and Dynamic Stream Analytics

With the popularity of Internet of Things (IoT), edge computing and cloud computing, more and more stream analytics applications are being developed including real-time trend prediction and object detection on top of IoT sensing data. One popular type of stream analytics is the recurrent neural network (RNN) deep learning model based time series or sequence data prediction and forecasting. Different from traditional analytics that assumes data are available ahead of time and will not change, stream analytics deals with data that are being generated continuously and data trend/distribution could change (a.k.a. concept drift), which will cause prediction/forecasting accuracy to drop over time. One other challenge is to find the best resource provisioning for stream analytics to achieve good overall latency. In this paper, we study how to best leverage edge and cloud resources to achieve better accuracy and latency for stream analytics using a type of RNN model called long short-term memory (LSTM). We propose a novel edge-cloud integrated framework for hybrid stream analytics that supports low latency inference on the edge and high capacity training on the cloud. To achieve flexible deployment, we study different approaches of deploying our hybrid learning framework including edge-centric, cloud-centric and edge-cloud integrated. Further, our hybrid learning framework can dynamically combine inference results from an LSTM model pre-trained based on historical data and another LSTM model re-trained periodically based on the most recent data. Using real-world and simulated stream datasets, our experiments show the proposed edge-cloud deployment is the best among all three deployment types in terms of latency. For accuracy, the experiments show our dynamic learning approach performs the best among all learning approaches for all three concept drift scenarios.

preprint2022arXiv

Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions

In our previous work, we proposed a language-independent speaker anonymization system based on self-supervised learning models. Although the system can anonymize speech data of any language, the anonymization was imperfect, and the speech content of the anonymized speech was distorted. This limitation is more severe when the input speech is from a domain unseen in the training data. This study analyzed the bottleneck of the anonymization system under unseen conditions. It was found that the domain (e.g., language and channel) mismatch between the training and test data affected the neural waveform vocoder and anonymized speaker vectors, which limited the performance of the whole system. Increasing the training data diversity for the vocoder was found to be helpful to reduce its implicit language and channel dependency. Furthermore, a simple correlation-alignment-based domain adaption strategy was found to be significantly effective to alleviate the mismatch on the anonymized speaker vectors. Audio samples and source code are available online.

preprint2022arXiv

Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation

The performance of spoofing countermeasure systems depends fundamentally upon the use of sufficiently representative training data. With this usually being limited, current solutions typically lack generalisation to attacks encountered in the wild. Strategies to improve reliability in the face of uncontrolled, unpredictable attacks are hence needed. We report in this paper our efforts to use self-supervised learning in the form of a wav2vec 2.0 front-end with fine tuning. Despite initial base representations being learned using only bona fide data and no spoofed data, we obtain the lowest equal error rates reported in the literature for both the ASVspoof 2021 Logical Access and Deepfake databases. When combined with data augmentation,these results correspond to an improvement of almost 90% relative to our baseline system.

preprint2022arXiv

Chiral Quantum Network with Giant Atoms

In superconducting quantum circuits (SQCs), chiral routing quantum information is often realized with the ferrite circulators, which are usually bulky, lossy and require strong magnetic fields. To overcome those problems, we propose a novel method to realize chiral quantum networks by exploiting giant atom effects in SQC platforms. By assuming each coupling point being modulated with time, the interaction becomes momentum-dependent, and giant atoms will chirally emit photons due to interference effects. The chiral factor can approach 1, and both the emission direction and rate can be freely tuned by the modulating signals. We demonstrate that a high-fidelity state transfer between remote giant atoms can be realized. Our proposal can be integrated on the superconducting chip easily, and has the potential to work as a tunable toolbox for quantum information processing in future chiral quantum networks.

preprint2022arXiv

CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training

Recent years have witnessed increasing interest in code representation learning, which aims to represent the semantics of source code into distributed vectors. Currently, various works have been proposed to represent the complex semantics of source code from different views, including plain text, Abstract Syntax Tree (AST), and several kinds of code graphs (e.g., Control/Data Flow Graph). However, most of them only consider a single view of source code independently, ignoring the correspondences among different views. In this paper, we propose to integrate different views with the natural-language description of source code into a unified framework with Multi-View contrastive Pre-training, and name our model as CODE-MVP. Specifically, we first extract multiple code views using compiler tools, and learn the complementary information among them under a contrastive learning framework. Inspired by the type checking in compilation, we also design a fine-grained type inference objective in the pre-training. Experiments on three downstream tasks over five datasets demonstrate the superiority of CODE-MVP when compared with several state-of-the-art baselines. For example, we achieve 2.4/2.3/1.1 gain in terms of MRR/MAP/Accuracy metrics on natural language code retrieval, code similarity, and code defect detection tasks, respectively.

preprint2022arXiv

Communication-Efficient Local SGD with Age-Based Worker Selection

A major bottleneck of distributed learning under parameter-server (PS) framework is communication cost due to frequent bidirectional transmissions between the PS and workers. To address this issue, local stochastic gradient descent (SGD) and worker selection have been exploited by reducing the communication frequency and the number of participating workers at each round, respectively. However, partial participation can be detrimental to convergence rate, especially for heterogeneous local datasets. In this paper, to improve communication efficiency and speed up the training process, we develop a novel worker selection strategy named AgeSel. The key enabler of AgeSel is utilization of the ages of workers to balance their participation frequencies. The convergence of local SGD with the proposed age-based partial worker participation is rigorously established. Simulation results demonstrate that the proposed AgeSel strategy can significantly reduce the number of training rounds needed to achieve a targeted accuracy, as well as the communication cost. The influence of the algorithm hyper-parameter is also explored to manifest the benefit of age-based worker selection.

preprint2022arXiv

Compilable Neural Code Generation with Compiler Feedback

Automatically generating compilable programs with (or without) natural language descriptions has always been a touchstone problem for computational linguistics and automated software engineering. Existing deep-learning approaches model code generation as text generation, either constrained by grammar structures in decoder, or driven by pre-trained language models on large-scale code corpus (e.g., CodeGPT, PLBART, and CodeT5). However, few of them account for compilability of the generated programs. To improve compilability of the generated programs, this paper proposes COMPCODER, a three-stage pipeline utilizing compiler feedback for compilable code generation, including language model fine-tuning, compilability reinforcement, and compilability discrimination. Comprehensive experiments on two code generation tasks demonstrate the effectiveness of our proposed approach, improving the success rate of compilation from 44.18 to 89.18 in code completion on average and from 70.3 to 96.2 in text-to-code generation, respectively, when comparing with the state-of-the-art CodeGPT.

preprint2022arXiv

Context-Aware Streaming Perception in Dynamic Environments

Efficient vision works maximize accuracy under a latency budget. These works evaluate accuracy offline, one image at a time. However, real-time vision applications like autonomous driving operate in streaming settings, where ground truth changes between inference start and finish. This results in a significant accuracy drop. Therefore, a recent work proposed to maximize accuracy in streaming settings on average. In this paper, we propose to maximize streaming accuracy for every environment context. We posit that scenario difficulty influences the initial (offline) accuracy difference, while obstacle displacement in the scene affects the subsequent accuracy degradation. Our method, Octopus, uses these scenario properties to select configurations that maximize streaming accuracy at test time. Our method improves tracking performance (S-MOTA) by 7.4% over the conventional static approach. Further, performance improvement using our method comes in addition to, and not instead of, advances in offline accuracy.

preprint2022arXiv

Cosmological constraints from the density gradient weighted correlation function

The mark weighted correlation function (MCF) $W(s,μ)$ is a computationally efficient statistical measure which can probe clustering information beyond that of the conventional 2-point statistics. In this work, we extend the traditional mark weighted statistics by using powers of the density field gradient $|\nabla ρ/ρ|^α$ as the weight, and use the angular dependence of the scale-averaged MCFs to constrain cosmological parameters. The analysis shows that the gradient based weighting scheme is statistically more powerful than the density based weighting scheme, while combining the two schemes together is more powerful than separately using either of them. Utilising the density weighted or the gradient weighted MCFs with $α=0.5,\ 1$, we can strengthen the constraint on $Ω_m$ by factors of 2 or 4, respectively, compared with the standard 2-point correlation function, while simultaneously using the MCFs of the two weighting schemes together can be $1.25$ times more statistically powerful than using the gradient weighting scheme alone. The mark weighted statistics may play an important role in cosmological analysis of future large-scale surveys. Many issues, including the possibility of using other types of weights, the influence of the bias on this statistics, as well as the usage of MCFs in the tomographic Alcock-Paczynski method, are worth further investigations.

preprint2022arXiv

Coupling two charge qubits via a superconducting resonator operating in the resonant and dispersive regimes

A key challenge for semiconductor quantum-dot charge qubits is the realization of long-range qubit coupling and performing high-fidelity gates based on it. Here, we describe a new type of charge qubit formed by an electron confined in a triple-quantum-dot system, enabling single and two-qubit gates working in the dipolar and quadrupolar detuning sweet spots. We further present the form for the long-range dipolar coupling between the charge qubit and the superconducting resonator. Based on the hybrid system composed of the qubits and the resonator, we present two types of entangling gates: the dynamical iSWAP gate and holonomic entangling gate, which are operating in the dispersive and resonant regimes, respectively. We find that the fidelity for the iSWAP gate can reach fidelity higher than 99\% for the noise level typical in experiments. Meanwhile, the fidelity for the holonomic gate can surpass 98\% if the anharmonicity in the resonator is large enough. Our proposal offers an alternative useful way to build up high-fidelity quantum computation for charge qubits in semiconductor quantum dot.

preprint2022arXiv

Covering Grassmannian Codes: Bounds and Constructions

Grassmannian $\mathcal{G}_q(n,k)$ is the set of all $k$-dimensional subspaces of the vector space $\mathbb{F}_q^n.$ Recently, Etzion and Zhang introduced a new notion called covering Grassmannian code which can be used in network coding solutions for generalized combination networks. An $α$-$(n,k,δ)_q^c$ covering Grassmannian code $\mathcal{C}$ is a subset of $\mathcal{G}_q(n,k)$ such that every set of $α$ codewords of $\mathcal{C}$ spans a subspace of dimension at least $δ+k$ in $\mathbb{F}_q^n.$ In this paper, we derive new upper and lower bounds on the size of covering Grassmannian codes. These bounds improve and extend the parameter range of known bounds.

preprint2022arXiv

Decentralized Stochastic Proximal Gradient Descent with Variance Reduction over Time-varying Networks

In decentralized learning, a network of nodes cooperate to minimize an overall objective function that is usually the finite-sum of their local objectives, and incorporates a non-smooth regularization term for the better generalization ability. Decentralized stochastic proximal gradient (DSPG) method is commonly used to train this type of learning models, while the convergence rate is retarded by the variance of stochastic gradients. In this paper, we propose a novel algorithm, namely DPSVRG, to accelerate the decentralized training by leveraging the variance reduction technique. The basic idea is to introduce an estimator in each node, which tracks the local full gradient periodically, to correct the stochastic gradient at each iteration. By transforming our decentralized algorithm into a centralized inexact proximal gradient algorithm with variance reduction, and controlling the bounds of error sequences, we prove that DPSVRG converges at the rate of $O(1/T)$ for general convex objectives plus a non-smooth term with $T$ as the number of iterations, while DSPG converges at the rate $O(\frac{1}{\sqrt{T}})$. Our experiments on different applications, network topologies and learning models demonstrate that DPSVRG converges much faster than DSPG, and the loss function of DPSVRG decreases smoothly along with the training epochs.

preprint2022arXiv

Deep Learning-based Massive MIMO CSI Acquisition for 5G Evolution and 6G

Recently, inspired by successful applications in many fields, deep learning (DL) technologies for CSI acquisition have received considerable research interest from both academia and industry. Considering the practical feedback mechanism of 5th generation (5G) New radio (NR) networks, we propose two implementation schemes for artificial intelligence for CSI (AI4CSI), the DL-based receiver and end-to-end design, respectively. The proposed AI4CSI schemes were evaluated in 5G NR networks in terms of spectrum efficiency (SE), feedback overhead, and computational complexity, and compared with legacy schemes. To demonstrate whether these schemes can be used in real-life scenarios, both the modeled-based channel data and practically measured channels were used in our investigations. When DL-based CSI acquisition is applied to the receiver only, which has little air interface impact, it provides approximately 25\% SE gain at a moderate feedback overhead level. It is feasible to deploy it in current 5G networks during 5G evolutions. For the end-to-end DL-based CSI enhancements, the evaluations also demonstrated their additional performance gain on SE, which is 6% -- 26% compared with DL-based receivers and 33% -- 58% compared with legacy CSI schemes. Considering its large impact on air-interface design, it will be a candidate technology for 6th generation (6G) networks, in which an air interface designed by artificial intelligence can be used.

preprint2022arXiv

Dichotomic Pattern Mining with Applications to Intent Prediction from Semi-Structured Clickstream Datasets

We introduce a pattern mining framework that operates on semi-structured datasets and exploits the dichotomy between outcomes. Our approach takes advantage of constraint reasoning to find sequential patterns that occur frequently and exhibit desired properties. This allows the creation of novel pattern embeddings that are useful for knowledge extraction and predictive modeling. Finally, we present an application on customer intent prediction from digital clickstream data. Overall, we show that pattern embeddings play an integrator role between semi-structured data and machine learning models, improve the performance of the downstream task and retain interpretability.

preprint2022arXiv

Domain Shift-oriented Machine Anomalous Sound Detection Model Based on Self-Supervised Learning

Thanks to the development of deep learning, research on machine anomalous sound detection based on self-supervised learning has made remarkable achievements. However, there are differences in the acoustic characteristics of the test set and the training set under different operating conditions of the same machine (domain shifts). It is challenging for the existing detection methods to learn the domain shifts features stably with low computation overhead. To address these problems, we propose a domain shift-oriented machine anomalous sound detection model based on self-supervised learning (TranSelf-DyGCN) in this paper. Firstly, we design a time-frequency domain feature modeling network to capture global and local spatial and time-domain features, thus improving the stability of machine anomalous sound detection stability under domain shifts. Then, we adopt a Dynamic Graph Convolutional Network (DyGCN) to model the inter-dependence relationship between domain shifts features, enabling the model to perceive domain shifts features efficiently. Finally, we use a Domain Adaptive Network (DAN) to compensate for the performance decrease caused by domain shifts, making the model adapt to anomalous sound better in the self-supervised environment. The performance of the suggested model is validated on DCASE 2020 task 2 and DCASE 2022 task 2.

preprint2022arXiv

Early results from GLASS-JWST. IX: First spectroscopic confirmation of low-mass quiescent galaxies at $z>2$ with NIRISS

How passive galaxies form, and the physical mechanisms which prevent star formation over long timescales, are some of the most outstanding questions in understanding galaxy evolution. The properties of quiescent galaxies over cosmic time provide crucial information to identify the quenching mechanisms. Passive galaxies have been confirmed and studied out to $z\sim4$, but all of these studies have been limited to massive systems (mostly with $\log{(M_{\rm star}/M_{\odot})}>10.8$). Using James Webb Space Telescope (JWST) NIRISS grism slitless spectroscopic data from the GLASS JWST ERS program, we present spectroscopic confirmation of two quiescent galaxies at $z_{\rm spec}=2.650^{+0.004}_{-0.006}$ and $z_{\rm spec}=2.433^{+0.032}_{-0.016}$ (3$σ$ errors) with stellar masses of $\log{(M_{\rm star}/M_{\odot})}=10.53^{+0.18}_{-0.06}$ and $\log{(M_{\rm star}/M_{\odot})}=9.93^{+0.06}_{-0.07}$ (corrected for magnification factors of $μ=2.0$ and $μ=2.1$, respectively). The latter represents the first spectroscopic confirmation of the existence of low-mass quiescent galaxies at cosmic noon, showcasing the power of JWST to identify and characterize this enigmatic population.

preprint2022arXiv

Early results from GLASS-JWST. XI: Stellar masses and mass-to-light ratio of z>7 galaxies

We exploit James Webb Space Telescope (JWST) NIRCam observations from the GLASS-JWST-Early Release Science program to investigate galaxy stellar masses at z>7. We first show that JWST observations reduce the uncertainties on the stellar mass by a factor of at least 5-10, when compared with the highest quality data sets available to date. We then study the UV mass-to-light ratio, finding that galaxies exhibit a two orders of magnitude range of M/L_UV values for a given luminosity, indicative of a broad variety of physical conditions and star formation histories. As a consequence, previous estimates of the cosmic star stellar mass density - based on an average correlation between UV luminosity and stellar mass - can be biased by as much as a factor of ~6. Our first exploration demonstrates that JWST represents a new era in our understanding of stellar masses at z>7, and therefore of the growth of galaxies prior to cosmic reionization.

preprint2022arXiv

Energy-Efficient UAV-Mounted RIS Assisted Mobile Edge Computing

Unmanned aerial vehicle (UAV) and reconfigurable intelligent surface (RIS) have been recently applied in the field of mobile edge computing (MEC) to improve the data exchange environment by proactively changing the wireless channels through maneuverable location deployment and intelligent signals reflection, respectively. Nevertheless, they may suffer from inherent limitations in practical scenarios. UAV-mounted RIS (U-RIS), as a promising integrated approach, can combine the advantages of UAV and RIS to break the limit. Inspired by this, we consider a novel U-RIS assisted MEC system, where a U-RIS is deployed to assist the communication between the ground users and an MEC server. The joint UAV trajectory, RIS passive beamforming and MEC resource allocation design is developed to maximize the energy efficiency (EE) of the system. To tackle the intractable non-convex problem, we divide it into two subproblems and solve them iteratively based on successive convex approximation (SCA) and the Dinkelbach method. Finally we obtain a high-performance suboptimal solution. Simulation results show that the proposed algorithm significantly improves the energy efficiency of the MEC system.

preprint2022arXiv

Enhanced brain structure-function tethering in transmodal cortex revealed by high-frequency eigenmodes

The brain's structural connectome supports signal propagation between neuronal elements, shaping diverse coactivation patterns that can be captured as functional connectivity. While the link between structure and function remains an ongoing challenge, the prevailing hypothesis is that the structure-function relationship may itself be gradually decoupled along a macroscale functional gradient spanning unimodal to transmodal regions. However, this hypothesis is strongly constrained by the underlying models which may neglect requisite signaling mechanisms. Here, we transform the structural connectome into a set of orthogonal eigenmodes governing frequency-specific diffusion patterns and show that regional structure-function relationships vary markedly under different signaling mechanisms. Specifically, low-frequency eigenmodes, which are considered sufficient to capture the essence of the functional network, contribute little to functional connectivity reconstruction in transmodal regions, resulting in structure-function decoupling along the unimodal-transmodal gradient. In contrast, high-frequency eigenmodes, which are usually on the periphery of attention due to their association with noisy and random dynamical patterns, contribute significantly to functional connectivity prediction in transmodal regions, inducing gradually convergent structure-function relationships from unimodal to transmodal regions. Although the information in high-frequency eigenmodes is weak and scattered, it effectively enhances the structure-function correspondence by 35% in unimodal regions and 56% in transmodal regions. Altogether, our findings suggest that the structure-function divergence in transmodal areas may not be an intrinsic property of brain organization, but can be narrowed through multiplexed and regionally specialized signaling mechanisms.

preprint2022arXiv

Estimating the confidence of speech spoofing countermeasure

Conventional speech spoofing countermeasures (CMs) are designed to make a binary decision on an input trial. However, a CM trained on a closed-set database is theoretically not guaranteed to perform well on unknown spoofing attacks. In some scenarios, an alternative strategy is to let the CM defer a decision when it is not confident. The question is then how to estimate a CM's confidence regarding an input trial. We investigated a few confidence estimators that can be easily plugged into a CM. On the ASVspoof2019 logical access database, the results demonstrate that an energy-based estimator and a neural-network-based one achieved acceptable performance in identifying unknown attacks in the test set. On a test set with additional unknown attacks and bona fide trials from other databases, the confidence estimators performed moderately well, and the CMs better discriminated bona fide and spoofed trials that had a high confidence score. Additional results also revealed the difficulty in enhancing a confidence estimator by adding unknown attacks to the training set.

preprint2022arXiv

Eyes Tell All: Irregular Pupil Shapes Reveal GAN-generated Faces

Generative adversary network (GAN) generated high-realistic human faces have been used as profile images for fake social media accounts and are visually challenging to discern from real ones. In this work, we show that GAN-generated faces can be exposed via irregular pupil shapes. This phenomenon is caused by the lack of physiological constraints in the GAN models. We demonstrate that such artifacts exist widely in high-quality GAN-generated faces and further describe an automatic method to extract the pupils from two eyes and analysis their shapes for exposing the GAN-generated faces. Qualitative and quantitative evaluations of our method suggest its simplicity and effectiveness in distinguishing GAN-generated faces.

preprint2022arXiv

Facility Location with Congestion and Priority in Drone-Based Emergency Delivery

Thanks to their fast delivery, reduced traffic restrictions, and low manpower need, drones have been increasingly deployed to deliver time-critical materials, such as medication, blood, and exam kits, in emergency situations. This paper considers a facility location model of using drones as mobile servers in emergency delivery. The model jointly optimizes the location of facilities, the capacity of drones deployed at opened facilities, and the allocation of demands, with an objective of equitable response times among all demand sites. To this end, we employ queues to model the system congestion of drone requests and consider three queuing disciplines: non-priority, static priority, and dynamic priority. For each discipline, we approximate the model as a mixed-integer second-order conic program (MISOCP), which can readily be solved in commercial solvers. We conduct extensive computational experiments to demonstrate the effectiveness and accuracy of our approach. Additionally, we compare the system performance under the three queuing disciplines and various problem parameters, from which we produce operational recommendations to decision makers in emergency delivery.

preprint2022arXiv

Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards

Generating accurate descriptions for online fashion items is important not only for enhancing customers' shopping experiences, but also for the increase of online sales. Besides the need of correctly presenting the attributes of items, the expressions in an enchanting style could better attract customer interests. The goal of this work is to develop a novel learning framework for accurate and expressive fashion captioning. Different from popular work on image captioning, it is hard to identify and describe the rich attributes of fashion items. We seed the description of an item by first identifying its attributes, and introduce attribute-level semantic (ALS) reward and sentence-level semantic (SLS) reward as metrics to improve the quality of text descriptions. We further integrate the training of our model with maximum likelihood estimation (MLE), attribute embedding, and Reinforcement Learning (RL). To facilitate the learning, we build a new FAshion CAptioning Dataset (FACAD), which contains 993K images and 130K corresponding enchanting and diverse descriptions. Experiments on FACAD demonstrate the effectiveness of our model.

preprint2022arXiv

First Census of Gas-phase Metallicity Gradients of Star-forming Galaxies in Overdense Environments at Cosmic Noon

We report the first spatially resolved measurements of gas-phase metallicity radial gradients in star-forming galaxies in overdense environments at $z\gtrsim2$. The spectroscopic data are acquired by the \mg\ survey, a Hubble Space Telescope (HST) cycle-28 medium program. This program is obtaining 45 orbits of WFC3/IR grism spectroscopy in the density peak regions of three massive galaxy protoclusters (BOSS 1244, BOSS 1542 and BOSS 1441) at $z=2-3$. Our sample in the BOSS 1244 field consists of 20 galaxies with stellar-mass ranging from $10^{9.0}$ to $10^{10.3}$ \Msun\ , star formation rate (SFR) from 10 to 240 \Msun\,yr$^{-1}$, and global gas-phase metallicity (\oh) from 8.2 to 8.6. At $1σ$ confidence level, 2/20 galaxies in our sample show positive (inverted) gradients -- the relative abundance of oxygen increasing with galactocentric radius, opposite the usual trend. Furthermore, 1/20 shows negative gradients and 17/20 are consistent with flat gradients. This high fraction of flat/inverted gradients is uncommon in simulations and previous observations conducted in blank fields at similar redshifts. To understand this, we investigate the correlations among various observed properties of our sample galaxies. We find an anticorrelation between metallicity gradient and global metallicity of our galaxies residing in extreme overdensities, and a marked deficiency of metallicity in our massive galaxies as compared to their coeval field counterparts. We conclude that the cold-mode gas accretion plays an active role in shaping the chemical evolution of galaxies in the protocluster environments, diluting their central chemical abundance, and flattening/inverting their metallicity gradients.

preprint2022arXiv

Fluorination Increases Hydrophobicity at the Macroscopic Level but not at the Microscopic Level

Hydrophobic interactions have been studied in detail in the past based on hydrophobic polymers, such as polystyrene (PS). Because fluorinated materials have relatively low surface energy, they often show both oleophobicity and hydrophobicity at the macroscopic level. However, it remains unknown how fluorination of hydrophobic polymer influences hydrophobicity at the microscopic level. In this work, we synthesized PS and fluorine-substituted PS (FPS) by reversible addition-fragmentation chain transfer polymerization method. Contact angle measurements confirmed that FPS is more hydrophobic than PS at the macroscopic level due to the introduction of fluorine. However, single molecule force spectroscopy experiments showed that the forces required to unfold the PS and FPS nanoparticles in water are indistinguishable, indicating that the strength of the hydrophobic ffect that drives the self-assembly of PS and FPS nanoparticles is the same at the microscopic level. The divergence of hydrophobic effect at the macroscopic and microscopic level may hint different underlying mechanisms: the hydrophobicity is dominated by the solvent hydration at the microscopic level and the surface-associated interaction at the macroscopic level.

preprint2022arXiv

Fundamental limitations on optimization in variational quantum algorithms

Exploring quantum applications of near-term quantum devices is a rapidly growing field of quantum information science with both theoretical and practical interests. A leading paradigm to establish such near-term quantum applications is variational quantum algorithms (VQAs). These algorithms use a classical optimizer to train a parameterized quantum circuit to accomplish certain tasks, where the circuits are usually randomly initialized. In this work, we prove that for a broad class of such random circuits, the variation range of the cost function via adjusting any local quantum gate within the circuit vanishes exponentially in the number of qubits with a high probability. This result can unify the restrictions on gradient-based and gradient-free optimizations in a natural manner and reveal extra harsh constraints on the training landscapes of VQAs. Hence a fundamental limitation on the trainability of VQAs is unraveled, indicating the essential mechanism of the optimization hardness in the Hilbert space with exponential dimension. We further showcase the validity of our results with numerical simulations of representative VQAs. We believe that these results would deepen our understanding of the scalability of VQAs and shed light on the search for near-term quantum applications with advantages.

preprint2022arXiv

Hermite-Gaussian-mode coherently composed states and deep learning based free-space optical communication link

In laser-based free-space optical communication, besides OAM beams, Hermite-Gaussian (HG) modes or HG-mode coherently composed states (HG-MCCS) can also be adopted as the information carrier to extend the channel capacity with the spatial pattern based encoding and decoding link. The light field of HG-MCCS is mainly determined by three independent parameters, including indexes of HG modes, relative initial phases between two eigenmodes, and scale coefficients of the eigenmodes, which can obtain a large number of effective coding modes at a low mode order. The beam intensity distributions of the HG-MCCSs have obvious distinguishable spatial characteristics and can keep propagation invariance, which are convenient to be decoded by the convolutional neural network (CNN) based image recognition method. We experimentally utilize HG-MCCS to realize a communication link including encoding, transmission under atmospheric turbulence (AT), and decoding based on CNN. With the index order of eigenmodes within six, 125 HG-MCCS are generated and used for information encoding, and the average recognition accuracy reached 99.5% for non-AT conditions. For the 125-level color images transmission, the error rate of the system is less than 1.8% even under the weak AT condition. Our work provides a useful basis for the future combination of dense data communication and artificial intelligence technology.

preprint2022arXiv

Hierarchical Interaction Networks with Rethinking Mechanism for Document-level Sentiment Analysis

Document-level Sentiment Analysis (DSA) is more challenging due to vague semantic links and complicate sentiment information. Recent works have been devoted to leveraging text summarization and have achieved promising results. However, these summarization-based methods did not take full advantage of the summary including ignoring the inherent interactions between the summary and document. As a result, they limited the representation to express major points in the document, which is highly indicative of the key sentiment. In this paper, we study how to effectively generate a discriminative representation with explicit subject patterns and sentiment contexts for DSA. A Hierarchical Interaction Networks (HIN) is proposed to explore bidirectional interactions between the summary and document at multiple granularities and learn subject-oriented document representations for sentiment classification. Furthermore, we design a Sentiment-based Rethinking mechanism (SR) by refining the HIN with sentiment label information to learn a more sentiment-aware document representation. We extensively evaluate our proposed models on three public datasets. The experimental results consistently demonstrate the effectiveness of our proposed models and show that HIN-SR outperforms various state-of-the-art methods.

preprint2022arXiv

Hybrid subconvexity bounds for twists of $\rm GL(3)$ $L$-functions

Let $π$ be a $SL(3,\mathbb Z)$ Hecke-Maass cusp form and $χ$ a primitive Dirichlet character of prime power conductor $\mathfrak{q}=p^k$ with $p$ prime. In this paper we will prove the following subconvexity bound $$ L\left(\frac{1}{2}+it,π\times χ\right)\ll_{π,\varepsilon} p^{3/4}\big(\mathfrak{q}(1+|t|)\big)^{3/4-3/40+\varepsilon}, $$ for any $\varepsilon >0$ and $t \in \mathbb{R}$.

preprint2022arXiv

Hydrodynamic Relaxation in a Strongly Interacting Fermi Gas

We measure the free decay of a spatially periodic density profile in a normal fluid strongly interacting Fermi gas, which is confined in a box potential. This spatial profile is initially created in thermal equilibrium by a perturbing potential. After the perturbation is abruptly extinguished, the dominant spatial Fourier component exhibits an exponentially decaying (thermally diffusive) mode and a decaying oscillatory (first sound) mode, enabling independent measurement of the thermal conductivity and the shear viscosity directly from the time-dependent evolution.

preprint2022arXiv

Incremental Graph Computation: Anchored Vertex Tracking in Dynamic Social Networks

User engagement has recently received significant attention in understanding the decay and expansion of communities in many online social networking platforms. When a user chooses to leave a social networking platform, it may cause a cascading dropping out among her friends. In many scenarios, it would be a good idea to persuade critical users to stay active in the network and prevent such a cascade because critical users can have significant influence on user engagement of the whole network. Many user engagement studies have been conducted to find a set of critical (anchored) users in the static social network. However, social networks are highly dynamic and their structures are continuously evolving. In order to fully utilize the power of anchored users in evolving networks, existing studies have to mine multiple sets of anchored users at different times, which incurs an expensive computational cost. To better understand user engagement in evolving network, we target a new research problem called Anchored Vertex Tracking (AVT) in this paper, aiming to track the anchored users at each timestamp of evolving networks. Nonetheless, it is nontrivial to handle the AVT problem which we have proved to be NP-hard. To address the challenge, we develop a greedy algorithm inspired by the previous anchored k-core study in the static networks. Furthermore, we design an incremental algorithm to efficiently solve the AVT problem by utilizing the smoothness of the network structure's evolution. The extensive experiments conducted on real and synthetic datasets demonstrate the performance of our proposed algorithms and the effectiveness in solving the AVT problem.

preprint2022arXiv

Investigating self-supervised front ends for speech spoofing countermeasures

Self-supervised speech model is a rapid progressing research topic, and many pre-trained models have been released and used in various down stream tasks. For speech anti-spoofing, most countermeasures (CMs) use signal processing algorithms to extract acoustic features for classification. In this study, we use pre-trained self-supervised speech models as the front end of spoofing CMs. We investigated different back end architectures to be combined with the self-supervised front end, the effectiveness of fine-tuning the front end, and the performance of using different pre-trained self-supervised models. Our findings showed that, when a good pre-trained front end was fine-tuned with either a shallow or a deep neural network-based back end on the ASVspoof 2019 logical access (LA) training set, the resulting CM not only achieved a low EER score on the 2019 LA test set but also significantly outperformed the baseline on the ASVspoof 2015, 2021 LA, and 2021 deepfake test sets. A sub-band analysis further demonstrated that the CM mainly used the information in a specific frequency band to discriminate the bona fide and spoofed trials across the test sets.

preprint2022arXiv

Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment Utterances

Conventional automatic speaker verification systems can usually be decomposed into a front-end model such as time delay neural network (TDNN) for extracting speaker embeddings and a back-end model such as statistics-based probabilistic linear discriminant analysis (PLDA) or neural network-based neural PLDA (NPLDA) for similarity scoring. However, the sequential optimization of the front-end and back-end models may lead to a local minimum, which theoretically prevents the whole system from achieving the best optimization. Although some methods have been proposed for jointly optimizing the two models, such as the generalized end-to-end (GE2E) model and NPLDA E2E model, all of these methods are designed for use with a single enrollment utterance. In this paper, we propose a new E2E joint method for speaker verification especially designed for the practical case of multiple enrollment utterances. In order to leverage the intra-relationship among multiple enrollment utterances, our model comes equipped with frame-level and utterance-level attention mechanisms. We also utilize several data augmentation techniques, including conventional noise augmentation using MUSAN and RIRs datasets and a unique speaker embedding-level mixup strategy for better optimization.

preprint2022arXiv

Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models

Speaker anonymization aims to protect the privacy of speakers while preserving spoken linguistic information from speech. Current mainstream neural network speaker anonymization systems are complicated, containing an F0 extractor, speaker encoder, automatic speech recognition acoustic model (ASR AM), speech synthesis acoustic model and speech waveform generation model. Moreover, as an ASR AM is language-dependent, trained on English data, it is hard to adapt it into another language. In this paper, we propose a simpler self-supervised learning (SSL)-based method for language-independent speaker anonymization without any explicit language-dependent model, which can be easily used for other languages. Extensive experiments were conducted on the VoicePrivacy Challenge 2020 datasets in English and AISHELL-3 datasets in Mandarin to demonstrate the effectiveness of our proposed SSL-based language-independent speaker anonymization method.

preprint2022arXiv

Learning to Solve Travelling Salesman Problem with Hardness-adaptive Curriculum

Various neural network models have been proposed to tackle combinatorial optimization problems such as the travelling salesman problem (TSP). Existing learning-based TSP methods adopt a simple setting that the training and testing data are independent and identically distributed. However, the existing literature fails to solve TSP instances when training and testing data have different distributions. Concretely, we find that different training and testing distribution will result in more difficult TSP instances, i.e., the solution obtained by the model has a large gap from the optimal solution. To tackle this problem, in this work, we study learning-based TSP methods when training and testing data have different distributions using adaptive-hardness, i.e., how difficult a TSP instance can be for a solver. This problem is challenging because it is non-trivial to (1) define hardness measurement quantitatively; (2) efficiently and continuously generate sufficiently hard TSP instances upon model training; (3) fully utilize instances with different levels of hardness to learn a more powerful TSP solver. To solve these challenges, we first propose a principled hardness measurement to quantify the hardness of TSP instances. Then, we propose a hardness-adaptive generator to generate instances with different hardness. We further propose a curriculum learner fully utilizing these instances to train the TSP solver. Experiments show that our hardness-adaptive generator can generate instances ten times harder than the existing methods, and our proposed method achieves significant improvement over state-of-the-art models in terms of the optimality gap.

preprint2022arXiv

Lessons learned from the NeurIPS 2021 MetaDL challenge: Backbone fine-tuning without episodic meta-learning dominates for few-shot learning image classification

Although deep neural networks are capable of achieving performance superior to humans on various tasks, they are notorious for requiring large amounts of data and computing resources, restricting their success to domains where such resources are available. Metalearning methods can address this problem by transferring knowledge from related tasks, thus reducing the amount of data and computing resources needed to learn new tasks. We organize the MetaDL competition series, which provide opportunities for research groups all over the world to create and experimentally assess new meta-(deep)learning solutions for real problems. In this paper, authored collaboratively between the competition organizers and the top-ranked participants, we describe the design of the competition, the datasets, the best experimental results, as well as the top-ranked methods in the NeurIPS 2021 challenge, which attracted 15 active teams who made it to the final phase (by outperforming the baseline), making over 100 code submissions during the feedback phase. The solutions of the top participants have been open-sourced. The lessons learned include that learning good representations is essential for effective transfer learning.

preprint2022arXiv

Lyman Continuum Galaxy Candidates in COSMOS

Star-forming galaxies are the sources likely to have reionized the universe. As we cannot observe them directly due to the opacity of the intergalactic medium at $z\gtrsim5$, we study $z\sim3\text{--}5$ galaxies as proxies to place observational constraints on cosmic reionization. Using new deep \textit{Hubble Space Telescope} rest-frame UV F336W and F435W imaging (30-orbit, $\sim40$~arcmin$^2$, $\sim29\text{--}30$~mag depth at 5$σ$), we attempt to identify a sample of Lyman continuum galaxies (LCGs). These are individual sources that emit ionizing flux below the Lyman break ($<912~\textÅ$). This population would allow us to constrain cosmic reionization parameters such as the number density and escape fraction ($f_{\rm esc}$) of ionizing sources. We compile a comprehensive parent sample that does not rely on the Lyman-break technique for redshifts. We present three new spectroscopic candidates at $z\sim3.7\text{--}4.4$, and 32 new photometric candidates. The high-resolution multi-band HST imaging and new Keck/Low Resolution Imaging Spectrometer (LRIS) redshifts make these promising spectroscopic LCG candidates. Using both a traditional and probabilistic approach, we find the most likely $f_{\rm esc}$ values for the three spectroscopic LCG candidates are $>100\%$, and therefore not physical. We are unable to confirm the true nature of these sources with the best available imaging and direct blue Keck/LRIS spectroscopy. More spectra, especially from the new class of 30 m telescopes, will be required to build a statistical sample of LCGs to place firm observational constraints on cosmic reionization.

preprint2022arXiv

Mask Wearing Status Estimation with Smartwatches

We present MaskReminder, an automatic mask-wearing status estimation system based on smartwatches, to remind users who may be exposed to the COVID-19 virus transmission scenarios, to wear a mask. MaskReminder with the powerful MLP-Mixer deep learning model can effectively learn long-short range information from the inertial measurement unit readings, and can recognize the mask-related hand movements such as wearing a mask, lowering the metal strap of the mask, removing the strap from behind one side of the ears, etc. Extensive experiments on 20 volunteers and 8000+ data samples show that the average recognition accuracy is 89%. Moreover, MaskReminder is capable to remind a user to wear with a success rate of 90% even in the user-independent setting.

preprint2022arXiv

Medical Matting: A New Perspective on Medical Segmentation with Uncertainty

It is difficult to accurately label ambiguous and complex shaped targets manually by binary masks. The weakness of binary mask under-expression is highlighted in medical image segmentation, where blurring is prevalent. In the case of multiple annotations, reaching a consensus for clinicians by binary masks is more challenging. Moreover, these uncertain areas are related to the lesions' structure and may contain anatomical information beneficial to diagnosis. However, current studies on uncertainty mainly focus on the uncertainty in model training and data labels. None of them investigate the influence of the ambiguous nature of the lesion itself.Inspired by image matting, this paper introduces alpha matte as a soft mask to represent uncertain areas in medical scenes and accordingly puts forward a new uncertainty quantification method to fill the gap of uncertainty research for lesion structure. In this work, we introduce a new architecture to generate binary masks and alpha mattes in a multitasking framework, which outperforms all state-of-the-art matting algorithms compared. The proposed uncertainty map is able to highlight the ambiguous regions and a novel multitasking loss weighting strategy we presented can improve performance further and demonstrate their concrete benefits. To fully-evaluate the effectiveness of our proposed method, we first labelled three medical datasets with alpha matte to address the shortage of available matting datasets in medical scenes and prove the alpha matte to be a more efficient labeling method than a binary mask from both qualitative and quantitative aspects.

preprint2022arXiv

Microscopic theory on magnetic-field-tuned sweet spot of exchange interactions in multielectron quantum-dot systems

The exchange interaction in a singlet-triplet qubit defined by two-electron states in the double-quantum-dot system ("two-electron singlet-triplet qubit") typically varies monotonically with the exchange interaction and thus carries no sweet spot. Here we study a singlet-triplet qubit defined by four-electron states in the double-quantum-dot system ("four-electron singlet-triplet qubit"). We demonstrate, using configuration-interaction calculations, that in the four-electron singlet-triplet qubit the exchange energy as a function of detuning can be non-monotonic, suggesting existence of sweet spots. We further show that the tuning of the sweet spot and the corresponding exchange energy by perpendicular magnetic field can be related to the variation of orbital splitting. Our results suggest that a singlet-triplet qubit with more than two electrons can have advantages in the realization of quantum computing.

preprint2022arXiv

Mitigating barren plateaus of variational quantum eigensolvers

Variational quantum algorithms (VQAs) are expected to establish valuable applications on near-term quantum computers. However, recent works have pointed out that the performance of VQAs greatly relies on the expressibility of the ansatzes and is seriously limited by optimization issues such as barren plateaus (i.e., vanishing gradients). This work proposes the state efficient ansatz (SEA) for accurate ground state preparation with improved trainability. We show that the SEA can generate an arbitrary pure state with much fewer parameters than a universal ansatz, making it efficient for tasks like ground state estimation. Then, we prove that barren plateaus can be efficiently mitigated by the SEA and the trainability can be further improved most quadratically by flexibly adjusting the entangling capability of the SEA. Finally, we investigate a plethora of examples in ground state estimation where we obtain significant improvements in the magnitude of cost gradient and the convergence speed.

preprint2022arXiv

Muffin: Testing Deep Learning Libraries via Neural Architecture Fuzzing

Deep learning (DL) techniques are proven effective in many challenging tasks, and become widely-adopted in practice. However, previous work has shown that DL libraries, the basis of building and executing DL models, contain bugs and can cause severe consequences. Unfortunately, existing testing approaches still cannot comprehensively exercise DL libraries. They utilize existing trained models and only detect bugs in model inference phase. In this work we propose Muffin to address these issues. To this end, Muffin applies a specifically-designed model fuzzing approach, which allows it to generate diverse DL models to explore the target library, instead of relying only on existing trained models. Muffin makes differential testing feasible in the model training phase by tailoring a set of metrics to measure the inconsistencies between different DL libraries. In this way, Muffin can best exercise the library code to detect more bugs. To evaluate the effectiveness of Muffin, we conduct experiments on three widely-used DL libraries. The results demonstrate that Muffin can detect 39 new bugs in the latest release versions of popular DL libraries, including Tensorflow, CNTK, and Theano.

preprint2022arXiv

Muon $(g-2)$ and Flavor Puzzles in the $U(1)^{}_{X}$-gauged Leptoquark Model

We present an economical model where an $S^{}_1$ leptoquark and an anomaly-free $U(1)^{}_X$ gauge symmetry with $X = B^{}_3-2L^{}_μ/3-L^{}_τ/3$ are introduced, to account for the muon anomalous magnetic moment $a^{}_μ\equiv (g^{}_μ-2)$ and flavor puzzles including $R^{}_{K^{(\ast)_{}}}$ and $R^{}_{D^{(\ast)_{}}}$ anomalies together with quark and lepton flavor mixing. The $Z^\prime_{}$ gauge boson associated with the $U(1)^{}_X$ symmetry is responsible for the $R^{}_{K^{(\ast)_{}}}$ anomaly. Meanwhile, the specific flavor mixing patterns of quarks and leptons can be generated after the spontaneous breakdown of the $U(1)^{}_X$ gauge symmetry via the Froggatt-Nielsen mechanism. The $S^{}_1$ leptoquark which is also charged under the $U(1)^{}_X$ gauge symmetry can simultaneously explain the latest muon $(g-2)$ result and the $R^{}_{D^{(\ast)_{}}}$ anomaly. In addition, we also discuss several other experimental constraints on our model.

preprint2022arXiv

NeurIPS'22 Cross-Domain MetaDL competition: Design and baseline results

We present the design and baseline results for a new challenge in the ChaLearn meta-learning series, accepted at NeurIPS'22, focusing on "cross-domain" meta-learning. Meta-learning aims to leverage experience gained from previous tasks to solve new tasks efficiently (i.e., with better performance, little training data, and/or modest computational resources). While previous challenges in the series focused on within-domain few-shot learning problems, with the aim of learning efficiently N-way k-shot tasks (i.e., N class classification problems with k training examples), this competition challenges the participants to solve "any-way" and "any-shot" problems drawn from various domains (healthcare, ecology, biology, manufacturing, and others), chosen for their humanitarian and societal impact. To that end, we created Meta-Album, a meta-dataset of 40 image classification datasets from 10 domains, from which we carve out tasks with any number of "ways" (within the range 2-20) and any number of "shots" (within the range 1-20). The competition is with code submission, fully blind-tested on the CodaLab challenge platform. The code of the winners will be open-sourced, enabling the deployment of automated machine learning solutions for few-shot image classification across several domains.

preprint2022arXiv

NL2GDPR: Automatically Develop GDPR Compliant Android Application Features from Natural Language

The recent privacy leakage incidences and the more strict policy regulations demand a much higher standard of compliance for companies and mobile apps. However, such obligations also impose significant challenges on app developers for complying with these regulations that contain various perspectives, activities, and roles, especially for small companies and developers who are less experienced in this matter or with limited resources. To address these hurdles, we develop an automatic tool, NL2GDPR, which can generate policies from natural language descriptions from the developer while also ensuring the app's functionalities are compliant with General Data Protection Regulation (GDPR). NL2GDPR is developed by leveraging an information extraction tool, OIA (Open Information Annotation), developed by Baidu Cognitive Computing Lab. At the core, NL2GDPR is a privacy-centric information extraction model, appended with a GDPR policy finder and a policy generator. We perform a comprehensive study to grasp the challenges in extracting privacy-centric information and generating privacy policies, while exploiting optimizations for this specific task. With NL2GDPR, we can achieve 92.9%, 95.2%, and 98.4% accuracy in correctly identifying GDPR policies related to personal data storage, process, and share types, respectively. To the best of our knowledge, NL2GDPR is the first tool that allows a developer to automatically generate GDPR compliant policies, with only the need of entering the natural language for describing the app features. Note that other non-GDPR-related features might be integrated with the generated features to build a complex app.

preprint2022arXiv

Nonadiabatic geometric quantum computation with cat qubits via invariant-based reverse engineering

We propose a protocol to realize nonadiabatic geometric quantum computation of small-amplitude Schrödinger cat qubits via invariant-based reverse engineering. We consider a system with a two-photon driven Kerr nonlinearity, which provides a pair of dressed even and odd coherent states, i.e., Schrödinger cat states for fault-tolerant quantum computations. An additional coherent field is applied to linearly drive a cavity mode, to induce oscillations between dressed cat states. By designing this linear drive with invariant-based reverse engineering, nonadiabatic geometric quantum computation with cat qubits can be implemented. The performance of the protocol is estimated by taking into account the influence of systematic errors, additive white Gaussian noise, and decoherence including photon loss and dephasing. Numerical results demonstrate that our protocol is robust against these negative factors. Therefore, this protocol may provide a feasible method for nonadiabatic geometric quantum computation in bosonic systems.

preprint2022arXiv

Nonreciprocal waveguide-QED for spinning cavities with multiple coupling points

We investigate chiral emission and the single-photon scattering of spinning cavities coupled to a meandering waveguide at multiple coupling points. It is shown that nonreciprocal photon transmissions occur in the cavities-waveguide system, which stems from interference effects among different coupling points, and frequency shifts induced by the Sagnac effect. The nonlocal interference is akin to the mechanism in giant atoms. In the single-cavity setup, by optimizing the spinning velocity and number of coupling points, the chiral factor can approach 1, and the chiral direction can be freely switched. Moreover, destructive interference gives rise to the complete photon transmission in one direction over the whole optical frequency band, with no analogy in other quantum setups. In the multiple-cavity system, we also investigate the photon transport properties. The results indicate a directional information flow between different nodes. Our proposal provides a novel way to achieve quantum nonreciprocal devices, which can be applied in large-scale quantum chiral networks with optical waveguides.

preprint2022arXiv

One dimensional reduced model for ITER relevant energetic particle transport

We set up a mapping procedure able to translate the evolution of the radial profile of fast ions, interacting with Toroidal Alfvén Eigenmodes, into the dynamics of an equivalent one dimensional bump-on-tail system. We apply this mapping technique to reproduce ITER relevant simulations, which clearly outlined deviations from the diffusive quasi-linear (QL) model. Our analysis demonstrates the capability of the one-dimensional beam-plasma dynamics to predict the relevant features of the non-linear hybrid LIGKA/HAGIS simulations. In particular, we clearly identify how the deviation from the QL evolutive profiles is due to the presence of avalanche processes. A detailed analysis regarding the reduced dimensionality is also addressed, by means of phase-space slicing based on constants of motion. In the conclusions, we outline the main criticalities and outcomes of the procedure, which must be satisfactorily addressed to make quantitative prediction on the observed outgoing fluxes in a Tokamak device.

preprint2022arXiv

Open-Eye: An Open Platform to Study Human Performance on Identifying AI-Synthesized Faces

AI-synthesized faces are visually challenging to discern from real ones. They have been used as profile images for fake social media accounts, which leads to high negative social impacts. Although progress has been made in developing automatic methods to detect AI-synthesized faces, there is no open platform to study the human performance of AI-synthesized faces detection. In this work, we develop an online platform called Open-eye to study the human performance of AI-synthesized face detection. We describe the design and workflow of the Open-eye in this paper.

preprint2022arXiv

Out-Of-Distribution Generalization on Graphs: A Survey

Graph machine learning has been extensively studied in both academia and industry. Although booming with a vast number of emerging methods and techniques, most of the literature is built on the in-distribution hypothesis, i.e., testing and training graph data are identically distributed. However, this in-distribution hypothesis can hardly be satisfied in many real-world graph scenarios where the model performance substantially degrades when there exist distribution shifts between testing and training graph data. To solve this critical problem, out-of-distribution (OOD) generalization on graphs, which goes beyond the in-distribution hypothesis, has made great progress and attracted ever-increasing attention from the research community. In this paper, we comprehensively survey OOD generalization on graphs and present a detailed review of recent advances in this area. First, we provide a formal problem definition of OOD generalization on graphs. Second, we categorize existing methods into three classes from conceptually different perspectives, i.e., data, model, and learning strategy, based on their positions in the graph machine learning pipeline, followed by detailed discussions for each category. We also review the theories related to OOD generalization on graphs and introduce the commonly used graph datasets for thorough evaluations. Finally, we share our insights on future research directions. This paper is the first systematic and comprehensive review of OOD generalization on graphs, to the best of our knowledge.

preprint2022arXiv

Pan More Gold from the Sand: Refining Open-domain Dialogue Training with Noisy Self-Retrieval Generation

Real human conversation data are complicated, heterogeneous, and noisy, from which building open-domain dialogue systems remains a challenging task. In fact, such dialogue data still contains a wealth of information and knowledge, however, they are not fully explored. In this paper, we show existing open-domain dialogue generation methods that memorize context-response paired data with autoregressive or encode-decode language models underutilize the training data. Different from current approaches, using external knowledge, we explore a retrieval-generation training framework that can take advantage of the heterogeneous and noisy training data by considering them as "evidence". In particular, we use BERTScore for retrieval, which gives better qualities of the evidence and generation. Experiments over publicly available datasets demonstrate that our method can help models generate better responses, even such training data are usually impressed as low-quality data. Such performance gain is comparable with those improved by enlarging the training set, even better. We also found that the model performance has a positive correlation with the relevance of the retrieved evidence. Moreover, our method performed well on zero-shot experiments, which indicates that our method can be more robust to real-world data.

preprint2022arXiv

Receiver Design for MIMO Unsourced Random Access with SKP Coding

In this letter, we extend the sparse Kronecker-product (SKP) coding scheme, originally designed for the additive white Gaussian noise (AWGN) channel, to multiple input multiple output (MIMO) unsourced random access (URA). With the SKP coding adopted for MIMO transmission, we develop an efficient Bayesian iterative receiver design to solve the intended challenging trilinear factorization problem. Numerical results show that the proposed design outperforms the existing counterparts, and that it performs well in all simulated settings with various antenna sizes and active-user numbers.

preprint2022arXiv

ReFormer: The Relational Transformer for Image Captioning

Image captioning is shown to be able to achieve a better performance by using scene graphs to represent the relations of objects in the image. The current captioning encoders generally use a Graph Convolutional Net (GCN) to represent the relation information and merge it with the object region features via concatenation or convolution to get the final input for sentence decoding. However, the GCN-based encoders in the existing methods are less effective for captioning due to two reasons. First, using the image captioning as the objective (i.e., Maximum Likelihood Estimation) rather than a relation-centric loss cannot fully explore the potential of the encoder. Second, using a pre-trained model instead of the encoder itself to extract the relationships is not flexible and cannot contribute to the explainability of the model. To improve the quality of image captioning, we propose a novel architecture ReFormer -- a RElational transFORMER to generate features with relation information embedded and to explicitly express the pair-wise relationships between objects in the image. ReFormer incorporates the objective of scene graph generation with that of image captioning using one modified Transformer model. This design allows ReFormer to generate not only better image captions with the bene-fit of extracting strong relational image features, but also scene graphs to explicitly describe the pair-wise relation-ships. Experiments on publicly available datasets show that our model significantly outperforms state-of-the-art methods on image captioning and scene graph generation

preprint2022arXiv

Robust Attentive Deep Neural Network for Exposing GAN-generated Faces

GAN-based techniques that generate and synthesize realistic faces have caused severe social concerns and security problems. Existing methods for detecting GAN-generated faces can perform well on limited public datasets. However, images from existing public datasets do not represent real-world scenarios well enough in terms of view variations and data distributions (where real faces largely outnumber synthetic faces). The state-of-the-art methods do not generalize well in real-world problems and lack the interpretability of detection results. Performance of existing GAN-face detection models degrades significantly when facing imbalanced data distributions. To address these shortcomings, we propose a robust, attentive, end-to-end network that can spot GAN-generated faces by analyzing their eye inconsistencies. Specifically, our model learns to identify inconsistent eye components by localizing and comparing the iris artifacts between the two eyes automatically. Our deep network addresses the imbalance learning issues by considering the AUC loss and the traditional cross-entropy loss jointly. Comprehensive evaluations of the FFHQ dataset in terms of both balanced and imbalanced scenarios demonstrate the superiority of the proposed method.

preprint2022arXiv

Robust Contrastive Learning against Noisy Views

Contrastive learning relies on an assumption that positive pairs contain related views, e.g., patches of an image or co-occurring multimodal signals of a video, that share certain underlying information about an instance. But what if this assumption is violated? The literature suggests that contrastive learning produces suboptimal representations in the presence of noisy views, e.g., false positive pairs with no apparent shared information. In this work, we propose a new contrastive loss function that is robust against noisy views. We provide rigorous theoretical justifications by showing connections to robust symmetric losses for noisy binary classification and by establishing a new contrastive bound for mutual information maximization based on the Wasserstein distance measure. The proposed loss is completely modality-agnostic and a simple drop-in replacement for the InfoNCE loss, which makes it easy to apply to existing contrastive frameworks. We show that our approach provides consistent improvements over the state-of-the-art on image, video, and graph contrastive learning benchmarks that exhibit a variety of real-world noise patterns.

preprint2022arXiv

Robust entangling gate for capacitively coupled few-electron singlet-triplet qubits

The search of a sweet spot, locus in qubit parameters where quantum control is first-order insensitive to noises, is key to achieve high-fidelity quantum gates. Efforts to search for such a sweet spot in conventional double-quantum-dot singlet-triplet qubits where each dot hosts one electron ("two-electron singlet-triplet qubit"), especially for two-qubit operations, have been unsuccessful. Here we consider singlet-triplet qubits allowing each dot to host more than one electron, with a total of four electrons in the double quantum dots ("four-electron singlet-triplet qubit"). We theoretically demonstrate, using configuration-interaction calculations, that sweet spots appear in this coupled qubit system. We further demonstrate that, under realistic charge noise and hyperfine noise, two-qubit operation at the proposed sweet spot could offer gate fidelities ($\sim99\%$) that are higher than conventional two-electron singlet-triplet qubit system ($\sim90\%$). Our results should facilitate realization of high-fidelity two-qubit gates in singlet-triplet qubit systems.

preprint2022arXiv

Scene Recognition with Objectness, Attribute and Category Learning

Scene classification has established itself as a challenging research problem. Compared to images of individual objects, scene images could be much more semantically complex and abstract. Their difference mainly lies in the level of granularity of recognition. Yet, image recognition serves as a key pillar for the good performance of scene recognition as the knowledge attained from object images can be used for accurate recognition of scenes. The existing scene recognition methods only take the category label of the scene into consideration. However, we find that the contextual information that contains detailed local descriptions are also beneficial in allowing the scene recognition model to be more discriminative. In this paper, we aim to improve scene recognition using attribute and category label information encoded in objects. Based on the complementarity of attribute and category labels, we propose a Multi-task Attribute-Scene Recognition (MASR) network which learns a category embedding and at the same time predicts scene attributes. Attribute acquisition and object annotation are tedious and time consuming tasks. We tackle the problem by proposing a partially supervised annotation strategy in which human intervention is significantly reduced. The strategy provides a much more cost-effective solution to real world scenarios, and requires considerably less annotation efforts. Moreover, we re-weight the attribute predictions considering the level of importance indicated by the object detected scores. Using the proposed method, we efficiently annotate attribute labels for four large-scale datasets, and systematically investigate how scene and attribute recognition benefit from each other. The experimental results demonstrate that MASR learns a more discriminative representation and achieves competitive recognition performance compared to the state-of-the-art methods

preprint2022arXiv

Self-directed Machine Learning

Conventional machine learning (ML) relies heavily on manual design from machine learning experts to decide learning tasks, data, models, optimization algorithms, and evaluation metrics, which is labor-intensive, time-consuming, and cannot learn autonomously like humans. In education science, self-directed learning, where human learners select learning tasks and materials on their own without requiring hands-on guidance, has been shown to be more effective than passive teacher-guided learning. Inspired by the concept of self-directed human learning, we introduce the principal concept of Self-directed Machine Learning (SDML) and propose a framework for SDML. Specifically, we design SDML as a self-directed learning process guided by self-awareness, including internal awareness and external awareness. Our proposed SDML process benefits from self task selection, self data selection, self model selection, self optimization strategy selection and self evaluation metric selection through self-awareness without human guidance. Meanwhile, the learning performance of the SDML process serves as feedback to further improve self-awareness. We propose a mathematical formulation for SDML based on multi-level optimization. Furthermore, we present case studies together with potential applications of SDML, followed by discussing future research directions. We expect that SDML could enable machines to conduct human-like self-directed learning and provide a new perspective towards artificial general intelligence.

preprint2022arXiv

Sensitivity tests of cosmic velocity fields to massive neutrinos

We investigate impacts of massive neutrinos on the cosmic velocity fields, employing high-resolution cosmological $N$-body simulations provided by the information-optimized CUBE code, where cosmic neutrinos are evolved using collisionless hydrodynamics and their perturbations can be accurately resolved. In this study we focus, for the first time, on the analysis of massive-neutrino induced suppression effects in various cosmic velocity field components of velocity magnitude, divergence, vorticity and dispersion. By varying the neutrino mass sum $M_ν$ from 0 -- 0.4 eV, the simulations show that, the power spectra of vorticity -- exclusively sourced by non-linear structure formation that is affected by massive neutrinos significantly -- is very sensitive to the mass sum, which potentially provide novel signatures in detecting massive neutrinos. Furthermore, using the chi-square statistic, we quantitatively test the sensitivity of the density and velocity power spectra to the neutrino mass sum. Indeed, we find that, the vorticity spectrum has the highest sensitivity, and the null hypothesis of massless neutrinos is incompatible with both vorticity and divergence spectra from $M_ν=0.1$ eV at high significance ($p$-value $= 0.03$ and $0.07$, respectively). These results demonstrate clearly the importance of peculiar velocity field measurements, in particular of vorticity and divergence components, in determination of neutrino mass and mass hierarchy.

preprint2022arXiv

Sign-switching of superexchange mediated by few electrons under non-uniform magnetic field

Long range interaction between distant spins is an important building block for the realization of large quantum-dot network in which couplings between pairs of spins can be selectively addressed. Recent experiments on coherent logical states oscillation between remote spins facilitated by intermediate electron states has paved the first step for large scale quantum information processing. Reaching this ultimate goal requires extensive studies on the superexchange interaction on different quantum-dot spatial arrangements and electron configurations. Here, we consider a linear triple-quantum-dot with two anti-parallel spins in the outer dots forming the logical states while various number of electrons in the middle dot forming a mediator, which facilitates the superexchange interaction. We show that the superexchange is enhanced when the number of mediating electrons increases. In addition, we show that forming a four-electron triplet in the mediator dot further enhance the superexchange strength. Our work can be a guide to scale up the quantum-dot array with controllable and dense connectivity.

preprint2022arXiv

Social Distancing Alert with Smartwatches

Social distancing is an efficient public health practice during the COVID-19 pandemic. However, people would violate the social distancing practice unconsciously when they conduct some social activities such as handshaking, hugging, kissing on the face or forehead, etc. In this paper, we present SoDA, a social distancing practice violation alert system based on smartwatches, for preventing COVID-19 virus transmission. SoDA utilizes recordings of accelerometers and gyroscopes to recognize activities that may violate social distancing practice with simple yet effective Vision Transformer models. Extensive experiments over 10 volunteers and 1800+ samples demonstrate that SoDA achieves social activity recognition with the accuracy of 94.7%, 1.8% negative alert, and 2.2% missing alert.

preprint2022arXiv

Stochastic Planner-Actor-Critic for Unsupervised Deformable Image Registration

Large deformations of organs, caused by diverse shapes and nonlinear shape changes, pose a significant challenge for medical image registration. Traditional registration methods need to iteratively optimize an objective function via a specific deformation model along with meticulous parameter tuning, but which have limited capabilities in registering images with large deformations. While deep learning-based methods can learn the complex mapping from input images to their respective deformation field, it is regression-based and is prone to be stuck at local minima, particularly when large deformations are involved. To this end, we present Stochastic Planner-Actor-Critic (SPAC), a novel reinforcement learning-based framework that performs step-wise registration. The key notion is warping a moving image successively by each time step to finally align to a fixed image. Considering that it is challenging to handle high dimensional continuous action and state spaces in the conventional reinforcement learning (RL) framework, we introduce a new concept `Plan' to the standard Actor-Critic model, which is of low dimension and can facilitate the actor to generate a tractable high dimensional action. The entire framework is based on unsupervised training and operates in an end-to-end manner. We evaluate our method on several 2D and 3D medical image datasets, some of which contain large deformations. Our empirical results highlight that our work achieves consistent, significant gains and outperforms state-of-the-art methods.

preprint2022arXiv

Submillimetre galaxies in two massive protoclusters at z = 2.24: witnessing the enrichment of extreme starbursts in the outskirts of HAE density peaks

Submillimetre galaxies represent a rapid growth phase of both star formation and massive galaxies. Mapping SMGs in galaxy protoclusters provides key insights into where and how these extreme starbursts take place in connections with the assembly of the large-scale structure in the early Universe. We search for SMGs at 850$\,μm$ using JCMT/SCUBA-2 in two massive protoclusters at $z=2.24$, BOSS1244 and BOSS1542, and detect 43 and 54 sources with $S_{850}>4\,$mJy at the $4σ$ level within an effective area of 264$\,$arcmin$^2$, respectively. We construct the intrinsic number counts and find that the abundance of SMGs is $2.0\pm0.3$ and $2.1\pm0.2$ times that of the general fields, confirming that BOSS1244 and BOSS1542 contain a higher fraction of dusty galaxies with strongly enhanced star formation. The volume densities of the SMGs are estimated to be $\sim15-$30 times the average, significantly higher than the overdensity factor ($\sim 6$) traced by H$α$ emission-line galaxies (HAEs). More importantly, we discover a prominent offset between the spatial distributions of the two populations in these two protoclusters -- SMGs are mostly located around the high-density regions of HAEs, and few are seen inside these regions. This finding may have revealed for the first time the occurrence of violent star formation enhancement in the outskirts of the HAE density peaks, likely driven by the boosting of gas supplies and/or starburst triggering events. Meanwhile, the lack of SMGs inside the most overdense regions at $z\sim2$ implies a transition to the environment disfavouring extreme starbursts.

preprint2022arXiv

Sum of Ranked Range Loss for Supervised Learning

In forming learning objectives, one oftentimes needs to aggregate a set of individual values to a single output. Such cases occur in the aggregate loss, which combines individual losses of a learning model over each training sample, and in the individual loss for multi-label learning, which combines prediction scores over all class labels. In this work, we introduce the sum of ranked range (SoRR) as a general approach to form learning objectives. A ranked range is a consecutive sequence of sorted values of a set of real numbers. The minimization of SoRR is solved with the difference of convex algorithm (DCA). We explore two applications in machine learning of the minimization of the SoRR framework, namely the AoRR aggregate loss for binary/multi-class classification at the sample level and the TKML individual loss for multi-label/multi-class classification at the label level. A combination loss of AoRR and TKML is proposed as a new learning objective for improving the robustness of multi-label learning in the face of outliers in sample and labels alike. Our empirical results highlight the effectiveness of the proposed optimization frameworks and demonstrate the applicability of proposed losses using synthetic and real data sets.

preprint2022arXiv

SWIPENET: Object detection in noisy underwater images

In recent years, deep learning based object detection methods have achieved promising performance in controlled environments. However, these methods lack sufficient capabilities to handle underwater object detection due to these challenges: (1) images in the underwater datasets and real applications are blurry whilst accompanying severe noise that confuses the detectors and (2) objects in real applications are usually small. In this paper, we propose a novel Sample-WeIghted hyPEr Network (SWIPENET), and a robust training paradigm named Curriculum Multi-Class Adaboost (CMA), to address these two problems at the same time. Firstly, the backbone of SWIPENET produces multiple high resolution and semantic-rich Hyper Feature Maps, which significantly improve small object detection. Secondly, a novel sample-weighted detection loss function is designed for SWIPENET, which focuses on learning high weight samples and ignore learning low weight samples. Moreover, inspired by the human education process that drives the learning from easy to hard concepts, we here propose the CMA training paradigm that first trains a clean detector which is free from the influence of noisy data. Then, based on the clean detector, multiple detectors focusing on learning diverse noisy data are trained and incorporated into a unified deep ensemble of strong noise immunity. Experiments on two underwater robot picking contest datasets (URPC2017 and URPC2018) show that the proposed SWIPENET+CMA framework achieves better accuracy in object detection against several state-of-the-art approaches.

preprint2022arXiv

Synergistic Network Learning and Label Correction for Noise-robust Image Classification

Large training datasets almost always contain examples with inaccurate or incorrect labels. Deep Neural Networks (DNNs) tend to overfit training label noise, resulting in poorer model performance in practice. To address this problem, we propose a robust label correction framework combining the ideas of small loss selection and noise correction, which learns network parameters and reassigns ground truth labels iteratively. Taking the expertise of DNNs to learn meaningful patterns before fitting noise, our framework first trains two networks over the current dataset with small loss selection. Based on the classification loss and agreement loss of two networks, we can measure the confidence of training data. More and more confident samples are selected for label correction during the learning process. We demonstrate our method on both synthetic and real-world datasets with different noise types and rates, including CIFAR-10, CIFAR-100 and Clothing1M, where our method outperforms the baseline approaches.

preprint2022arXiv

Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis

Speech synthesis and music audio generation from symbolic input differ in many aspects but share some similarities. In this study, we investigate how text-to-speech synthesis techniques can be used for piano MIDI-to-audio synthesis tasks. Our investigation includes Tacotron and neural source-filter waveform models as the basic components, with which we build MIDI-to-audio synthesis systems in similar ways to TTS frameworks. We also include reference systems using conventional sound modeling techniques such as sample-based and physical-modeling-based methods. The subjective experimental results demonstrate that the investigated TTS components can be applied to piano MIDI-to-audio synthesis with minor modifications. The results also reveal the performance bottleneck -- while the waveform model can synthesize high quality piano sound given natural acoustic features, the conversion from MIDI to acoustic features is challenging. The full MIDI-to-audio synthesis system is still inferior to the sample-based or physical-modeling-based approaches, but we encourage TTS researchers to test their TTS models for this new task and improve the performance.

preprint2022arXiv

The VoicePrivacy 2020 Challenge Evaluation Plan

The VoicePrivacy Challenge aims to promote the development of privacy preservation tools for speech technology by gathering a new community to define the tasks of interest and the evaluation methodology, and benchmarking solutions through a series of challenges. In this document, we formulate the voice anonymization task selected for the VoicePrivacy 2020 Challenge and describe the datasets used for system development and evaluation. We also present the attack models and the associated objective and subjective evaluation metrics. We introduce two anonymization baselines and report objective evaluation results.

preprint2022arXiv

Theory on electron-phonon spin dehphasing in GaAs multi-electron double quantum dots

Recent studies reveal that a double-quantum-dot system hosting more than two electrons may be superior in certain aspects as compared to the traditional case in which only two electrons are confined (a singlet-triplet qubit). We study the electron-phonon dephasing occurring in a GaAs multi-electron double-quantum-dot system, in a biased case in which the singlet state is hybridized, as well as in an unbiased case in which the hybridization is absent. We have found that while the electron-phonon dephasing rate increases with the number of electrons confined in the unbiased case, this does not hold in the biased case. We define a merit figure as a ratio between the exchange energy and the dephasing rate, and have shown that in experimentally relevant range of the exchange energy, the merit figure actually increases with the number of electrons in the biased case. Our results show that the multi-electron quantum-dot system has another advantage in mitigating the effect of electron-phonon dephasing, which is previously under-appreciated in the literature.

preprint2022arXiv

Topological strings and Wilson loops

We propose the refined topological string correspondence to the expectation values of half-BPS Wilson loop operators in 5d $\mathcal{N}=1$ gauge theory partition function on the Omega-deformed background $\mathbb{R}^4_{ε_{1,2}}\times S^1$. We provide the refined topological vertex method and the refined holomorphic anomaly equation method in the topological string theory, from which we have exact computations on the 5d Wilson loops partition functions in both A- and B-models. Finally, with the exact results we have in B-model, we recover the quantum periods of local $\mathbb{P}^1\times\mathbb{P}^1$ model and local $\mathbb{P}^2$ model in the study of quantum geometry and we further give a refined generalization of A-period.

preprint2022arXiv

Trajectory Planning of Cellular-Connected UAV for Communication-assisted Radar Sensing

Being a key technology for beyond fifth-generation wireless systems, joint communication and radar sensing (JCAS) utilizes the reflections of communication signals to detect foreign objects and deliver situational awareness. A cellular-connected unmanned aerial vehicle (UAV) is uniquely suited to form a mobile bistatic synthetic aperture radar (SAR) with its serving base station (BS) to sense over large areas with superb sensing resolutions at no additional requirement of spectrum. This paper designs this novel BS-UAV bistatic SAR platform, and optimizes the flight path of the UAV to minimize its propulsion energy and guarantee the required sensing resolutions on a series of interesting landmarks. A new trajectory planning algorithm is developed to convexify the propulsion energy and resolution requirements by using successive convex approximation and block coordinate descent. Effective trajectories are obtained with a polynomial complexity. Extensive simulations reveal that the proposed trajectory planning algorithm outperforms significantly its alternative that minimizes the flight distance of cellular-aided sensing missions in terms of energy efficiency and effective consumption fluctuation. The energy saving offered by the proposed algorithm can be as significant as 55\%.

preprint2022arXiv

Unknown-Aware Object Detection: Learning What You Don't Know from Videos in the Wild

Building reliable object detectors that can detect out-of-distribution (OOD) objects is critical yet underexplored. One of the key challenges is that models lack supervision signals from unknown data, producing overconfident predictions on OOD objects. We propose a new unknown-aware object detection framework through Spatial-Temporal Unknown Distillation (STUD), which distills unknown objects from videos in the wild and meaningfully regularizes the model's decision boundary. STUD first identifies the unknown candidate object proposals in the spatial dimension, and then aggregates the candidates across multiple video frames to form a diverse set of unknown objects near the decision boundary. Alongside, we employ an energy-based uncertainty regularization loss, which contrastively shapes the uncertainty space between the in-distribution and distilled unknown objects. STUD establishes the state-of-the-art performance on OOD detection tasks for object detection, reducing the FPR95 score by over 10% compared to the previous best method. Code is available at https://github.com/deeplearning-wisc/stud.

preprint2022arXiv

Unsupervised Domain Adaptive Fundus Image Segmentation with Category-level Regularization

Existing unsupervised domain adaptation methods based on adversarial learning have achieved good performance in several medical imaging tasks. However, these methods focus only on global distribution adaptation and ignore distribution constraints at the category level, which would lead to sub-optimal adaptation performance. This paper presents an unsupervised domain adaptation framework based on category-level regularization that regularizes the category distribution from three perspectives. Specifically, for inter-domain category regularization, an adaptive prototype alignment module is proposed to align feature prototypes of the same category in the source and target domains. In addition, for intra-domain category regularization, we tailored a regularization technique for the source and target domains, respectively. In the source domain, a prototype-guided discriminative loss is proposed to learn more discriminative feature representations by enforcing intra-class compactness and inter-class separability, and as a complement to traditional supervised loss. In the target domain, an augmented consistency category regularization loss is proposed to force the model to produce consistent predictions for augmented/unaugmented target images, which encourages semantically similar regions to be given the same label. Extensive experiments on two publicly fundus datasets show that the proposed approach significantly outperforms other state-of-the-art comparison algorithms.

preprint2022arXiv

VerSe: A Vertebrae Labelling and Segmentation Benchmark for Multi-detector CT Images

Vertebral labelling and segmentation are two fundamental tasks in an automated spine processing pipeline. Reliable and accurate processing of spine images is expected to benefit clinical decision-support systems for diagnosis, surgery planning, and population-based analysis on spine and bone health. However, designing automated algorithms for spine processing is challenging predominantly due to considerable variations in anatomy and acquisition protocols and due to a severe shortage of publicly available data. Addressing these limitations, the Large Scale Vertebrae Segmentation Challenge (VerSe) was organised in conjunction with the International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) in 2019 and 2020, with a call for algorithms towards labelling and segmentation of vertebrae. Two datasets containing a total of 374 multi-detector CT scans from 355 patients were prepared and 4505 vertebrae have individually been annotated at voxel-level by a human-machine hybrid algorithm (https://osf.io/nqjyw/, https://osf.io/t98fz/). A total of 25 algorithms were benchmarked on these datasets. In this work, we present the the results of this evaluation and further investigate the performance-variation at vertebra-level, scan-level, and at different fields-of-view. We also evaluate the generalisability of the approaches to an implicit domain shift in data by evaluating the top performing algorithms of one challenge iteration on data from the other iteration. The principal takeaway from VerSe: the performance of an algorithm in labelling and segmenting a spine scan hinges on its ability to correctly identify vertebrae in cases of rare anatomical variations. The content and code concerning VerSe can be accessed at: https://github.com/anjany/verse.

preprint2021arXiv

A Comprehensive Survey of Neural Architecture Search: Challenges and Solutions

Deep learning has made breakthroughs and substantial in many fields due to its powerful automatic representation capabilities. It has been proven that neural architecture design is crucial to the feature representation of data and the final performance. However, the design of the neural architecture heavily relies on the researchers' prior knowledge and experience. And due to the limitations of human' inherent knowledge, it is difficult for people to jump out of their original thinking paradigm and design an optimal model. Therefore, an intuitive idea would be to reduce human intervention as much as possible and let the algorithm automatically design the neural architecture. Neural Architecture Search (NAS) is just such a revolutionary algorithm, and the related research work is complicated and rich. Therefore, a comprehensive and systematic survey on the NAS is essential. Previously related surveys have begun to classify existing work mainly based on the key components of NAS: search space, search strategy, and evaluation strategy. While this classification method is more intuitive, it is difficult for readers to grasp the challenges and the landmark work involved. Therefore, in this survey, we provide a new perspective: beginning with an overview of the characteristics of the earliest NAS algorithms, summarizing the problems in these early NAS algorithms, and then providing solutions for subsequent related research work. Besides, we conduct a detailed and comprehensive analysis, comparison, and summary of these works. Finally, we provide some possible future research directions.

preprint2021arXiv

A Marching Cube Algorithm Based on Edge Growth

Marching Cube algorithm is currently one of the most popular 3D reconstruction surface rendering algorithms. It forms cube voxels through the input image, and then uses 15 basic topological configurations to extract the iso-surfaces in the voxels. It processes each cube voxel in a traversal manner, but it does not consider the relationship between iso-surfaces in adjacent cubes. Due to ambiguity, the final reconstructed model may have holes. We propose a Marching Cube algorithm based on edge growth. The algorithm first extracts seed triangles, then grows the seed triangles and reconstructs the entire 3D model. According to the position of the growth edge, we propose 17 topological configurations with iso-surfaces. From the reconstruction results, the algorithm can reconstruct the 3D model well. When only the main contour of the 3D model needs to be organized, the algorithm performs well. In addition, when there are multiple scattered parts in the data, the algorithm can extract only the 3D contours of the parts connected to the seed by setting the region selected by the seed.

preprint2021arXiv

ASVspoof 2019: spoofing countermeasures for the detection of synthesized, converted and replayed speech

The ASVspoof initiative was conceived to spearhead research in anti-spoofing for automatic speaker verification (ASV). This paper describes the third in a series of bi-annual challenges: ASVspoof 2019. With the challenge database and protocols being described elsewhere, the focus of this paper is on results and the top performing single and ensemble system submissions from 62 teams, all of which out-perform the two baseline systems, often by a substantial margin. Deeper analyses shows that performance is dominated by specific conditions involving either specific spoofing attacks or specific acoustic environments. While fusion is shown to be particularly effective for the logical access scenario involving speech synthesis and voice conversion attacks, participants largely struggled to apply fusion successfully for the physical access scenario involving simulated replay attacks. This is likely the result of a lack of system complementarity, while oracle fusion experiments show clear potential to improve performance. Furthermore, while results for simulated data are promising, experiments with real replay data show a substantial gap, most likely due to the presence of additive noise in the latter. This finding, among others, leads to a number of ideas for further research and directions for future editions of the ASVspoof challenge.

preprint2021arXiv

Deep Learning to Quantify Pulmonary Edema in Chest Radiographs

Purpose: To develop a machine learning model to classify the severity grades of pulmonary edema on chest radiographs. Materials and Methods: In this retrospective study, 369,071 chest radiographs and associated radiology reports from 64,581 (mean age, 51.71; 54.51% women) patients from the MIMIC-CXR chest radiograph dataset were included. This dataset was split into patients with and without congestive heart failure (CHF). Pulmonary edema severity labels from the associated radiology reports were extracted from patients with CHF as four different ordinal levels: 0, no edema; 1, vascular congestion; 2, interstitial edema; and 3, alveolar edema. Deep learning models were developed using two approaches: a semi-supervised model using a variational autoencoder and a pre-trained supervised learning model using a dense neural network. Receiver operating characteristic curve analysis was performed on both models. Results: The area under the receiver operating characteristic curve (AUC) for differentiating alveolar edema from no edema was 0.99 for the semi-supervised model and 0.87 for the pre-trained models. Performance of the algorithm was inversely related to the difficulty in categorizing milder states of pulmonary edema (shown as AUCs for semi-supervised model and pre-trained model, respectively): 2 versus 0, 0.88 and 0.81; 1 versus 0, 0.79 and 0.66; 3 versus 1, 0.93 and 0.82; 2 versus 1, 0.69 and 0.73; and, 3 versus 2, 0.88 and 0.63. Conclusion: Deep learning models were trained on a large chest radiograph dataset and could grade the severity of pulmonary edema on chest radiographs with high performance.

preprint2021arXiv

Elliptic Quantum Curves of 6d SO(N) theories

We discuss supersymmetric defects in 6d $\mathcal{N}=(1,0)$ SCFTs with $\mathrm{SO}(N_c)$ gauge group and $N_c-8$ fundamental flavors. The codimension 2 and 4 defects are engineered by coupling the 6d gauge fields to charged free fields in four and two dimensions, respectively. We find that the partition function in the presence of the codimension 2 defect on $\mathbb{R}^4\times \mathbb{T}^2$ in the Nekrasov-Shatashvili limit satisfies an elliptic difference equation which quantizes the Seiberg-Witten curve of the 6d theory. The expectation value of the codimension 4 defect appearing in the difference equation is an even (under reflection) degree $N_c$ section over the elliptic curve when $N_c$ is even, and an odd section when $N_c$ is odd. We also find that RG-flows of the defects and the associated difference equations in the 6d $\mathrm{SO}(2N+1)$ gauge theories triggered by Higgs VEVs of KK-momentum states provide quantum Seiberg-Witten curves for $\mathbb{Z}_2$ twisted compactifications of the 6d $\mathrm{SO}(2N)$ gauge theories.

preprint2021arXiv

Experimental demonstration of adversarial examples in learning topological phases

Classification and identification of different phases and the transitions between them is a central task in condensed matter physics. Machine learning, which has achieved dramatic success in a wide range of applications, holds the promise to bring unprecedented perspectives for this challenging task. However, despite the exciting progress made along this direction, the reliability of machine-learning approaches likewise demands further investigation. Here, with the nitrogen-vacancy center platform, we report the first proof-of-principle experimental demonstration of adversarial examples in learning topological phases. We show that, after adding a tiny amount of carefully-designed perturbations, the experimentally observed adversarial examples can successfully deceive a splendid phase classifier, whose prediction accuracy is larger than $99.2\%$ on legitimate samples, with a notably high confidence. Our results explicitly showcase the crucial vulnerability aspect of applying machine learning techniques in classifying phases of matter, which provides an indispensable guide for future studies in this interdisciplinary field.

preprint2021arXiv

Fast generation of Cat states in Kerr nonlinear resonators via optimal adiabatic control

Macroscopic cat states have been widely studied to illustrate fundamental principles of quantum physics as well as their application in quantum information processing. In this paper, we propose a quantum speedup method for adiabatic creation of cat states in a Kerr nonlinear resonator via gradient-descent optimal adiabatic control. By simultaneously adiabatic tuning the the cavity detuning and driving field strength, the width of minimum energy gap between the target trajectory and non-adiabatic trajectory can be widen, which allows us to speed up the evolution along the adiabatic path. Compared with the previous proposal of preparing the cat state by only controlling two-photon pumping strength in a Kerr nonlinear resonator, our method can prepare the target state with much shorter time, as well as a high fidelity and a large non-classical volume. It is worth noting that the cat state prepared by our method is also robust against single-photon loss very well. Moreover, when our proposal has a large initial detuning, it will creates a large-size cat state successfully. This proposal of preparing cat states can be implemented in superconducting quantum circuits, which provides a quantum state resource for quantum information encoding and fault-tolerant quantum computing.

preprint2021arXiv

Floquet Spin Amplification

Detection of weak electromagnetic waves and hypothetical particles aided by quantum amplification is important for fundamental physics and applications. However, demonstrations of quantum amplification are still limited; in particular, the physics of quantum amplification is not fully explored in periodically driven (Floquet) systems, which are generally defined by time-periodic Hamiltonians and enable observation of many exotic quantum phenomena such as time crystals. Here we investigate the magnetic-field signal amplification by periodically driven $^{129}$Xe spins and observe signal amplification at frequencies of transitions between Floquet spin states. This "Floquet amplification" allows to simultaneously enhance and measure multiple magnetic fields with at least one order of magnitude improvement, offering the capability of femtotesla-level measurements. Our findings extend the physics of quantum amplification to Floquet systems and can be generalized to a wide variety of existing amplifiers, enabling a previously unexplored class of "Floquet amplifiers".

preprint2021arXiv

Instance-Aware Predictive Navigation in Multi-Agent Environments

In this work, we aim to achieve efficient end-to-end learning of driving policies in dynamic multi-agent environments. Predicting and anticipating future events at the object level are critical for making informed driving decisions. We propose an Instance-Aware Predictive Control (IPC) approach, which forecasts interactions between agents as well as future scene structures. We adopt a novel multi-instance event prediction module to estimate the possible interaction among agents in the ego-centric view, conditioned on the selected action sequence of the ego-vehicle. To decide the action at each step, we seek the action sequence that can lead to safe future states based on the prediction module outputs by repeatedly sampling likely action sequences. We design a sequential action sampling strategy to better leverage predicted states on both scene-level and instance-level. Our method establishes a new state of the art in the challenging CARLA multi-agent driving simulation environments without expert demonstration, giving better explainability and sample efficiency.

preprint2021arXiv

Leveraging Regular Fundus Images for Training UWF Fundus Diagnosis Models via Adversarial Learning and Pseudo-Labeling

Recently, ultra-widefield (UWF) 200\degree~fundus imaging by Optos cameras has gradually been introduced because of its broader insights for detecting more information on the fundus than regular 30 degree - 60 degree fundus cameras. Compared with UWF fundus images, regular fundus images contain a large amount of high-quality and well-annotated data. Due to the domain gap, models trained by regular fundus images to recognize UWF fundus images perform poorly. Hence, given that annotating medical data is labor intensive and time consuming, in this paper, we explore how to leverage regular fundus images to improve the limited UWF fundus data and annotations for more efficient training. We propose the use of a modified cycle generative adversarial network (CycleGAN) model to bridge the gap between regular and UWF fundus and generate additional UWF fundus images for training. A consistency regularization term is proposed in the loss of the GAN to improve and regulate the quality of the generated data. Our method does not require that images from the two domains be paired or even that the semantic labels be the same, which provides great convenience for data collection. Furthermore, we show that our method is robust to noise and errors introduced by the generated unlabeled data with the pseudo-labeling technique. We evaluated the effectiveness of our methods on several common fundus diseases and tasks, such as diabetic retinopathy (DR) classification, lesion detection and tessellated fundus segmentation. The experimental results demonstrate that our proposed method simultaneously achieves superior generalizability of the learned representations and performance improvements in multiple tasks.

preprint2021arXiv

MetaDelta: A Meta-Learning System for Few-shot Image Classification

Meta-learning aims at learning quickly on novel tasks with limited data by transferring generic experience learned from previous tasks. Naturally, few-shot learning has been one of the most popular applications for meta-learning. However, existing meta-learning algorithms rarely consider the time and resource efficiency or the generalization capacity for unknown datasets, which limits their applicability in real-world scenarios. In this paper, we propose MetaDelta, a novel practical meta-learning system for the few-shot image classification. MetaDelta consists of two core components: i) multiple meta-learners supervised by a central controller to ensure efficiency, and ii) a meta-ensemble module in charge of integrated inference and better generalization. In particular, each meta-learner in MetaDelta is composed of a unique pretrained encoder fine-tuned by batch training and parameter-free decoder used for prediction. MetaDelta ranks first in the final phase in the AAAI 2021 MetaDL Challenge\footnote{https://competitions.codalab.org/competitions/26638}, demonstrating the advantages of our proposed system. The codes are publicly available at https://github.com/Frozenmad/MetaDelta.

preprint2021arXiv

MOSNet: Deep Learning based Objective Assessment for Voice Conversion

Existing objective evaluation metrics for voice conversion (VC) are not always correlated with human perception. Therefore, training VC models with such criteria may not effectively improve naturalness and similarity of converted speech. In this paper, we propose deep learning-based assessment models to predict human ratings of converted speech. We adopt the convolutional and recurrent neural network models to build a mean opinion score (MOS) predictor, termed as MOSNet. The proposed models are tested on large-scale listening test results of the Voice Conversion Challenge (VCC) 2018. Experimental results show that the predicted scores of the proposed MOSNet are highly correlated with human MOS ratings at the system level while being fairly correlated with human MOS ratings at the utterance level. Meanwhile, we have modified MOSNet to predict the similarity scores, and the preliminary results show that the predicted scores are also fairly correlated with human ratings. These results confirm that the proposed models could be used as a computational evaluator to measure the MOS of VC systems to reduce the need for expensive human rating.

preprint2021arXiv

Multimodal Gait Recognition for Neurodegenerative Diseases

In recent years, single modality based gait recognition has been extensively explored in the analysis of medical images or other sensory data, and it is recognised that each of the established approaches has different strengths and weaknesses. As an important motor symptom, gait disturbance is usually used for diagnosis and evaluation of diseases; moreover, the use of multi-modality analysis of the patient's walking pattern compensates for the one-sidedness of single modality gait recognition methods that only learn gait changes in a single measurement dimension. The fusion of multiple measurement resources has demonstrated promising performance in the identification of gait patterns associated with individual diseases. In this paper, as a useful tool, we propose a novel hybrid model to learn the gait differences between three neurodegenerative diseases, between patients with different severity levels of Parkinson's disease and between healthy individuals and patients, by fusing and aggregating data from multiple sensors. A spatial feature extractor (SFE) is applied to generating representative features of images or signals. In order to capture temporal information from the two modality data, a new correlative memory neural network (CorrMNN) architecture is designed for extracting temporal features. Afterwards, we embed a multi-switch discriminator to associate the observations with individual state estimations. Compared with several state-of-the-art techniques, our proposed framework shows more accurate classification results.

preprint2021arXiv

New bounds and constructions for constant weighted $X$-codes

As a crucial technique for integrated circuits (IC) test response compaction, $X$-compact employs a special kind of codes called $X$-codes for reliable compressions of the test response in the presence of unknown logic values ($X$s). From a combinatorial view point, Fujiwara and Colbourn \cite{FC2010} introduced an equivalent definition of $X$-codes and studied $X$-codes of small weights that have good detectability and $X$-tolerance. An $(m,n,d,x)$ $X$-code is an $m\times n$ binary matrix with column vectors as its codewords. The parameters $d,x$ correspond to the test quality of the code. In this paper, bounds and constructions for constant weighted $X$-codes are investigated. First, we obtain a general result on the maximum number of codewords $n$ for an $(m,n,d,x)$ $X$-code of weight $w$, and we further improve this lower bound for the case with $x=2$ and $w=3$ through the probabilistic method. Then, using tools from additive combinatorics and finite fields, we present some explicit constructions for constant weighted $X$-codes with $d=3,7$ and $x=2$, which are optimal for the case when $d=3, w=4$ and nearly optimal for the case when $d=3,w=3$. We also consider a special class of $X$-codes introduced in \cite{FC2010} and improve the best known lower bound on the maximum number of codewords for this kind of $X$-codes.

preprint2021arXiv

Quasi one-dimensional diffuse laser cooling of atoms

We demonstrate experimentally the generation of one-dimensional cold gases of $^{87}$Rb atoms by diffuse laser cooling (DLC). A horizontal slender vacuum glass tube with length of 105~cm and diameter of 2~cm is used in our experiment. The diffuse laser light inside the tube, which is generated by multi-reflection of injected lasers, cools the background vapor atoms. With 250~mW of cooling light and 50~mW of repumping light, an evenly distributed meter-long profile of atom cloud is obtained. We observe a factor 4 improvement on the atomic OD for a typical cooling duration of 170~ms and a sub-Doppler atomic temperature of 25~$μ$k. The maximum number of detected cold atoms remain constant for a free-fall duration of 30~ms. Such samples are ideal for many quantum optical experiments involving electromagnetically induced transparency, electronically highly excited (Rydberg) atoms and quantum precision measurements.

preprint2021arXiv

Shortcuts to Adiabaticity for the Quantum Rabi Model: Efficient Generation of Giant Entangled cat States via Parametric Amplification

We propose a method for the fast generation of nonclassical ground states of the Rabi model in the ultrastrong and deep-strong coupling regimes via the shortcuts-to-adiabatic (STA) dynamics. The time-dependent quantum Rabi model is simulated by applying parametric amplification to the Jaynes-Cummings model. Using experimentally feasible parametric drive, this STA protocol can generate large-size Schrödinger cat states, through a process that is 10 times faster compared to adiabatic protocols. Such fast evolution increases the robustness of our protocol against dissipation. Our method enables one to freely design the parametric drive, so that the target state can be generated in the lab frame. A largely detuned light-matter coupling makes the protocol robust against imperfections of the operation times in experiments.

preprint2021arXiv

Spontaneous imbibition in porous media: from pore scale to Darcy scale

Spontaneous imbibition has been receiving much attention due to its significance in many subsurface and industrial applications. Unveiling pore-scale wetting dynamics, and particularly its upscaling to the Darcy scale are still unresolved. In this work, we conduct image-based pore-network modeling of cocurrent spontaneous imbibition and the corresponding quasi-static imbibition, in homogeneous sintered glass beads as well as heterogeneous Estaillades. A wide range of viscosity ratios and wettability conditions are taken into account. Based on our pore-scale results, we show the influence of pore-scale heterogeneity on imbibition dynamics and nonwetting entrapment. We elucidate different pore-filling mechanisms in imbibition, which helps us understand wetting dynamics. Most importantly, we develop a non-equilibrium model for relative permeability of the wetting phase, which adequately incorporates wetting dynamics. This is crucial to the final goal of developing a two-phase imbibition model with measurable material properties such as capillary pressure and relative permeability. Finally, we propose some future work on both numerical and experimental verifications of the developed non-equilibrium permeability model.

preprint2021arXiv

Synergic Adversarial Label Learning for Grading Retinal Diseases via Knowledge Distillation and Multi-task Learning

The need for comprehensive and automated screening methods for retinal image classification has long been recognized. Well-qualified doctors annotated images are very expensive and only a limited amount of data is available for various retinal diseases such as age-related macular degeneration (AMD) and diabetic retinopathy (DR). Some studies show that AMD and DR share some common features like hemorrhagic points and exudation but most classification algorithms only train those disease models independently. Inspired by knowledge distillation where additional monitoring signals from various sources is beneficial to train a robust model with much fewer data. We propose a method called synergic adversarial label learning (SALL) which leverages relevant retinal disease labels in both semantic and feature space as additional signals and train the model in a collaborative manner. Our experiments on DR and AMD fundus image classification task demonstrate that the proposed method can significantly improve the accuracy of the model for grading diseases. In addition, we conduct additional experiments to show the effectiveness of SALL from the aspects of reliability and interpretability in the context of medical imaging application.

preprint2021arXiv

The dynamic energy balance in earthquakes expressed by fault surface morphology

The dynamic energy balance is essential for earthquake studies. The energy balance approach is one of the most famous developments in fracture mechanics. To interpret seismological data, crack models and sliding on a frictional surface (fault) models are widely used. The macroscopically observable energy budget and the microscopic processes can be related through the fracture energy $G_c$. The fault surface morphology is the direct result of the microscopic processes near the crack tip or on the frictional interface. Here we show that the dynamic energy balance in earthquakes can be expressed by fault surface morphology, and that they are quantitatively linked. The direct shear experiments proves the predictions of the theoretical discussions, and show that the strain rate has crucial influence on the dynamic energy balance.

preprint2021arXiv

The mass-metallicity relation at cosmic noon in overdense environments: first results from the MAMMOTH-Grism HST slitless spectroscopic survey

The MAMMOTH-Grism slitless spectroscopic survey is a Hubble Space Telescope (HST) cycle-28 medium program, which is obtaining 45 orbits of WFC3/IR grism spectroscopy in the density peak regions of three massive galaxy protoclusters at $z=2-3$ discovered using the MAMMOTH technique. We introduce this survey by presenting the first measurement of the mass-metallicity relation (MZR) at high redshift in overdense environments via grism spectroscopy. From the completed MAMMOTH-Grism observations in the field of the BOSS1244 protocluster at $z=2.24\pm0.02$, We secure a sample of 36 protocluster member galaxies at $z\sim2.24$, showing strong nebular emission lines ([O III], H$β$ and [O II]) in their G141 spectra. Using the multi-wavelength broad-band deep imaging from HST and ground-based telescopes, we measure their stellar masses in the range of $[10^{9},10^{10.4}]M_\odot$, instantaneous star formation rates (SFR) from 10 to 240$M_\odot yr^{-1}$, and global gas-phase metallicities [$\frac{1}{3}$,1] of solar. Compared with similarly selected field galaxy sample at the same redshift, our galaxies show on average increased SFRs by $\sim$0.06dex and $\sim$0.18dex at $\sim$10$^{10.1}M_\odot$ and $\sim$10$^{9.8}M_\odot$, respectively. Using the stacked spectra of our sample galaxies, we derive the MZR in the BOSS1244 protocluster core as $12+\log({\rm O/H})=(0.136\pm0.018)\times\log(M_\ast/M_\odot)+(7.082\pm0.175)$, showing significantly shallower slope than that in the field. This shallow MZR slope is likely caused by the combined effects of efficient recycling of feedback-driven winds and cold-mode gas accretion in protocluster environments. The former effect helps low-mass galaxies residing in overdensities retain their metal production, whereas the latter effect dilutes the metal content of high-mass galaxies, making them more metal poor than their coeval field counterparts.

preprint2021arXiv

Tunable Chiral Bound States with Giant Atoms

We propose tunable chiral bound states in a system composed of superconducting giant atoms and a Josephson photonic-crystal waveguide (PCW), with no analog in other quantum setups. The chiral bound states arise due to interference in the nonlocal coupling of a giant atom to multiple points of the waveguide. The chirality can be tuned by changing either the atom-waveguide coupling or the external bias of the PCW. Furthermore, the chiral bound states can induce directional dipole-dipole interactions between multiple giant atoms coupling to the same waveguide. Our proposal is ready to be implemented in experiments with superconducting circuits, where it can be used as a tunable toolbox to realize topological phase transitions and quantum simulations.

preprint2020arXiv

A Census of Sub-kiloparsec Resolution Metallicity Gradients in Star-forming Galaxies at Cosmic Noon from HST Slitless Spectroscopy

We present hitherto the largest sample of gas-phase metallicity radial gradients measured at sub-kiloparsec resolution in star-forming galaxies in the redshift range of $z\in[1.2, 2.3]$. These measurements are enabled by the synergy of slitless spectroscopy from the Hubble Space Telescope near-infrared channels and the lensing magnification from foreground galaxy clusters. Our sample consists of 76 galaxies with stellar mass ranging from 10$^7$ to 10$^{10}$ $M_\odot$, instantaneous star-formation rate in the range of [1, 100] $M_\odot$/yr, and global metallicity [$\frac{1}{12}$, 2] solar. At 2-$σ$ confidence level, 15/76 galaxies in our sample show negative radial gradients, whereas 7/76 show inverted gradients. Combining ours and all other metallicity gradients obtained at similar resolution currently available in the literature, we measure a negative mass dependence of $Δ\log({\rm O/H})/Δr~ [\mathrm{dex~kpc^{-1}}] = \left(-0.020\pm0.007\right) + \left(-0.016\pm0.008\right) \log(M_\ast/10^{9.4} M_\odot)$ with the intrinsic scatter being $σ=0.060\pm0.006$ over four orders of magnitude in stellar mass. Our result is consistent with strong feedback, not secular processes, being the primary governor of the chemo-structural evolution of star-forming galaxies during the disk mass assembly at cosmic noon. We also find that the intrinsic scatter of metallicity gradients increases with decreasing stellar mass and increasing specific star-formation rate. This increase in the intrinsic scatter is likely caused by the combined effect of cold-mode gas accretion and merger-induced starbursts, with the latter more predominant in the dwarf mass regime of $M_\ast\lesssim10^9 M_\odot$.

preprint2020arXiv

A SLAM Map Restoration Algorithm Based on Submaps and an Undirected Connected Graph

Many visual simultaneous localization and mapping (SLAM) systems have been shown to be accurate and robust, and have real-time performance capabilities on both indoor and ground datasets. However, these methods can be problematic when dealing with aerial frames captured by a camera mounted on an unmanned aerial vehicle (UAV) because the flight height of the UAV can be difficult to control and is easily affected by the environment.To cope with the case of lost tracking, many visual SLAM systems employ a relocalization strategy. This involves the tracking thread continuing the online working by inspecting the connections between the subsequent new frames and the generated map before the tracking was lost. To solve the missing map problem, which is an issue in many applications , after the tracking is lost, based on monocular visual SLAM, we present a method of reconstructing a complete global map of UAV datasets by sequentially merging the submaps via the corresponding undirected connected graph. Specifically, submaps are repeatedly generated, from the initialization process to the place where the tracking is lost, and a corresponding undirected connected graph is built by considering these submaps as nodes and the common map points within two submaps as edges. The common map points are then determined by the bag-of-words (BoW) method, and the submaps are merged if they are found to be connected with the online map in the undirect connected graph. To demonstrate the performance of the proposed method, we first investigated the performance on a UAV dataset, and the experimental results showed that, in the case of several tracking failures, the integrity of the mapping was significantly better than that of the current mainstream SLAM method.

preprint2020arXiv

An Efficient Index Method for the Optimal Route Query over Multi-Cost Networks

Smart city has been consider the wave of the future and the route recommendation in networks is a fundamental problem in it. Most existing approaches for the shortest route problem consider that there is only one kind of cost in networks. However, there always are several kinds of cost in networks and users prefer to select an optimal route under the global consideration of these kinds of cost. In this paper, we study the problem of finding the optimal route in the multi-cost networks. We prove this problem is NP-hard and the existing index techniques cannot be used to this problem. We propose a novel partition-based index with contour skyline techniques to find the optimal route. We propose a vertex-filtering algorithm to facilitate the query processing. We conduct extensive experiments on six real-life networks and the experimental results show that our method has an improvement in efficiency by an order of magnitude compared to the previous heuristic algorithms.

preprint2020arXiv

ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech

Automatic speaker verification (ASV) is one of the most natural and convenient means of biometric person recognition. Unfortunately, just like all other biometric systems, ASV is vulnerable to spoofing, also referred to as "presentation attacks." These vulnerabilities are generally unacceptable and call for spoofing countermeasures or "presentation attack detection" systems. In addition to impersonation, ASV systems are vulnerable to replay, speech synthesis, and voice conversion attacks. The ASVspoof 2019 edition is the first to consider all three spoofing attack types within a single challenge. While they originate from the same source database and same underlying protocol, they are explored in two specific use case scenarios. Spoofing attacks within a logical access (LA) scenario are generated with the latest speech synthesis and voice conversion technologies, including state-of-the-art neural acoustic and waveform model techniques. Replay spoofing attacks within a physical access (PA) scenario are generated through carefully controlled simulations that support much more revealing analysis than possible previously. Also new to the 2019 edition is the use of the tandem detection cost function metric, which reflects the impact of spoofing and countermeasures on the reliability of a fixed ASV system. This paper describes the database design, protocol, spoofing attack implementations, and baseline ASV and countermeasure results. It also describes a human assessment on spoofed data in logical access. It was demonstrated that the spoofing data in the ASVspoof 2019 database have varied degrees of perceived quality and similarity to the target speakers, including spoofed data that cannot be differentiated from bona-fide utterances even by human subjects.

preprint2020arXiv

Automated Pavement Crack Segmentation Using U-Net-based Convolutional Neural Network

Automated pavement crack image segmentation is challenging because of inherent irregular patterns, lighting conditions, and noise in images. Conventional approaches require a substantial amount of feature engineering to differentiate crack regions from non-affected regions. In this paper, we propose a deep learning technique based on a convolutional neural network to perform segmentation tasks on pavement crack images. Our approach requires minimal feature engineering compared to other machine learning techniques. We propose a U-Net-based network architecture in which we replace the encoder with a pretrained ResNet-34 neural network. We use a "one-cycle" training schedule based on cyclical learning rates to speed up the convergence. Our method achieves an F1 score of 96% on the CFD dataset and 73% on the Crack500 dataset, outperforming other algorithms tested on these datasets. We perform ablation studies on various techniques that helped us get marginal performance boosts, i.e., the addition of spatial and channel squeeze and excitation (SCSE) modules, training with gradually increasing image sizes, and training various neural network layers with different learning rates.

preprint2020arXiv

BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning

Datasets drive vision progress, yet existing driving datasets are impoverished in terms of visual content and supported tasks to study multitask learning for autonomous driving. Researchers are usually constrained to study a small set of problems on one dataset, while real-world computer vision applications require performing tasks of various complexities. We construct BDD100K, the largest driving video dataset with 100K videos and 10 tasks to evaluate the exciting progress of image recognition algorithms on autonomous driving. The dataset possesses geographic, environmental, and weather diversity, which is useful for training models that are less likely to be surprised by new conditions. Based on this diverse dataset, we build a benchmark for heterogeneous multitask learning and study how to solve the tasks together. Our experiments show that special training strategies are needed for existing models to perform such heterogeneous tasks. BDD100K opens the door for future studies in this important venue.

preprint2020arXiv

Bridge the Domain Gap Between Ultra-wide-field and Traditional Fundus Images via Adversarial Domain Adaptation

For decades, advances in retinal imaging technology have enabled effective diagnosis and management of retinal disease using fundus cameras. Recently, ultra-wide-field (UWF) fundus imaging by Optos camera is gradually put into use because of its broader insights on fundus for some lesions that are not typically seen in traditional fundus images. Research on traditional fundus images is an active topic but studies on UWF fundus images are few. One of the most important reasons is that UWF fundus images are hard to obtain. In this paper, for the first time, we explore domain adaptation from the traditional fundus to UWF fundus images. We propose a flexible framework to bridge the domain gap between two domains and co-train a UWF fundus diagnosis model by pseudo-labelling and adversarial learning. We design a regularisation technique to regulate the domain adaptation. Also, we apply MixUp to overcome the over-fitting issue from incorrect generated pseudo-labels. Our experimental results on either single or both domains demonstrate that the proposed method can well adapt and transfer the knowledge from traditional fundus images to UWF fundus images and improve the performance of retinal disease recognition.

preprint2020arXiv

Category-wise Attack: Transferable Adversarial Examples for Anchor Free Object Detection

Deep neural networks have been demonstrated to be vulnerable to adversarial attacks: subtle perturbations can completely change the classification results. Their vulnerability has led to a surge of research in this direction. However, most works dedicated to attacking anchor-based object detection models. In this work, we aim to present an effective and efficient algorithm to generate adversarial examples to attack anchor-free object models based on two approaches. First, we conduct category-wise instead of instance-wise attacks on the object detectors. Second, we leverage the high-level semantic information to generate the adversarial examples. Surprisingly, the generated adversarial examples it not only able to effectively attack the targeted anchor-free object detector but also to be transferred to attack other object detectors, even anchor-based detectors such as Faster R-CNN.

preprint2020arXiv

Channel-Dependent Scheduling in Wireless Energy Transfer for Mobile Devices

Resonant Beam Charging (RBC) is the Wireless Power Transfer (WPT) technology, which can provide high-power, long-distance, mobile, and safe wireless charging for Internet of Things (IoT) devices. Supporting multiple IoT devices charging simultaneously is a significant feature of the RBC system. To optimize the multi-user charging performance, the transmitting power should be scheduled for charging all IoT devices simultaneously. In order to keep all IoT devices working as long as possible for fairness, we propose the First Access First Charge (FAFC) scheduling algorithm. Then, we formulate the scheduling parameters quantitatively for algorithm implementation. Finally, we analyze the performance of FAFC scheduling algorithm considering the impacts of the receiver number, the transmitting power and the charging time. Based on the analysis, we summarize the methods of improving the WPT performance for multiple IoT devices, which include limiting the receiver number, increasing the transmitting power, prolonging the charging time and improving the single-user's charging efficiency. The FAFC scheduling algorithm design and analysis provide a fair WPT solution for the multi-user RBC system.

preprint2020arXiv

Community detection based on first passage probabilities

Community detection is of fundamental significance for understanding the topology characters and the spreading dynamics on complex networks. While random walk is widely used and is proven effective in many community detection algorithms, there still exists two major defects: (i) the maximal length of random walk is too large to distinguish the clustering information if using the average step of all possible random walks; (ii) the useful community information at all other step lengths are missed if using a pre-assigned maximal length. In this paper, we propose a novel community detection method based on the first passage probabilities (FPPM), equipped with a new similarity measure that incorporates the complete structural information within the maximal step length. Here the diameter of the network is chosen as an appropriate boundary of random walks which is adaptive to different networks. Then we use the hierarchical clustering to group the vertices into communities and further select the best division through the corresponding modularity values. Finally, a post-processing strategy is designed to integrate the unreasonable small communities, which significantly improves the accuracy of community division. Surprisingly, the numerical simulations show that FPPM performs best compared to several classic algorithms on both synthetic benchmarks and real-world networks, which reveals the universality and effectiveness of our method.

preprint2020arXiv

Cooling-Aware Resource Allocation and Load Management for Mobile Edge Computing Systems

Driven by explosive computation demands of Internet of Things (IoT), mobile edge computing (MEC) provides a promising technique to enhance the computation capability for mobile users. In this paper, we propose a joint resource allocation and load management mechanism in an MEC system with wireless power transfer (WPT), by jointly optimizing the transmit power for WPT, the local/edge computing load, the offloading time, and the frequencies of the central processing units (CPUs) at the access point (AP) and the users. To achieve an energy-efficient and sustainable WPT-MEC system, we minimize the total energy consumption of the AP, while meeting computation latency requirements. Cooling energy which is non-negligible, is taken into account in minimizing the energy consumption of the MEC system. By rigorously orchestrating the state-of-the-art optimization techniques, we design an iterative algorithm and obtain the optimal solution in a semi-closed form. Based on the solution, interesting properties and insights are summarized. Extensive numerical tests show that the proposed algorithm can save up to 90.4% the energy of existing benchmarks.

preprint2020arXiv

Cosmological constraints from the redshift dependence of the Alcock-Paczynski effect: Possibility of estimateing the non-linear systematics using fast simulations

The tomographic AP method is so far the best method in separating the Alcock-Paczynski (AP) signal from the redshift space distortion (RSD) effects and deriving powerful constraints on cosmological parameters using the $\lesssim40h^{-1}\ \rm Mpc$ clustering region. To guarantee that the method can be easily applied to the future large scale structure (LSS) surveys, we study the possibility of estimating the systematics of the method using fast simulation method. The major contribution of the systematics comes from the non-zero redshift evolution of the RSD effects, which is quantified by $\hatξ_{Δs}(μ,z)$ in our analysis, and estimated using the BigMultidark exact N-body simulation and approximate COLA simulation samples. We find about 5\%/10\% evolution when comparing the $\hatξ_{Δs}(μ,z)$ measured as $z=0.5$/$z=1$ to the measurements at $z=0$. We checked the inaccuracy in the 2pCFs computed using COLA, and find it 5-10 times smaller than the intrinsic systematics of the tomographic AP method, indicating that using COLA to estimate the systematics is good enough. Finally, we test the effect of halo bias, and find $\lesssim$1.5\% change in $\hatξ_{Δs}$ when varying the halo mass within the range of $2\times 10^{12}$ to $10^{14}$ $M_{\odot}$. We will perform more studies to achieve an accurate and efficient estimation of the systematics in redshift range of $z=0-1.5$.

preprint2020arXiv

Cost of quantum entanglement simplified

Quantum entanglement is a key physical resource in quantum information processing that allows for performing basic quantum tasks such as teleportation and quantum key distribution, which are impossible in the classical world. Ever since the rise of quantum information theory, it has been an open problem to quantify entanglement in an information-theoretically meaningful way. In particular, every previously defined entanglement measure bearing a precise information-theoretic meaning is not known to be efficiently computable, or if it is efficiently computable, then it is not known to have a precise information-theoretic meaning. In this Letter, we meet this challenge by introducing an entanglement measure that has a precise information-theoretic meaning as the exact cost required to prepare an entangled state when two distant parties are allowed to perform quantum operations that completely preserve the positivity of the partial transpose. Additionally, this entanglement measure is efficiently computable by means of a semidefinite program, and it bears a number of useful properties such as additivity and faithfulness. Our results bring key insights into the fundamental entanglement structure of arbitrary quantum states, and they can be used directly to assess and quantify the entanglement produced in quantum-physical experiments.

preprint2020arXiv

Cross-Channel Intragroup Sparsity Neural Network

Modern deep neural networks rely on overparameterization to achieve state-of-the-art generalization. But overparameterized models are computationally expensive. Network pruning is often employed to obtain less demanding models for deployment. Fine-grained pruning removes individual weights in parameter tensors and can achieve a high model compression ratio with little accuracy degradation. However, it introduces irregularity into the computing dataflow and often does not yield improved model inference efficiency in practice. Coarse-grained model pruning, while realizing satisfactory inference speedup through removal of network weights in groups, e.g. an entire filter, often lead to significant accuracy degradation. This work introduces the cross-channel intragroup (CCI) sparsity structure, which can prevent the inference inefficiency of fine-grained pruning while maintaining outstanding model performance. We then present a novel training algorithm designed to perform well under the constraint imposed by the CCI-Sparsity. Through a series of comparative experiments we show that our proposed CCI-Sparsity structure and the corresponding pruning algorithm outperform prior art in inference efficiency by a substantial margin given suited hardware acceleration in the future.

preprint2020arXiv

Crossing-Domain Generative Adversarial Networks for Unsupervised Multi-Domain Image-to-Image Translation

State-of-the-art techniques in Generative Adversarial Networks (GANs) have shown remarkable success in image-to-image translation from peer domain X to domain Y using paired image data. However, obtaining abundant paired data is a non-trivial and expensive process in the majority of applications. When there is a need to translate images across n domains, if the training is performed between every two domains, the complexity of the training will increase quadratically. Moreover, training with data from two domains only at a time cannot benefit from data of other domains, which prevents the extraction of more useful features and hinders the progress of this research area. In this work, we propose a general framework for unsupervised image-to-image translation across multiple domains, which can translate images from domain X to any a domain without requiring direct training between the two domains involved in image translation. A byproduct of the framework is the reduction of computing time and computing resources, since it needs less time than training the domains in pairs as is done in state-of-the-art works. Our proposed framework consists of a pair of encoders along with a pair of GANs which learns high-level features across different domains to generate diverse and realistic samples from. Our framework shows competing results on many image-to-image tasks compared with state-of-the-art techniques.

preprint2020arXiv

Deep Learning for Learning Graph Representations

Mining graph data has become a popular research topic in computer science and has been widely studied in both academia and industry given the increasing amount of network data in the recent years. However, the huge amount of network data has posed great challenges for efficient analysis. This motivates the advent of graph representation which maps the graph into a low-dimension vector space, keeping original graph structure and supporting graph inference. The investigation on efficient representation of a graph has profound theoretical significance and important realistic meaning, we therefore introduce some basic ideas in graph representation/network embedding as well as some representative models in this chapter.

preprint2020arXiv

Deep Visual Odometry with Adaptive Memory

We propose a novel deep visual odometry (VO) method that considers global information by selecting memory and refining poses. Existing learning-based methods take the VO task as a pure tracking problem via recovering camera poses from image snippets, leading to severe error accumulation. Global information is crucial for alleviating accumulated errors. However, it is challenging to effectively preserve such information for end-to-end systems. To deal with this challenge, we design an adaptive memory module, which progressively and adaptively saves the information from local to global in a neural analogue of memory, enabling our system to process long-term dependency. Benefiting from global information in the memory, previous results are further refined by an additional refining module. With the guidance of previous outputs, we adopt a spatial-temporal attention to select features for each view based on the co-visibility in feature domain. Specifically, our architecture consisting of Tracking, Remembering and Refining modules works beyond tracking. Experiments on the KITTI and TUM-RGBD datasets demonstrate that our approach outperforms state-of-the-art methods by large margins and produces competitive results against classic approaches in regular scenes. Moreover, our model achieves outstanding performance in challenging scenarios such as texture-less regions and abrupt motions, where classic algorithms tend to fail.

preprint2020arXiv

Design Choices for X-vector Based Speaker Anonymization

The recently proposed x-vector based anonymization scheme converts any input voice into that of a random pseudo-speaker. In this paper, we present a flexible pseudo-speaker selection technique as a baseline for the first VoicePrivacy Challenge. We explore several design choices for the distance metric between speakers, the region of x-vector space where the pseudo-speaker is picked, and gender selection. To assess the strength of anonymization achieved, we consider attackers using an x-vector based speaker verification system who may use original or anonymized speech for enrollment, depending on their knowledge of the anonymization scheme. The Equal Error Rate (EER) achieved by the attackers and the decoding Word Error Rate (WER) over anonymized data are reported as the measures of privacy and utility. Experiments are performed using datasets derived from LibriSpeech to find the optimal combination of design choices in terms of privacy and utility.

preprint2020arXiv

Domain Embedded Multi-model Generative Adversarial Networks for Image-based Face Inpainting

Prior knowledge of face shape and structure plays an important role in face inpainting. However, traditional face inpainting methods mainly focus on the generated image resolution of the missing portion without consideration of the special particularities of the human face explicitly and generally produce discordant facial parts. To solve this problem, we present a domain embedded multi-model generative adversarial model for inpainting of face images with large cropped regions. We firstly represent only face regions using the latent variable as the domain knowledge and combine it with the non-face parts textures to generate high-quality face images with plausible contents. Two adversarial discriminators are finally used to judge whether the generated distribution is close to the real distribution or not. It can not only synthesize novel image structures but also explicitly utilize the embedded face domain knowledge to generate better predictions with consistency on structures and appearance. Experiments on both CelebA and CelebA-HQ face datasets demonstrate that our proposed approach achieved state-of-the-art performance and generates higher quality inpainting results than existing ones.

preprint2020arXiv

Duplication of Windows Services

OS-level virtualization techniques virtualize system resources at the system call interface, has the distinct advantage of smaller run-time resource requirements as compared to HAL-level virtualization techniques, and thus forms an important building block for virtualizing parallel and distributed applications such as a HPC clusters. Because the Windows operating system puts certain critical functionalities in privileged user-level system service processes, a complete OS-level virtualization solution for the Windows platform requires duplication of such Windows service as Remote Procedure Call Server Service (RPCSS). As many implementation details of the Windows system services are proprietary, duplicating Windows system services becomes the key technical challenge for virtualizing the Windows platform at the OS level. Moreover, as a core component of cloud computing, IIS web server-related services need to be duplicated in containers (i.e., OS-level virtual machines), but so far there is no such scheme. In this paper, we thoroughly identify all issues that affect service duplication, and then propose the first known methodology to systematically duplicate both system and ordinary Windows services. Our experiments show that the methodology can duplicate a set of system and ordinary services on different versions of Windows OS.

preprint2020arXiv

Eco-evolutionary dynamics with environmental feedback: cooperation in a changing world

Eco-evolutionary game dynamics which characterizes the mutual interactions and the coupled evolutions of strategies and environments has been of growing interests in very recent years. Since such feedback loops widely exist in a range of coevolutionary systems, such as microbial systems, social-ecological system and psychological-economic system, recent modeling frameworks that unveil the oscillating dynamics of social dilemmas have great potential for practical applications. In this perspective article, we overview the latest progress of evolutionary game theory in this direction. We describe both mathematical methods and interdisciplinary applications across different fields. The ideas worthy of further consideration are discussed in prospects, with the central role of promoting cooperations in a changing world.

preprint2020arXiv

Efficiently computable bounds for magic state distillation

Magic-state distillation (or non-stabilizer state manipulation) is a crucial component in the leading approaches to realizing scalable, fault-tolerant, and universal quantum computation. Related to non-stabilizer state manipulation is the resource theory of non-stabilizer states, for which one of the goals is to characterize and quantify non-stabilizerness of a quantum state. In this paper, we introduce the family of thauma measures to quantify the amount of non-stabilizerness in a quantum state, and we exploit this family of measures to address several open questions in the resource theory of non-stabilizer states. As a first application, we establish the hypothesis testing thauma as an efficiently computable benchmark for the one-shot distillable non-stabilizerness, which in turn leads to a variety of bounds on the rate at which non-stabilizerness can be distilled, as well as on the overhead of magic-state distillation. We then prove that the max-thauma can be used as an efficiently computable tool in benchmarking the efficiency of magic-state distillation and that it can outperform pervious approaches based on mana. Finally, we use the min-thauma to bound a quantity known in the literature as the "regularized relative entropy of magic." As a consequence of this bound, we find that two classes of states with maximal mana, a previously established non-stabilizerness measure, cannot be interconverted in the asymptotic regime at a rate equal to one. This result resolves a basic question in the resource theory of non-stabilizer states and reveals a difference between the resource theory of non-stabilizer states and other resource theories such as entanglement and coherence.

preprint2020arXiv

Eigen-GNN: A Graph Structure Preserving Plug-in for GNNs

Graph Neural Networks (GNNs) are emerging machine learning models on graphs. Although sufficiently deep GNNs are shown theoretically capable of fully preserving graph structures, most existing GNN models in practice are shallow and essentially feature-centric. We show empirically and analytically that the existing shallow GNNs cannot preserve graph structures well. To overcome this fundamental challenge, we propose Eigen-GNN, a simple yet effective and general plug-in module to boost GNNs ability in preserving graph structures. Specifically, we integrate the eigenspace of graph structures with GNNs by treating GNNs as a type of dimensionality reduction and expanding the initial dimensionality reduction bases. Without needing to increase depths, Eigen-GNN possesses more flexibilities in handling both feature-driven and structure-driven tasks since the initial bases contain both node features and graph structures. We present extensive experimental results to demonstrate the effectiveness of Eigen-GNN for tasks including node classification, link prediction, and graph isomorphism tests.

preprint2020arXiv

Energy Management and Trajectory Optimization for UAV-Enabled Legitimate Monitoring Systems

Thanks to their quick placement and high flexibility, unmanned aerial vehicles (UAVs) can be very useful in the current and future wireless communication systems. With a growing number of smart devices and infrastructure-free communication networks, it is necessary to legitimately monitor these networks to prevent crimes. In this paper, a novel framework is proposed to exploit the flexibility of the UAV for legitimate monitoring via joint trajectory design and energy management. The system includes a suspicious transmission link with a terrestrial transmitter and a terrestrial receiver, and a UAV to monitor the suspicious link. The UAV can adjust its positions and send jamming signal to the suspicious receiver to ensure successful eavesdropping. Based on this model, we first develop an approach to minimize the overall jamming energy consumption of the UAV. Building on a judicious (re-)formulation, an alternating optimization approach is developed to compute a locally optimal solution in polynomial time. Furthermore, we model and include the propulsion power to minimize the overall energy consumption of the UAV. Leveraging the successive convex approximation method, an effective iterative approach is developed to find a feasible solution fulfilling the Karush-Kuhn-Tucker (KKT) conditions. Extensive numerical results are provided to verify the merits of the proposed schemes.

preprint2020arXiv

Entangling Nuclear Spins by Dissipation in a Solid-state System

Entanglement is a fascinating feature of quantum mechanics and a key ingredient in most quantum information processing tasks. Yet the generation of entanglement is usually hampered by undesired dissipation owing to the inevitable coupling of a system with its environment. Here, we report an experiment on how to entangle two $^{13}$C nuclear spins via engineered dissipation in a nitrogen-vacancy system. We utilize the electron spin as an ancilla, and combine unitary processes together with optical pumping of the ancilla to implement the engineered dissipation and deterministically produce an entangled state of the two nuclear spins, independent of their initial states. Our experiment demonstrates the power of engineered dissipation as a tool for generation of multi-qubit entanglement in solid-state systems.

preprint2020arXiv

Evolution of Ethereum: A Temporal Graph Perspective

Ethereum is one of the most popular blockchain systems that supports more than half a million transactions every day and fosters miscellaneous decentralized applications with its Turing-complete smart contract machine. Whereas it remains mysterious what the transaction pattern of Ethereum is and how it evolves over time. In this paper, we study the evolutionary behavior of Ethereum transactions from a temporal graph point of view. We first develop a data analytics platform to collect external transactions associated with users as well as internal transactions initiated by smart contracts. Three types of temporal graphs, user-to-user, contract-to-contract and user-contract graphs, are constructed according to trading relationship and are segmented with an appropriate time window. We observe a strong correlation between the size of user-to-user transaction graph and the average Ether price in a time window, while no evidence of such linkage is shown at the average degree, average edge weights and average triplet closure duration. The macroscopic and microscopic burstiness of Ethereum transactions is validated. We analyze the Gini indexes of the transaction graphs and the user wealth in which Ethereum is found to be very unfair since the very beginning, in a sense, "the rich is already very rich".

preprint2020arXiv

Ferroelastic-switching-driven colossal shear strain and piezoelectricity in a hybrid ferroelectric

Materials that can produce large controllable strains are widely used in shape memory devices, actuators and sensors. Great efforts have been made to improve the strain outputs of various material systems. Among them, ferroelastic transitions underpin giant reversible strains in electrically-driven ferro/piezoelectrics and thermally- or magneticallydriven shape memory alloys. However, large-strain ferroelastic switching in conventional ferroelectrics is very challenging while magnetic and thermal controls are not desirable for applications. Here, we demonstrate an unprecedentedly large shear strain up to 21.5 % in a hybrid ferroelectric, C6H5N(CH3)3CdCl3. The strain response is about two orders of magnitude higher than those of top-performing conventional ferroelectric polymers and oxides. It is achieved via inorganic bond switching and facilitated by the structural confinement of the large organic moieties, which prevents the undesired 180-degree polarization switching. Furthermore, Br substitution can effectively soften the bonds and result in giant shear piezoelectric coefficient (d35 ~ 4800 pm/V) in Br-rich end of the solid solution, C6H5N(CH3)3CdBr3xCl3(1-x). The superior electromechanical properties of the compounds promise their potential in lightweight and high energy density devices, and the strategy described here should inspire the development of next-generation piezoelectrics and electroactive materials based on hybrid ferroelectrics.

preprint2020arXiv

Forecast for FAST: from Galaxies Survey to Intensity Mapping

The Five-Hundred-Meter Aperture Spherical Radio Telescope(FAST) is the largest single-dish radio telescope in the world. In this paper, we make forecast on the FAST HI large scale structure survey by mock observations. We consider a drift scan survey with the L-band 19 beam receiver, which may be commensal with the pulsar search and Galactic HI survey. We also consider surveys at lower frequency, either using the current single feed wide band receiver, or a future multi-beam phased array feed (PAF) in the UHF band. We estimate the number density of detected HI galaxies and the measurement error in positions, the precision of the surveys are evaluated using both Fisher matrix and simulated observations. The measurement error in the HI galaxy power spectrum is estimated, and we find that the error is relatively large even at moderate redshifts, as the number of positively detected galaxies drops drastically with increasing redshift. However, good cosmological measurement could be obtained with the intensity mapping technique where the large scale HI distribution is measured without resolving individual galaxies. The figure of merit (FoM) for the dark energy equation of state with different observation times are estimated, we find that with the existing L-band multi-beam receiver, a good measurement of low redshift large scale structure can be obtained, which complements the existing optical surveys. With a PAF in the UHF band, the constraint can be much stronger, reaching the level of a dark energy task force (DETF) stage IV experiment.

preprint2020arXiv

Frustratingly Simple Few-Shot Object Detection

Detecting rare objects from a few examples is an emerging problem. Prior works show meta-learning is a promising approach. But, fine-tuning techniques have drawn scant attention. We find that fine-tuning only the last layer of existing detectors on rare classes is crucial to the few-shot object detection task. Such a simple approach outperforms the meta-learning methods by roughly 2~20 points on current benchmarks and sometimes even doubles the accuracy of the prior methods. However, the high variance in the few samples often leads to the unreliability of existing benchmarks. We revise the evaluation protocols by sampling multiple groups of training examples to obtain stable comparisons and build new benchmarks based on three datasets: PASCAL VOC, COCO and LVIS. Again, our fine-tuning approach establishes a new state of the art on the revised benchmarks. The code as well as the pretrained models are available at https://github.com/ucbdrive/few-shot-object-detection.

preprint2020arXiv

Generating Synthetic Magnetism via Floquet Engineering Auxiliary Qubits in Phonon-Cavity-Based Lattice

Gauge magnetic fields have a close relation to breaking time-reversal symmetry in condensed matter. In the present of the gauge fields, we might observe nonreciprocal and topological transport. Inspired by these, there is a growing effort to realize exotic transport phenomena in optical and acoustic systems. However, due to charge neutrality, realizing analog magnetic flux for phonons in nanoscale systems is still challenging in both theoretical and experimental studies. Here we propose a novel mechanism to generate synthetic magnetic field for phonon lattice by Floquet engineering auxiliary qubits. We find that, a longitudinal Floquet drive on the qubit will produce a resonant coupling between two detuned acoustic cavities. Specially, the phase encoded into the longitudinal drive can exactly be transformed into the phonon-phonon hopping. Our proposal is general and can be realized in various types of artificial hybrid quantum systems. Moreover, by taking surface-acoustic-wave (SAW) cavities for example, we propose how to generate synthetic magnetic flux for phonon transport. In the present of synthetic magnetic flux, the time-reversal symmetry will be broken, which allows to realize the circulator transport and analog Aharonov-Bohm effects for acoustic waves. Last, we demonstrate that our proposal can be scaled to simulate topological states of matter in quantum acoustodynamics system.

preprint2020arXiv

Generative Adversarial Zero-Shot Relational Learning for Knowledge Graphs

Large-scale knowledge graphs (KGs) are shown to become more important in current information systems. To expand the coverage of KGs, previous studies on knowledge graph completion need to collect adequate training instances for newly-added relations. In this paper, we consider a novel formulation, zero-shot learning, to free this cumbersome curation. For newly-added relations, we attempt to learn their semantic features from their text descriptions and hence recognize the facts of unseen relations with no examples being seen. For this purpose, we leverage Generative Adversarial Networks (GANs) to establish the connection between text and knowledge graph domain: The generator learns to generate the reasonable relation embeddings merely with noisy text descriptions. Under this setting, zero-shot learning is naturally converted to a traditional supervised classification task. Empirically, our method is model-agnostic that could be potentially applied to any version of KG embeddings, and consistently yields performance improvements on NELL and Wiki dataset.

preprint2020arXiv

High-fidelity geometric gate for silicon-based spin qubits

High-fidelity manipulation is the key for the physical realization of fault-tolerant quantum computation. Here, we present a protocol to realize universal nonadiabatic geometric gates for silicon-based spin qubits. We find that the advantage of geometric gates over dynamical gates depends crucially on the evolution loop for the construction of the geometric phase. Under appropriate evolution loops, both the geometric single-qubit gates and the CNOT gate can outperform their dynamical counterparts for both systematic and detuning noises. We also perform randomized benchmarking using noise amplitudes consistent with experiments in silicon. For the static noise model, the averaged fidelities of geometric gates are around 99.90\% or above, while for the time-dependent $1/f$-type noise, the fidelities are around 99.98\% when only the detuning noise is present. We also show that the improvement in fidelities of the geometric gates over dynamical ones typically increases with the exponent $α$ of the $1/f$ noise, and the ratio can be as high as 4 when $α\approx 3$. Our results suggest that geometric gates with judiciously chosen evolution loops can be a powerful way to realize high-fidelity quantum gates.

preprint2020arXiv

How fine can fine-tuning be? Learning efficient language models

State-of-the-art performance on language understanding tasks is now achieved with increasingly large networks; the current record holder has billions of parameters. Given a language model pre-trained on massive unlabeled text corpora, only very light supervised fine-tuning is needed to learn a task: the number of fine-tuning steps is typically five orders of magnitude lower than the total parameter count. Does this mean that fine-tuning only introduces small differences from the pre-trained model in the parameter space? If so, can one avoid storing and computing an entire model for each task? In this work, we address these questions by using Bidirectional Encoder Representations from Transformers (BERT) as an example. As expected, we find that the fine-tuned models are close in parameter space to the pre-trained one, with the closeness varying from layer to layer. We show that it suffices to fine-tune only the most critical layers. Further, we find that there are surprisingly many good solutions in the set of sparsified versions of the pre-trained model. As a result, fine-tuning of huge language models can be achieved by simply setting a certain number of entries in certain layers of the pre-trained parameters to zero, saving both task-specific parameter storage and computational cost.

preprint2020arXiv

Influence of Laser Intensity Fluctuation on Single-Cesium Atom Trapping Lifetime in a 1064-nm Microscopic Optical Tweezer

An optical tweezer composed of a strongly focused single-spatial-mode Gaussian beam of a red-detuned 1064-nm laser can confine a single-cesium (Cs) atom at the strongest point of the light intensity. We can use this for coherent manipulation of single-quantum bits and single-photon sources. The trapping lifetime of the atoms in the optical tweezers is very short due to the impact of the background atoms, the laser intensity fluctuation of optical tweezer and the residual thermal motion of the atoms. In this paper, we analyzed the influence of the background pressure, the trap frequency of optical tweezers and the parametric heating of the optical tweezer on the atomic trapping lifetime. Combined with the external feedback loop based on an acousto-optical modulator (AOM), the intensity fluctuation of the 1064-nm laser in the time domain was suppressed from $\pm$ 3.360$\%$ to $\pm$ 0.064$\%$, and the suppression bandwidth reached approximately 33 kHz. The trapping lifetime of a single Cs atom in the microscopic optical tweezer was extended from 4.04 s to 6.34 s.

preprint2020arXiv

Interpretable CNNs for Object Classification

This paper proposes a generic method to learn interpretable convolutional filters in a deep convolutional neural network (CNN) for object classification, where each interpretable filter encodes features of a specific object part. Our method does not require additional annotations of object parts or textures for supervision. Instead, we use the same training data as traditional CNNs. Our method automatically assigns each interpretable filter in a high conv-layer with an object part of a certain category during the learning process. Such explicit knowledge representations in conv-layers of CNN help people clarify the logic encoded in the CNN, i.e., answering what patterns the CNN extracts from an input image and uses for prediction. We have tested our method using different benchmark CNNs with various structures to demonstrate the broad applicability of our method. Experiments have shown that our interpretable filters are much more semantically meaningful than traditional filters.

preprint2020arXiv

Joint Modeling of Chest Radiographs and Radiology Reports for Pulmonary Edema Assessment

We propose and demonstrate a novel machine learning algorithm that assesses pulmonary edema severity from chest radiographs. While large publicly available datasets of chest radiographs and free-text radiology reports exist, only limited numerical edema severity labels can be extracted from radiology reports. This is a significant challenge in learning such models for image classification. To take advantage of the rich information present in the radiology reports, we develop a neural network model that is trained on both images and free-text to assess pulmonary edema severity from chest radiographs at inference time. Our experimental results suggest that the joint image-text representation learning improves the performance of pulmonary edema assessment compared to a supervised model trained on images only. We also show the use of the text for explaining the image classification by the joint model. To the best of our knowledge, our approach is the first to leverage free-text radiology reports for improving the image model performance in this application. Our code is available at https://github.com/RayRuizhiLiao/joint_chestxray.

preprint2020arXiv

Joint User Identification, Channel Estimation, and Signal Detection for Grant-Free NOMA

For massive machine-type communications, centralized control may incur a prohibitively high overhead. Grant-free non-orthogonal multiple access (NOMA) provides possible solutions, yet poses new challenges for efficient receiver design. In this paper, we develop a joint user identification, channel estimation, and signal detection (JUICESD) algorithm. We divide the whole detection scheme into two modules: slot-wise multi-user detection (SMD) and combined signal and channel estimation (CSCE). SMD is designed to decouple the transmissions of different users by leveraging the approximate message passing (AMP) algorithms, and CSCE is designed to deal with the nonlinear coupling of activity state, channel coefficient and transmit signal of each user separately. To address the problem that the exact calculation of the messages exchanged within CSCE and between the two modules is complicated due to phase ambiguity issues, this paper proposes a rotationally invariant Gaussian mixture (RIGM) model, and develops an efficient JUICESD-RIGM algorithm. JUICESD-RIGM achieves a performance close to JUICESD with a much lower complexity. Capitalizing on the feature of RIGM, we further analyze the performance of JUICESD-RIGM with state evolution techniques. Numerical results demonstrate that the proposed algorithms achieve a significant performance improvement over the existing alternatives, and the derived state evolution method predicts the system performance accurately.

preprint2020arXiv

Learning Tuple Compatibility for Conditional OutfitRecommendation

Outfit recommendation requires the answers of some challenging outfit compatibility questions such as 'Which pair of boots and school bag go well with my jeans and sweater?'. It is more complicated than conventional similarity search, and needs to consider not only visual aesthetics but also the intrinsic fine-grained and multi-category nature of fashion items. Some existing approaches solve the problem through sequential models or learning pair-wise distances between items. However, most of them only consider coarse category information in defining fashion compatibility while neglecting the fine-grained category information often desired in practical applications. To better define the fashion compatibility and more flexibly meet different needs, we propose a novel problem of learning compatibility among multiple tuples (each consisting of an item and category pair), and recommending fashion items following the category choices from customers. Our contributions include: 1) Designing a Mixed Category Attention Net (MCAN) which integrates both fine-grained and coarse category information into recommendation and learns the compatibility among fashion tuples. MCAN can explicitly and effectively generate diverse and controllable recommendations based on need. 2) Contributing a new dataset IQON, which follows eastern culture and can be used to test the generalization of recommendation systems. Our extensive experiments on a reference dataset Polyvore and our dataset IQON demonstrate that our method significantly outperforms state-of-the-art recommendation methods.

preprint2020arXiv

Lepton Flavor Mixing and CP Violation in the Minimal Type-(I+II) Seesaw Model with a Modular $A_4$ Symmetry

In this paper, we study the implications of the modular $A^{}_4$ flavor symmetry in constructing a supersymmetric minimal type-(I+II) seesaw model, in which only one right-handed neutrino and two Higgs triplets are introduced to account for the tiny neutrino masses, flavor mixing and CP violation. The right-handed neutrino as well as the Higgs triplets in this model are assigned into the trivial one-dimensional irreducible representation of the modular group $A^{}_{4}$. We show that the individual contributions to the neutrino masses from the right-handed neutrino and the Higgs triplet are comparable. We also find that the neutrino mass matrix can possess an approximate $μ-τ$ reflection symmetry for some specific values of free model parameters. Moreover, our model predicts relatively large masses of three light neutrinos, thus can be easily tested in future neutrino experiments.

preprint2020arXiv

Modeling of Rakugo Speech and Its Limitations: Toward Speech Synthesis That Entertains Audiences

We have been investigating rakugo speech synthesis as a challenging example of speech synthesis that entertains audiences. Rakugo is a traditional Japanese form of verbal entertainment similar to a combination of one-person stand-up comedy and comic storytelling and is popular even today. In rakugo, a performer plays multiple characters, and conversations or dialogues between the characters make the story progress. To investigate how close the quality of synthesized rakugo speech can approach that of professionals' speech, we modeled rakugo speech using Tacotron 2, a state-of-the-art speech synthesis system that can produce speech that sounds as natural as human speech albeit under limited conditions, and an enhanced version of it with self-attention to better consider long-term dependencies. We also used global style tokens and manually labeled context features to enrich speaking styles. Through a listening test, we measured not only naturalness but also distinguishability of characters, understandability of the content, and the degree of entertainment. Although we found that the speech synthesis models could not yet reach the professional level, the results of the listening test provided interesting insights: 1) we should not focus only on the naturalness of synthesized speech but also the distinguishability of characters and the understandability of the content to further entertain audiences; 2) the fundamental frequency (fo) expressions of synthesized speech are poorer than those of human speech, and more entertaining speech should have richer fo expression. Although there is room for improvement, we believe this is an important stepping stone toward achieving entertaining speech synthesis at the professional level.

preprint2020arXiv

More Practical and Adaptive Algorithms for Online Quantum State Learning

Online quantum state learning is a recently proposed problem by Aaronson et al. (2018), where the learner sequentially predicts $n$-qubit quantum states based on given measurements on states and noisy outcomes. In the previous work, the algorithms are worst-case optimal in general but fail in achieving tighter bounds in certain simpler or more practical cases. In this paper, we develop algorithms to advance the online learning of quantum states. First, we show that Regularized Follow-the-Leader (RFTL) method with Tallis-2 entropy can achieve an $O(\sqrt{MT})$ total loss with perfect hindsight on the first $T$ measurements with maximum rank $M$. This regret bound depends only on the maximum rank $M$ of measurements rather than the number of qubits, which takes advantage of low-rank measurements. Second, we propose a parameter-free algorithm based on a classical adjusting learning rate schedule that can achieve a regret depending on the loss of best states in hindsight, which takes advantage of low noisy outcomes. Besides these more adaptive bounds, we also show that our RFTL with Tallis-2 entropy algorithm can be implemented efficiently on near-term quantum computing devices, which is not achievable in previous works.

preprint2020arXiv

Multi-modal Deep Analysis for Multimedia

With the rapid development of Internet and multimedia services in the past decade, a huge amount of user-generated and service provider-generated multimedia data become available. These data are heterogeneous and multi-modal in nature, imposing great challenges for processing and analyzing them. Multi-modal data consist of a mixture of various types of data from different modalities such as texts, images, videos, audios etc. In this article, we present a deep and comprehensive overview for multi-modal analysis in multimedia. We introduce two scientific research problems, data-driven correlational representation and knowledge-guided fusion for multimedia analysis. To address the two scientific problems, we investigate them from the following aspects: 1) multi-modal correlational representation: multi-modal fusion of data across different modalities, and 2) multi-modal data and knowledge fusion: multi-modal fusion of data with domain knowledge. More specifically, on data-driven correlational representation, we highlight three important categories of methods, such as multi-modal deep representation, multi-modal transfer learning, and multi-modal hashing. On knowledge-guided fusion, we discuss the approaches for fusing knowledge with data and four exemplar applications that require various kinds of domain knowledge, including multi-modal visual question answering, multi-modal video summarization, multi-modal visual pattern mining and multi-modal recommendation. Finally, we bring forward our insights and future research directions.

preprint2020arXiv

Nearly nondestructive thermometry of labeled cold atoms and application to isotropic laser cooling

We have designed and implemented a straightforward method to deterministically measure the temperature of the selected segment of a cold atom ensemble, and we have also developed an upgrade in the form of nondestructive thermometry. The essence is to monitor the thermal expansion of the targeted cold atoms after labeling them through manipulating the internal states, and the nondestructive property relies upon the nearly lossless detection via driving a cycling transition. For cold atoms subject to isotropic laser cooling, this method has the unique capability of addressing only the atoms on the optical detection axis within the enclosure, which is exactly the part we care about in major applications such as atomic clock or quantum sensing. Furthermore, our results confirm the sub-Doppler cooling features in isotropic laser cooling, and we have investigated the relevant cooling properties. Meanwhile, we have applied the recently developed optical configuration with the cooling laser injection in the form of hollow beams, which helps to enhance the cooling performance and accumulate more cold atoms in the central regions.

preprint2020arXiv

New Constructions of Optimal Locally Repairable Codes with Super-Linear Length

As an important coding scheme in modern distributed storage systems, locally repairable codes (LRCs) have attracted a lot of attentions from perspectives of both practical applications and theoretical research. As a major topic in the research of LRCs, bounds and constructions of the corresponding optimal codes are of particular concerns. In this work, codes with $(r,δ)$-locality which have optimal minimal distance w.r.t. the bound given by Prakash et al. \cite{Prakash2012Optimal} are considered. Through parity check matrix approach, constructions of both optimal $(r,δ)$-LRCs with all symbol locality ($(r,δ)_a$-LRCs) and optimal $(r,δ)$-LRCs with information locality ($(r,δ)_i$-LRCs) are provided. As a generalization of a work of Xing and Yuan \cite{XY19}, these constructions are built on a connection between sparse hypergraphs and optimal $(r,δ)$-LRCs. With the help of constructions of large sparse hypergraphs, the length of codes constructed can be super-linear in the alphabet size. This improves upon previous constructions when the minimal distance of the code is at least $3δ+1$. As two applications, optimal H-LRCs with super-linear length and GSD codes with unbounded length are also constructed.

preprint2020arXiv

On Lattice Packings and Coverings of Asymmetric Limited-Magnitude Balls

We construct integer error-correcting codes and covering codes for the limited-magnitude error channel with more than one error. The codes are lattices that pack or cover the space with the appropriate error ball. Some of the constructions attain an asymptotic packing/covering density that is constant. The results are obtained via various methods, including the use of codes in the Hamming metric, modular $B_t$-sequences, $2$-fold Sidon sets, and sets avoiding arithmetic progression.

preprint2020arXiv

Post-Heat Treatment Design of High-Strength Low-Alloy Steels Processed by Laser Powder Bed Fusion

In this study, a post-heat treatment design for additively manufactured copper-bearing high-strength low-alloy (HSLA)-100 steel is performed by understanding the process-structure-property relationships. Hot isostatic pressing (HIP) is designed to reduce the porosity from 3% to less than 1% for the HSLA-100 steel processed by laser powder bed fusion (LPBF). Quenching dilatometry is employed to design the HIP parameters with the optimized cooling rate for the maximum amount of martensite transformed after HIP. Afterward, a post-heat treatment step with cyclic re-austenitization is introduced for an effective grain refinement to compensate the coarsened microstructure after HIP. Finally, tempering is optimized through microstructure characterization and microhardness. A two-fold increase in the yield strength of the HSLA with tailored microstructure during post-heat treatment is achieved in comparison with the as-built HSLA.

preprint2020arXiv

Public discourse and social network echo chambers driven by socio-cognitive biases

In recent years, social media has increasingly become an important platform for political campaigns, especially elections. It remains elusive how exactly public discourse is driven by the intricate interplay between individual socio-cognitive biases, dueling campaign efforts, and social media platforms. We examine this complex socio-political process by integrating observed retweet networks from the 2016 political networks with an agent-based model of political opinion formation and network structure. Here we show that the range of political viewpoints individuals are willing to consider is a key determinant in the formation of polarized networks and the emergence of echo chambers. We also find that winning majority support in public discourse is determined by both the effort exerted by campaigns and the relative ideological positioning of opposing campaigns. Our results demonstrate how public discourse and political polarization can be modeled as an interactive process of shifting individual opinions, evolving social networks, and political campaigns.

preprint2020arXiv

Quantum algorithms for hedging and the learning of Ising models

A paradigmatic algorithm for online learning is the Hedge algorithm by Freund and Schapire. An allocation into different strategies is chosen for multiple rounds and each round incurs corresponding losses for each strategy. The algorithm obtains a favorable guarantee for the total losses even in an adversarial situation. This work presents quantum algorithms for such online learning in an oracular setting. For $T$ time steps and $N$ strategies, we exhibit run times of about $O \left ({\rm poly} (T) \sqrt{N} \right)$ for estimating the losses and for betting on individual strategies by sampling. In addition, we discuss a quantum analogue of the Sparsitron, a machine learning algorithm based on the Hedge algorithm. The quantum algorithm inherits the provable learning guarantees from the classical algorithm and exhibits polynomial speedups. The speedups may find relevance in finance, for example for hedging risks, and machine learning, for example for learning generalized linear models or Ising models.

preprint2020arXiv

Reject Illegal Inputs with Generative Classifier Derived from Any Discriminative Classifier

Generative classifiers have been shown promising to detect illegal inputs including adversarial examples and out-of-distribution samples. Supervised Deep Infomax~(SDIM) is a scalable end-to-end framework to learn generative classifiers. In this paper, we propose a modification of SDIM termed SDIM-\emph{logit}. Instead of training generative classifier from scratch, SDIM-\emph{logit} first takes as input the logits produced any given discriminative classifier, and generate logit representations; then a generative classifier is derived by imposing statistical constraints on logit representations. SDIM-\emph{logit} could inherit the performance of the discriminative classifier without loss. SDIM-\emph{logit} incurs a negligible number of additional parameters, and can be efficiently trained with base classifiers fixed. We perform \emph{classification with rejection}, where test samples whose class conditionals are smaller than pre-chosen thresholds will be rejected without predictions. Experiments on illegal inputs, including adversarial examples, samples with common corruptions, and out-of-distribution~(OOD) samples show that allowed to reject a portion of test samples, SDIM-\emph{logit} significantly improves the performance on the left test sets.

preprint2020arXiv

Resonant Beam Communications with Photovoltaic Receiver for Optical Data and Power Transfer

The vision and requirements of the sixth generation (6G) mobile communication systems are expected to adopt freespace optical communication (FSO) and wireless power transfer (WPT). The laser-based WPT or wireless information transfer (WIT) usually faces the challenges of mobility and safety. We present a mobile and safe resonant beam communication (RBCom) system, which can realize high-rate simultaneous wireless information and power transfer (SWIPT). We propose an analytical model to depict its carrier beam and information transfer procedures. The numerical results show that RBCom can achieve more than 40 mW charging power and 1:6 Gbit/s channel capacity with orthogonal frequency division multiplexing (OFDM) scheme, which can be applied in future scenario where power and high-rate data are simultaneously desired.

preprint2020arXiv

Reverberation Modeling for Source-Filter-based Neural Vocoder

This paper presents a reverberation module for source-filter-based neural vocoders that improves the performance of reverberant effect modeling. This module uses the output waveform of neural vocoders as an input and produces a reverberant waveform by convolving the input with a room impulse response (RIR). We propose two approaches to parameterizing and estimating the RIR. The first approach assumes a global time-invariant (GTI) RIR and directly learns the values of the RIR on a training dataset. The second approach assumes an utterance-level time-variant (UTV) RIR, which is invariant within one utterance but varies across utterances, and uses another neural network to predict the RIR values. We add the proposed reverberation module to the phase spectrum predictor (PSP) of a HiNet vocoder and jointly train the model. Experimental results demonstrate that the proposed module was helpful for modeling the reverberation effect and improving the perceived quality of generated reverberant speech. The UTV-RIR was shown to be more robust than the GTI-RIR to unknown reverberation conditions and achieved a perceptually better reverberation effect.

preprint2020arXiv

REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments

One of the long-term challenges of robotics is to enable robots to interact with humans in the visual world via natural language, as humans are visual animals that communicate through language. Overcoming this challenge requires the ability to perform a wide variety of complex tasks in response to multifarious instructions from humans. In the hope that it might drive progress towards more flexible and powerful human interactions with robots, we propose a dataset of varied and complex robot tasks, described in natural language, in terms of objects visible in a large set of real images. Given an instruction, success requires navigating through a previously-unseen environment to identify an object. This represents a practical challenge, but one that closely reflects one of the core visual problems in robotics. Several state-of-the-art vision-and-language navigation, and referring-expression models are tested to verify the difficulty of this new task, but none of them show promising results because there are many fundamental differences between our task and previous ones. A novel Interactive Navigator-Pointer model is also proposed that provides a strong baseline on the task. The proposed model especially achieves the best performance on the unseen test split, but still leaves substantial room for improvement compared to the human performance.

preprint2020arXiv

Rydberg level shift due to the electric field generated by Rydberg atom collision induced ionization in cesium atomic ensemble

We experimentally studied the Rydberg level shift caused by the electric field, which is generated by Rydberg atom collision induced ionization in a cesium atomic ensemble. The density of charged particles caused by collisions between Rydberg atoms is changed by controlling the ground-state atomic density and optical excitation process. We measured the Rydberg level shift using Rydberg electromagnetically-induced-transparency (EIT) spectroscopy, and interpreted the physical origin using a semi-classical model. The experimental results are in good agreement with the numerical simulation. These energy shifts are important for the self-calibrated sensing of microwave field by the employing of Rydberg EIT. Moreover, in contrast to the resonant excitation case, narrow-linewidth spectroscopy with high signal-to-noise ratio would be useful for high-precision measurements.

preprint2020arXiv

Scattering medium: randomly packed pinhole cameras

When light travels through scattering media, speckles (spatially random distribution of fluctuated intensities) are formed due to the interference of light travelling along different optical paths, preventing the perception of structure, absolute location and dimension of a target within or on the other side of the medium. Currently, the prevailing techniques such as wavefront shaping, optical phase conjugation, scattering matrix measurement, and speckle autocorrelation imaging can only picture the target structure in the absence of prior information. Here we show that a scattering medium can be conceptualized as an assembly of randomly packed pinhole cameras, and the corresponding speckle pattern is a superposition of randomly shifted pinhole images. This provides a new perspective to bridge target, scattering medium, and speckle pattern, allowing one to localize and profile a target quantitatively from speckle patterns perceived from the other side of the scattering medium, which is impossible with all existing methods. The method also allows us to interpret some phenomena of diffusive light that are otherwise challenging to understand. For example, why the morphological appearance of speckle patterns changes with the target, why information is difficult to be extracted from thick scattering media, and what determines the capability of seeing through scattering media. In summary, the concept, whilst in its infancy, opens a new door to unveiling scattering media and information extraction from scattering media in real time.

preprint2020arXiv

Self-Supervised Deep Visual Odometry with Online Adaptation

Self-supervised VO methods have shown great success in jointly estimating camera pose and depth from videos. However, like most data-driven methods, existing VO networks suffer from a notable decrease in performance when confronted with scenes different from the training data, which makes them unsuitable for practical applications. In this paper, we propose an online meta-learning algorithm to enable VO networks to continuously adapt to new environments in a self-supervised manner. The proposed method utilizes convolutional long short-term memory (convLSTM) to aggregate rich spatial-temporal information in the past. The network is able to memorize and learn from its past experience for better estimation and fast adaptation to the current frame. When running VO in the open world, in order to deal with the changing environment, we propose an online feature alignment method by aligning feature distributions at different time. Our VO network is able to seamlessly adapt to different environments. Extensive experiments on unseen outdoor scenes, virtual to real world and outdoor to indoor environments demonstrate that our method consistently outperforms state-of-the-art self-supervised VO baselines considerably.

preprint2020arXiv

Spin-orbit coupling and spin-triplet pairing symmetry in $\mathrm{Sr_2 Ru O_4}$

Spin-orbit coupling (SOC) plays a crucial role in determining the spin structure of an odd parity psedospin-triplet Cooper pairing state. Here, we present a thorough study of how SOC lifts the degeneracy among different p-wave pseudospin-triplet pairing states in a widely used microscopic model for $\mathrm{Sr_2 Ru O_4}$, combining a Ginzburg-Landau (GL) free energy expansion, a symmetry analysis of the model, and numerical weak-coupling renormalization group (RG) and random phase approximation (RPA) calculations. These analyses are then used to critically re-examine previous numerical results on the stability of chiral p-wave pairing. The symmetry analysis can serve as a guide for future studies, especially numerical calculations, on the pairing instability in $\mathrm{Sr_2 Ru O_4}$ and can be useful for studying other multi-band spin-triplet superconductors where SOC plays an important role.

preprint2020arXiv

Stacking fault energy prediction for austenitic steels: thermodynamic modeling vs. machine learning

Stacking fault energy (SFE) is of the most critical microstructure attribute for controlling the deformation mechanism and optimizing mechanical properties of austenitic steels, while there are no accurate and straightforward computational tools for modeling it. In this work, we applied both thermodynamic modeling and machine learning to predict the stacking fault energy (SFE) for more than 300 austenitic steels. The comparison indicates a high need of improving low-temperature CALPHAD (CALculation of PHAse Diagrams) databases and interfacial energy prediction to enhance thermodynamic model reliability. The ensembled machine learning algorithms provide a more reliable prediction compared with thermodynamic and empirical models. Based on the statistical analysis of experimental results, only Ni and Fe have a moderate monotonic influence on SFE, while many other elements exhibit a complex effect that their influence on SFE may change with the alloy composition.

preprint2020arXiv

Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals

Recent years have seen growing efforts to develop spoofing countermeasures (CMs) to protect automatic speaker verification (ASV) systems from being deceived by manipulated or artificial inputs. The reliability of spoofing CMs is typically gauged using the equal error rate (EER) metric. The primitive EER fails to reflect application requirements and the impact of spoofing and CMs upon ASV and its use as a primary metric in traditional ASV research has long been abandoned in favour of risk-based approaches to assessment. This paper presents several new extensions to the tandem detection cost function (t-DCF), a recent risk-based approach to assess the reliability of spoofing CMs deployed in tandem with an ASV system. Extensions include a simplified version of the t-DCF with fewer parameters, an analysis of a special case for a fixed ASV system, simulations which give original insights into its interpretation and new analyses using the ASVspoof 2019 database. It is hoped that adoption of the t-DCF for the CM assessment will help to foster closer collaboration between the anti-spoofing and ASV research communities.

preprint2020arXiv

Task-Aware Feature Generation for Zero-Shot Compositional Learning

Visual concepts (e.g., red apple, big elephant) are often semantically compositional and each element of the compositions can be reused to construct novel concepts (e.g., red elephant). Compositional feature synthesis, which generates image feature distributions exploiting the semantic compositionality, is a promising approach to sample-efficient model generalization. In this work, we propose a task-aware feature generation (TFG) framework for compositional learning, which generates features of novel visual concepts by transferring knowledge from previously seen concepts. These synthetic features are then used to train a classifier to recognize novel concepts in a zero-shot manner. Our novel TFG design injects task-conditioned noise layer-by-layer, producing task-relevant variation at each level. We find the proposed generator design improves classification accuracy and sample efficiency. Our model establishes a new state of the art on three zero-shot compositional learning (ZSCL) benchmarks, outperforming the previous discriminative models by a large margin. Our model improves the performance of the prior arts by over 2x in the generalized ZSCL setting.

preprint2020arXiv

The curious case of developmental BERTology: On sparsity, transfer learning, generalization and the brain

In this essay, we explore a point of intersection between deep learning and neuroscience, through the lens of large language models, transfer learning and network compression. Just like perceptual and cognitive neurophysiology has inspired effective deep neural network architectures which in turn make a useful model for understanding the brain, here we explore how biological neural development might inspire efficient and robust optimization procedures which in turn serve as a useful model for the maturation and aging of the brain.

preprint2020arXiv

The three-level coupled Maxwell-Bloch equations: rogue waves, semirational rogue waves and W-shaped solitons

In this paper the coupled Maxwell-Bloch equations which describe the propagation of two optical pulses in an optical medium with coherent three-level atoms are studied by Darboux transformation. The general nth-order rogue wave solution involving two different choices of multiple roots for the spectral characteristic equation and the multiparametric nth-order semirational solution are both obtained in terms of Schur polynomials. The explicit rogue wave solutions and semirational solutions from first to second order are provided. In contrast to the known Peregrine soliton, dark and four-petaled structures, some unusual patterns such as triple-hole, twisted-pair, composite four-petaled and composite dark rogue waves are put forward. Moreover, the interaction between dark-bright soliton and dark rogue wave and interaction between breather and dark rogue wave are shown. Further, the higher-order nonlinear superposition modes which feature triple and quadruple temporal-spatial distributions are presented. Finally, the state transition between rogue wave and W-shaped soliton is found where the modulation instability growth rate tends to zero under the low perturbation frequency. Particularly, the dark and double-peak W-shaped solitons are examined.

preprint2020arXiv

Time-dependent Hamiltonian simulation with $L^1$-norm scaling

The difficulty of simulating quantum dynamics depends on the norm of the Hamiltonian. When the Hamiltonian varies with time, the simulation complexity should only depend on this quantity instantaneously. We develop quantum simulation algorithms that exploit this intuition. For sparse Hamiltonian simulation, the gate complexity scales with the $L^1$ norm $\int_{0}^{t}\mathrm{d}τ\left\lVert H(τ)\right\lVert_{\max}$, whereas the best previous results scale with $t\max_{τ\in[0,t]}\left\lVert H(τ)\right\lVert_{\max}$. We also show analogous results for Hamiltonians that are linear combinations of unitaries. Our approaches thus provide an improvement over previous simulation algorithms that can be substantial when the Hamiltonian varies significantly. We introduce two new techniques: a classical sampler of time-dependent Hamiltonians and a rescaling principle for the Schrödinger equation. The rescaled Dyson-series algorithm is nearly optimal with respect to all parameters of interest, whereas the sampling-based approach is easier to realize for near-term simulation. These algorithms could potentially be applied to semi-classical simulations of scattering processes in quantum chemistry.

preprint2020arXiv

Tunable optomechanically induced transparency by controlling the dark-mode effect

We study tunable optomechanically induced transparency by controlling the dark-mode effect induced by two mechanical modes coupled to a common cavity field. This is realized by introducing a phase-dependent phonon-exchange interaction, which is used to form a loop-coupled configuration. Combining this phase-dependent coupling with the optomechanical interactions, the dark-mode effect can be controlled by the quantum interference effect. In particular, the dark-mode effect in this two-mechanical-mode optomechanical system can lead to a double-amplified optomechanically induced transparency (OMIT) window and a higher efficiency of the second-order sideband in comparison with the standard optomechanical system. This is because the effective mechanical decay rate related to the linewidth of the OMIT window becomes a twofold increase in the weak-coupling limit. When the dark-mode effect is broken, controllable double transparency windows appear and the second-order sideband, as well as the light delay or advance, is significantly enhanced. For an N-mechanical-mode optomechanical system, we find that in the presence of the dark-mode effect, the amplification multiple of the linewidth of the OMIT window is nearly proportional to the number of mechanical modes, and that the OMIT with a single window becomes the one with N tunable windows by breaking the dark-mode effect. The study will be useful in optical information storage within a large-frequency bandwidth and multichannel optical communication based on optomechanical systems.

preprint2020arXiv

Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation

Visual navigation is a task of training an embodied agent by intelligently navigating to a target object (e.g., television) using only visual observations. A key challenge for current deep reinforcement learning models lies in the requirements for a large amount of training data. It is exceedingly expensive to construct sufficient 3D synthetic environments annotated with the target object information. In this paper, we focus on visual navigation in the low-resource setting, where we have only a few training environments annotated with object information. We propose a novel unsupervised reinforcement learning approach to learn transferable meta-skills (e.g., bypass obstacles, go straight) from unannotated environments without any supervisory signals. The agent can then fast adapt to visual navigation through learning a high-level master policy to combine these meta-skills, when the visual-navigation-specified reward is provided. Evaluation in the AI2-THOR environments shows that our method significantly outperforms the baseline by 53.34% relatively on SPL, and further qualitative analysis demonstrates that our method learns transferable motor primitives for visual navigation.

preprint2020arXiv

Using Cyclic Noise as the Source Signal for Neural Source-Filter-based Speech Waveform Model

Neural source-filter (NSF) waveform models generate speech waveforms by morphing sine-based source signals through dilated convolution in the time domain. Although the sine-based source signals help the NSF models to produce voiced sounds with specified pitch, the sine shape may constrain the generated waveform when the target voiced sounds are less periodic. In this paper, we propose a more flexible source signal called cyclic noise, a quasi-periodic noise sequence given by the convolution of a pulse train and a static random noise with a trainable decaying rate that controls the signal shape. We further propose a masked spectral loss to guide the NSF models to produce periodic voiced sounds from the cyclic noise-based source signal. Results from a large-scale listening test demonstrated the effectiveness of the cyclic noise and the masked spectral loss on speaker-independent NSF models in copy-synthesis experiments on the CMU ARCTIC database.

preprint2020arXiv

VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research

We present a new large-scale multilingual video description dataset, VATEX, which contains over 41,250 videos and 825,000 captions in both English and Chinese. Among the captions, there are over 206,000 English-Chinese parallel translation pairs. Compared to the widely-used MSR-VTT dataset, VATEX is multilingual, larger, linguistically complex, and more diverse in terms of both video and natural language descriptions. We also introduce two tasks for video-and-language research based on VATEX: (1) Multilingual Video Captioning, aimed at describing a video in various languages with a compact unified captioning model, and (2) Video-guided Machine Translation, to translate a source language description into the target language using the video information as additional spatiotemporal context. Extensive experiments on the VATEX dataset show that, first, the unified multilingual model can not only produce both English and Chinese descriptions for a video more efficiently, but also offer improved performance over the monolingual models. Furthermore, we demonstrate that the spatiotemporal video context can be effectively utilized to align source and target languages and thus assist machine translation. In the end, we discuss the potentials of using VATEX for other video-and-language research.

preprint2020arXiv

Zero-Shot Multi-Speaker Text-To-Speech with State-of-the-art Neural Speaker Embeddings

While speaker adaptation for end-to-end speech synthesis using speaker embeddings can produce good speaker similarity for speakers seen during training, there remains a gap for zero-shot adaptation to unseen speakers. We investigate multi-speaker modeling for end-to-end text-to-speech synthesis and study the effects of different types of state-of-the-art neural speaker embeddings on speaker similarity for unseen speakers. Learnable dictionary encoding-based speaker embeddings with angular softmax loss can improve equal error rates over x-vectors in a speaker verification task; these embeddings also improve speaker similarity and naturalness for unseen speakers when used for zero-shot adaptation to new speakers in end-to-end speech synthesis.

preprint2019arXiv

Elliptic Blowup Equations for 6d SCFTs. II: Exceptional Cases

The building blocks of 6d $(1,0)$ SCFTs include certain rank one theories with gauge group $G=SU(3),SO(8),F_4,E_{6,7,8}$. In this paper, we propose a universal recursion formula for the elliptic genera of all such theories. This formula is solved from the elliptic blowup equations introduced in our previous paper. We explicitly compute the elliptic genera and refined BPS invariants, which recover all previous results from topological string theory, modular bootstrap, Hilbert series, 2d quiver gauge theories and 4d $\mathcal{N}=2$ superconformal $H_{G}$ theories. We also observe an intriguing relation between the $k$-string elliptic genus and the Schur indices of rank $k$ $H_{G}$ SCFTs, as a generalization of Lockhart-Zotto's conjecture at the rank one cases. In a subsequent paper, we deal with all other non-Higgsable clusters with matters.

preprint2019arXiv

Elliptic Blowup Equations for 6d SCFTs. III: E-strings, M-strings and Chains

We establish the elliptic blowup equations for E-strings and M-strings and solve elliptic genera and refined BPS invariants from them. Such elliptic blowup equations can be derived from a path integral interpretation. We provide toric hypersurface construction for the Calabi-Yau geometries of M-strings and those of E-strings with up to three mass parameters turned on, as well as an approach to derive the perturbative prepotential directly from the local description of the Calabi-Yau threefolds. We also demonstrate how to systematically obtain blowup equations for all rank one 5d SCFTs from E-string by blow-down operations. Finally, we present blowup equations for E-M and M string chains.

preprint2019arXiv

Experimental Test of Leggett's Inequalities with Solid-State Spins

Bell's theorem states that no local hidden variable model is compatible with quantum mechanics. Surprisingly, even if we release the locality constraint, certain nonlocal hidden variable models, such as the one proposed by Leggett, may still be at variance with the predictions of quantum physics. Here, we report an experimental test of Leggett's nonlocal model with solid-state spins in a diamond nitrogen-vacancy center. We entangle an electron spin with a surrounding weakly coupled $^{13}C$ nuclear spin and observe that the entangled states violate Leggett-type inequalities by more than four and seven standard deviations for six and eight measurement settings, respectively. Our experimental results are in full agreement with quantum predictions and violate Leggett's nonlocal hidden variable inequality with a high level of confidence.

preprint2019arXiv

Hamiltonian bump-on-tail model: interpretation of EP/AE interaction

The Bump-on-Tail (BoT) model is often adopted to characterize the non-linear interaction between fast ions and Alfvén Eigenmodes. A multi-beam Hamiltonian approach to the BoT model is tested here as paradigm for the description of these phenomena.

preprint2019arXiv

Homophily on social networks changes evolutionary advantage in competitive information diffusion

Competitive information diffusion on large-scale social networks reveals fundamental characteristics of rumor contagions and has profound influence on public opinion formation. There has been growing interest in exploring dynamical mechanisms of the competing evolutions recently. Nevertheless, the impacts of population homophily, which determines powerful collective human behaviors, remains unclear. In this paper, we incorporate homophily effects into a modified competitive ignorant-spreader-ignorant (SIS) rumor diffusion model with generalized population preference. Using microscopic Markov chain approach, we first derive the phase diagram of competing diffusion results and examine how competitive information spreads and evolves on social networks. We then explore the detailed effects of homophily, which is modeled by a rewiring mechanism. Results show that homophily promotes the formation of divided "echo chambers" and protects the disadvantaged information from extinction, which further changes or even reverses the evolutionary advantage, i.e., the difference of final proportions of the competitive information. We highlight the conclusion that the reversals may happen only when the initially disadvantaged information has stronger transmission ability, owning diffusion advantage over the other one. Our framework provides profound insight into competing dynamics with population homophily, which may pave ways for further controlling misinformation and guiding public belief systems. Moreover, the reversing condition sheds light on designing effective competing strategies in many real scenarios.

preprint2019arXiv

Modeling and Analysis of Energy Harvesting and Smart Grid-Powered Wireless Communication Networks: A Contemporary Survey

The advancements in smart power grid and the advocation of ``green communications'' have inspired the wireless communication networks to harness energy from ambient environments and operate in an energy-efficient manner for economic and ecological benefits. This article presents a contemporary review of recent breakthroughs on the utilization, redistribution, trading and planning of energy harvested in future wireless networks interoperating with smart grids. This article starts with classical models of renewable energy harvesting technologies. We embark on constrained operation and optimization of different energy harvesting wireless systems, such as point-to-point, multipoint-to-point, multipoint-to-multipoint, multi-hop, and multi-cell systems. We also review wireless power and information transfer technologies which provide a special implementation of energy harvesting wireless communications. A significant part of the article is devoted to the redistribution of redundant (unused) energy harvested within cellular networks, the energy planning under dynamic pricing when smart grids are in place, and two-way energy trading between cellular networks and smart grids. Applications of different optimization tools, such as convex optimization, Lagrangian dual-based method, subgradient method, and Lyapunov-based online optimization, are compared. This article also collates the potential applications of energy harvesting techniques in emerging (or upcoming) 5G/B5G communication systems. It is revealed that an effective redistribution and two-way trading of energy can significantly reduce the electricity bills of wireless service providers and decrease the consumption of brown energy. A list of interesting research directions are provided, requiring further investigation.

preprint2019arXiv

On the Properties of the Effective Jarlskog Invariant for Three-flavor Neutrino Oscillations in Matter

In this paper, we show that the ratio of the effective Jarlskog invariant $\widetilde{\cal J}$ for leptonic CP violation in three-flavor neutrino oscillations in matter to its counterpart ${\cal J}$ in vacuum $\widetilde{\cal J}/{\cal J} \approx 1/(\hat{C}^{}_{12} \hat{C}^{}_{13})$ holds as an excellent approximation, where $\hat{C}^{}_{12} \equiv \sqrt{1 - 2 \hat{A}^{}_* \cos 2θ^{}_{12} + \hat{A}^2_*}$ with $\hat{A}^{}_* \equiv a\cos^2 θ^{}_{13}/Δ^{}_{21}$ and $\hat{C}^{}_{13} \equiv \sqrt{1 - 2 A^{}_{\rm c} \cos 2θ^{}_{13} + A^2_{\rm c}}$ with $A^{}_{\rm c} \equiv a/Δ^{}_{\rm c}$. Here $Δ^{}_{ij} \equiv m^2_i - m^2_j$ (for $ij = 21, 31, 32$) stand for the neutrino mass-squared differences in vacuum and $θ^{}_{ij}$ (for $ij = 12, 13, 23$) are the neutrino mixing angles in vacuum, while $Δ^{}_{\rm c} \equiv Δ^{}_{31}\cos^2θ^{}_{12} + Δ^{}_{32} \sin^2 θ^{}_{12}$ and the matter parameter $a \equiv 2\sqrt{2}G^{}_{\rm F} N^{}_e E$ are defined. This result has been explicitly derived by improving the previous analytical solutions to the renormalization-group equations of effective neutrino masses and mixing parameters in matter. Furthermore, as a practical application, such a simple analytical formula has been implemented to understand the existence and location of the extrema of $\widetilde{\cal J}$.

preprint2019arXiv

Overcome Competitive Exclusion in Ecosystems

Explaining biodiversity in nature is a fundamental problem in ecology. An outstanding challenge is embodied in the so-called Competitive Exclusion Principle: two species competing for one limiting resource cannot coexist at constant population densities, or more generally, the number of consumer species in steady coexistence cannot exceed that of resources. The fact that competitive exclusion is rarely observed in natural ecosystems has not been fully understood. Here we show that by forming chasing triplets among the consumers and resources in the consumption process, the Competitive Exclusion Principle can be naturally violated. The modeling framework developed here is broadly applicable and can be used to explain the biodiversity of many consumer-resource ecosystems and hence deepens our understanding of biodiversity in nature.

preprint2019arXiv

Privacy-preserving Distributed Machine Learning via Local Randomization and ADMM Perturbation

With the proliferation of training data, distributed machine learning (DML) is becoming more competent for large-scale learning tasks. However, privacy concerns have to be given priority in DML, since training data may contain sensitive information of users. In this paper, we propose a privacy-preserving ADMM-based DML framework with two novel features: First, we remove the assumption commonly made in the literature that the users trust the server collecting their data. Second, the framework provides heterogeneous privacy for users depending on data's sensitive levels and servers' trust degrees. The challenging issue is to keep the accumulation of privacy losses over ADMM iterations minimal. In the proposed framework, a local randomization approach, which is differentially private, is adopted to provide users with self-controlled privacy guarantee for the most sensitive information. Further, the ADMM algorithm is perturbed through a combined noise-adding method, which simultaneously preserves privacy for users' less sensitive information and strengthens the privacy protection of the most sensitive information. We provide detailed analyses on the performance of the trained model according to its generalization error. Finally, we conduct extensive experiments using real-world datasets to validate the theoretical results and evaluate the classification performance of the proposed framework.

preprint2019arXiv

Quantum Channel Simulation and the Channel's Smooth Max-Information

We study the general framework of quantum channel simulation, that is, the ability of a quantum channel to simulate another one using different classes of codes. First, we show that the minimum error of simulation and the one-shot quantum simulation cost under no-signalling assisted codes are given by semidefinite programs. Second, we introduce the channel's smooth max-information, which can be seen as a one-shot generalization of the mutual information of a quantum channel. We provide an exact operational interpretation of the channel's smooth max-information as the one-shot quantum simulation cost under no-signalling assisted codes, which significantly simplifies the study of channel simulation and provides insights and bounds for the case under entanglement-assisted codes. Third, we derive the asymptotic equipartition property of the channel's smooth max-information; i.e., it converges to the quantum mutual information of the channel in the independent and identically distributed asymptotic limit. This implies the quantum reverse Shannon theorem in the presence of no-signalling correlations. Finally, we explore the simulation cost of various quantum channels.

preprint2019arXiv

Steering Eco-Evolutionary Games Dynamics with Manifold Control

Feedback loops between population dynamics of individuals and their ecological environment are ubiquitously found in nature, and have shown profound effects on the resulting eco-evolutionary dynamics. Incorporating linear environmental feedback law into replicator dynamics of two-player games, recent theoretical studies shed light on understanding the oscillating dynamics of social dilemma. However, detailed effects of more general nonlinear feedback loops in multi-player games, which is more common especially in microbial systems, remain unclear. Here, we focus on ecological public goods games with environmental feedbacks driven by nonlinear selection gradient. Unlike previous models, multiple segments of stable and unstable equilibrium manifolds can emerge from the population dynamical systems. We find that a larger relative asymmetrical feedback speed for group interactions centered on cooperators not only accelerates the convergence of stable manifolds, but also increases the attraction basin of these stable manifolds. Furthermore, our work offers an innovative manifold control approach: by designing appropriate switching control laws, we are able to steer the eco-evolutionary dynamics to any desired population states. Our mathematical framework is an important generalization and complement to coevolutionary game dynamics, and also fills the theoretical gap in guiding the widespread problem of population state control in microbial experiments.

preprint2017arXiv

A strengthened inequality of Alon-Babai-Suzuki's conjecture on set systems with restricted intersections modulo p

Let $K=\{k_1,k_2,\ldots,k_r\}$ and $L=\{l_1,l_2,\ldots,l_s\}$ be disjoint subsets of $\{0,1,\ldots,p-1\}$, where $p$ is a prime and $A=\{A_1,A_2,\ldots,A_m\}$ be a family of subsets of $[n]$ such that $|A_i|\pmod{p}\in K$ for all $A_i\in A$ and $|A_i\cap A_j|\pmod{p}\in L$ for $i\ne j$. In 1991, Alon, Babai and Suzuki conjectured that if $n\geq s+\max_{1\leq i\leq r} k_i$, then $|A|\leq {n\choose s}+{n\choose s-1}+\cdots+{n\choose s-r+1}$. In 2000, Qian and Ray-Chaudhuri proved the conjecture under the condition $n\geq 2s-r$. In 2015, Hwang and Kim verified the conjecture of Alon, Babai and Suzuki. In this paper, we will prove that if $n\geq 2s-2r+1$ or $n\geq s+\max_{1\leq i\leq r}k_i$, then \[ |A|\leq{n-1\choose s}+{n-1\choose s-1}+\cdots+{n-1\choose s-2r+1}. \] This result strengthens the upper bound of Alon, Babai and Suzuki's conjecture when $n\geq 2s-2$.

preprint2016arXiv

A genus-4 topological recursion relation for Gromov-Witten invariants

In this paper, we give a new genus-4 topological recursion relation for Gromov-Witten invariants of compact symplectic manifolds via Pixton's relations on the moduli space of curves. As an application, we prove Pixton's relations imply a known topological recursion relation on $\bar{\mathcal{M}}_{g,1}$ for genus $g\leq4$.

preprint2016arXiv

A multi-task learning model for malware classification with useful file access pattern from API call sequence

Based on API call sequences, semantic-aware and machine learning (ML) based malware classifiers can be built for malware detection or classification. Previous works concentrate on crafting and extracting various features from malware binaries, disassembled binaries or API calls via static or dynamic analysis and resorting to ML to build classifiers. However, they tend to involve too much feature engineering and fail to provide interpretability. We solve these two problems with the recent advances in deep learning: 1) RNN-based autoencoders (RNN-AEs) can automatically learn low-dimensional representation of a malware from its raw API call sequence. 2) Multiple decoders can be trained under different supervisions to give more information, other than the class or family label of a malware. Inspired by the works of document classification and automatic sentence summarization, each API call sequence can be regarded as a sentence. In this paper, we make the first attempt to build a multi-task malware learning model based on API call sequences. The model consists of two decoders, one for malware classification and one for $\emph{file access pattern}$ (FAP) generation given the API call sequence of a malware. We base our model on the general seq2seq framework. Experiments show that our model can give competitive classification results as well as insightful FAP information.

preprint2016arXiv

A semidefinite programming upper bound of quantum capacity

Recently the power of positive partial transpose preserving (PPTp) and no-signalling (NS) codes in quantum communication has been studied. We continue with this line of research and show that the NS/PPTp/NS$\cap$PPTp codes assisted zero-error quantum capacity depends only on the non-commutative bipartite graph of the channel and the one-shot case can be computed efficiently by semidefinite programming (SDP). As an example, the activated PPTp codes assisted zero-error quantum capacity is carefully studied. We then present a general SDP upper bound $Q_Γ$ of quantum capacity and show it is always smaller than or equal to the "Partial transposition bound" introduced by Holevo and Werner, and the inequality could be strict. This upper bound is found to be additive, and thus is an upper bound of the potential PPTp assisted quantum capacity as well. We further demonstrate that $Q_Γ$ is strictly better than several previously known upper bounds for an explicit class of quantum channels. Finally, we show that $Q_Γ$ can be used to bound the super-activation of quantum capacity.

preprint2016arXiv

Accelerating Data Regeneration for Distributed Storage Systems with Heterogeneous Link Capacities

Distributed storage systems provide large-scale reliable data storage services by spreading redundancy across a large group of storage nodes. In such a large system, node failures take place on a regular basis. When a storage node breaks down, a replacement node is expected to regenerate the redundant data as soon as possible in order to maintain the same level of redundancy. Previous results have been mainly focused on the minimization of network traffic in regeneration. However, in practical networks, where link capacities vary in a wide range, minimizing network traffic does not always yield the minimum regeneration time. In this paper, we investigate two approaches to the problem of minimizing regeneration time in networks with heterogeneous link capacities. The first approach is to download different amounts of repair data from the helping nodes according to the link capacities. The second approach generalizes the conventional star-structured regeneration topology to tree-structured topologies so that we can utilize the links between helping nodes with bypassing low-capacity links. Simulation results show that the flexible tree-structured regeneration scheme that combines the advantages of both approaches can achieve a substantial reduction in the regeneration time.

preprint2016arXiv

Benchmarking of dynamically corrected gates for the exchange-only spin qubit in $1/f$ noise environment

We study theoretically the responses of the dynamically corrected gates to time-dependent noises in the exchange-only spin qubit system. We consider $1/f$ noises having spectra proportional to $1/ω^α$, where the exponent $α$ indicates the strength of correlation within the noise. The quantum gate errors due to noises are extracted from a numerical simulation of Randomized Benchmarking, and are compared between the application of uncorrected operations and that of dynamically corrected gates robust against the hyperfine noise. We have found that for $α\gtrsim1.5$, the dynamically corrected gates offer considerable reduction in the gate error and such reduction is approximately two orders of magnitude for the experimentally relevant noise exponent. On the other hand, no improvement of the gate fidelity is provided for $α\lesssim1.5$. This critical value $α_c\approx1.5$ is comparatively larger than that for the cases for the singlet-triplet qubits. The filter transfer functions corresponding to the dynamically corrected gates are also computed and compared to those derived from uncorrected pulses. Our results suggest that the dynamically corrected gates are useful measures to suppress the hyperfine noise when operating the exchange-only qubits.

preprint2016arXiv

Context-Free Path Queries on RDF Graphs

Navigational graph queries are an important class of queries that canextract implicit binary relations over the nodes of input graphs. Most of the navigational query languages used in the RDF community, e.g. property paths in W3C SPARQL 1.1 and nested regular expressions in nSPARQL, are based on the regular expressions. It is known that regular expressions have limited expressivity; for instance, some natural queries, like same generation-queries, are not expressible with regular expressions. To overcome this limitation, in this paper, we present cfSPARQL, an extension of SPARQL query language equipped with context-free grammars. The cfSPARQL language is strictly more expressive than property paths and nested expressions. The additional expressivity can be used for modelling graph similarities, graph summarization and ontology alignment. Despite the increasing expressivity, we show that cfSPARQL still enjoys a low computational complexity and can be evaluated efficiently.

preprint2016arXiv

Detection of Lyman-Alpha Emission From a Triple Imaged z=6.85 Galaxy Behind MACS J2129.4-0741

We report the detection of Ly$α$ emission at $\sim9538$Å in the Keck/DEIMOS and \HST WFC3 G102 grism data from a triply-imaged galaxy at $z=6.846\pm0.001$ behind galaxy cluster MACS J2129.4$-$0741. Combining the emission line wavelength with broadband photometry, line ratio upper limits, and lens modeling, we rule out the scenario that this emission line is \oii at $z=1.57$. After accounting for magnification, we calculate the weighted average of the intrinsic Ly$α$ luminosity to be $\sim1.3\times10^{42}~\mathrm{erg}~\mathrm{s}^{-1}$ and Ly$α$ equivalent width to be $74\pm15$Å. Its intrinsic UV absolute magnitude at 1600Å is $-18.6\pm0.2$ mag and stellar mass $(1.5\pm0.3)\times10^{7}~M_{\odot}$, making it one of the faintest (intrinsic $L_{UV}\sim0.14~L_{UV}^*$) galaxies with Ly$α$ detection at $z\sim7$ to date. Its stellar mass is in the typical range for the galaxies thought to dominate the reionization photon budget at $z\gtrsim7$; the inferred Ly$α$ escape fraction is high ($\gtrsim 10$\%), which could be common for sub-$L^*$ $z\gtrsim7$ galaxies with Ly$α$ emission. This galaxy offers a glimpse of the galaxy population that is thought to drive reionization, and it shows that gravitational lensing is an important avenue to probe the sub-$L^*$ galaxy population.

preprint2016arXiv

Direct Meissner Effect Observation of Superconductivity in Compressed H2S

Recently, an extremely high superconducting temperature (Tc) of ~200 K has been reported in the sulfur hydride system above 100 GPa. This result is supported by theoretical predictions and verified experimentally. The crystal structure of the superconducting phase was also identified experimentally, confirming the theoretically predicted structure as well as a decomposition mechanism from H2S to H3S+S. Even though nuclear resonant scattering has been successfully used to provide magnetic evidence for a superconducting state, a direct measurement of the important Meissner effect is still lacking. Here we report in situ alternating-current magnetic susceptibility measurements on compressed H2S under high pressures. It is shown that superconductivity suddenly appears at 117 GPa and that Tc reaches 183 K at 149 GPa before decreasing monotonically with a further increase in pressure. This evolution agrees with both theoretical calculations and earlier experimental measurements. The idea of conventional high temperature superconductivity in hydrogen-dominant compounds has thus been realized in the sulfur hydride system under hydrostatic pressure, opening further exciting perspectives for possibly realizing room temperature superconductivity in hydrogen-based compounds.

preprint2016arXiv

Efficient Approximation of Well-Designed SPARQL Queries

Query response time often influences user experience in the real world. However, it possibly takes more time to answer a query with its all exact solutions, especially when it contains the OPT operations since the OPT operation is the least conventional operator in SPARQL. So it becomes essential to make a trade-off between the query response time and the accuracy of their solutions. In this paper, based on the depth of the OPT operation occurring in a query, we propose an approach to obtain its all approximate queries with less depth of the OPT operation. This paper mainly discusses those queries with well-designed patterns since the OPT operation in a well-designed pattern is really "optional". Firstly, we transform a well-designed pattern in OPT normal form into a well-designed tree, whose inner nodes are labeled by OPT operation and leaf nodes are labeled by patterns containing other operations such as the AND operation and the FILTER operation. Secondly, based on this well-designed tree, we remove "optional" well-designed subtrees with less depth of the OPT operation and then obtain approximate queries with different depths of the OPT operation. Finally, we evaluate the approximate query efficiency with the degree of approximation.

preprint2016arXiv

Fast control of semiconductor qubits beyond the rotating-wave approximation

We present a theoretical study of single-qubit operations by oscillatory fields on various semiconductor platforms. We explicitly show how to perform faster gate operations by going beyond the universally-used rotating wave approximation (RWA) regime, while using only two sinusoidal pulses. No complicated pulse shaping or optimal control sequences are required. We first show for specific published experiments how much error is currently incurred by implementing pulses designed using standard RWA. We then show that an even modest increase in gate speed would cause problems in using RWA for gate design in the singlet-triplet (ST) and resonant-exchange (RX) qubits. We discuss the extent to which analytically keeping higher orders in the perturbation theory would address the problem. More strikingly, we give a new prescription for gating with strong coupling far beyond the RWA regime. We perform numerical calculations for the phases and the durations of two consecutive pulses to realize the key Hadamard and $\fracπ{8}$ gates with coupling strengths up to several times the qubit splitting. Working in this manifestly non-RWA regime, the gate operation speeds up by two to three orders of magnitude.

preprint2016arXiv

Improved Semidefinite Programming Upper Bound on Distillable Entanglement

A new additive and semidefinite programming (SDP) computable entanglement measure is introduced to upper bound the amount of distillable entanglement in bipartite quantum states by operations completely preserving the positivity of partial transpose (PPT). This quantity is always smaller than or equal to the logarithmic negativity, the previously best known SDP bound on distillable entanglement, and the inequality is strict in general. Furthermore, a succinct SDP characterization of the one-copy PPT deterministic distillable entanglement for any given state is also obtained, which provides a simple but useful lower bound on the PPT distillable entanglement. Remarkably, there is a genuinely mixed state of which both bounds coincide with the distillable entanglement while being strictly less than the logarithmic negativity.

preprint2016arXiv

Invertible binary matrix with maximum number of $2$-by-$2$ invertible submatrices

The problem is related to all-or-nothing transforms (AONT) suggested by Rivest as a preprocessing for encrypting data with a block cipher. Since then there have been various applications of AONTs in cryptography and security. D'Arco, Esfahani and Stinson posed the problem on the constructions of binary matrices for which the desired properties of an AONT hold with the maximum probability. That is, for given integers $t\le s$, what is the maximum number of $t$-by-$t$ invertible submatrices in a binary matrix of order $s$? For the case $t=2$, let $R_2(s)$ denote the maximal proportion of 2-by-2 invertible submatrices. D'Arco, Esfahani and Stinson conjectured that the limit is between 0.492 and 0.625. We completely solve the case $t=2$ by showing that $\lim_{s\rightarrow\infty}R_2(s)=0.5$.

preprint2016arXiv

Method for observing robust and tunable phonon blockade in a nanomechanical resonator coupled to a charge qubit

Phonon blockade is a purely quantum phenomenon, analogous to Coulomb and photon blockades, in which a single phonon in an anharmonic mechanical resonator can impede the excitation of a second phonon. We propose an experimental method to realize phonon blockade in a driven harmonic nanomechanical resonator coupled to a qubit, where the coupling is proportional to the second-order nonlinear susceptibility $χ^{(2)}$. This is in contrast to the standard realizations of phonon and photon blockade effects in Kerr-type $χ^{(3)}$ nonlinear systems. The nonlinear coupling strength can be adjusted conveniently by changing the coherent drive field.As an example, we apply this model to predict and describe phonon blockade in a nanomechanical resonator coupled to a Cooper-pair box (i.e., a charge qubit) with a linear longitudinal coupling. By obtaining the solutions of the steady state for this composite system, we give the conditions forobserving strong antibunching and sub-Poissonian phonon-number statistics in this induced second-order nonlinear system. Besides using the qubit to produce phonon blockade states, the qubit itself can also be employed to detect blockade effects by measuring its states. Numerical simulations indicate that the robustness of the phonon blockade, and the sensitivity of detecting it, will benefit from this strong induced nonlinear coupling.

preprint2016arXiv

Multi-output microwave single-photon source using superconducting circuits with longitudinal and transverse couplings

Single-photon devices at microwave frequencies are important for applications in quantum information processing and communication in the microwave regime. In this work, we describe a proposal of a multi-output single-photon device. We consider two superconducting resonators coupled to a gap-tunable qubit via both its longitudinal and transverse degrees of freedom. Thus, this qubit-resonator coupling differs from the coupling in standard circuit quantum-electrodynamic systems described by the Jaynes-Cummings model. We demonstrate that an effective quadratic coupling between one of the normal modes and the qubit can be induced, and this induced second-order nonlinearity is much larger than that for conventional Kerr-type systems exhibiting photon blockade. Assuming that a coupled normal mode is resonantly driven, we observe that the output fields from the resonators exhibit strong sub-Poissonian photon-number statistics and photon antibunching. Contrary to previous studies on resonant photon blockade, the first-excited state of our device is a pure single-photon Fock state rather than a polariton state, i.e., a highly hybridized qubit-photon state. In addition, it is found that the optical state truncation caused by the strong qubit-induced nonlinearity can lead to an entanglement between the two resonators, even in their steady state under the Markov approximation.

preprint2016arXiv

New bounds of permutation codes under Hamming metric and Kendall's $τ$-metric

Permutation codes are widely studied objects due to their numerous applications in various areas, such as power line communications, block ciphers, and the rank modulation scheme for flash memories. Several kinds of metrics are considered for permutation codes according to their specific applications. This paper concerns some improvements on the bounds of permutation codes under Hamming metric and Kendall's $τ$-metric respectively, using mainly a graph coloring approach. Specifically, under Hamming metric, we improve the Gilbert-Varshamov bound asymptotically by a factor $n$, when the minimum Hamming distance $d$ is fixed and the code length $n$ goes to infinity. Under Kendall's $τ$-metric, we narrow the gap between the known lower bounds and upper bounds. Besides, we also obtain some sporadic results under Kendall's $τ$-metric for small parameters.

preprint2016arXiv

Noise filtering of composite pulses for singlet-triplet qubits

Semiconductor quantum dot spin qubits are promising candidates for quantum computing. In these systems, the dynamically corrected gates offer considerable reduction of gate errors and are therefore of great interest both theoretically and experimentally. They are, however, designed under the static-noise model and may be considered as low-frequency filters. In this work, we perform a comprehensive theoretical study of the response of a type of dynamically corrected gates, namely the {\sc supcode} for singlet-triplet qubits, to realistic $1/f$ noises with frequency spectra $1/ω^α$. Through randomized benchmarking, we have found that {\sc supcode} offers improvement of the gate fidelity for $α\gtrsim1$ and the improvement becomes exponentially more pronounced with the increase of the noise exponent in the range $1\lesssimα\leq3$ studied. On the other hand, for small $α$, {\sc supcode} will not offer any improvement. The $δJ$-{\sc supcode}, specifically designed for systems where the nuclear noise is absent, is found to offer additional error reduction than the full {\sc supcode} for charge noises. The computed filter transfer functions of the {\sc supcode} gates are also presented.

preprint2016arXiv

On Multiplicative Multitask Feature Learning

We investigate a general framework of multiplicative multitask feature learning which decomposes each task's model parameters into a multiplication of two components. One of the components is used across all tasks and the other component is task-specific. Several previous methods have been proposed as special cases of our framework. We study the theoretical properties of this framework when different regularization conditions are applied to the two decomposed components. We prove that this framework is mathematically equivalent to the widely used multitask feature learning methods that are based on a joint regularization of all model parameters, but with a more general form of regularizers. Further, an analytical formula is derived for the across-task component as related to the task-specific component for all these regularizers, leading to a better understanding of the shrinkage effect. Study of this framework motivates new multitask learning algorithms. We propose two new learning formulations by varying the parameters in the proposed framework. Empirical studies have revealed the relative advantages of the two new formulations by comparing with the state of the art, which provides instructive insights into the feature learning problem with multiple tasks.

preprint2016arXiv

On private information retrieval array codes

Given a database, the private information retrieval (PIR) protocol allows a user to make queries to several servers and retrieve a certain item of the database via the feedbacks, without revealing the privacy of the specific item to any single server. Classical models of PIR protocols require that each server stores a whole copy of the database. Recently new PIR models are proposed with coding techniques arising from distributed storage system. In these new models each server only stores a fraction $1/s$ of the whole database, where $s>1$ is a given rational number. PIR array codes are recently proposed by Fazeli, Vardy and Yaakobi to characterize the new models. Consider a PIR array code with $m$ servers and the $k$-PIR property (which indicates that these $m$ servers may emulate any efficient $k$-PIR protocol). The central problem is to design PIR array codes with optimal rate $k/m$. Our contribution to this problem is three-fold. First, for the case $1<s\le 2$, although PIR array codes with optimal rate have been constructed recently by Blackburn and Etzion, the number of servers in their construction is impractically large. We determine the minimum number of servers admitting the existence of a PIR array code with optimal rate for a certain range of parameters. Second, for the case $s>2$, we derive a new upper bound on the rate of a PIR array code. Finally, for the case $s>2$, we analyze a new construction by Blackburn and Etzion and show that its rate is better than all the other existing constructions.

preprint2016arXiv

On the quantum no-signalling assisted zero-error classical simulation cost of non-commutative bipartite graphs

Using one channel to simulate another exactly with the aid of quantum no-signalling correlations has been studied recently. The one-shot no-signalling assisted classical zero-error simulation cost of non-commutative bipartite graphs has been formulated as semidefinite programms [Duan and Winter, IEEE Trans. Inf. Theory 62, 891 (2016)]. Before our work, it was unknown whether the one-shot (or asymptotic) no-signalling assisted zero-error classical simulation cost for general non-commutative graphs is multiplicative (resp. additive) or not. In this paper we address these issues and give a general sufficient condition for the multiplicativity of the one-shot simulation cost and the additivity of the asymptotic simulation cost of non-commutative bipartite graphs, which include all known cases such as extremal graphs and classical-quantum graphs. Applying this condition, we exhibit a large class of so-called \emph{cheapest-full-rank graphs} whose asymptotic zero-error simulation cost is given by the one-shot simulation cost. Finally, we disprove the multiplicativity of one-shot simulation cost by explicitly constructing a special class of qubit-qutrit non-commutative bipartite graphs.

preprint2016arXiv

On the Statistical Analysis of Practical SPARQL Queries

In this paper, we analyze some basic features of SPARQL queries coming from our practical world in a statistical way. These features include three statistic features such as the occurrence frequency of triple patterns, fragments, well-designed patterns and four semantic features such as monotonicity, non-monotonicity, weak monotonicity (old solutions are still served as parts of new solutions when some new triples are added) and satisfiability. All these features contribute to characterize SPARQL queries in different dimensions. We hope that this statistical analysis would provide some useful observation for researchers and engineers who are interested in what practical SPARQL queries look like, so that they could develop some practical heuristics for processing SPARQL queries and build SPARQL query processing engines and benchmarks. Besides, they can narrow the scope of their problems by avoiding those cases that do possibly not happen in our practical world.

preprint2016arXiv

PIWD: A Plugin-based Framework for Well-Designed SPARQL

In the real world datasets (e.g.,DBpedia query log), queries built on well-designed patterns containing only AND and OPT operators (for short, WDAO-patterns) account for a large proportion among all SPARQL queries. In this paper, we present a plugin-based framework for all SELECT queries built on WDAO-patterns, named PIWD. The framework is based on a parse tree called \emph{well-designed AND-OPT tree} (for short, WDAO-tree) whose leaves are basic graph patterns (BGP) and inner nodes are the OPT operators. We prove that for any WDAO-pattern, its parse tree can be equivalently transformed into a WDAO-tree. Based on the proposed framework, we can employ any query engine to evaluate BGP for evaluating queries built on WDAO-patterns in a convenient way. Theoretically, we can reduce the query evaluation of WDAO-patterns to subgraph homomorphism as well as BGP since the query evaluation of BGP is equivalent to subgraph homomorphism. Finally, our preliminary experiments on gStore and RDF-3X show that PIWD can answer all queries built on WDAO-patterns effectively and efficiently.

preprint2016arXiv

Robust Learning with Kernel Mean p-Power Error Loss

Correntropy is a second order statistical measure in kernel space, which has been successfully applied in robust learning and signal processing. In this paper, we define a nonsecond order statistical measure in kernel space, called the kernel mean-p power error (KMPE), including the correntropic loss (CLoss) as a special case. Some basic properties of KMPE are presented. In particular, we apply the KMPE to extreme learning machine (ELM) and principal component analysis (PCA), and develop two robust learning algorithms, namely ELM-KMPE and PCA-KMPE. Experimental results on synthetic and benchmark data show that the developed algorithms can achieve consistently better performance when compared with some existing methods.

preprint2016arXiv

RORS: Enhanced Rule-based OWL Reasoning on Spark

The rule-based OWL reasoning is to compute the deductive closure of an ontology by applying RDF/RDFS and OWL entailment rules. The performance of the rule-based OWL reasoning is often sensitive to the rule execution order. In this paper, we present an approach to enhancing the performance of the rule-based OWL reasoning on Spark based on a locally optimal executable strategy. Firstly, we divide all rules (27 in total) into four main classes, namely, SPO rules (5 rules), type rules (7 rules), sameAs rules (7 rules), and schema rules (8 rules) since, as we investigated, those triples corresponding to the first three classes of rules are overwhelming (e.g., over 99% in the LUBM dataset) in our practical world. Secondly, based on the interdependence among those entailment rules in each class, we pick out an optimal rule executable order of each class and then combine them into a new rule execution order of all rules. Finally, we implement the new rule execution order on Spark in a prototype called RORS. The experimental results show that the running time of RORS is improved by about 30% as compared to Kim & Park's algorithm (2015) using the LUBM200 (27.6 million triples).

preprint2016arXiv

Statistical Decoupling of Lagrangian Fluid Parcel in Newtonian Cosmology

The Lagrangian dynamics of a single fluid element within a self-gravitational matter field is intrinsically non-local due to the presence of the tidal force. This complicates the theoretical investigation of the non-linear evolution of various cosmic objects, e.g. dark matter halos, in the context of Lagrangian fluid dynamics, since a fluid parcel with given initial density and shape may evolve differently depending on their environments. In this paper, we provide a statistical solution that could decouple this environmental dependence. After deriving the probability distribution evolution equation of the matter field, our method produces a set of closed ordinary differential equations whose solution is uniquely determined by the initial condition of the fluid element. Mathematically, it corresponds to the projected characteristic curve of the transport equation of the density-weighted probability density function (PDF). Consequently it is guaranteed that the one-point PDF would be preserved by evolving these local, yet non-linear, curves with the same set of initial data as the real system. Physically, these trajectories describe the mean evolution averaged over all environments by substituting the tidal tensor with its conditional average. For Gaussian distributed dynamical variables, this mean tidal tensor is simply proportional to the velocity shear tensor, and the dynamical system would recover the prediction of Zel'dovich approximation (ZA) with the further assumption of the linearized continuity equation. For Weakly non-Gaussian field, the averaged tidal tensor could be expanded perturbatively as a function of all relevant dynamical variables whose coefficients are determined by the statistics of the field.

preprint2016arXiv

Stochastic Online Control for Energy-Harvesting Wireless Networks with Battery Imperfections

In energy harvesting (EH) network, the energy storage devices (i.e., batteries) are usually not perfect. In this paper, we consider a practical battery model with finite battery capacity, energy (dis-)charging loss, and energy dissipation. Taking into account such battery imperfections, we rely on the Lyapunov optimization technique to develop a stochastic online control scheme that aims to maximize the utility of data rates for EH multi-hop wireless networks. It is established that the proposed algorithm can provide a feasible and efficient data admission, power allocation, routing and scheduling solution, without requiring any statistical knowledge of the stochastic channel, data-traffic, and EH processes. Numerical results demonstrate the merit of the proposed scheme.

preprint2016arXiv

The Grism Lens-Amplified Survey from Space (GLASS). VI. Comparing the Mass and Light in MACSJ0416.1-2403 using Frontier Field imaging and GLASS spectroscopy

We present a strong and weak gravitational lens model of the galaxy cluster MACSJ0416.1-2403, constrained using spectroscopy from the Grism Lens-Amplified Survey from Space (GLASS) and Hubble Frontier Fields (HFF) imaging data. We search for emission lines in known multiply imaged sources in the GLASS spectra, obtaining secure spectroscopic redshifts of 31 multiple images belonging to 16 distinct source galaxies. The GLASS spectra provide the first spectroscopic measurements for 6 of the source galaxies. The weak lensing signal is acquired from 884 galaxies in the F606W HFF image. By combining the weak lensing constraints with 15 multiple image systems with spectroscopic redshifts and 9 multiple image systems with photometric redshifts, we reconstruct the gravitational potential of the cluster on an adaptive grid. The resulting total mass density map is compared with a stellar mass density map obtained from the deep Spitzer Frontier Fields imaging data to study the relative distribution of stellar and total mass in the cluster. We find that the projected stellar mass to total mass ratio, $f_{\star}$, varies considerably with the stellar surface mass density. The mean projected stellar mass to total mass ratio is $\langle f_{\star} \rangle= 0.009 \pm 0.003 $ (stat.), but with a systematic error as large as $0.004-0.005$, dominated by the choice of the IMF. We find agreement with several recent measurements of $f_{\star}$ in massive cluster environments. The lensing maps of convergence, shear, and magnification are made available to the broader community in the standard HFF format.

preprint2016arXiv

Two-Scale Stochastic Control for Multipoint Communication Systems with Renewables

Increasing threats of global warming and climate changes call for an energy-efficient and sustainable design of future wireless communication systems. To this end, a novel two-scale stochastic control framework is put forth for smart-grid powered coordinated multi-point (CoMP) systems. Taking into account renewable energy sources (RES), dynamic pricing, two-way energy trading facilities and imperfect energy storage devices, the energy management task is formulated as an infinite-horizon optimization problem minimizing the time-average energy transaction cost, subject to the users' quality of service (QoS) requirements. Leveraging the Lyapunov optimization approach as well as the stochastic subgradient method, a two-scale online control (TS-OC) approach is developed for the resultant smart-grid powered CoMP systems. Using only historical data, the proposed TS-OC makes online control decisions at two timescales, and features a provably feasible and asymptotically near-optimal solution. Numerical tests further corroborate the theoretical analysis, and demonstrate the merits of the proposed approach.

preprint2016arXiv

Virtualizing System and Ordinary Services in Windows-based OS-Level Virtual Machines

OS-level virtualization incurs smaller start-up and run-time overhead than HAL-based virtualization and thus forms an important building block for developing fault-tolerant and intrusion-tolerant applications. A complete implementation of OS-level virtualization on the Windows platform requires virtualization of Windows services, such as system services like the Remote Procedure Call Server Service (RPCSS), because they are essentially extensions of the kernel. As Windows system services work very differently from their counterparts on UNIX-style OS, i.e., daemons, and many of their implementation details are proprietary, virtualizing Windows system services turned out to be the most challenging technical barrier for OS-level virtualization for the Windows platform. In this paper, we describe a general technique to virtualize Windows services, and demonstrate its effectiveness by applying it to successfully virtualize a set of important Windows system services and ordinary services on different versions of Windows OS, including RPCSS, DcomLaunch, IIS service group, Tlntsvr, MySQL, Apache2.2, CiSvc, ImapiService, etc.

preprint2015arXiv

Analyses of microstructural and elastic properties of porous SOFC cathodes based on focused ion beam tomography

Mechanical properties of porous SOFC electrodes are largely determined by their microstructures. Measurements of the elastic properties and microstructural parameters can be achieved by modelling of the digitally reconstructed 3D volumes based on the real electrode microstructures. However, the reliability of such measurements is greatly dependent on the processing of raw images acquired for reconstruction. In this work, the actual microstructures of La0.6Sr0.4Co0.2Fe0.8O3-d (LSCF) cathodes sintered at an elevated temperature were reconstructed based on dual-beam FIB/SEM tomography. Key microstructural and elastic parameters were estimated and correlated. Analyses of their sensitivity to the grayscale threshold value applied in the image segmentation were performed. The important microstructural parameters included porosity, tortuosity, specific surface area, particle and pore size distributions, and inter-particle neck size distribution, which may have varying extent of effect on the elastic properties simulated from the microstructures using FEM. Results showed that different threshold value range would result in different degree of sensitivity for a specific parameter. The estimated porosity and tortuosity were more sensitive than surface area to volume ratio. Pore and neck size were found to be less sensitive than particle size. Results also showed that the modulus was essentially sensitive to the porosity which was largely controlled by the threshold value.

preprint2015arXiv

Crack Formation in Ceramic Films Used in Solid Oxide Fuel Cells

The manufacture of solid oxide fuel cells (SOFCs) involves fabrication of a multilayer ceramic structure, for which constrained sintering is a key processing step in many cases. Defects are often observed in the sintered structure, but their formation during sintering is not well understood. In this work, various ceramic films were fabricated by screen printing and a variety of defects observed. Some films showed mud-cracking defects, whereas others presented distributed large pores. Mud cracking defects were found to originate from a network of fine cracks present in the green film and formed during drying and binder burn-out. Control of these early stages is essential for producing crack-free films. In order to investigate how defects evolve during sintering, artificial cracks were introduced in the green films using indentation. It was observed that crack opening always increased during constrained sintering. In contrast, similar initial cracks could be closed and healed during co-sintering.

preprint2015arXiv

Energy Spectral Property in an Isolated CME-driven Shock

Observations from multiple spacecraft show that there are energy spectral "breaks" at 1-10MeV in some large CME-driven shocks. However, numerical models can hardly simulate this property due to high computational expense. The present paper focuses on analyzing these energy spectral "breaks" by Monte Carlo particle simulations of an isolated CME-driven shock. Taking the Dec 14 2006 CME-driven shock as an example, we investigate the formation of this energy spectral property. For this purpose, we apply different values for the scattering time in our isolated shock model to obtain the highest energy "tails", which can potentially exceed the "break" energy range. However, we have not found the highest energy "tails" beyond the "break" energy range, but instead find that the highest energy "tails" reach saturation near the range of energy at 5MeV. So, we believe that there exists an energy spectral "cut off" in an isolated shock. If there is no interaction with another shock, there would not be formation of the energy spectral "break" property.

preprint2015arXiv

Energy-Efficient Transmission Schedule for Delay-Limited Bursty Data Arrivals under Non-Ideal Circuit Power Consumption

This paper develops a novel approach to obtaining energy-efficient transmission schedules for delay-limited bursty data arrivals under non-ideal circuit power consumption. Assuming a-prior knowledge of packet arrivals, deadlines and channel realizations, we show that the problem can be formulated as a convex program. For both time-invariant and time-varying fading channels, it is revealed that the optimal transmission between any two consecutive channel or data state changing instants, termed epoch, can only take one of the three strategies: (i) no transmission, (ii) transmission with an energy-efficiency (EE) maximizing rate over part of the epoch, or (iii) transmission with a rate greater than the EE-maximizing rate over the whole epoch. Based on this specific structure, efficient algorithms are then developed to find the optimal policies that minimize the total energy consumption with a low computational complexity. The proposed approach can provide the optimal benchmarks for practical schemes designed for transmissions of delay-limited data arrivals, and can be employed to develop efficient online scheduling schemes which require only causal knowledge of data arrivals and deadline requirements.

preprint2015arXiv

Formation of Rotational Discontinuities in Compressive three-dimensional MHD Turbulence

Measurements of solar wind turbulence reveal the ubiquity of discontinuities. In this study, we investigate how the discontinuities, especially rotational discontinuities (RDs), are formed in magnetohydrodynamic (MHD) turbulence. In a simulation of the decaying compressive three-dimensional (3-D) MHD turbulence with an imposed uniform background magnetic field, we detect RDs with sharp field rotations and little variations of magnetic field intensity as well as mass density. At the same time, in the de Hoffman-Teller (HT) frame, the plasma velocity is nearly in agreement with the Alfvén speed, and is field-aligned on both sides of the discontinuity. We take one of the identified RDs to analyze in details its 3-D structure and temporal evolution. By checking the magnetic field and plasma parameters, we find that the identified RD evolves from the steepening of the Alfvén wave with moderate amplitude, and that steepening is caused by the nonuniformity of the Alfvén speed in the ambient turbulence.

preprint2015arXiv

Illuminating a Dark Lens : A Type Ia Supernova Magnified by the Frontier Fields Galaxy Cluster Abell 2744

SN HFF14Tom is a Type Ia Supernova (SN) discovered at z = 1.3457 +- 0.0001 behind the galaxy cluster Abell 2744 (z = 0.308). In a cosmology-independent analysis, we find that HFF14Tom is 0.77 +- 0.15 magnitudes brighter than unlensed Type Ia SNe at similar redshift, implying a lensing magnification of mu_obs = 2.03 +- 0.29. This observed magnification provides a rare opportunity for a direct empirical test of galaxy cluster lens models. Here we test 17 lens models, 13 of which were generated before the SN magnification was known, qualifying as pure "blind tests". The models are collectively fairly accurate: 8 of the models deliver median magnifications that are consistent with the measured mu to within 1-sigma. However, there is a subtle systematic bias: the significant disagreements all involve models overpredicting the magnification. We evaluate possible causes for this mild bias, and find no single physical or methodological explanation to account for it. We do find that model accuracy can be improved to some extent with stringent quality cuts on multiply-imaged systems, such as requiring that a large fraction have spectroscopic redshifts. In addition to testing model accuracies as we have done here, Type Ia SN magnifications could also be used as inputs for future lens models of Abell 2744 and other clusters, providing valuable constraints in regions where traditional strong- and weak-lensing information is unavailable.

preprint2015arXiv

Improving the gate fidelity of capacitively coupled spin qubits

Capacitively coupled semiconductor spin qubits hold promise as the building blocks of a scalable quantum computing architecture with long-range coupling between distant qubits. However, the two-qubit gate fidelities achieved in experiments to date have been severely limited by decoherence originating from charge noise and hyperfine interactions with nuclear spins, and are currently unacceptably low for any conceivable multi-qubit gate operations. Here, we present control protocols that implement two-qubit entangling gates while substantially suppressing errors due to both types of noise. These protocols are obtained by making simple modifications to control sequences already used in the laboratory and should thus be easy enough for immediate experimental realization. Together with existing control protocols for robust single-qubit gates, our results constitute an important step toward scalable quantum computation using spin qubits in semiconductor platforms.

preprint2015arXiv

Modulation instability and controllable rogue waves with multiple compression points for periodically modulated coupled Hirota equations

Based on modulation instability analysis and generalized Darboux transformation, we derive a hierarchy of rogue wave solutions for a variable-coefficients coupled Hirota equations. The explicit first-order rogue wave solution is presented, and the dark-bright and composite rogue waves with multiple compression points are shown by choosing sufficiently large periodic modulation amplitudes in the coefficients of the coupled equations. Also, the dark-bright and composite Peregrine combs are generated from the multiple compression points. For the second-order case, the darkbright three sisters, rogue wave quartets and sextets structures with one or more rogue waves involving multiple compression points are put forward, respectively. Furthermore, some wave characteristics such as the difference between light intensity and continuous wave background, and pulse energy evolution of the dark rogue wave solution features multiple compression points are discussed.

preprint2015arXiv

Nanoindentation of Porous Bulk and Thin Films of LSCF

In this paper we show how reliable measurements on porous ceramic films can be made by appropriate nanoindentation experiments and analysis. Room-temperature mechanical properties of the mixed-conducting perovskite material LSCF6428 were investigated by nanoindentation of porous bulk samples and porous films sintered at temperatures from 900-1200C. A spherical indenter was used so that the contact area was much greater than the scale of the porous microstructure. The elastic modulus of the bulk samples was found to increase from 33.8-174.3 GPa and hardness from 0.64-5.32 GPa as the porosity decreased from 45-5% after sintering at 900-1200C. Densification under the indenter was found to have little influence on the measured elastic modulus. The residual porosity in the dense sample was found to account for the discrepancy between the elastic moduli measured by indentation and by impulse excitation. Crack-free LSCF6428 films of acceptable surface roughness for indentation were also prepared by sintering at 900-1200C. Reliable measurements of the true properties of the films were obtained by data extrapolation provided that the ratio of indentation depth to film thickness was in the range 0.1 to 0.2. The elastic moduli of the films and bulk materials were approximately equal for a given porosity. The 3D microstructures of films before and after indentation were characterized using FIB-SEM tomography. Finite element modelling of the elastic deformation of the actual microstructures showed excellent agreement with the nanoindentation results.

preprint2015arXiv

New bounds and constructions for multiply constant-weight codes

Multiply constant-weight codes (MCWCs) were introduced recently to improve the reliability of certain physically unclonable function response. In this paper, the bounds of MCWCs and the constructions of optimal MCWCs are studied. Firstly, we derive three different types of upper bounds which improve the Johnson-type bounds given by Chee {\sl et al.} in some parameters. The asymptotic lower bound of MCWCs is also examined. Then we obtain the asymptotic existence of two classes of optimal MCWCs, which shows that the Johnson-type bounds for MCWCs with distances $2\sum_{i=1}^mw_i-2$ or $2mw-w$ are asymptotically exact. Finally, we construct a class of optimal MCWCs with total weight four and distance six by establishing the connection between such MCWCs and a new kind of combinatorial structures. As a consequence, the maximum sizes of MCWCs with total weight less than or equal to four are determined almost completely.

preprint2015arXiv

New Exact Quantization Condition for Toric Calabi-Yau Geometries

We propose a new exact quantization condition for a class of quantum mechanical systems derived from local toric Calabi-Yau three-folds. Our proposal includes all contributions to the energy spectrum which are non-perturbative in the Planck constant, and is much simpler than the available quantization condition in the literature. We check that our proposal is consistent with previous works and implies non-trivial relations among the topological Gopakumar-Vafa invariants of the toric Calabi-Yau geometries. Together with the recent developments, our proposal opens a new avenue in the long investigations at the interface of geometry, topology and quantum mechanics.

preprint2015arXiv

Occurrence Rates and Heating Effects of Tangential and Rotational Discontinuities as Obtained from Three-dimensional Simulation of Magnetohydrodynamic Turbulence

In solar wind, magnetohydrodynamic (MHD) discontinuities are ubiquitous and often found to be at the origin of turbulence intermittency. They may also play a key role in the turbulence dissipation and heating of the solar wind. The tangential (TD) and rotational (RD) discontinuities are the two most important types of discontinuities. Recently, the connection between turbulence intermittency and proton thermodynamics has been being investigated observationally. Here we present numerical results from three-dimensional MHD simulation with pressure anisotropy and define new methods to identify and to distinguish TDs and RDs. Three statistical results obtained about the relative occurrence rates and heating effects are highlighted: (1) RDs tend to take up the majority of the discontinuities along with time; (2) the thermal states embedding TDs tend to be associated with extreme plasma parameters or instabilities, while RDs do not; (3) TDs have a higher average T as well as perpendicular temperature $T_\perp$. The simulation shows that TDs and RDs evolve and contribute to solar wind heating differently. These results will inspire our understanding of the mechanisms that generate discontinuities and cause plasma heating.

preprint2015arXiv

On the Optimal Provider Selection for Repair in Distributed Storage System with Network Coding

In large-scale distributed storage systems (DSS), reliability is provided by redundancy spread over storage servers across the Internet. Network coding (NC) has been widely studied in DSS because it can improve the reliability with low repair time. To maintain reliability, an unavailable storage server should be firstly replaced by a new server, named new comer. Then, multiple storage servers, called providers, should be selected from surviving servers and send their coded data through the Internet to the new comer for regenerating the lost data. Therefore, in a large-scale DSS, provider selection and data routing during the regeneration phase have great impact on the performance of regeneration time. In this paper, we investigate a problem of optimal provider selection and data routing for minimizing the regeneration time in the DSS with NC. Specifically, we first define the problem in the DSS with NC. For the case that the providers are given, we model the problem as a mathematical programming. Based on the mathematical programming, we then formulate the optimal provider selection and data routing problem as an integer linear programming problem and develop an efficient near-optimal algorithm based on linear programming relaxation (BLP). Finally, extensive simulation experiments have been conducted, and the results show the effectiveness of the proposed algorithm.

preprint2015arXiv

Particles Acceleration in Converged Two Shocks

Observations show that there is a proton spectral "break" with E$_{break}$ at 1-10MeV in some large CME-driven shocks. Theoretical model usually attribute this phenomenon to a diffusive shock acceleration. However, the underlying physics of the shock acceleration still remains uncertain. Although previous numerical models can hardly predict this "break" due to either high computational expense or shortcomings of current models, the present paper focuses on simulating this energy spectrum in converged two shocks by Monte Carlo numerical method. Considering the Dec 13 2006 CME-driven shock interaction with an Earth bow shock, we examine whether the energy spectral "break" could occur on an interaction between two shocks. As result, we indeed obtain the maximum proton energy up to 10MeV, which is the premise to investigate the existence of the energy spectral "break". Unexpectedly, we further find a proton spectral "break" appears distinctly at the energy $\sim$5MeV.

preprint2015arXiv

Porous LSCF/Dense 3YSZ Interface Fracture Toughness Measured by Single Cantilever Beam Wedge Test

Sandwich specimens were prepared by firing a thin inter-layer of porous La0.6Sr0.4Co0.2Fe0.8O3-d (LSCF) to bond a thin tetragonal yttria-stabilised zirconia (YSZ) beam to a thick YSZ substrate. Fracture of the joint was evaluated by introducing a wedge between the two YSZ adherands so that the stored energy in the thin YSZ cantilever beam drives a stable crack in the adhesive bond and allows the critical energy release rate for crack extension (fracture toughness) to be measured. The crack path in most specimens showed a mixture of adhesive failure (at the YSZ-LSCF interface) and cohesive failure (within the LSCF). It was found that the extent of adhesive fracture increased with firing temperature and decreased with LSCF layer thickness. The adhesive failures were mainly at the interface between the LSCF and the thin YSZ beam and FEM modelling revealed that this is due to asymmetric stresses in the LSCF. Within the firing temperature range of 1000-1150C, the bonding fracture toughness appears to have a strong dependence on firing temperature. However, the intrinsic adhesive fracture toughness of the LSCF/YSZ interface was estimated to be 11 Jm2 and was not firing temperature dependent within the temperature range investigated.

preprint2015arXiv

Pushing towards the Limit of Sampling Rate: Adaptive Chasing Sampling

Measurement samples are often taken in various monitoring applications. To reduce the sensing cost, it is desirable to achieve better sensing quality while using fewer samples. Compressive Sensing (CS) technique finds its role when the signal to be sampled meets certain sparsity requirements. In this paper we investigate the possibility and basic techniques that could further reduce the number of samples involved in conventional CS theory by exploiting learning-based non-uniform adaptive sampling. Based on a typical signal sensing application, we illustrate and evaluate the performance of two of our algorithms, Individual Chasing and Centroid Chasing, for signals of different distribution features. Our proposed learning-based adaptive sampling schemes complement existing efforts in CS fields and do not depend on any specific signal reconstruction technique. Compared to conventional sparse sampling methods, the simulation results demonstrate that our algorithms allow $46\%$ less number of samples for accurate signal reconstruction and achieve up to $57\%$ smaller signal reconstruction error under the same noise condition.

preprint2015arXiv

Robust quantum control using smooth pulses and topological winding

The greatest challenge in achieving the high level of control needed for future technologies based on coherent quantum systems is the decoherence induced by the environment. Here, we present an analytical approach that yields explicit constraints on the driving field which are necessary and sufficient to ensure that the leading-order noise-induced errors in a qubit's evolution cancel exactly. We derive constraints for two of the most common types of noise that arise in qubits: slow fluctuations of the qubit energy splitting and fluctuations in the driving field itself. By theoretically recasting a phase in the qubit's wavefunction as a topological winding number, we can satisfy the noise-cancelation conditions by adjusting driving field parameters without altering the target state or quantum evolution. We demonstrate our method by constructing robust quantum gates for two types of spin qubit: phosphorous donors in silicon and nitrogen-vacancy centers in diamond.

preprint2015arXiv

Robust Smart-Grid Powered Cooperative Multipoint Systems

A framework is introduced to integrate renewable energy sources (RES) and dynamic pricing capabilities of the smart grid into beamforming designs for coordinated multi-point (CoMP) downlink communication systems. To this end, novel models are put forth to account for harvesting, storage of nondispatchable RES, time-varying energy pricing, as well as stochastic wireless channels. Building on these models, robust energy management and transmit-beamforming designs are developed to minimize the worst-case energy cost subject to the worst-case user QoS guarantees for the CoMP downlink. Leveraging pertinent tools, this task is formulated as a convex problem. A Lagrange dual based subgradient iteration is then employed to find the desired optimal energy-management strategy and transmit-beamforming vectors. Numerical results are provided to demonstrate the merits of the proposed robust designs.

preprint2015arXiv

Rule Optimization for Real-Time Query Service in Software-Defined Internet of Vehicles

Internet of Vehicles (IoV) has recently gained considerable attentions from both industry and research communities since the development of communication technology and smart city. However, a proprietary and closed way of operating hardwares in network equipments slows down the progress of new services deployment and extension in IoV. Moreover, the tightly coupled control and data planes in traditional networks significantly increase the complexity and cost of network management. By proposing a novel architecture, called Software-Defined Internet of Vehicles (SDIV), we adopt the software-defined network (SDN) architecture to address these problems by leveraging its separation of the control plane from the data plane and a uniform way to configure heterogeneous switches. However, the characteristics of IoV introduce the very challenges in rule installation due to the limited size of Flow Tables at OpenFlow-enabled switches which are the main component of SDN. It is necessary to build compact Flow Tables for the scalability of IoV. Accordingly, we develop a rule optimization approach for real-time query service in SDIV. Specifically, we separate wired data plane from wireless data plane and use multicast address in wireless data plane. Furthermore, we introduce a destination-driven model in wired data plane for reducing the number of rules at switches. Experiments show that our rule optimization strategy reduces the number of rules while keeping the performance of data transmission.

preprint2015arXiv

Sampling Online Social Networks via Heterogeneous Statistics

Most sampling techniques for online social networks (OSNs) are based on a particular sampling method on a single graph, which is referred to as a statistics. However, various realizing methods on different graphs could possibly be used in the same OSN, and they may lead to different sampling efficiencies, i.e., asymptotic variances. To utilize multiple statistics for accurate measurements, we formulate a mixture sampling problem, through which we construct a mixture unbiased estimator which minimizes asymptotic variance. Given fixed sampling budgets for different statistics, we derive the optimal weights to combine the individual estimators; given fixed total budget, we show that a greedy allocation towards the most efficient statistics is optimal. In practice, the sampling efficiencies of statistics can be quite different for various targets and are unknown before sampling. To solve this problem, we design a two-stage framework which adaptively spends a partial budget to test different statistics and allocates the remaining budget to the inferred best statistics. We show that our two-stage framework is a generalization of 1) randomly choosing a statistics and 2) evenly allocating the total budget among all available statistics, and our adaptive algorithm achieves higher efficiency than these benchmark strategies in theory and experiment.

preprint2015arXiv

SEARS: Space Efficient And Reliable Storage System in the Cloud

Today's cloud storage services must offer storage reliability and fast data retrieval for large amount of data without sacrificing storage cost. We present SEARS, a cloud-based storage system which integrates erasure coding and data deduplication to support efficient and reliable data storage with fast user response time. With proper association of data to storage server clusters, SEARS provides flexible mixing of different configurations, suitable for real-time and archival applications. Our prototype implementation of SEARS over Amazon EC2 shows that it outperforms existing storage systems in storage efficiency and file retrieval time. For 3 MB files, SEARS delivers retrieval time of $2.5$ s compared to $7$ s with existing systems.

preprint2015arXiv

Service Provisioning and Profit Maximization in Network-assisted Adaptive HTTP Streaming

Adaptive HTTP streaming with centralized consideration of multiple streams has gained increasing interest. It poses a special challenge that the interests of both content provider and network operator need to be deliberately balanced. More importantly, the adaptation strategy is required to be flexible enough to be ported to various systems that work under different network environments, QoE levels, and economic objectives. To address these challenges, we propose a Markov Decision Process (MDP) based network-assisted adaptation framework, wherein cost of buffering, significant playback variation, bandwidth management and income of playback are jointly investigated. We then demonstrate its promising service provisioning and maximal profit for a mobile network in which fair or differentiated service is required.

preprint2015arXiv

Some Improvements on Locally Repairable Codes

The locally repairable codes (LRCs) were introduced to correct erasures efficiently in distributed storage systems. LRCs are extensively studied recently. In this paper, we first deal with the open case remained in \cite{q} and derive an improved upper bound for the minimum distances of LRCs. We also give an explicit construction for LRCs attaining this bound. Secondly, we consider the constructions of LRCs with any locality and availability which have high code rate and minimum distance as large as possible. We give a graphical model for LRCs. By using the deep results from graph theory, we construct a family of LRCs with any locality $r$ and availability $2$ with code rate $\frac{r-1}{r+1}$ and optimal minimum distance $O(\log n)$ where $n$ is the length of the code.

preprint2015arXiv

Spectral Anisotropy of Elsässer Variables in Two Dimensional Wave-vector Space as Observed in the Fast Solar Wind Turbulence

Intensive studies have been conducted to understand the anisotropy of solar wind turbulence. However, the anisotropy of Elsässer variables ($\textbf{Z}^\pm$) in 2D wave-vector space has yet to be investigated. Here we first verify the transformation based on the projection-slice theorem between the power spectral density PSD$_{2D}(k_\parallel,k_\perp )$ and the spatial correlation function CF$_{2D} (r_\parallel,r_\perp )$. Based on the application of the transformation to the magnetic field and the particle measurements from the WIND spacecraft, we investigate the spectral anisotropy of Elsässer variables ($\textbf{Z}^\pm$), and the distribution of residual energy E$_{R}$, Alfvén ratio R$_{A}$ and Elsässer ratio R$_{E}$ in the $(k_\parallel,k_\perp)$ space. The spectra PSD$_{2D}(k_\parallel,k_\perp )$ of $\textbf{B}$, $\textbf{V}$, and $\textbf{Z}_{major}$ (the larger of $\textbf{Z}^\pm$) show a similar pattern that PSD$_{2D}(k_\parallel,k_\perp )$ is mainly distributed along a ridge inclined toward the $k_\perp$ axis. This is probably the signature of the oblique Alfvénic fluctuations propagating outwardly. Unlike those of $\textbf{B}$, $\textbf{V}$, and $\textbf{Z}_{major}$, the spectrum PSD$_{2D}(k_\parallel,k_\perp )$ of $\textbf{Z}_{minor}$ is distributed mainly along the $k_\perp$ axis. Close to the $k_\perp$ axis, $\left| {E}_{R}\right|$ becomes larger while R$_{A}$ becomes smaller, suggesting that the dominance of magnetic energy over kinetic energy becomes more significant at small $k_\parallel$. R$_{E}$ is larger at small $k_\parallel$, implying that PSD$_{2D}(k_\parallel,k_\perp )$ of $\textbf{Z}_{minor}$ is more concentrated along the $k_\perp$ direction as compared to that of $\textbf{Z}_{major}$. The residual energy condensate at small $k_\parallel$ is consistent with simulation results in which E$_{R}$ is spontaneously generated by Alfvén wave interaction.

preprint2015arXiv

Surface quality improvement of porous thin films suitable for nanoindentation

The reliability of perovskite material LSCF to be used as cathode parts in solid oxide fuel cells also relies on its mechanical properties. Adequate surface conditions are desired when the as-sintered porous thin films are subjected to nanoindentation for mechanical property determination. In this study, extensive cracks and considerable surface roughness were found in the LSCF films after sintering at high temperatures. This would significantly scatter the nanoindentation data and result in unreliable measurements. Various attempts including the comparison of film deposition methods, drying and sintering processes, and reformulating the ink were made to improve the surface quality. Results revealed little dependence of cracking and surface roughness on deposition methods, drying or sintering processes. It was found that the critical factor for obtaining crack-free and smooth LSCF films was the ability of the ink to be self-levelling in the earlier wet state. Reproducible nanoindentation measurements were obtained for the films with improved surface quality.

preprint2015arXiv

The Grism Lens-Amplified Survey from Space (GLASS). II. Gas-phase metallicity and radial gradients in an interacting system at z~2

We present spatially resolved gas-phase metallicity for a system of three galaxies at z=1.85 detected in the Grism Lens-Amplified Survey from Space (GLASS). The combination of HST's diffraction limit and strong gravitational lensing by the cluster MACS J0717+3745 results in a spatial resolution of ~200-300 pc, enabling good spatial sampling despite the intrinsically small galaxy sizes. The galaxies in this system are separated by 50-200 kpc in projection and are likely in an early stage of interaction, evidenced by relatively high specific star formation rates. Their gas-phase metallicities are consistent with larger samples at similar redshift, star formation rate, and stellar mass. We obtain a precise measurement of the metallicity gradient for one galaxy and find a shallow slope compared to isolated galaxies at high redshift, consistent with a flattening of the gradient due to gravitational interaction. An alternative explanation for the shallow metallicity gradient and elevated star formation rate is rapid recycling of metal-enriched gas, but we find no evidence for enhanced gas-phase metallicities which should result from this effect. Notably, the measured stellar masses log(M/Msun) = 7.2-9.1 probe to an order of magnitude below previous mass-metallicity studies at this redshift. The lowest mass galaxy has properties similar to those expected for Fornax at this redshift, indicating that GLASS is able to directly study the progenitors of local group dwarf galaxies on spatially resolved scales. Larger samples from the full GLASS survey will be ideal for studying the effects of feedback, and the time evolution of metallicity gradients. These initial results demonstrate the utility of HST spectroscopy combined with gravitational lensing for characterizing resolved physical properties of galaxies at high redshift.

preprint2015arXiv

Topological effects of charge transfer in telomere G-quadruplex: Mechanism on telomerase activation and inhibition

We explore charge transfer in the telomere G-Quadruplex (TG4) DNA theoretically by the nonequilibrium Green's function method, and reveal the topological effect of charge transport in TG4 DNA. The consecutive TG4(CTG4) is semiconducting with 0.2 ~ 0.3eV energy gap. Charges transfers favorably in the consecutive TG4, but are trapped in the non-consecutive TG4 (NCTG4). The global conductance is inversely proportional to the local conductance for NCTG4. The topological structure transition from NCTG4 to CTG4 induces abruptly ~ 3nA charge current, which provide a microscopic clue to understand the telomerase activated or inhibited by TG4. Our findings reveal the fundamental property of charge transfer in TG4 and its relationship with the topological structure of TG4.

preprint2015arXiv

Topology of neutral hydrogen distribution with the Square Kilometer Array

Morphology of the complex HI gas distribution can be quantified by statistics like the Minkowski functionals, and can provide a way to statistically study the large scale structure in the HI maps both at low redshifts, and during the epoch of reionization (EoR). At low redshifts, the 21cm emission traces the underlying matter distribution. Topology of the HI gas distribution, as measured by the genus, could be used as a "standard ruler". This enables the determination of distance-redshift relation and also the discrimination of various models of dark energy and of modified gravity. The topological analysis is also sensitive to certain primordial non-Gaussian features. Compared with two-point statistics, the topological statistics are more robust against the nonlinear gravitational evolution, bias, and redshift-space distortion. The HI intensity map observation naturally avoids the sparse sampling distortion, which is an important systematic in optical galaxy survey. The large cosmic volume accessible to SKA would provide unprecedented accuracy using such a measurement... [abridged]

preprint2015arXiv

Topology-Aware Node Selection for Data Regeneration in Heterogeneous Distributed Storage Systems

Distributed storage systems introduce redundancy to protect data from node failures. After a storage node fails, the lost data should be regenerated at a replacement storage node as soon as possible to maintain the same level of redundancy. Minimizing such a regeneration time is critical to the reliability of distributed storage systems. Existing work commits to reduce the regeneration time by either minimizing the regenerating traffic, or adjusting the regenerating traffic patterns, whereas nodes participating data regeneration are generally assumed to be given beforehand. However, such regeneration time also depends heavily on the selection of the participating nodes. Selecting different participating nodes actually involve different data links between the nodes. Real-world distributed storage systems usually exhibit heterogeneous link capacities. It is possible to further reduce the regeneration time via exploiting such link capacity differences and avoiding the link bottlenecks. In this paper, we consider the minimization of the regeneration time by selecting the participating nodes in heterogeneous networks. We analyze the regeneration time and propose node selection algorithms for overlay networks and real-world topologies. Considering that the flexible amount of data blocks from each provider may deeply influence the regeneration time, several techniques are designed to enhance our schemes in overlay networks. Experimental results show that our node selection schemes can significantly reduce the regeneration time for each topology, especially in practical networks with heterogeneous link capacities.

preprint2015arXiv

Tunable electromagnetically induced transparency with a coupled superconducting system

Electromagnetically induced transparency (EIT) has usually been demonstrated by using three-level atomic systems. In this paper, we theoretically proposed an efficient method to realize EIT in microwave regime through a coupled system consisting of a flux qubit and a superconducting LC resonator with relatively high quality factor. In the present composed system, the working levels are the dressed states of a two-level flux qubit and the resonators with a probe pump field. There exits a second order coherent transfer between the dressed states. By comparing the results with those in the conventional atomic system we have revealed the physical origin of the EIT phenomenon in this composed system. Since the whole system is artificial and tunable, our scheme may have potential applications in various domains.

preprint2015arXiv

Utility of observational Hubble parameter data on dark energy evolution

Aiming at exploring the nature of dark energy, we use thirty-six observational Hubble parameter data (OHD) in the redshift range $0 \leqslant z \leqslant 2.36$ to make a cosmological model-independent test of the two-point $Omh^2(z_{2};z_{1})$ diagnostic. In $Λ$CDM, we have $Omh^2 \equiv Ω_{m}h^2$, where $Ω_{m}$ is the matter density parameter at present. We bin all the OHD into four data points to mitigate the observational contaminations. By comparing with the value of $Ω_{m}h^2$ which is constrained tightly by the Planck observations, our results show that in all six testing pairs of $Omh^2$ there are two testing pairs are consistent with $Λ$CDM at $1σ$ confidence level (CL), whereas for another two of them $Λ$CDM can only be accommodated at $2σ$ CL. Particularly, for remaining two pairs, $Λ$CDM is not compatible even at $2σ$ CL. Therefore it is reasonable that although deviations from $Λ$CDM exist for some pairs, cautiously, we cannot rule out the validity of $Λ$CDM. We further apply two methods to derive the value of Hubble constant $H_0$ utilizing the two-point $Omh^2(z_{2};z_{1})$ diagnostic. We obtain $H_0 = 71.23\pm1.54$ ${\mathrm{km \ s^{-1} \ Mpc^{-1}}}$ from inverse variance weighted $Omh^2$ value (method (I)) and $H_0 = 69.37\pm1.59$ ${\mathrm{km \ s^{-1} \ Mpc^{-1}}}$ that the $Omh^2$ value originates from Planck measurement (method (II)), both at $1σ$ CL. Finally, we explore how the error in OHD propagate into $w(z)$ at certain redshift during the reconstruction of $w(z)$. We argue that the current precision on OHD is not sufficient small to ensure the reconstruction of $w(z)$ in an acceptable error range, especially at the low redshift

preprint2014arXiv

A halo bias function measured deeply into voids without stochasticity

We study the relationship between dark-matter haloes and matter in the MIP $N$-body simulation ensemble, which allows precision measurements of this relationship, even deeply into voids. What enables this is a lack of discreteness, stochasticity, and exclusion, achieved by averaging over hundreds of possible sets of initial small-scale modes, while holding fixed large-scale modes that give the cosmic web. We find (i) that dark-matter-halo formation is greatly suppressed in voids; there is an exponential downturn at low densities in the otherwise power-law matter-to-halo density bias function. Thus, the rarity of haloes in voids is akin to the rarity of the largest clusters, and their abundance is quite sensitive to cosmological parameters. The exponential downturn appears both in an excursion-set model, and in a model in which fluctuations evolve in voids as in an open universe with an effective $Ω_m$ proportional to a large-scale density. We also find that (ii) haloes typically populate the average halo-density field in a super-Poisson way, i.e. with a variance exceeding the mean; and (iii) the rank-order-Gaussianized halo and dark-matter fields are impressively similar in Fourier space. We compare both their power spectra and cross-correlation, supporting the conclusion that one is roughly a strictly-increasing mapping of the other. The MIP ensemble especially reveals how halo abundance varies with `environmental' quantities beyond the local matter density; (iv) we find a visual suggestion that at fixed matter density, filaments are more populated by haloes than clusters.

preprint2014arXiv

A Note on Instanton Effects in ABJM Theory

We consider the quantum spectral problem appearing the Fermi gas formulation of the ABJM (Aharony-Bergman-Jafferis-Maldacena) matrix model. This is known to related to the refined topological string on local P^1*P^1 Calabi-Yau geometry. In the ABJM setting the problem is formulated by an integral equation, and is somewhat different from the one formulated directly in terms of the Calabi-Yau geometry and studied in our earlier paper. We use the similar method in our earlier paper to determine the non-perturbative contributions to the quantum phase volume in the ABJM case from the Bohr-Sommerfeld quantization condition. As in our earlier paper, the non-perturbative contributions contain higher order smooth corrections beyond those required by singularity cancellations with the perturbative contributions proposed by Kallen and Marino. Our results imply possible new contributions to the grand potential of the ABJM matrix model.

preprint2014arXiv

BRVST: Efficient and Content-Expressive Information Matching Overlay in Wireless Networks

Efficient and flexible information matching over wireless networks has become increasingly important and challenging with the popularity of smart devices and the growth of social-network-based applications. Some existing approaches designed for wired networks are not applicable to wireless networks, due to their overwhelming control overheads. In this paper, we propose a reliable and scalable binary range vector summary tree (BRVST) infrastructure for flexible information expression support, effective content matching and timely information dissemination over the dynamic wireless network. A novel attribute range vector structure has been introduced for efficient and accurate content representation and a summary tree structure to facilitate information aggregation. For robust and scalable operations over dynamic wireless network, the proposed overlay system exploits a virtual hierarchical geographic management framework. Extensive simulations demonstrate that BRVST has a significantly faster event matching speed, while incurs very low storage and traffic overhead, as compared with peer schemes tested.

preprint2014arXiv

Conditions for the vanishing of the genus-2 G-function

In this paper we give some sufficient conditions for the vanishing of the genus-2 G-function, which was introduced by B. Dubrovin, S. Liu and Y. Zhang in [DLZ]. As a corollary we prove their conjecture for the vanishing of the genus-2 G-function for ADE singularities.

preprint2014arXiv

Deciphering Solar Magnetic Activity I: On The Relationship Between The Sunspot Cycle And The Evolution Of Small Magnetic Features

Sunspots are a canonical marker of the Sun's internal magnetic field which flips polarity every ~22-years. The principal variation of sunspots, an ~11-year variation in number, modulates the amount of magnetic field that pierces the solar surface and drives significant variations in our Star's radiative, particulate and eruptive output over that period. This paper presents observations from the Solar and Heliospheric Observatory and Solar Dynamics Observatory indicating that the 11-year sunspot variation is intrinsically tied it to the spatio-temporal overlap of the activity bands belonging to the 22-year magnetic activity cycle. Using a systematic analysis of ubiquitous coronal brightpoints, and the magnetic scale on which they appear to form, we show that the landmarks of sunspot cycle 23 can be explained by considering the evolution and interaction of the overlapping activity bands of the longer scale variability.

preprint2014arXiv

Ferromagnetic response of a "high-temperature" quantum antiferromagnet

We study the finite temperature antiferromagnetic phase of the ionic Hubbard model in the strongly interacting limit using quantum Monte Carlo based dynamical mean field theory. We find that the ionic potential plays a dual role in determining the antiferromagnetic order. A small ionic potential (compared to Hubbard repulsion) increases the super-exchange coupling in the projected sector of the model, leading to an increase in the Neel temperature of the system. A large ionic potential leads to resonance between projected antiferromagnetically ordered configurations and density ordered configurations with double occupancies, thereby killing antiferromagnetism in the system. This novel way of degrading antiferromagnetism leads to spin polarization of the low energy single particle density of states. The dynamic response of the system thus mimics ferromagnetic behaviour, although the system is still an antiferromagnet in terms of the static spin order.

preprint2014arXiv

Forecasts on the Dark Energy and Primordial Non-Gaussianity Observations with the Tianlai Cylinder Array

The Tianlai experiment is dedicated to the observation of large scale structures (LSS) by the 21 cm intensity mapping technique. In this paper we make forecasts on its capability at observing or constraining the dark energy parameters and the primordial non-Gaussianity. From the LSS data one can use the baryon acoustic oscillation (BAO) and the growth rate derived from the redshift space distortion (RSD) to measure the dark energy density and equation of state. The primordial non-Gaussianity can be constrained either by looking for scale-dependent bias in the power spectrum, or by using the bispectrum. Here we consider three cases: the Tianlai cylinder array pathfinder which is currently being built, an upgrade of the pathfinder array with more receiver units, and the full-scale Tianlai cylinder array. Using the full-scale Tianlai experiment, we expect $σ_{w_0} \sim 0.082$ and $σ_{w_a} \sim 0.21$ from the BAO and RSD measurements, $σ_{\rm f_{NL}}^{\rm local} \sim 14$ from the power spectrum measurements with scale-dependent bias, and $σ_{\rm f_{NL}}^{\rm local} \sim 22$ and $σ_{\rm f_{NL}}^{\rm equil} \sim 157$ from the bispectrum measurements.

preprint2014arXiv

Generalized Darboux transformation and higher-order rogue wave solutions of the coupled Hirota equations

This paper is dedicated to study higher-order rogue wave solutions of the coupled Hirota equations with high-order nonlinear effects like the third dispersion, self-steepening and stimulated Raman scattering terms. By using the generalized Darboux transformation, a unified representation of Nth-order rogue wave solution with 3N+1 free parameters is obtained. In particular, the first-order rogue wave solution containing polynomials of fourth order, and the second-order rogue wave solution consisting of polynomials of eighth order are explicitly presented. Through the numerical plots, we show that four or six fundamental rogue waves can coexist in the second-order rogue waves. By adjusting the values of some free parameters, different kinds of spatial-temporal distribution structures such as circular, quadrilateral, triangular, line and fundamental patterns are exhibited. Moreover, we see that nine or twelve fundamental rogue waves can synchronously emerge in the third-order rogue waves. The more intricate spatialtemporal distribution shapes are shown via adequate choices of the free parameters. Several wave characteristics such as the amplitudes and the coordinate positions of the highest peaks in the rogue waves are discussed.

preprint2014arXiv

Generalized Darboux transformation and localized waves in coupled Hirota equations

In this paper, we construct a generalized Darboux transformation to the coupled Hirota equations with high-order nonlinear effects like the third dispersion, self-steepening and inelastic Raman scattering terms. As application, an Nth-order localized wave solution on the plane backgrounds with the same spectral parameter is derived through the direct iterative rule. In particular, some semi-rational, multi-parametric localized wave solutions are obtained: (1) Vector generalization of the first- and the second-order rogue wave solution; (2) Interactional solutions between a dark-bright soliton and a rogue wave, two dark-bright solitons and a second-order rogue wave; (3) Interactional solutions between a breather and a rogue wave, two breathers and a second-order rogue wave. The results further reveal the striking dynamic structures of localized waves in complex coupled systems.

preprint2014arXiv

Genus-2 G-function for $P^1$ orbifolds

In this paper we prove that for Gromov-Witten theory of $P^1$ orbifolds of ADE type the genus-2 G-function introduced by B. Dubrovin, S. Liu, and Y. Zhang vanishes. Together with our results in [LW], this completely solves the main conjecture in their paper [DLZ]. In the process, we also found a sufficient condition for the vanishing of the genus-2 G-function which is weaker than the condition given in our previous paper [LW].

preprint2014arXiv

Identifying Potential Markers of the Sun's Giant Convective Scale

Line-of-sight magnetograms from the Helioseismic and Magnetic Imager (HMI) of the Solar Dynamics Observatory (SDO) are analyzed using a diagnostic known as the "Magnetic Range of Influence," or MRoI. The MRoI is a measure of the length over which a photospheric magnetogram is balanced and so its application gives the user a sense of the connective length scales in the outer solar atmosphere. The MRoI maps and histograms inferred from the SDO/HMI magnetograms primarily exhibit four scales: a scale of a few megameters that can be associated with granulation, a scale of a few tens of megameters that can be associated with super-granulation, a scale of many hundreds to thousands of megameters that can be associated with coronal holes and active regions, and a hitherto unnoticed scale that ranges from 100 to 250 megameters. We infer that this final scale is an imprint of the (rotationally-driven) giant convective scale on photospheric magnetism. This scale appears in MRoI maps as well-defined, spatially distributed, concentrations that we have dubbed "g-nodes." Furthermore, using coronal observations from the Atmospheric Imaging Assembly (AIA) on SDO, we see that the vicinity of these g-nodes appears to be a preferred location for the formation of extreme ultraviolet (EUV, and likely X-Ray) brightpoints. These observations and straightforward diagnostics offer the potential of a near-real-time mapping of the Sun's largest convective scale, a scale that possibly reaches to the very bottom of the convective zone.

preprint2014arXiv

Kinematic Morphology of Large-scale Structure: Evolution from Potential to Rotational Flow

As an alternative way of describing the cosmological velocity field, we discuss the evolution of rotational invariants constructed from the velocity gradient tensor. Compared with the traditional divergence-vorticity decomposition, these invariants, defined as coefficients of characteristic equation of the velocity gradient tensor, enable a complete classification of all possible flow patterns in the dark-matter comoving frame, including both potential and vortical flows. We show that this tool, first introduced in turbulence two decades ago, proves to be very useful in understanding the evolution of the cosmic web structure, and in classifying its morphology. Before shell-crossing, different categories of potential flow are highly associated with cosmic web structure, because of the coherent evolution of density and velocity. This correspondence is even preserved at some level when vorticity is generated after shell-crossing. The evolution from the potential to vortical flow can be traced continuously by these invariants. With the help of this tool, we show that the vorticity is generated in a particular way that is highly correlated with the large-scale structure. This includes a distinct spatial distribution and different types of alignment between cosmic web and vorticity direction for various vortical flows. Incorporating shell-crossing into closed dynamical systems is highly non-trivial, but we propose a possible statistical explanation for some of these phenomena relating to the internal structure of the three-dimensional invariants space.

preprint2014arXiv

Logical gaps in the approximate solutions of the social learning game and an exact solution

After the social learning models were proposed, finding the solutions of the games becomes a well-defined mathematical question. However, almost all papers on the games and their applications are based on solutions built upon either an add-hoc argument or a twisted Bayesian analysis of the games. Here, we present logical gaps in those solutions and an exact solution of our own. We also introduced a minor extension to the original game such that not only logical difference but also difference in action outcomes among those solutions become visible.

preprint2014arXiv

MDR Codes: A New Class of RAID-6 Codes with Optimal Rebuilding and Encoding

As storage systems grow in size, device failures happen more frequently than ever before. Given the commodity nature of hard drives employed, a storage system needs to tolerate a certain number of disk failures while maintaining data integrity, and to recover lost data with minimal interference to normal disk I/O operations. RAID-6, which can tolerate up to two disk failures with the minimum redundancy, is becoming widespread. However, traditional RAID-6 codes suffer from high disk I/O overhead during recovery. In this paper, we propose a new family of RAID-6 codes, the Minimum Disk I/O Repairable (MDR) codes, which achieve the optimal disk I/O overhead for single failure recoveries. Moreover, we show that MDR codes can be encoded with the minimum number of bit-wise XOR operations. Simulation results show that MDR codes help to save about half of disk read operations than traditional RAID-6 codes, and thus can reduce the recovery time by up to 40%.

preprint2014arXiv

New Bounds For Frameproof Codes

Frameproof codes are used to fingerprint digital data. It can prevent copyrighted materials from unauthorized use. In this paper, we study upper and lower bounds for $w$-frameproof codes of length $N$ over an alphabet of size $q$. The upper bound is based on a combinatorial approach and the lower bound is based on a probabilistic construction. Both bounds can improve previous results when $q$ is small compared to $w$, say $cq\leq w$ for some constant $c\leq q$. Furthermore, we pay special attention to binary frameproof codes. We show a binary $w$-frameproof code of length $N$ can not have more than $N$ codewords if $N<\binom{w+1}{2}$.

preprint2014arXiv

Noise-compensating pulses for electrostatically controlled silicon spin qubits

We study the performance of SUPCODE---a family of dynamically correcting pulses designed to cancel simultaneously both Overhauser and charge noise for singlet-triplet spin qubits---adapted to silicon devices with electrostatic control. We consider both natural Si and isotope-enriched Si systems, and in each case we investigate the behavior of individual gates under static noise and perform randomized benchmarking to obtain the average gate error under realistic 1/f noise. We find that in most cases SUPCODE pulses offer roughly an order of magnitude reduction in gate error, and especially in the case of isotope-enriched Si, SUPCODE yields gate operations of very high fidelity. We also develop a version of SUPCODE that cancels the charge noise only, "$δJ$-SUPCODE", which is particularly beneficial for isotope-enriched Si devices where charge noise dominates Overhauser noise, offering a level of error reduction comparable to the original SUPCODE while yielding gate times that are 30% to 50% shorter. Our results show that the SUPCODE noise-compensating pulses provide a fast, simple, and effective approach to error suppression, bringing gate errors well below the quantum error correction threshold in principle.

preprint2014arXiv

Nonlinear dynamics of phase space zonal structures and energetic particle physics in fusion plasmas

A general theoretical framework for investigating nonlinear dynamics of phase space zonal structures is presented in this work. It is then, more specifically, applied to the limit where the nonlinear evolution time scale is smaller or comparable to the wave-particle trapping period. In this limit, both theoretical and numerical simulation studies show that non-adiabatic frequency chirping and phase locking could lead to secular resonant particle transport on meso- or macro-scales. The interplay between mode structures and resonant particles then provides the crucial ingredient to properly understand and analyze the nonlinear dynamics of Alfvén wave instabilities excited by non-perturbative energetic particles in burning fusion plasmas. Analogies with autoresonance in nonlinear dynamics and with superradiance in free electron lasers are also briefly discussed.

preprint2014arXiv

On the Nonlinear Evolution of Cosmic Web: Lagrangian Dynamics Revisited

We investigate the nonlinear evolution of cosmic morphologies of the large-scale structure by examining the Lagrangian dynamics of various tensors of a cosmic fluid element, including the velocity gradient tensor, the Hessian matrix of the gravitational potential as well as the deformation tensor. Instead of the eigenvalue representation, the first two tensors, which associate with the "kinematic" and "dynamical" cosmic web classification algorithm respectively, are studied in a more convenient parameter space. These parameters are defined as the rotational invariant coefficients of the characteristic equation of the tensor. In the nonlinear local model (NLM) where the magnetic part of Weyl tensor vanishes, these invariants are fully capable of characterizing the dynamics. Unlike the Zeldovich approximation (ZA), where various morphologies do not change before approaching a one-dimensional singularity, the sheets in NLM are unstable for both overdense and underdense perturbations. While it has long been known that the coupling between tidal tensor and velocity shear would cause a filamentary final configuration of a collapsing region, we show that the underdense perturbation are more subtle, as the balance between the shear rate (tidal force) and the divergence (density) could lead to different morphologies. Interestingly, this instability also sets the basis for understanding some distinctions of the cosmic web identified dynamically and kinematically. We show that the sheets with negative density perturbation in the potential based algorithm would turn to filaments faster than in the kinematic method, which could explain the distorted dynamical filamentary structure observed in the simulation.

preprint2014arXiv

Preparing ground states and squeezed states of nanomechanical cantilevers by fast dissipation

We propose a protocol that enables strong coupling between a flux qubit and the quantized motion of a magnetized nanomechanical cantilever. The flux qubit is driven by microwave fields with suitable parameters to induce sidebands, which will lead to the desired coupling. We show that the nanomechanical modes can be cooled to the ground states and the single-mode squeezed vacuum states can be generated via fast dissipation of the flux qubit. In our scheme, the qubit decay plays a positive role and can help drive the system to the target states.

preprint2014arXiv

Prometheus: LT Codes Meet Cooperative Transmission in Cellular Networks

Following fast growth of cellular networks, more users have drawn attention to the contradiction between dynamic user data traffic and static data plans. To address this important but largely unexplored issue, in this paper, we design a new data plan sharing system named Prometheus, which is based on the scenario that some smartphone users have surplus data traffic and are willing to help others download data. To realize this system, we first propose a mechanism that incorporates LT codes into UDP. It is robust to transmission errors and encourages more concurrent transmissions and forwardings. It also can be implemented easily with low implementation complexity. Then we design an incentive mechanism using a Stackelberg game to choose assistant users ($AUs$), all participants will gain credits in return, which can be used to ask for future help when they need to download something. Finally real environment experiments are conducted and the results show that users in our Prometheus not only can manage their surplus data plan more efficiently, but also achieve a higher speed download rate.

preprint2014arXiv

Quark magnetar in three-flavor Nambu--Jona-Lasinio model with vector interaction and magnetized gluon potential

We investigate properties of strange quark matter in the framework of SU(3) Nambu--Jona-Lasinio(NJL) model with vector interaction under strong magnetic fields. The effects of vector-isoscalar and vector-isovector interaction on the equation of state of strange quark matter are investigated, and it is found that the equation of state is not sensitive to the vector-isovector interaction, however, a repulsive interaction in the vector-isoscalar channel gives a stiffer equation of state for cold dense quark matter. In the presence of magnetic field, gluons will be magnetized via quark loops, and the contribution from magnetized gluons to the equation of state is also estimated. The sound velocity square is a quantity to measure the hardness or softness of dense quark matter, and in the NJL model without vector interaction at zero magnetic field the sound velocity square is always less than 1/3. It is found that a repulsive vector-isoscalar interaction and a positive pressure contribution from magnetized gluons can enhance the sound velocity square, which can even reach 1. To construct quark magnetars under strong magnetic fields, we consider anisotropic pressures and use a density-dependent magnetic field profile to mimic the magnetic field distribution in a quark star. We also analyze the parameter region for the magnitude of vector-isoscalar interaction and the contribution from magnetized gluons in order to produce 2 solar mass quark magnetars.

preprint2014arXiv

Quark stars under strong magnetic fields

Within the confined isospin- and density-dependent mass model, we study the properties of strange quark matter (SQM) and quark stars (QSs) under strong magnetic fields. The equation of state of SQM under a constant magnetic field is obtained self-consistently and the pressure perpendicular to the magnetic field is shown to be larger than that parallel to the magnetic field, implying that the properties of magnetized QSs generally depend on both the strength and the orientation of the magnetic fields distributed inside the stars. Using a density-dependent magnetic field profile which is introduced to mimic the magnetic field strength distribution in a star, we study the properties of static spherical QSs by assuming two extreme cases for the magnetic field orientation in the stars, i.e., the radial orientation in which the local magnetic fields are along the radial direction and the transverse orientation in which the local magnetic fields are randomly oriented but perpendicular to the radial direction. Our results indicate that including the magnetic fields with radial (transverse) orientation can significantly decrease (increase) the maximum mass of QSs, demonstrating the importance of the magnetic field orientation inside the magnetized compact stars.

preprint2014arXiv

Reconstructing evolving signalling networks by hidden Markov nested effects models

Inferring time-varying networks is important to understand the development and evolution of interactions over time. However, the vast majority of currently used models assume direct measurements of node states, which are often difficult to obtain, especially in fields like cell biology, where perturbation experiments often only provide indirect information of network structure. Here we propose hidden Markov nested effects models (HM-NEMs) to model the evolving network by a Markov chain on a state space of signalling networks, which are derived from nested effects models (NEMs) of indirect perturbation data. To infer the hidden network evolution and unknown parameter, a Gibbs sampler is developed, in which sampling network structure is facilitated by a novel structural Metropolis--Hastings algorithm. We demonstrate the potential of HM-NEMs by simulations on synthetic time-series perturbation data. We also show the applicability of HM-NEMs in two real biological case studies, in one capturing dynamic crosstalk during the progression of neutrophil polarisation, and in the other inferring an evolving network underlying early differentiation of mouse embryonic stem cells.

preprint2014arXiv

Reconstructing primordial power spectrum using Planck and SDSS-III measurements

We develop an accurate and efficient Bayesian method to reconstruct the primordial power spectrum in a model-independent way, and apply it to the latest cosmic microwave background measurement from Planck mission, and the large scale structure observation of SDSS-III BOSS (CMASS) sample, combined with the type Ia supernovae sample (SNLS 3-year) and the measurements of baryon acoustic oscillations from SDSS-II, 6dF, and WiggleZ survey. We confirm that the scale-invariant primordial power spectrum is strongly disfavored, and a model with suppressed power on horizon scales is supported by current data. We also find that a modulation on scales $5\times10^{-4}~\textrm{Mpc}^{-1} \lesssim k \lesssim 0.01~\textrm{Mpc}^{-1}$ is mildly preferred at $2σ$ confidence level, whose origin needs further investigation.

preprint2014arXiv

Robust quantum gates for singlet-triplet spin qubits using composite pulses

We present a comprehensive theoretical treatment of SUPCODE, a method for generating dynamically corrected quantum gate operations, which are immune to random noise in the environment, by using carefully designed sequences of soft pulses. SUPCODE enables dynamical error suppression even when the control field is constrained to be positive and uniaxial, making it particularly suited to counteracting the effects of noise in systems subject to these constraints such as singlet-triplet qubits. We describe and explain in detail how to generate SUPCODE pulse sequences for arbitrary single-qubit gates and provide several explicit examples of sequences that implement commonly used gates, including the single-qubit Clifford gates. We develop sequences for noise-resistant two-qubit gates for two exchanged-coupled singlet-triplet qubits by cascading robust single-qubit gates, leading to a 35% reduction in gate time compared to previous works. This cascade approach can be scaled up to produce gates for an arbitrary-length spin qubit array, and is thus relevant to scalable quantum computing architectures. To more accurately describe real spin qubit experiments, we show how to design sequences that incorporate additional features and practical constraints such as sample-specific charge noise models and finite pulse rise times. We provide a detailed analysis based on randomized benchmarking to show how SUPCODE gates perform under realistic $1/f^α$ noise and find a strong dependence of gate fidelity on the exponent $α$, with best performance for $α>1$. Our SUPCODE sequences can therefore be used to implement robust universal quantum computation while accommodating the fundamental constraints and experimental realities of singlet-triplet qubits.

preprint2014arXiv

Robust Two-Qubit Gates for Exchange-Coupled Qubits

We present composite pulse sequences that perform fault-tolerant two-qubit gate operations on exchange-only quantum dot spin qubits in various experimentally relevant geometries. We show how to perform dynamically corrected two-qubit gates in exchange-only systems with the leading hyperfine error term cancelled. These pulse sequences are constructed to conform to the realistic experimental constraint of strictly non-negative couplings. We establish that our proposed pulse sequences lead to several orders of magnitude improvement in the gate fidelity compared with their uncorrected counterparts. Together with single-qubit dynamically corrected gates, our results enable noise-resistant universal quantum operations with exchange-only qubits.

preprint2014arXiv

Towards the Asymptotic Sum Capacity of the MIMO Cellular Two-Way Relay Channel

In this paper, we consider the transceiver and relay design for multiple-input multiple-output (MIMO) cellular two-way relay channel (cTWRC), where a multi-antenna base station (BS) exchanges information with multiple multi-antenna mobile stations via a multi-antenna relay station (RS). We propose a novel two-way relaying scheme to approach the sum capacity of the MIMO cTWRC.

preprint2014arXiv

Vibration-assisted coherent excitation energy transfer in a detuning system

The roles of the vibration motions played in the excitation energy transfer process are studied. It is found that a strong coherent transfer in the hybrid system emerges when the detuning between the donor and the acceptor equals the intrinsic frequency of the vibrational mode, and as a result the energy can be transferred into the acceptor much effectively. Three cases of the donor and the acceptor coupling with vibrational modes are investigated respectively. We find that the quantum interference between the two different transfer channels via the vibrational modes can affects the dynamics of the system significantly.

preprint2013arXiv

A Graph Minor Perspective to Multicast Network Coding

Network Coding encourages information coding across a communication network. While the necessity, benefit and complexity of network coding are sensitive to the underlying graph structure of a network, existing theory on network coding often treats the network topology as a black box, focusing on algebraic or information theoretic aspects of the problem. This work aims at an in-depth examination of the relation between algebraic coding and network topologies. We mathematically establish a series of results along the direction of: if network coding is necessary/beneficial, or if a particular finite field is required for coding, then the network must have a corresponding hidden structure embedded in its underlying topology, and such embedding is computationally efficient to verify. Specifically, we first formulate a meta-conjecture, the NC-Minor Conjecture, that articulates such a connection between graph theory and network coding, in the language of graph minors. We next prove that the NC-Minor Conjecture is almost equivalent to the Hadwiger Conjecture, which connects graph minors with graph coloring. Such equivalence implies the existence of $K_4$, $K_5$, $K_6$, and $K_{O(q/\log{q})}$ minors, for networks requiring $\mathbb{F}_3$, $\mathbb{F}_4$, $\mathbb{F}_5$ and $\mathbb{F}_q$, respectively. We finally prove that network coding can make a difference from routing only if the network contains a $K_4$ minor, and this minor containment result is tight. Practical implications of the above results are discussed.

preprint2013arXiv

Beamforming Design for Multiuser Two-Way Relaying: A Unified Approach via Max-Min SINR

In this paper, we develop a unified framework for beamforming designs in non-regenerative multiuser two-way relaying (TWR).

preprint2013arXiv

Dynamically corrected gates for an exchange-only qubit

We provide analytical composite pulse sequences that perform dynamical decoupling concurrently with arbitrary rotations for a qubit coded in the spin state of a triple quantum dot. The sequences are designed to respect realistic experimental constraints such as strictly nonnegative couplings. Logical errors and leakage errors are simultaneously corrected. A short pulse sequence is presented to compensate nuclear noise and a longer sequence is presented to simultaneously compensate nuclear and charge noise. The capability developed in this work provides a clear prescription for combatting the relevant sources of noise that currently hinder exchange-only qubit experiments.

preprint2013arXiv

Hot Electron and Pair Production from the Texas Petawatt Laser Irradiating Thick Gold Targets

We present data for relativistic hot electron production by the Texas Petawatt Laser irradiating solid Au targets with thickness between 1 and 4 mm. The experiment was performed at the short focus target chamber TC1 in July 2011, with laser energies around 50 J. We measured hot electron spectra out to 50 MeV which show a narrow peak around 10 - 20 MeV plus high energy exponential tail. The hot electron spectral shape differs from those reported for other PW lasers. We did not observe direct evidence of positron production above background.

preprint2013arXiv

Noise-resistant control for a spin qubit array

We develop a systematic method of performing corrected gate operations on an array of exchange-coupled singlet-triplet qubits in the presence of both fluctuating nuclear Overhauser field gradients and charge noise. The single-qubit control sequences we present have a simple form, are relatively short, and form the building blocks of a corrected CNOT gate when also implemented on the inter-qubit exchange link. This is a key step towards enabling large-scale quantum computation in a semiconductor-based architecture by facilitating error reduction below the quantum error correction threshold for both single-qubit and multi-qubit gate operations.

preprint2013arXiv

Rogue wave solutions in AB system

In this paper, the generalized Darboux transformation is established to the AB system, which mainly describes marginally unstable baroclinic wave packets in geophysical fluids and ultra-short pulses in nonlinear optics. We find a unified formula of Nth-order rogue wave solution for the AB system by the direct iterative rule. In particular, rogue waves possessing several free parameters from first to second order are calculated. The dynamics and some interesting structures of the rogue waves are illustrated through some figures.

preprint2013arXiv

Sublinear expectation linear regression

Nonlinear expectation, including sublinear expectation as its special case, is a new and original framework of probability theory and has potential applications in some scientific fields, especially in finance risk measure and management. Under the nonlinear expectation framework, however, the related statistical models and statistical inferences have not yet been well established. The goal of this paper is to construct the sublinear expectation regression and investigate its statistical inference. First, a sublinear expectation linear regression is defined and its identifiability is given. Then, based on the representation theorem of sublinear expectation and the newly defined model, several parameter estimations and model predictions are suggested, the asymptotic normality of estimations and the mini-max property of predictions are obtained. Furthermore, new methods are developed to realize variable selection for high-dimensional model. Finally, simulation studies and a real-life example are carried out to illustrate the new models and methodologies. All notions and methodologies developed are essentially different from classical ones and can be thought of as a foundation for general nonlinear expectation statistics.

preprint2013arXiv

Testing X-ray Measurements of Galaxy Cluster Gas Mass Fraction Using the Cosmic Distance-Duality Relation

We propose a consistency test of some recent X-ray gas mass fraction ($f_{\rm{gas}}$) measurements in galaxy clusters, using the cosmic distance-duality relation, $η_{\rm{theory}}=\dl(1+z)^{-2}/\da$, with luminosity distance ($\dl$) data from the Union2 compilation of type Ia supernovae. We set $η_{\rm{theory}}\equiv1$, instead of assigning any redshift parameterizations to it, and constrain the cosmological information preferred by $f_{\rm{gas}}$ data along with supernova observations. We adopt a new binning method in the reduction of the Union2 data, in order to minimize the statistical errors. Four data sets of X-ray gas mass fraction, which are reported by Allen et al. (2 samples), LaRoque et al. and Ettori et al., are detailedly analyzed against two theoretical modelings of $f_{\rm{gas}}$. The results from the analysis of Allen et al.'s samples prove the feasibility of our method. It is found that the preferred cosmology by LaRoque et al.'s sample is consistent with its reference cosmology within 1-$σ$ confidence level. However, for Ettori et al.'s $f_{\rm{gas}}$ sample, the inconsistency can reach more than 3-$σ$ confidence level and this dataset shows special preference to an $\Ol=0$ cosmology.

preprint2013arXiv

The effect of aberration on partial-sky measurements of the cosmic microwave background temperature power spectrum

Our motion relative to the cosmic-microwave-background (CMB) rest frame deflects light rays giving rise to shifts as large as L -> L(1+beta), where beta=0.00123 is our velocity (in units of the speed of light) on measurements CMB fluctuations. Here we present a novel harmonic-space approach to this CMB aberration that improves upon prior work by allowing us to (i) go to higher orders in beta, thus extending the validity of the analysis to measurements at L > 1/beta ~ 800; and (ii) treat the effects of window functions and pixelization in a more accurate and computationally efficient manner. We calculate precisely the magnitude of the systematic bias in the power spectrum inferred from the partial sky, and show that aberration shifts the multipole moment by Delta L/L ~ beta<cos(theta)>, with <cos(theta)> averaged over the survey footprint. Such a shift, if ignored, would bias the measurement of the sound-horizon size theta_* at the 0.01%-level, which is comparable to the measurement uncertainties of Planck. The bias can then propagate into cosmological parameters such as the angular-diameter distance, Hubble parameter and dark-energy equation of state. We study the effect of aberration for current Planck, South Pole Telescope (SPT) and Atacama Cosmology Telescope (ACT) data and show that the bias cannot be neglected. We suggest that the small tension between Planck, ACT, and SPT may be due partially to aberration. An Appendix shows how the near constancy of the full-sky power spectrum under aberration follows from unitarity of the aberration kernel.

preprint2012arXiv

A CME-driven shock analysis of the 14-Dec-2006 SEP event

Observations of the interplanetary shock provide us with strong evidence of particle acceleration to multi-MeV energies, even up to GeV energy, in a solar flare or coronal mass ejection (CME). Diffusive shock acceleration is an efficient mechanism for particle acceleration. For investigating the shock structure, the energy injection and energy spectrum of a CME-driven shock, we perform dynamical Monte Carlo simulation of the 14-Dec-2006 CME-driven shock using an anisotropic scattering law. The simulated results of the shock fine structure, particle injection, and energy spectrum are presented. We find that our simulation results give a good fit to the observations from multiple spacecraft.

preprint2012arXiv

Covalency, double-counting and the metal-insulator phase diagram in transition metal oxides

Dynamical mean field theory calculations are used to show that for late transition-metal-oxides a critical variable for the Mott/charge-transfer transition is the number of d-electrons, which is determined by charge transfer from oxygen ions. Insulating behavior is found only for a narrow range of d-occupancy, irrespective of the size of the intra-d Coulomb repulsion. The result is useful in interpreting 'density functional +U' and 'density functional plus dynamical mean field' methods in which additional correlations are applied to a specific set of orbitals and an important role is played by the 'double counting correction' which dictates the occupancy of these correlated orbitals. General considerations are presented and are illustrated by calculations for two representative transition metal oxide systems: layered perovskite Cu-based "high-Tc" materials, an orbitally non-degenerate electronically quasi-two dimensional systems, and pseudocubic rare earch nickelates, an orbitally degenerate electronically three dimensional system. Density functional calculations yield d-occupancies very far from the Mott metal-insulator phase boundary in the nickelate materials, but closer to it in the cuprates, indicating the sensitivity of theoretical models of the cuprates to the choice of double counting correction and corroborating the critical role of lattice distortions in attaining the experimentally observed insulating phase in the nickelates.

preprint2012arXiv

Electrostatic field acceleration of laser-driven ion bunch by using double layer thin foils

Monoenergetic ion bunch generation and acceleration from double layer thin foil target irradiated by intense linearly polarized (LP) laser pulse is investigated using two-dimensional (2D) particle-in-cell (PIC) simulations. The low-Z ions in the front layer of the target are accelerated by the laser-driven hot electrons and penetrate through the high-Z ion layer to generate a quasi-monoenergetic ion bunch, and this bunch will continue to be accelerated by the quasi-stable electrostatic sheath field which is formed by the immobile high-Z ions and the hot electrons. This mechanism offers possibility to generate monoenergetic ion bunch without ultrahigh-contrast and ultrahigh gradient laser pulses in beam generation experiments, which is confirmed by our simulations.

preprint2012arXiv

Observational constraints on cosmic neutrinos and dark energy revisited

Using several cosmological observations, i.e. the cosmic microwave background anisotropies (WMAP), the weak gravitational lensing (CFHTLS), the measurements of baryon acoustic oscillations (SDSS+WiggleZ), the most recent observational Hubble parameter data, the Union2.1 compilation of type Ia supernovae, and the HST prior, we impose constraints on the sum of neutrino masses ($\mnu$), the effective number of neutrino species ($\neff$) and dark energy equation of state ($w$), individually and collectively. We find that a tight upper limit on $\mnu$ can be extracted from the full data combination, if $\neff$ and $w$ are fixed. However this upper bound is severely weakened if $\neff$ and $w$ are allowed to vary. This result naturally raises questions on the robustness of previous strict upper bounds on $\mnu$, ever reported in the literature. The best-fit values from our most generalized constraint read $\mnu=0.556^{+0.231}_{-0.288}\rm eV$, $\neff=3.839\pm0.452$, and $w=-1.058\pm0.088$ at 68% confidence level, which shows a firm lower limit on total neutrino mass, favors an extra light degree of freedom, and supports the cosmological constant model. The current weak lensing data are already helpful in constraining cosmological model parameters for fixed $w$. The dataset of Hubble parameter gains numerous advantages over supernovae when $w=-1$, particularly its illuminating power in constraining $\neff$. As long as $w$ is included as a free parameter, it is still the standardizable candles of type Ia supernovae that play the most dominant role in the parameter constraints.

preprint2012arXiv

Performance Guarantees for Distributed Reachability Queries

In the real world a graph is often fragmented and distributed across different sites. This highlights the need for evaluating queries on distributed graphs. This paper proposes distributed evaluation algorithms for three classes of queries: reachability for determining whether one node can reach another, bounded reachability for deciding whether there exists a path of a bounded length between a pair of nodes, and regular reachability for checking whether there exists a path connecting two nodes such that the node labels on the path form a string in a given regular expression. We develop these algorithms based on partial evaluation, to explore parallel computation. When evaluating a query Q on a distributed graph G, we show that these algorithms possess the following performance guarantees, no matter how G is fragmented and distributed: (1) each site is visited only once; (2) the total network traffic is determined by the size of Q and the fragmentation of G, independent of the size of G; and (3) the response time is decided by the largest fragment of G rather than the entire G. In addition, we show that these algorithms can be readily implemented in the MapReduce framework. Using synthetic and real-life data, we experimentally verify that these algorithms are scalable on large graphs, regardless of how the graphs are distributed.

preprint2012arXiv

Resummed Perturbation Theory of Galaxy Clustering

The relationship between observed tracers such as galaxies and the underlying dark matter distribution is crucial in extracting cosmological information. As the linear bias model breaks down at quasi-linear scales, the standard perturbative approach of the nonlinear Eulerian bias model (EBM) is not accurate enough in describing galaxy clustering. In this paper, we discuss such a model in the context of resummed perturbation theory, and further generalize it to incorporate the subsequent gravitational evolution by combining with a Lagrangian description of galaxies' motion. The multipoint propagators we constructed for such model also exhibit exponential damping similar to their dark matter counterparts, therefore the convergence property of statistics built upon these quantities is improved. This is achieved by applying both Eulerian and Lagrangian resummation techniques of dark matter field developed in recent years. As inherited from the Lagrangian description of galaxy density evolution, our approach automatically incorporates the non-locality induced by gravitational evolution after the formation of the tracer, and also allows us to include a continuous galaxy formation history by temporally weighted-averaging relevant quantities with the galaxy formation rate.

preprint2012arXiv

What can we learn about solar coronal mass ejections, coronal dimmings, and Extreme-Ultraviolet jets through spectroscopic observations?

We analyze several data sets obtained by Hinode/EIS and find various types of flows during CMEs and EUV jet eruptions. CME-induced dimming regions are found to be characterized by significant blueshift and enhanced line width by using a single Gaussian fit. While a red-blue (RB) asymmetry analysis and a RB-guided double Gaussian fit of the coronal line profiles indicate that these are likely caused by the superposition of a strong background emission component and a relatively weak (~10%) high-speed (~100 km s-1) upflow component. This finding suggests that the outflow velocity in the dimming region is probably of the order of 100 km s-1, not ~20 km s-1 as reported previously. Density and temperature diagnostics suggest that dimming is primarily an effect of density decrease rather than temperature change. The mass losses in dimming regions as estimated from different methods are roughly consistent with each other and they are 20%-60% of the masses of the associated CMEs. With the guide of RB asymmetry analysis, we also find several temperature-dependent outflows (speed increases with temperature) immediately outside the (deepest) dimming region. In an erupted CME loop and an EUV jet, profiles of emission lines formed at coronal and transition region temperatures are found to exhibit two well-separated components, an almost stationary component accounting for the background emission and a highly blueshifted (~200 km s-1) component representing emission from the erupting material. The two components can easily be decomposed through a double Gaussian fit and we can diagnose the electron density, temperature and mass of the ejecta. Combining the speed of the blueshifted component and the projected speed of the erupting material derived from simultaneous imaging observations, we can calculate the real speed of the ejecta.

preprint2011arXiv

Cosmological models with Lagrange Multiplier Field

We first consider the Einstein-aether theory with a gravitational coupling and a Lagrange multiplier field, and then consider the non-minimally coupled quintessence field theory with Lagrange multiplier field. We study the influence of the Lagrange multiplier field on these models. We show that the energy density evolution of the Einstein-aether field and the quintessence field are significantly modified. The energy density of the Einstein-aether is nearly a constant during the entire history of the Universe. The energy density of the quintessence field can also be kept nearly constant in the matter dominated Universe, or even exhibit a phantom-like behavior for some models. This suggests a possible dynamical origin of the cosmological constant or dark energy. Further more, for the canonical quintessence in the absence of gravitational coupling, we find that the quintessence scalar field can play the role of cold dark matter with the introduction of a Lagrange multiplier field. We conclude that the Lagrange multiplier field could play a very interesting and important role in the construction of cosmological models.

preprint2011arXiv

d_{3z^2-r^2} Orbital in high-Tc cuprates: Excitonic spectrum, metal-insulator phase diagram, optical conductivity and orbital character of doped holes

The single-site dynamical mean-field approximation is used to solve a model of high-Tc cuprate superconductors which includes both d_{x^2-y^2} and d_{3z^2-r^2} orbitals on the Cu as well as the relevant oxygen states. Both T (with apical oxygen) and T' (without apical oxygen) crystal structures are considered. In both phases, inclusion of the d_{3z^2-r^2} orbital is found to broaden the range of stability of the charge transfer insulating phase. For equal charge transfer energies and interaction strengths, the T' phase is found to be less strongly correlated than the T phase. For both structures, d-d excitons are found within the charge-transfer gap. However, for all physically relevant dopings the Fermi surface is found to have only one sheet and the admixture of d_{3z^2-r^2} into ground state wave function remains negligible (<5%). Inclusion of the extra orbitals is found not to resolve the discrepancy between computed and observed conductivity in the insulating state.

preprint2011arXiv

Diagrammatic Quantum Monte Carlo solution of the two-dimensional Cooperon-Fermion model

We investigate the two-dimensional cooperon-fermion model in the correlated regime with a new continuous-time diagrammatic determinant quantum Monte Carlo (DDQMC) algorithm. We estimate the transition temperature $T_{c}$, examine the effectively reduced band gap and cooperon mass, and find that delocalization of the cooperons enhances the diamagnetism. When applied to diamagnetism of the pseudogap phase in high-$T_{c}$ cuprates, we obtain results in a qualitative agreement with recent torque magnetization measurements.

preprint2011arXiv

Generic Hubbard model description of semiconductor quantum dot spin qubits

We introduce a Hubbard model as the simple quantum generalization of the classical capacitance circuit model to study semiconductor quantum-dot spin qubits. We prove theoretically that our model is equivalent to the usual capacitance circuit model in the absence of quantum fluctuations. However, our model naturally includes quantum effects such as hopping and spin exchange. The parameters of the generalized Hubbard model can either be directly read off from the experimental plot of the stability diagram or be calculated from the microscopic theory, establishing a quantitative connection between the two. We show that, while the main topology of the charge stability diagram is determined by the ratio between inter-site and on-site Coulomb repulsion, fine details of the stability diagram reveal information about quantum effects. Extracting quantum information from experiments using our Hubbard model approach is simple, but would require the measurement resolution to increase by an order of magnitude.

preprint2011arXiv

High-frequency asymptotic behavior of self-energies in quantum impurity models

We present explicit expressions for the high-frequency asymptotic behavior of electron self-energy of general quantum impurity models, which may be useful for improving the convergence of dynamical mean-field calculations and for the analytic continuation of the electron self-energy. We also give results, expressed in more physical terms, for the two-orbital and three-orbital rotationally invariant Slater-Kanamori interactions, in order to facilitate calculations of transition metal oxides.

preprint2011arXiv

Hubbard model description of silicon spin qubits: charge stability diagram and tunnel coupling in Si double quantum dots

We apply the recently introduced Hubbard model approach to quantitatively describe the experimental charge stability diagram and tunnel coupling of silicon double quantum dot systems. The results calculated from both the generalized Hubbard model and the microscopic theory are compared with existing experimental data, and excellent agreement between theory and experiment is found. The central approximation of our theory is a reduction of the full multi-electron multi-band system to an effective two-electron model, which is numerically tractable. In the microscopic theory we utilize the Hund-Mulliken approximation to the electron wave functions and compare the results calculated with two different forms of confinement potentials (biquadratic and Gaussian). We discuss the implications of our work for future studies.

preprint2011arXiv

Monte Carlo simulations of a diffusive shock with multiple scattering angular distributions

We independently develop a simulation code following the previous dynamical Monte Carlo simulation of the diffusive shock acceleration under the isotropic scattering law during the scattering process, and the same results are obtained. Since the same results test the validity of the dynamical Monte Carlo method for simulating a collisionless shock, we extend the simulation toward including an anisotropic scattering law for further developing this dynamical Monte Carlo simulation. Under this extended anisotropic scattering law, a Gaussian distribution function is used to describe the variation of scattering angles in the particle's local frame. As a result, we obtain a series of different shock structures and evolutions in terms of the standard deviation values of the given Gaussian scattering angular distributions. We find that the total energy spectral index increases as the standard deviation value of the scattering angular distribution increases, but the subshock's energy spectral index decreases as the standard deviation value of the scattering angular distribution increases.

preprint2011arXiv

Morphology of Galaxy Clusters: A Cosmological Model-Independent Test of the Cosmic Distance-Duality Relation

Aiming at comparing different morphological models of galaxy clusters, we use two new methods to make a cosmological model-independent test of the distance-duality (DD) relation. The luminosity distances come from Union2 compilation of Supernovae Type Ia. The angular diameter distances are given by two cluster models (De Filippis et al. and Bonamente et al.). The advantage of our methods is that it can reduce statistical errors. Concerning the morphological hypotheses for cluster models, it is mainly focused on the comparison between elliptical $β$-model and spherical $β$-model. The spherical $β$-model is divided into two groups in terms of different reduction methods of angular diameter distances, i.e. conservative spherical $β$-model and corrected spherical $β$-model. Our results show that the DD relation is consistent with the elliptical $β$-model at $1σ$ confidence level (CL) for both methods, whereas for almost all spherical $β$-model parameterizations, the DD relation can only be accommodated at $3σ$ CL, particularly for the conservative spherical $β$-model. In order to minimize systematic uncertainties, we also apply the test to the overlap sample, i.e. the same set of clusters modeled by both De Filippis et al. and Bonamente et al.. It is found that the DD relation is compatible with the elliptically modeled overlap sample at $1σ$ CL, however for most of the parameterizations, the DD relation can not be accommodated even at $3σ$ CL for any of the two spherical $β$-models. Therefore it is reasonable that the marked triaxial ellipsoidal model is a better geometrical hypothesis describing the structure of the galaxy cluster compared with the spherical $β$-model if the DD relation is valid in cosmological observations.

preprint2011arXiv

Mott insulating phases and magnetism of fermions in a double-well optical lattice

We theoretically investigate, using non-perturbative strong correlation techniques, Mott insulating phases and magnetic ordering of two-component fermions in a two-dimensional double-well optical lattice. At filling of two fermions per site, there are two types of Mott insulators, one of which is characterized by spin-1 antiferromagnetism below the Neel temperature. The super-exchange interaction in this system is induced by the interplay between the inter-band interaction and the spin degree of freedom. A great advantage of the double-well optical lattice is that the magnetic quantum phase diagram and the Neel temperature can be easily controlled by tuning the orbital energy splitting of the two-level system. Particularly, the Neel temperature can be one order of magnitude larger than that in standard optical lattices, facilitating the experimental search for magnetic ordering in optical lattice systems.

preprint2011arXiv

Perturbation Theory of the Cosmological Log-Density Field

The matter density field exhibits a nearly lognormal probability density distribution (PDF) after entering into the nonlinear regime. Recently, it has been shown that the shape of the power spectrum of a logarithmically transformed density field is very close to the linear density power spectrum, motivating an analytic study of it. In this paper, we develop cosmological perturbation theory for the power spectrum of this field. Our formalism is developed in the context of renormalized perturbation theory, which helps to regulate the convergence behavior of the perturbation series, and of the Taylor- series expansion we use of the logarithmic mapping. This approach allows us to handle the critical issue of density smoothing in a straightforward way. We also compare our perturbative results with simulation measurements.

preprint2011arXiv

Quantum theory of the charge stability diagram of semiconductor double quantum dot systems

We complete our recently introduced theoretical framework treating the double quantum dot system with a generalized form of Hubbard model. The effects of all quantum parameters involved in our model on the charge stability diagram are discussed in detail. A general formulation of the microscopic theory is presented, and truncating at one orbital per site, we study the implication of different choices of the model confinement potential on the Hubbard parameters as well as the charge stability diagram. We calculate the charge stability diagram keeping three orbitals per site and find that the effect of additional higher-lying orbitals on the subspace with lowest-energy orbitals only can be regarded as a small renormalization of Hubbard parameters, thereby justifying our practice of keeping only the lowest-orbital in all other calculations. The role of the harmonic oscillator frequency in the implementation of the Gaussian model potential is discussed, and the effect of an external magnetic field is identified to be similar to choosing a more localized electron wave function in microscopic calculations. The full matrix form of the Hamiltonian including all possible exchange terms, and several peculiar charge stability diagrams due to unphysical parameters are presented in the appendix, thus emphasizing the critical importance of a reliable microscopic model in obtaining the system parameters defining the Hamiltonian.

preprint2011arXiv

Role of oxygen-oxygen hopping in the three-band copper-oxide model: quasiparticle weight, metal insulator and magnetic phase boundaries, gap values and optical conductivity

We investigate the effect of oxygen-oxygen hopping on the three-band copper-oxide model relevant to high-$T_c$ cuprates, finding that the physics is changed only slightly as the oxygen-oxygen hopping is varied. The location of the metal-insulator phase boundary in the plane of interaction strength and charge transfer energy shifts by $\sim 0.5$eV or less along the charge transfer axis, the quasiparticle weight has approximately the same magnitude and doping dependence and the qualitative characteristics of the electron-doped and hole-doped sides of the phase diagram do not change. The results confirm the identification of La$_2$CuO$_4$ as a material with intermediate correlation strength. However, the magnetic phase boundary as well as higher-energy features of the optical spectrum are found to depend on the magnitude of the oxygen-oxygen hopping. We compare our results to previously published one-band and three-band model calculations.

preprint2011arXiv

The "S" Curve Relationship between Export Diversity and Economic Size of Countries

The highly detailed international trade data among all countries in the world during 1971-2000 shows that the kinds of export goods and the logarithmic GDP (gross domestic production) of a country has an S-shaped relationship. This indicates all countries can be divided into three stages accordingly. First, the poor countries always export very few kinds of products as we expect. Second, once the economic size (GDP) of a country is beyond a threshold, its export diversity may increase dramatically. However, this is not the case for rich countries because a ceiling on the export diversity is observed when their GDPs are higher than another threshold. This pattern is very stable for different years although the concrete parameters of the fitting sigmoid functions may change with time. In addition, we also discussed other relationships such as import diversity with respect to logarithmic GDP, diversity of exporters with respect to the number of export goods etc., all of these relationships show S-shaped or power law patterns. Although this paper does not explain the origin of the S-shaped curve, it may provide a basic empirical fact and insights for economic diversity.

preprint2011arXiv

The energy analysis for the monte carlo simulations of a diffusive shock

According to the shock jump conditions, the total fluid's mass, momentum, and energy should be conserved in the entire simulation box. We perform the dynamical Monte Carlo simulations with the multiple scattering law for energy analysis. The various energy functions of time are obtained by monitoring the total particles' mass, momentum, and energy in the simulation box. In conclusion, the energy analysis indicates that the smaller energy losses in the prescribed scattering law are, the harder the energy spectrum produced is.

preprint2011arXiv

The energy injection and losses in the Monte Carlo simulations of a diffusive shock

Although diffusive shock acceleration (DSA) could be simulated by some well-established models, the assumption of the injection rate from the thermal particles to the superthermal population is still a contentious problem. But in the self-consistent Monte Carlo simulations, because of the prescribed scattering law instead of the assumption of the injected function, hence particle injection rate is intrinsically defined by the prescribed scattering law. We expect to examine the correlation of the energy injection with the prescribed multiple scattering angular distributions. According to the Rankine-Hugoniot conditions, the energy injection and the losses in the simulation system can directly decide the shock energy spectrum slope. By the simulations performed with multiple scattering law in the dynamical Monte Carlo model, the energy injection and energy loss functions are obtained. As results, the case applying anisotropic scattering law produce a small energy injection and large energy losses leading to a soft shock energy spectrum, the case applying isotropic scattering law produce a large energy injection and small energy losses leading to a hard shock energy spectrum.

preprint2011arXiv

Topology of large scale structure as test of modified gravity

The genus of the iso-density contours is a robust measure of the topology of large scale structure, and it is relatively insensitive to nonlinear gravitational evolution, galaxy bias and redshift-space distortion. We show that the growth of density fluctuations is scale-dependent even in the linear regime in some modified gravity theories, which opens a new possibility of testing the theories observationally. We propose to use the genus of the iso-density contours, an intrinsic measure of the topology of large scale structure, as a statistic to be used in such tests. In Einstein's general theory of relativity, density fluctuations are growing at the same rate on all scales in the linear regime, and the genus per comoving volume is almost conserved as structures are growing homologously, so we expect that the genus-smoothing scale relation is basically time-independent. However, in some modified gravity models where structures grow with different rates on different scales, the genus-smoothing scale relation should change over time. This can be used to test the gravity models with large scale structure observations. We studied the case of the f(R) theory, DGP braneworld theory as well as the parameterized post-Friedmann (PPF) models. We also forecast how the modified gravity models can be constrained with optical/IR or redshifted 21cm radio surveys in the near future.

preprint2011arXiv

Two components of the coronal emission revealed by EUV spectroscopic observations

Recent spectroscopic observations have revealed the ubiquitous presence of blueward asymmetries of emission lines formed in the solar corona and transition region. These asymmetries are most prominent in loop footpoint regions, where a clear correlation of the asymmetry with the Doppler shift and line width determined from the single Gaussian fit is found. Such asymmetries suggest at least two emission components: a primary component accounting for the background emission and a secondary component associated with high-speed upflows. The latter has been proposed to play a vital role in the coronal heating process and there is no agreement on its properties. Here we slightly modify the initially developed technique of Red-Blue (RB) asymmetry analysis and apply it to both artificial spectra and spectra observed by the EUV Imaging Spectrometer onboard Hinode, and demonstrate that the secondary component usually contributes a few percent of the total emission, has a velocity ranging from 50 to 150 km s-1 and a Gaussian width comparable to that of the primary one in loop footpoint regions. The results of the RB asymmetry analysis are then used to guide a double Gaussian fit and we find that the obtained properties of the secondary component are generally consistent with those obtained from the RB asymmetry analysis. Through a comparison of the location, relative intensity, and velocity distribution of the blueward secondary component with the properties of the upward propagating disturbances revealed in simultaneous images from the Atmospheric Imaging Assembly onboard the Solar Dynamics Observatory, we find a clear association of the secondary component with the propagating disturbances.

preprint2010arXiv

Cosmic microwave background with Brans-Dicke Gravity: I. Covariant Formulation

In the covariant cosmological perturbation theory, a 1+3 decomposition ensures that all variables in the frame-independent equations are covariant, gauge-invariant and have clear physical interpretations. We develop this formalism in the case of Brans-Dicke gravity, and apply this method to the calculation of cosmic microwave background (CMB) anisotropy and large scale structures (LSS). We modify the publicly available Boltzmann code CAMB to calculate numerically the evolution of the background and adiabatic perturbations, and obtain the temperature and polarization spectra of the Brans-Dicke theory for both scalar and tensor mode, the tensor mode result for the Brans-Dicke gravity are obtained numerically for the first time. We first present our theoretical formalism in detail, then explicitly describe the techniques used in modifying the CAMB code. These techniques are also very useful to other gravity models. Next we compare the CMB and LSS spectra in Brans-Dicke theory with those in the standard general relativity theory. At last, we investigate the ISW effect and the CMB lensing effect in the Brans-Dicke theory. Constraints on Brans-Dicke model with current observational data is presented in a companion paper (paper II).

preprint2010arXiv

Decentralized Estimation over Orthogonal Multiple-access Fading Channels in Wireless Sensor Networks - Optimal and Suboptimal Estimators

Optimal and suboptimal decentralized estimators in wireless sensor networks (WSNs) over orthogonal multiple-access fading channels are studied in this paper. Considering multiple-bit quantization before digital transmission, we develop maximum likelihood estimators (MLEs) with both known and unknown channel state information (CSI). When training symbols are available, we derive a MLE that is a special case of the MLE with unknown CSI. It implicitly uses the training symbols to estimate the channel coefficients and exploits the estimated CSI in an optimal way. To reduce the computational complexity, we propose suboptimal estimators. These estimators exploit both signal and data level redundant information to improve the estimation performance. The proposed MLEs reduce to traditional fusion based or diversity based estimators when communications or observations are perfect. By introducing a general message function, the proposed estimators can be applied when various analog or digital transmission schemes are used. The simulations show that the estimators using digital communications with multiple-bit quantization outperform the estimator using analog-and-forwarding transmission in fading channels. When considering the total bandwidth and energy constraints, the MLE using multiple-bit quantization is superior to that using binary quantization at medium and high observation signal-to-noise ratio levels.

preprint2010arXiv

Enhanced surface acceleration of fast electrons by using sub-wavelength grating targets

Surface acceleration of fast electrons in intense laser-plasma interaction is improved by using sub-wavelength grating targets. The fast electron beam emitted along the target surface was enhanced by more than three times relative to that by using planar target. The total number of the fast electrons ejected from the front side of target was also increased by about one time. The method to enhance the surface acceleration of fast electron is effective for various targets with sub-wavelength structured surface, and can be applied widely in the cone-guided fast ignition, energetic ion acceleration, plasma device, and other high energy density physics experiments.

preprint2010arXiv

On the Counter-jet Emission in GRB Afterglows

We investigate the dynamical evolution of double-sided jets and present detailed numerical studies on the emission from the receding jet of gamma-ray bursts. It is found that the receding jet emission is generally very weak and only manifests as a plateau in the late time radio afterglow light curves. Additionally, we find that the effect of synchrotron self-absorption can influence the peak time of the receding jet emission significantly.

preprint2010arXiv

Theory of oxygen K-edge x-ray absorption spectra of cuprates

The dynamical mean-field theory of the three-band model of copper-oxide superconductors is used to calculate the doping dependence of the intensity of the oxygen K-edge x-ray absorption spectra of high-$T_c$ copper oxide superconductors. The model is found not to reproduce the results of a recent experiment, suggesting that at sufficiently high doping the physics beyond the conventional three-band model becomes important.

preprint2009arXiv

On the afterglow from the receding jet of gamma-ray burst

According to popular progenitor models of gamma-ray bursts, twin jets should be launched by the central engine, with a forward jet moving toward the observer and a receding jet (or the counter jet) moving backwardly. However, in calculating the afterglows, usually only the emission from the forward jet is considered. Here we present a detailed numerical study on the afterglow from the receding jet. Our calculation is based on a generic dynamical description, and includes some delicate ingredients such as the effect of the equal arrival time surface. It is found that the emission from the receding jet is generally rather weak. In radio bands, it usually peaks at a time of $t \geq 1000$ d, with the peak flux nearly 4 orders of magnitude lower than the peak flux of the forward jet. Also, it usually manifests as a short plateau in the total afterglow light curve, but not as an obvious rebrightening as once expected. In optical bands, the contribution from the receding jet is even weaker, with the peak flux being $\sim 8$ orders of magnitude lower than the peak flux of the forward jet. We thus argue that the emission from the receding jet is very difficult to detect. However, in some special cases, i.e., when the circum-burst medium density is very high, or if the parameters of the receding jet is quite different from those of the forward jet, the emission from the receding jet can be significantly enhanced and may still emerge as a marked rebrightening. We suggest that the search for receding jet emission should mostly concentrate on nearby gamma-ray bursts, and the observation campaign should last for at least several hundred days for each event.

preprint2009arXiv

Primordial Non-Gaussianity from LAMOST Surveys

The primordial non-Gaussianity (PNG) in matter density perturbation is a very powerful probe of the physics of the very early Universe. The local PNG can induce a distinct scale-dependent bias on the large scale structure distribution of galaxies and quasars, which could be used for constraining it. We study the detection limits on PNG from the surveys of the LAMOST telescope. The cases of the main galaxy survey, the luminous red galaxy (LRG) survey, and the quasar survey of different magnitude limits are considered. We find that the MAIN1 sample (i.e. the main galaxy survey with one magnitude deeper than the SDSS main galaxy survey, or r<18.8) could only provide very weak constraint on PNG. For the MAIN2 sample (r<19.8) and the LRG survey, the 2σ(95.5%) limit on the PNG parameter f_{NL} are |f_{NL}|<145 and |f_{NL}|<114 respectively, comparable to the current limit from cosmic microwave background (CMB) data. The quasar survey could provide much more stringent constraint, and we find that the 2σlimit for |f_{NL}| is between 50 and 103, depending on the magnitude limit of the survey. With Planck-like priors on cosmological parameters, the quasar survey with g<21.65 would improve the constraints to |f_{NL}|<43 (2σ). We also discuss the possibility of further tightening the constraint by using the relative bias method proposed by Seljak(2008).

preprint2009arXiv

Quantum criticality and non-Fermi-liquid behavior in a two-level two-lead quantum dot

Analytical and continuous-time quantum Monte Carlo methods are used to investigate the possibility of occupation switching and quantum criticality in a model of two quantum impurities coupled to two leads. A general discussion of potential occupancy-switching related quantum critical points is given, and a detailed analysis is made of a specific model which has been recently discussed. For spinless electrons, no phase transition is found. For electrons with spin, a critical value of the interaction strength separates a weak coupling regime in which all properties vary smoothly with parameters from a strong coupling phase in which occupation numbers vary discontinuously as level energies are changed. The discontinuity point is characterized by non-Fermi-liquid behavior. Results for self-energies and correlation functions are given. Phase diagrams are presented.

Xin Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

319 published item(s)

Don't Click That: Teaching Web Agents to Resist Deceptive Interfaces

Continuous Angular Power Spectrum Recovery From Channel Covariance via Chebyshev Polynomials

Collaborative Watermarking for Adversarial Speech Synthesis

Early Results from GLASS-JWST XXIII: The transmission of Lyman-alpha from UV-faint z ~ 3-6 galaxies

High-Efficiency Resonant Beam Charging and Communication

Lyman Continuum Emission from AGN at 2.3$\lesssim$z$\lesssim$3.7 in the UVCANDELS Fields

Spoofing attack augmentation: can differently-trained attack models improve generalisation?

Learning-based Intelligent Surface Configuration, User Selection, Channel Allocation, and Modulation Adaptation for Jamming-resisting Multiuser OFDMA Systems

Suppression of laser beam's polarization and intensity fluctuation via a Mach-Zehnder interferometer with proper feedback

A Closer Look at Debiased Temporal Sentence Grounding in Videos: Dataset, Metric, and Approach

A Comprehensive Survey on Deep Clustering: Taxonomy, Challenges, and Future Directions

A Practical Guide to Logical Access Voice Presentation Attack Detection

A Theoretical View on Sparsely Activated Networks

A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes

Accidental symmetries in the scalar potential of the Standard Model extended with two Higgs triplets

Adaptive Worker Grouping For Communication-Efficient and Straggler-Tolerant Distributed SGD

Adversarial Attack Framework on Graph Embedding Models with Limited Knowledge

An Edge-Cloud Integrated Framework for Flexible and Dynamic Stream Analytics

Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions

Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation

Chiral Quantum Network with Giant Atoms

CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training

Communication-Efficient Local SGD with Age-Based Worker Selection

Compilable Neural Code Generation with Compiler Feedback

Context-Aware Streaming Perception in Dynamic Environments

Cosmological constraints from the density gradient weighted correlation function

Coupling two charge qubits via a superconducting resonator operating in the resonant and dispersive regimes

Covering Grassmannian Codes: Bounds and Constructions

Decentralized Stochastic Proximal Gradient Descent with Variance Reduction over Time-varying Networks

Deep Learning-based Massive MIMO CSI Acquisition for 5G Evolution and 6G

Dichotomic Pattern Mining with Applications to Intent Prediction from Semi-Structured Clickstream Datasets

Domain Shift-oriented Machine Anomalous Sound Detection Model Based on Self-Supervised Learning

Early results from GLASS-JWST. IX: First spectroscopic confirmation of low-mass quiescent galaxies at $z>2$ with NIRISS

Early results from GLASS-JWST. XI: Stellar masses and mass-to-light ratio of z>7 galaxies

Energy-Efficient UAV-Mounted RIS Assisted Mobile Edge Computing

Enhanced brain structure-function tethering in transmodal cortex revealed by high-frequency eigenmodes

Estimating the confidence of speech spoofing countermeasure

Eyes Tell All: Irregular Pupil Shapes Reveal GAN-generated Faces

Facility Location with Congestion and Priority in Drone-Based Emergency Delivery

Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards

First Census of Gas-phase Metallicity Gradients of Star-forming Galaxies in Overdense Environments at Cosmic Noon

Fluorination Increases Hydrophobicity at the Macroscopic Level but not at the Microscopic Level

Fundamental limitations on optimization in variational quantum algorithms

Hermite-Gaussian-mode coherently composed states and deep learning based free-space optical communication link

Hierarchical Interaction Networks with Rethinking Mechanism for Document-level Sentiment Analysis

Hybrid subconvexity bounds for twists of $\rm GL(3)$ $L$-functions

Hydrodynamic Relaxation in a Strongly Interacting Fermi Gas

Incremental Graph Computation: Anchored Vertex Tracking in Dynamic Social Networks

Investigating self-supervised front ends for speech spoofing countermeasures

Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment Utterances

Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models

Learning to Solve Travelling Salesman Problem with Hardness-adaptive Curriculum

Lessons learned from the NeurIPS 2021 MetaDL challenge: Backbone fine-tuning without episodic meta-learning dominates for few-shot learning image classification

Lyman Continuum Galaxy Candidates in COSMOS

Mask Wearing Status Estimation with Smartwatches

Medical Matting: A New Perspective on Medical Segmentation with Uncertainty

Microscopic theory on magnetic-field-tuned sweet spot of exchange interactions in multielectron quantum-dot systems

Mitigating barren plateaus of variational quantum eigensolvers

Muffin: Testing Deep Learning Libraries via Neural Architecture Fuzzing

Muon $(g-2)$ and Flavor Puzzles in the $U(1)^{}_{X}$-gauged Leptoquark Model

NeurIPS'22 Cross-Domain MetaDL competition: Design and baseline results

NL2GDPR: Automatically Develop GDPR Compliant Android Application Features from Natural Language

Nonadiabatic geometric quantum computation with cat qubits via invariant-based reverse engineering

Nonreciprocal waveguide-QED for spinning cavities with multiple coupling points

One dimensional reduced model for ITER relevant energetic particle transport

Open-Eye: An Open Platform to Study Human Performance on Identifying AI-Synthesized Faces

Out-Of-Distribution Generalization on Graphs: A Survey

Pan More Gold from the Sand: Refining Open-domain Dialogue Training with Noisy Self-Retrieval Generation

Receiver Design for MIMO Unsourced Random Access with SKP Coding

ReFormer: The Relational Transformer for Image Captioning

Robust Attentive Deep Neural Network for Exposing GAN-generated Faces

Robust Contrastive Learning against Noisy Views

Robust entangling gate for capacitively coupled few-electron singlet-triplet qubits

Scene Recognition with Objectness, Attribute and Category Learning