Source author record

Dong Wang

Dong Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Computer Vision Computation and Language physics.acc-ph math.PR quant-ph math-ph math.MP physics.optics Neural and Evolutionary Computing Artificial Intelligence math.CA Sound eess.AS math.NA Social and Information Networks Software Engineering cond-mat.mtrl-sci cond-mat.soft cond-mat.stat-mech gr-qc nlin.SI Numerical Analysis cond-mat.str-el hep-th Information Retrieval physics.ins-det Populations and Evolution astro-ph.EP Cryptography and Security hep-ex math.OC physics.soc-ph Biological Physics cs.CY eess.SP Information Theory math.IT nucl-ex physics.app-ph astro-ph.CO Computation Computational Engineering, Finance, and Science Computer Science and Game Theory cond-mat.dis-nn cond-mat.mes-hall cond-mat.quant-gas Distributed, Parallel, and Cluster Computing eess.IV Emerging Technologies Hardware Architecture hep-ph math.ST Multimedia Networking and Internet Architecture nucl-th physics.chem-ph physics.data-an physics.flu-dyn physics.med-ph Statistics Theory

Catalog footprint

What is connected

173works

61topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Dr. Zero: Self-Evolving Search Agents without Training Data

As high-quality data becomes increasingly difficult to obtain, data-free self-evolution has emerged as a promising paradigm. This approach allows large language models (LLMs) to autonomously generate and solve complex problems, thereby improving their reasoning capabilities. However, multi-turn search agents struggle in data-free self-evolution due to the limited question diversity and the substantial compute required for multi-step reasoning and tool using. In this work, we introduce Dr. Zero, a framework enabling search agents to effectively self-evolve without any training data. In particular, we design a self-evolution feedback loop where a proposer generates diverse questions to train a solver initialized from the same base model. As the solver evolves, it incentivizes the proposer to produce increasingly difficult yet solvable tasks, thus establishing an automated curriculum to refine both agents. To enhance training efficiency, we also introduce hop-grouped relative policy optimization (HRPO). This method clusters structurally similar questions to construct group-level baselines, effectively minimizing the sampling overhead in evaluating each query's individual difficulty and solvability. Consequently, HRPO significantly reduces the compute requirements for solver training without compromising performance or stability. Extensive experiment results demonstrate that the data-free Dr. Zero matches or surpasses fully supervised search agents, proving that complex reasoning and search capabilities can emerge solely through self-evolution.

preprint2026arXiv

From Failure to Mastery: Generating Hard Samples for Tool-use Agents

The advancement of LLM agents with tool-use capabilities requires diverse and complex training corpora. Existing data generation methods, which predominantly follow a paradigm of random sampling and shallow generation, often yield simple and homogeneous trajectories that fail to capture complex, implicit logical dependencies. To bridge this gap, we introduce HardGen, an automatic agentic pipeline designed to generate hard tool-use training samples with verifiable reasoning. Firstly, HardGen establishes a dynamic API Graph built upon agent failure cases, from which it samples to synthesize hard traces. Secondly, these traces serve as conditional priors to guide the instantiation of modular, abstract advanced tools, which are subsequently leveraged to formulate hard queries. Finally, the advanced tools and hard queries enable the generation of verifiable complex Chain-of-Thought (CoT), with a closed-loop evaluation feedback steering the continuous refinement of the process. Extensive evaluations demonstrate that a 4B parameter model trained with our curated dataset achieves superior performance compared to several leading open-source and closed-source competitors (e.g., GPT-5.2, Gemini-3-Pro and Claude-Opus-4.5). Our code, models, and dataset will be open-sourced to facilitate future research.

preprint2026arXiv

MambaPanoptic: A Vision Mamba-based Structured State Space Framework for Panoptic Segmentation

Panoptic segmentation requires the simultaneous recognition of countable thing instances and amorphous stuff regions, placing joint demands on long-range context modelling, multi-scale feature representation, and efficient dense prediction. Existing convolutional and transformer-based methods struggle to satisfy all three requirements concurrently: convolutional architectures are limited in their capacity to model long-range dependencies, while transformer-based methods incur quadratic computational cost that is prohibitive at high resolutions. In this paper, we propose MambaPanoptic, a fully Mamba-based panoptic segmentation framework that addresses these limitations through two principal contributions. First, we introduce MambaFPN, a top-down feature pyramid that leverages Mamba blocks to generate globally coherent, multi-scale feature representations with linear computational complexity. Second, we adopt a PanopticFCN-style kernel generator that produces unified thing and stuff kernels for proposal-free panoptic prediction, enhanced by a QuadMamba-based feature refinement module applied at multiple network stages. Experiments on the Cityscapes and COCO panoptic segmentation benchmarks demonstrate that MambaPanoptic consistently outperforms PanopticDeepLab and PanopticFCN under comparable model sizes, and matches or surpasses Mask2Former on Cityscapes in PQ and AP while requiring fewer parameters.

preprint2026arXiv

MASH: A Multiplatform and Multimodal Annotated Dataset for Societal Impact of Hurricane

Natural disasters cause multidimensional threats to human societies, with hurricanes exemplifying one of the most disruptive events that not only caused severe physical damage but also sparked widespread discussion on social media platforms. Existing datasets for studying societal impacts of hurricanes often focus on outdated hurricanes and are limited to a single social media platform, failing to capture the broader societal impact in today's diverse social media environment. Moreover, existing datasets annotate visual and textual content of the post separately, failing to account for the multimodal nature of social media posts. To address these gaps, we present a multiplatform and Multimodal Annotated Dataset for Societal Impact of Hurricane (MASH) that includes 59,607 relevant social media data posts from Reddit, TikTok, and YouTube. In addition, all relevant social media data posts are annotated in a multimodal approach that considers both textual and visual content on three dimensions: Humanitarian Classes, Bias Classes, and Information Integrity Classes. To our best knowledge, MASH is the first large-scale, multi-platform, multimodal, and multi-dimensionally annotated dataset centered on hurricane disasters. In addition, we introduce an online platform that supports interactive data exploration, provides preliminary analytical results, and allows users to share their insights regarding the societal impacts of hurricanes. We envision that MASH can contribute to the study of hurricanes' impact on society, such as disaster response, disaster severity classification, public sentiment analysis, disaster policy making, and bias identification. The dataset is publicly available at https://huggingface.co/datasets/YRC10/MASH under the Creative Commons Attribution 4.0 (CC BY 4.0) license.

preprint2026arXiv

MeSS: City Mesh-Guided Outdoor Scene Generation with Cross-View Consistent Diffusion

Mesh models have become increasingly accessible for numerous cities; however, the lack of realistic textures restricts their application in virtual urban navigation and autonomous driving. To address this, this paper proposes MeSS (Meshbased Scene Synthesis) for generating high-quality, styleconsistent outdoor scenes with city mesh models serving as the geometric prior. While image and video diffusion models can leverage spatial layouts (such as depth maps or HD maps) as control conditions to generate street-level perspective views, they are not directly applicable to 3D scene generation. Video diffusion models excel at synthesizing consistent view sequences that depict scenes but often struggle to adhere to predefined camera paths or align accurately with rendered control videos. In contrast, image diffusion models, though unable to guarantee cross-view visual consistency, can produce more geometry-aligned results when combined with ControlNet. Building on this insight, our approach enhances image diffusion models by improving cross-view consistency. The pipeline comprises three key stages: first, we generate geometrically consistent sparse views using Cascaded Outpainting ControlNets; second, we propagate denser intermediate views via a component dubbed AGInpaint; and third, we globally eliminate visual inconsistencies (e.g., varying exposure) using the GCAlign module. Concurrently with generation, a 3D Gaussian Splatting (3DGS) scene is reconstructed by initializing Gaussian balls on the mesh surface. Our method outperforms existing approaches in both geometric alignment and generation quality. Once synthesized, the scene can be rendered in diverse styles through relighting and style transfer techniques. project page: https://albertchen98.github.io/mess/

preprint2026arXiv

RELO: Reinforcement Learning to Localize for Visual Object Tracking

Conventional visual object trackers localize targets using handcrafted spatial priors, often in the form of heatmaps. Such priors provide only surrogate supervision and are poorly aligned with tracking optimization and evaluation metrics, such as intersection over union (IoU) and area under the success curve (AUC). Here, we introduce RELO, a REinforcement-learning-to-LOcalize method for visual object tracking that formulates target localization as a Markov decision process. Specifically, RELO replaces handcrafted spatial priors with a localization policy learned over spatial positions via reinforcement learning, with rewards combining frame-level IoU and sequence-level AUC. We additionally introduce layer-aligned temporal token propagation to improve semantic consistency across frames, with negligible computational overhead. Across multiple benchmarks, RELO achieves superior results, attaining 57.5% AUC on LaSOText without template updates. This confirms that reward-driven localization provides an effective alternative to prior-driven localization for visual object tracking.

preprint2026arXiv

V2P: Visual Attention Calibration for GUI Grounding via Background Suppression and Center Peaking

Precise localization of GUI elements is crucial for the development of GUI agents. Traditional methods rely on bounding box or center-point regression, neglecting spatial interaction uncertainty and visual-semantic hierarchies. Recent methods incorporate attention mechanisms but still face two key issues: (1) ignoring processing background regions causes attention drift from the desired area, and (2) uniform modeling the target UI element fails to distinguish between its center and edges, leading to click imprecision. Inspired by how humans visually process and interact with GUI elements, we propose the Valley-to-Peak (V2P) method to address these issues. To mitigate background distractions, V2P introduces a suppression attention mechanism that minimizes the model's focus on irrelevant regions to highlight the intended region. For the issue of center-edge distinction, V2P applies a Fitts' Law-inspired approach by modeling GUI interactions as 2D Gaussian heatmaps where the weight gradually decreases from the center towards the edges. The weight distribution follows a Gaussian function, with the variance determined by the target's size. Consequently, V2P effectively isolates the target area and teaches the model to concentrate on the most essential point of the UI element. The model trained by V2P achieves the performance with 92.4\% and 52.5\% on two benchmarks ScreenSpot-v2 and ScreenSpot-Pro (see Fig.~\ref{fig:main_results_charts}). Ablations further confirm each component's contribution, underscoring V2P's generalizability in precise GUI grounding tasks and its potential for real-world deployment in future GUI agents.

preprint2025arXiv

Movable Antenna Enhanced Multi-Region Beam Coverage: A Multi-Notch-Filter-Inspired Design

Movable antenna (MA) has emerged as a promising technology to enhance wireless communication performance by exploiting the new degree of freedom (DoF) via antenna position optimization. In this letter, we investigate the MA-enhanced wide beam coverage over multiple subregions in the spatial domain. Specifically, we aim to maximize the minimum beam gain over the desired subregions by jointly optimizing the transmit beamforming and antenna position vector (APV). Although this problem is non-convex, we propose an efficient algorithm to solve it by leveraging the similarity between the considered multi-region coverage and classical multi-notch filter (MNF) design. In particular, we construct a spatial MNF-based transmit beamforming vector by assuming a continuous amplitude and phase-shift profile within the antenna movement region. Based on this continuous profile, we propose a sequential update algorithm to select an optimal subset of MA positions for multi-region coverage, jointly with a Gibbs sampling (GS) procedure to avoid undesired local optimum. Numerical results show that our proposed algorithm can significantly outperform conventional fixed position antennas (FPAs) and achieve a comparable performance to the alternating optimization (AO) algorithm with dramatically lower complexity.

preprint2025arXiv

On the Effectiveness of Training Data Optimization for LLM-based Code Generation: An Empirical Study

Large language models (LLMs) have achieved remarkable progress in code generation, largely driven by the availability of high-quality code datasets for effective training. To further improve data quality, numerous training data optimization techniques have been proposed; however, their overall effectiveness has not been systematically evaluated. To bridge this gap, we conduct the first large-scale empirical study, examining five widely-used training data optimization techniques and their pairwise combinations for LLM-based code generation across three benchmarks and four LLMs. Our results show that data synthesis is the most effective technique for improving functional correctness and reducing code smells, although it performs relatively worse on code maintainability compared to data refactoring, cleaning, and selection. Regarding combinations, we find that most combinations do not further improve functional correctness but can effectively enhance code quality (code smells and maintainability). Among all combinations, data synthesis combined with data refactoring achieves the strongest overall performance. Furthermore, our fine-grained analysis reinforces these findings and provides deeper insights into how individual techniques and their combinations influence code generation effectiveness. Overall, this work represents a first step toward a systematic understanding of training data optimization and combination strategies, offering practical guidance for future research and deployment in LLM-based code generation.

preprint2025arXiv

Particle-scale origin of quadrupolar non-affine displacement fields in granular solids

In this work, we identify the local structural defects that control the non-affine displacement fields in jammed disk packings subjected to athermal, quasistatic (AQS) simple shear. While complex non-affine displacement fields typically occur during simple shear, isolated effective quadrupoles are also observed and their probability increases with increasing pressure. We show that the emergence of an isolated effective quadrupole requires the breaking of an interparticle contact that is aligned with low-frequency, spatially extended vibrational modes. Since the Eshelby inhomogeneity problem gives rise to quadrupolar displacement fields in continuum materials, we reformulate and implement Eshelby's equivalent inclusion method (EIM) for jammed disk packings. Using EIM, we show that we can reconstruct the non-affine displacement fields for jammed disk packings in response to applied shear as a sum of discrete Eshelby-like defects that are caused by mismatches in the local stiffnesses of triangles formed from Delaunay triangulation of the disk centers.

preprint2024arXiv

A prediction-correction based iterative convolution-thresholding method for topology optimization of heat transfer problems

In this paper, we propose an iterative convolution-thresholding method (ICTM) based on prediction-correction for solving the topology optimization problem in steady-state heat transfer equations. The problem is formulated as a constrained minimization problem of the complementary energy, incorporating a perimeter/surface-area regularization term, while satisfying a steady-state heat transfer equation. The decision variables of the optimization problem represent the domains of different materials and are represented by indicator functions. The perimeter/surface-area term of the domain is approximated using Gaussian kernel convolution with indicator functions. In each iteration, the indicator function is updated using a prediction-correction approach. The prediction step is based on the variation of the objective functional by imposing the constraints, while the correction step ensures the monotonically decreasing behavior of the objective functional. Numerical results demonstrate the efficiency and robustness of our proposed method, particularly when compared to classical approaches based on the ICTM.

preprint2024arXiv

Reconfigurable Three-Dimensional Thermal Dome

Thermal metamaterial represents a groundbreaking approach to control heat conduction, and, as a crucial component, thermal invisibility is of utmost importance for heat management. Despite the flourishing development of thermal invisibility schemes, they still face two limitations in practical applications. First, objects are typically completely enclosed in traditional cloaks, making them difficult to use and unsuitable for objects with heat sources. Second, although some theoretical proposals have been put forth to change the thermal conductivity of materials to achieve dynamic invisibility, their designs are complex and rigid, making them unsuitable for large-scale use in real three-dimensional spaces. Here, we propose a concept of a thermal dome to achieve three-dimensional invisibility. Our scheme includes an open functional area, greatly enhancing its usability and applicability. It features a reconfigurable structure, constructed with simple isotropic natural materials, making it suitable for dynamic requirements. The performance of our reconfigurable thermal dome has been confirmed through simulations and experiments, consistent with the theory. The introduction of this concept can greatly advance the development of thermal invisibility technology from theory to engineering and provide inspiration for other physical domains, such as direct current electric fields and magnetic fields.

preprint2023arXiv

RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation Allocation Approach for Recommender Systems

Recommender systems aim to recommend the most suitable items to users from a large number of candidates. Their computation cost grows as the number of user requests and the complexity of services (or models) increases. Under the limitation of computation resources (CRs), how to make a trade-off between computation cost and business revenue becomes an essential question. The existing studies focus on dynamically allocating CRs in queue truncation scenarios (i.e., allocating the size of candidates), and formulate the CR allocation problem as an optimization problem with constraints. Some of them focus on single-phase CR allocation, and others focus on multi-phase CR allocation but introduce some assumptions about queue truncation scenarios. However, these assumptions do not hold in other scenarios, such as retrieval channel selection and prediction model selection. Moreover, existing studies ignore the state transition process of requests between different phases, limiting the effectiveness of their approaches. This paper proposes a Reinforcement Learning (RL) based Multi-Phase Computation Allocation approach (RL-MPCA), which aims to maximize the total business revenue under the limitation of CRs. RL-MPCA formulates the CR allocation problem as a Weakly Coupled MDP problem and solves it with an RL-based approach. Specifically, RL-MPCA designs a novel deep Q-network to adapt to various CR allocation scenarios, and calibrates the Q-value by introducing multiple adaptive Lagrange multipliers (adaptive-$λ$) to avoid violating the global CR constraints. Finally, experiments on the offline simulation environment and online real-world recommender system validate the effectiveness of our approach.

preprint2023arXiv

The Security Analysis of Continuous-Variable Quantum Key Distribution under Limited Eavesdropping with Practical Fiber

Research on optimal eavesdropping models under practical conditions will help to evaluate realistic risk when employing quantum key distribution (QKD) system for secure information transmission. Intuitively, fiber loss will lead to the optical energy leaking to the environment, rather than harvested by the eavesdropper, which also limits the eavesdropping ability while improving the QKD system performance in practical use. However, defining the optimal eavesdropping model in the presence of lossy fiber is difficult because the channel is beyond the control of legitimate partners and the leaked signal is undetectable. Here we investigate how the fiber loss influences the eavesdropping ability based on a teleportation-based collective attack model which requires two distant stations and a shared entanglement source. We find that if the distributed entanglement is limited due to the practical loss, the optimal attack occurs when the two teleportation stations are merged to one and placed close to the transmitter site, which performs similar to the entangling-cloning attack but with a reduced wiretapping ratio. Assuming Eve uses the best available hollow-core fiber, the secret key rate in the practical environment can be 20%~40% higher than that under ideal eavesdropping. While if the entanglement distillation technology is mature enough to provide high quality of distributed entanglement, the two teleportation stations should be distantly separated for better eavesdropping performance, where the eavesdropping can even approach the optimal collective attack. Under the current level of entanglement purification technology, the unavoidable fiber loss can still greatly limit the eavesdropping ability as well as enhance the secret key rate and transmission distance of the realistic system, which promotes the development of QKD systems in practical application scenarios.

preprint2022arXiv

A two-stage method for reconstruction of parameters in diffusion equations

Parameter reconstruction for diffusion equations has a wide range of applications. In this paper, we proposed a two-stage scheme to efficiently solve conductivity reconstruction problems for steady-state diffusion equations with solution data measured inside the domain. The first stage is based on total variation regularization of the log diffusivity and the split Bregman iteration method. In the second stage, we apply the K-means clustering for the reconstruction of ``blocky'' conductivity functions. The convergence of the scheme is theoretically proved and extensive numerical examples are shown to demonstrate the performance of the scheme.

preprint2022arXiv

An efficient unconditionally stable method for Dirichlet partitions in arbitrary domains

A Dirichlet $k$-partition of a domain is a collection of $k$ pairwise disjoint open subsets such that the sum of their first Laplace--Dirichlet eigenvalues is minimal. In this paper, we propose a new relaxation of the problem by introducing auxiliary indicator functions of domains and develop a simple and efficient diffusion generated method to compute Dirichlet $k$-partitions for arbitrary domains. The method only alternates three steps: 1. convolution, 2. thresholding, and 3. projection. The method is simple, easy to implement, insensitive to initial guesses and can be effectively applied to arbitrary domains without any special discretization. At each iteration, the computational complexity is linear in the discretization of the computational domain. Moreover, we theoretically prove the energy decaying property of the method. Experiments are performed to show the accuracy of approximation, efficiency and unconditional stability of the algorithm. We apply the proposed algorithms on both 2- and 3-dimensional flat tori, triangle, square, pentagon, hexagon, disk, three-fold star, five-fold star, cube, ball, and tetrahedron domains to compute Dirichlet $k$-partitions for different $k$ to show the effectiveness of the proposed method. Compared to previous work with reported computational time, the proposed method achieves hundreds of times acceleration.

preprint2022arXiv

An Exploration of npm Package Co-Usage Examples from Stack Overflow: A Case Study

Third-party package usage has become a common practice in contemporary software development. Developers often face different challenges, including choosing the right libraries, installing errors, discrepancies, setting up the environment, and building failures during software development. The risks of maintaining a third-party package are well known, but it is unclear how information from Stack Overflow (SO) can be useful. This paper performed an empirical study to explore npm co-usage in SO. From over 30,000 SO posts, we extracted 2,100 SO posts related to npm and matched them to 217,934 npm library packages. We find that, popular and highly used libraries are not discussed as often in SO. However, we can see that the accepted answers may prove useful, as we believe that the usage examples and executable commands could be reused for tool support.

preprint2022arXiv

Balanced Multimodal Learning via On-the-fly Gradient Modulation

Multimodal learning helps to comprehensively understand the world, by integrating different senses. Accordingly, multiple input modalities are expected to boost model performance, but we actually find that they are not fully exploited even when the multimodal model outperforms its uni-modal counterpart. Specifically, in this paper we point out that existing multimodal discriminative models, in which uniform objective is designed for all modalities, could remain under-optimized uni-modal representations, caused by another dominated modality in some scenarios, e.g., sound in blowing wind event, vision in drawing picture event, etc. To alleviate this optimization imbalance, we propose on-the-fly gradient modulation to adaptively control the optimization of each modality, via monitoring the discrepancy of their contribution towards the learning objective. Further, an extra Gaussian noise that changes dynamically is introduced to avoid possible generalization drop caused by gradient modulation. As a result, we achieve considerable improvement over common fusion methods on different multimodal tasks, and this simple strategy can also boost existing multimodal methods, which illustrates its efficacy and versatility. The source code is available at \url{https://github.com/GeWu-Lab/OGM-GE_CVPR2022}.

preprint2022arXiv

C-P Map: A Novel Evaluation Toolkit for Speaker Verification

Evaluation trials are used to probe performance of automatic speaker verification (ASV) systems. In spite of the clear importance and impact, evaluation trials have not been seriously treated in research and engineering practice. This paper firstly presents a theoretical analysis on evaluation trials and highlights potential bias with the most popular cross-pairing approach used in trials design. To interpret and settle this problem, we define the concept of trial config and C-P map derived from it. The C-P map measures the performance of an ASV system on various trial configs in a 2-dimensional map. On the map, each location represents a particular trial config and its corresponding color represents the system performance. Experiments conducted on representative ASV systems show that the proposed C-P map offers a powerful evaluation toolkit for ASV performance analysis and comparison. The source code for C-P map has been release at https://gitlab.com/csltstu/sunine.

preprint2022arXiv

Check and Link: Pairwise Lesion Correspondence Guides Mammogram Mass Detection

Detecting mass in mammogram is significant due to the high occurrence and mortality of breast cancer. In mammogram mass detection, modeling pairwise lesion correspondence explicitly is particularly important. However, most of the existing methods build relatively coarse correspondence and have not utilized correspondence supervision. In this paper, we propose a new transformer-based framework CL-Net to learn lesion detection and pairwise correspondence in an end-to-end manner. In CL-Net, View-Interactive Lesion Detector is proposed to achieve dynamic interaction across candidates of cross views, while Lesion Linker employs the correspondence supervision to guide the interaction process more accurately. The combination of these two designs accomplishes precise understanding of pairwise lesion correspondence for mammograms. Experiments show that CL-Net yields state-of-the-art performance on the public DDSM dataset and our in-house dataset. Moreover, it outperforms previous methods by a large margin in low FPI regime.

preprint2022arXiv

Cross DQN: Cross Deep Q Network for Ads Allocation in Feed

E-commerce platforms usually display a mixed list of ads and organic items in feed. One key problem is to allocate the limited slots in the feed to maximize the overall revenue as well as improve user experience, which requires a good model for user preference. Instead of modeling the influence of individual items on user behaviors, the arrangement signal models the influence of the arrangement of items and may lead to a better allocation strategy. However, most of previous strategies fail to model such a signal and therefore result in suboptimal performance. In addition, the percentage of ads exposed (PAE) is an important indicator in ads allocation. Excessive PAE hurts user experience while too low PAE reduces platform revenue. Therefore, how to constrain the PAE within a certain range while keeping personalized recommendation under the PAE constraint is a challenge. In this paper, we propose Cross Deep Q Network (Cross DQN) to extract the crucial arrangement signal by crossing the embeddings of different items and modeling the crossed sequence by multi-channel attention. Besides, we propose an auxiliary loss for batch-level constraint on PAE to tackle the above-mentioned challenge. Our model results in higher revenue and better user experience than state-of-the-art baselines in offline experiments. Moreover, our model demonstrates a significant improvement in the online A/B test and has been fully deployed on Meituan feed to serve more than 300 millions of customers.

preprint2022arXiv

Deep Page-Level Interest Network in Reinforcement Learning for Ads Allocation

A mixed list of ads and organic items is usually displayed in feed and how to allocate the limited slots to maximize the overall revenue is a key problem. Meanwhile, modeling user preference with historical behavior is essential in recommendation and advertising (e.g., CTR prediction and ads allocation). Most previous works for user behavior modeling only model user's historical point-level positive feedback (i.e., click), which neglect the page-level information of feedback and other types of feedback. To this end, we propose Deep Page-level Interest Network (DPIN) to model the page-level user preference and exploit multiple types of feedback. Specifically, we introduce four different types of page-level feedback as input, and capture user preference for item arrangement under different receptive fields through the multi-channel interaction module. Through extensive offline and online experiments on Meituan food delivery platform, we demonstrate that DPIN can effectively model the page-level user preference and increase the revenue for the platform.

preprint2022arXiv

Efficient Localness Transformer for Smart Sensor-Based Energy Disaggregation

Modern smart sensor-based energy management systems leverage non-intrusive load monitoring (NILM) to predict and optimize appliance load distribution in real-time. NILM, or energy disaggregation, refers to the decomposition of electricity usage conditioned on the aggregated power signals (i.e., smart sensor on the main channel). Based on real-time appliance power prediction using sensory technology, energy disaggregation has great potential to increase electricity efficiency and reduce energy expenditure. With the introduction of transformer models, NILM has achieved significant improvements in predicting device power readings. Nevertheless, transformers are less efficient due to O(l^2) complexity w.r.t. sequence length l. Moreover, transformers can fail to capture local signal patterns in sequence-to-point settings due to the lack of inductive bias in local context. In this work, we propose an efficient localness transformer for non-intrusive load monitoring (ELTransformer). Specifically, we leverage normalization functions and switch the order of matrix multiplication to approximate self-attention and reduce computational complexity. Additionally, we introduce localness modeling with sparse local attention heads and relative position encodings to enhance the model capacity in extracting short-term local patterns. To the best of our knowledge, ELTransformer is the first NILM model that addresses computational complexity and localness modeling in NILM. With extensive experiments and quantitative analyses, we demonstrate the efficiency and effectiveness of the the proposed ELTransformer with considerable improvements compared to state-of-the-art baselines.

preprint2022arXiv

Enhanced exemplar autoencoder with cycle consistency loss in any-to-one voice conversion

Recent research showed that an autoencoder trained with speech of a single speaker, called exemplar autoencoder (eAE), can be used for any-to-one voice conversion (VC). Compared to large-scale many-to-many models such as AutoVC, the eAE model is easy and fast in training, and may recover more details of the target speaker. To ensure VC quality, the latent code should represent and only represent content information. However, this is not easy to attain for eAE as it is unaware of any speaker variation in model training. To tackle the problem, we propose a simple yet effective approach based on a cycle consistency loss. Specifically, we train eAEs of multiple speakers with a shared encoder, and meanwhile encourage the speech reconstructed from any speaker-specific decoder to get a consistent latent code as the original speech when cycled back and encoded again. Experiments conducted on the AISHELL-3 corpus showed that this new approach improved the baseline eAE consistently. The source code and examples are available at the project page: http://project.cslt.org/.

preprint2022arXiv

Evolving Programmable Computational Metamaterials

Granular metamaterials are a promising choice for the realization of mechanical computing devices. As preliminary evidence of this, we demonstrate here how to embed Boolean logic gates (AND and XOR) into a granular metamaterial by evolving where particular grains are placed in the material. Our results confirm the existence of gradients of increasing "AND-ness" and "XOR-ness" within the space of possible materials that can be followed by evolutionary search. We measure the computational functionality of a material by probing how it transforms bits encoded as vibrations with zero or non-zero amplitude. We compared the evolution of materials built from mass-contrasting particles and materials built from stiffness-contrasting particles, and found that the latter were more evolvable. We believe this work may pave the way toward evolutionary design of increasingly sophisticated, programmable, and computationally dense metamaterials with certain advantages over more traditional computational substrates.

preprint2022arXiv

Generation of S-shaped photonic hooks from microcylinders with engineered surface patches

Photonic hooks (PHs) are non-evanescent light beams with a highly concentrated curved optical fields. Since their discovery, PHs always have one single inflection point and thus have a hook-like structure. In this work, a new type of PHs with two inflection points and S-shaped structures (S-PHs) were reported for the first time. We theoretically studied the effects of various physical parameters on the generation of S-PHs. Furthermore, we showed that decorating particles with multiple patches can significantly enhance the curvature and length of the S-PHs. The S-PHs may have potential applications in super-resolution imaging, sub-wavelength micromachining, particle and cell manipulation, etc.

preprint2022arXiv

Giving Back: Contributions Congruent to Library Dependency Changes in a Software Ecosystem

Popular adoption of third-party libraries for contemporary software development has led to the creation of large inter-dependency networks, where sustainability issues of a single library can have widespread network effects. Maintainers of these libraries are often overworked, relying on the contributions of volunteers to sustain these libraries. In this work, we measure contributions that are aligned with dependency changes, to understand where they come from (i.e., non-maintainer, client maintainer, library maintainer, and library and client maintainer), analyze whether they contribute to library dormancy (i.e., a lack of activity), and investigate the similarities between these contributions and developers' typical contributions. Hence, we leverage socio-technical techniques to measure the dependency-contribution congruence (DC congruence), i.e., the degree to which contributions align with dependencies. We conduct a large-scale empirical study to measure the DC congruence for the NPM ecosystem using 1.7 million issues, 970 thousand pull requests (PR), and over 5.3 million commits belonging to 107,242 NPM packages. At the ecosystem level, we pinpoint in time peaks of congruence with dependency changes (i.e., 16% DC congruence score). Surprisingly, these contributions came from the ecosystem itself (i.e., non-maintainers of either client and library). At the project level, we find that DC congruence shares a statistically significant relationship with the likelihood of a package becoming dormant. Finally, by comparing source code of contributions, we find that congruent contributions are statistically different to typical contributions. Our work has implications to encourage and sustain contributions, especially to support library maintainers that require dependency changes.

preprint2022arXiv

Gradient Importance Learning for Incomplete Observations

Though recent works have developed methods that can generate estimates (or imputations) of the missing entries in a dataset to facilitate downstream analysis, most depend on assumptions that may not align with real-world applications and could suffer from poor performance in subsequent tasks such as classification. This is particularly true if the data have large missingness rates or a small sample size. More importantly, the imputation error could be propagated into the prediction step that follows, which may constrain the capabilities of the prediction model. In this work, we introduce the gradient importance learning (GIL) method to train multilayer perceptrons (MLPs) and long short-term memories (LSTMs) to directly perform inference from inputs containing missing values without imputation. Specifically, we employ reinforcement learning (RL) to adjust the gradients used to train these models via back-propagation. This allows the model to exploit the underlying information behind missingness patterns. We test the approach on real-world time-series (i.e., MIMIC-III), tabular data obtained from an eye clinic, and a standard dataset (i.e., MNIST), where our imputation-free predictions outperform the traditional two-step imputation-based predictions using state-of-the-art imputation methods.

preprint2022arXiv

Hybrid Transfer in Deep Reinforcement Learning for Ads Allocation

Ads allocation, which involves allocating ads and organic items to limited slots in feed with the purpose of maximizing platform revenue, has become a research hotspot. Notice that, e-commerce platforms usually have multiple entrances for different categories and some entrances have few visits. Data from these entrances has low coverage, which makes it difficult for the agent to learn. To address this challenge, we propose Similarity-based Hybrid Transfer for Ads Allocation (SHTAA), which effectively transfers samples as well as knowledge from data-rich entrance to data-poor entrance. Specifically, we define an uncertainty-aware similarity for MDP to estimate the similarity of MDP for different entrances. Based on this similarity, we design a hybrid transfer method, including instance transfer and strategy transfer, to efficiently transfer samples and knowledge from one entrance to another. Both offline and online experiments on Meituan food delivery platform demonstrate that the proposed method could achieve better performance for data-poor entrance and increase the revenue for the platform.

preprint2022arXiv

Learning List-wise Representation in Reinforcement Learning for Ads Allocation with Multiple Auxiliary Tasks

With the recent prevalence of reinforcement learning (RL), there have been tremendous interests in utilizing RL for ads allocation in recommendation platforms (e.g., e-commerce and news feed sites). To achieve better allocation, the input of recent RL-based ads allocation methods is upgraded from point-wise single item to list-wise item arrangement. However, this also results in a high-dimensional space of state-action pairs, making it difficult to learn list-wise representations with good generalization ability. This further hinders the exploration of RL agents and causes poor sample efficiency. To address this problem, we propose a novel RL-based approach for ads allocation which learns better list-wise representations by leveraging task-specific signals on Meituan food delivery platform. Specifically, we propose three different auxiliary tasks based on reconstruction, prediction, and contrastive learning respectively according to prior domain knowledge on ads allocation. We conduct extensive experiments on Meituan food delivery platform to evaluate the effectiveness of the proposed auxiliary tasks. Both offline and online experimental results show that the proposed method can learn better list-wise representations and achieve higher revenue for the platform compared to the state-of-the-art baselines.

preprint2022arXiv

Neural Topic Modeling with Deep Mutual Information Estimation

The emerging neural topic models make topic modeling more easily adaptable and extendable in unsupervised text mining. However, the existing neural topic models is difficult to retain representative information of the documents within the learnt topic representation. In this paper, we propose a neural topic model which incorporates deep mutual information estimation, i.e., Neural Topic Modeling with Deep Mutual Information Estimation(NTM-DMIE). NTM-DMIE is a neural network method for topic learning which maximizes the mutual information between the input documents and their latent topic representation. To learn robust topic representation, we incorporate the discriminator to discriminate negative examples and positive examples via adversarial learning. Moreover, we use both global and local mutual information to preserve the rich information of the input documents in the topic representation. We evaluate NTM-DMIE on several metrics, including accuracy of text clustering, with topic representation, topic uniqueness and topic coherence. Compared to the existing methods, the experimental results show that NTM-DMIE can outperform in all the metrics on the four datasets.

preprint2022arXiv

Newcomer OSS-Candidates: Characterizing Contributions of Novice Developers to GitHub

The ability of an Open Source Software (OSS) project to attract, onboard, and retain any newcomer is vital to its livelihood. Although, evidence suggests an upsurge in novice developers joining social coding platforms (such as GitHub), the extent to which their activities result in a OSS contribution is unknown. Henceforth, we execute the protocols of a registered report to study activities of a "Newcomer OSS-Candidate", who is a novice developer that is new to that social coding platform, and has the intention to later onboard an OSS project. Using GitHub as a case platform, we analyze 171 identified Newcomer OSS-Candidates to characterize their contribution activities. Results show that Newcomer OSS-Candidates are likely to target software based repositories (i.e., 66%), and their first contributions are mainly associated with development (commits) and maintenance (PRs). Newcomer OSS-Candidates are less likely to practice social coding, but eventually end up onboarding (i.e., 30% quantitative, 70% follow-up survey) an OSS project. Furthermore, they cite finding a way to start as the most challenging barrier to contribute. Our work reveals insights on how newcomers to social coding platforms are potential sources of OSS contributions.

preprint2022arXiv

Novel Valence Transition in Elemental Metal Europium around 80 GPa

Valence transition could induce structural, insulator-metal, nonmagnetic-magnetic and superconducting transitions in rare-earth metals and compounds, while the underlying physics remains unclear due to the complex interaction of localized 4f electrons as well as their coupling with itinerant electrons. The valence transition in the elemental metal europium (Eu) still has remained as a matter of debate. Using resonant x-ray emission scattering and x-ray diffraction, we pressurize the states of 4f electrons in Eu and study its valence and structure transitions up to 160 GPa. We provide compelling evidence for a valence transition around 80 GPa, which coincides with a structural transition from a monoclinic (C2/c) to an orthorhombic phase (Pnma). We show that the valence transition occurs when the pressure-dependent energy gap between 4f and 5d electrons approaches the Coulomb interaction. Our discovery is critical for understanding the electrodynamics of Eu, including magnetism and high-pressure superconductivity.

preprint2022arXiv

Overcoming Van der Waals Forces in reconfigurable nanostructures

Reconfigurable metamaterials require constituent nanostructures to demonstrate switching of shapes with external stimuli. For generality, such nanostructures would touch and stick to other surfaces in one of its configurations. Yet, a longstanding challenge is in overcoming this stiction caused by Van der Waals forces, which impedes shape recovery. Here, we introduce a stiff yet self-recovering material system based on acrylic acid, and tested it in high-aspect ratio structures, where recovery is weak. This designer material has a storage modulus of ~5.2 GPa at room temperature and ~90 MPa in the rubbery state at 150 Celsius, an order of magnitude higher than previous reports. A high-resolution resin for two-photon lithography was developed based on this polymer system, enabling 3D printing of nanopillars with diameters of ~400 nm and aspect ratio as high as ~10. Experimentally, we observed self-recovery as collapsed and touching structures overcome stiction to stand back up. We developed a theoretical model to explain the recoverability of these sub-micron structures. Reconfigurable structural colour prints and holograms were demonstrated, indicating potential applications of the material system as a shape memory polymer suitable for sub-micron reconfigurable metamaterials.

preprint2022arXiv

Pay Attention to Hard Trials

Performance of speaker recognition systems is evaluated on test trials. Although as crucial as rulers for tailors, trials have not been carefully treated so far, and most existing benchmarks compose trials by naive cross-pairing. In this paper, we argue that the cross-pairing approach produces overwhelming easy trials, which in turn leads to potential bias in system and technique comparison. To solve the problem, we advocate more attention to hard trials. We present an SVM-based approach to identifying hard trials and use it to construct new evaluation sets for VoxCeleb1 and SITW. With the new sets, we can re-evaluate the contribution of some recent technologies. The code and the identified hard trials will be published online at http://project.cslt.org.

preprint2022arXiv

PointScatter: Point Set Representation for Tubular Structure Extraction

This paper explores the point set representation for tubular structure extraction tasks. Compared with the traditional mask representation, the point set representation enjoys its flexibility and representation ability, which would not be restricted by the fixed grid as the mask. Inspired by this, we propose PointScatter, an alternative to the segmentation models for the tubular structure extraction task. PointScatter splits the image into scatter regions and parallelly predicts points for each scatter region. We further propose the greedy-based region-wise bipartite matching algorithm to train the network end-to-end and efficiently. We benchmark the PointScatter on four public tubular datasets, and the extensive experiments on tubular structure segmentation and centerline extraction task demonstrate the effectiveness of our approach. Code is available at https://github.com/zhangzhao2022/pointscatter.

preprint2022arXiv

Probabilistic methods for approximate archetypal analysis

Archetypal analysis is an unsupervised learning method for exploratory data analysis. One major challenge that limits the applicability of archetypal analysis in practice is the inherent computational complexity of the existing algorithms. In this paper, we provide a novel approximation approach to partially address this issue. Utilizing probabilistic ideas from high-dimensional geometry, we introduce two preprocessing techniques to reduce the dimension and representation cardinality of the data, respectively. We prove that provided the data is approximately embedded in a low-dimensional linear subspace and the convex hull of the corresponding representations is well approximated by a polytope with a few vertices, our method can effectively reduce the scaling of archetypal analysis. Moreover, the solution of the reduced problem is near-optimal in terms of prediction errors. Our approach can be combined with other acceleration techniques to further mitigate the intrinsic complexity of archetypal analysis. We demonstrate the usefulness of our results by applying our method to summarize several moderately large-scale datasets.

preprint2022arXiv

Reliable Visualization for Deep Speaker Recognition

In spite of the impressive success of convolutional neural networks (CNNs) in speaker recognition, our understanding to CNNs' internal functions is still limited. A major obstacle is that some popular visualization tools are difficult to apply, for example those producing saliency maps. The reason is that speaker information does not show clear spatial patterns in the temporal-frequency space, which makes it hard to interpret the visualization results, and hence hard to confirm the reliability of a visualization tool. In this paper, we conduct an extensive analysis on three popular visualization methods based on CAM: Grad-CAM, Score-CAM and Layer-CAM, to investigate their reliability for speaker recognition tasks. Experiments conducted on a state-of-the-art ResNet34SE model show that the Layer-CAM algorithm can produce reliable visualization, and thus can be used as a promising tool to explain CNN-based speaker models. The source code and examples are available in our project page: http://project.cslt.org/.

preprint2022arXiv

Rényi State Entropy for Exploration Acceleration in Reinforcement Learning

One of the most critical challenges in deep reinforcement learning is to maintain the long-term exploration capability of the agent. To tackle this problem, it has been recently proposed to provide intrinsic rewards for the agent to encourage exploration. However, most existing intrinsic reward-based methods proposed in the literature fail to provide sustainable exploration incentives, a problem known as vanishing rewards. In addition, these conventional methods incur complex models and additional memory in their learning procedures, resulting in high computational complexity and low robustness. In this work, a novel intrinsic reward module based on the Rényi entropy is proposed to provide high-quality intrinsic rewards. It is shown that the proposed method actually generalizes the existing state entropy maximization methods. In particular, a $k$-nearest neighbor estimator is introduced for entropy estimation while a $k$-value search method is designed to guarantee the estimation accuracy. Extensive simulation results demonstrate that the proposed Rényi entropy-based method can achieve higher performance as compared to existing schemes.

preprint2022arXiv

Some Examples of Privacy-preserving Publication and Sharing of COVID-19 Pandemic Data

A considerable amount of various types of data have been collected during the COVID-19 pandemic, the analysis and interpretation of which have been indispensable for curbing the spread of the disease. As the pandemic moves to an endemic state, the data collected during the pandemic will continue to be rich sources for further studying and understanding the impacts of the pandemic on various aspects of our society. On the other hand, naïve release and sharing of the information can be associated with serious privacy concerns. In this study, we use three common but distinct data types collected during the pandemic (case surveillance tabular data, case location data, and contact tracing networks) to illustrate the publication and sharing of granular information and individual-level pandemic data in a privacy-preserving manner. We leverage and build upon the concept of differential privacy to generate and release privacy-preserving data for each data type. We investigate the inferential utility of privacy-preserving information through simulation studies at different levels of privacy guarantees and demonstrate the approaches in real-life data. All the approaches employed in the study are straightforward to apply. Our study generates statistical evidence on the practical feasibility of sharing pandemic data with privacy guarantees and on how to balance the statistical utility of released information during this process.

preprint2022arXiv

Topmetal-M: a novel pixel sensor for compact tracking applications

The Topmetal-M is a large area pixel sensor (18 mm * 23 mm) prototype fabricated in a new 130 nm high-resistivity CMOS process in 2019. It contains 400 rows * 512 columns square pixels with the pitch of 40 μm. In Topmetal-M, a novel charge collection method combing the Monolithic Active Pixel Sensor (MAPS) and the Topmetal sensor has been proposed for the first time. Both the ionized charge deposited by the particle in the sensor and along the track over the sensor can be collected. The in-pixel circuit mainly consists of a low-noise charge sensitive amplifier to establish the signal for the energy reconstruction, and a discriminator with a Time-to-Amplitude Converter (TAC) for the Time of Arrival (TOA) measurement. With this mechanism, the trajectory, particle hit position, energy and arrival time of the particle can be measured. The analog signal from each pixel is accessible through time-shared multiplexing over the entire pixel array. This paper will discuss the design and preliminary test results of the Topmetal-M sensor.

preprint2022arXiv

Towards Grand Unification of Object Tracking

We present a unified method, termed Unicorn, that can simultaneously solve four tracking problems (SOT, MOT, VOS, MOTS) with a single network using the same model parameters. Due to the fragmented definitions of the object tracking problem itself, most existing trackers are developed to address a single or part of tasks and overspecialize on the characteristics of specific tasks. By contrast, Unicorn provides a unified solution, adopting the same input, backbone, embedding, and head across all tracking tasks. For the first time, we accomplish the great unification of the tracking network architecture and learning paradigm. Unicorn performs on-par or better than its task-specific counterparts in 8 tracking datasets, including LaSOT, TrackingNet, MOT17, BDD100K, DAVIS16-17, MOTS20, and BDD100K MOTS. We believe that Unicorn will serve as a solid step towards the general vision model. Code is available at https://github.com/MasterBin-IIAU/Unicorn.

preprint2022arXiv

Ultra-stable shear jammed granular material

Dry granular materials such as sand, gravel, pills, or agricultural grains, can become rigid when compressed or sheared. At low density, one can distort the shape of a container of granular material without encountering any resistance. Under isotropic compression, the material will reach a certain {\it jamming} density and then resist further compression. {\em Shear jamming} occurs when resistance to shear emerges in a system at a density lower than the jamming density, and the elastic properties of such states have important implications for industrial and geophysical processes. We report on experimental observations of changes in the mechanical properties of a shear-jammed granular material subjected to small-amplitude, quasi-static cyclic shear. We study a layer of plastic discs confined to a shear cell, using photoelasticimetry to measure all inter-particle vector forces. For sufficiently small cyclic shear amplitudes and large enough initial shear, the material evolves to an unexpected "ultra-stable" state in which all the particle positions and inter-particle contact forces remain unchanged after each complete shear cycle for thousands of cycles. The stress response of these states to small imposed shear is nearly elastic, in contrast to the original shear jammed state.

preprint2022arXiv

Universality for random matrices with equi-spaced external source: a case study of a biorthogonal ensemble

We prove the edge and bulk universality of random Hermitian matrices with equi-spaced external source. One feature of our method is that we use neither a Christoffel-Darboux type formula, nor a double-contour formula, which are standard methods to prove universality results for exactly solvable models. This matrix model is an example of a biorthogonal ensemble, which is a special kind of determinantal point process whose kernel generally does not have a Christoffel-Darboux type formula or double-contour representation. Our methods may showcase how to handle universality problems for biorthogonal ensembles in general.

preprint2022arXiv

Variational methods and deep Ritz method for active elastic solids

Variational methods have been widely used in soft matter physics for both static and dynamic problems. These methods are mostly based on two variational principles: the variational principle of minimum free energy (MFEVP) and Onsager's variational principle (OVP). Our interests lie in the applications of these variational methods to active matter physics. In our former work [Soft Matter, 2021, 17, 3634], we have explored the applications of OVP-based variational methods for the modeling of active matter dynamics. In the present work, we explore variational (or energy) methods that are based on MFEVP for static problems in active elastic solids. We show that MFEVP can be used not only to derive equilibrium equations, but also to develop approximate solution methods, such as Ritz method, for active solid statics. Moreover, the power of Ritz-type method can be further enhanced using deep learning methods if we use deep neural networks to construct the trial solutions of the variational problems. We then apply these variational methods and the deep Ritz method to study the spontaneous bending and contraction of a thin active circular plate that is induced by internal asymmetric active contraction. The circular plate is found to be bent towards its contracting side. The study of such a simple toy system gives implications for understanding the morphogenesis of solid-like confluent cell monolayers. In addition, we introduce a so-called activogravity length to characterize the importance of gravitational forces relative to internal active contraction in driving the bending of the active plate. When the lateral plate dimension is larger than the activogravity length (about 100 micron), gravitational forces become important. Such gravitaxis behaviors at multicellular scales may play significant roles in the morphogenesis and in the up-down symmetry broken during tissue development.

preprint2022arXiv

Visible-Thermal UAV Tracking: A Large-Scale Benchmark and New Baseline

With the popularity of multi-modal sensors, visible-thermal (RGB-T) object tracking is to achieve robust performance and wider application scenarios with the guidance of objects' temperature information. However, the lack of paired training samples is the main bottleneck for unlocking the power of RGB-T tracking. Since it is laborious to collect high-quality RGB-T sequences, recent benchmarks only provide test sequences. In this paper, we construct a large-scale benchmark with high diversity for visible-thermal UAV tracking (VTUAV), including 500 sequences with 1.7 million high-resolution (1920 $\times$ 1080 pixels) frame pairs. In addition, comprehensive applications (short-term tracking, long-term tracking and segmentation mask prediction) with diverse categories and scenes are considered for exhaustive evaluation. Moreover, we provide a coarse-to-fine attribute annotation, where frame-level attributes are provided to exploit the potential of challenge-specific trackers. In addition, we design a new RGB-T baseline, named Hierarchical Multi-modal Fusion Tracker (HMFT), which fuses RGB-T data in various levels. Numerous experiments on several datasets are conducted to reveal the effectiveness of HMFT and the complement of different fusion types. The project is available at here.

preprint2022arXiv

Vision-based Anti-UAV Detection and Tracking

Unmanned aerial vehicles (UAV) have been widely used in various fields, and their invasion of security and privacy has aroused social concern. Several detection and tracking systems for UAVs have been introduced in recent years, but most of them are based on radio frequency, radar, and other media. We assume that the field of computer vision is mature enough to detect and track invading UAVs. Thus we propose a visible light mode dataset called Dalian University of Technology Anti-UAV dataset, DUT Anti-UAV for short. It contains a detection dataset with a total of 10,000 images and a tracking dataset with 20 videos that include short-term and long-term sequences. All frames and images are manually annotated precisely. We use this dataset to train several existing detection algorithms and evaluate the algorithms' performance. Several tracking methods are also tested on our tracking dataset. Furthermore, we propose a clear and simple tracking algorithm combined with detection that inherits the detector's high precision. Extensive experiments show that the tracking performance is improved considerably after fusing detection, thus providing a new attempt at UAV tracking using our dataset.The datasets and results are publicly available at: https://github.com/wangdongdut/DUT-Anti-UAV

preprint2021arXiv

A Dataset And Benchmark Of Underwater Object Detection For Robot Picking

Underwater object detection for robot picking has attracted a lot of interest. However, it is still an unsolved problem due to several challenges. We take steps towards making it more realistic by addressing the following challenges. Firstly, the currently available datasets basically lack the test set annotations, causing researchers must compare their method with other SOTAs on a self-divided test set (from the training set). Training other methods lead to an increase in workload and different researchers divide different datasets, resulting there is no unified benchmark to compare the performance of different algorithms. Secondly, these datasets also have other shortcomings, e.g., too many similar images or incomplete labels. Towards these challenges we introduce a dataset, Detecting Underwater Objects (DUO), and a corresponding benchmark, based on the collection and re-annotation of all relevant datasets. DUO contains a collection of diverse underwater images with more rational annotations. The corresponding benchmark provides indicators of both efficiency and accuracy of SOTAs (under the MMDtection framework) for academic research and industrial applications, where JETSON AGX XAVIER is used to assess detector speed to simulate the robot-embedded environment.

preprint2021arXiv

A vector Riemann-Hilbert approach to the Muttalib-Borodin ensembles

In this paper, we consider Muttalib-Borodin ensemble of Laguerre type, a determinantal point process over $[0,\infty)$ which depends on the varying weights $x^αe^{-nV(x)}$, $α>-1$, and a parameter $θ$. For $θ$ being a positive integer, we derive asymptotics of the associated biorthogonal polynomials near the origin for a large class of potential functions $V$ as $n\to \infty$. This further allows us to establish the hard edge scaling limit of the correlation kernel, which is previously only known in the special cases and conjectured to be universal. Our proof is based on the Deift/Zhou nonlinear steepest descent analysis of two $1 \times 2$ vector-valued Riemann-Hilbert problems that characterize the biorthogonal polynomials and the explicit constructions of $(θ+1)\times(θ+1)$ local parametrices near the origin in terms of the Meijer G-functions.

preprint2021arXiv

Holographic insulator/superconductor phase transitions with excited states

We construct a family of solutions of the holographic insulator/superconductor phase transitions with the excited states in the AdS soliton background by using both the numerical and analytical methods. The interesting point is that the improved Sturm-Liouville method can not only analytically investigate the properties of the phase transition with the excited states, but also the distributions of the condensed fields in the vicinity of the critical point. We observe that, regardless of the type of the holographic model, the excited state has a higher critical chemical potential than the corresponding ground state, and the difference of the dimensionless critical chemical potential between the consecutive states is around 2.4, which is different from the finding of the metal/superconductor phase transition in the AdS black hole background. Furthermore, near the critical point, we find that the phase transition of the systems is of the second order and a linear relationship exists between the charge density and chemical potential for all the excited states in both s-wave and p-wave insulator/superconductor models.

preprint2021arXiv

Holographic superconductors in 4D Einstein-Gauss-Bonnet gravity

We investigate the neutral AdS black-hole solution in the consistent $D\rightarrow4$ Einstein-Gauss-Bonnet gravity proposed in [K. Aoki, M.A. Gorji, and S. Mukohyama, Phys. Lett. B {\bf 810}, 135843 (2020)] and construct the gravity duals of ($2+1$)-dimensional superconductors with Gauss-Bonnet corrections in the probe limit. We find that the curvature correction has a more subtle effect on the scalar condensates in the s-wave superconductor in ($2+1$)-dimensions, which is different from the finding in the higher-dimensional superconductors that the higher curvature correction makes the scalar hair more difficult to be developed in the full parameter space. However, in the p-wave case, we observe that the higher curvature correction always makes it harder for the vector condensates to form in various dimensions. Moreover, we note that the higher curvature correction results in the larger deviation from the expected relation in the gap frequency $ω_g/T_c\approx 8$ in both ($2+1$)-dimensional s-wave and p-wave models.

preprint2021arXiv

Necessary and sufficient criterion of steering for two-qubit T states

Einstein-Podolsky-Rosen (EPR) steering is the ability that an observer persuades a distant observer to share entanglement by making local measurements. Determining a quantum state is steerable or unsteerable remains an open problem. Here, we derive a new steering inequality with infinite measurements corresponding to an arbitrary two-qubit T state, from consideration of EPR steering inequalities with N projective measurement settings for each side. In fact, the steering inequality is also a sufficient criterion for guaranteering that the T state is unsteerable. Hence, the steering inequality can be viewed as a necessary and sufficient criterion to distinguish whether the T state is steerable or unsteerable. In order to reveal the fact that the set composed of steerable states is the strict subset of the set made up of entangled states, we prove theoretically that all separable T states can not violate the steering inequality. Moreover, we put forward a method to estimate the maximum violation from concurrence for arbitrary two-qubit T states, which indicates that the T state is steerable if its concurrence exceeds 1/4.

preprint2021arXiv

Reconfigurable-intelligent-surface-assisted Downlink Transmission Design via Bayesian Optimization

This paper investigates the transmission design in the reconfigurable-intelligent-surface (RIS)-assisted downlink system. The channel state information (CSI) is usually difficult to be estimated at the base station (BS) when the RIS is not equipped with radio frequency chains. In this paper, we propose a downlink transmission framework with unknown CSI via Bayesian optimization. Since the CSI is not available at the BS, we treat the unknown objective function as the black-box function and take the beamformer, the phase shift, and the receiving filter as the input. Then the objective function is decomposed as the sum of low-dimension subfunctions to reduce the complexity. By re-expressing the power constraint of the BS in spherical coordinates, the original constraint problem is converted into an equivalent unconstrained problem. The users estimate the sum MSE of the training symbols as the objective value and feed it back to the BS. We assume a Gaussian prior of the feedback samples and the next query point is updated by minimizing the constructed acquisition function. Furthermore, this framework can also be applied to the power transfer system and fairness problems. Simulation results validate the effectiveness of the proposed transmission scheme in the downlink data transmission and power transfer.

preprint2021arXiv

Robust stimulated Raman shortcut-to-adiabatic passage by invariant-based optimal control

The stimulated Raman adiabatic passage (STIRAP) shows an efficient technique that accurately transfers population between two discrete quantum states with the same parity, in three-level quantum systems based on adiabatic evolution. This technique has widely theoretical and experimental applications in many fields of physics, chemistry, and beyond. Here, we present a generally robust approach to speed up STIRAP with invariant-based shortcut to adiabaticity. By controlling the dynamical process, we inversely design a family of Hamiltonians that can realize fast and accurate population transfer from the first to the third level, while the systematic error is largely suppressed in general. Furthermore, a detailed trade-off relation between the population of the intermediate state and the amplitudes of Rabi frequencies in the transfer process is illustrated. These results provide an optimal route toward manipulating the evolution of three-level quantum systems in future quantum information processing.

preprint2021arXiv

Self-Amplification of Coherent Energy Modulation in Seeded Free-Electron Lasers

The spectroscopic techniques for time-resolved fine analysis of matter require coherent X-ray radiation with femtosecond duration and high average brightness. Seeded free-electron lasers (FELs), which use the frequency up-conversion of an external seed laser to improve temporal coherence, are ideal for providing fully coherent soft X-ray pulses. However, it is difficult to operate seeded FELs at a high repetition rate due to the limitations of present state-of-the-art laser systems. Here, we report the novel self-modulation method for enhancing laser-induced energy modulation, thereby significantly reducing the requirement of an external laser system. Driven by this scheme, we experimentally realize high harmonic generation in a seeded FEL using an unprecedentedly small energy modulation. An electron beam with a laser-induced energy modulation as small as 1.8 times the slice energy spread is used for lasing at the 7th harmonic of a 266-nm seed laser in a single-stage high-gain harmonic generation (HGHG) setup and the 30th harmonic of the seed laser in a two-stage HGHG setup. The results mark a major step towards a high-repetition-rate, fully coherent X-ray FEL.

preprint2020arXiv

A deep learning approach to multi-track location and orientation in gaseous drift chambers

Accurate measuring the location and orientation of individual particles in a beam monitoring system is of particular interest to researchers in multiple disciplines. Among feasible methods, gaseous drift chambers with hybrid pixel sensors have the great potential to realize long-term stable measurement with considerable precision. In this paper, we introduce deep learning to analyze patterns in the beam projection image to facilitate three-dimensional reconstruction of particle tracks. We propose an end-to-end neural network based on segmentation and fitting for feature extraction and regression. Two segmentation branches, named binary segmentation and semantic segmentation, perform initial track determination and pixel-track association. Then pixels are assigned to multiple tracks, and a weighted least squares fitting is implemented with full back-propagation. Besides, we introduce a center-angle measure to judge the precision of location and orientation by combining two separate factors. The initial position resolution achieves 8.8 $μm$ for the single track and 11.4 $μm$ (15.2 $μm$) for the 1-3 tracks (1-5 tracks), and the angle resolution achieves 0.15$^{\circ}$ and 0.21$^{\circ}$ (0.29$^{\circ}$) respectively. These results show a significant improvement in accuracy and multi-track compatibility compared to traditional methods.

preprint2020arXiv

An efficient iterative method for reconstructing surface from point clouds

Surface reconstruction from point clouds is a fundamental step in many applications in computer vision. In this paper, we develop an efficient iterative method on a variational model for the surface reconstruction from point clouds. The surface is implicitly represented by indicator functions and the energy functional is then approximated based on such representations using heat kernel convolutions. We then develop a novel iterative method to minimize the approximate energy and prove the energy decaying property during each iteration. We then use asymptotic expansion to give a connection between the proposed algorithm and active contour models. Extensive numerical experiments are performed in both 2- and 3- dimensional Euclidean spaces to show that the proposed method is simple, efficient, and accurate.

preprint2020arXiv

ASR-Free Pronunciation Assessment

Most of the pronunciation assessment methods are based on local features derived from automatic speech recognition (ASR), e.g., the Goodness of Pronunciation (GOP) score. In this paper, we investigate an ASR-free scoring approach that is derived from the marginal distribution of raw speech signals. The hypothesis is that even if we have no knowledge of the language (so cannot recognize the phones/words), we can still tell how good a pronunciation is, by comparatively listening to some speech data from the target language. Our analysis shows that this new scoring approach provides an interesting correction for the phone-competition problem of GOP. Experimental results on the ERJ dataset demonstrated that combining the ASR-free score and GOP can achieve better performance than the GOP baseline.

preprint2020arXiv

Backreacting holographic superconductors from the coupling of a scalar field to the Einstein tensor

We investigate the properties of the backreacting holographic superconductors from the coupling of a scalar field to the Einstein tensor in the background of a d-dimensional AdS black hole. Imposing the Dirichlet boundary condition of the trial function without the Neumann boundary conditions, we improve the analytical Sturm-Liouville method with an iterative procedure to explore the pure effect of the Einstein tensor on the holographic superconductors and find that the Einstein tensor hinders the condensate of the scalar field but does not affect the critical phenomena. Our analytical findings are in very good agreement with the numerical results from the "marginally stable modes" method, which implies that the Sturm-Liouville method is still powerful to study the holographic superconductors from the coupling of a scalar field to the Einstein tensor even if we consider the backreactions.

preprint2020arXiv

Consistency of archetypal analysis

Archetypal analysis is an unsupervised learning method that uses a convex polytope to summarize multivariate data. For fixed $k$, the method finds a convex polytope with $k$ vertices, called archetype points, such that the polytope is contained in the convex hull of the data and the mean squared distance between the data and the polytope is minimal. In this paper, we prove a consistency result that shows if the data is independently sampled from a probability measure with bounded support, then the archetype points converge to a solution of the continuum version of the problem, of which we identify and establish several properties. We also obtain the convergence rate of the optimal objective values under appropriate assumptions on the distribution. If the data is independently sampled from a distribution with unbounded support, we also prove a consistency result for a modified method that penalizes the dispersion of the archetype points. Our analysis is supported by detailed computational experiments of the archetype points for data sampled from the uniform distribution in a disk, the normal distribution, an annular distribution, and a Gaussian mixture model.

preprint2020arXiv

Cooling-Shrinking Attack: Blinding the Tracker with Imperceptible Noises

Adversarial attack of CNN aims at deceiving models to misbehave by adding imperceptible perturbations to images. This feature facilitates to understand neural networks deeply and to improve the robustness of deep learning models. Although several works have focused on attacking image classifiers and object detectors, an effective and efficient method for attacking single object trackers of any target in a model-free way remains lacking. In this paper, a cooling-shrinking attack method is proposed to deceive state-of-the-art SiameseRPN-based trackers. An effective and efficient perturbation generator is trained with a carefully designed adversarial loss, which can simultaneously cool hot regions where the target exists on the heatmaps and force the predicted bounding box to shrink, making the tracked target invisible to trackers. Numerous experiments on OTB100, VOT2018, and LaSOT datasets show that our method can effectively fool the state-of-the-art SiameseRPN++ tracker by adding small perturbations to the template or the search regions. Besides, our method has good transferability and is able to deceive other top-performance trackers such as DaSiamRPN, DaSiamRPN-UpdateNet, and DiMP. The source codes are available at https://github.com/MasterBin-IIAU/CSA.

preprint2020arXiv

CovidSens: A Vision on Reliable Social Sensing for COVID-19

With the spiraling pandemic of the Coronavirus Disease 2019 (COVID-19), it has becoming inherently important to disseminate accurate and timely information about the disease. Due to the ubiquity of Internet connectivity and smart devices, social sensing is emerging as a dynamic AI-driven sensing paradigm to extract real-time observations from online users. In this paper, we propose CovidSens, a vision of social sensing based risk alert systems to spontaneously obtain and analyze social data to infer COVID-19 propagation. CovidSens can actively help to keep the general public informed about the COVID-19 spread and identify risk-prone areas. The CovidSens concept is motivated by three observations: 1) people actively share their experience of COVID-19 via online social media, 2) official warning channels and news agencies are relatively slower than people reporting on social media, and 3) online users are frequently equipped with powerful mobile devices that can perform data processing and analytics. We envision unprecedented opportunities to leverage posts generated by ordinary people to build real-time sensing and analytic system for gathering and circulating COVID-19 propagation data. Specifically, the vision of CovidSens attempts to answer the questions: How to distill reliable information on COVID-19 with prevailing rumors and misinformation? How to inform the general public about the state of the spread timely and effectively? How to leverage the computational power on edge devices to construct fully integrated edge-based social sensing platforms? In this vision paper, we discuss the roles of CovidSens and identify potential challenges in developing reliable social sensing based risk alert systems. We envision that approaches originating from multiple disciplines can be effective in addressing the challenges. Finally, we outline a few research directions for future work in CovidSens.

preprint2020arXiv

Curriculum Audiovisual Learning

Associating sound and its producer in complex audiovisual scene is a challenging task, especially when we are lack of annotated training data. In this paper, we present a flexible audiovisual model that introduces a soft-clustering module as the audio and visual content detector, and regards the pervasive property of audiovisual concurrency as the latent supervision for inferring the correlation among detected contents. To ease the difficulty of audiovisual learning, we propose a novel curriculum learning strategy that trains the model from simple to complex scene. We show that such ordered learning procedure rewards the model the merits of easy training and fast convergence. Meanwhile, our audiovisual model can also provide effective unimodal representation and cross-modal alignment performance. We further deploy the well-trained model into practical audiovisual sound localization and separation task. We show that our localization model significantly outperforms existing methods, based on which we show comparable performance in sound separation without referring external visual supervision. Our video demo can be found at https://youtu.be/kuClfGG0cFU.

preprint2020arXiv

DASC: Towards A Road Damage-Aware Social-Media-Driven Car Sensing Framework for Disaster Response Applications

While vehicular sensor networks (VSNs) have earned the stature of a mobile sensing paradigm utilizing sensors built into cars, they have limited sensing scopes since car drivers only opportunistically discover new events. Conversely, social sensing is emerging as a new sensing paradigm where measurements about the physical world are collected from humans. In contrast to VSNs, social sensing is more pervasive, but one of its key limitations lies in its inconsistent reliability stemming from the data contributed by unreliable human sensors. In this paper, we present DASC, a road Damage-Aware Social-media-driven Car sensing framework that exploits the collective power of social sensing and VSNs for reliable disaster response applications. However, integrating VSNs with social sensing introduces a new set of challenges: i) How to leverage noisy and unreliable social signals to route the vehicles to accurate regions of interest? ii) How to tackle the inconsistent availability (e.g., churns) caused by car drivers being rational actors? iii) How to efficiently guide the cars to the event locations with little prior knowledge of the road damage caused by the disaster, while also handling the dynamics of the physical world and social media? The DASC framework addresses the above challenges by establishing a novel hybrid social-car sensing system that employs techniques from game theory, feedback control, and Markov Decision Process (MDP). In particular, DASC distills signals emitted from social media and discovers the road damages to effectively drive cars to target areas for verifying emergency events. We implement and evaluate DASC in a reputed vehicle simulator that can emulate real-world disaster response scenarios. The results of a real-world application demonstrate the superiority of DASC over current VSNs-based solutions in detection accuracy and efficiency.

preprint2020arXiv

Development of readout electronics a novel beam monitoring system for ion research facility accelerator

This article presents the readout electronics of a novel beam monitoring system for ion research facility accelerator. The readout electronics are divided into Front-end Card (FEC) and Readout Control Unit (RCU). FEC uses Topmetal II minus to processes the energy of the hitting particles and convert it into a voltage signal. The main function of RCU is to digitize the analog output signal of FEC and format the raw data. On the other hand, the RCU also processes the control commands from the host and distributes the commands according to the mapping. The readout electronic has been characterized and calibrated in the laboratory, and have been installed with the detector. Implementation and testing of readout electronics have been discussed.

preprint2020arXiv

Domain-Invariant Speaker Vector Projection by Model-Agnostic Meta-Learning

Domain generalization remains a critical problem for speaker recognition, even with the state-of-the-art architectures based on deep neural nets. For example, a model trained on reading speech may largely fail when applied to scenarios of singing or movie. In this paper, we propose a domain-invariant projection to improve the generalizability of speaker vectors. This projection is a simple neural net and is trained following the Model-Agnostic Meta-Learning (MAML) principle, for which the objective is to classify speakers in one domain if it had been updated with speech data in another domain. We tested the proposed method on CNCeleb, a new dataset consisting of single-speaker multi-condition (SSMC) data. The results demonstrated that the MAML-based domain-invariant projection can produce more generalizable speaker vectors, and effectively improve the performance in unseen domains.

preprint2020arXiv

Experimental demonstration of complementarity relations between quantum steering criteria

The ability that one system immediately affects another one by using local measurements is regarded as quantum steering, which can be detected by various steering criteria. Recently, Mondal et al. [Phys. Rev. A 98, 052330 (2018)] derived the complementarity relations of coherence steering criteria, and revealed that the quantum steering of system can be observed through the average coherence of subsystem. Here, we experimentally verify the complementarity relations between quantum steering criteria by employing two-photon Bell-like states and three Pauli operators. The results demonstrate that if prepared quantum states can violate two setting coherence steering criteria and turn out to be steerable states, then it cannot violate the complementary settings criteria. Three measurement settings inequality, which establish a complementarity relation between these two coherence steering criteria, always holds in experiment. Besides, we experimentally certify that the strengths of coherence steering criteria dependent on the choice of coherence measure. In comparison with two setting coherence steering criteria based on l1 norm of coherence and relative entropy of coherence, our experimental results show that the steering criterion based on skew information of coherence is more stronger in detecting the steerability of quantum states. Thus, our experimental demonstrations can deepen the understanding of the relation between the quantum steering and quantum coherence.

preprint2020arXiv

FocalMix: Semi-Supervised Learning for 3D Medical Image Detection

Applying artificial intelligence techniques in medical imaging is one of the most promising areas in medicine. However, most of the recent success in this area highly relies on large amounts of carefully annotated data, whereas annotating medical images is a costly process. In this paper, we propose a novel method, called FocalMix, which, to the best of our knowledge, is the first to leverage recent advances in semi-supervised learning (SSL) for 3D medical image detection. We conducted extensive experiments on two widely used datasets for lung nodule detection, LUNA16 and NLST. Results show that our proposed SSL methods can achieve a substantial improvement of up to 17.3% over state-of-the-art supervised learning approaches with 400 unlabeled CT scans.

preprint2020arXiv

Graph Representation Learning for Merchant Incentive Optimization in Mobile Payment Marketing

Mobile payment such as Alipay has been widely used in our daily lives. To further promote the mobile payment activities, it is important to run marketing campaigns under a limited budget by providing incentives such as coupons, commissions to merchants. As a result, incentive optimization is the key to maximizing the commercial objective of the marketing campaign. With the analyses of online experiments, we found that the transaction network can subtly describe the similarity of merchants' responses to different incentives, which is of great use in the incentive optimization problem. In this paper, we present a graph representation learning method atop of transaction networks for merchant incentive optimization in mobile payment marketing. With limited samples collected from online experiments, our end-to-end method first learns merchant representations based on an attributed transaction networks, then effectively models the correlations between the commercial objectives each merchant may achieve and the incentives under varying treatments. Thus we are able to model the sensitivity to incentive for each merchant, and spend the most budgets on those merchants that show strong sensitivities in the marketing campaign. Extensive offline and online experimental results at Alipay demonstrate the effectiveness of our proposed approach.

preprint2020arXiv

High-Performance Long-Term Tracking with Meta-Updater

Long-term visual tracking has drawn increasing attention because it is much closer to practical applications than short-term tracking. Most top-ranked long-term trackers adopt the offline-trained Siamese architectures, thus, they cannot benefit from great progress of short-term trackers with online update. However, it is quite risky to straightforwardly introduce online-update-based trackers to solve the long-term problem, due to long-term uncertain and noisy observations. In this work, we propose a novel offline-trained Meta-Updater to address an important but unsolved problem: Is the tracker ready for updating in the current frame? The proposed meta-updater can effectively integrate geometric, discriminative, and appearance cues in a sequential manner, and then mine the sequential information with a designed cascaded LSTM module. Our meta-updater learns a binary output to guide the tracker's update and can be easily embedded into different trackers. This work also introduces a long-term tracking framework consisting of an online local tracker, an online verifier, a SiamRPN-based re-detector, and our meta-updater. Numerous experimental results on the VOT2018LT, VOT2019LT, OxUvALT, TLP, and LaSOT benchmarks show that our tracker performs remarkably better than other competing algorithms. Our project is available on the website: https://github.com/Daikenan/LTMU.

preprint2020arXiv

Improve bone age assessment by learning from anatomical local regions

Skeletal bone age assessment (BAA), as an essential imaging examination, aims at evaluating the biological and structural maturation of human bones. In the clinical practice, Tanner and Whitehouse (TW2) method is a widely-used method for radiologists to perform BAA. The TW2 method splits the hands into Region Of Interests (ROI) and analyzes each of the anatomical ROI separately to estimate the bone age. Because of considering the analysis of local information, the TW2 method shows accurate results in practice. Following the spirit of TW2, we propose a novel model called Anatomical Local-Aware Network (ALA-Net) for automatic bone age assessment. In ALA-Net, anatomical local extraction module is introduced to learn the hand structure and extract local information. Moreover, we design an anatomical patch training strategy to provide extra regularization during the training process. Our model can detect the anatomical ROIs and estimate bone age jointly in an end-to-end manner. The experimental results show that our ALA-Net achieves a new state-of-the-art single model performance of 3.91 mean absolute error (MAE) on the public available RSNA dataset. Since the design of our model is well consistent with the well recognized TW2 method, it is interpretable and reliable for clinical usage.

preprint2020arXiv

Improved tripartite uncertainty relation with quantum memory

Uncertainty principle is a striking and fundamental feature in quantum mechanics distinguishing from classical mechanics. It offers an important lower bound to predict outcomes of two arbitrary incompatible observables measured on a particle. In quantum information theory, this uncertainty principle is popularly formulized in terms of entropy. Here, we present an improvement of tripartite quantum-memory-assisted entropic uncertainty relation. The uncertainty's lower bound is derived by considering mutual information and Holevo quantity. It shows that the bound derived by this method will be tighter than the lower bound in [Phys. Rev. Lett. 103, 020402 (2009)]. Furthermore, regarding a pair of mutual unbiased bases as the incompatibility, our bound will become extremely tight for the three-qubit $\emph{X}$-state system, completely coinciding with the entropy-based uncertainty, and can restore Renes ${\emph{et al.}}$'s bound with respect to arbitrary tripartite pure states. In addition, by applying our lower bound, one can attain the tighter bound of quantum secret key rate, which is of basic importance to enhance the security of quantum key distribution protocols.

preprint2020arXiv

Jointly Modeling Motion and Appearance Cues for Robust RGB-T Tracking

In this study, we propose a novel RGB-T tracking framework by jointly modeling both appearance and motion cues. First, to obtain a robust appearance model, we develop a novel late fusion method to infer the fusion weight maps of both RGB and thermal (T) modalities. The fusion weights are determined by using offline-trained global and local multimodal fusion networks, and then adopted to linearly combine the response maps of RGB and T modalities. Second, when the appearance cue is unreliable, we comprehensively take motion cues, i.e., target and camera motions, into account to make the tracker robust. We further propose a tracker switcher to switch the appearance and motion trackers flexibly. Numerous results on three recent RGB-T tracking datasets show that the proposed tracker performs significantly better than other state-of-the-art algorithms.

preprint2020arXiv

Metal-free magnetism in chemically doped covalent organic frameworks

Organic and molecule-based magnets are not easily attainable, because to introduce stable paramagnetic centers to pure organic systems is challenging. Crystalline covalent organic frameworks (COFs) with high designability and chemical diversity constitute ideal platforms to access intriguing magnetic phenomena of organic materials. In this work, we proposed a general approach to attain unpaired electron spin and metal-free magnetism in narrow-band COFs by chemical doping. By using density functional theory calculations, we found that dopants with energy-matched frontier orbitals to COFs not only inject charges to them but also further localize the charges through orbital hybridization and formation of supramolecular charge-transfer complex. The localized states enable stable paramagnetic centers introduced to nonmagnetic COFs. Based on this discovery, we designed two new COFs with narrow valence band, which show prospective magnetism after doping with iodine. Further, we unraveled magnetic anisotropy in two-dimensional COFs and showed that both spin-conduction and magnetic interactions can be modulated by manipulating the building blocks of COFs. Our work highlights a practical scenario to attain magnetism in COFs and other organic materials, which hold great promise for applications in organic spintronic devices.

preprint2020arXiv

Metastable atomic layer deposition: 3D self-assembly towards ultra dark materials

Black body materials prove promising candidates to meet future energy demands as they are able to harvest energy from the total bandwidth of solar radiation. Here, we report on high absorption (> 98 %) near-black body-like structures consisting of a silica scaffold and Ag nanoparticles with a layer thickness below 10 um; fabricated using metastable atomic layer deposition (MS-ALD) and to be applied for a wide solar spectrum ranging from 220 nm to 2500 nm. Several effects contribute collectively and in a synergistic manner to the high absorbance, including the pronounced heterogeneity of the nanoparticles in size and shape, particle plasmon hybridization and the trapping of omni-directionally scattered light in the 3D hierarchical hybrid structures. We propose that, in the future, MS-ALD needs to be considered as a simple and promising method to fabricate black-body materials with excellent broadband absorption.

preprint2020arXiv

Multispectral Pan-sharpening via Dual-Channel Convolutional Network with Convolutional LSTM Based Hierarchical Spatial-Spectral Feature Fusion

Multispectral pan-sharpening aims at producing a high resolution (HR) multispectral (MS) image in both spatial and spectral domains by fusing a panchromatic (PAN) image and a corresponding MS image. In this paper, we propose a novel dual-channel network (DCNet) framework for MS pan-sharpening. In our DCNet, the dual-channel backbone involves a spatial channel to capture spatial information with a 2D CNN, and a spectral channel to extract spectral information with a 3D CNN. This heterogeneous 2D/3D CNN architecture can minimize causing spectral information distortion, which typically happens in conventional 2D CNN models. In order to fully integrate the spatial and spectral features captured from different levels, we introduce a multi-level fusion strategy. Specifically, a spatial-spectral CLSTM (S$^2$-CLSTM) module is proposed for fusing the hierarchical spatial and spectral features, which can effectively capture correlations among multi-level features. The S$^2$-CLSTM module attaches two fusion ways: the intra-level fusion via bi-directional lateral connections and inter-level fusion via the cell state in the S$^2$-CLSTM. Finally, the ideal HR-MS image is recovered by a reconstruction module. Extensive experiments have been conducted at both simulated lower scale and the original scale of real-world datasets. Compared with the state-of-the-art methods, the proposed DCNet achieves superior or competitive performance.

preprint2020arXiv

Neural Discriminant Analysis for Deep Speaker Embedding

Probabilistic Linear Discriminant Analysis (PLDA) is a popular tool in open-set classification/verification tasks. However, the Gaussian assumption underlying PLDA prevents it from being applied to situations where the data is clearly non-Gaussian. In this paper, we present a novel nonlinear version of PLDA named as Neural Discriminant Analysis (NDA). This model employs an invertible deep neural network to transform a complex distribution to a simple Gaussian, so that the linear Gaussian model can be readily established in the transformed space. We tested this NDA model on a speaker recognition task where the deep speaker vectors (x-vectors) are presumably non-Gaussian. Experimental results on two datasets demonstrate that NDA consistently outperforms PLDA, by handling the non-Gaussian distributions of the x-vectors.

preprint2020arXiv

Newcomer Candidate: Characterizing Contributions of a Novice Developer to GitHub

Context: To attract, onboard, and retain any new-comer in Open Source Software (OSS) projects is vital to their livelihood. Recent studies conclude that OSS projects risk failure due to abandonment and poor participation of newcomers. Evidence suggests more new users are joining GitHub, however, the extent to which they contribute to OSS projects is unknown. Objective: In this study, we coin the term 'newcomer candidate' to describe new users to the GitHub platform. Our objective is to track and characterize their initial contributions. As a preliminary survey, we collected 208 newcomer candidate contributions in GitHub. Using this dataset, we then plan to track their contributions to reveal insights. Method: We will use a mixed-methods approach, i.e., quantitative and qualitative, to identify whether or not newcomer candidates practice social coding, the kinds of their contributions, projects they target, and the proportion that they eventually onboard to an OSS project. Limitation: The key limitation is that our newcomer candidates are restricted to those that were collected from our preliminary survey.

preprint2020arXiv

Privacy Risk and Preservation For COVID-19 Contact Tracing Apps

Contact tracing in the COVID-19 pandemic is key to prevent the further spread of COVID-19. Countries and regions around the world have developed and deployed or are considering adopting contact-tracing software or mobile apps. While contact tracing apps and software play an important role in the pandemic, red flags have been raised regarding the privacy risk associated with contact tracing. In this short paper, we provide an overview on the GPS and Bluetooth based contact-tracing apps in the framework of both centralized and decentralized models, examine the associated privacy risk and the effectiveness of the privacy-preserving measures adopted in different apps.

preprint2020arXiv

Stabilizing Training of Generative Adversarial Nets via Langevin Stein Variational Gradient Descent

Generative adversarial networks (GANs), famous for the capability of learning complex underlying data distribution, are however known to be tricky in the training process, which would probably result in mode collapse or performance deterioration. Current approaches of dealing with GANs' issues almost utilize some practical training techniques for the purpose of regularization, which on the other hand undermines the convergence and theoretical soundness of GAN. In this paper, we propose to stabilize GAN training via a novel particle-based variational inference -- Langevin Stein variational gradient descent (LSVGD), which not only inherits the flexibility and efficiency of original SVGD but aims to address its instability issues by incorporating an extra disturbance into the update dynamics. We further demonstrate that by properly adjusting the noise variance, LSVGD simulates a Langevin process whose stationary distribution is exactly the target distribution. We also show that LSVGD dynamics has an implicit regularization which is able to enhance particles' spread-out and diversity. At last we present an efficient way of applying particle-based variational inference on a general GAN training procedure no matter what loss function is adopted. Experimental results on one synthetic dataset and three popular benchmark datasets -- Cifar-10, Tiny-ImageNet and CelebA validate that LSVGD can remarkably improve the performance and stability of various GAN models.

preprint2020arXiv

Towards Privacy-aware Task Allocation in Social Sensing based Edge Computing Systems

With the advance in mobile computing, Internet of Things, and ubiquitous wireless connectivity, social sensing based edge computing (SSEC) has emerged as a new computation paradigm where people and their personally owned devices collect sensor measurements from the physical world and process them at the edge of the network. This paper focuses on a privacy-aware task allocation problem where the goal is to optimize the computation task allocation in SSEC systems while respecting the users' customized privacy settings. It introduces a novel Game-theoretic Privacy-aware Task Allocation (G-PATA) framework to achieve the goal. G-PATA includes (i) a bottom-up game-theoretic model to generate the maximum payoffs at end devices while satisfying the end user's privacy settings; (ii) a top-down incentive scheme to adjust the rewards for the tasks to ensure that the task allocation decisions made by end devices meet the Quality of Service (QoS) requirements of the applications. Furthermore, the framework incorporates an efficient load balancing and iteration reduction component to adapt to the dynamic changes in status and privacy configurations of end devices. The G-PATA framework was implemented on a real-world edge computing platform that consists of heterogeneous end devices (Jetson TX1 and TK1 boards, and Raspberry Pi3). We compare G-PATA with state-of-the-art task allocation schemes through two real-world social sensing applications. The results show that G-PATA significantly outperforms existing approaches under various privacy settings (our scheme achieved as much as 47% improvements in delay reduction for the application and 15% more payoffs for end devices compared to the baselines.).

preprint2020arXiv

WANA: Symbolic Execution of Wasm Bytecode for Cross-Platform Smart Contract Vulnerability Detection

Many popular blockchain platforms are supporting smart contracts for building decentralized applications. However, the vulnerabilities within smart contracts have led to serious financial loss to their end users. For the EOSIO blockchain platform, effective vulnerability detectors are still limited. Furthermore, existing vulnerability detection tools can only support one blockchain platform. In this work, we present WANA, a cross-platform smart contract vulnerability detection tool based on the symbolic execution of WebAssembly bytecode. Furthermore, WANA proposes a set of test oracles to detect the vulnerabilities in EOSIO and Ethereum smart contracts based on WebAssembly bytecode analysis. Our experimental analysis shows that WANA can effectively detect vulnerabilities in both EOSIO and Ethereum smart contracts with high efficiency.

preprint2019arXiv

Enlightening force chains: a review of photoelasticimetry in granular matter

A photoelastic material will reveal its internal stresses when observed through polarizing filters. This eye-catching property has enlightened our understanding of granular materials for over half a century, whether in the service of art, education, or scientific research. In this review article in honor of Robert Behringer, we highlight both his pioneering use of the method in physics research, and its reach into the public sphere through museum exhibits and outreach programs. We aim to provide clear protocols for artists, exhibit-designers, educators, and scientists to use in their own endeavors. It is our hope that this will build awareness about the ubiquitous presence of granular matter in our lives, enlighten its puzzling behavior, and promote conversations about its importance in environmental and industrial contexts. To aid in this endeavor, this paper also serves as a front door to a detailed wiki containing open, community-curated guidance on putting these methods into practice.

preprint2019arXiv

Experimental certification of steering criterion based on general entropic uncertainty relation

Quantum steering describes the phenomenon that one system can be immediately influenced by another with local measurements. It can be detected by the violation of a powerful and useful steering criterion from general entropic uncertainty relation. This criterion, in principle, can be evaluated straightforwardly and achieved by only probability distributions from a finite set of measurement settings. Herein, we experimentally verify the steering criterion by means of the two-photon Werner-like states and three Pauli measurements. The results indicate that quantum steering can be verified by the criterion in a convenient way. In particular, it is no need to perform the usual quantum state tomography in experiment, which reduces the required experimental resources greatly. Moreover, we demonstrate that the criterion is stronger than the linear one for the detecting quantum steering of the Werner-like states.

preprint2019arXiv

Experimental investigation of entropic uncertainty relations and coherence uncertainty relations

Uncertainty relation usually is one of the most important features in quantum mechanics, and is the backbone of quantum theory, which distinguishes from the rule in classical counterpart. Specifically, entropy-based uncertainty relations are of fundamental importance in the region of quantum information theory, offering one nontrivial bound of key rate towards quantum key distribution. In this work, we experimentally demonstrate the entropic uncertainty relations and coherence-based uncertainty relations in an all-optics platform. By means of preparing two kinds of bipartite initial states with high fidelity, i.e., Bell-like states and Bell-like diagonal states, we carry on local projective measurements over a complete set of mutually unbiased bases on the measured subsystem. In terms of quantum tomography, the density matrices of the initial states and the post-measurement states are reconstructed. It shows that our experimental results coincide with the theoretical predictions very well. Additionally, we also verify that the lower bounds of both the entropy-based and coherence-based uncertainty can be tightened by imposing the Holevo quantity and mutual information, and the entropic uncertainty is inversely correlated with the coherence. Our demonstrations might offer an insight into their uncertainty relations and their connection to quantum coherence in quantum information science, which might be applicable to the security analysis of quantum key distributions.

preprint2019arXiv

Experimental observation the Einstein-Podolsky-Rosen Steering based on the detection of entanglement

The Einstein-Podolsky-Rosen (EPR) steering is an intermediate quantum nonlocality between entanglement and Bell nonlocality, which plays an important role in quantum information processing tasks. In the past few years, the investigations concerning EPR steering have been demonstrated in a series of experiments. However, these studies rely on the relevant steering inequalities and the choices of measurement settings. Here, we experimentally verify the EPR steering via entanglement detection without using any steering inequality and measurement setting. By constructing two new states from a two-qubit target state, we observe the EPR steering by detecting the entanglement of these new states. The results show that the entanglement of the newly constructed states can be regarded as a new kind of steering witness for target states. Compared to the results of Xiao et al. [Phys. Rev. Lett. 118, 140404 (2017)], we find that the ability of detecting EPR steering in our scenario is stronger than two-setting projective measurements, which can observe more steerable states. Hence, our demonstrations can deepen the understanding of the connection between the EPR steering and entanglement.

preprint2019arXiv

Generalized random matrix model with additional interactions

We introduce a log-gas model that is a generalization of a random matrix ensemble with an additional interaction, whose strength depends on a parameter $γ$. The equilibrium density is computed by numerically solving the Riemann-Hilbert problem associated with the ensemble. The effect of the additional parameter $γ$ associated with the two-body interaction can be understood in terms of an effective $γ$-dependent single-particle confining potential.

preprint2018arXiv

Generalized superconductors from the coupling of a scalar field to the Einstein tensor and their refractive index in massive gravity

We construct the generalized superconductors from the coupling of a scalar field to the Einstein tensor in the massive gravity and investigate their negative refraction in the probe limit. We observe that the larger graviton mass and Einstein tensor coupling parameters both hinder the formation of the condensation, but the larger graviton mass or smaller coupling parameter makes it easier for the emergence of the Cave of Winds. Furthermore, we see that the larger graviton mass but smaller coupling parameter make the range of frequencies or the range of temperatures larger for which a negative Depine-Lakhtakia index occurs, which indicates that the graviton mass and Einstein tensor have completely different effects on the negative refraction. In addition, we find that the larger graviton mass and coupling parameters both can reduce the dissipation and improve the propagation in the holographic setup.

preprint2018arXiv

Optical control of magnetism in NiFe/VO2 heterostructures

Optical methods for magnetism manipulation have been considered as a promising strategy for ultralow-power and ultrahigh-speed spin switches, which becomes a hot spot in the field of spintronics. However, a widely applicable and efficient method to combine optical operation with magnetic modulation is still highly desired. Here, the strongly correlated electron material VO2 is introduced to realize phase-transition based optical control of the magnetism in NiFe. The NiFe/VO2 bilayer heterostructure features appreciable modulations in electrical conductivity (55%), coercivity (60%), and magnetic anisotropy (33.5%). Further analyses indicate that interfacial strain coupling plays a crucial role in this modulation. Utilizing this optically controlled magnetism modulation feature, programmable Boolean logic gates (AND, OR, NAND, NOR, XOR, NXOR and NOT) for high-speed and low-power data processing are demonstrated based on this engineered heterostructure. As a demonstration of phase-transition spintronics, this work may pave the way for next-generation electronics in the post-Moore era.

preprint2016arXiv

A Universal Update-pacing Framework For Visual Tracking

This paper proposes a novel framework to alleviate the model drift problem in visual tracking, which is based on paced updates and trajectory selection. Given a base tracker, an ensemble of trackers is generated, in which each tracker's update behavior will be paced and then traces the target object forward and backward to generate a pair of trajectories in an interval. Then, we implicitly perform self-examination based on trajectory pair of each tracker and select the most robust tracker. The proposed framework can effectively leverage temporal context of sequential frames and avoid to learn corrupted information. Extensive experiments on the standard benchmark suggest that the proposed framework achieves superior performance against state-of-the-art trackers.

preprint2016arXiv

An efficient threshold dynamics method for wetting on rough surfaces

The threshold dynamics method developed by Merriman, Bence and Osher (MBO) is an efficient method for simulating the motion by mean curvature flow when the interface is away from the solid boundary. Direct generalization of MBO-type methods to the wetting problem with interfaces intersecting the solid boundary is not easy because solving the heat equation in a general domain with a wetting boundary condition is not as efficient as it is with the original MBO method. The dynamics of the contact point also follows a different law compared with the dynamics of the interface away from the boundary. In this paper, we develop an efficient volume preserving threshold dynamics method for simulating wetting on rough surfaces. This method is based on minimization of the weighted surface area functional over an extended domain that includes the solid phase. The method is simple, stable with $O(N \log N)$ complexity per time step and is not sensitive to the inhomogeneity or roughness of the solid boundary.

preprint2016arXiv

AP16-OL7: A Multilingual Database for Oriental Languages and A Language Recognition Baseline

We present the AP16-OL7 database which was released as the training and test data for the oriental language recognition (OLR) challenge on APSIPA 2016. Based on the database, a baseline system was constructed on the basis of the i-vector model. We report the baseline results evaluated in various metrics defined by the AP16-OLR evaluation plan and demonstrate that AP16-OL7 is a reasonable data resource for multilingual research.

preprint2016arXiv

Bayesian Neighbourhood Component Analysis

Learning a good distance metric in feature space potentially improves the performance of the KNN classifier and is useful in many real-world applications. Many metric learning algorithms are however based on the point estimation of a quadratic optimization problem, which is time-consuming, susceptible to overfitting, and lack a natural mechanism to reason with parameter uncertainty, an important property useful especially when the training set is small and/or noisy. To deal with these issues, we present a novel Bayesian metric learning method, called Bayesian NCA, based on the well-known Neighbourhood Component Analysis method, in which the metric posterior is characterized by the local label consistency constraints of observations, encoded with a similarity graph instead of independent pairwise constraints. For efficient Bayesian optimization, we explore the variational lower bound over the log-likelihood of the original NCA objective. Experiments on several publicly available datasets demonstrate that the proposed method is able to learn robust metric measures from small size dataset and/or from challenging training set with labels contaminated by errors. The proposed method is also shown to outperform a previous pairwise constrained Bayesian metric learning method.

preprint2016arXiv

Binary Speaker Embedding

The popular i-vector model represents speakers as low-dimensional continuous vectors (i-vectors), and hence it is a way of continuous speaker embedding. In this paper, we investigate binary speaker embedding, which transforms i-vectors to binary vectors (codes) by a hash function. We start from locality sensitive hashing (LSH), a simple binarization approach where binary codes are derived from a set of random hash functions. A potential problem of LSH is that the randomly sampled hash functions might be suboptimal. We therefore propose an improved Hamming distance learning approach, where the hash function is learned by a variable-sized block training that projects each dimension of the original i-vectors to variable-sized binary codes independently. Our experiments show that binary speaker embedding can deliver competitive or even better results on both speaker verification and identification tasks, while the memory usage and the computation cost are significantly reduced.

preprint2016arXiv

Can Machine Generate Traditional Chinese Poetry? A Feigenbaum Test

Recent progress in neural learning demonstrated that machines can do well in regularized tasks, e.g., the game of Go. However, artistic activities such as poem generation are still widely regarded as human's special capability. In this paper, we demonstrate that a simple neural model can imitate human in some tasks of art generation. We particularly focus on traditional Chinese poetry, and show that machines can do as well as many contemporary poets and weakly pass the Feigenbaum Test, a variant of Turing test in professional domains. Our method is based on an attention-based recurrent neural network, which accepts a set of keywords as the theme and generates poems by looking at each keyword during the generation. A number of techniques are proposed to improve the model, including character vector initialization, attention to input and hybrid-style training. Compared to existing poetry generation methods, our model can generate much more theme-consistent and semantic-rich poems.

preprint2016arXiv

Chinese Song Iambics Generation with Neural Attention-based Model

Learning and generating Chinese poems is a charming yet challenging task. Traditional approaches involve various language modeling and machine translation techniques, however, they perform not as well when generating poems with complex pattern constraints, for example Song iambics, a famous type of poems that involve variable-length sentences and strict rhythmic patterns. This paper applies the attention-based sequence-to-sequence model to generate Chinese Song iambics. Specifically, we encode the cue sentences by a bi-directional Long-Short Term Memory (LSTM) model and then predict the entire iambic with the information provided by the encoder, in the form of an attention-based LSTM that can regularize the generation process by the fine structure of the input cues. Several techniques are investigated to improve the model, including global context integration, hybrid style training, character vector initialization and adaptation. Both the automatic and subjective evaluation results show that our model indeed can learn the complex structural and rhythmic patterns of Song iambics, and the generation is rather successful.

preprint2016arXiv

Chromatic Effect for THz Generation in a Novel Wave-front Tilt Scheme

Deriving single or few cycle terahertz pulse (THz) by intense femtosecond laser through cascaded optical rectification in electro-optic crystals is a crucial technique in cutting-edge time-resolved spectroscopy to characterize micro-scale structures and ultrafast dynamics. In the past decade, lithium niobate (LN) crystal implementation of wave-front tilt scheme has been prevalently used, while painstaking efforts have been invested in order to achieve higher THz conversion efficiency. In this research we developed a brand new type of LN crystal possessing dual-face-cut and Brewster coupling, and conducted experimental and simulative investigation systematically to optimize the multi-dimensionally entangled parameters in THz generation, predicting the extreme conversion efficiency of 10% is potentially promising at the THz absorption coefficient of 0.5cm-1. More remarkably, we first discovered that the chirp of the driving laser pulse plays a decisive role in the wave-front tilt scheme, and the THz generation efficiency could be enhanced tremendously by applying an appropriate chirp.

preprint2016arXiv

Deflecting cavity dynamics for time-resolved machine studies of SXFEL user facility

Radio frequency deflectors are widely used for time-resolved electron beam energy, emittance and radiation profile measurements in modern free electron laser facilities. In this paper, we present the beam dynamics aspects of the deflecting cavity of SXFEL user facility, which is located at the exit of the undulator. With a targeted time resolution around 10 fs, it is expected to be an important tool for time-resolved commissioning and machine studies for SXFEL user facility.

preprint2016arXiv

Gender Differences in Communication Behaviors, Spatial Proximity Patterns, and Mobility Habits

The existence of gender differences in the structure and composition of social networks is a well established finding in the social and behavioral sciences, but researchers continue to debate whether structural, dispositional, or life course factors are the primary driver of these differences. In this paper we extend work on gender differences in social networks to patterns of interaction, propinquity, and connectivity captured via a social sensing platform comprised of an ensemble of individuals' phone calls, text messaging, face-to-face interactions, and traces of their mobility activities. We attempt to isolate dispositional from other factors by focusing on a relatively homogeneous population on a relatively closed setting at the same stage in the life course. Analysis across three different networks along with mobility data reveals systematic gender differences in terms of communicative, distributional, mobility, and spatial proximity tendencies. Most importantly, some patterns observed in the communication network (e.g. stronger same-gender preference for women) are found to be reversed in the spatial proximity context, with men displaying a greater tendency to spend time in a narrower (and thus more predictable) range of spaces with same-gender peers than women. These findings provide robust and novel evidence for the powerful effect of gender in structuring behavior across a wide range of communication and mobility behaviors, validating and extending recent work pointing to differences of socio-cultural and evolutionary origin in the styles of sociability and interaction characteristic of men and women.

preprint2016arXiv

Generating coherent soft x-ray pulses in the water window with a high-brightness seeded free-electron laser

We propose a new scheme to generate high-brightness and temporal coherent soft x-ray radiation in a seeded free-electron laser. The proposed scheme is based the coherent harmonic generation (CHG) and superradiant principles. A CHG scheme is first used to generate coherent signal at ultra-high harmonics of the seed. This coherent signal is then amplified by a series of chicane-undulator modules via the fresh bunch and superradiant processes in the following radiator. Using a representative of realistic set of parameters, three-dimensional simulations have been carried out and the simulations results demonstrated that 10 GW-level ultra-short coherent radiation pulses in the water window can be achieved by using the proposed technique.

preprint2016arXiv

Local Training for PLDA in Speaker Verification

PLDA is a popular normalization approach for the i-vector model, and it has delivered state-of-the-art performance in speaker verification. However, PLDA training requires a large amount of labeled development data, which is highly expensive in most cases. A possible approach to mitigate the problem is various unsupervised adaptation methods, which use unlabeled data to adapt the PLDA scattering matrices to the target domain. In this paper, we present a new `local training' approach that utilizes inaccurate but much cheaper local labels to train the PLDA model. These local labels discriminate speakers within a single conversion only, and so are much easier to obtain compared to the normal `global labels'. Our experiments show that the proposed approach can deliver significant performance improvement, particularly with limited globally-labeled data.

preprint2016arXiv

Max-margin Metric Learning for Speaker Recognition

Probabilistic linear discriminant analysis (PLDA) is a popular normalization approach for the i-vector model, and has delivered state-of-the-art performance in speaker recognition. A potential problem of the PLDA model, however, is that it essentially assumes Gaussian distributions over speaker vectors, which is not always true in practice. Additionally, the objective function is not directly related to the goal of the task, e.g., discriminating true speakers and imposters. In this paper, we propose a max-margin metric learning approach to solve the problems. It learns a linear transform with a criterion that the margin between target and imposter trials are maximized. Experiments conducted on the SRE08 core test show that compared to PLDA, the new approach can obtain comparable or even better performance, though the scoring is simply a cosine computation.

preprint2016arXiv

Modeling the disequilibrium species for Jupiter and Saturn: Implications for Juno and Saturn entry probe

Disequilibrium species have been used previously to probe the deep water abundances and the eddy diffusion coefficient for giant planets. In this paper, we present a diffusion-kinetics code that predicts the abundances of disequilibrium species in the tropospheres of Jupiter and Saturn with updated thermodynamic and kinetic data. The dependence on the deep water abundance and the eddy diffusion coefficient is investigated. We quantified the disagreements in CO kinetics that comes from using different reaction networks and identified C$_2$H$_6$ as a useful tracer for the eddy diffusion coefficient. We first apply a H/P/O reaction network to Jupiter and Saturn's atmospheres and suggest a new PH$_3$ destruction pathway. New chemical pathways for SiH$_4$ and GeH$_4$ destruction are also suggested, and another AsH$_3$ destruction pathway is investigated thanks to new thermodynamic and kinetic data. These new models should enhance the interpretation of the measurement of disequilibrium species by JIRAM on board Juno and allow disentangling between methods for constraining the Saturn's deep water abundance with the Saturn entry probes envisaged by NASA or ESA.

preprint2016arXiv

Modular knowledge systems accelerate human migration in asymmetric random environments

Migration is a key mechanism for expansion of communities. In spatially heterogeneous environments, rapidly gaining knowledge about the local environment is key to the evolutionary success of a migrating population. For historical human migration, environmental heterogeneity was naturally asymmetric in the north-south (NS) and east-west (EW) directions. We here consider the human migration process in the Americas, modeled as random, asymmetric, modularly correlated environments. Knowledge about the environments determines the fitness of each individual. We present a phase diagram for asymmetry of migration as a function of carrying capacity and fitness threshold. We find that the speed of migration is proportional to the inverse complement of the spatial environmental gradient, and in particular we find that north-south migration rates are lower than east-west migration rates when the environmental gradient is higher in the north-south direction. Communication of knowledge between individuals can help to spread beneficial knowledge within the population. The speed of migration increases when communication transmits pieces of knowledge that contribute in a modular way to the fitness of individuals. The results for the dependence of migration rate on asymmetry and modularity are consistent with existing archaeological observations. The results for asymmetry of genetic divergence are consistent with patterns of human gene flow.

preprint2016arXiv

Multi-task Recurrent Model for Speech and Speaker Recognition

Although highly correlated, speech and speaker recognition have been regarded as two independent tasks and studied by two communities. This is certainly not the way that people behave: we decipher both speech content and speaker traits at the same time. This paper presents a unified model to perform speech and speaker recognition simultaneously and altogether. The model is based on a unified neural network where the output of one task is fed to the input of the other, leading to a multi-task recurrent network. Experiments show that the joint model outperforms the task-specific models on both the two tasks.

preprint2016arXiv

Multi-task Recurrent Model for True Multilingual Speech Recognition

Research on multilingual speech recognition remains attractive yet challenging. Recent studies focus on learning shared structures under the multi-task paradigm, in particular a feature sharing structure. This approach has been found effective to improve performance on each individual language. However, this approach is only useful when the deployed system supports just one language. In a true multilingual scenario where multiple languages are allowed, performance will be significantly reduced due to the competition among languages in the decoding space. This paper presents a multi-task recurrent model that involves a multilingual speech recognition (ASR) component and a language recognition (LR) component, and the ASR component is informed of the language information by the LR component, leading to a language-aware recognition. We tested the approach on an English-Chinese bilingual recognition task. The results show that the proposed multi-task recurrent model can improve performance of multilingual recognition systems.

preprint2016arXiv

Nonintersecting Brownian bridges between reflecting or absorbing walls

We study a model of nonintersecting Brownian bridges on an interval with either absorbing or reflecting walls at the boundaries, focusing on the point in space-time at which the particles meet the wall. These processes are determinantal, and in different scaling limits when the particles approach the reflecting (resp. absorbing) walls we obtain hard-edge limiting kernels which are the even (resp. odd) parts of the Pearcey and tacnode kernels. We also show that in the single time case, our hard-edge tacnode kernels are equivalent to the ones studied by Delvaux [16], defined in terms of a $4\times 4$ Lax pair for the inhomogeneous Painlevé II equation (PII). As a technical ingredient in the proof, we construct a Schlesinger transform for the $4 \times 4$ Lax pair in [16] which preserves the Hastings--McLeod solutions to PII.

preprint2016arXiv

Nonintersecting Brownian motions on the unit circle

We consider an ensemble of $n$ nonintersecting Brownian particles on the unit circle with diffusion parameter $n^{-1/2}$, which are conditioned to begin at the same point and to return to that point after time $T$, but otherwise not to intersect. There is a critical value of $T$ which separates the subcritical case, in which it is vanishingly unlikely that the particles wrap around the circle, and the supercritical case, in which particles may wrap around the circle. In this paper, we show that in the subcritical and critical cases the probability that the total winding number is zero is almost surely 1 as $n\to\infty$, and in the supercritical case that the distribution of the total winding number converges to the discrete normal distribution. We also give a streamlined approach to identifying the Pearcey and tacnode processes in scaling limits. The formula of the tacnode correlation kernel is new and involves a solution to a Lax system for the Painlevé II equation of size 2 $\times$ 2. The proofs are based on the determinantal structure of the ensemble, asymptotic results for the related system of discrete Gaussian orthogonal polynomials, and a formulation of the correlation kernel in terms of a double contour integral.

preprint2016arXiv

OC16-CE80: A Chinese-English Mixlingual Database and A Speech Recognition Baseline

We present the OC16-CE80 Chinese-English mixlingual speech database which was released as a main resource for training, development and test for the Chinese-English mixlingual speech recognition (MixASR-CHEN) challenge on O-COCOSDA 2016. This database consists of 80 hours of speech signals recorded from more than 1,400 speakers, where the utterances are in Chinese but each involves one or several English words. Based on the database and another two free data resources (THCHS30 and the CMU dictionary), a speech recognition (ASR) baseline was constructed with the deep neural network-hidden Markov model (DNN-HMM) hybrid system. We then report the baseline results following the MixASR-CHEN evaluation rules and demonstrate that OC16-CE80 is a reasonable data resource for mixlingual research.

preprint2016arXiv

PipeCNN: An OpenCL-Based FPGA Accelerator for Large-Scale Convolution Neuron Networks

Convolutional neural networks (CNNs) have been widely employed in many applications such as image classification, video analysis and speech recognition. Being compute-intensive, CNN computations are mainly accelerated by GPUs with high power dissipations. Recently, studies were carried out exploiting FPGA as CNN accelerator because of its reconfigurability and energy efficiency advantage over GPU, especially when OpenCL-based high-level synthesis tools are now available providing fast verification and implementation flows. Previous OpenCL-based design only focused on creating a generic framework to identify performance-related hardware parameters, without utilizing FPGA's special capability of pipelining kernel functions to minimize memory bandwidth requirement. In this work, we propose an FPGA accelerator with a new architecture of deeply pipelined OpenCL kernels. Data reuse and task mapping techniques are also presented to improve design efficiency. The proposed schemes are verified by implementing two representative large-scale CNNs, AlexNet and VGG on Altera Stratix-V A7 FPGA. We have achieved a similar peak performance of 33.9 GOPS with a 34% resource reduction on DSP blocks compared to previous work. Our design is openly accessible and thus can be reused to explore new architectures for neural network accelerators.

preprint2016arXiv

Recurrent Neural Network Training with Dark Knowledge Transfer

Recurrent neural networks (RNNs), particularly long short-term memory (LSTM), have gained much attention in automatic speech recognition (ASR). Although some successful stories have been reported, training RNNs remains highly challenging, especially with limited training data. Recent research found that a well-trained model can be used as a teacher to train other child models, by using the predictions generated by the teacher model as supervision. This knowledge transfer learning has been employed to train simple neural nets with a complex one, so that the final performance can reach a level that is infeasible to obtain by regular training. In this paper, we employ the knowledge transfer learning approach to train RNNs (precisely LSTM) using a deep neural network (DNN) model as the teacher. This is different from most of the existing research on knowledge transfer learning, since the teacher (DNN) is assumed to be weaker than the child (RNN); however, our experiments on an ASR task showed that it works fairly well: without applying any tricks on the learning scheme, this approach can train RNNs successfully even with limited training data.

preprint2016arXiv

System Combination for Short Utterance Speaker Recognition

For text-independent short-utterance speaker recognition (SUSR), the performance often degrades dramatically. This paper presents a combination approach to the SUSR tasks with two phonetic-aware systems: one is the DNN-based i-vector system and the other is our recently proposed subregion-based GMM-UBM system. The former employs phone posteriors to construct an i-vector model in which the shared statistics offers stronger robustness against limited test data, while the latter establishes a phone-dependent GMM-UBM system which represents speaker characteristics with more details. A score-level fusion is implemented to integrate the respective advantages from the two systems. Experimental results show that for the text-independent SUSR task, both the DNN-based i-vector system and the subregion-based GMM-UBM system outperform their respective baselines, and the score-level system combination delivers performance improvement.

preprint2016arXiv

Test of \textit{Topmetal-${II}^-$} In Liquid Nitrogen For Cryogenic Temperature TPCs

\textit{Topmetal-${II}^-$} is a highly pixelated direct charge sensor that contains a 72${\times}$72 pixel array of 83$μ$m pitch size. The key feature of \textit{Topmetal-${II}^-$} is that it can directly collect charges via metal nodes of each pixel to form two-dimensional images of charge cloud distributions. \textit{Topmetal-${II}^-$} was proved to measure charged particles without amplification at room temperature. To measure its performance at cryogenic temperature, a \textit{Topmetal-${II}^-$} sensor is embedded into a liquid nitrogen dewar. The results presented in this paper show that \textit{Topmetal-${II}^-$} can also operate well at this low temperature with a noise (ENC) of 12 e$^-$ lower than that at room temperature (13 e$^-$). From the noise perspective, \textit{Topmetal-${II}^-$} is a promising candidate for the next generation readout of liquid argon and xenon Time Projection Chamber (TPC) used in experiments searching for neutrinoless double beta decay and dark matter.

preprint2016arXiv

The Transition Probability of the $q$-TAZRP ($q$-Bosons) with Inhomogeneous Jump Rates

In this paper we consider the $q$-deformed totally asymmetric zero range process ($q$-TAZRP), also known as the $q$-boson (stochastic) particle system, on the ${\mathbb Z}$ lattice, such that the jump rate of a particle depends on the site where it is on the lattice. We derive the transition probability for an $n$ particle process in Bethe ansatz form as a sum of $n!$ $n$-fold contour integrals. Our result generalizes the transition probability formula by Korhonen and Lee for $q$-TAZRP with a homogeneous lattice, and our method follows the same approach as theirs.

preprint2016arXiv

Two Lax systems for the Painlevé II equation, and two related kernels in random matrix theory

We consider two Lax systems for the homogeneous Painlevé II equation: one of size $2\times 2$ studied by Flaschka and Newell in the early 1980's, and one of size $4\times 4$ introduced by Delvaux-Kuijlaars-Zhang and Duits-Geudens in the early 2010's. We prove that solutions to the $4\times 4$ system can be derived from those to the $2\times 2$ system via an integral transform, and consequently relate the Stokes multipliers for the two systems. As corollaries we are able to express two kernels for determinantal processes as contour integrals involving the Flaschka-Newell Lax system: the tacnode kernel arising in models of nonintersecting paths, and a critical kernel arising in a two-matrix model.

preprint2016arXiv

Wakefields studies for the SXFEL user facility

Besides the original seeded undulator line, in the Soft X-ray free-electron laser (SXFEL) user facility at Shanghai, a second undulator line based on self-amplified spontaneous emission is proposed to achieve 2 nm laser pulse with extremely high brightness. In this paper, the beam energy deviation induced by the undulator wakefields is numerically obtained, and it is verified to have a good agreement between 3D and 2D simulation results. The beam energy loss along the undulator degrades the expected FEL output performance. Impact of wakefields on pulse energy, radiation power and spectrum is discussed, as well as the benefits of compensation obtained with a taper in the undulator field. And using the planned SXFEL diagnostic, a longitudinal wakefields measurement experiment is proposed and simulated.

preprint2015arXiv

Bulk and soft-edge universality for singular values of products of Ginibre random matrices

It has been shown by Akemann, Ipsen and Kieburg that the squared singular values of products of $M$ rectangular random matrices with independent complex Gaussian entries are distributed according to a determinantal point process with a correlation kernel that admits a representation in terms of Meijer G-functions. We prove the universality of the local statistics of the squared singular values, namely, the bulk universality given by the sine kernel and the edge universality given by the Airy kernel. The proof is based on the asymptotic analysis for the double contour integral representation of the correlation kernel. Our strategy can be generalized to deal with other models of products of random matrices introduced recently and to establish similar universal results. Two more examples are investigated, one is the product of $M$ Ginibre matrices and the inverse of $K$ Ginibre matrices studied by Forrester, and the other one is the product of $M-1$ Ginibre matrices with one truncated unitary matrix considered by Kuijlaars and Stivigny.

preprint2015arXiv

Deep Representation of Facial Geometric and Photometric Attributes for Automatic 3D Facial Expression Recognition

In this paper, we present a novel approach to automatic 3D Facial Expression Recognition (FER) based on deep representation of facial 3D geometric and 2D photometric attributes. A 3D face is firstly represented by its geometric and photometric attributes, including the geometry map, normal maps, normalized curvature map and texture map. These maps are then fed into a pre-trained deep convolutional neural network to generate the deep representation. Then the facial expression prediction is simplyachieved by training linear SVMs over the deep representation for different maps and fusing these SVM scores. The visualizations show that the deep representation provides a complete and highly discriminative coding scheme for 3D faces. Comprehensive experiments on the BU-3DFE database demonstrate that the proposed deep representation can outperform the widely used hand-crafted descriptors (i.e., LBP, SIFT, HOG, Gabor) and the state-of-art approaches under the same experimental protocols.

preprint2015arXiv

Deep Speaker Vectors for Semi Text-independent Speaker Verification

Recent research shows that deep neural networks (DNNs) can be used to extract deep speaker vectors (d-vectors) that preserve speaker characteristics and can be used in speaker verification. This new method has been tested on text-dependent speaker verification tasks, and improvement was reported when combined with the conventional i-vector method. This paper extends the d-vector approach to semi text-independent speaker verification tasks, i.e., the text of the speech is in a limited set of short phrases. We explore various settings of the DNN structure used for d-vector extraction, and present a phone-dependent training which employs the posterior features obtained from an ASR system. The experimental results show that it is possible to apply d-vectors on semi text-independent speaker recognition, and the phone-dependent training improves system performance.

preprint2015arXiv

Demonstration of nonlinear-energy-spread compensation in relativistic electron bunches with corrugated structures

High quality electron beams with flat distributions in both energy and current are critical for many accelerator-based scientific facilities such as free-electron lasers and MeV ultrafast electron diffraction and microscopes. In this Letter we report on using corrugated structures to compensate for the beam nonlinear energy chirp imprinted by the curvature of the radio-frequency field, leading to a significant reduction in beam energy spread. By using a pair of corrugated structures with orthogonal orientations, we show that the quadrupole wake fields which otherwise increase beam emittance can be effectively canceled. This work also extends the applications of corrugated structures to the low beam charge (a few pC) and low beam energy (a few MeV) regime and may have a strong impact in many accelerator-based facilities.

preprint2015arXiv

Feedback-optimized Extraordinary Optical Transmission of Continuous-variable Entangled States

We report on the feedback-optimized extraordinary optical transmission of continuous-variable entangled states through a hexagonal metal-hole array. The continuous-variable entanglements from a nondegenerate optical parametric amplifier are first demonstrated to survive after a photon-plasmon-photon conversion process. By controlling the reflected light from the metal-hole array, a significant enhancement of quantum correlations has then been experimentally achieved comparing to the case of without such coherent feedback control. This result presents a useful technique to efficiently recover the substantial reflective losses in the plasmonic circuits for quantum information processing.

preprint2015arXiv

Fermionic non-Abelian fractional Chern insulators from dipolar interactions

We study fermions on a triangular lattice model that exhibits topological flatbands characterized by nonzero Chern numbers. Our scheme stems from the well-known Hofstadter model but the next-nearest-neighbor hopping is introduced, which is crucial for tuning the lowest band to be nearly flat. Differing from previous proposals with the necessity of multiparticle interactions, we consider the more realistic long-range dipolar interaction combined with two-body short-range attractions between fermions. We show the realization of the non-Abelian $ν=1/2$ Moore-Read fractional Chern insulators, and strong evidence for the existence of the more exotic $ν=3/5$ Read-Rezayi fractional Chern insulators. Our results provide insights for the experimental realization of these exotic states by realistic two-body interactions and thus facilitates the implementation of the universal topological quantum computation.

preprint2015arXiv

Fluctuations of TASEP and LPP with general initial data

We prove Airy process variational formulas for the one-point probability distribution of (discrete time parallel update) TASEP with general initial data, as well as last passage percolation from a general lattice path to a point. We also consider variants of last passage percolation with inhomogeneous parameter geometric weights and provide variational formulas of a similar nature. This proves one aspect of the conjectural description of the renormalization fixed point of the Kardar-Parisi-Zhang universality class.

preprint2015arXiv

Generalized Remote Preparation of Arbitrary $m$-qubit Entangled States via Genuine Entanglements

Herein, we present a feasible, general protocol for quantum communication within a network via generalized remote preparation of an arbitrary $m$-qubit entangled state designed with genuine tripartite Greenberger--Horne--Zeilinger-type entangled resources. During the implementations, we construct novel collective unitary operations; these operations are tasked with performing the necessary phase transfers during remote state preparations. We have distilled our implementation methods into a five-step procedure, which can be used to faithfully recover the desired state during transfer. Compared to previous existing schemes, our methodology features a greatly increased success probability. After the consumption of auxiliary qubits and the performance of collective unitary operations, the probability of successful state transfer is increased four-fold and eight-fold for arbitrary two- and three-qubit entanglements when compared to other methods within the literature, respectively. We conclude this paper with a discussion of the presented scheme for state preparation, including: success probabilities, reducibility and generalizability.

preprint2015arXiv

Improved Deep Speaker Feature Learning for Text-Dependent Speaker Recognition

A deep learning approach has been proposed recently to derive speaker identifies (d-vector) by a deep neural network (DNN). This approach has been applied to text-dependent speaker recognition tasks and shows reasonable performance gains when combined with the conventional i-vector approach. Although promising, the existing d-vector implementation still can not compete with the i-vector baseline. This paper presents two improvements for the deep learning approach: a phonedependent DNN structure to normalize phone variation, and a new scoring approach based on dynamic time warping (DTW). Experiments on a text-dependent speaker recognition task demonstrated that the proposed methods can provide considerable performance improvement over the existing d-vector implementation.

preprint2015arXiv

Joint Source Selection and Data Extrapolation in Social Sensing for Disaster Response

This paper complements the large body of social sensing literature by developing means for augmenting sensing data with inference results that "fill-in" missing pieces. It specifically explores the synergy between (i) inference techniques used for filling-in missing pieces and (ii) source selection techniques used to determine which pieces to retrieve in order to improve inference results. We focus on prediction in disaster scenarios, where disruptive trend changes occur. We first discuss our previous conference study that compared a set of prediction heuristics and developed a hybrid prediction algorithm. We then enhance the prediction scheme by considering algorithms for sensor selection that improve inference quality. Our proposed source selection and extrapolation algorithms are tested using data collected during the New York City crisis in the aftermath of Hurricane Sandy in November 2012. The evaluation results show that consistently good predictions are achieved. The work is notable for addressing the bi-modal nature of damage propagation in complex systems subjected to stress, where periods of calm are interspersed with periods of severe change. It is novel in offering a new solution to the problem that jointly leverages source selection and extrapolation components thereby improving the results.

preprint2015arXiv

Knowledge Transfer Pre-training

Pre-training is crucial for learning deep neural networks. Most of existing pre-training methods train simple models (e.g., restricted Boltzmann machines) and then stack them layer by layer to form the deep structure. This layer-wise pre-training has found strong theoretical foundation and broad empirical support. However, it is not easy to employ such method to pre-train models without a clear multi-layer structure,e.g., recurrent neural networks (RNNs). This paper presents a new pre-training approach based on knowledge transfer learning. In contrast to the layer-wise approach which trains model components incrementally, the new approach trains the entire model as a whole but with an easier objective function. This is achieved by utilizing soft targets produced by a prior trained model (teacher model). Compared to the conventional layer-wise methods, this new method does not care about the model structure, so can be used to pre-train very complex models. Experiments on a speech recognition task demonstrated that with this approach, complex RNNs can be well trained with a weaker deep neural network (DNN) model. Furthermore, the new method can be combined with conventional layer-wise pre-training to deliver additional gains.

preprint2015arXiv

Learning from LDA using Deep Neural Networks

Latent Dirichlet Allocation (LDA) is a three-level hierarchical Bayesian model for topic inference. In spite of its great success, inferring the latent topic distribution with LDA is time-consuming. Motivated by the transfer learning approach proposed by~\newcite{hinton2015distilling}, we present a novel method that uses LDA to supervise the training of a deep neural network (DNN), so that the DNN can approximate the costly LDA inference with less computation. Our experiments on a document classification task show that a simple DNN can learn the LDA behavior pretty well, while the inference is speeded up tens or hundreds of times.

preprint2015arXiv

Learning Speech Rate in Speech Recognition

A significant performance reduction is often observed in speech recognition when the rate of speech (ROS) is too low or too high. Most of present approaches to addressing the ROS variation focus on the change of speech signals in dynamic properties caused by ROS, and accordingly modify the dynamic model, e.g., the transition probabilities of the hidden Markov model (HMM). However, an abnormal ROS changes not only the dynamic but also the static property of speech signals, and thus can not be compensated for purely by modifying the dynamic model. This paper proposes an ROS learning approach based on deep neural networks (DNN), which involves an ROS feature as the input of the DNN model and so the spectrum distortion caused by ROS can be learned and compensated for. The experimental results show that this approach can deliver better performance for too slow and too fast utterances, demonstrating our conjecture that ROS impacts both the dynamic and the static property of speech. In addition, the proposed approach can be combined with the conventional HMM transition adaptation method, offering additional performance gains.

preprint2015arXiv

Modularity Enhances the Rate of Evolution in a Rugged Fitness Landscape

Biological systems are modular, and this modularity affects the evolution of biological systems over time and in different environments. We here develop a theory for the dynamics of evolution in a rugged, modular fitness landscape. We show analytically how horizontal gene transfer couples to the modularity in the system and leads to more rapid rates of evolution at short times. The model, in general, analytically demonstrates a selective pressure for the prevalence of modularity in biology. We use this model to show how the evolution of the influenza virus is affected by the modularity of the proteins that are recognized by the human immune system. Approximately 25\% of the observed rate of fitness increase of the virus could be ascribed to a modular viral landscape.

preprint2015arXiv

Nanoscale $β$-Nuclear Magnetic Resonance Depth Imaging of Topological Insulators

Considerable evidence suggests that variations in the properties of topological insulators (TIs) at the nanoscale and at interfaces can strongly affect the physics of topological materials. Therefore, a detailed understanding of surface states and interface coupling is crucial to the search for and applications of new topological phases of matter. Currently, no methods can provide depth profiling near surfaces or at interfaces of topologically inequivalent materials. Such a method could advance the study of interactions. Herein we present a non-invasive depth-profiling technique based on $β$-NMR spectroscopy of radioactive $^8$Li$^+$ ions that can provide "one-dimensional imaging" in films of fixed thickness and generates nanoscale views of the electronic wavefunctions and magnetic order at topological surfaces and interfaces. By mapping the $^8$Li nuclear resonance near the surface and 10 nm deep into the bulk of pure and Cr-doped bismuth antimony telluride films, we provide signatures related to the TI properties and their topological non-trivial characteristics that affect the electron-nuclear hyperfine field, the metallic shift and magnetic order. These nanoscale variations in $β$-NMR parameters reflect the unconventional properties of the topological materials under study, and understanding the role of heterogeneities is expected to lead to the discovery of novel phenomena involving quantum materials.

preprint2015arXiv

Recognize Foreign Low-Frequency Words with Similar Pairs

Low-frequency words place a major challenge for automatic speech recognition (ASR). The probabilities of these words, which are often important name entities, are generally under-estimated by the language model (LM) due to their limited occurrences in the training data. Recently, we proposed a word-pair approach to deal with the problem, which borrows information of frequent words to enhance the probabilities of low-frequency words. This paper presents an extension to the word-pair method by involving multiple `predicting words' to produce better estimation for low-frequency words. We also employ this approach to deal with out-of-language words in the task of multi-lingual speech recognition.

preprint2015arXiv

Relation Classification via Recurrent Neural Network

Deep learning has gained much success in sentence-level relation classification. For example, convolutional neural networks (CNN) have delivered competitive performance without much effort on feature engineering as the conventional pattern-based methods. Thus a lot of works have been produced based on CNN structures. However, a key issue that has not been well addressed by the CNN-based method is the lack of capability to learn temporal features, especially long-distance dependency between nominal pairs. In this paper, we propose a simple framework based on recurrent neural networks (RNN) and compare it with CNN-based model. To show the limitation of popular used SemEval-2010 Task 8 dataset, we introduce another dataset refined from MIMLRE(Angeli et al., 2014). Experiments on two different datasets strongly indicates that the RNN-based model can deliver better performance on relation classification, and it is particularly capable of learning long-distance relation patterns. This makes it suitable for real-world applications where complicated expressions are often involved.

preprint2015arXiv

Self-organized magnetic particles to tune the mechanical behaviour of a granular system

Above a certain density a granular material jams. This property can be controlled by either tuning a global property, such as the packing fraction or by applying shear strain, or at the micro-scale by tuning grain shape, inter-particle friction or externally controlled organization. Here, we introduce a novel way to change a local granular property by adding a weak anisotropic magnetic interaction between particles. We measure the evolution of the pressure, $P$, and coordination number, $Z$, for a packing of 2D photo-elastic disks, subject to uniaxial compression. Some of the particles have embedded cuboidal magnets. The strength of the magnetic interactions between particles are too weak to have a strong direct effect on $P$ or $Z$ when the system is jammed. However, the magnetic interactions play an important role in the evolution of latent force networks when systems containing a large enough fraction of the particles with magnets are driven through unjammed states. In this case, a statistically stable network of magnetic chains self-organizes and overlaps with force chains, strengthening the granular medium. We believe this property can be used to reversibly control mechanical properties of granular materials.

preprint2015arXiv

Stochastic Top-k ListNet

ListNet is a well-known listwise learning to rank model and has gained much attention in recent years. A particular problem of ListNet, however, is the high computation complexity in model training, mainly due to the large number of object permutations involved in computing the gradients. This paper proposes a stochastic ListNet approach which computes the gradient within a bounded permutation subset. It significantly reduces the computation complexity of model training and allows extension to Top-k models, which is impossible with the conventional implementation based on full-set permutations. Meanwhile, the new approach utilizes partial ranking information of human labels, which helps improve model quality. Our experiments demonstrated that the stochastic ListNet method indeed leads to better ranking performance and speeds up the model training remarkably.

preprint2015arXiv

THCHS-30 : A Free Chinese Speech Corpus

Speech data is crucially important for speech recognition research. There are quite some speech databases that can be purchased at prices that are reasonable for most research institutes. However, for young people who just start research activities or those who just gain initial interest in this direction, the cost for data is still an annoying barrier. We support the `free data' movement in speech recognition: research institutes (particularly supported by public funds) publish their data freely so that new researchers can obtain sufficient data to kick of their career. In this paper, we follow this trend and release a free Chinese speech database THCHS-30 that can be used to build a full- edged Chinese speech recognition system. We report the baseline system established with this database, including the performance under highly noisy conditions.

preprint2015arXiv

Transfer Learning for Speech and Language Processing

Transfer learning is a vital technique that generalizes models trained for one setting or task to other settings or tasks. For example in speech recognition, an acoustic model trained for one language can be used to recognize speech in another language, with little or no re-training data. Transfer learning is closely related to multi-task learning (cross-lingual vs. multilingual), and is traditionally studied in the name of `model adaptation'. Recent advance in deep learning shows that transfer learning becomes much easier and more effective with high-level abstract features learned by deep models, and the `transfer' can be conducted not only between data distributions and data types, but also between model structures (e.g., shallow nets and deep nets) or even model types (e.g., Bayesian models and neural models). This review paper summarizes some recent prominent research towards this direction, particularly for speech and language processing. We also report some results from our group and highlight the potential of this very interesting research field.

preprint2015arXiv

Unsupervised Feature Learning with C-SVDDNet

In this paper, we investigate the problem of learning feature representation from unlabeled data using a single-layer K-means network. A K-means network maps the input data into a feature representation by finding the nearest centroid for each input point, which has attracted researchers' great attention recently due to its simplicity, effectiveness, and scalability. However, one drawback of this feature mapping is that it tends to be unreliable when the training data contains noise. To address this issue, we propose a SVDD based feature learning algorithm that describes the density and distribution of each cluster from K-means with an SVDD ball for more robust feature representation. For this purpose, we present a new SVDD algorithm called C-SVDD that centers the SVDD ball towards the mode of local density of each cluster, and we show that the objective of C-SVDD can be solved very efficiently as a linear programming problem. Additionally, traditional unsupervised feature learning methods usually take an average or sum of local representations to obtain global representation which ignore spatial relationship among them. To use spatial information we propose a global representation with a variant of SIFT descriptor. The architecture is also extended with multiple receptive field scales and multiple pooling sizes. Extensive experiments on several popular object recognition benchmarks, such as STL-10, MINST, Holiday and Copydays shows that the proposed C-SVDDNet method yields comparable or better performance than that of the previous state of the art methods.

preprint2015arXiv

VMF-SNE: Embedding for Spherical Data

T-SNE is a well-known approach to embedding high-dimensional data and has been widely used in data visualization. The basic assumption of t-SNE is that the data are non-constrained in the Euclidean space and the local proximity can be modelled by Gaussian distributions. This assumption does not hold for a wide range of data types in practical applications, for instance spherical data for which the local proximity is better modelled by the von Mises-Fisher (vMF) distribution instead of the Gaussian. This paper presents a vMF-SNE embedding algorithm to embed spherical data. An iterative process is derived to produce an efficient embedding. The results on a simulation data set demonstrated that vMF-SNE produces better embeddings than t-SNE for spherical data.

preprint2015arXiv

Wakefield issue and its impact on X-ray photon pulse in the SXFEL test facility

Besides the designed beam acceleration, the energy of electrons changed by the longitudinal wakefields in a real free-electron laser (FEL) facility, which may degrade FEL performances from the theoretical expectation. In this paper, with the help of simulation codes, the wakefields induced beam energy loss in the sophisticated undulator section is calculated for Shanghai soft X-ray FEL, which is a two-stage seeded FEL test facility. While the 1st stage 44 nm FEL output is almost not affected by the wakefields, it is found that a beam energy loss about 0.8 MeV degrades the peak brightness of the 2nd stage 8.8 nm FEL by a factor of 1.6, which however can be compensated by a magnetic field fine tuning of each undulator segment.

preprint2014arXiv

Beam energy distribution influences on density modulation efficiency in seeded free-electron lasers

The beam energy spread at the entrance of undulator system is of paramount importance for efficient density modulation in high-gain seeded free-electron lasers (FELs). In this paper, the dependences of high harmonic micro-bunching in the high-gain harmonic generation (HGHG), echo-enabled harmonic generation (EEHG) and phase-merging enhanced harmonic generation (PEHG) schemes on the electron energy spread distribution are studied. Theoretical investigations and multi-dimensional numerical simulations are applied to the cases of uniform and saddle beam energy distributions and compared to a traditional Gaussian distribution. It shows that the uniform and saddle electron energy distributions significantly enhance the performance of HGHG-FELs, while they almost have no influence on EEHG and PEHG schemes. A numerical example demonstrates that, with about 84keV RMS uniform and/or saddle slice energy spread, the 30th harmonic radiation can be directly generated by a single-stage seeding scheme for a soft x-ray FEL facility.

preprint2014arXiv

Compensating the electron beam energy spread by the natural transverse gradient of laser undulator in all-optical x-ray light sources

All-optical ideas provide a potential to dramatically cut off the size and cost of x-ray light sources to the university-laboratory scale, with the combination of the laser-plasma accelerator and the laser undulator. However, the large longitudinal energy spread of the electron beam from laser-plasma accelerator may hinder the way to high brightness of these all-optical light sources. In this paper, the beam energy spread effect is proposed to be significantly compensated by the natural transverse gradient of a laser undulator when properly transverse-dispersing the electron beam. Theoretical analysis and numerical simulations on conventional laser-Compton scattering sources and high-gain all-optical x-ray free-electron lasers with the electron beams from laser-plasma accelerators are presented.

preprint2014arXiv

Decoy state measurement-device-independent quantum key distribution based on the Clauser-Horne-Shimony-Holt inequality

The measurement-device-independent quantum key distribution (MDI-QKD) protocol is proposed to remove the detector side channel attacks, while its security relies on the assumption that the encoding systems are perfectly characterized. In contrast, the MDI-QKD protocol based on the Clauser-Horne-Shimony-Holt inequality (CHSH-MDI-QKD) weakens this assumption, which only requires the quantum state to be prepared in the two-dimensional Hilbert space and the devices are independent. In experimental realizations, the weak coherent state, which is always used in QKD systems due to the lack of an ideal single photon source, may be prepared in the high-dimensional space. In this paper, we investigate the decoy-state CHSH-MDI-QKD protocol with $s(3 \le s \le 5)$ intensities, including one signal state and $s-1$ decoy states, and we also consider the finite-size effect on the decoy-state CHSH-MDI-QKD protocol with five intensities. Simulation results show that this scheme is very practical.

preprint2014arXiv

Development of a highly pixelated direct charge sensor, Topmetal-I, for ionizing radiation imaging

Using industrial standard 0.35μm CMOS Integrated Circuit process, we realized a highly pixelated sensor that directly collects charge via metal nodes placed on the top of each pixel and forms two dimensional images of charge cloud distribution. The first version, Topmetal-I, features a 64x64 pixel array of 80μm pitch size. Direct charge calibration reveals an average capacitance of 210fF per pixel. The charge collection noise is near the thermal noise limit. With the readout, individual pixel channels exhibit a most probable equivalent noise charge of 330e-.

preprint2014arXiv

Experimental demonstration of longitudinal beam phase space linearizer in a free-electron laser facility by corrugated structures

Removal of residual linear energy chirp and intrinsic nonlinear energy curvature in the relativistic electron beam from radiofrequency linear accelerator is of paramount importance for efficient lasing of a high-gain free-electron laser. Recently, it was theoretically and experimentally demonstrated that the longitudinal wakefield excited by the electrons itself in the corrugated structure allows for precise control of the electron beam phase space. In this Letter, we report the first utilization of a corrugated structure as beam linearizer in the operation of a seeded free-electron laser driven by a 140 MeV linear accelerator, where a gain of ~10,000 over spontaneous emission was achieved at the second harmonic of the 1047 nm seed laser, and a free-electron laser bandwidth narrowing by about 50% was observed, in good agreement with the theoretical expectations.

preprint2014arXiv

Field and long-term demonstration of a wide area quantum key distribution network

A wide area quantum key distribution (QKD) network deployed on communication infrastructures provided by China Mobile Ltd. is demonstrated. Three cities and two metropolitan area QKD networks were linked up to form the Hefei-Chaohu-Wuhu wide area QKD network with over 150 kilometers coverage area, in which Hefei metropolitan area QKD network was a typical full-mesh core network to offer all-to-all interconnections, and Wuhu metropolitan area QKD network was a representative quantum access network with point-to-multipoint configuration. The whole wide area QKD network ran for more than 5000 hours, from 21 December 2011 to 19 July 2012, and part of the network stopped until last December. To adapt to the complex and volatile field environment, the Faraday-Michelson QKD system with several stability measures was adopted when we designed QKD devices. Through standardized design of QKD devices, resolution of symmetry problem of QKD devices, and seamless switching in dynamic QKD network, we realized the effective integration between point-to-point QKD techniques and networking schemes.

preprint2014arXiv

Multiparty-controlled remote preparation of four-qubit cluster-type entangled states

We present a strategy for implementing multiparty-controlled remote state preparation (MCRSP) for a family of four-qubit cluster-type states with genuine entanglements while employing, Greenberg-Horne-Zeilinger-class states as quantum channels. In this scenario, the encoded information is transmitted from the sender to a spatially separated receiver via the control of multi-party. Predicated on the collaboration of all participants, the desired state can be entirely restored within the receiver's place with high success probability by application of appropriate local operations and necessary classical communication . Moreover, this proposal for MCRSP can be faithfully achieved with unit total success probability when the quantum channels are distilled to maximally entangled ones.

preprint2014arXiv

New Insights on Jupiter's Deep Water Abundance from Disequilibrium Species

The bulk water abundance on Jupiter potentially constrains the planet's formation conditions. We improve the chemical constraints on Jupiter's deep water abundance in this paper. The eddy diffusion coefficient is used to model vertical mixing in planetary atmosphere, and based on laboratory studies dedicated to turbulent rotating convection, we propose a new formulation of the eddy diffusion coefficient for the troposphere of giant planets. The new formulation predicts a smooth transition from the slow rotation regime (near the equator) to the rapid rotation regime (near the pole). We estimate an uncertainty for the newly derived coefficient of less than 25$\%$, which is much better than the one order of magnitude uncertainty used in the literature. We then reevaluate the water constraint provided by CO, using the newer eddy diffusion coefficient. We considered two updated CO kinetic models, one model constrains the water enrichment (relative to solar) between 0.1 and 0.75, while the other constrains the water enrichment between 3 and 11.

preprint2014arXiv

Single-shot measurement of free-electron laser polarization at SDUV-FEL

In this paper, a division-of-amplitude photopolarimeter (DOAP) for measuring the polarization state of free-electron laser (FEL) pulse is described. The incident FEL beam is divided into four separate beams, and four Stokes parameters can be measured in a single-shot. In the crossed-planar undulators experiment at Shanghai deep ultraviolet FEL test facility, this DOAP instrument constructed in house responses accurately and timely while the polarization-state of fully coherent FEL pulses are switched, which is helpful for confirming the crossed-planar undulators technique for short-wavelength FELs.

preprint2013arXiv

$ρ- ω$ Mixing in $J/ψ\to VP$ Decays

The study on $ρ-ω$ mixing is mainly focused on vector mesons decay with isospin I=1, namely $ρ(ω)\toπ^+π^-$ process. In this paper, we present the study of $ρ-ω$ mixing in $ρ(ω)\toπ^+π^-π^0$ (I=0) using a flavor parameterization model for $J/ψ\to VP$ process. By fitting theoretical frame to PDG data, we obtain the SU(3)-breaking effect parameters $s_V=0.03\pm 0.12,\ s_P=0.17\pm 0.17$ and the $ρ-ω$ mixing polarization operator $Π_{ρω}=0.006\pm 0.011\text{\text{GeV}}^2$. The branching ratios are also renewed when mixing effect is incorporated: $Br(J/ψ\to ωπ^0) = (3.64 \pm 0.37)\times 10^{-4}$, $Br(J/ψ\to ωη) = (1.48 \pm 0.17)\times 10^{-3}$, $Br(J/ψ\to ωη^{\prime}) = (1.55\pm 0.56)\times 10^{-4}$; they are different from the corresponding PDG2012 values by 19%, 15% and 15%, respectively.

preprint2013arXiv

Fast Polarization Switching Demonstration Using Crossed-Planar Undulator in a Seeded Free Electron Laser

Fast polarization switching of light sources is required over a wide spectral range to investigate the symmetry of matter. In this Letter, we report the first experimental demonstration of the crossed-planar undulator technique at a seeded free-electron laser, which holds great promise for the full control and fast switching of the polarization of short-wavelength radiation. In the experiment, the polarization state of the coherent radiation at the 2nd harmonic of the seed laser is switched successfully. The experiment results confirm the theory, and pave the way for applying the crossed-planar undulator technique for the seeded X-ray free electron lasers.

preprint2013arXiv

FEL Polarization Control Studies on Dalian Coherent Light Source

The polarization switch of a free-electron laser (FEL) is of great importance to the user scientific community. In this paper, we investigate the generation of controllable polarization FEL from two well-known approaches for Dalian coherent light source, i.e., crossed planar undulator and elliptical permanent undulator. In order to perform a fair comparative study, a one-dimensional time-dependent FEL code has been developed, in which the imperfection effects of an elliptical permanent undulator are taken into account. Comprehensive simulation results indicate that the residual beam energy chirp and the intrinsic FEL gain may contribute to the degradation of the polarization performance for the crossed planar undulator. And the elliptical permanent undulator is not very sensitive to the undulator errors and beam imperfections. Meanwhile, with proper configurations of the main planar undulators and additional elliptical permanent undulator section, circular polarized FEL with pulse energy exceeds 100 $μ$J could be achieved at Dalian coherent light source.

preprint2013arXiv

Phase-merging Enhanced Harmonic Generation Free-electron Laser

Together with one of its variants, the recently proposed phase-merging enhanced harmonic generation (PEHG) free-electron lasers (FELs) have been systematically studied in this paper. Different form the standard-HGHG, a transverse gradient undulator is employed for introducing a phase-merging effect into the transversely dispersed electron beam in PEHG. The analytical theory of the phase-merging effect and the physical mechanism behind the phenomenon were presented. Using a representative and realistic set of beam parameters, intensive start-to-end simulations for soft x-ray FEL generation were given to illustrate the performance of PEHG. Moreover, some practical issues that may affect the performance of PEHG were also discussed.

preprint2013arXiv

Proposal for High-harmonic EEHG Lasing at Shanghai Deep Ultra-Violet Free-electron Laser

The echo-enabled harmonic generation (EEHG) free-electron laser (FEL) has been already demonstrated at lower harmonics and the first lasing at third harmonic also has been achieved at Shanghai deep ultra-violet FEL (SDUV-FEL). While the great advantage of much higher harmonic up-conversion efficiency of EEHG over other seeded FELs only shows evidently at much higher harmonics. In this paper, we investigate the possibility of EEHG lasing at 10-th harmonic of the seed laser at SDUV-FEL, both physical designs and numerical simulations have been studied carefully. Two proposals of EEHG at 10-th harmonic have been studied respectively, i.e. with the seed lasers of the same color and two difference colors, the simulation results indicate that both approaches could be the candidate for EEHG lasing at 10-th harmonic at SDUV-FEL, meanwhile the coherent synchrotron radiation does not affect the performance of EEHG-FEL but only slightly shifts the central radiation frequency.

preprint2013arXiv

Random matrices with equispaced external source

We study Hermitian random matrix models with an external source matrix which has equispaced eigenvalues, and with an external field such that the limiting mean density of eigenvalues is supported on a single interval as the dimension tends to infinity. We obtain strong asymptotics for the multiple orthogonal polynomials associated to these models, and as a consequence for the average characteristic polynomials. One feature of the multiple orthogonal polynomials analyzed in this paper is that the number of orthogonality weights of the polynomials grows with the degree. Nevertheless we are able to characterize them in terms of a pair of 2 x 1 vector-valued Riemann-Hilbert problems, and to perform an asymptotic analysis of the Riemann-Hilbert problems.

preprint2013arXiv

Random matrix minor processes related to percolation theory

This paper studies a number of matrix models of size n and the associated Markov chains for the eigenvalues of the models for consecutive n's. They are consecutive principal minors for two of the models, GUE with external source and the multiple Laguerre matrix model, and merely properly defined consecutive matrices for the third one, the Jacobi-Pineiro model; nevertheless the eigenvalues of the consecutive models all interlace. We show: (i) For each of those finite models, we give the transition probability of the associated Markov chain and the joint distribution of the entire interlacing set of eigenvalues; we show this is a determinantal point process whose extended kernels share many common features. (ii) To each of these models and their set of eigenvalues, we associate a last-passage percolation model, either finite percolation or percolation along an infinite strip of finite width, yielding a precise relationship between the last passage times and the eigenvalues. (iii) Finally it is shown that for appropriate choices of exponential distribution on the percolation, with very small means, the rescaled last passage times lead to the Pearcey process; this should connect the Pearcey statistics with random directed polymers.

preprint2013arXiv

Simulation studies on laser pulse stability for Dalian Coherent Light Source

Dalian Coherent Light Source will use a 300MeV LINAC to produce fully coherent photon pulses in the wavelength range between 150-50nm by high gain harmonic generation free electron laser (FEL) scheme. To generate stable FEL pulses, stringent tolerance budget is required for the LINAC output parameters, such as the mean beam energy stability, electron bunch arrival time jitter, peak current variation and the transverse beam position offset. In order to provide guidance for the design of the Dalian Coherent Light Source, in this paper, the sensitivity of FEL pulse energy fluctuation to various error sources of the electron bunch was performed using intensive start-to-end FEL simulations.

preprint2013arXiv

Slippage effect on laser phase error amplification in seeded harmonic generation free-electron lasers

Free-electron lasers (FELs) seeded with external lasers hold great promise for generating high power radiation with nearly transform-limited bandwidth in soft x-ray region. However, it has been pointed out that the initial seed laser noise will be amplified by the frequency up-conversion process, which may degrade the quality of the output radiation produced by a harmonic generation scheme. In this paper, theoretical and simulation studies for laser phase error amplification in seeded FEL schemes with slippage effect taken into account are presented. It is found that, the seed laser imperfection experienced by the electron beam can be significantly smoothed by the slippage effect in the modulator when the slippage length is comparable to the laser pulse length. This smoothing effect allows one to preserve the excellent temporal coherence of seeded FELs in presence of large laser phase errors. For ultra-short UV seed lasers with FWHM around 16 fs, the slippage length in a modulator with ~30 undulator periods is typically comparable to the laser width; for longer seed laser pulses with FWHM around 80 fs, the slippage length can be made comparable to the laser width using a modulator tuned at the sub-harmonic of the UV seed laser. Three-dimensional simulations have been carried out for a soft x-ray facility using seed laser pulses with large frequency chirp and the results show that the sensitivity of the bandwidth of the seeded FEL to the initial frequency chirp can be significantly reduced by a proper design of the modulator such that the slippage length is comparable to the laser width. Our studies show that the tolerance on laser phase error for generating nearly transform-limited soft x-ray pulses in seeded FELs is much looser than previously thought and fully coherent radiation at nanometer wavelength may be reached with current technologies.

preprint2013arXiv

Tunable Band Topology Reflected by Fractional Quantum Hall States in Two-Dimensional Lattices

Two-dimensional lattice models subjected to an external effective magnetic field can form nontrivial band topologies characterized by nonzero integer band Chern numbers. In this Letter, we investigate such a lattice model originating from the Hofstadter model and demonstrate that the band topology transitions can be realized by simply introducing tunable longer-range hopping. The rich phase diagram of band Chern numbers is obtained for the simple rational flux density and a classification of phases is presented. In the presence of interactions, the existence of fractional quantum Hall states in both |C|=1 and |C|>1 bands is confirmed, which can reflect the band topologies in different phases. In contrast, when our model reduces to a one-dimensional lattice, the ground states are crucially different from fractional quantum Hall states. Our results may provide insights into the study of new fractional quantum Hall states and experimental realizations of various topological phases in optical lattices.

preprint2012arXiv

Chemical Constraints on the Oxygen Abundances in Jupiter and Saturn

We perform a comparative analysis of the chemical kinetics of CO and $\rm PH_3$ in Jupiter and Saturn to assess the full set of constraints available on the troposphere water abundance in the two giant planets. For carbon monoxide we employ both a widely used CO kinetic scheme from Yung et al, and a newly identified CO chemical scheme from Visscher and Moses. For $\rm PH_3$ chemical scheme, we use the same chemical scheme as in Visscher and Fegley. Yung's chemical scheme for CO yields a water enrichment of 0.95 - 23.0 times solar abundance on Jupiter, and an upper limit of 14.0 for Saturn. Visscher's chemical scheme in contrast produces a water enrichment of 0.24 - 2.6 times solar abundance in Jupiter, and for Saturn an upper limit for water enrichment of 8.0. From this scheme, which takes advantage of the most up-to-date kinetics, we preclude high water enrichments on Jupiter and Saturn, and show that the kinetics approach yields Jovian bulk abundance in which values of C/O elevated relative to solar are admissible. Our result is consistent with recent reinterpretation of Galileo Probe data in which Jupiter formed in a water-depleted portion of the protoplanetary disk (Mousis et al).

preprint2012arXiv

On a relationship between high rank cases and rank one cases of Hermitian random matrix models with external source

We prove an identity on Hermitian random matrix models with external source relating the high rank cases to the rank 1 cases. This identity was proved and used in a previous paper of ours to study the asymptotics of the top eigenvalues. In this paper, we give an alternative, more conceptual proof of this identity based on a connection between the Hermitian matrix models with external source and the discrete KP hierarchy. This connection is obtained using the vertex operator method of Adler and van Moerbeke. The desired identity then follows from the Fay-like identity of the discrete KP tau vector.

preprint2012arXiv

On the largest eigenvalue of a Hermitian random matrix model with spiked external source II. Higher rank cases

This is the second part of a study of the limiting distributions of the top eigenvalues of a Hermitian matrix model with spiked external source under a general external potential. The case when the external source is of rank one was analyzed in an earlier paper. In the present paper we extend the analysis to the higher rank case. If all the eigenvalues of the external source are less than a critical value, the largest eigenvalue converges to the right end-point of the support of the equilibrium measure as in the case when there is no external source. On the other hand, if an external source eigenvalue is larger than the critical value, then an eigenvalue is pulled off from the support of the equilibrium measure. This transition is continuous, and is universal, including the fluctuation laws, for convex potentials. For non-convex potentials, two types of discontinuous transitions are possible to occur generically. We evaluate the limiting distributions in each case for general potentials including those whose equilibrium measure have multiple intervals for their support.

preprint2012arXiv

Polarization control proposal for Shanghai deep ultraviolet free electron laser

In this paper, a fully coherent radiation option with controllable polarization is proposed for Shanghai deep ultraviolet free electron laser (FEL) test facility. Intensive start-to-end simulation suggests that, the two crossed planar undulators which generate the horizontal and vertical linear polarized FEL respectively, should be placed as close as possible for avoiding the polarization performance degradation of the final combined FEL radiation. With the existence of the phase-shifter between the two crossed radiators, Fourier-Transform-Limited output radiation with 100 nJ order pulse energy, 5 ps full pulse length and circular polarization degree above 90% could be achieved.

preprint2012arXiv

Species Diversity in Rock-Paper-Scissors Game Coupling with Levy Flight

Rock-paper-scissors (RPS) game is a nice model to study the biodiversity in ecosystem. However, the previous studies only consider the nearest- neighbor- interaction among the species. In this paper, taking the long range migration into account, the effects of the interplay between nearest-neighbor-interaction and long-range-interaction of Levy flight obey the power law distance distribution with the exponent h (-0.3<h<-0.1) in spatial RPS game is investigated. Taking the probability of long range Levy flight and the power exponent as parameters, the coexistence conditions of three species are found. The critical curves for stable coexistence of three species in the parameters space are presented. It is also found that long-range-interaction with Levy flight has interesting effects on the final spatiotemporal pattern of the system. The results reveal that the long-range-interaction of Levy flight exhibit pronounced effects on biodiversity of ecosystem.

preprint2012arXiv

Status of polarization control experiment at Shanghai deep ultraviolet free electron laser

A polarization control experiment by utilizing a pair of crossed undulators has been proposed for the Shanghai deep ultraviolet free electron laser test facility. Numerical simulations indicate that, with the electromagnetic phase-shifter located between the two crossed planar undulators, fully coherent radiation with 100 nJ order pulse energy, 5 picoseconds pulse length and circular polarization degree above 90% could be generated. The physical design study and the preparation status of the experiment are presented in the paper.

preprint2012arXiv

The largest eigenvalue of real symmetric, Hermitian and Hermitian self-dual random matrix models with rank one external source, part I

We consider the limiting location and limiting distribution of the largest eigenvalue in real symmetric ($β$ = 1), Hermitian ($β$ = 2), and Hermitian self-dual ($β$ = 4) random matrix models with rank 1 external source. They are analyzed in a uniform way by a contour integral representation of the joint probability density function of eigenvalues. Assuming the one-band condition and certain regularities of the potential function, we obtain the limiting location of the largest eigenvalue when the nonzero eigenvalue of the external source matrix is not the critical value, and further obtain the limiting distribution of the largest eigenvalue when the nonzero eigenvalue of the external source matrix is greater than the critical value. When the nonzero eigenvalue of the external source matrix is less than or equal to the critical value, the limiting distribution of the largest eigenvalue will be analyzed in a subsequent paper. In this paper we also give a definition of the external source model for all $β$ > 0.

preprint2011arXiv

A genuine maximally seven-qubit entangled state

Contrary to A.Borras et al.'s [1] conjecture, a genuine maximally seven-qubit entangled state is presented. We find a seven-qubit state whose marginal density matrices for subsystems of 1,2- qubits are all completely mixed and for subsystems of 3-qubits is almost completely mixed.

preprint2011arXiv

Characterizing Temporal Structure of the Relativistic Electron Bunch

Using proper beam energy chirp and the undulator detuning effect, we propose a modified optical replica synthesizer scheme to characterize the temporal structure of a relativistic electron bunch, which predicts a 100-fs temporal resolution in numerical simulation. The proof of principle experiment demonstrates a peak current of 9 A and a slice energy spread of about 0.5 keV for the uncompressed electron beam of the Shanghai Deep UV Free Electron Laser Facility.

preprint2011arXiv

Continuous variable entanglement measurement without phase locking

A new simple entanglement measurement method is proposed for the bright EPR beams generated from a non-degenerate optical parametric amplifier operating at deamplification. Due to the output signal and idler modes are frequency degenerate and in phase, the needed phase shift of interference for the measurement of the correlated phase quadratures and anti-correlated amplitude quadratures can be accomplished by a quarter-wave plate and a half wave plate without separating the signal and idler beam. Therefore, phase locking and local oscillators are avoided.

preprint2011arXiv

Recoiling Supermassive Black Holes in Spin-flip Radio Galaxies

Numerical relativity simulations predict that coalescence of supermassive black hole (SMBH) binaries not only leads to a spin flip but also to a recoiling of the merger remnant SMBHs. In the literature, X-shaped radio sources are popularly suggested to be candidates for SMBH mergers with spin flip of jet-ejecting SMBHs. Here we investigate the spectral and spatial observational signatures of the recoiling SMBHs in radio sources undergoing black hole spin flip. Our results show that SMBHs in most spin-flip radio sources have mass ratio $q\ga 0.3$ with a minimum possible value $q_{\rm min} \simeq 0.05$. For major mergers, the remnant SMBHs can get a kick velocity as high as $2100 km s^{-1}$ in the direction within an angle $\la 40^\circ$ relative to the spin axes of remnant SMBHs, implying that recoiling quasars are biased to be with high Doppler-shifted broad emission lines while recoiling radio galaxies are biased to large apparent spatial off-center displacements. We also calculate the distribution functions of line-of-sight velocity and apparent spatial off-center for spin-flip radio sources with different apparent jet reorientation angles. Our results show that the larger the apparent jet reorientation angle is, the larger the Doppler-shifting recoiling velocity and apparent spatial off-center displacement will be. We investigate the effects of recoiling velocity on the dust torus in spin-flip radio sources and suggest that recoiling of SMBHs would lead to "dust poor" AGNs. Finally, we collect a sample of 19 X-shaped radio objects and for each object give the probability of detecting the predicted signatures of recoiling SMBH.

preprint2011arXiv

The contribution of spin torque to spin Hall coefficient and spin motive force in spin-orbit coupling system

We derive rigorously the relativistic angular momentum conservation equation by means of quantum electrodynamics. The novel nonrelativistic spin current and torque in the spin-orbit coupling system, up to the order of $1/c^{4}$, are exactly investigated by using Foldy-Wouthuysen transformation. We find a perfect spin Hall coefficient including the contribution of spin torque dipole. A novel spin motive force, analogue to the Lorentz force, is also obtained for understanding of the spin Hall effect.

preprint2010arXiv

On the largest eigenvalue of a Hermitian random matrix model with spiked external source I. Rank one case

Consider a Hermitian matrix model under an external potential with spiked external source. When the external source is of rank one, we compute the limiting distribution of the largest eigenvalue for general, regular, analytic potential for all values of the external source. There is a transitional phenomenon, which is universal for convex potentials. However, for non-convex potentials, new types of transition may occur. The higher rank external source is analyzed in the subsequent paper.

preprint2009arXiv

Random matrices with external source and KP $τ$ functions

In this paper we prove that the partition function in the random matrix model with external source is a KP $τ$ function.

Dong Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

173 published item(s)

Dr. Zero: Self-Evolving Search Agents without Training Data

From Failure to Mastery: Generating Hard Samples for Tool-use Agents

MambaPanoptic: A Vision Mamba-based Structured State Space Framework for Panoptic Segmentation

MASH: A Multiplatform and Multimodal Annotated Dataset for Societal Impact of Hurricane

MeSS: City Mesh-Guided Outdoor Scene Generation with Cross-View Consistent Diffusion

RELO: Reinforcement Learning to Localize for Visual Object Tracking

V2P: Visual Attention Calibration for GUI Grounding via Background Suppression and Center Peaking

Movable Antenna Enhanced Multi-Region Beam Coverage: A Multi-Notch-Filter-Inspired Design

On the Effectiveness of Training Data Optimization for LLM-based Code Generation: An Empirical Study

Particle-scale origin of quadrupolar non-affine displacement fields in granular solids

A prediction-correction based iterative convolution-thresholding method for topology optimization of heat transfer problems

Reconfigurable Three-Dimensional Thermal Dome

RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation Allocation Approach for Recommender Systems

The Security Analysis of Continuous-Variable Quantum Key Distribution under Limited Eavesdropping with Practical Fiber

A two-stage method for reconstruction of parameters in diffusion equations

An efficient unconditionally stable method for Dirichlet partitions in arbitrary domains

An Exploration of npm Package Co-Usage Examples from Stack Overflow: A Case Study

Balanced Multimodal Learning via On-the-fly Gradient Modulation

C-P Map: A Novel Evaluation Toolkit for Speaker Verification

Check and Link: Pairwise Lesion Correspondence Guides Mammogram Mass Detection

Cross DQN: Cross Deep Q Network for Ads Allocation in Feed

Deep Page-Level Interest Network in Reinforcement Learning for Ads Allocation

Efficient Localness Transformer for Smart Sensor-Based Energy Disaggregation

Enhanced exemplar autoencoder with cycle consistency loss in any-to-one voice conversion

Evolving Programmable Computational Metamaterials

Generation of S-shaped photonic hooks from microcylinders with engineered surface patches

Giving Back: Contributions Congruent to Library Dependency Changes in a Software Ecosystem

Gradient Importance Learning for Incomplete Observations

Hybrid Transfer in Deep Reinforcement Learning for Ads Allocation

Learning List-wise Representation in Reinforcement Learning for Ads Allocation with Multiple Auxiliary Tasks

Neural Topic Modeling with Deep Mutual Information Estimation

Newcomer OSS-Candidates: Characterizing Contributions of Novice Developers to GitHub

Novel Valence Transition in Elemental Metal Europium around 80 GPa

Overcoming Van der Waals Forces in reconfigurable nanostructures

Pay Attention to Hard Trials

PointScatter: Point Set Representation for Tubular Structure Extraction

Probabilistic methods for approximate archetypal analysis

Reliable Visualization for Deep Speaker Recognition

Rényi State Entropy for Exploration Acceleration in Reinforcement Learning

Some Examples of Privacy-preserving Publication and Sharing of COVID-19 Pandemic Data

Topmetal-M: a novel pixel sensor for compact tracking applications

Towards Grand Unification of Object Tracking

Ultra-stable shear jammed granular material

Universality for random matrices with equi-spaced external source: a case study of a biorthogonal ensemble

Variational methods and deep Ritz method for active elastic solids

Visible-Thermal UAV Tracking: A Large-Scale Benchmark and New Baseline

Vision-based Anti-UAV Detection and Tracking

A Dataset And Benchmark Of Underwater Object Detection For Robot Picking

A vector Riemann-Hilbert approach to the Muttalib-Borodin ensembles

Holographic insulator/superconductor phase transitions with excited states

Holographic superconductors in 4D Einstein-Gauss-Bonnet gravity

Necessary and sufficient criterion of steering for two-qubit T states

Reconfigurable-intelligent-surface-assisted Downlink Transmission Design via Bayesian Optimization

Robust stimulated Raman shortcut-to-adiabatic passage by invariant-based optimal control

Self-Amplification of Coherent Energy Modulation in Seeded Free-Electron Lasers

A deep learning approach to multi-track location and orientation in gaseous drift chambers

An efficient iterative method for reconstructing surface from point clouds

ASR-Free Pronunciation Assessment

Backreacting holographic superconductors from the coupling of a scalar field to the Einstein tensor

Consistency of archetypal analysis

Cooling-Shrinking Attack: Blinding the Tracker with Imperceptible Noises

CovidSens: A Vision on Reliable Social Sensing for COVID-19

Curriculum Audiovisual Learning

DASC: Towards A Road Damage-Aware Social-Media-Driven Car Sensing Framework for Disaster Response Applications

Development of readout electronics a novel beam monitoring system for ion research facility accelerator

Domain-Invariant Speaker Vector Projection by Model-Agnostic Meta-Learning

Experimental demonstration of complementarity relations between quantum steering criteria

FocalMix: Semi-Supervised Learning for 3D Medical Image Detection

Graph Representation Learning for Merchant Incentive Optimization in Mobile Payment Marketing

High-Performance Long-Term Tracking with Meta-Updater

Improve bone age assessment by learning from anatomical local regions

Improved tripartite uncertainty relation with quantum memory

Jointly Modeling Motion and Appearance Cues for Robust RGB-T Tracking

Metal-free magnetism in chemically doped covalent organic frameworks