Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
161works
0followers
60topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

161 published item(s)

preprint2026arXiv

Chart-FR1: Visual Focus-Driven Fine-Grained Reasoning on Dense Charts

Multimodal large language models (MLLMs) have shown considerable potential in chart understanding and reasoning tasks. However, they still struggle with high information density (HID) charts characterized by multiple subplots, legends, and dense annotations due to three major challenges: (1) limited fine-grained perception results in the omission of critical visual cues; (2) redundant or noisy visual information undermines the performance of multimodal reasoning; (3) lack of adaptive deep reasoning relative to the amount of visual information. To tackle these challenges, we present a novel focus-driven fine-grained chart reasoning model, Chart-FR1, to improve perception, focusing efficiency, and adaptive deep reasoning on HID charts. Specifically, we propose Focus-CoT, a visual focusing chain-of-thought that enhances fine-grained perception by explicitly linking reasoning steps to key visual cues, such as local image regions and OCR signals. Building on this, we introduce Focus-GRPO, a focus-driven reinforcement learning algorithm with an information-efficiency reward that compresses redundant visual information for efficient focusing, and an adaptive KL penalty mechanism that enables flexible control over reasoning depth as more visual cues are discovered. Furthermore, to fill the gap in benchmarks for HID charts, we build HID-Chart, a challenging benchmark with an information-density metric designed to evaluate fine-grained chart reasoning capabilities. Extensive experiments on multiple chart benchmarks demonstrate that Chart-FR1 outperforms state-of-the-art MLLMs in chart understanding and reasoning. Code is available at https://github.com/phkhub/Chart-FR1.

preprint2026arXiv

SynGR: Unleashing the Potential of Cross-Modal Synergy for Generative Recommendation

Generative Recommendation (GR) has emerged as a promising paradigm by formulating item recommendation as a sequence-to-sequence generation task over item identifiers. Recent studies have incorporated multimodal signals to provide richer token-level evidence for generation. However, existing approaches largely rely on alignment-centric fusion and underexplore synergistic information across modalities. In practice, synergistic information plays a critical role in capturing emergent item properties that cannot be inferred from any single modality alone. Such properties encode intrinsic item semantics and guide user preferences, enabling models to move beyond surface-level feature matching. To address this limitation, we propose \textbf{SynGR}, a synergistic generative recommendation framework that explicitly encourages the exploitation of cross-modal dependencies during generation. By constraining overreliance on dominant modalities, SynGR enables the model to capture emergent item semantics beyond shared or modality-specific signals. Extensive experiments across three benchmark datasets demonstrate that SynGR achieves superior performance.

preprint2025arXiv

Micro-Macro Tensor Neural Surrogates for Uncertainty Quantification in Collisional Plasma

Plasma kinetic equations exhibit pronounced sensitivity to microscopic perturbations in model parameters and data, making reliable and efficient uncertainty quantification (UQ) essential for predictive simulations. However, the cost of uncertainty sampling, the high-dimensional phase space, and multiscale stiffness pose severe challenges to both computational efficiency and error control in traditional numerical methods. These aspects are further emphasized in presence of collisions where the high-dimensional nonlocal collision integrations and conservation properties pose severe constraints. To overcome this, we present a variance-reduced Monte Carlo framework for UQ in the Vlasov--Poisson--Landau (VPL) system, in which neural network surrogates replace the multiple costly evaluations of the Landau collision term. The method couples a high-fidelity, asymptotic-preserving VPL solver with inexpensive, strongly correlated surrogates based on the Vlasov--Poisson--Fokker--Planck (VPFP) and Euler--Poisson (EP) equations. For the surrogate models, we introduce a generalization of the separable physics-informed neural network (SPINN), developing a class of tensor neural networks based on an anisotropic micro-macro decomposition, to reduce velocity-moment costs, model complexity, and the curse of dimensionality. To further increase correlation with VPL, we calibrate the VPFP model and design an asymptotic-preserving SPINN whose small- and large-Knudsen limits recover the EP and VP systems, respectively. Numerical experiments show substantial variance reduction over standard Monte Carlo, accurate statistics with far fewer high-fidelity samples, and lower wall-clock time, while maintaining robustness to stochastic dimension.

preprint2024arXiv

Digital-SC: Digital Semantic Communication with Adaptive Network Split and Learned Non-Linear Quantization

Semantic communication, an intelligent communication paradigm that aims to transmit useful information in the semantic domain, is facilitated by deep learning techniques. Robust semantic features can be learned and transmitted in an analog fashion, but it poses new challenges to hardware, protocol, and encryption. In this paper, we propose a digital semantic communication system, which consists of an encoding network deployed on a resource-limited device and a decoding network deployed at the edge. To acquire better semantic representation for digital transmission, a novel non-linear quantization module is proposed to efficiently quantize semantic features with trainable quantization levels. Additionally, structured pruning is incorporated to reduce the dimension of the transmitted features. We also introduce a semantic learning loss (SLL) function to reduce semantic error. To adapt to various channel conditions and inputs under constraints of communication and computing resources, a policy network is designed to adaptively choose the split point and the dimension of the transmitted semantic features. Experiments using the CIFAR-10 and ImageNet dataset for image classification are employed to evaluate the proposed digital semantic communication network, and ablation studies are conducted to assess the proposed quantization module, structured pruning and SLL.

preprint2024arXiv

Strong decays of $T_{c\bar s0}(2900)^{++/0}$ as a fully open-flavor tetraquark state

We have studied the strong decay properties of the recently observed $T^a_{c\bar s0}(2900)^{++/0}$ by considering it as a $cu\bar{d}\bar{s}/cd\bar{u}\bar{s}$ fully open-flavor tetraquark state with $I(J^P)=1(0^+)$. In the framework of QCD sum rules, we have calculated the three-point correlation functions of the two-body strong decay processes $T^a_{c\bar s0}(2900)^{++}\rightarrow D_s^+π^+$, $D^+K^+, D_s^{\ast +}ρ^+$ and $D_{s1}^+π^+$. The full width of $T^a_{c\bar s0}(2900)^{++/0}$ is obtained as $161.7\pm94.8$ MeV, which is consistent with the experimental observation. We predict the relative branching ratios as $Γ(T\rightarrow D_sπ):Γ(T\rightarrow DK):Γ(T\rightarrow D_s^{\ast} ρ):Γ(T\rightarrow D_{s1}π)\approx1.00:1.10:0.04:0.43$, implying that the main decay modes of $T^a_{c\bar s0}(2900)^{++/0}$ state are $D_sπ$ and $DK$ channels in our calculations. However, the $P$-wave decay mode $D_{s1}π$ is also comparable and important by including the uncertainties. To further identify the nature of $T^a_{c\bar s0}(2900)^{++/0}$, we suggest confirming them in the $DK$ and $D_{s}π$ final states, and measuring the above ratios in future experiments.

preprint2023arXiv

A First Search for Solar $^8$B Neutrino in the PandaX-4T Experiment using Neutrino-Nucleus Coherent Scattering

A search for interactions from solar $^8$B neutrinos elastically scattering off xenon nuclei using PandaX-4T commissioning data is reported. The energy threshold of this search is further lowered compared with the previous search for dark matter, with various techniques utilized to suppress the background that emerges from data with the lowered threshold. A blind analysis is performed on the data with an effective exposure of 0.48 tonne$\cdot$year, and no significant excess of events is observed. Among results obtained using the neutrino-nucleus coherent scattering, our results give the best constraint on the solar $^8$B neutrino flux. We further provide a more stringent limit on the cross section between dark matter and nucleon in the mass range from 3 to 9 GeV/c$^2$.

preprint2023arXiv

A Knowledge-based Learning Framework for Self-supervised Pre-training Towards Enhanced Recognition of Biomedical Microscopy Images

Self-supervised pre-training has become the priory choice to establish reliable neural networks for automated recognition of massive biomedical microscopy images, which are routinely annotation-free, without semantics, and without guarantee of quality. Note that this paradigm is still at its infancy and limited by closely related open issues: 1) how to learn robust representations in an unsupervised manner from unlabelled biomedical microscopy images of low diversity in samples? and 2) how to obtain the most significant representations demanded by a high-quality segmentation? Aiming at these issues, this study proposes a knowledge-based learning framework (TOWER) towards enhanced recognition of biomedical microscopy images, which works in three phases by synergizing contrastive learning and generative learning methods: 1) Sample Space Diversification: Reconstructive proxy tasks have been enabled to embed a priori knowledge with context highlighted to diversify the expanded sample space; 2) Enhanced Representation Learning: Informative noise-contrastive estimation loss regularizes the encoder to enhance representation learning of annotation-free images; 3) Correlated Optimization: Optimization operations in pre-training the encoder and the decoder have been correlated via image restoration from proxy tasks, targeting the need for semantic segmentation. Experiments have been conducted on public datasets of biomedical microscopy images against the state-of-the-art counterparts (e.g., SimCLR and BYOL), and results demonstrate that: TOWER statistically excels in all self-supervised methods, achieving a Dice improvement of 1.38 percentage points over SimCLR. TOWER also has potential in multi-modality medical image analysis and enables label-efficient semi-supervised learning, e.g., reducing the annotation cost by up to 99% in pathological classification.

preprint2023arXiv

Automated Sleep Staging via Parallel Frequency-Cut Attention

This paper proposes a novel framework for automatically capturing the time-frequency nature of electroencephalogram (EEG) signals of human sleep based on the authoritative sleep medicine guidance. The framework consists of two parts: the first part extracts informative features by partitioning the input EEG spectrograms into a sequence of time-frequency patches. The second part is constituted by an attention-based architecture to efficiently search for the correlation between partitioned time-frequency patches and defining factors of sleep stages in parallel. The proposed pipeline is validated on the Sleep Heart Health Study dataset with new state-of-the-art results for the stages wake, N2, and N3, obtaining respective F1 scores of 0.93, 0.88, and 0.87, with only EEG signals used. The proposed method also has a high inter-rater reliability of 0.80 kappa. We also visualize the correspondence between sleep staging decisions and features extracted by the proposed method, providing strong interpretability for our model.

preprint2023arXiv

Nonreciprocal Charge and Spin Transport Induced by Non-Hermitian Skin Effect in Mesoscopic Heterojunctions

The pursuit of the non-Hermitian skin effect (NHSE) in various physical systems is of great research interest. Compared with recent progress in non-electronic systems, the implementation of the NHSE in condensed matter physics remains elusive. Here, we show that the NHSE can be engineered in the mesoscopic heterojunctions (system plus reservoir) in which electrons in two channels of the system moving towards each other have asymmetric coupling to those of the reservoir. This makes electrons in the system moving forward and in the opposite direction have unequal lifetimes, and so gives rise to a point-gap spectral topology. Accordingly, the electron eigenstates exhibit NHSE under the open boundary condition, consistent with the description of the generalized Brillouin zone. Such a reservoir-engineered NHSE visibly manifests itself as the nonreciprocal charge current that can be probed by the standard transport measurements. Further, we generalize the scenario to the spin-resolved NHSE, which can be probed by the nonreciprocal spin transport. Our work opens a new research avenue for implementing and detecting the NHSE in electronic mesoscopic systems, which will lead to interesting device applications.

preprint2023arXiv

Proximity-induced zero-energy states indistinguishable from topological edge states

When normal metals (NMs) are attached to topological insulators or topological superconductors, it is conceivable that the quantum states in these finite adjacent materials can intermix. In this case -- and because the NM usually does not possess the same symmetry as the topological material -- it is pertinent to ask whether zero-energy edge states in the topological layer are affected by the presence of the NM. To address this issue, we consider three prototype systems simulated by tight-binding models, namely a Su-Schrieffer-Heeger/NM, a Kitaev/NM, and a Chern insulator/NM. For all junctions investigated, we find that there exist trivial ``fine-tuned'' zero-energy states in the NM layer that can percolate into the topological region, thus mimicking a topological state. These zero-energy states are created by fine-tuning the NM chemical potential such that some of the NM states cross zero energy; they can occur even when the topological material is in the topologically trivial phase, and exist over a large region of the topological phase diagram. Interestingly, the true Majorana end modes of the Kitaev/NM model cannot be crossed by any NM state, as the NM metal layer in this case does not break particle-hole symmetry. On the other hand, when the chiral symmetry of the Su-Schrieffer-Heeger chain is broken by the attached NM, crossings are allowed. In addition, even in Chern insulators that do not preserve nonspatial symmetries, but the topological edge state self-generates a symmetry eigenvalue, such a fine-tuned zero-energy state can still occur. Our results indicate that when a topological material is attached to a metallic layer, one has to be cautious as to identify true topological edge states merely from their energy spectra and wave function profiles near the interface.

preprint2022arXiv

3D large-scale fused silica microfluidic chips enabled by hybrid laser microfabrication for continuous-flow UV photochemical synthesis

We demonstrate a hybrid laser microfabrication approach, which combines the technical merits of ultrafast laser-assisted chemical etching and carbon dioxide laser-induced in-situ melting, for centimeter-scale and bonding-free fabrication of 3D complex hollow microstructures in fused silica glass. With the developed approach, large-scale fused silica microfluidic chips with integrated 3D cascaded micromixing units can be reliably manufactured. High-performance on-chip mixing and continuous-flow photochemical synthesis under UV LEDs irradiation at ~280 nm were demonstrated using the manufactured chip, indicating a powerful capability for versatile fabrication of highly transparent all-glass microfluidic reactors for on-chip photochemical synthesis.

preprint2022arXiv

A Hierarchical Interactive Network for Joint Span-based Aspect-Sentiment Analysis

Recently, some span-based methods have achieved encouraging performances for joint aspect-sentiment analysis, which first extract aspects (aspect extraction) by detecting aspect boundaries and then classify the span-level sentiments (sentiment classification). However, most existing approaches either sequentially extract task-specific features, leading to insufficient feature interactions, or they encode aspect features and sentiment features in a parallel manner, implying that feature representation in each task is largely independent of each other except for input sharing. Both of them ignore the internal correlations between the aspect extraction and sentiment classification. To solve this problem, we novelly propose a hierarchical interactive network (HI-ASA) to model two-way interactions between two tasks appropriately, where the hierarchical interactions involve two steps: shallow-level interaction and deep-level interaction. First, we utilize cross-stitch mechanism to combine the different task-specific features selectively as the input to ensure proper two-way interactions. Second, the mutual information technique is applied to mutually constrain learning between two tasks in the output layer, thus the aspect input and the sentiment input are capable of encoding features of the other task via backpropagation. Extensive experiments on three real-world datasets demonstrate HI-ASA's superiority over baselines.

preprint2022arXiv

A Search for the Cosmic Ray Boosted Sub-GeV Dark Matter at the PandaX-II Experiment

We report a novel search for the cosmic ray boosted dark matter using the 100~tonne$\cdot$day full data set of the PandaX-II detector located at the China Jinping Underground Laboratory. With the extra energy gained from the cosmic rays, sub-GeV dark matter particles can produce visible recoil signals in the detector. The diurnal modulations in rate and energy spectrum are utilized to further enhance the signal sensitivity. Our result excludes the dark matter-nucleon elastic scattering cross section between 10$^{-31}$cm$^{2}$ and 10$^{-28}$cm$^{2}$ for a dark matter masses from 0.1 MeV/$c^2$ to 0.1 GeV/$c^2$, with a large parameter space previously unexplored by experimental collaborations.

preprint2022arXiv

A search for two-component Majorana dark matter in a simplified model using the full exposure data of PandaX-II experiment

In the two-component Majorana dark matter model, one dark matter particle can scatter off the target nuclei, and turn into a slightly heavier component. In the framework of a simplified model with a vector boson mediator, both the tree-level and loop-level processes contribute to the signal in direct detection experiment. In this paper, we report the search results for such dark matter from PandaX-II experiment, using total data of the full 100.7 tonne$\cdot$day exposure. No significant excess is observed, so strong constraints on the combined parameter space of mediator mass and dark matter mass are derived. With the complementary search results from collider experiments, a large range of parameter space can be excluded.

preprint2022arXiv

Adaptive Greedy versus Non-adaptive Greedy for Influence Maximization

We consider the *adaptive influence maximization problem*: given a network and a budget $k$, iteratively select $k$ seeds in the network to maximize the expected number of adopters. In the *full-adoption feedback model*, after selecting each seed, the seed-picker observes all the resulting adoptions. In the *myopic feedback model*, the seed-picker only observes whether each neighbor of the chosen seed adopts. Motivated by the extreme success of greedy-based algorithms/heuristics for influence maximization, we propose the concept of *greedy adaptivity gap*, which compares the performance of the adaptive greedy algorithm to its non-adaptive counterpart. Our first result shows that, for submodular influence maximization, the adaptive greedy algorithm can perform up to a $(1-1/e)$-fraction worse than the non-adaptive greedy algorithm, and that this ratio is tight. More specifically, on one side we provide examples where the performance of the adaptive greedy algorithm is only a $(1-1/e)$ fraction of the performance of the non-adaptive greedy algorithm in four settings: for both feedback models and both the *independent cascade model* and the *linear threshold model*. On the other side, we prove that in any submodular cascade, the adaptive greedy algorithm always outputs a $(1-1/e)$-approximation to the expected number of adoptions in the optimal non-adaptive seed choice. Our second result shows that, for the general submodular diffusion model with full-adoption feedback, the adaptive greedy algorithm can outperform the non-adaptive greedy algorithm by an unbounded factor. Finally, we propose a risk-free variant of the adaptive greedy algorithm that always performs no worse than the non-adaptive greedy algorithm.

preprint2022arXiv

Availability Attacks Create Shortcuts

Availability attacks, which poison the training data with imperceptible perturbations, can make the data \emph{not exploitable} by machine learning algorithms so as to prevent unauthorized use of data. In this work, we investigate why these perturbations work in principle. We are the first to unveil an important population property of the perturbations of these attacks: they are almost \textbf{linearly separable} when assigned with the target labels of the corresponding samples, which hence can work as \emph{shortcuts} for the learning objective. We further verify that linear separability is indeed the workhorse for availability attacks. We synthesize linearly-separable perturbations as attacks and show that they are as powerful as the deliberately crafted attacks. Moreover, such synthetic perturbations are much easier to generate. For example, previous attacks need dozens of hours to generate perturbations for ImageNet while our algorithm only needs several seconds. Our finding also suggests that the \emph{shortcut learning} is more widely present than previously believed as deep models would rely on shortcuts even if they are of an imperceptible scale and mixed together with the normal features. Our source code is published at \url{https://github.com/dayu11/Availability-Attacks-Create-Shortcuts}.

preprint2022arXiv

Branching Reinforcement Learning

In this paper, we propose a novel Branching Reinforcement Learning (Branching RL) model, and investigate both Regret Minimization (RM) and Reward-Free Exploration (RFE) metrics for this model. Unlike standard RL where the trajectory of each episode is a single $H$-step path, branching RL allows an agent to take multiple base actions in a state such that transitions branch out to multiple successor states correspondingly, and thus it generates a tree-structured trajectory. This model finds important applications in hierarchical recommendation systems and online advertising. For branching RL, we establish new Bellman equations and key lemmas, i.e., branching value difference lemma and branching law of total variance, and also bound the total variance by only $O(H^2)$ under an exponentially-large trajectory. For RM and RFE metrics, we propose computationally efficient algorithms BranchVI and BranchRFE, respectively, and derive nearly matching upper and lower bounds. Our results are only polynomial in problem parameters despite exponentially-large trajectories.

preprint2022arXiv

Certified Robustness to Word Substitution Ranking Attack for Neural Ranking Models

Neural ranking models (NRMs) have achieved promising results in information retrieval. NRMs have also been shown to be vulnerable to adversarial examples. A typical Word Substitution Ranking Attack (WSRA) against NRMs was proposed recently, in which an attacker promotes a target document in rankings by adding human-imperceptible perturbations to its text. This raises concerns when deploying NRMs in real-world applications. Therefore, it is important to develop techniques that defend against such attacks for NRMs. In empirical defenses adversarial examples are found during training and used to augment the training set. However, such methods offer no theoretical guarantee on the models' robustness and may eventually be broken by other sophisticated WSRAs. To escape this arms race, rigorous and provable certified defense methods for NRMs are needed. To this end, we first define the \textit{Certified Top-$K$ Robustness} for ranking models since users mainly care about the top ranked results in real-world scenarios. A ranking model is said to be Certified Top-$K$ Robust on a ranked list when it is guaranteed to keep documents that are out of the top $K$ away from the top $K$ under any attack. Then, we introduce a Certified Defense method, named CertDR, to achieve certified top-$K$ robustness against WSRA, based on the idea of randomized smoothing. Specifically, we first construct a smoothed ranker by applying random word substitutions on the documents, and then leverage the ranking property jointly with the statistical property of the ensemble to provably certify top-$K$ robustness. Extensive experiments on two representative web search datasets demonstrate that CertDR can significantly outperform state-of-the-art empirical defense methods for ranking models.

preprint2022arXiv

Characterization of BNL and HPK AC-LGAD sensors with a 120 GeV proton beam

We present measurements of AC-LGADs performed at the Fermilab's test beam facility using 120 GeV protons. We studied the performance of various strip and pad AC-LGAD sensors that were produced by BNL and HPK. The measurements are performed with our upgraded test beam setup that utilizes a high precision telescope tracker, and a simultaneous readout of up to 7 channels per sensor, which allows detailed studies of signal sharing characteristics. These measurements allow us to assess the differences in designs between different manufacturers, and optimize them based on experimental performance. We then study several reconstruction algorithms to optimize position and time resolutions that utilize the signal sharing properties of each sensor. We present a world's first demonstration of silicon sensors in a test beam that simultaneously achieve better than 6-10 micron position and 30 ps time resolution. This represents a substantial improvement to the spatial resolution than would be obtained with binary readout of sensors with similar pitch.

preprint2022arXiv

CohortVA: A Visual Analytic System for Interactive Exploration of Cohorts based on Historical Data

In history research, cohort analysis seeks to identify social structures and figure mobilities by studying the group-based behavior of historical figures. Prior works mainly employ automatic data mining approaches, lacking effective visual explanation. In this paper, we present CohortVA, an interactive visual analytic approach that enables historians to incorporate expertise and insight into the iterative exploration process. The kernel of CohortVA is a novel identification model that generates candidate cohorts and constructs cohort features by means of pre-built knowledge graphs constructed from large-scale history databases. We propose a set of coordinated views to illustrate identified cohorts and features coupled with historical events and figure profiles. Two case studies and interviews with historians demonstrate that CohortVA can greatly enhance the capabilities of cohort identifications, figure authentications, and hypothesis generation.

preprint2022arXiv

Contextual Fine-to-Coarse Distillation for Coarse-grained Response Selection in Open-Domain Conversations

We study the problem of coarse-grained response selection in retrieval-based dialogue systems. The problem is equally important with fine-grained response selection, but is less explored in existing literature. In this paper, we propose a Contextual Fine-to-Coarse (CFC) distilled model for coarse-grained response selection in open-domain conversations. In our CFC model, dense representations of query, candidate response and corresponding context is learned based on the multi-tower architecture, and more expressive knowledge learned from the one-tower architecture (fine-grained) is distilled into the multi-tower architecture (coarse-grained) to enhance the performance of the retriever. To evaluate the performance of our proposed model, we construct two new datasets based on the Reddit comments dump and Twitter corpus. Extensive experimental results on the two datasets show that the proposed methods achieve a significant improvement over all evaluation metrics compared with traditional baseline methods.

preprint2022arXiv

Cyclotron quantization and mirror-time transition on nonreciprocal lattices

Unidirectional transport and localized cyclotron motion are two opposite physical phenomena. Here, we study the interplay effects between them on nonreciprocal lattices subject to a magnetic field. We show that, in the long-wavelength limit, the trajectories of the wave packets always form closed orbits in four-dimensional (4D) complex space. Therefore, the semiclassical quantization rules persist despite the nonreciprocity, which preserves real Landau levels. We predict a different type of non-Hermitian spectral transition induced by the spontaneous breaking of the combined mirror-time reversal ($\mathcal{MT}$) symmetry, which generally exists in such systems. An order parameter is proposed to describe the $\mathcal{MT}$ phase transition, not only to determine the $\mathcal{MT}$ phase boundary but also to quantify the degree of $\mathcal{MT}$-symmetry breaking. Such an order parameter can be generally applied to all types of non-Hermitian phase transitions.

preprint2022arXiv

D2-TPred: Discontinuous Dependency for Trajectory Prediction under Traffic Lights

A profound understanding of inter-agent relationships and motion behaviors is important to achieve high-quality planning when navigating in complex scenarios, especially at urban traffic intersections. We present a trajectory prediction approach with respect to traffic lights, D2-TPred, which uses a spatial dynamic interaction graph (SDG) and a behavior dependency graph (BDG) to handle the problem of discontinuous dependency in the spatial-temporal space. Specifically, the SDG is used to capture spatial interactions by reconstructing sub-graphs for different agents with dynamic and changeable characteristics during each frame. The BDG is used to infer motion tendency by modeling the implicit dependency of the current state on priors behaviors, especially the discontinuous motions corresponding to acceleration, deceleration, or turning direction. Moreover, we present a new dataset for vehicle trajectory prediction under traffic lights called VTP-TL. Our experimental results show that our model achieves more than {20.45% and 20.78% }improvement in terms of ADE and FDE, respectively, on VTP-TL as compared to other trajectory prediction algorithms. The dataset and code are available at: https://github.com/VTP-TL/D2-TPred.

preprint2022arXiv

Deep Generative Models for Geometric Design Under Uncertainty

Deep generative models have demonstrated effectiveness in learning compact and expressive design representations that significantly improve geometric design optimization. However, these models do not consider the uncertainty introduced by manufacturing or fabrication. Past work that quantifies such uncertainty often makes simplified assumptions on geometric variations, while the "real-world" uncertainty and its impact on design performance are difficult to quantify due to the high dimensionality. To address this issue, we propose a Generative Adversarial Network-based Design under Uncertainty Framework (GAN-DUF), which contains a deep generative model that simultaneously learns a compact representation of nominal (ideal) designs and the conditional distribution of fabricated designs given any nominal design. We demonstrated the framework on two real-world engineering design examples and showed its capability of finding the solution that possesses better performances after fabrication.

preprint2022arXiv

Deep Joint Source-Channel Coding for CSI Feedback: An End-to-End Approach

The increased throughput brought by MIMO technology relies on the knowledge of channel state information (CSI) acquired in the base station (BS). To make the CSI feedback overhead affordable for the evolution of MIMO technology (e.g., massive MIMO and ultra-massive MIMO), deep learning (DL) is introduced to deal with the CSI compression task. Based on the separation principle in existing communication systems, DL based CSI compression is used as source coding. However, this separate source-channel coding (SSCC) scheme is inferior to the joint source-channel coding (JSCC) scheme in the finite blocklength regime. In this paper, we propose a deep joint source-channel coding (DJSCC) based framework for the CSI feedback task. In particular, the proposed method can simultaneously learn from the CSI source and the wireless channel. Instead of truncating CSI via Fourier transform in the delay domain in existing methods, we apply non-linear transform networks to compress the CSI. Furthermore, we adopt an SNR adaption mechanism to deal with the wireless channel variations. The extensive experiments demonstrate the validity, adaptability, and generality of the proposed framework.

preprint2022arXiv

Does Momentum Change the Implicit Regularization on Separable Data?

The momentum acceleration technique is widely adopted in many optimization algorithms. However, there is no theoretical answer on how the momentum affects the generalization performance of the optimization algorithms. This paper studies this problem by analyzing the implicit regularization of momentum-based optimization. We prove that on the linear classification problem with separable data and exponential-tailed loss, gradient descent with momentum (GDM) converges to the L2 max-margin solution, which is the same as vanilla gradient descent. That means gradient descent with momentum acceleration still converges to a low-complexity model, which guarantees their generalization. We then analyze the stochastic and adaptive variants of GDM (i.e., SGDM and deterministic Adam) and show they also converge to the L2 max-margin solution. Technically, to overcome the difficulty of the error accumulation in analyzing the momentum, we construct new potential functions to analyze the gap between the model parameter and the max-margin solution. Numerical experiments are conducted and support our theoretical results.

preprint2022arXiv

Entanglement on nucleation barrier of polymer crystal

We propose a theoretical approach to quantitatively account for the role of entanglement in the nucleation of polymer melts, which is the unique feature of polymer differentiated from small molecules. By performing molecular dynamics simulations, we obtain the nucleation barriers of polymer systems with different entanglement densities, which exhibits an opposite trend compared to the prediction of the classic nucleation theory (CNT). To amend the deficiency of the CNT in polymer crystallization, we introduce the entanglement free energy to reflect the role of entanglement in polymer nucleation. Specifically, the polymer nucleation not only involves free energies of monomers inside and on the surface of a nucleus as considered in the CNT, but also affects the entanglement network around the nucleus. Our theoretical approach provides a reasonable interpretation for the unsolved nucleation phenomena of polymers in simulations and experiments.

preprint2022arXiv

Epidemic Plateau: A Phenomenon under Adaptive Prevention Strategies

Since the beginning of the COVID-19 spreading, the number of studies on the epidemic models increased dramatically. It is important for policymakers to know how the disease will spread and what are the effects of the policies and environment on the spreading. In this paper, we propose two extensions to the standard SIR model: (a) we consider the prevention measures adopted based on the current severity of the infection. Those measures are adaptive and change over time; (b) multiple cities and regions are considered, with population movements between those cities and regions, while taking into account that each region may have different prevention measures. Although the adaptive measures and mobility of the population were often observed during the pandemic, these effects are rarely explicitly modeled and studied in the classical epidemic models. The model we propose gives rise to a plateau phenomenon: the number of people infected by the disease stays at the same level during an extended period of time. We show what are conditions need to be met in order for the spreading to exhibit a plateau period in a single city. In addition, this phenomenon is interdependent when considering multiple cities. We verify from the real-world data that the plateau phenomenon does exist in many regions of the world in the current COVID-19 development. Finally, we provide theoretical analysis on the plateau phenomenon for the single-city model and derive a series of results on the emergence and the ending of the plateau, as well as on the height and length of the plateau. Our theoretical results match well with our experimental findings.

preprint2022arXiv

Event-by-event jet anisotropy and hard-soft tomography of the quark-gluon plasma

Suppression of jet spectra or jet quenching in high-energy heavy-ion collisions is caused by jet energy loss in the dense medium. The azimuthal anisotropy of jet energy loss in non-central heavy-ion collisions can lead to jet anisotropy which in turn can provide insight into the path-length dependence of jet quenching. This is investigated within the Linear Boltzmann Transport (LBT) model which simulates both elastic scattering and medium-induced gluon radiation based on perturbative QCD for jet shower and medium recoil partons as well as radiated gluons as they propagate through the quark-gluon plasma (QGP). The dynamical evolution of the QGP in each event of heavy-ion collisions is provided by the (3+1)D CLVisc hydrodynamic model with fully fluctuating initial conditions. This framework has been shown to describe the suppression of single inclusive jet spectra well. We calculate in this study the elliptic ($v_{2}^{\rm jet}$) and triangular ($v_{3}^{\rm jet}$) anisotropy coefficients of the single inclusive jet spectra in Pb+Pb collisions at the LHC energies. We investigate the colliding energy, centrality, jet transverse momentum dependence of the jet anisotropy, as well as their event-by-event correlation with the flow coefficients of the soft bulk hadrons. An approximate linear correlation between jet and bulk $v_2$ is found. Effect of the bulk $v_n$ fluctuation on $v_n^{\rm jet}$ is found negligible. The jet-induced medium excitation, which is influenced by radial flow, is shown to enhance $v_{2}^{\rm jet}$ and the enhancement increases with the jet cone size. The jet elliptic anisotropy $v_{2}^{\rm jet}$ is also found to be slightly enhanced by the shear viscosity of the bulk medium in comparison to the LBT results when jets propagate through an ideal hydrodynamic QGP medium.

preprint2022arXiv

Evidence of Spin Frustration in Vanadium Diselenide Monolayer Magnet

Monolayer VSe2, featuring both charge density wave and magnetism phenomena, represents a unique van der Waals magnet in the family of metallic two-dimensional transition-metal dichalcogenides (2D-TMDs). Herein, by means of in-situ microscopic and spectroscopic techniques, including scanning tunneling microscopy/spectroscopy, synchrotron X-ray and angle-resolved photoemission, and X-ray absorption, direct spectroscopic signatures are established, that identify the metallic 1T-phase and vanadium 3d1 electronic configuration in monolayer VSe2 grown on graphite by molecular-beam epitaxy. Element-specific X-ray magnetic circular dichroism, complemented with magnetic susceptibility measurements, further reveals monolayer VSe2 as a frustrated magnet, with its spins exhibiting subtle correlations, albeit in the absence of a long-range magnetic order down to 2 K and up to a 7 T magnetic field. This observation is attributed to the relative stability of the ferromagnetic and antiferromagnetic ground states, arising from its atomic-scale structural features, such as rotational disorders and edges. The results of this study extend the current understanding of metallic 2D-TMDs in the search for exotic low-dimensional quantum phenomena, and stimulate further theoretical and experimental studies on van der Waals monolayer magnets.

preprint2022arXiv

Fabry-Pérot interference in 2D low-density Rashba gas

In mesoscopic electronic systems, the Fabry-Pérot (FP) oscillation is observed in various 1D devices. As for higher dimensions, numerous transverse channels usually lead to dephasing that quenches the overall oscillation of the conductance. Up to now, the FP oscillation in 2D electronic systems is only reported in graphene-based devices, and very recently, the \emph{pn} junctions of inverted InAs/GaSb double quantum well [Phys. Rev. X 10, 031007 (2020)]. In the latter, the band shape of a sombrero hat plays an essential role, which introduces a novel mechanism of electron-hole hybridization for the 2D FP oscillation. In this work, we propose that such a scenario can be generalized to the 2D planar junction composed of low-density Rashba gas, where the band bottom possesses a sombrero hat shape as well. We show that the backscattering between the outer and inner Fermi circles dominates the FP interference and significantly suppresses the dephasing effect between different transverse channels, which leads to a visible oscillation of the tunneling conductance. Specially, the visibility of the oscillating pattern can be enhanced by applying interface barriers, in contrast to that in the InAs/GaSb double quantum well. Our results provide a promising way for the implementation of the FP oscillation in the 2D electron gas.

preprint2022arXiv

Failure behaviors and processing maps with failure domains for hot compression of a powder metallurgy Ni-based superalloy

Processing maps are key to guiding the thermo-mechanical processing (TMP) of superalloys. However, traditional processing maps are incapable of delimiting failure, which is an essential factor to be concerned about during the TMP of superalloys. Employing isothermal hot compression experiments and finite element analysis (FEA), the present study examined the failure behaviors of a powder metallurgy (P/M) Ni-based superalloy and constructed processing maps with failure domains based on the predicted failure threshold. The micromechanical Gurson-Tvergaard-Needleman (GTN) damage model was employed in the FEA to model the cavity-driven intergranular fracture of the superalloy. Deformation temperature and strain rate were considered in the range of 1050 ~ 1150 C and 0.001 ~ 1 s-1, respectively. The FEA results reveal that the maximum tensile stress locates at the outer budging surfaces of the samples, which causes failure initiation and subsequent propagation into longitudinal cracks, being consistent with the experiments. It is further demonstrated that the failure is strain-controlled and the critical failure strain remains nearly insensitive to the range of strain rates considered while increasing with the increase of temperature in a third-order polynomial. Finally, an optimized processing window for hot deformation of the superalloy is formulated to warrant good hot workability while avoiding flow instability and failure. The present study offers direct insights into the failure behaviors of P/M Ni-based superalloys and details a modeling strategy to delineate optimized parametric spaces for the TMP of superalloys.

preprint2022arXiv

Federated Visualization: A Privacy-preserving Strategy for Aggregated Visual Query

We present a novel privacy preservation strategy for decentralized visualization. The key idea is to imitate the flowchart of the federated learning framework, and reformulate the visualization process within a federated infrastructure. The federation of visualization is fulfilled by leveraging a shared global module that composes the encrypted externalizations of transformed visual features of data pieces in local modules. We design two implementations of federated visualization: a prediction-based scheme, and a query-based scheme. We demonstrate the effectiveness of our approach with a set of visual forms, and verify its robustness with evaluations. We report the value of federated visualization in real scenarios with an expert review.

preprint2022arXiv

From hydro to jet quenching, coalescence and hadron cascade: a coupled approach to solving the $R_{AA}\otimes v_2$ puzzle

Hydrodynamics and jet quenching are responsible for the elliptic flow $v_2$ and suppression of large transverse momentum ($p_T$) hadrons, respectively, two of the most important phenomena leading to the discovery of a strongly coupled quark-gluon plasma (QGP) in high-energy heavy-ion collisions. A consistent description of the hadron suppression factor $R_{AA}$ and $v_2$, especially at intermediate $p_T$, however, remains a challenge. We solve this long-standing $R_{AA}\otimes v_2$ puzzle by including quark coalescence for hadronization and final state hadron cascade in the coupled linear Boltzmann transport-hydro model that combines concurrent jet transport and hydrodynamic evolution of the bulk medium. We illustrate that quark coalescence and hadron cascade, two keys to solving the puzzle, also lead to a splitting of $v_2$ for pions, kaons and protons in the intermediate $p_T$ region. We demonstrate for the first time that experimental data on $R_{AA}$, $v_2$ and their hadron flavor dependence from low to intermediate and high $p_T$ in high-energy heavy-ion collisions can be understood within this coupled framework.

preprint2022arXiv

Global Consistent Point Cloud Registration Based on Lie-algebraic Cohomology

We present a novel, effective method for global point cloud registration problems by geometric topology. Based on many point cloud pairwise registration methods (e.g ICP), we focus on the problem of accumulated error for the composition of transformations along any loops. The major technical contribution of this paper is a linear method for the elimination of errors, using only solving a Poisson equation. We demonstrate the consistency of our method from Hodge-Helmhotz decomposition theorem and experiments on multiple RGBD datasets of real-world scenes. The experimental results also demonstrate that our global registration method runs quickly and provides accurate reconstructions.

preprint2022arXiv

HetVis: A Visual Analysis Approach for Identifying Data Heterogeneity in Horizontal Federated Learning

Horizontal federated learning (HFL) enables distributed clients to train a shared model and keep their data privacy. In training high-quality HFL models, the data heterogeneity among clients is one of the major concerns. However, due to the security issue and the complexity of deep learning models, it is challenging to investigate data heterogeneity across different clients. To address this issue, based on a requirement analysis we developed a visual analytics tool, HetVis, for participating clients to explore data heterogeneity. We identify data heterogeneity through comparing prediction behaviors of the global federated model and the stand-alone model trained with local data. Then, a context-aware clustering of the inconsistent records is done, to provide a summary of data heterogeneity. Combining with the proposed comparison techniques, we develop a novel set of visualizations to identify heterogeneity issues in HFL. We designed three case studies to introduce how HetVis can assist client analysts in understanding different types of heterogeneity issues. Expert reviews and a comparative study demonstrate the effectiveness of HetVis.

preprint2022arXiv

HFUL: A Hybrid Framework for User Account Linkage across Location-Aware Social Networks

Sources of complementary information are connected when we link user accounts belonging to the same user across different platforms or devices. The expanded information promotes the development of a wide range of applications, such as cross-platform prediction, cross-platform recommendation, and advertisement. Due to the significance of user account linkage and the widespread popularization of GPS-enabled mobile devices, there are increasing research studies on linking user account with spatio-temporal data across location-aware social networks. Being different from most existing studies in this domain that only focus on the effectiveness, we propose a novel framework entitled HFUL (A Hybrid Framework for User Account Linkage across Location-Aware Social Networks), where efficiency, effectiveness, scalability, robustness, and application of user account linkage are considered. Specifically, to improve the efficiency, we develop a comprehensive index structure from the spatio-temporal perspective, and design novel pruning strategies to reduce the search space. To improve the effectiveness, a kernel density estimation-based method has been proposed to alleviate the data sparsity problem in measuring users' similarities. Additionally, we investigate the application of HFUL in terms of user prediction, time prediction, and location prediction. The extensive experiments conducted on three real-world datasets demonstrate the superiority of HFUL in terms of effectiveness, efficiency, scalability, robustness, and application compared with the state-of-the-art methods.

preprint2022arXiv

Higher order monotonicity and submodularity of influence in social networks: from local to global

Kempe, Kleinberg and Tardos (KKT) proposed the following conjecture about the general threshold model in social networks: local monotonicity and submodularity imply global monotonicity and submodularity. That is, if the threshold function of every node is monotone and submodular, then the spread function $σ(S)$ is monotone and submodular, where $S$ is a seed set and the spread function $σ(S)$ denotes the expected number of active nodes at termination of a diffusion process starting from $S$. The correctness of this conjecture has been proved by Mossel and Roch. In this paper, we first provide the concept AD-k (Alternating Difference-$k$) as a generalization of monotonicity and submodularity. Specifically, a set function $f$ is called \adk if all the $\ell$-th order differences of $f$ on all inputs have sign $(-1)^{\ell+1}$ for every $\ell\leq k$. Note that AD-1 corresponds to monotonicity and AD-2 corresponds to monotonicity and submodularity. We propose a refined version of KKT's conjecture: in the general threshold model, local AD-k implies global AD-k. The original KKT conjecture corresponds to the case for AD-2, and the case for AD-1 is the trivial one of local monotonicity implying global monotonicity. By utilizing continuous extensions of set functions as well as social graph constructions, we prove the correctness of our conjecture when the social graph is a directed acyclic graph (DAG). Furthermore, we affirm our conjecture on general social graphs when $k=\infty$.

preprint2022arXiv

Low Radioactive Material Screening and Background Control for the PandaX-4T Experiment

PandaX-4T is a ton-scale dark matter direct detection experiment using a dual-phase TPC technique at the China Jinping Underground Laboratory. Various ultra-low background technologies have been developed and applied to material screening for PandaX-4T, including HPGe gamma spectroscopy, ICP-MS, NAA, radon emanation measurement system, krypton assay station, and alpha detection system. Low background materials were selected to assemble the detector. Surface treatment procedures were investigated to further suppress radioactive background. Combining measured results and Monte Carlo simulation, the total material background rates of PandaX-4T in the energy region of 1-25 keV$\rm{}_{ee}$ are estimated to be (9.9 $\pm$ 1.9) $\times \ 10^{-3}$ mDRU for electron recoil and (2.8 $\pm$ 0.6) $\times \ 10^{-4}$ mDRU for nuclear recoil. In addition, $^{nat}$Kr in the detector is estimated to be <8 ppt.

preprint2022arXiv

Measurement of interaction-dressed Berry curvature and quantum metric in solids by optical absorption

The quantum geometric properties of a Bloch state in momentum space are usually described by the Berry curvature and quantum metric. In realistic gapped materials where interactions and disorder render the Bloch state not a viable starting point, we generalize these concepts by introducing dressed Berry curvature and quantum metric at finite temperature, in which the effect of many-body interactions can be included perturbatively. These quantities are extracted from the charge polarization susceptibility caused by linearly or circularly polarized electric fields, whose spectral functions can be measured from momentum-resolved exciton or infrared absorption rate. As a concrete example, we investigate Chern insulators in the presence of impurity scattering, whose results suggest that the quantum geometric properties are protected by the energy gap against many-body interactions.

preprint2022arXiv

Mutual Distillation Learning Network for Trajectory-User Linking

Trajectory-User Linking (TUL), which links trajectories to users who generate them, has been a challenging problem due to the sparsity in check-in mobility data. Existing methods ignore the utilization of historical data or rich contextual features in check-in data, resulting in poor performance for TUL task. In this paper, we propose a novel Mutual distillation learning network to solve the TUL problem for sparse check-in mobility data, named MainTUL. Specifically, MainTUL is composed of a Recurrent Neural Network (RNN) trajectory encoder that models sequential patterns of input trajectory and a temporal-aware Transformer trajectory encoder that captures long-term time dependencies for the corresponding augmented historical trajectories. Then, the knowledge learned on historical trajectories is transferred between the two trajectory encoders to guide the learning of both encoders to achieve mutual distillation of information. Experimental results on two real-world check-in mobility datasets demonstrate the superiority of MainTUL against state-of-the-art baselines. The source code of our model is available at https://github.com/Onedean/MainTUL.

preprint2022arXiv

Network Inference and Influence Maximization from Samples

Influence maximization is the task of selecting a small number of seed nodes in a social network to maximize the influence spread from these seeds. It has been widely investigated in the past two decades. In the canonical setting, the social network and its diffusion parameters are given as input. In this paper, we consider the more realistic sampling setting where the network is unknown and we only have a set of passively observed cascades that record the sets of activated nodes at each diffusion step. We study the task of influence maximization from these cascade samples (IMS) and present constant approximation algorithms for it under mild conditions on the seed set distribution. To achieve the optimization goal, we also provide a novel solution to the network inference problem, that is, learning diffusion parameters and the network structure from the cascade data. Compared with prior solutions, our network inference algorithms require weaker assumptions and do not rely on maximum-likelihood estimation and convex programming. Our IMS algorithms enhance the learning-and-then-optimization approach by allowing a constant approximation ratio even when the diffusion parameters are hard to learn, and we do not need any assumption related to the network structure or diffusion parameters.

preprint2022arXiv

Neural Operator with Regularity Structure for Modeling Dynamics Driven by SPDEs

Stochastic partial differential equations (SPDEs) are significant tools for modeling dynamics in many areas including atmospheric sciences and physics. Neural Operators, generations of neural networks with capability of learning maps between infinite-dimensional spaces, are strong tools for solving parametric PDEs. However, they lack the ability to modeling SPDEs which usually have poor regularity due to the driving noise. As the theory of regularity structure has achieved great successes in analyzing SPDEs and provides the concept model feature vectors that well-approximate SPDEs&#39; solutions, we propose the Neural Operator with Regularity Structure (NORS) which incorporates the feature vectors for modeling dynamics driven by SPDEs. We conduct experiments on various of SPDEs including the dynamic Phi41 model and the 2d stochastic Navier-Stokes equation, and the results demonstrate that the NORS is resolution-invariant, efficient, and achieves one order of magnitude lower error with a modest amount of data.

preprint2022arXiv

Neutron-induced nuclear recoil background in the PandaX-4T experiment

Neutron-induced nuclear recoil background is critical to the dark matter searches in the PandaX-4T liquid xenon experiment. This paper studies the feature of neutron background in liquid xenon and evaluates their contribution in the single scattering nuclear recoil events through three methods. The first method is fully Monte Carlo simulation based. The last two are data-driven methods that also use the multiple scattering signals and high energy signals in the data, respectively. In the PandaX-4T commissioning data with an exposure of 0.63 tonne-year, all these methods give a consistent result that there are $1.15\pm0.57$ neutron-induced background in dark matter signal region within an approximated nuclear recoil energy window between 5 and 100 keV.

preprint2022arXiv

New hadron configuration: The double-gluon hybrid state

This is the first study on the double-gluon hybrid, which consists of one valence quark and one valence antiquark together with two valence gluons. We concentrate on the one with the exotic quantum number $J^{PC} = 2^{+-}$ that conventional $\bar q q$ mesons can not reach. We apply QCD sum rule method to evaluate its mass to be $2.26^{+0.20}_{-0.25}$ GeV, and study its possible decay patterns. Especially, its three-meson decay patterns are generally not suppressed severely compared to two-meson decay patterns, so the $S$-wave three-meson decay channels $f_1ωπ/f_1ρπ$ can be useful in identifying its nature, which is of particular importance to the direct test of QCD in the low energy sector.

preprint2022arXiv

Newly observed $a_0(1817)$ as the scaling point of constructing the scalar meson spectroscopy

Stimulated by the newly observed $a_0(1817)$ by the BESIII Collaboration, we find a perfect Regge trajectory composed of the $a_0(980)$, $a_0(1450)$, and $a_0(1817)$, which leads us to categorize the $a_0(980)$, $a_0(1450)$, and $a_0(1817)$ into the isovector scalar meson family. This scenario is supported by their two-body Okubo-Zweig-Iizuka allowed strong decay behaviors. In this scheme, we also predict the third radial excitation of the $a_0(980)$, which is denoted as the $a_0(2115)$, accessible at future experiment as a direct test of this assignment. We find another Regge trajectory which contains three isoscalar scalar states $f_0(980)$, $X(1812)$, and $f_0(2100)$. We investigate their two-body Okubo-Zweig-Iizuka allowed strong decay patterns, which are roughly consistent with the experimental data. The $f_0(980)$, $X(1812)$, and $f_0(2100)$ can be well grouped into the isoscalar scalar meson family. We want to emphasize that these two Regge trajectories have a similar slope. In summary, the present work provides a scheme of constructing the scalar meson family based on these reported scalar states. The possibility of the $f_0(1710)$ as the candidate of the scalar glueball cannot be excluded by the observation of the $a_0(1817)$ since the $a_0(1817)$ is more suitable as the isovector partner of the $X(1812)$.

preprint2022arXiv

Normalized/Clipped SGD with Perturbation for Differentially Private Non-Convex Optimization

By ensuring differential privacy in the learning algorithms, one can rigorously mitigate the risk of large models memorizing sensitive training data. In this paper, we study two algorithms for this purpose, i.e., DP-SGD and DP-NSGD, which first clip or normalize \textit{per-sample} gradients to bound the sensitivity and then add noise to obfuscate the exact information. We analyze the convergence behavior of these two algorithms in the non-convex optimization setting with two common assumptions and achieve a rate $\mathcal{O}\left(\sqrt[4]{\frac{d\log(1/δ)}{N^2ε^2}}\right)$ of the gradient norm for a $d$-dimensional model, $N$ samples and $(ε,δ)$-DP, which improves over previous bounds under much weaker assumptions. Specifically, we introduce a regularizing factor in DP-NSGD and show that it is crucial in the convergence proof and subtly controls the bias and noise trade-off. Our proof deliberately handles the per-sample gradient clipping and normalization that are specified for the private setting. Empirically, we demonstrate that these two algorithms achieve similar best accuracy while DP-NSGD is comparatively easier to tune than DP-SGD and hence may help further save the privacy budget when accounting the tuning effort.

preprint2022arXiv

On the Phases of a Semi-Sectorial Matrix

In this paper, we extend the definition of phases of sectorial matrices to those of semi-sectorial matrices, which are possibly singular. Properties of the phases are also extended, including those of the Moore-Penrose generalized inverse, compressions and Schur complements, matrix sums and products. In particular, a majorization relation is established between the phases of the nonzero eigenvalues of $AB$ and the phases of the compressions of $A$ and $B$, which leads to a generalized matrix small phase theorem. For the matrices which are not necessarily semi-sectorial, we define their (largest and smallest) essential phases via diagonal similarity transformation. An explicit expression for the essential phases of a Laplacian matrix of a directed graph is obtained.

preprint2022arXiv

Online Competitive Influence Maximization

Online influence maximization has attracted much attention as a way to maximize influence spread through a social network while learning the values of unknown network parameters. Most previous works focus on single-item diffusion. In this paper, we introduce a new Online Competitive Influence Maximization (OCIM) problem, where two competing items (e.g., products, news stories) propagate in the same network and influence probabilities on edges are unknown. We adopt a combinatorial multi-armed bandit (CMAB) framework for OCIM, but unlike the non-competitive setting, the important monotonicity property (influence spread increases when influence probabilities on edges increase) no longer holds due to the competitive nature of propagation, which brings a significant new challenge to the problem. We provide a nontrivial proof showing that the Triggering Probability Modulated (TPM) condition for CMAB still holds in OCIM, which is instrumental for our proposed algorithms OCIM-TS and OCIM-OFU to achieve sublinear Bayesian and frequentist regret, respectively. We also design an OCIM-ETC algorithm that requires less feedback and easier offline computation, at the expense of a worse frequentist regret bound. Experimental evaluations demonstrate the effectiveness of our algorithms.

preprint2022arXiv

Online Influence Maximization under the Independent Cascade Model with Node-Level Feedback

We study the online influence maximization (OIM) problem in social networks, where the learner repeatedly chooses seed nodes to generate cascades, observes the cascade feedback, and gradually learns the best seeds that generate the largest cascade in multiple rounds. In the demand of the real world, we work with node-level feedback instead of the common edge-level feedback in the literature. The edge-level feedback reveals all edges that pass through information in a cascade, whereas the node-level feedback only reveals the activated nodes with timestamps. The node-level feedback is arguably more realistic since in practice it is relatively easy to observe who is influenced but very difficult to observe from which relationship (edge) the influence comes. Previously, there is a nearly optimal $\tilde{O}(\sqrt{T})$-regret algorithm for OIM problem under the linear threshold (LT) diffusion model with node-level feedback. It remains unknown whether the same algorithm exists for the independent cascade (IC) diffusion model. In this paper, we resolve this open problem by presenting an $\tilde{O}(\sqrt{T})$-regret algorithm for OIM problem under the IC model with node-level feedback.

preprint2022arXiv

Optimal $(0,1)$-Matrix Completion with Majorization Ordered Objectives (To the memory of Pravin Varaiya)

We propose and examine two optimal $(0,1)$-matrix completion problems with majorization ordered objectives. They elevate the seminal study by Gale and Ryser from feasibility to optimality in partial order programming (POP), referring to optimization with partially ordered objectives. We showcase their applications in electric vehicle charging, portfolio optimization, and secure data storage. Solving such integer POP (iPOP) problems is challenging because of the possible non-comparability among objective values and the integer requirements. Nevertheless, we prove the essential uniqueness of all optimal objective values and identify two particular ones for each of the two inherently symmetric iPOP problems. Furthermore, for every optimal objective value, we decompose the construction of an associated optimal~$(0,1)$-matrix into a series of sorting processes, respectively agreeing with the rule of thumb &#34;peak shaving&#34; or &#34;valley filling.&#34; We show that the resulting algorithms have linear time complexities and verify their empirical efficiency via numerical simulations compared to the standard order-preserving method for POP.

preprint2022arXiv

Quasi-periodic Accelerations of Energetic Particles during a Solar Flare

We report the observation of non-stationary Quasi-Periodic Pulsations (QPPs) in high-energy particles during the impulsive phase of an X4.8 flare on 2002 July 23 (SOL2002-07-23T00:35). The X4.8 flare was simultaneously measured by the Reuven Ramaty High Energy Solar Spectroscopic Imager, Nobeyama Radio Polarimeters, and Nobeyama Radioheliograph. The quasi-period of about 50 s, determined by the wavelet transform, is detected in the Gamma-ray line emission. Using the same method, a quasi-period of about 90 s is found in Gamma-ray continuum, hard X-ray (HXR) and radio emissions during almost the same time. Our observations suggest that the flare QPPs should be associated with energetic ions and nonthermal electrons that quasi-periodically accelerated by the repetitive magnetic reconnection. The different quasi-periods between Gamma-ray line and continuum/HXR/radio emissions indicate an apparent difference in acceleration or propagation between energetic ions and nonthermal electrons of this solar flare.

preprint2022arXiv

Remixing Functionally Graded Structures: Data-Driven Topology Optimization with Multiclass Shape Blending

To create heterogeneous, multiscale structures with unprecedented functionalities, recent topology optimization approaches design either fully aperiodic systems or functionally graded structures, which compete in terms of design freedom and efficiency. We propose to inherit the advantages of both through a data-driven framework for multiclass functionally graded structures that mixes several families, i.e., classes, of microstructure topologies to create spatially-varying designs with guaranteed feasibility. The key is a new multiclass shape blending scheme that generates smoothly graded microstructures without requiring compatible classes or connectivity and feasibility constraints. Moreover, it transforms the microscale problem into an efficient, low-dimensional one without confining the design to predefined shapes. Compliance and shape matching examples using common truss geometries and diversity-based freeform topologies demonstrate the versatility of our framework, while studies on the effect of the number and diversity of classes illustrate the effectiveness. The generality of the proposed methods supports future extensions beyond the linear applications presented.

preprint2022arXiv

REST: Debiased Social Recommendation via Reconstructing Exposure Strategies

The recommendation system, relying on historical observational data to model the complex relationships among the users and items, has achieved great success in real-world applications. Selection bias is one of the most important issues of the existing observational data based approaches, which is actually caused by multiple types of unobserved exposure strategies (e.g. promotions and holiday effects). Though various methods have been proposed to address this problem, they are mainly relying on the implicit debiasing techniques but not explicitly modeling the unobserved exposure strategies. By explicitly Reconstructing Exposure STrategies (REST in short), we formalize the recommendation problem as the counterfactual reasoning and propose the debiased social recommendation method. In REST, we assume that the exposure of an item is controlled by the latent exposure strategies, the user, and the item. Based on the above generation process, we first provide the theoretical guarantee of our method via identification analysis. Second, we employ a variational auto-encoder to reconstruct the latent exposure strategies, with the help of the social networks and the items. Third, we devise a counterfactual reasoning based recommendation algorithm by leveraging the recovered exposure strategies. Experiments on four real-world datasets, including three published datasets and one private WeChat Official Account dataset, demonstrate significant improvements over several state-of-the-art methods.

preprint2022arXiv

Search for the elusive jet-induced diffusion wake in $Z/γ$-jets with 2D jet tomography in high-energy heavy-ion collisions

Diffusion wake is an unambiguous part of the jet-induced medium response in high-energy heavy-ion collisions that leads to a depletion of soft hadrons in the opposite direction of the jet propagation. New experimental data on $Z$-hadron correlation in Pb+Pb collisions at the Large Hadron Collider show, however, an enhancement of soft hadrons in the direction of both the $Z$ and the jet. Using a coupled linear Boltzmann transport and hydro model, we demonstrate that medium modification of partons from the initial multiple parton interaction (MPI) gives rise to a soft hadron enhancement that is uniform in azimuthal angle while jet-induced medium response and soft gluon radiation dominate the enhancement in the jet direction. After subtraction of the contributions from MPI with a mixed-event procedure, the diffusion wake becomes visible in the near-side $Z$-hadron correlation. We further employ the longitudinal and transverse gradient jet tomography for the first time to localize the initial jet production positions in $Z/γ$-jet events in which the effect of the diffusion wake is apparent in $Z/γ$-hadron correlation even without the subtraction of MPI.

preprint2022arXiv

Shape-Aware Monocular 3D Object Detection

The detection of 3D objects through a single perspective camera is a challenging issue. The anchor-free and keypoint-based models receive increasing attention recently due to their effectiveness and simplicity. However, most of these methods are vulnerable to occluded and truncated objects. In this paper, a single-stage monocular 3D object detection model is proposed. An instance-segmentation head is integrated into the model training, which allows the model to be aware of the visible shape of a target object. The detection largely avoids interference from irrelevant regions surrounding the target objects. In addition, we also reveal that the popular IoU-based evaluation metrics, which were originally designed for evaluating stereo or LiDAR-based detection methods, are insensitive to the improvement of monocular 3D object detection algorithms. A novel evaluation metric, namely average depth similarity (ADS) is proposed for the monocular 3D object detection models. Our method outperforms the baseline on both the popular and the proposed evaluation metrics while maintaining real-time efficiency.

preprint2022arXiv

Study of background from accidental coincidence signals in the PandaX-II experiment

The PandaX-II experiment employed a 580kg liquid xenon detector to search for the interactions between dark matter particles and the target xenon atoms. The accidental coincidences of isolated signals result in a dangerous background which mimic the signature of the dark matter. We performed a detailed study on the accidental coincidence background in PandaX-II, including the possible origin of the isolated signals, the background level and corresponding background suppression method. With a boosted-decision-tree algorithm, the accidental coincidence background is reduced by 70% in the dark matter signal region, thus the sensitivity of dark matter search at PandaX-II is improved.

preprint2022arXiv

Symmetry-enforced nodal lines in the band structures of vacancy-engineered graphene

We elaborate that single-layer graphene with periodic vacancies can have a band structure containing nodal lines or nodal loops, opening the possibility of graphene-based electronic or spintronic devices with novel functionalities. The principle is that by removing carbon atoms such that the lattice becomes nonsymmorphic, every two sublattices in the unit cell will map to each other under glide plane operation. This mapping yields degenerate eigenvalues for the glide plane operation, which guarantees that the energy bands must stick together pairwise at a boundary of the Brillouin zone. Moving away from the Brillouin zone boundary causes the symmetry-enforced nodal lines to split, resulting in accidental nodal lines caused by the crossings of the split bands. Moreover, the density of states at the Fermi level may be dramatically enhanced if the nodal lines crosses the Fermi level. The nodal lines occur a variety of vacancy configurations even in the presence of Rashba spin-orbit coupling. Finally, our theory also explains the nodal loops surrounding the entire Brillouin zone of a chevron-type nanoporous graphene fabricated in a recent experiment.

preprint2022arXiv

Thompson Sampling for Combinatorial Semi-Bandits

In this paper, we study the application of the Thompson sampling (TS) methodology to the stochastic combinatorial multi-armed bandit (CMAB) framework. We first analyze the standard TS algorithm for the general CMAB model when the outcome distributions of all the base arms are independent, and obtain a distribution-dependent regret bound of $O(m\log K_{\max}\log T / Δ_{\min})$, where $m$ is the number of base arms, $K_{\max}$ is the size of the largest super arm, $T$ is the time horizon, and $Δ_{\min}$ is the minimum gap between the expected reward of the optimal solution and any non-optimal solution. This regret upper bound is better than the $O(m(\log K_{\max})^2\log T / Δ_{\min})$ bound in prior works. Moreover, our novel analysis techniques can help to tighten the regret bounds of other existing UCB-based policies (e.g., ESCB), as we improve the method of counting the cumulative regret. Then we consider the matroid bandit setting (a special class of CMAB model), where we could remove the independence assumption across arms and achieve a regret upper bound that matches the lower bound. Except for the regret upper bounds, we also point out that one cannot directly replace the exact offline oracle (which takes the parameters of an offline problem instance as input and outputs the exact best action under this instance) with an approximation oracle in TS algorithm for even the classical MAB problem. Finally, we use some experiments to show the comparison between regrets of TS and other existing algorithms, the experimental results show that TS outperforms existing baselines.

preprint2022arXiv

Tighter monogamy relations for the Tsallis-q and Rényi-$α$ entanglement in multiqubit systems

Monogamy relations characterize the distributions of quantum entanglement in multipartite systems. In this work, we present some tighter monogamy relations in terms of the power of the Tsallis-q and Rényi-$α$ entanglement in multipartite systems. We show that these new monogamy relations of multipartite entanglement with tighter lower bounds than the existing ones. Furthermore, three examples are given to illustrate the tightness.

preprint2022arXiv

Time-Series Domain Adaptation via Sparse Associative Structure Alignment: Learning Invariance and Variance

Domain adaptation on time-series data is often encountered in the industry but received limited attention in academia. Most of the existing domain adaptation methods for time-series data borrow the ideas from the existing methods for non-time series data to extract the domain-invariant representation. However, two peculiar difficulties to time-series data have not been solved. 1) It is not a trivial task to model the domain-invariant and complex dependence among different timestamps. 2) The domain-variant information is important but how to leverage them is almost underexploited. Fortunately, the stableness of causal structures among different domains inspires us to explore the structures behind the time-series data. Based on this inspiration, we investigate the domain-invariant unweighted sparse associative structures and the domain-variant strengths of the structures. To achieve this, we propose Sparse Associative structure alignment by learning Invariance and Variance (SASA-IV in short), a model that simultaneously aligns the invariant unweighted spare associative structures and considers the variant information for time-series unsupervised domain adaptation. Technologically, we extract the domain-invariant unweighted sparse associative structures with a unidirectional alignment restriction and embed the domain-variant strengths via a well-designed autoregressive module. Experimental results not only testify that our model yields state-of-the-art performance on three real-world datasets but also provide some insightful discoveries on knowledge transfer.

preprint2022arXiv

TO-FLOW: Efficient Continuous Normalizing Flows with Temporal Optimization adjoint with Moving Speed

Continuous normalizing flows (CNFs) construct invertible mappings between an arbitrary complex distribution and an isotropic Gaussian distribution using Neural Ordinary Differential Equations (neural ODEs). It has not been tractable on large datasets due to the incremental complexity of the neural ODE training. Optimal Transport theory has been applied to regularize the dynamics of the ODE to speed up training in recent works. In this paper, a temporal optimization is proposed by optimizing the evolutionary time for forward propagation of the neural ODE training. In this appoach, we optimize the network weights of the CNF alternately with evolutionary time by coordinate descent. Further with temporal regularization, stability of the evolution is ensured. This approach can be used in conjunction with the original regularization approach. We have experimentally demonstrated that the proposed approach can significantly accelerate training without sacrifying performance over baseline models.

preprint2022arXiv

Towards Efficient Visual Simplification of Computational Graphs in Deep Neural Networks

A computational graph in a deep neural network (DNN) denotes a specific data flow diagram (DFD) composed of many tensors and operators. Existing toolkits for visualizing computational graphs are not applicable when the structure is highly complicated and large-scale (e.g., BERT [1]). To address this problem, we propose leveraging a suite of visual simplification techniques, including a cycle-removing method, a module-based edge-pruning algorithm, and an isomorphic subgraph stacking strategy. We design and implement an interactive visualization system that is suitable for computational graphs with up to 10 thousand elements. Experimental results and usage scenarios demonstrate that our tool reduces 60% elements on average and hence enhances the performance for recognizing and diagnosing DNN models. Our contributions are integrated into an open-source DNN visualization toolkit, namely, MindInsight [2].

preprint2022arXiv

Two Coupled Rejection Metrics Can Tell Adversarial Examples Apart

Correctly classifying adversarial examples is an essential but challenging requirement for safely deploying machine learning models. As reported in RobustBench, even the state-of-the-art adversarially trained models struggle to exceed 67% robust test accuracy on CIFAR-10, which is far from practical. A complementary way towards robustness is to introduce a rejection option, allowing the model to not return predictions on uncertain inputs, where confidence is a commonly used certainty proxy. Along with this routine, we find that confidence and a rectified confidence (R-Con) can form two coupled rejection metrics, which could provably distinguish wrongly classified inputs from correctly classified ones. This intriguing property sheds light on using coupling strategies to better detect and reject adversarial examples. We evaluate our rectified rejection (RR) module on CIFAR-10, CIFAR-10-C, and CIFAR-100 under several attacks including adaptive ones, and demonstrate that the RR module is compatible with different adversarial training frameworks on improving robustness, with little extra computation. The code is available at https://github.com/P2333/Rectified-Rejection.

preprint2022arXiv

Unbalanced-basis-misalignment tolerant measurement-device-independent quantum key distribution

Measurement-device-independent quantum key distribution (MDIQKD) is a revolutionary protocol since it is physically immune to all attacks on the detection side. However, the protocol still keeps the strict assumptions on the source side that the four BB84-states must be perfectly prepared to ensure security. Some protocols release part of the assumptions in the encoding system to keep the practical security, but the performance would be dramatically reduced. In this work, we present a MDIQKD protocol that requires less knowledge of encoding system to combat the troublesome modulation errors and fluctuations. We have also experimentally demonstrated the protocol. The result indicates the high-performance and good security for its practical applications. Besides, its robustness and flexibility exhibit a good value for complex scenarios such as the QKD networks.

preprint2022arXiv

Using Deep Learning with Large Aggregated Datasets for COVID-19 Classification from Cough

The Covid-19 pandemic has been one of the most devastating events in recent history, claiming the lives of more than 5 million people worldwide. Even with the worldwide distribution of vaccines, there is an apparent need for affordable, reliable, and accessible screening techniques to serve parts of the World that do not have access to Western medicine. Artificial Intelligence can provide a solution utilizing cough sounds as a primary screening mode for COVID-19 diagnosis. This paper presents multiple models that have achieved relatively respectable performance on the largest evaluation dataset currently presented in academic literature. Through investigation of a self-supervised learning model (Area under the ROC curve, AUC = 0.807) and a convolutional nerual network (CNN) model (AUC = 0.802), we observe the possibility of model bias with limited datasets. Moreover, we observe that performance increases with training data size, showing the need for the worldwide collection of data to help combat the Covid-19 pandemic with non-traditional means.

preprint2022arXiv

Wireless Image Transmission Using Deep Source Channel Coding With Attention Modules

Recent research on joint source channel coding (JSCC) for wireless communications has achieved great success owing to the employment of deep learning (DL). However, the existing work on DL based JSCC usually trains the designed network to operate under a specific signal-to-noise ratio (SNR) regime, without taking into account that the SNR level during the deployment stage may differ from that during the training stage. A number of networks are required to cover the scenario with a broad range of SNRs, which is computational inefficiency (in the training stage) and requires large storage. To overcome these drawbacks our paper proposes a novel method called Attention DL based JSCC (ADJSCC) that can successfully operate with different SNR levels during transmission. This design is inspired by the resource assignment strategy in traditional JSCC, which dynamically adjusts the compression ratio in source coding and the channel coding rate according to the channel SNR. This is achieved by resorting to attention mechanisms because these are able to allocate computing resources to more critical tasks. Instead of applying the resource allocation strategy in traditional JSCC, the ADJSCC uses the channel-wise soft attention to scaling features according to SNR conditions. We compare the ADJSCC method with the state-of-the-art DL based JSCC method through extensive experiments to demonstrate its adaptability, robustness and versatility. Compared with the existing methods, the proposed method takes less storage and is more robust in the presence of channel mismatch.

preprint2021arXiv

BézierGAN: Automatic Generation of Smooth Curves from Interpretable Low-Dimensional Parameters

Many real-world objects are designed by smooth curves, especially in the domain of aerospace and ship, where aerodynamic shapes (e.g., airfoils) and hydrodynamic shapes (e.g., hulls) are designed. To facilitate the design process of those objects, we propose a deep learning based generative model that can synthesize smooth curves. The model maps a low-dimensional latent representation to a sequence of discrete points sampled from a rational Bézier curve. We demonstrate the performance of our method in completing both synthetic and real-world generative tasks. Results show that our method can generate diverse and realistic curves, while preserving consistent shape variation in the latent space, which is favorable for latent space design optimization or design space exploration.

preprint2021arXiv

BN-invariant sharpness regularizes the training model to better generalization

It is arguably believed that flatter minima can generalize better. However, it has been pointed out that the usual definitions of sharpness, which consider either the maxima or the integral of loss over a $δ$ ball of parameters around minima, cannot give consistent measurement for scale invariant neural networks, e.g., networks with batch normalization layer. In this paper, we first propose a measure of sharpness, BN-Sharpness, which gives consistent value for equivalent networks under BN. It achieves the property of scale invariance by connecting the integral diameter with the scale of parameter. Then we present a computation-efficient way to calculate the BN-sharpness approximately i.e., one dimensional integral along the &#34;sharpest&#34; direction. Furthermore, we use the BN-sharpness to regularize the training and design an algorithm to minimize the new regularized objective. Our algorithm achieves considerably better performance than vanilla SGD over various experiment settings.

preprint2021arXiv

Dark Matter Search Results from the PandaX-4T Commissioning Run

We report the first dark matter search results using the commissioning data from PandaX-4T. Using a time projection chamber with 3.7-tonne of liquid xenon target and an exposure of 0.63 tonne$\cdot$year, 1058 candidate events are identified within an approximate nuclear recoil energy window between 5 and 100 keV. No significant excess over background is observed. Our data set a stringent limit to the dark matter-nucleon spin-independent interactions, with a lowest excluded cross section (90% C.L.) of $3.8\times10^{-47} $cm$^2$ at a dark matter mass of 30 GeV/$c^2$.

preprint2021arXiv

Deep Generative Model for Efficient 3D Airfoil Parameterization and Generation

In aerodynamic shape optimization, the convergence and computational cost are greatly affected by the representation capacity and compactness of the design space. Previous research has demonstrated that using a deep generative model to parameterize two-dimensional (2D) airfoils achieves high representation capacity/compactness, which significantly benefits shape optimization. In this paper, we propose a deep generative model, Free-Form Deformation Generative Adversarial Networks (FFD-GAN), that provides an efficient parameterization for three-dimensional (3D) aerodynamic/hydrodynamic shapes like aircraft wings, turbine blades, car bodies, and hulls. The learned model maps a compact set of design variables to 3D surface points representing the shape. We ensure the surface smoothness and continuity of generated geometries by incorporating an FFD layer into the generative model. We demonstrate FFD-GAN&#39;s performance using a wing shape design example. The results show that FFD-GAN can generate realistic designs and form a reasonable parameterization. We further demonstrate FFD-GAN&#39;s high representation compactness and capacity by testing its design space coverage, the feasibility ratio of the design space, and its performance in design optimization. We demonstrate that over 94% feasibility ratio is achieved among wings randomly generated by the FFD-GAN, while FFD and B-spline only achieve less than 31%. We also show that the FFD-GAN leads to an order of magnitude faster convergence in a wing shape optimization problem, compared to the FFD and the B-spline parameterizations.

preprint2021arXiv

Establishing the first hidden-charm pentaquark with strangeness

We study the $P_{cs}(4459)^0$ recently observed by LHCb using the method of QCD sum rules. Our results support its interpretation as the $\bar D^* Ξ_c$ hadronic molecular state of either $J^P=1/2^-$ or $3/2^-$. Within the hadronic molecular picture, the three LHCb experiments observing $P_c$ and $P_{cs}$ states \cite{lhcb,Aaij:2015tga,Aaij:2019vzc} can be well understood as a whole. This strongly supports the existence of hadronic molecules, whose studies can significantly improve our understanding on the construction of the subatomic world. To verify this picture, we propose to further investigate the $P_{cs}(4459)^0$ to examine whether it can be separated into two states, and to search for the $\bar D Ξ_c$ molecular state of $J^P=1/2^-$.

preprint2021arXiv

How Does Data Augmentation Affect Privacy in Machine Learning?

It is observed in the literature that data augmentation can significantly mitigate membership inference (MI) attack. However, in this work, we challenge this observation by proposing new MI attacks to utilize the information of augmented data. MI attack is widely used to measure the model&#39;s information leakage of the training set. We establish the optimal membership inference when the model is trained with augmented data, which inspires us to formulate the MI attack as a set classification problem, i.e., classifying a set of augmented instances instead of a single data point, and design input permutation invariant features. Empirically, we demonstrate that the proposed approach universally outperforms original methods when the model is trained with data augmentation. Even further, we show that the proposed approach can achieve higher MI attack success rates on models trained with some data augmentation than the existing methods on models trained without data augmentation. Notably, we achieve a 70.1% MI attack success rate on CIFAR10 against a wide residual network while the previous best approach only attains 61.9%. This suggests the privacy risk of models trained with data augmentation could be largely underestimated.

preprint2021arXiv

Internal Calibration of the PandaX-II Detector with Radon Gaseous Sources

We have developed a low-energy electron recoil (ER) calibration method with $^{220}$Rn for the PandaX-II detector. $^{220}$Rn, emanated from natural thorium compounds, was fed into the detector through the xenon purification system. From 2017 to 2019, we performed three dedicated calibration campaigns with different radon sources. We studied the detector response to $α$, $β$, and $γ$ particles with focus on low energy ER events. During the runs in 2017 and 2018, the amount of radioactivity of $^{222}$Rn were on the order of 1\% of that of $^{220}$Rn and thorium particulate contamination was negligible, especially in 2018. We also measured the background contribution from $^{214}$Pb for the first time in PandaX-II with the help from a $^{222}$Rn injection. Calibration strategy with $^{220}$Rn and $^{222}$Rn will be implemented in the upcoming PandaX-4T experiment and can be useful for other xenon-based detectors as well.

preprint2021arXiv

Light tetraquark states with the exotic quantum number $J^{PC} = 3^{-+}$

We apply the method of QCD sum rules to study the $s q \bar s \bar q$ tetraquark states with the exotic quantum number $J^{PC} = 3^{-+}$, and extract mass of the lowest-lying state to be $2.33^{+0.19}_{-0.16}$ GeV. To construct the relevant tetraquark currents we need to explicitly add the covariant derivative operator. Our systematical analysis on their relevant interpolating currents indicates that: a) this state well decays into the $P$-wave $ρϕ/ωϕ$ channel but not into the $ρf_2(1525)/ωf_2(1525)/ϕf_2(1270)$ channels, and b) it well decays into the $K^*(892) \bar K_2^*(1430)$ channel but not into the $P$-wave $K^*(892) \bar K^*(892)$ channel.

preprint2021arXiv

Light yield and field dependence measurement in PandaX-II dual-phase xenon detector

The dual-phase xenon time projection chamber (TPC) is one of the most sensitive detector technology for dark matter direct search, where the energy deposition of incoming particle can be converted into photons and electrons through xenon excitation and ionization. The detector response to signal energy deposition varies significantly with the electric field in liquid xenon. We study the detector&#39;s light yield and its dependence on the electric field in the PandaX-II dual-phase detector containing 580~kg liquid xenon in the sensitive volume. From our measurements, the light yield at electric fields from 0~V/cm to 317~V/cm is obtained for energy depositions up to 236~keV.

preprint2021arXiv

Mechanical Cloak via Data-Driven Aperiodic Metamaterial Design

Mechanical cloaks are materials engineered to manipulate the elastic response around objects to make them indistinguishable from their homogeneous surroundings. Typically, methods based on material-parameter transformations are used to design optical, thermal and electric cloaks. However, they are not applicable in designing mechanical cloaks, since continuum-mechanics equations are not form-invariant under general coordinate transformations. As a result, existing design methods for mechanical cloaks have so far been limited to a narrow selection of voids with simple shapes. To address this challenge, we present a systematic, data-driven design approach to create mechanical cloaks composed of aperiodic metamaterials using a large pre-computed unit cell database. Our method is flexible to allow the design of cloaks with various boundary conditions, multiple loadings, different shapes and numbers of voids, and different homogeneous surroundings. It enables a concurrent optimization of both topology and properties distribution of the cloak. Compared to conventional fixed-shape solutions, this results in an overall better cloaking performance, and offers unparalleled versatility. Experimental measurements on 3D-printed structures further confirm the validity of the proposed approach. Our research illustrates the benefits of data-driven approaches in quickly responding to new design scenarios and resolving the computational challenge associated with multiscale designs of functional structures. It could be generalized to accommodate other applications that require heterogeneous property distribution, such as soft robots and implants design.

preprint2021arXiv

On-chip integrated waveguide amplifiers on Erbium-doped thin film lithium niobate on insulator

We demonstrate on-chip light amplification with integrated optical waveguide fabricated on erbium-doped thin film lithium niobate on insulator (TFLNOI) using the photolithography assisted chemo-mechanical etching (PLACE) technique. A maximum internal net gain of 18 dB in the small-signal-gain regime is measured at the peak emission wavelength of 1530 nm for a waveguide length of 3.6 cm, indicating a differential gain per unit length of 5 dB/cm. This work paves the way to the monolithic integration of diverse active and passive photonic components on the TFLNOI platform.

preprint2021arXiv

Quantum key distribution over scattering channel

Scattering of light by cloud, haze, and fog decreases the transmission efficiency of communication channels in quantum key distribution (QKD), reduces the system&#39;s practical security, and thus constrains the deployment of free-space QKD. Here, we employ the wavefront shaping technology to compensate distorted optical signals in high-loss scattering quantum channels and fulfill a polarization-encoded BB84 QKD experiment. With this quantum channel compensation technology, we achieve a typical enhancement of about 250 in transmission efficiency and improve the secure key rate from 0 to $1.85\times10^{-6}$ per sifted key. The method and its first time validation show the great potential to expand the territory of QKD systems from lossless channels to highly scattered ones and therefore enhances the deployment ability of global quantum communication network.

preprint2021arXiv

Results of Dark Matter Search using the Full PandaX-II Exposure

We report the dark matter search results obtained using the full 132 ton$\cdot$day exposure of the PandaX-II experiment, including all data from March 2016 to August 2018. No significant excess of events is identified above the expected background. Upper limits are set on the spin-independent dark matter-nucleon interactions. The lowest 90% confidence level exclusion on the spin-independent cross section is $2.2\times 10^{-46}$ cm$^2$ at a WIMP mass of 30 GeV/$c^2$.

preprint2021arXiv

Room temperature ferromagnetism of monolayer chromium telluride with perpendicular magnetic anisotropy

The realization of long-range magnetic ordering in two-dimensional (2D) systems can potentially revolutionize next-generation information technology. Here, we report the successful fabrication of crystalline Cr3Te4 monolayers with room temperature ferromagnetism. Using molecular beam epitaxy, the growth of 2D Cr3Te4 films with monolayer thickness is demonstrated at low substrate temperatures (~100C), compatible with Si CMOS technology. X-ray magnetic circular dichroism measurements reveal a Curie temperature (Tc) of ~344 K for the Cr3Te4 monolayer with an out-of-plane magnetic easy axis, which decreases to ~240 K for the thicker film (~ 7 nm) with an in-plane easy axis. The enhancement of ferromagnetic coupling and the magnetic anisotropy transition is ascribed to interfacial effects, in particular the orbital overlap at the monolayer Cr3Te4/graphite interface, supported by density-functional theory calculations. This work sheds light on the low-temperature scalable growth of 2D nonlayered materials with room temperature ferromagnetism for new magnetic and spintronic devices.

preprint2021arXiv

Security Analysis and Improvement of Source Independent Quantum Random Number Generators with Imperfect Devices

A quantum random number generator (QRNG) as a genuine source of randomness is essential in many applications, such as number simulation and cryptography. Recently, a source-independent quantum random number generator (SI-QRNG), which can generate secure random numbers with untrusted sources, has been realized. However, the measurement loopholes of the trusted but imperfect devices used in SI-QRNGs have not yet been fully explored, which will cause security problems, especially in high-speed systems. Here, we point out and evaluate the security loopholes of practical imperfect measurement devices in SI-QRNGs. We also provide corresponding countermeasures to prevent these information leakages by recalculating the conditional minimum entropy and adding a monitor. Furthermore, by taking into account the finite-size effect,we show that the influence of the afterpulse can exceed that of the finite-size effect with the large number of sampled rounds. Our protocol is simple and effective, and it promotes the security of SI-QRNG in practice as well as the compatibility with high-speed measurement devices, thus paving the way for constructing ultrafast and security-certified commercial SI-QRNG systems.

preprint2021arXiv

Toward the existence of odderon as a three-gluon bound state

Inspired by the evidence of the odderon exchange recently observed by the D0 and TOTEM Collaborations, a QCD sum rule investigation is performed to study the odderon as a three-gluon bound state. There may exist six lowest-lying three-gluon odderons with the quantum numbers $J^{PC} = 1/2/3^{\pm-}$. We systematically construct their interpolating currents, and calculate their mass spectra. To verify their existence, we propose to search for the spin-3 odderons in their $VVV$ and $VVP$ decay channels directly at LHC, with $V$ and $P$ light vector and pseudoscalar mesons respectively.

preprint2021arXiv

Two- and three-gluon glueballs of $C=+$

We study two- and three-gluon glueballs of $C=+$ using the method of QCD sum rules. We systematically construct their interpolating currents, and find that all the spin-1 currents of $C=+$ vanish. This suggests that the ``ground-state&#39;&#39; spin-1 glueballs of $C=+$ do not exist within the relativistic framework. We calculate masses of the two-gluon glueballs with $J^{PC} = 0^{\pm+}/2^{\pm+}$ and the three-gluon glueballs with $J^{PC} = 0^{\pm+}/2^{\pm+}$. We propose to search for the $J^{PC} = 0^{-+}/2^{-\pm}/3^{\pm-}$ three-gluon glueballs in their three-meson decay channels in future BESIII, GlueX, LHC, and PANDA experiments.

preprint2021arXiv

UAV-Assisted Over-the-Air Computation

Over-the-air computation (AirComp) provides a promising way to support ultrafast aggregation of distributed data. However, its performance cannot be guaranteed in long-distance transmission due to the distortion induced by the channel fading and noise. To unleash the full potential of AirComp, this paper proposes to use a low-cost unmanned aerial vehicle (UAV) acting as a mobile base station to assist AirComp systems. Specifically, due to its controllable high-mobility and high-altitude, the UAV can move sufficiently close to the sensors to enable line-of-sight transmission and adaptively adjust all the links&#39; distances, thereby enhancing the signal magnitude alignment and noise suppression. Our goal is to minimize the time-averaging mean-square error for AirComp by jointly optimizing the UAV trajectory, the scaling factor at the UAV, and the transmit power at the sensors, under constraints on the UAV&#39;s predetermined locations and flying speed, sensors&#39; average and peak power limits. However, due to the highly coupled optimization variables and time-dependent constraints, the resulting problem is non-convex and challenging. We thus propose an efficient iterative algorithm by applying the block coordinate descent and successive convex optimization techniques. Simulation results verify the convergence of the proposed algorithm and demonstrate the performance gains and robustness of the proposed design compared with benchmarks.

preprint2021arXiv

Vacancy-Engineered Flat-Band Superconductivity in Holey Graphene

A bipartite lattice with chiral symmetry is known to host zero energy flat bands if the numbers of the two sublattices are different. We demonstrate that this mechanism of producing flat bands can be realized on graphene by introducing periodic vacancies. Using first-principle calculations, we elaborate that even though the pristine graphene does not exactly preserve chiral symmetry, this mechanism applied to holey graphene still produces single or multiple bands as narrow as ~0.5eV near the Fermi surface throughout the entire Brillouin zone. Moreover, this mechanism can combine with vacancy-engineered nonsymmorphic symmetry to produce band structures with coexisting flat bands and nodal lines. A weak coupling mean-field treatment suggests the stabilization of superconductivity by these vacancy-engineered narrow bands. In addition, superconductivity occurs predominantly on the majority sublattices, with an amplitude that increases with the number of narrow bands.

preprint2020arXiv

(Locally) Differentially Private Combinatorial Semi-Bandits

In this paper, we study Combinatorial Semi-Bandits (CSB) that is an extension of classic Multi-Armed Bandits (MAB) under Differential Privacy (DP) and stronger Local Differential Privacy (LDP) setting. Since the server receives more information from users in CSB, it usually causes additional dependence on the dimension of data, which is a notorious side-effect for privacy preserving learning. However for CSB under two common smoothness assumptions \cite{kveton2015tight,chen2016combinatorial}, we show it is possible to remove this side-effect. In detail, for $B_{\infty}$-bounded smooth CSB under either $\varepsilon$-LDP or $\varepsilon$-DP, we prove the optimal regret bound is $Θ(\frac{mB^2_{\infty}\ln T } {Δε^2})$ or $\tildeΘ(\frac{mB^2_{\infty}\ln T} { Δε})$ respectively, where $T$ is time period, $Δ$ is the gap of rewards and $m$ is the number of base arms, by proposing novel algorithms and matching lower bounds. For $B_1$-bounded smooth CSB under $\varepsilon$-DP, we also prove the optimal regret bound is $\tildeΘ(\frac{mKB^2_1\ln T} {Δε})$ with both upper bound and lower bound, where $K$ is the maximum number of feedback in each round. All above results nearly match corresponding non-private optimal rates, which imply there is no additional price for (locally) differentially private CSB in above common settings.

preprint2020arXiv

$X_0(2900)$ and $X_1(2900)$: hadronic molecules or compact tetraquarks

Very recently the LHCb Collaboration reported their observation of the first two fully open-flavor tetraquark states, the $X_0(2900)$ of $J^P = 0^+$ and the $X_1(2900)$ of $J^P = 1^-$. We study their possible interpretations using the method of QCD sum rules, paying special attention to an interesting feature of this experiment that the higher resonance $X_1(2900)$ has a width significantly larger than the lower one $X_0(2900)$. Our results suggest that the $X_0(2900)$ can be interpreted as the $S$-wave $D^{*-}K^{*+}$ molecule state of $J^P = 0^+$, and the $X_1(2900)$ can be interpreted as the $P$-wave $\bar c \bar s u d$ compact tetraquark state of $J^P = 1^-$. Mass predictions of their bottom partners are also given.

preprint2020arXiv

A Generic Framework and Library for Exploration of Small Multiples through Interactive Piling

Small multiples are miniature representations of visual information used generically across many domains. Handling large numbers of small multiples imposes challenges on many analytic tasks like inspection, comparison, navigation, or annotation. To address these challenges, we developed a framework and implemented a library called Piling.js for designing interactive piling interfaces. Based on the piling metaphor, such interfaces afford flexible organization, exploration, and comparison of large numbers of small multiples by interactively aggregating visual objects into piles. Based on a systematic analysis of previous work, we present a structured design space to guide the design of visual piling interfaces. To enable designers to efficiently build their own visual piling interfaces, Piling.js provides a declarative interface to avoid having to write low-level code and implements common aspects of the design space. An accompanying GUI additionally supports the dynamic configuration of the piling interface. We demonstrate the expressiveness of Piling.js with examples from machine learning, immunofluorescence microscopy, genomics, and public health.

preprint2020arXiv

A Learning-Driven Framework with Spatial Optimization For Surgical Suture Thread Reconstruction and Autonomous Grasping Under Multiple Topologies and Environmental Noises

Surgical knot tying is one of the most fundamental and important procedures in surgery, and a high-quality knot can significantly benefit the postoperative recovery of the patient. However, a longtime operation may easily cause fatigue to surgeons, especially during the tedious wound closure task. In this paper, we present a vision-based method to automate the suture thread grasping, which is a sub-task in surgical knot tying and an intermediate step between the stitching and looping manipulations. To achieve this goal, the acquisition of a suture&#39;s three-dimensional (3D) information is critical. Towards this objective, we adopt a transfer-learning strategy first to fine-tune a pre-trained model by learning the information from large legacy surgical data and images obtained by the on-site equipment. Thus, a robust suture segmentation can be achieved regardless of inherent environment noises. We further leverage a searching strategy with termination policies for a suture&#39;s sequence inference based on the analysis of multiple topologies. Exact results of the pixel-level sequence along a suture can be obtained, and they can be further applied for a 3D shape reconstruction using our optimized shortest path approach. The grasping point considering the suturing criterion can be ultimately acquired. Experiments regarding the suture 2D segmentation and ordering sequence inference under environmental noises were extensively evaluated. Results related to the automated grasping operation were demonstrated by simulations in V-REP and by robot experiments using Universal Robot (UR) together with the da Vinci Research Kit (dVRK) adopting our learning-driven framework.

preprint2020arXiv

A real-time multi-constraints obstacle avoidance method using LiDAR

Obstacle avoidance is one of the essential and indispensable functions for autonomous mobile robots. Most of the existing solutions are typically based on single condition constraint and cannot incorporate sensor data in a real-time manner, which often fail to respond to unexpected moving obstacles in dynamic unknown environments. In this paper, a novel real-time multi-constraints obstacle avoidance method using Light Detection and Ranging(LiDAR) is proposed, which is able to, based on the latest estimation of the robot pose and environment, find the sub-goal defined by a multi-constraints function within the explored region and plan a corresponding optimal trajectory at each time step iteratively, so that the robot approaches the goal over time. Meanwhile, at each time step, the improved Ant Colony Optimization(ACO) algorithm is also used to re-plan optimal paths from the latest robot pose to the latest defined sub-goal position. While ensuring convergence, planning in this method is done by repeated local optimizations, so that the latest sensor data from LiDAR and derived environment information can be fully utilized at each step until the robot reaches the desired position. This method facilitates real-time performance, also has little requirement on memory space or computational power due to its nature, thus our method has huge potentials to benefit small low-cost autonomous platforms. The method is evaluated against several existing technologies in both simulation and real-world experiments.

preprint2020arXiv

A Simultaneous Inference Procedure to Identify Subgroups from RCTs with Survival Outcomes: Application to Analysis of AMD Progression Studies

With the uptake of targeted therapies, instead of the &#34;one-fits-all&#34; approach, modern randomized clinical trials (RCTs) often aim to develop treatments that target a subgroup of patients. Motivated by analyzing the Age-Related Eye Disease Study (AREDS) data, a large RCT to study the efficacy of nutritional supplements in delaying the progression of an eye disease, age-related macular degeneration (AMD), we develop a simultaneous inference procedure to identify and infer subgroups with differential treatment efficacy in RCTs with survival outcome. Specifically, we formulate the multiple testing problem through contrasts and construct their simultaneous confidence intervals, which control both within- and across- marker multiplicity appropriately. Realistic simulations are conducted using real genotype data to evaluate the method performance under various scenarios. The method is then applied to AREDS to assess the efficacy of antioxidants and zinc combination in delaying AMD progression. Multiple gene regions including ESRRB-VASH1 on chromosome 14 have been identified with subgroups showing differential efficacy. We further validate our findings in an independent subsequent RCT, AREDS2, by discovering consistent differential treatment responses in the targeted and non-targeted subgroups been identified from AREDS. This simultaneous inference approach provides a step forward to confidently identify and infer subgroups in modern drug development.

preprint2020arXiv

A Systematic Literature Review of Modern Software Visualization

We report on the state-of-the-art of software visualization. To ensure reproducibility, we adopted the Systematic Literature Review methodology. That is, we analyzed 1440 entries from IEEE Xplore and ACM Digital Library databases. We selected 105 relevant full papers published in 2013-2019, which we classified based on the aspect of the software system that is supported (i.e., structure, behavior, and evolution). For each paper, we extracted main dimensions that characterize software visualizations, such as software engineering tasks, roles of users, information visualization techniques, and media used to display visualizations. We provide researchers in the field an overview of the state-of-the-art in software visualization and highlight research opportunities. We also help developers to identify suitable visualizations for their particular context by matching software visualizations to development concerns and concrete details to obtain available visualization tools.

preprint2020arXiv

Absence of equilibrium edge currents in theoretical models of topological insulators

The low energy sector of 2D and 3D topological insulators (TIs) exhibits propagating edge states, which has speculated the existence of equilibrium edge currents or edge spin currents. We demonstrate that if the low energy sector of TIs is regularized in a straightforward manner into a square or cubic lattice, then the current from the edge states is in fact canceled out exactly by that from the valence bands, rendering no edge current. This result serves as a warning that for any equilibrium property of topological insulators, the contribution from the valence bands should not be overlooked. In these regularized lattice model, there is a finite edge current only if the Dirac point of the edge states is shifted away from the chemical potential, for instance by doping, impurities, edge confining potential, surface band bending, or gate voltage. The edge current in small quantum dots as a function of the gate voltage is quantized, and the edge current can flow out of the gated region up to the decay length of the edge state.

preprint2020arXiv

Adaptive Expansion Bayesian Optimization for Unbounded Global Optimization

Bayesian optimization is normally performed within fixed variable bounds. In cases like hyperparameter tuning for machine learning algorithms, setting the variable bounds is not trivial. It is hard to guarantee that any fixed bounds will include the true global optimum. We propose a Bayesian optimization approach that only needs to specify an initial search space that does not necessarily include the global optimum, and expands the search space when necessary. However, over-exploration may occur during the search space expansion. Our method can adaptively balance exploration and exploitation in an expanding space. Results on a range of synthetic test functions and an MLP hyperparameter optimization task show that the proposed method out-performs or at least as good as the current state-of-the-art methods.

preprint2020arXiv

Adaptive Power and Rate Control for Real-time Status Updating over Fading Channels

Age of Information (AoI) has attracted much attention recently due to its capability of characterizing the freshness of information. To improve information freshness over fading channels, efficient scheduling methods are highly desired for wireless transmissions. However, due to the channel instability and arrival randomness, optimizing AoI is very challenging. In this paper, we are interested in the AoI-optimal transmissions with rate-adaptive transmission schemes in a buffer-aware system. More specifically, we utilize a probabilistic scheduling method to minimize the AoI while satisfying an average power constraint. By characterizing the probabilistic scheduling policy with a Constrained Markov Decision Process (CMDP), we formulate a Linear Programming (LP) problem. Further, a low complexity algorithm is presented to obtain the optimal scheduling policy, which is proved to belong to a set of semi-threshold-based policies. Numerical results verify the reduction in computational complexity and the optimality of semi-threshold-based policy, which indicates that we can achieve well real-time service with a low calculating complexity.

preprint2020arXiv

Airfoil Design Parameterization and Optimization using Bézier Generative Adversarial Networks

Global optimization of aerodynamic shapes usually requires a large number of expensive computational fluid dynamics simulations because of the high dimensionality of the design space. One approach to combat this problem is to reduce the design space dimension by obtaining a new representation. This requires a parametric function that compactly and sufficiently describes useful variation in shapes. We propose a deep generative model, Bézier-GAN, to parameterize aerodynamic designs by learning from shape variations in an existing database. The resulted new parameterization can accelerate design optimization convergence by improving the representation compactness while maintaining sufficient representation capacity. We use the airfoil design as an example to demonstrate the idea and analyze Bézier-GAN&#39;s representation capacity and compactness. Results show that Bézier-GAN both (1) learns smooth and realistic shape representations for a wide range of airfoils and (2) empirically accelerates optimization convergence by at least two times compared to state-of-the-art parameterization methods.

preprint2020arXiv

Asynchronous Stochastic Gradient Descent with Delay Compensation

With the fast development of deep learning, it has become common to learn big neural networks using massive training data. Asynchronous Stochastic Gradient Descent (ASGD) is widely adopted to fulfill this task for its efficiency, which is, however, known to suffer from the problem of delayed gradients. That is, when a local worker adds its gradient to the global model, the global model may have been updated by other workers and this gradient becomes &#34;delayed&#34;. We propose a novel technology to compensate this delay, so as to make the optimization behavior of ASGD closer to that of sequential SGD. This is achieved by leveraging Taylor expansion of the gradient function and efficient approximation to the Hessian matrix of the loss function. We call the new algorithm Delay Compensated ASGD (DC-ASGD). We evaluated the proposed algorithm on CIFAR-10 and ImageNet datasets, and the experimental results demonstrate that DC-ASGD outperforms both synchronous SGD and asynchronous SGD, and nearly approaches the performance of sequential SGD.

preprint2020arXiv

Bubble Storytelling with Automated Animation: A Brexit Hashtag Activism Case Study

Hashtag data are common and easy to acquire. Thus, they are widely used in studies and visual data storytelling. For example, a recent story by China Central Television Europe (CCTV Europe) depicts Brexit as a hashtag movement displayed on an animated bubble chart. However, creating such a story is usually laborious and tedious, because narrators have to switch between different tools and discuss with different collaborators. To reduce the burden, we develop a prototype system to help explore the bubbles&#39; movement by automatically inserting animations connected to the storytelling of the video creators and the interaction of viewers to those videos. We demonstrate the usability of our method through both use cases and a semi-structured user study.

preprint2020arXiv

ChoreoNet: Towards Music to Dance Synthesis with Choreographic Action Unit

Dance and music are two highly correlated artistic forms. Synthesizing dance motions has attracted much attention recently. Most previous works conduct music-to-dance synthesis via directly music to human skeleton keypoints mapping. Meanwhile, human choreographers design dance motions from music in a two-stage manner: they firstly devise multiple choreographic dance units (CAUs), each with a series of dance motions, and then arrange the CAU sequence according to the rhythm, melody and emotion of the music. Inspired by these, we systematically study such two-stage choreography approach and construct a dataset to incorporate such choreography knowledge. Based on the constructed dataset, we design a two-stage music-to-dance synthesis framework ChoreoNet to imitate human choreography procedure. Our framework firstly devises a CAU prediction model to learn the mapping relationship between music and CAU sequences. Afterwards, we devise a spatial-temporal inpainting model to convert the CAU sequence into continuous dance motions. Experimental results demonstrate that the proposed ChoreoNet outperforms baseline methods (0.622 in terms of CAU BLEU score and 1.59 in terms of user study score).

preprint2020arXiv

Combinatorial Pure Exploration of Dueling Bandit

In this paper, we study combinatorial pure exploration for dueling bandits (CPE-DB): we have multiple candidates for multiple positions as modeled by a bipartite graph, and in each round we sample a duel of two candidates on one position and observe who wins in the duel, with the goal of finding the best candidate-position matching with high probability after multiple rounds of samples. CPE-DB is an adaptation of the original combinatorial pure exploration for multi-armed bandit (CPE-MAB) problem to the dueling bandit setting. We consider both the Borda winner and the Condorcet winner cases. For Borda winner, we establish a reduction of the problem to the original CPE-MAB setting and design PAC and exact algorithms that achieve both the sample complexity similar to that in the CPE-MAB setting (which is nearly optimal for a subclass of problems) and polynomial running time per round. For Condorcet winner, we first design a fully polynomial time approximation scheme (FPTAS) for the offline problem of finding the Condorcet winner with known winning probabilities, and then use the FPTAS as an oracle to design a novel pure exploration algorithm ${\sf CAR}$-${\sf Cond}$ with sample complexity analysis. ${\sf CAR}$-${\sf Cond}$ is the first algorithm with polynomial running time per round for identifying the Condorcet winner in CPE-DB.

preprint2020arXiv

Computation Offloading in Heterogeneous Mobile Edge Computing with Energy Harvesting

Energy harvesting aided mobile edge computing (MEC) has gained much attention for its widespread application in the computation-intensive, latency-sensitive and energy-hungry scenario. In this paper, computation offloading from multi-MD to multi-MEC-s in heterogeneous MEC systems with energy harvesting is investigated from a game theoretic perspective. The objective is to minimize the average response time of an MD that consists of communication time, waiting time and processing time. M/G/1 queueing models are established for MDs and MEC-ss. The interference among MDs, the randomness in computation task generation, harvested energy arrival, wireless channel state, queueing at the MEC-s, and the power budget constraint of an MD are taken into consideration. A noncooperative computation offloading game is formulated. We give the definition and existence analysis of the Nash equilibrium (NE). Furthermore, we reconstruct the optimization problem of an MD. A 2-step decomposition is presented and performed. Thereby, we arrive at a one-dimensional search problem and a greatly shrunken sub-problem. We can obtain the optimal solution of the sub-problem by seeking the finite solutions of its Karush-Kuhn-Tucker (KKT) conditions. Thereafter, a distributive NE-orienting iterated best-response algorithm is designed. Simulations are carried out to illustrate the convergence performance and parameter effect.

preprint2020arXiv

ConceptExplorer: Visual Analysis of Concept Driftsin Multi-source Time-series Data

Time-series data is widely studied in various scenarios, like weather forecast, stock market, customer behavior analysis. To comprehensively learn about the dynamic environments, it is necessary to comprehend features from multiple data sources. This paper proposes a novel visual analysis approach for detecting and analyzing concept drifts from multi-sourced time-series. We propose a visual detection scheme for discovering concept drifts from multiple sourced time-series based on prediction models. We design a drift level index to depict the dynamics, and a consistency judgment model to justify whether the concept drifts from various sources are consistent. Our integrated visual interface, ConceptExplorer, facilitates visual exploration, extraction, understanding, and comparison of concepts and concept drifts from multi-source time-series data. We conduct three case studies and expert interviews to verify the effectiveness of our approach.

preprint2020arXiv

Constructing Basis Path Set by Eliminating Path Dependency

The way the basis path set works in neural network remains mysterious, and the generalization of newly appeared G-SGD algorithm to more practical network is hindered. The Basis Path Set Searching problem is formulated from the perspective of graph theory, to find the basis path set in a regular complicated neural network. Our paper aims to discover the underlying cause of path dependency between two independent substructures. Algorithm DEAH is designed to solve the Basis Path Set Searching problem by eliminating such path dependency. The path subdivision chain is proposed to effectively eliminate the path dependency inside the chain and between chains. The theoretical proofs and analysis of polynomial time complexity are presented. The paper therefore provides one methodology to find the basis path set in a more general neural network, which offers theoretical and algorithmic support for the application of G-SGD algorithm in more practical scenarios.

preprint2020arXiv

Controlling a Networked SIS Model via a Single Input over Undirected Graphs

This paper formulates and studies the problem of controlling a networked SIS model using a single input in which the network structure is described by a connected undirected graph. A necessary and sufficient condition on the values of curing and infection rates for the healthy state to be exponentially stable is obtained via the analysis of signed Laplacians when the control input is the curing budget of a single agent. In the case when the healthy state is stabilizable, an explicit expression for the minimum curing budget is provided. The utility of the algorithm is demonstrated using a simulation over a network of cities in the northeastern United States.

preprint2020arXiv

Convergence of Distributed Stochastic Variance Reduced Methods without Sampling Extra Data

Stochastic variance reduced methods have gained a lot of interest recently for empirical risk minimization due to its appealing run time complexity. When the data size is large and disjointly stored on different machines, it becomes imperative to distribute the implementation of such variance reduced methods. In this paper, we consider a general framework that directly distributes popular stochastic variance reduced methods in the master/slave model, by assigning outer loops to the parameter server, and inner loops to worker machines. This framework is natural and friendly to implement, but its theoretical convergence is not well understood. We obtain a comprehensive understanding of algorithmic convergence with respect to data homogeneity by measuring the smoothness of the discrepancy between the local and global loss functions. We establish the linear convergence of distributed versions of a family of stochastic variance reduced algorithms, including those using accelerated and recursive gradient updates, for minimizing strongly convex losses. Our theory captures how the convergence of distributed algorithms behaves as the number of machines and the size of local data vary. Furthermore, we show that when the data are less balanced, regularization can be used to ensure convergence at a slower rate. We also demonstrate that our analysis can be further extended to handle nonconvex loss functions.

preprint2020arXiv

Data-Driven Topology Optimization with Multiclass Microstructures using Latent Variable Gaussian Process

The data-driven approach is emerging as a promising method for the topological design of multiscale structures with greater efficiency. However, existing data-driven methods mostly focus on a single class of microstructures without considering multiple classes to accommodate spatially varying desired properties. The key challenge is the lack of an inherent ordering or distance measure between different classes of microstructures in meeting a range of properties. To overcome this hurdle, we extend the newly developed latent-variable Gaussian process (LVGP) models to create multi-response LVGP (MR-LVGP) models for the microstructure libraries of metamaterials, taking both qualitative microstructure concepts and quantitative microstructure design variables as mixed-variable inputs. The MR-LVGP model embeds the mixed variables into a continuous design space based on their collective effects on the responses, providing substantial insights into the interplay between different geometrical classes and material parameters of microstructures. With this model, we can easily obtain a continuous and differentiable transition between different microstructure concepts that can render gradient information for multiscale topology optimization. We demonstrate its benefits through multiscale topology optimization with aperiodic microstructures. Design examples reveal that considering multiclass microstructures can lead to improved performance due to the consistent load-transfer paths for micro- and macro-structures.

preprint2020arXiv

Deep Generative Modeling for Mechanistic-based Learning and Design of Metamaterial Systems

Metamaterials are emerging as a new paradigmatic material system to render unprecedented and tailorable properties for a wide variety of engineering applications. However, the inverse design of metamaterial and its multiscale system is challenging due to high-dimensional topological design space, multiple local optima, and high computational cost. To address these hurdles, we propose a novel data-driven metamaterial design framework based on deep generative modeling. A variational autoencoder (VAE) and a regressor for property prediction are simultaneously trained on a large metamaterial database to map complex microstructures into a low-dimensional, continuous, and organized latent space. We show in this study that the latent space of VAE provides a distance metric to measure shape similarity, enable interpolation between microstructures and encode meaningful patterns of variation in geometries and properties. Based on these insights, systematic data-driven methods are proposed for the design of microstructure, graded family, and multiscale system. For microstructure design, the tuning of mechanical properties and complex manipulations of microstructures are easily achieved by simple vector operations in the latent space. The vector operation is further extended to generate metamaterial families with a controlled gradation of mechanical properties by searching on a constructed graph model. For multiscale metamaterial systems design, a diverse set of microstructures can be rapidly generated using VAE for target properties at different locations and then assembled by an efficient graph-based optimization method to ensure compatibility between adjacent microstructures. We demonstrate our framework by designing both functionally graded and heterogeneous metamaterial systems that achieve desired distortion behaviors.

preprint2020arXiv

Detection of Fermi Arcs in Weyl Semimetals through Surface Negative Refraction

One of the main features of Weyl semimetals is the existence of Fermi arc surface states at their surface, which cannot be realized in pure two-dimensional systems in the absence of many-body interactions. Due to the gapless bulk of the semimetal, it is, however, challenging to observe clear signatures from the Fermi arc surface states. Here, we propose to detect such novel surface states via perfect negative refraction that occurs between two adjacent open surfaces with properly orientated Fermi arcs. Specifically, this phenomenon visibly manifests in non-local transport measurement, where the negative refraction generates a return peak in the real-space conductance. This provides a unique signature of the Fermi arc surface states. We discuss the appearance of this peak both in inversion and time-reversal symmetric Weyl semimetals, where the latter exhibits conductance oscillations due to multiple negative refraction scattering events.

preprint2020arXiv

DRGraph: An Efficient Graph Layout Algorithm for Large-scale Graphs by Dimensionality Reduction

Efficient layout of large-scale graphs remains a challenging problem: the force-directed and dimensionality reduction-based methods suffer from high overhead for graph distance and gradient computation. In this paper, we present a new graph layout algorithm, called DRGraph, that enhances the nonlinear dimensionality reduction process with three schemes: approximating graph distances by means of a sparse distance matrix, estimating the gradient by using the negative sampling technique, and accelerating the optimization process through a multi-level layout scheme. DRGraph achieves a linear complexity for the computation and memory consumption, and scales up to large-scale graphs with millions of nodes. Experimental results and comparisons with state-of-the-art graph layout methods demonstrate that DRGraph can generate visually comparable layouts with a faster running time and a lower memory requirement.

preprint2020arXiv

Dual Gaussian-based Variational Subspace Disentanglement for Visible-Infrared Person Re-Identification

Visible-infrared person re-identification (VI-ReID) is a challenging and essential task in night-time intelligent surveillance systems. Except for the intra-modality variance that RGB-RGB person re-identification mainly overcomes, VI-ReID suffers from additional inter-modality variance caused by the inherent heterogeneous gap. To solve the problem, we present a carefully designed dual Gaussian-based variational auto-encoder (DG-VAE), which disentangles an identity-discriminable and an identity-ambiguous cross-modality feature subspace, following a mixture-of-Gaussians (MoG) prior and a standard Gaussian distribution prior, respectively. Disentangling cross-modality identity-discriminable features leads to more robust retrieval for VI-ReID. To achieve efficient optimization like conventional VAE, we theoretically derive two variational inference terms for the MoG prior under the supervised setting, which not only restricts the identity-discriminable subspace so that the model explicitly handles the cross-modality intra-identity variance, but also enables the MoG distribution to avoid posterior collapse. Furthermore, we propose a triplet swap reconstruction (TSR) strategy to promote the above disentangling process. Extensive experiments demonstrate that our method outperforms state-of-the-art methods on two VI-ReID datasets.

preprint2020arXiv

Effects of Structural Distortions on the Electronic Structure of T-type Transition Metal Dichalcogenides

Single-layer transition metal dichalcogenides (TMDCs) can adopt two distinct structures corresponding to different coordination of the metal atoms. TMDCs adopting the T-type structure exhibit a rich and diverse set of phenomena, including charge density waves (CDW) in a $\sqrt{13}\times\sqrt{13}$ supercell pattern in TaS$_2$ and TaSe$_2$, and a possible excitonic insulating phase in TiSe$_2$. These properties make the T-TMDCs desirable components of layered heterostructure devices. In order to predict the emergent properties of combinations of different layered materials, one needs simple and accurate models for the constituent layers which can take into account potential effects of lattice mismatch, relaxation, strain, and structural distortion. Previous studies have developed ab initio tight-binding Hamiltonians for H-type TMDCs [arXiv:1709.07510]. Here we extend this work to include T-type TMDCs. We demonstrate the capabilities of our model using three example systems: a 1-dimensional sinusoidal ripple, the 2$\times$2 CDW in TiSe$_2$, and the $\sqrt{13}\times\sqrt{13}$ CDW in TaS$_2$. Using the technique of band unfolding we compare the electronic structure of the distorted crystals to the pristine band structure and find excellent agreement with direct DFT calculations, provided the magnitude of the distortions remains in the linear regime.

preprint2020arXiv

Efficient Approximation Algorithms for Adaptive Influence Maximization

Given a social network $G$ and an integer $k$, the influence maximization (IM) problem asks for a seed set $S$ of $k$ nodes from $G$ to maximize the expected number of nodes influenced via a propagation model. The majority of the existing algorithms for the IM problem are developed only under the non-adaptive setting, i.e., where all $k$ seed nodes are selected in one batch without observing how they influence other users in real world. In this paper, we study the adaptive IM problem where the $k$ seed nodes are selected in batches of equal size $b$, such that the $i$-th batch is identified after the actual influence results of the former $i-1$ batches are observed. In this paper, we propose the first practical algorithm for the adaptive IM problem that could provide the worst-case approximation guarantee of $1-\mathrm{e}^{ρ_b(\varepsilon-1)}$, where $ρ_b=1-(1-1/b)^b$ and $\varepsilon \in (0, 1)$ is a user-specified parameter. In particular, we propose a general framework AdaptGreedy that could be instantiated by any existing non-adaptive IM algorithms with expected approximation guarantee. Our approach is based on a novel randomized policy that is applicable to the general adaptive stochastic maximization problem, which may be of independent interest. In addition, we propose a novel non-adaptive IM algorithm called EPIC which not only provides strong expected approximation guarantee, but also presents superior performance compared with the existing IM algorithms. Meanwhile, we clarify some existing misunderstandings in recent work and shed light on further study of the adaptive IM problem. We conduct experiments on real social networks to evaluate our proposed algorithms comprehensively, and the experimental results strongly corroborate the superiorities and effectiveness of our approach.

preprint2020arXiv

Efficient decoy-states for the reference-frame-independent measurement-device-independent quantum key distribution

Reference-frame-independent measurement-device-independent quantum key distribution (RFI-MDI-QKD) is a novel protocol which eliminates all possible attacks on detector side and necessity of reference-frame alignment in source sides. However, its performance may degrade notably due to statistical fluctuations, since more parameters, e.g. yields and error rates for mismatched-basis events, must be accumulated to monitor the security. In this work, we find that the original decoy-states method estimates these yields over pessimistically since it ignores the potential relations between different bases. Through processing parameters of different bases jointly, the performance of RFI-MDI-QKD is greatly improved in terms of secret key rate and achievable distance when statistical fluctuations are considered. Our results pave an avenue towards practical RFI-MDI-QKD.

preprint2020arXiv

EnsembleGAN: Adversarial Learning for Retrieval-Generation Ensemble Model on Short-Text Conversation

Generating qualitative responses has always been a challenge for human-computer dialogue systems. Existing dialogue systems generally derive from either retrieval-based or generative-based approaches, both of which have their own pros and cons. Despite the natural idea of an ensemble model of the two, existing ensemble methods only focused on leveraging one approach to enhance another, we argue however that they can be further mutually enhanced with a proper training strategy. In this paper, we propose ensembleGAN, an adversarial learning framework for enhancing a retrieval-generation ensemble model in open-domain conversation scenario. It consists of a language-model-like generator, a ranker generator, and one ranker discriminator. Aiming at generating responses that approximate the ground-truth and receive high ranking scores from the discriminator, the two generators learn to generate improved highly relevant responses and competitive unobserved candidates respectively, while the discriminative ranker is trained to identify true responses from adversarial ones, thus featuring the merits of both generator counterparts. The experimental results on a large short-text conversation data demonstrate the effectiveness of the ensembleGAN by the amelioration on both human and automatic evaluation metrics.

preprint2020arXiv

Exemplar-based Layout Fine-tuning for Node-link Diagrams

We design and evaluate a novel layout fine-tuning technique for node-link diagrams that facilitates exemplar-based adjustment of a group of substructures in batching mode. The key idea is to transfer user modifications on a local substructure to other substructures in the whole graph that are topologically similar to the exemplar. We first precompute a canonical representation for each substructure with node embedding techniques and then use it for on-the-fly substructure retrieval. We design and develop a light-weight interactive system to enable intuitive adjustment, modification transfer, and visual graph exploration. We also report some results of quantitative comparisons, three case studies, and a within-participant user study.

preprint2020arXiv

Federated Learning via Intelligent Reflecting Surface

Over-the-air computation (AirComp) based federated learning (FL) is capable of achieving fast model aggregation by exploiting the waveform superposition property of multiple access channels. However, the model aggregation performance is severely limited by the unfavorable wireless propagation channels. In this paper, we propose to leverage intelligent reflecting surface (IRS) to achieve fast yet reliable model aggregation for AirComp-based FL. To optimize the learning performance, we formulate an optimization problem that jointly optimizes the device selection, the aggregation beamformer at the base station (BS), and the phase shifts at the IRS to maximize the number of devices participating in the model aggregation of each communication round under certain mean-squared-error (MSE) requirements. To tackle the formulated highly-intractable problem, we propose a two-step optimization framework. Specifically, we induce the sparsity of device selection in the first step, followed by solving a series of MSE minimization problems to find the maximum feasible device set in the second step. We then propose an alternating optimization framework, supported by the difference-of-convex-functions programming algorithm for low-rank optimization, to efficiently design the aggregation beamformers at the BS and phase shifts at the IRS. Simulation results will demonstrate that our proposed algorithm and the deployment of an IRS can achieve a lower training loss and higher FL prediction accuracy than the baseline algorithms.

preprint2020arXiv

Ferroelectricity and multiferroicity in anti-Ruddlesden-Popper structures

Combining ferroelectricity with other properties such as visible light absorption or long-range magnetic order requires the discovery of new families of ferroelectric materials. Here, through the analysis of a high-throughput database of phonon band structures, we identify a new structural family of anti-Ruddlesden-Popper phases A$_4$X$_2$O (A=Ca, Sr, Ba, Eu, X=Sb, P, As, Bi) showing ferroelectric and anti-ferroelectric behaviors. The discovered ferroelectrics belong to the new class of hyperferroelectrics which polarize even under open-circuit boundary conditions. The polar distortion involves the movement of O anions against apical A cations and is driven by geometric effects resulting from internal chemical strains. Within this new structural family, we show that Eu$_4$Sb$_2$O combines coupled ferromagnetic and ferroelectric order at the same atomic site, a very rare occurrence in materials physics.

preprint2020arXiv

Finite-key analysis for twin-field quantum key distribution based on generalized operator dominance condition

Quantum key distribution (QKD) can help two distant peers to share secret key bits, whose security is guaranteed by the law of physics. In practice, the secret key rate of a QKD protocol is always lowered with the increasing of channel distance, which severely limits the applications of QKD. Recently, twin-field (TF) QKD has been proposed and intensively studied, since it can beat the rate-distance limit and greatly increase the achievable distance of QKD. Remarkalebly, K. Maeda et. al. proposed a simple finite-key analysis for TF-QKD based on operator dominance condition. Although they showed that their method is sufficient to beat the rate-distance limit, their operator dominance condition is not general, i.e. it can be only applied in three decoy states scenarios, which implies that its key rate cannot be increased by introducing more decoy states, and also cannot reach the asymptotic bound even in case of preparing infinite decoy states and optical pulses. Here, to bridge this gap, we propose an improved finite-key analysis of TF-QKD through devising new operator dominance condition. We show that by adding the number of decoy states, the secret key rate can be furtherly improved and approach the asymptotic bound. Our theory can be directly used in TF-QKD experiment to obtain higher secret key rate. Our results can be directly used in experiments to obtain higher key rates.

preprint2020arXiv

Fractional Fourier transforms on $L^p$ and applications

This paper is devoted to the $L^p(\mathbb R)$ theory of the fractional Fourier transform (FRFT) for $1\le p < 2$. In view of the special structure of the FRFT, we study FRFT properties of $L^1$ functions, via the introduction of a suitable chirp operator. However, in the $L^1(\mathbb{R})$ setting, problems of convergence arise even when basic manipulations of functions are performed. We overcome such issues and study the FRFT inversion problem via approximation by suitable means, such as the fractional Gauss and Abel means. We also obtain the regularity of fractional convolution and results on pointwise convergence of FRFT means. Finally we discuss $L^p$ multiplier results and a Littlewood-Paley theorem associated with FRFT.

preprint2020arXiv

Fully open-flavor tetraquark states $bc\bar{q}\bar{s}$ and $sc\bar{q}\bar{b}$ with $J^{P}=0^{+},1^{+}$

We have studied the masses for fully open-flavor tetraquark states $bc\bar{q}\bar{s}$ and $sc\bar{q}\bar{b}$ with quantum numbers $J^{P}=0^{+},1^{+}$. We systematically construct all diquark-antiquark interpolating currents and calculate the two-point correlation functions and spectral densities in the framework of QCD sum rule method. Our calculations show that the masses are about $7.1-7.2$ GeV for the $bc\bar{q}\bar{s}$ tetraquark states and $7.0-7.1$ GeV for the $sc\bar{q}\bar{b}$ tetraquarks. The masses of $bc\bar{q}\bar{s}$ tetraquarks are below the thresholds of $\bar{B}_{s}D$ and $\bar{B}_{s}^{*}D$ final states for the scalar and axial-vector channels respectively. The $sc\bar{q}\bar{b}$ tetraquark states with $J^{P}=1^{+}$ lie below the $B_{c}^{+}K^{*}$ and $B_{s}^{*}D$ thresholds. Such low masses for these possible tetraquark states indicate that they can only decay via weak interaction and thus are very narrow and stable.

preprint2020arXiv

G2L-Net: Global to Local Network for Real-time 6D Pose Estimation with Embedding Vector Features

In this paper, we propose a novel real-time 6D object pose estimation framework, named G2L-Net. Our network operates on point clouds from RGB-D detection in a divide-and-conquer fashion. Specifically, our network consists of three steps. First, we extract the coarse object point cloud from the RGB-D image by 2D detection. Second, we feed the coarse object point cloud to a translation localization network to perform 3D segmentation and object translation prediction. Third, via the predicted segmentation and translation, we transfer the fine object point cloud into a local canonical coordinate, in which we train a rotation localization network to estimate initial object rotation. In the third step, we define point-wise embedding vector features to capture viewpoint-aware information. To calculate more accurate rotation, we adopt a rotation residual estimator to estimate the residual between initial rotation and ground truth, which can boost initial pose estimation performance. Our proposed G2L-Net is real-time despite the fact multiple steps are stacked via the proposed coarse-to-fine framework. Extensive experiments on two benchmark datasets show that G2L-Net achieves state-of-the-art performance in terms of both accuracy and speed.

preprint2020arXiv

Global Context Aware Convolutions for 3D Point Cloud Understanding

Recent advances in deep learning for 3D point clouds have shown great promises in scene understanding tasks thanks to the introduction of convolution operators to consume 3D point clouds directly in a neural network. Point cloud data, however, could have arbitrary rotations, especially those acquired from 3D scanning. Recent works show that it is possible to design point cloud convolutions with rotation invariance property, but such methods generally do not perform as well as translation-invariant only convolution. We found that a key reason is that compared to point coordinates, rotation-invariant features consumed by point cloud convolution are not as distinctive. To address this problem, we propose a novel convolution operator that enhances feature distinction by integrating global context information from the input point cloud to the convolution. To this end, a globally weighted local reference frame is constructed in each point neighborhood in which the local point set is decomposed into bins. Anchor points are generated in each bin to represent global shape features. A convolution can then be performed to transform the points and anchor features into final rotation-invariant features. We conduct several experiments on point cloud classification, part segmentation, shape retrieval, and normals estimation to evaluate our convolution, which achieves state-of-the-art accuracy under challenging rotations.

preprint2020arXiv

GraphFederator: Federated Visual Analysis for Multi-party Graphs

This paper presents GraphFederator, a novel approach to construct joint representations of multi-party graphs and supports privacy-preserving visual analysis of graphs. Inspired by the concept of federated learning, we reformulate the analysis of multi-party graphs into a decentralization process. The new federation framework consists of a shared module that is responsible for joint modeling and analysis, and a set of local modules that run on respective graph data. Specifically, we propose a federated graph representation model (FGRM) that is learned from encrypted characteristics of multi-party graphs in local modules. We also design multiple visualization views for joint visualization, exploration, and analysis of multi-party graphs. Experimental results with two datasets demonstrate the effectiveness of our approach.

preprint2020arXiv

Influence Maximization with Spontaneous User Adoption

We incorporate self activation into influence propagation and propose the self-activation independent cascade (SAIC) model: nodes may be self activated besides being selected as seeds, and influence propagates from both selected seeds and self activated nodes. Self activation reflects the real-world scenarios such as people naturally share product recommendations with their friends even without marketing intervention. It also leads to two new forms of optimization problems: (a) {\em preemptive influence maximization (PIM)}, which aims to find $k$ nodes that, if self-activated, can reach the most number of nodes before other self-activated nodes; and (b) {\em boosted preemptive influence maximization (BPIM)}, which aims to select $k$ seeds that are guaranteed to be activated and can reach the most number of nodes before other self-activated nodes. We propose scalable algorithms for PIM and BPIM and prove that they achieve $1-\varepsilon$ approximation for PIM and $1-1/e-\varepsilon$ approximation for BPIM, for any $\varepsilon > 0$. Through extensive tests on real-world graphs, we demonstrate that our algorithms outperform the baseline algorithms significantly for the PIM problem in solution quality, and also outperform the baselines for BPIM when self-activation behaviors are non-uniform across nodes.

preprint2020arXiv

InGaAs/InP single-photon detectors with 60% detection efficiency at 1550 nm

InGaAs/InP single-photon detectors (SPDs) are widely used for near-infrared photon counting in practical applications. Photon detection efficiency (PDE) is one of the most important parameters for SPD characterization, and therefore increasing PDE consistently plays a central role in both industrial development and academic research. Here we present the implementation of high-frequency gating InGaAs/InP SPD with a PDE as high as 60% at 1550 nm. On one hand, we optimize the structure design and device fabrication of InGaAs/InP single-photon avalanche diode with an additional dielectric-metal reflection layer to relatively increase the absorption efficiency of incident photons by ~ 20%. On the other hand, we develop a monolithic readout circuit of weak avalanche extraction to minimize the parasitic capacitance for the suppression of the afterpulsing effect. With 1.25 GHz sine wave gating and optimized gate amplitude and operation temperature, the SPD is characterized to reach a PDE of 60% with a dark count rate (DCR) of 340 kcps. For practical use, given 3 kcps DCR as a reference the PDE reaches ~ 40% PDE with an afterpulse probability of 5.5%, which can significantly improve the performance for the near-infrared SPD based applications.

preprint2020arXiv

Learning Contextualized Sentence Representations for Document-Level Neural Machine Translation

Document-level machine translation incorporates inter-sentential dependencies into the translation of a source sentence. In this paper, we propose a new framework to model cross-sentence dependencies by training neural machine translation (NMT) to predict both the target translation and surrounding sentences of a source sentence. By enforcing the NMT model to predict source context, we want the model to learn &#34;contextualized&#34; source sentence representations that capture document-level dependencies on the source side. We further propose two different methods to learn and integrate such contextualized sentence embeddings into NMT: a joint training method that jointly trains an NMT model with the source context prediction model and a pre-training & fine-tuning method that pretrains the source context prediction model on a large-scale monolingual document corpus and then fine-tunes it with the NMT model. Experiments on Chinese-English and English-German translation show that both methods can substantially improve the translation quality over a strong document-level Transformer baseline.

preprint2020arXiv

Lightweight Mask R-CNN for Long-Range Wireless Power Transfer Systems

Resonant Beam Charging (RBC) is a wireless charging technology which supports multi-watt power transfer over meter-level distance. The features of safety, mobility and simultaneous charging capability enable RBC to charge multiple mobile devices safely at the same time. To detect the devices that need to be charged, a Mask R-CNN based dection model is proposed in previous work. However, considering the constraints of the RBC system, it&#39;s not easy to apply Mask R-CNN in lightweight hardware-embedded devices because of its heavy model and huge computation. Thus, we propose a machine learning detection approach which provides a lighter and faster model based on traditional Mask R-CNN. The proposed approach makes the object detection much easier to be transplanted on mobile devices and reduce the burden of hardware computation. By adjusting the structure of the backbone and the head part of Mask R-CNN, we reduce the average detection time from $1.02\mbox{s}$ per image to $0.6132\mbox{s}$, and reduce the model size from $245\mbox{MB}$ to $47.1\mbox{MB}$. The improved model is much more suitable for the application in the RBC system.

preprint2020arXiv

Medium modification of $γ$-jet fragmentation functions in Pb+Pb collisions at LHC

Coupled linear Boltzmann transport and hydrodynamic (CoLBT-hydro) model has been developed for simultaneous simulations of jet propagation and jet-induced medium excitation in heavy-ion collisions. Within this coupled approach, the final reconstructed jets in heavy-ion collisions include not only hadrons from the hadronization of medium modified jet shower partons from the linear Boltzmann transport (LBT) but also hadrons from the freeze-out of the jet-induced medium excitation in the hydrodynamic evolution of the bulk medium. Using the CoLBT-hydro model, we study medium modification of the fragmentation functions of $γ$-triggered jets in high-energy heavy-ion collisions at the Large Hadron Collider. The CoLBT-hydro model is shown to describe the experimental data not only on the suppression of leading hadrons within the jet cone at large momentum fraction $z_γ=p_T^h/p_T^γ$ relative to the transverse momentum of the trigger photon due to parton energy loss but also the enhancement of soft hadrons at small $z_γ$ and $z_{\rm jet}=p_T^h/p_T^{\rm jet}$ due to jet-induced medium excitation. There is no suppression of the fragmentation function, however, at large momentum fraction $z_{\rm jet}$ relative to the transverse momentum of the reconstructed jet due to trigger bias and medium modification of quark to gluon jet fraction. For jets whose final transverse momenta are comparable to or larger than that of the trigger photon, the trigger bias can lead to enhancement of the jet fragmentation function at large $z_{\rm jet}$.

preprint2020arXiv

MetaSleepLearner: A Pilot Study on Fast Adaptation of Bio-signals-Based Sleep Stage Classifier to New Individual Subject Using Meta-Learning

Identifying bio-signals based-sleep stages requires time-consuming and tedious labor of skilled clinicians. Deep learning approaches have been introduced in order to challenge the automatic sleep stage classification conundrum. However, the difficulties can be posed in replacing the clinicians with the automatic system due to the differences in many aspects found in individual bio-signals, causing the inconsistency in the performance of the model on every incoming individual. Thus, we aim to explore the feasibility of using a novel approach, capable of assisting the clinicians and lessening the workload. We propose the transfer learning framework, entitled MetaSleepLearner, based on Model Agnostic Meta-Learning (MAML), in order to transfer the acquired sleep staging knowledge from a large dataset to new individual subjects. The framework was demonstrated to require the labelling of only a few sleep epochs by the clinicians and allow the remainder to be handled by the system. Layer-wise Relevance Propagation (LRP) was also applied to understand the learning course of our approach. In all acquired datasets, in comparison to the conventional approach, MetaSleepLearner achieved a range of 5.4\% to 17.7\% improvement with statistical difference in the mean of both approaches. The illustration of the model interpretation after the adaptation to each subject also confirmed that the performance was directed towards reasonable learning. MetaSleepLearner outperformed the conventional approaches as a result from the fine-tuning using the recordings of both healthy subjects and patients. This is the first work that investigated a non-conventional pre-training method, MAML, resulting in a possibility for human-machine collaboration in sleep stage classification and easing the burden of the clinicians in labelling the sleep stages through only several epochs rather than an entire recording.

preprint2020arXiv

MO-PaDGAN: Generating Diverse Designs with Multivariate Performance Enhancement

Deep generative models have proven useful for automatic design synthesis and design space exploration. However, they face three challenges when applied to engineering design: 1) generated designs lack diversity, 2) it is difficult to explicitly improve all the performance measures of generated designs, and 3) existing models generally do not generate high-performance novel designs, outside the domain of the training data. To address these challenges, we propose MO-PaDGAN, which contains a new Determinantal Point Processes based loss function for probabilistic modeling of diversity and performances. Through a real-world airfoil design example, we demonstrate that MO-PaDGAN expands the existing boundary of the design space towards high-performance regions and generates new designs with high diversity and performances exceeding training data.

preprint2020arXiv

Molecular Latent Space Simulators

Small integration time steps limit molecular dynamics (MD) simulations to millisecond time scales. Markov state models (MSMs) and equation-free approaches learn low-dimensional kinetic models from MD simulation data by performing configurational or dynamical coarse-graining of the state space. The learned kinetic models enable the efficient generation of dynamical trajectories over vastly longer time scales than are accessible by MD, but the discretization of configurational space and/or absence of a means to reconstruct molecular configurations precludes the generation of continuous all-atom molecular trajectories. We propose latent space simulators (LSS) to learn kinetic models for continuous all-atom simulation trajectories by training three deep learning networks to (i) learn the slow collective variables of the molecular system, (ii) propagate the system dynamics within this slow latent space, and (iii) generatively reconstruct molecular configurations. We demonstrate the approach in an application to Trp-cage miniprotein to produce novel ultra-long synthetic folding trajectories that accurately reproduce all-atom molecular structure, thermodynamics, and kinetics at six orders of magnitude lower cost than MD. The dramatically lower cost of trajectory generation enables greatly improved sampling and greatly reduced statistical uncertainties in estimated thermodynamic averages and kinetic rates.

preprint2020arXiv

New Directions in Distributed Deep Learning: Bringing the Network at Forefront of IoT Design

In this paper, we first highlight three major challenges to large-scale adoption of deep learning at the edge: (i) Hardware-constrained IoT devices, (ii) Data security and privacy in the IoT era, and (iii) Lack of network-aware deep learning algorithms for distributed inference across multiple IoT devices. We then provide a unified view targeting three research directions that naturally emerge from the above challenges: (1) Federated learning for training deep networks, (2) Data-independent deployment of learning algorithms, and (3) Communication-aware distributed inference. We believe that the above research directions need a network-centric approach to enable the edge intelligence and, therefore, fully exploit the true potential of IoT.

preprint2020arXiv

Normal family of meromorphic mappings and Big Picard&#39;s theorem

In this paper, we prove some results in normal family of meromorphic mappings intersecting with moving hypersurfaces. As some applications, we establish some results for normal mapping and extension of holomorphic mappings. A our result is strongly extended the Montel&#39;s normal criterion in the case several variables which is due to Tu in [Proc. Amer. Math. Soc. 127,1039-1049, 1999]. Our results are also strongly extended the results of Tu-Li in [Sci. China Ser. A. 48, 355-364, 2005] and [J. Math. Anal. Appl. 342, 629-638, 2006].

preprint2020arXiv

On Spectral Properties of Signed Laplacians with Connections to Eventual Positivity

Signed graphs have appeared in a broad variety of applications, ranging from social networks to biological networks, from distributed control and computation to power systems. In this paper, we investigate spectral properties of signed Laplacians for undirected signed graphs. We find conditions on the negative weights under which a signed Laplacian is positive semidefinite via the Kron reduction and multiport network theory. For signed Laplacians that are indefinite, we characterize their inertias with the same framework. Furthermore, we build connections between signed Laplacians, generalized M-matrices, and eventually exponentially positive matrices.

preprint2020arXiv

On the Energy Self-Sustainability of IoT via Distributed Compressed Sensing

This paper advocates the use of the distributed compressed sensing (DCS) paradigm to deploy energy harvesting (EH) Internet of Thing (IoT) devices for energy self-sustainability. We consider networks with signal/energy models that capture the fact that both the collected signals and the harvested energy of different devices can exhibit correlation. We provide theoretical analysis on the performance of both the classical compressive sensing (CS) approach and the proposed distributed CS (DCS)-based approach to data acquisition for EH IoT. Moreover, we perform an in-depth comparison of the proposed DCS-based approach against the distributed source coding (DSC) system. These performance characterizations and comparisons embody the effect of various system phenomena and parameters including signal correlation, EH correlation, network size, and energy availability level. Our results unveil that, the proposed approach offers significant increase in data gathering capability with respect to the CS-based approach, and offers a substantial reduction of the mean-squared error distortion with respect to the DSC system.

preprint2020arXiv

Optimization from Structured Samples for Coverage Functions

We revisit the optimization from samples (OPS) model, which studies the problem of optimizing objective functions directly from the sample data. Previous results showed that we cannot obtain a constant approximation ratio for the maximum coverage problem using polynomially many independent samples of the form $\{S_i, f(S_i)\}_{i=1}^t$ (Balkanski et al., 2017), even if coverage functions are $(1 - ε)$-PMAC learnable using these samples (Badanidiyuru et al., 2012), which means most of the function values can be approximately learned very well with high probability. In this work, to circumvent the impossibility result of OPS, we propose a stronger model called optimization from structured samples (OPSS) for coverage functions, where the data samples encode the structural information of the functions. We show that under three general assumptions on the sample distributions, we can design efficient OPSS algorithms that achieve a constant approximation for the maximum coverage problem. We further prove a constant lower bound under these assumptions, which is tight when not considering computational efficiency. Moreover, we also show that if we remove any one of the three assumptions, OPSS for the maximum coverage problem has no constant approximation.

preprint2020arXiv

Optimized protocol for twin-field quantum key distribution

Twin-field quantum key distribution (TF-QKD) and its variant protocols are highly attractive due to the advantage of overcoming the rate-loss limit for secret key rates of point-to-point QKD protocols. For variations of TF-QKD, the key point to ensure security is switching randomly between a code mode and a test mode. Among all TF-QKD protocols, their code modes are very different, e.g. modulating continuous phases, modulating only two opposite phases, and sending or not sending signal pulses. Here we show that, by discretizing the number of global phases in the code mode, we can give a unified view on the first two types of TF-QKD protocols, and demonstrate that increasing the number of discrete phases extends the achievable distance, and as a trade-off, lowers the secret key rate at short distances due to the phase post-selection.

preprint2020arXiv

PassVizor: Toward Better Understanding of the Dynamics of Soccer Passes

In soccer, passing is the most frequent interaction between players and plays a significant role in creating scoring chances. Experts are interested in analyzing players&#39; passing behavior to learn passing tactics, i.e., how players build up an attack with passing. Various approaches have been proposed to facilitate the analysis of passing tactics. However, the dynamic changes of a team&#39;s employed tactics over a match have not been comprehensively investigated. To address the problem, we closely collaborate with domain experts and characterize requirements to analyze the dynamic changes of a team&#39;s passing tactics. To characterize the passing tactic employed for each attack, we propose a topic-based approach that provides a high-level abstraction of complex passing behaviors. Based on the model, we propose a glyph-based design to reveal the multi-variate information of passing tactics within different phases of attacks, including player identity, spatial context, and formation. We further design and develop PassVizor, a visual analytics system, to support the comprehensive analysis of passing dynamics. With the system, users can detect the changing patterns of passing tactics and examine the detailed passing process for evaluating passing tactics. We invite experts to conduct analysis with PassVizor and demonstrate the usability of the system through an expert interview.

preprint2020arXiv

Persistent currents and spin torque caused by percolated quantum spin Hall state

Motivated by recent experiments, we investigate the quantum spin Hall state in 2D topological insulator/ferromagnetic metal planar junctions by means of a tight-binding model and linear response theory. We demonstrate that whether the edge state Dirac cone is submerged into the ferromagnetic subbands and the direction of the magnetization dramatically affect (i) how the edge state percolates into the ferromagnet, and (ii) the spin-momentum locking of the edge state. Laminar flows of room temperature persistent charge and spin currents near the interface are uncovered. In addition, the current-induced spin polarization at the edge of the 2D topological insulator is found to be dramatically enhanced near the impurities. The current-induced spin polarization in the ferromagnet is mainly polarized in the out-of-plane direction ${\hat{\bf z}}$, rendering a current-induced spin torque that is predominantly field-like $\propto {\bf S}\times{\hat{\bf z}}$.

preprint2020arXiv

Q-greedyUCB: a New Exploration Policy for Adaptive and Resource-efficient Scheduling

This paper proposes a learning algorithm to find a scheduling policy that achieves an optimal delay-power trade-off in communication systems. Reinforcement learning (RL) is used to minimize the expected latency for a given energy constraint where the environments such as traffic arrival rates or channel conditions can change over time. For this purpose, this problem is formulated as an infinite-horizon Markov Decision Process (MDP) with constraints. To handle the constrained optimization problem, we adopt the Lagrangian relaxation technique to solve it. Then, we propose a variant of Q-learning, Q-greedyUCB that combines Q-learning for \emph{average} reward algorithm and Upper Confidence Bound (UCB) policy to solve this decision-making problem. We prove that the Q-greedyUCB algorithm is convergent through mathematical analysis. Simulation results show that Q-greedyUCB finds an optimal scheduling strategy, and is more efficient than Q-learning with the $\varepsilon$-greedy and Average-payoff RL algorithm in terms of the cumulative reward (i.e., the weighted sum of delay and energy) and the convergence speed. We also show that our algorithm can reduce the regret by up to 12% compared to the Q-learning with the $\varepsilon$-greedy and Average-payoff RL algorithm.

preprint2020arXiv

Quantum key distribution with dissipative Kerr soliton generated by on-chip microresonators

Quantum key distribution (QKD) can distribute symmetric key bits between remote legitimate users with the guarantee of quantum mechanics principles. For practical applications, the compact and robust photonic components for QKD are essential, and there are increasing attention to integrate the source, detector and modulators on a photonic chip. However, the massive and parallel QKD based on wavelength multiplexing are still challenge, due to the limited coherent light sources on the chip. Here, we introduce the Kerr dissipative soliton in a microresonator, which provides the locked coherent frequency comb with 49GHz frequency spacing, for QKD. We demonstrate the parallel QKD by demulplexing the coherent comb lines form the soliton, and showing the potential of Gbps secret key rate if the hundreds of channels covering C and L bands are fully exploited. The demonstrated soliton based QKD architecture are compatible with the efforts of quantum photonic integrated circuits, which are compact, robust and low-cost, and provides a competitive platform of practical QKD chip.

preprint2020arXiv

Reconfigurable Intelligent Surface Assisted Massive MIMO with Antenna Selection

Antenna selection is capable of reducing the hardware complexity of massive multiple-input multiple-output (MIMO) networks at the cost of certain performance degradation. Reconfigurable intelligent surface (RIS) has emerged as a cost-effective technique that can enhance the spectrum-efficiency of wireless networks by reconfiguring the propagation environment. By employing RIS to compensate the performance loss due to antenna selection, in this paper we propose a new network architecture, i.e., RIS-assisted massive MIMO system with antenna selection, to enhance the system performance while enjoying a low hardware cost. This is achieved by maximizing the channel capacity via joint antenna selection and passive beamforming while taking into account the cardinality constraint of active antennas and the unit-modulus constraints of all RIS elements. However, the formulated problem turns out to be highly intractable due to the non-convex constraints and coupled optimization variables, for which an alternating optimization framework is provided, yielding antenna selection and passive beamforming subproblems. The computationally efficient submodular optimization algorithms are developed to solve the antenna selection subproblem under different channel state information assumptions. The iterative algorithms based on block coordinate descent are further proposed for the passive beamforming design by exploiting the unique problem structures. Experimental results will demonstrate the algorithmic advantages and desirable performance of the proposed algorithms for RIS- assisted massive MIMO systems with antenna selection.

preprint2020arXiv

Revisiting the Modifiable Areal Unit Problem in Deep Traffic Prediction with Visual Analytics

Deep learning methods are being increasingly used for urban traffic prediction where spatiotemporal traffic data is aggregated into sequentially organized matrices that are then fed into convolution-based residual neural networks. However, the widely known modifiable areal unit problem within such aggregation processes can lead to perturbations in the network inputs. This issue can significantly destabilize the feature embeddings and the predictions, rendering deep networks much less useful for the experts. This paper approaches this challenge by leveraging unit visualization techniques that enable the investigation of many-to-many relationships between dynamically varied multi-scalar aggregations of urban traffic data and neural network predictions. Through regular exchanges with a domain expert, we design and develop a visual analytics solution that integrates 1) a Bivariate Map equipped with an advanced bivariate colormap to simultaneously depict input traffic and prediction errors across space, 2) a Morans I Scatterplot that provides local indicators of spatial association analysis, and 3) a Multi-scale Attribution View that arranges non-linear dot plots in a tree layout to promote model analysis and comparison across scales. We evaluate our approach through a series of case studies involving a real-world dataset of Shenzhen taxi trips, and through interviews with domain experts. We observe that geographical scale variations have important impact on prediction performances, and interactive visual exploration of dynamically varying inputs and outputs benefit experts in the development of deep traffic prediction models.

preprint2020arXiv

SMAP: A Joint Dimensionality Reduction Scheme for Secure Multi-Party Visualization

Nowadays, as data becomes increasingly complex and distributed, data analyses often involve several related datasets that are stored on different servers and probably owned by different stakeholders. While there is an emerging need to provide these stakeholders with a full picture of their data under a global context, conventional visual analytical methods, such as dimensionality reduction, could expose data privacy when multi-party datasets are fused into a single site to build point-level relationships. In this paper, we reformulate the conventional t-SNE method from the single-site mode into a secure distributed infrastructure. We present a secure multi-party scheme for joint t-SNE computation, which can minimize the risk of data leakage. Aggregated visualization can be optionally employed to hide disclosure of point-level relationships. We build a prototype system based on our method, SMAP, to support the organization, computation, and exploration of secure joint embedding. We demonstrate the effectiveness of our approach with three case studies, one of which is based on the deployment of our system in real-world applications.

preprint2020arXiv

Solving Sparse Linear Inverse Problems in Communication Systems: A Deep Learning Approach With Adaptive Depth

Sparse signal recovery problems from noisy linear measurements appear in many areas of wireless communications. In recent years, deep learning (DL) based approaches have attracted interests of researchers to solve the sparse linear inverse problem by unfolding iterative algorithms as neural networks. Typically, research concerning DL assume a fixed number of network layers. However, it ignores a key character in traditional iterative algorithms, where the number of iterations required for convergence changes with varying sparsity levels. By investigating on the projected gradient descent, we unveil the drawbacks of the existing DL methods with fixed depth. Then we propose an end-to-end trainable DL architecture, which involves an extra halting score at each layer. Therefore, the proposed method learns how many layers to execute to emit an output, and the network depth is dynamically adjusted for each task in the inference phase. We conduct experiments using both synthetic data and applications including random access in massive MTC and massive MIMO channel estimation, and the results demonstrate the improved efficiency for the proposed approach.

preprint2020arXiv

The influence of the Insight-HXMT/LE time response on timing analysis

LE is the low energy telescope of Insight-HXMT. It uses swept charge devices (SCDs) to detect soft X-ray photons. The time response of LE is caused by the structure of SCDs. With theoretical analysis and Monte Carlo simulations we discuss the influence of LE time response (LTR) on the timing analysis from three aspects: the power spectral density, the pulse profile and the time lag. After the LTR, the value of power spectral density monotonously decreases with the increasing frequency. The power spectral density of a sinusoidal signal reduces by a half at frequency 536 Hz. The corresponding frequency for QPO signals is 458 Hz. The Root mean square (RMS) of QPOs holds the similar behaviour. After the LTR, the centroid frequency and full width at half maxima (FWHM) of QPOs signals do not change. The LTR reduces the RMS of pulse profiles and shifts the pulse phase. In the time domain, the LTR only reduces the peak value of the crosscorrelation function while it does not change the peak position. Thus it will not affect the result of the time lag. When considering the time lag obtained from two instruments and one among them is LE, a 1.18 ms lag is expected caused by the LTR. The time lag calculated in the frequency domain is the same as that in the time domain.

preprint2020arXiv

Weighted estimates for the bilinear maximal operator on filtered measure spaces

Assuming the bilinear reverse Holder&#39;s condition, we character weighted inequalities for the bilinear maximal operator on filtered measure spaces. We also obtain Hytonen-Perez type weighted estimates for the bilinear maximal operator. Our approaches are mainly based on the new construction of bilinear versions of principal sets and the new Carleson embedding theorem on filtered measure spaces. In particular, we find a new property of the construction and we call it the conditional sparsity of principal sets.

preprint2019arXiv

Designing Anisotropic Microstructures with Spectral Density Function

Materials&#39; microstructure strongly influences its performance and is thus a critical aspect in design of functional materials. Previous efforts on microstructure mediated design mostly assume isotropy, which is not ideal when material performance is dependent on an underlying transport phenomenon. In this article, we propose an anisotropic microstructure design strategy that leverages Spectral Density Function (SDF) for rapid reconstruction of high resolution, two phase, isotropic or anisotropic microstructures in 2D and 3D. We demonstrate that SDF microstructure representation provides an intuitive method for quantifying anisotropy through a dimensionless scalar variable termed anisotropy index. The computational efficiency and low dimensional microstructure representation enabled by our method is demonstrated through an active layer design case study for Bulk Heterojunction Organic Photovoltaic Cells (OPVCs). Results indicate that optimized design, exhibiting strong anisotropy, outperforms isotropic active layer designs. Further, we show that Cross-sectional Scanning Tunneling Microscopy and Spectroscopy (XSTM/S) is as an effective tool for characterization of anisotropic microstructures.

preprint2019arXiv

Electron-Hole Interference in an Inverted-Band Semiconductor Bilayer

Electron optics in the solid state promises new functionality in electronics through the possibility of realizing micrometer-sized interferometers, lenses, collimators and beam splitters that manipulate electrons instead of light. Until now, however, such functionality has been demonstrated exclusively in one-dimensional devices, such as in nanotubes, and in graphene-based devices operating with p-n junctions. In this work, we describe a novel mechanism for realizing electron optics in two dimensions. By studying a two-dimensional Fabry-Pérot interferometer based on a resonant cavity formed in an InAs/GaSb double quantum well using p-n junctions, we establish that electron-hole hybridization in band-inverted systems can facilitate coherent interference. With this discovery, we expand the field of electron optics to encompass materials that exhibit band inversion and hybridization, with the promise to surpass the performance of current state-of-the-art devices.

preprint2019arXiv

Elongated Nano Domains and Molecular Intermixing induced Doping in Organic Photovoltaic Active Layers with Electric Field Treatment

The effects of the electric-field-assisted annealing on the bulk heterojunction nano-morphology in the P3HT/PCBM active layer of the organic photovoltaic cells (OPVCs) are presented here. It was widely accepted that the electric-field-assisted annealing will facilitate the P3HT, the polar polymer, to be better crystalline to enhance the charge mobility, hence the improvement of the OPVC performance. The influences on the nano-morphology of the electron donor and accepter domains are not well understood. Here, using the cross-sectional scanning tunneling microscopy and spectroscopy (XSTM/S), the electric-field-assisted annealing treatment is found to influence the molecular domains to be elongated with the orientation near the direction of the external electric field. The elongation of the molecular domains is believed to facilitate the domain percolation, which causes higher charge mobility, hence the higher short-circuit current density (Jsc). On the other hand, it was also observed that the electronic properties of the P3HT-rich and PCBM-rich domains in the electric-field-assisted annealed samples showed smaller energy band gaps and smaller molecular orbital offset between the two domains, which is argued to decrease the open circuit voltage (Voc) and negatively impact the OPVC performance. Based on the X-ray diffraction (XRD) and small angle X-ray scattering (SAXS) results, the altered electronic properties are argued to be due to the molecular intermixing induced doping effects. These results point out competing factors affecting the OPVC performance with the electric-field-assisted annealing treatment.

preprint2019arXiv

Fabrication and performance of AC-coupled LGADs

Detectors that can simultaneously provide fine time and spatial resolution have attracted wide-spread interest for applications in several fields such as high-energy and nuclear physics as well as in low-energy electron detection, photon science, photonics and imaging. Low-Gain Avalanche Diodes (LGADs), being fabricated on thin silicon substrates and featuring a charge gain of up to 100, exhibit excellent timing performance. Since pads much larger than the substrate thickness are necessary to achieve a spatially uniform multiplication, a fine pad pixelation is difficult. To overcome this limitation, the AC-coupled LGAD approach was introduced. In this type of device, metal electrodes are placed over an insulator at a fine pitch, and signals are capacitively induced on these electrodes. At Brookhaven National Laboratory, we have designed and fabricated prototypes of AC-coupled LGAD sensors. The performance of small test structures with different particle beams from radioactive sources are shown.

preprint2019arXiv

Field-Effect Transistor based on Surface Negative Refraction in Weyl Nanowires

Weyl semimetals are characterized by their bulk Weyl points -- conical band touching points that carry a topological monopole charge -- and Fermi arc states that span between the Weyl points on the surface of the material. Recently, significant progress has been made towards understanding and measuring the physical properties of Weyl semimetals. Yet, potential applications remain relatively sparse. Here, we propose Weyl semimetal nanowires as field-effect transistors, dubbed WEYLFETs. Specifically, applying gradient gate voltage along the nanowire, an electrical field is generated that effectively tilts the open surfaces, thus, varying the relative orientation between Fermi arcs on different surfaces. As a result, perfect negative refraction between adjacent surfaces can occur and longitudinal conductance along the wire is suppressed. The WEYLFET offers a high on/off ratio with low power consumption. Adverse effects due to dispersive Fermi arcs and surface disorder are studied.

preprint2019arXiv

Generating quantum multi-criticality in topological insulators by periodic driving

We demonstrate that the prototypical two-dimensional Chern insulator hosts exotic quantum multi-criticality in the presence of an appropriate periodic driving: a linear Dirac-like transition coexists with a nodal loop-like transition caused by emerging symmetries. The existence of multiple universality classes and scaling laws can be unambiguously captured by a single renormalization group approach based on the stroboscopic Floquet Hamiltonian, regardless of whether the topological transition is associated with the anomalous edge modes or not.

preprint2019arXiv

Overview to the Hard X-ray Modulation Telescope (Insight-HXMT) Satellite

As China&#39;s first X-ray astronomical satellite, the Hard X-ray Modulation Telescope (HXMT), which was dubbed as Insight-HXMT after the launch on June 15, 2017, is a wide-band (1-250 keV) slat-collimator-based X-ray astronomy satellite with the capability of all-sky monitoring in 0.2-3 MeV. It was designed to perform pointing, scanning and gamma-ray burst (GRB) observations and, based on the Direct Demodulation Method (DDM), the image of the scanned sky region can be reconstructed. Here we give an overview of the mission and its progresses, including payload, core sciences, ground calibration/facility, ground segment, data archive, software, in-orbit performance, calibration, background model, observations and some preliminary results.

preprint2019arXiv

Searching for Neutrino-less Double Beta Decay of $^{136}$Xe with PandaX-II Liquid Xenon Detector

We report the Neutrino-less Double Beta Decay (NLDBD) search results from PandaX-II dual-phase liquid xenon time projection chamber. The total live time used in this analysis is 403.1 days from June 2016 to August 2018. With NLDBD-optimized event selection criteria, we obtain a fiducial mass of 219 kg of natural xenon. The accumulated xenon exposure is 242 kg$\cdot$yr, or equivalently 22.2 kg$\cdot$yr of $^{136}$Xe exposure. At the region around $^{136}$Xe decay Q-value of 2458 keV, the energy resolution of PandaX-II is 4.2%. We find no evidence of NLDBD in PandaX-II and establish a lower limit for decay half-life of 2.4 $ \times 10^{23} $ yr at the 90% confidence level, which corresponds to an effective Majorana neutrino mass $m_{ββ} < (1.3 - 3.5)$ eV. This is the first NLDBD result reported from a dual-phase xenon experiment.

preprint2019arXiv

Unifying topological phase transitions in noninteracting, interacting, and periodically driven systems

Topological phase transitions track changes in topological properties of a system and occur in real materials as well as quantum engineered systems, all of which differ greatly in terms of dimensionality, symmetries, interactions, and driving, and hence require a variety of techniques and concepts to describe their topological properties. For instance, depending on the system, topology may be accessed from single-particle Bloch wave functions, Green&#39;s functions, or many-body wave functions. We demonstrate that despite this diversity, all topological phase transitions display a universal feature: namely, a divergence of the curvature function that composes the topological invariant at the critical point. This feature can be exploited via a renormalization-group-like methodology to describe topological phase transitions. This approach serves to extend notions of correlation function, critical exponents, scaling laws and universality classes used in Landau theory to characterize topological phase transitions in a unified manner.

preprint2017arXiv

An empirical behavioural order-driven model with price limit rules

We develop an empirical behavioural order-driven (EBOD) model, which consists of an order placement process and an order cancellation process. Price limit rules are introduced in the definition of relative price. The order placement process is determined by several empirical regularities: the long memory in order directions, the long memory in relative prices, the asymmetric distribution of relative prices, and the nonlinear dependence of the average order size and its standard deviation on the relative price. Order cancellation follows a Poisson process with the arrival rate determined from real data and the cancelled order is determined according to the empirical distributions of relative price level and relative position at the same price level. All these ingredients of the model are derived based on the empirical microscopic regularities in the order flows of stocks on the Shenzhen Stock Exchange. The model is able to produce the main stylized facts in real markets. Computational experiments uncover that asymmetric setting of price limits will cause the stock price diverging exponentially when the up price limit is higher than the down price limit and vanishing vice versus. We also find that asymmetric price limits have influences on stylized facts. Our EBOD model provides a suitable computational experiment platform for academics, market participants and policy makers.

preprint2017arXiv

Proactive Caching for Energy-Efficiency in Wireless Networks: A Markov Decision Process Approach

Content caching in wireless networks provides a substantial opportunity to trade off low cost memory storage with energy consumption, yet finding the optimal causal policy with low computational complexity remains a challenge. This paper models the Joint Pushing and Caching (JPC) problem as a Markov Decision Process (MDP) and provides a solution to determine the optimal randomized policy. A novel approach to decouple the influence from buffer occupancy and user requests is proposed to turn the high-dimensional optimization problem into three low-dimensional ones. Furthermore, a non-iterative algorithm to solve one of the sub-problems is presented, exploiting a structural property we found as \textit{generalized monotonicity}, and hence significantly reduces the computational complexity. The result attains close performance in comparison with theoretical bounds from non-practical policies, while benefiting from higher time efficiency than the unadapted MDP solution.

preprint2013arXiv

Projection Design For Statistical Compressive Sensing: A Tight Frame Based Approach

In this paper, we develop a framework to design sensing matrices for compressive sensing applications that lead to good mean squared error (MSE) performance subject to sensing cost constraints. By capitalizing on the MSE of the oracle estimator, whose performance has been shown to act as a benchmark to the performance of standard sparse recovery algorithms, we use the fact that a Parseval tight frame is the closest design - in the Frobenius norm sense - to the solution of a convex relaxation of the optimization problem that relates to the minimization of the MSE of the oracle estimator with respect to the equivalent sensing matrix, subject to sensing energy constraints. Based on this result, we then propose two sensing matrix designs that exhibit two key properties: i) the designs are closed form rather than iterative; ii) the designs exhibit superior performance in relation to other designs in the literature, which is revealed by our numerical investigation in various scenarios with different sparse recovery algorithms including basis pursuit de-noise (BPDN), the Dantzig selector and orthogonal matching pursuit (OMP).