Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
40works
0followers
29topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

40 published item(s)

preprint2026arXiv

When Emotion Becomes Trigger: Emotion-style dynamic Backdoor Attack Parasitising Large Language Models

Backdoor vulnerabilities widely exist in the fine-tuning of large language models(LLMs). Most backdoor poisoning methods operate mainly at the token level and lack deeper semantic manipulation, which limits stealthiness. In addition, Prior attacks rely on a single fixed trigger to induce harmful outputs. Such static triggers are easy to detect, and clean fine-tuning can weaken the trigger-target association. Through causal validation, we observe that emotion is not directly linked to individual words, but functions as an overall stylistic factor through tone. In the representation space of LLM, emotion can be decoupled from semantics, forming distinct cluster from the original neutral text. Therefore, we consider the emotional factor as the backdoor trigger to propose a pparasitic emotion-style dynamic backdoor attack, Paraesthesia. By mixing samples with the emotional trigger into clean data and then fine-tuning the model, the model is able to generate the predefined attack response when encountering emotional inputs during the inference stage. Paraesthesia includes two the quantification and rewriting of emotional styles. We evaluate the effectiveness of our method on instruction-following generation and classification tasks. The experimental results show that Paraesthesia achieves an attack success rate of around 99\% across both task types and four different models, while maintaining the clean utility of the models.

preprint2023arXiv

The optical conductivity of the 2D $t-J$ model and the origin of electron incoherence in the high-T$_{c}$ cuprate superconductors: a variational study

Understanding the origin of electron incoherence is the first step toward a theoretical description of the non-Fermi liquid behavior of the high-T$_{c}$ cuprate superconductors. Such electron incoherence manifests itself most evidently in the non-Drude behavior of the optical response of the system and the anomalous density fluctuation behavior in the long wave length limit. The spectral weight transfer related to such dissipative response, which is absent in conventional Fermi liquid metal, has direct consequence on the dc transport property of the system in the normal state and the superfluid stiffness in the superconducting state. It is found that such electron incoherence remains significant even in the clean limit and at low temperature and thus must be attributed to the strong electron correlation effect in the cuprate superconductors. Here we study such an intrinsic effect in the 2D $t-J$ model through the variational calculation of its optical conductivity $σ(ω)$. We assume a resonating valence bond ground state as our starting point and find that a significant portion of the total optical spectral weight remains incoherent throughout the phase diagram. The optical absorption is found to extend all the way to an energy of the order of the bare band width. We find that both the total optical weight $\bar{K}$ and the integrated incoherent optical weight $I$ increase monotonically with doping, with their ratio $R_{incoh}=I/\bar{K}$ decreasing monotonically with doping. Our results indicate that the majority part of electron incoherence in the 2D $t-J$ model can be attributed to the electron fractionalization mechanism assumed in such a treatment. We also find that the Drude weight deduced from $D=\bar{K}-I$ scales linearly with hole doping, without any sign of a non-monotonic behavior in the overdoped regime.

preprint2022arXiv

Animating collider processes with Event-time-frame Format

High Energy Physics processes, such as hard scattering, parton shower, and hadronization, occur at colliders around the world, e.g., the Large Hadron Collider in Europe. The various steps are also components within corresponding Monte-Carlo simulations. They are usually considered to occur in an instant and displayed in MC simulations as intricate paths hard-coded with the HepMC format. We recently developed a framework to convert HEP event records into online 3D animations, aiming for visual Monte-Carlo studies and science popularization, where the most difficult parts are about designing an event timeline and particles' movement. As a by-product, we propose here an event-time-frame format for animation data exchanging and persistence, which is potentially helpful in other visualization works. The code is maintained at https://github.com/lyazj/hepani, and the web service is available at https://ppnp.pku.edu.cn/hepani/index.html.

preprint2022arXiv

Cross-speaker emotion disentangling and transfer for end-to-end speech synthesis

The cross-speaker emotion transfer task in text-to-speech (TTS) synthesis particularly aims to synthesize speech for a target speaker with the emotion transferred from reference speech recorded by another (source) speaker. During the emotion transfer process, the identity information of the source speaker could also affect the synthesized results, resulting in the issue of speaker leakage. This paper proposes a new method with the aim to synthesize controllable emotional expressive speech and meanwhile maintain the target speaker's identity in the cross-speaker emotion TTS task. The proposed method is a Tacotron2-based framework with emotion embedding as the conditioning variable to provide emotion information. Two emotion disentangling modules are contained in our method to 1) get speaker-irrelevant and emotion-discriminative embedding, and 2) explicitly constrain the emotion and speaker identity of synthetic speech to be that as expected. Moreover, we present an intuitive method to control the emotion strength in the synthetic speech for the target speaker. Specifically, the learned emotion embedding is adjusted with a flexible scalar value, which allows controlling the emotion strength conveyed by the embedding. Extensive experiments have been conducted on a Mandarin disjoint corpus, and the results demonstrate that the proposed method is able to synthesize reasonable emotional speech for the target speaker. Compared to the state-of-the-art reference embedding learned methods, our method gets the best performance on the cross-speaker emotion transfer task, indicating that our method achieves the new state-of-the-art performance on learning the speaker-irrelevant emotion embedding. Furthermore, the strength ranking test and pitch trajectories plots demonstrate that the proposed method can effectively control the emotion strength, leading to prosody-diverse synthetic speech.

preprint2022arXiv

Cross-speaker Emotion Transfer Based On Prosody Compensation for End-to-End Speech Synthesis

Cross-speaker emotion transfer speech synthesis aims to synthesize emotional speech for a target speaker by transferring the emotion from reference speech recorded by another (source) speaker. In this task, extracting speaker-independent emotion embedding from reference speech plays an important role. However, the emotional information conveyed by such emotion embedding tends to be weakened in the process to squeeze out the source speaker's timbre information. In response to this problem, a prosody compensation module (PCM) is proposed in this paper to compensate for the emotional information loss. Specifically, the PCM tries to obtain speaker-independent emotional information from the intermediate feature of a pre-trained ASR model. To this end, a prosody compensation encoder with global context (GC) blocks is introduced to obtain global emotional information from the ASR model's intermediate feature. Experiments demonstrate that the proposed PCM can effectively compensate the emotion embedding for the emotional information loss, and meanwhile maintain the timbre of the target speaker. Comparisons with state-of-the-art models show that our proposed method presents obvious superiority on the cross-speaker emotion transfer task.

preprint2022arXiv

Heegaard genus, degree-one maps, and amalgamation of 3-manifolds

Let $M=W\cup_T V$ be an amalgamation of two compact 3-manifolds along a torus, where $W$ is the exterior of a knot in a homology sphere. Let $N$ be the manifold obtained by replacing $W$ with a solid torus such that the boundary of a Seifert surface in $W$ is a meridian of the solid torus. This means that there is a degree-one map $f\colon M\to N$, pinching $W$ into a solid torus while fixing $V$. We prove that $g(M)\ge g(N)$, where $g(M)$ denotes the Heegaard genus. An immediate corollary is that the tunnel number of a satellite knot is at least as large as the tunnel number of its pattern knot.

preprint2022arXiv

Mass Testing and Characterization of 20-inch PMTs for JUNO

Main goal of the JUNO experiment is to determine the neutrino mass ordering using a 20kt liquid-scintillator detector. Its key feature is an excellent energy resolution of at least 3 % at 1 MeV, for which its instruments need to meet a certain quality and thus have to be fully characterized. More than 20,000 20-inch PMTs have been received and assessed by JUNO after a detailed testing program which began in 2017 and elapsed for about four years. Based on this mass characterization and a set of specific requirements, a good quality of all accepted PMTs could be ascertained. This paper presents the performed testing procedure with the designed testing systems as well as the statistical characteristics of all 20-inch PMTs intended to be used in the JUNO experiment, covering more than fifteen performance parameters including the photocathode uniformity. This constitutes the largest sample of 20-inch PMTs ever produced and studied in detail to date, i.e. 15,000 of the newly developed 20-inch MCP-PMTs from Northern Night Vision Technology Co. (NNVT) and 5,000 of dynode PMTs from Hamamatsu Photonics K. K.(HPK).

preprint2022arXiv

Measurement of high-pressure xenon gas absorption in acrylic

Acrylic is a popular structural material in experiments requiring low background because of its radio-purity, machinability, and mechanical strength. However, its porosity may cause significant gas absorption and influence the detector stability in the long term. The interaction between acrylic and other detector materials becomes one of the key concerns in the detector design. In this paper, we carry out an experiment to measure quantitatively the absorption process of high-pressure xenon gas into acrylic. A specific setup is designed for the measurement, and systematic measurements are done to obtain a result of the absorption amount: 0.98 g xenon into 332 g of acrylic.

preprint2022arXiv

MSTRIQ: No Reference Image Quality Assessment Based on Swin Transformer with Multi-Stage Fusion

Measuring the perceptual quality of images automatically is an essential task in the area of computer vision, as degradations on image quality can exist in many processes from image acquisition, transmission to enhancing. Many Image Quality Assessment(IQA) algorithms have been designed to tackle this problem. However, it still remains un settled due to the various types of image distortions and the lack of large-scale human-rated datasets. In this paper, we propose a novel algorithm based on the Swin Transformer [31] with fused features from multiple stages, which aggregates information from both local and global features to better predict the quality. To address the issues of small-scale datasets, relative rankings of images have been taken into account together with regression loss to simultaneously optimize the model. Furthermore, effective data augmentation strategies are also used to improve the performance. In comparisons with previous works, experiments are carried out on two standard IQA datasets and a challenge dataset. The results demonstrate the effectiveness of our work. The proposed method outperforms other methods on standard datasets and ranks 2nd in the no-reference track of NTIRE 2022 Perceptual Image Quality Assessment Challenge [53]. It verifies that our method is promising in solving diverse IQA problems and thus can be used to real-word applications.

preprint2022arXiv

Phylogenetic Study of 2019-nCoV by Using Alignment Free Method (Evolutionary Bifurcation of Novel Coronavirus Mutants)

The phylogenetic tree of SARS-CoV-2 (nCov-19) viruses is reconstructed according to the similarity of genome sequences. The tree topology of Betacoronavirus is remarkably consistent with biologist's systematics. Because the tree construction contains enough information about virus mutants, it is suitable to study the evolutionary relationship between novel coronavirus mutants transmitted among humans. The emergences of 14 kinds of main mutants are studied and these strains can be classified as eight bifurcations of the phylogenetic tree. It is found that there exist three types of virus mutations, namely, the mutation among sub-branches of the same branch, the off-root mutation and the root-oriented mutation between large branches of the tree. From the point of the relation between viral mutation and host selection we found that individuals with low immunity provide a special environment for the positive natural selection of virus evolution. It gives a mechanism to explain why large mutations between two distant branches generally occur in the nCov-19 phylogenetic tree. The finding is helpful to formulate strategies to control the spread of COVID-19.

preprint2022arXiv

Progressive Multi-scale Consistent Network for Multi-class Fundus Lesion Segmentation

Effectively integrating multi-scale information is of considerable significance for the challenging multi-class segmentation of fundus lesions because different lesions vary significantly in scales and shapes. Several methods have been proposed to successfully handle the multi-scale object segmentation. However, two issues are not considered in previous studies. The first is the lack of interaction between adjacent feature levels, and this will lead to the deviation of high-level features from low-level features and the loss of detailed cues. The second is the conflict between the low-level and high-level features, this occurs because they learn different scales of features, thereby confusing the model and decreasing the accuracy of the final prediction. In this paper, we propose a progressive multi-scale consistent network (PMCNet) that integrates the proposed progressive feature fusion (PFF) block and dynamic attention block (DAB) to address the aforementioned issues. Specifically, PFF block progressively integrates multi-scale features from adjacent encoding layers, facilitating feature learning of each layer by aggregating fine-grained details and high-level semantics. As features at different scales should be consistent, DAB is designed to dynamically learn the attentive cues from the fused features at different scales, thus aiming to smooth the essential conflicts existing in multi-scale features. The two proposed PFF and DAB blocks can be integrated with the off-the-shelf backbone networks to address the two issues of multi-scale and feature inconsistency in the multi-class segmentation of fundus lesions, which will produce better feature representation in the feature space. Experimental results on three public datasets indicate that the proposed method is more effective than recent state-of-the-art methods.

preprint2022arXiv

Secure and Efficient Tunneling of MACsec for Modern Industrial Use Cases

Trends like Industry 4.0 will pose new challenges for future industrial networks. Greater interconnectedness, higher data volumes as well as new requirements for speeds as well as security will make new approaches necessary. Performanceoptimized networking techniques will be demanded to implement new use cases, like network separation and isolation, in a secure fashion. A new and highly efficient protocol, that will be vital for that purpose, is MACsec. It is a Layer 2 encryption protocol that was previously extended specifically for industrial environments. Yet, it lacks the ability to bridge local networks. Therefore, in this work, we propose a secure and efficient Layer 3 tunneling scheme for MACsec. We design and implement two approaches, that are equally secure and considerably outperform comparable state-of-the-art techniques.

preprint2022arXiv

Subspace Adversarial Training

Single-step adversarial training (AT) has received wide attention as it proved to be both efficient and robust. However, a serious problem of catastrophic overfitting exists, i.e., the robust accuracy against projected gradient descent (PGD) attack suddenly drops to 0% during the training. In this paper, we approach this problem from a novel perspective of optimization and firstly reveal the close link between the fast-growing gradient of each sample and overfitting, which can also be applied to understand robust overfitting in multi-step AT. To control the growth of the gradient, we propose a new AT method, Subspace Adversarial Training (Sub-AT), which constrains AT in a carefully extracted subspace. It successfully resolves both kinds of overfitting and significantly boosts the robustness. In subspace, we also allow single-step AT with larger steps and larger radius, further improving the robustness performance. As a result, we achieve state-of-the-art single-step AT performance. Without any regularization term, our single-step AT can reach over 51% robust accuracy against strong PGD-50 attack of radius 8/255 on CIFAR-10, reaching a competitive performance against standard multi-step PGD-10 AT with huge computational advantages. The code is released at https://github.com/nblt/Sub-AT.

preprint2022arXiv

Tightly Coupled Optimization-based GPS-Visual-Inertial Odometry with Online Calibration and Initialization

In this paper, we present a tightly coupled optimization-based GPS-Visual-Inertial odometry system to solve the trajectory drift of the visual-inertial odometry especially over long-term runs. Visual reprojection residuals, IMU residuals, and GPS measurement residuals are jointly minimized within a local bundle adjustment, in which we apply GPS measurements and IMU preintegration used for the IMU residuals to formulate a novel GPS residual. To improve the efficiency and robustness of the system, we propose a fast reference frames initialization method and an online calibration method for GPS-IMU extrinsic and time offset. In addition, we further test the performance and convergence of our online calibration method. Experimental results on EuRoC datasets show that our method consistently outperforms other tightly coupled and loosely coupled approaches. Meanwhile, this system has been validated on KAIST datasets, which proves that our system can work well in the case of visual or GPS failure.

preprint2021arXiv

A Benchmark of Ocular Disease Intelligent Recognition: One Shot for Multi-disease Detection

In ophthalmology, early fundus screening is an economic and effective way to prevent blindness caused by ophthalmic diseases. Clinically, due to the lack of medical resources, manual diagnosis is time-consuming and may delay the condition. With the development of deep learning, some researches on ophthalmic diseases have achieved good results, however, most of them are just based on one disease. During fundus screening, ophthalmologists usually give diagnoses of multi-disease on binocular fundus image, so we release a dataset with 8 diseases to meet the real medical scene, which contains 10,000 fundus images from both eyes of 5,000 patients. We did some benchmark experiments on it through some state-of-the-art deep neural networks. We found simply increasing the scale of network cannot bring good results for multi-disease classification, and a well-structured feature fusion method combines characteristics of multi-disease is needed. Through this work, we hope to advance the research of related fields.

preprint2021arXiv

A route to engineered high aspect-ratio silicon nanostructures through regenerative secondary mask lithography

Silicon nanostructuring imparts unique material properties including antireflectivity, antifogging, anti-icing, self-cleaning, and/or antimicrobial activity. To tune these properties however, a good control over features size and shape is essential. Here, a versatile fabrication process is presented to achieve tailored silicon nanostructures (thin/thick pillars, sharp/truncated/re-entrant cones), of pitch down to ~50 nm, and high-aspect ratio (>10). The approach relies on pre-assembled block copolymer (BCP) micelles and their direct transfer into a glass hard mask of an arbitrary thickness, now enabled by our recently reported regenerative secondary mask lithography. During this pattern transfer, not only the mask diameter can be decreased but also uniquely increased; constituting the first method to achieve such tunability without necessitating a different molecular weight BCP. Consequently, the hard mask modulation (height, diameter) advances the flexibility in attainable inter-pillar spacing, aspect ratios, and re-entrant profiles (= glass on silicon). Combined with adjusted silicon etch conditions, the morphology of nanopatterns can be highly customized. The process control and scalability enable uniform patterning of a 6-inch wafer which is verified through cross-wafer excellent antireflectivity (<5%) and water-repellency (advancing contact angle 158°; hysteresis 1°). It is envisioned the implementation of this approach to silicon nanostructuring to be far-reaching, facilitating fundamental studies and targeting applications spanning solar panels, antifogging/antibacterial surfaces, sensing, amongst many others.

preprint2021arXiv

Applications of Deep Learning in Fundus Images: A Review

The use of fundus images for the early screening of eye diseases is of great clinical importance. Due to its powerful performance, deep learning is becoming more and more popular in related applications, such as lesion segmentation, biomarkers segmentation, disease diagnosis and image synthesis. Therefore, it is very necessary to summarize the recent developments in deep learning for fundus images with a review paper. In this review, we introduce 143 application papers with a carefully designed hierarchy. Moreover, 33 publicly available datasets are presented. Summaries and analyses are provided for each task. Finally, limitations common to all tasks are revealed and possible solutions are given. We will also release and regularly update the state-of-the-art results and newly-released datasets at https://github.com/nkicsl/Fundus Review to adapt to the rapid development of this field.

preprint2021arXiv

Cooperative output feedback tracking control of stochastic linear heterogeneous multi-agent systems

We study cooperative output feedback tracking control of stochastic linear heterogeneous leader-following multi-agent systems. Each agent has a continuous-time linear heterogeneous dynamics with incompletely measurable state, and there are additive and multiplicative noises along with information exchange among agents. We propose a set of admissible distributed observation strategies for estimating the leader&#39;s and the followers&#39; states, and a set of admissible cooperative output feedback control strategies based on the certainty equivalence principle. By output regulation theory and stochastic analysis, we show that for observable leader&#39;s dynamics and stabilizable and detectable followers&#39; dynamics, if the intensity coefficient of multiplicative noises multiplied by the sum of real parts of the leader&#39; s unstable modes is less than 1/4 of the minimum non-zero eigenvalue of graph Laplacian, then there exist admissible distributed observation and cooperative control strategies to ensure mean square bounded output tracking, provided the associated output regulation equations are solvable. Finally, the effectiveness of our control strategies is demonstrated by a numerical simulation.

preprint2021arXiv

JUNO Physics and Detector

The Jiangmen Underground Neutrino Observatory (JUNO) is a 20 kton LS detector at 700-m underground. An excellent energy resolution and a large fiducial volume offer exciting opportunities for addressing many important topics in neutrino and astro-particle physics. With 6 years of data, the neutrino mass ordering can be determined at 3-4 sigma and three oscillation parameters can be measured to a precision of 0.6% or better by detecting reactor antineutrinos. With 10 years of data, DSNB could be observed at 3-sigma; a lower limit of the proton lifetime of 8.34e33 years (90% C.L.) can be set by searching for p->nu_bar K^+; detection of solar neutrinos would shed new light on the solar metallicity problem and examine the vacuum-matter transition region. A core-collapse supernova at 10 kpc would lead to ~5000 IBD and ~2000 (300) all-flavor neutrino-proton (electron) scattering events. Geo-neutrinos can be detected with a rate of ~400 events/year. We also summarize the final design of the JUNO detector and the key R&D achievements. All 20-inch PMTs have been tested. The average photon detection efficiency is 28.9% for the 15,000 MCP PMTs and 28.1% for the 5,000 dynode PMTs, higher than the JUNO requirement of 27%. Together with the >20 m attenuation length of LS, we expect a yield of 1345 p.e. per MeV and an effective energy resolution of 3.02%/\sqrt{E (MeV)}$ in simulations. The underwater electronics is designed to have a loss rate <0.5% in 6 years. With degassing membranes and a micro-bubble system, the radon concentration in the 35-kton water pool could be lowered to <10 mBq/m^3. Acrylic panels of radiopurity <0.5 ppt U/Th are produced. The 20-kton LS will be purified onsite. Singles in the fiducial volume can be controlled to ~10 Hz. The JUNO experiment also features a double calorimeter system with 25,600 3-inch PMTs, a LS testing facility OSIRIS, and a near detector TAO.

preprint2020arXiv

A Deep Learning Method for Complex Human Activity Recognition Using Virtual Wearable Sensors

Sensor-based human activity recognition (HAR) is now a research hotspot in multiple application areas. With the rise of smart wearable devices equipped with inertial measurement units (IMUs), researchers begin to utilize IMU data for HAR. By employing machine learning algorithms, early IMU-based research for HAR can achieve accurate classification results on traditional classical HAR datasets, containing only simple and repetitive daily activities. However, these datasets rarely display a rich diversity of information in real-scene. In this paper, we propose a novel method based on deep learning for complex HAR in the real-scene. Specially, in the off-line training stage, the AMASS dataset, containing abundant human poses and virtual IMU data, is innovatively adopted for enhancing the variety and diversity. Moreover, a deep convolutional neural network with an unsupervised penalty is proposed to automatically extract the features of AMASS and improve the robustness. In the on-line testing stage, by leveraging advantages of the transfer learning, we obtain the final result by fine-tuning the partial neural network (optimizing the parameters in the fully-connected layers) using the real IMU data. The experimental results show that the proposed method can surprisingly converge in a few iterations and achieve an accuracy of 91.15% on a real IMU dataset, demonstrating the efficiency and effectiveness of the proposed method.

preprint2020arXiv

Attention-SLAM: A Visual Monocular SLAM Learning from Human Gaze

This paper proposes a novel simultaneous localization and mapping (SLAM) approach, namely Attention-SLAM, which simulates human navigation mode by combining a visual saliency model (SalNavNet) with traditional monocular visual SLAM. Most SLAM methods treat all the features extracted from the images as equal importance during the optimization process. However, the salient feature points in scenes have more significant influence during the human navigation process. Therefore, we first propose a visual saliency model called SalVavNet in which we introduce a correlation module and propose an adaptive Exponential Moving Average (EMA) module. These modules mitigate the center bias to enable the saliency maps generated by SalNavNet to pay more attention to the same salient object. Moreover, the saliency maps simulate the human behavior for the refinement of SLAM results. The feature points extracted from the salient regions have greater importance in optimization process. We add semantic saliency information to the Euroc dataset to generate an open-source saliency SLAM dataset. Comprehensive test results prove that Attention-SLAM outperforms benchmarks such as Direct Sparse Odometry (DSO), ORB-SLAM, and Salient DSO in terms of efficiency, accuracy, and robustness in most test cases.

preprint2020arXiv

Augmenting Neural Networks with First-order Logic

Today, the dominant paradigm for training neural networks involves minimizing task loss on a large dataset. Using world knowledge to inform a model, and yet retain the ability to perform end-to-end training remains an open question. In this paper, we present a novel framework for introducing declarative knowledge to neural network architectures in order to guide training and prediction. Our framework systematically compiles logical statements into computation graphs that augment a neural network without extra learnable parameters or manual redesign. We evaluate our modeling strategy on three tasks: machine comprehension, natural language inference, and text chunking. Our experiments show that knowledge-augmented networks can strongly improve over baselines, especially in low-data regimes.

preprint2020arXiv

Causality and Batch Reinforcement Learning: Complementary Approaches To Planning In Unknown Domains

Reinforcement learning algorithms have had tremendous successes in online learning settings. However, these successes have relied on low-stakes interactions between the algorithmic agent and its environment. In many settings where RL could be of use, such as health care and autonomous driving, the mistakes made by most online RL algorithms during early training come with unacceptable costs. These settings require developing reinforcement learning algorithms that can operate in the so-called batch setting, where the algorithms must learn from set of data that is fixed, finite, and generated from some (possibly unknown) policy. Evaluating policies different from the one that collected the data is called off-policy evaluation, and naturally poses counter-factual questions. In this project we show how off-policy evaluation and the estimation of treatment effects in causal inference are two approaches to the same problem, and compare recent progress in these two areas.

preprint2020arXiv

Crystalline and magnetic structures, magnetization, heat capacity and anisotropic magnetostriction effect in a yttrium-chromium oxide

We have studied a nearly stoichiometric insulating Y$_{0.97(2)}$Cr$_{0.98(2)}$O$_{3.00(2)}$ single crystal by performing measurements of magnetization, heat capacity, and neutron diffraction. Albeit that the YCrO$_3$ compound behaviors like a soft ferromagnet with a coersive force of $\sim$ 0.05 T, there exist strong antiferromagnetic (AFM) interactions between Cr$^{3+}$ spins due to a strongly negative paramagnetic Curie-Weiss temperature, i.e., -433.2(6) K. The coexistence of ferromagnetism and antiferromagnetism may indicate a canted AFM structure. The AFM phase transition occurs at $T_\textrm{N} =$ 141.5(1) K, which increases to $T_\textrm{N}$(5T) = 144.5(1) K at 5 T. Within the accuracy of the present neuron-diffraction studies, we determine a G-type AFM structure with a propagation vector \textbf{k} = (1 1 0) and Cr$^{3+}$ spin directions along the crystallographic \emph{c} axis of the orthorhombic structure with space group \emph{Pnma} below $T_\textrm{N}$. At 12 K, the refined moment size is 2.45(6) $μ_\textrm{B}$, $\sim$ 82\% of the theoretical saturation value 3 $μ_\textrm{B}$. The Cr$^{3+}$ spin interactions are probably two-dimensional Ising like within the reciprocal (1 1 0) scattering plane. Below $T_\textrm{N}$, the lattice configuration (\emph{a}, \emph{b}, \emph{c}, and \emph{V}) deviates largely downward from the Gr$\ddot{\textrm{u}}$neisen law, displaying an anisotropic magnetostriction effect and a magnetoelastic effect. Especially, the sample contraction upon cooling is enhanced below the AFM transition temperature. There is evidence to suggest that the actual crystalline symmetry of YCrO$_3$ compound is probably lower than the currently assumed one. Additionally, we compared the $t_{2\textrm{g}}$ YCrO$_3$ and the $e_\textrm{g}$ La$_{7/8}$Sr$_{1/8}$MnO$_3$ single crystals for a further understanding of the reason for the possible symmetry lowering.

preprint2020arXiv

Detecting EPR steering via two classes of local measurements

We study the Einstein-Podolsky-Rosen (EPR) steering and present steerability criteria for arbitrary qubit-qudit (qudit-qubit) systems based on mutually unbiased measurements (MUMs) and general symmetric informationally complete measurements (general SIC-POVMs). Avoiding the usual complicated steering inequalities, these criteria can be more operational than some existing criteria and implemented experimentally. Detailed examples are given to illustrate the efficiency of the criteria in both computation and experimental implementation.

preprint2020arXiv

Dynamics of planar vector fields near a non-smooth equilibrium

In this paper we contribute to qualitative and geometric analysis of planar piecewise smooth vector fields, which consist of two smooth vector fields separated by the straight line $y=0$ and sharing the origin as a non-degenerate equilibrium. In the sense of $Σ$-equivalence, we provide a sufficient condition for linearization and give phase portraits and normal forms for these linearizable vector fields. This condition is hard to be weakened because there exist vector fields which are not linearizable when this condition is not satisfied. Regarding perturbations, a necessary and sufficient condition for local $Σ$-structural stability is established when the origin is still an equilibrium of both smooth vector fields under perturbations. In the opposition to this case, we prove that for any piecewise smooth vector field studied in this paper there is a limit cycle bifurcating from the origin, and there are some piecewise smooth vector fields such that for any positive integer $m$ there is a perturbation having exactly $m$ limit cycles bifurcating from the origin. Here $m$ maybe infinity.

preprint2020arXiv

Feasibility and physics potential of detecting $^8$B solar neutrinos at JUNO

The Jiangmen Underground Neutrino Observatory~(JUNO) features a 20~kt multi-purpose underground liquid scintillator sphere as its main detector. Some of JUNO&#39;s features make it an excellent experiment for $^8$B solar neutrino measurements, such as its low-energy threshold, its high energy resolution compared to water Cherenkov detectors, and its much large target mass compared to previous liquid scintillator detectors. In this paper we present a comprehensive assessment of JUNO&#39;s potential for detecting $^8$B solar neutrinos via the neutrino-electron elastic scattering process. A reduced 2~MeV threshold on the recoil electron energy is found to be achievable assuming the intrinsic radioactive background $^{238}$U and $^{232}$Th in the liquid scintillator can be controlled to 10$^{-17}$~g/g. With ten years of data taking, about 60,000 signal and 30,000 background events are expected. This large sample will enable an examination of the distortion of the recoil electron spectrum that is dominated by the neutrino flavor transformation in the dense solar matter, which will shed new light on the tension between the measured electron spectra and the predictions of the standard three-flavor neutrino oscillation framework. If $Δm^{2}_{21}=4.8\times10^{-5}~(7.5\times10^{-5})$~eV$^{2}$, JUNO can provide evidence of neutrino oscillation in the Earth at the about 3$σ$~(2$σ$) level by measuring the non-zero signal rate variation with respect to the solar zenith angle. Moveover, JUNO can simultaneously measure $Δm^2_{21}$ using $^8$B solar neutrinos to a precision of 20\% or better depending on the central value and to sub-percent precision using reactor antineutrinos. A comparison of these two measurements from the same detector will help elucidate the current tension between the value of $Δm^2_{21}$ reported by solar neutrino experiments and the KamLAND experiment.

preprint2020arXiv

Ferroelastic-switching-driven colossal shear strain and piezoelectricity in a hybrid ferroelectric

Materials that can produce large controllable strains are widely used in shape memory devices, actuators and sensors. Great efforts have been made to improve the strain outputs of various material systems. Among them, ferroelastic transitions underpin giant reversible strains in electrically-driven ferro/piezoelectrics and thermally- or magneticallydriven shape memory alloys. However, large-strain ferroelastic switching in conventional ferroelectrics is very challenging while magnetic and thermal controls are not desirable for applications. Here, we demonstrate an unprecedentedly large shear strain up to 21.5 % in a hybrid ferroelectric, C6H5N(CH3)3CdCl3. The strain response is about two orders of magnitude higher than those of top-performing conventional ferroelectric polymers and oxides. It is achieved via inorganic bond switching and facilitated by the structural confinement of the large organic moieties, which prevents the undesired 180-degree polarization switching. Furthermore, Br substitution can effectively soften the bonds and result in giant shear piezoelectric coefficient (d35 ~ 4800 pm/V) in Br-rich end of the solid solution, C6H5N(CH3)3CdBr3xCl3(1-x). The superior electromechanical properties of the compounds promise their potential in lightweight and high energy density devices, and the strategy described here should inspire the development of next-generation piezoelectrics and electroactive materials based on hybrid ferroelectrics.

preprint2020arXiv

Gauge-induced Floquet topological states in photonic waveguides

Tremendous efforts have been devoted to the search for exotic topological states, which usually exist at an interface between lattices with differing topological invariants according to the bulk-edge correspondence. Here, we show a new finding of topological states localized at the interface between two gauge-shifted Floquet photonic lattices, despite the same topological order across the entire structure. The quasienergy band structures reveal that these new interface modes belong to the Floquet π modes, which are further found to enable a robust one-way propagation thanks to the flexible control of the Floquet gauge. The intriguing propagations of these π-interface modes are experimentally verified in a silicon waveguides platform at near-infrared wavelengths, which show both broad working bandwidth and high tolerance to the structural fluctuations. Our approach provides a new route for light manipulations with robust behaviours in Floquet engineering and beyond.

preprint2020arXiv

Real-time Universal Style Transfer on High-resolution Images via Zero-channel Pruning

Extracting effective deep features to represent content and style information is the key to universal style transfer. Most existing algorithms use VGG19 as the feature extractor, which incurs a high computational cost and impedes real-time style transfer on high-resolution images. In this work, we propose a lightweight alternative architecture - ArtNet, which is based on GoogLeNet, and later pruned by a novel channel pruning method named Zero-channel Pruning specially designed for style transfer approaches. Besides, we propose a theoretically sound sandwich swap transform (S2) module to transfer deep features, which can create a pleasing holistic appearance and good local textures with an improved content preservation ability. By using ArtNet and S2, our method is 2.3 to 107.4 times faster than state-of-the-art approaches. The comprehensive experiments demonstrate that ArtNet can achieve universal, real-time, and high-quality style transfer on high-resolution images simultaneously, (68.03 FPS on 512 times 512 images).

preprint2020arXiv

Self-Learning with Rectification Strategy for Human Parsing

In this paper, we solve the sample shortage problem in the human parsing task. We begin with the self-learning strategy, which generates pseudo-labels for unlabeled data to retrain the model. However, directly using noisy pseudo-labels will cause error amplification and accumulation. Considering the topology structure of human body, we propose a trainable graph reasoning method that establishes internal structural connections between graph nodes to correct two typical errors in the pseudo-labels, i.e., the global structural error and the local consistency error. For the global error, we first transform category-wise features into a high-level graph model with coarse-grained structural information, and then decouple the high-level graph to reconstruct the category features. The reconstructed features have a stronger ability to represent the topology structure of the human body. Enlarging the receptive field of features can effectively reducing the local error. We first project feature pixels into a local graph model to capture pixel-wise relations in a hierarchical graph manner, then reverse the relation information back to the pixels. With the global structural and local consistency modules, these errors are rectified and confident pseudo-labels are generated for retraining. Extensive experiments on the LIP and the ATR datasets demonstrate the effectiveness of our global and local rectification modules. Our method outperforms other state-of-the-art methods in supervised human parsing tasks.

preprint2020arXiv

Structured Tuning for Semantic Role Labeling

Recent neural network-driven semantic role labeling (SRL) systems have shown impressive improvements in F1 scores. These improvements are due to expressive input representations, which, at least at the surface, are orthogonal to knowledge-rich constrained decoding mechanisms that helped linear SRL models. Introducing the benefits of structure to inform neural models presents a methodological challenge. In this paper, we present a structured tuning framework to improve models using softened constraints only at training time. Our framework leverages the expressiveness of neural networks and provides supervision with structured loss components. We start with a strong baseline (RoBERTa) to validate the impact of our approach, and show that our framework outperforms the baseline by learning to comply with declarative constraints. Additionally, our experiments with smaller training sizes show that we can achieve consistent improvements under low-resource scenarios.

preprint2020arXiv

TAO Conceptual Design Report: A Precision Measurement of the Reactor Antineutrino Spectrum with Sub-percent Energy Resolution

The Taishan Antineutrino Observatory (TAO, also known as JUNO-TAO) is a satellite experiment of the Jiangmen Underground Neutrino Observatory (JUNO). A ton-level liquid scintillator detector will be placed at about 30 m from a core of the Taishan Nuclear Power Plant. The reactor antineutrino spectrum will be measured with sub-percent energy resolution, to provide a reference spectrum for future reactor neutrino experiments, and to provide a benchmark measurement to test nuclear databases. A spherical acrylic vessel containing 2.8 ton gadolinium-doped liquid scintillator will be viewed by 10 m^2 Silicon Photomultipliers (SiPMs) of >50% photon detection efficiency with almost full coverage. The photoelectron yield is about 4500 per MeV, an order higher than any existing large-scale liquid scintillator detectors. The detector operates at -50 degree C to lower the dark noise of SiPMs to an acceptable level. The detector will measure about 2000 reactor antineutrinos per day, and is designed to be well shielded from cosmogenic backgrounds and ambient radioactivities to have about 10% background-to-signal ratio. The experiment is expected to start operation in 2022.

preprint2020arXiv

Theory of the anomalous spin dynamics of spin-$\frac{1}{2}$ triangular lattice Heisenberg antiferromagnet and its application to Ba$_3$CoSb$_2$O$_9$

Although it is well accepted that the spin-$\frac{1}{2}$ triangular lattice Heisenberg antiferromagnet(TLHAF) has a long range ordered ground state, a thorough understanding of its spin dynamics is still missing. While the linear spin wave theory(LSWT) predicts three branches of magnon mode in the magnetic Brillouin zone(MBZ), the 1/S expansion at the next order is found to break down in a large portion of the MBZ centered around the M point, leaving the fate of the magnon modes there undecided. Recent neutron scattering measurement on Ba$_3$CoSb$_2$O$_9$, an ideal realization of the spin-$\frac{1}{2}$ TLHAF, provides a surprising answer to this issue. Two, rather than three branches of magnon mode are observed around the M point, whose dispersion are strongly renormalized with respect to the LSWT prediction and exhibit pronounced roton-like minimum at the M point. This is accompanied by a strong spin fluctuation continuum at higher energy, inside which two strong and broad spectral peaks of unknown origin are observed. In this work, we propose a simple picture for these spectral anomalies by invoking the resonating valence bond(RVB) physics in the description of the ground state of the system. We find that the roton-like minimum in the magnon dispersion can be explained by the coupling between the collective spin fluctuation and the continuum of Dirac spinon excitation moving in a $π$-flux background. We also propose that the two broad peaks in the continuum can be understood respectively as the Landau damped third magnon mode and the Landau damped longitudinal mode. Such a picture can be verified by studying the polarization character of the various spectral features.

preprint2020arXiv

VecQ: Minimal Loss DNN Model Compression With Vectorized Weight Quantization

Quantization has been proven to be an effective method for reducing the computing and/or storage cost of DNNs. However, the trade-off between the quantization bitwidth and final accuracy is complex and non-convex, which makes it difficult to be optimized directly. Minimizing direct quantization loss (DQL) of the coefficient data is an effective local optimization method, but previous works often neglect the accurate control of the DQL, resulting in a higher loss of the final DNN model accuracy. In this paper, we propose a novel metric called Vector Loss. Based on this new metric, we develop a new quantization solution called VecQ, which can guarantee minimal direct quantization loss and better model accuracy. In addition, in order to speed up the proposed quantization process during model training, we accelerate the quantization process with a parameterized probability estimation method and template-based derivation calculation. We evaluate our proposed algorithm on MNIST, CIFAR, ImageNet, IMDB movie review and THUCNews text data sets with numerical DNN models. The results demonstrate that our proposed quantization solution is more accurate and effective than the state-of-the-art approaches yet with more flexible bitwidth support. Moreover, the evaluation of our quantized models on Saliency Object Detection (SOD) tasks maintains comparable feature extraction quality with up to 16$\times$ weight size reduction.

preprint2020arXiv

Vertical CVD Growth of Highly Uniform Transition Metal Dichalcogenides

Two-dimensional (2D) transition metal dichalcogenides (TMDCs) have attracted great attention due to their physical and chemical properties that make them promising in electronics and optoelectronics. Because of the difficulties in controlling concentrations of solid precursors and spatially non-uniform growth dynamics, it is challenging to grow wafer-scale 2D TMDCs with good uniformity and reproducibility so far, which significantly hinders their practical use. Here we report a vertical chemical vapor deposition (VCVD) design to grow monolayer TMDCs with a uniform density and high quality over the whole wafer, and with excellent reproducibility. The use of such VCVD design can easily control the three key growth parameters of precursor concentration, gas flow and temperature, which cannot be done in currently widely-used horizontal CVD system. Statistical results show that VCVD-grown monolayer TMDCs including MoS2 and WS2 are of high uniformity and quality on substrates over centimeter size. We also fabricated multiple van der Waals heterostructures by the one-step transfer of VCVD-grown TMDC samples, owning to its good uniformity. This work opens a way to grow 2D materials with high uniformity and reproducibility on the wafer scale, which can be used for the scalable fabrication of 2D materials and their heterostructures.

preprint2019arXiv

Periodic orbits of linear Filippov systems with a line of discontinuity

In this paper we consider periodic orbits of planar linear Filippov systems with a line of discontinuity. Unlike many publications researching only the maximum number of crossing periodic orbits, we investigate not only the number and configuration of sliding periodic orbits, but also the coexistence of sliding periodic orbits and crossing ones. Firstly, we prove that the number of sliding periodic orbits is at most 2, and give all possible configurations of one or two sliding periodic orbits. Secondly, we prove that two sliding periodic orbits coexist with at most one crossing periodic orbit, and one sliding periodic orbit can coexist with two crossing ones.

preprint2018arXiv

Collaborative Filtering with Topic and Social Latent Factors Incorporating Implicit Feedback

Recommender systems (RSs) provide an effective way of alleviating the information overload problem by selecting personalized items for different users. Latent factors based collaborative filtering (CF) has become the popular approaches for RSs due to its accuracy and scalability. Recently, online social networks and user-generated content provide diverse sources for recommendation beyond ratings. Although {\em social matrix factorization} (Social MF) and {\em topic matrix factorization} (Topic MF) successfully exploit social relations and item reviews, respectively, both of them ignore some useful information. In this paper, we investigate the effective data fusion by combining the aforementioned approaches. First, we propose a novel model {\em \mbox{MR3}} to jointly model three sources of information (i.e., ratings, item reviews, and social relations) effectively for rating prediction by aligning the latent factors and hidden topics. Second, we incorporate the implicit feedback from ratings into the proposed model to enhance its capability and to demonstrate its flexibility. We achieve more accurate rating prediction on real-life datasets over various state-of-the-art methods. Furthermore, we measure the contribution from each of the three data sources and the impact of implicit feedback from ratings, followed by the sensitivity analysis of hyperparameters. Empirical studies demonstrate the effectiveness and efficacy of our proposed model and its extension.

preprint2018arXiv

Music Sequence Prediction with Mixture Hidden Markov Models

Recommendation systems that automatically generate personalized music playlists for users have attracted tremendous attention in recent years. Nowadays, most music recommendation systems rely on item-based or user-based collaborative filtering or content-based approaches. In this paper, we propose a novel mixture hidden Markov model (HMM) for music play sequence prediction. We compare the mixture model with state-of-the-art methods and evaluate the predictions quantitatively and qualitatively on a large-scale real-world dataset in a Kaggle competition. Results show that our model significantly outperforms traditional methods as well as other competitors. We conclude by envisioning a next-generation music recommendation system that integrates our model with recent advances in deep learning, computer vision, and speech techniques, and has promising potential in both academia and industry.

preprint2018arXiv

Self-error-rejecting quantum state transmission of entangled photons for faithful quantum communication without calibrate reference frames

We propose an alignment-free two-party polarization-entanglement transmission scheme for entangled photons by using only linear-optical elements, requiring neither ancillary photons nor calibrated reference frames. The scheme is robust against both the random channel noise and the instability of reference frames, and it is subsequently extended to multi-party Greenberger-Horne-Zeilinger state transmission. Furthermore, the success probabilities for two- and multi-party entanglement transmission are, in principle, improved to unity when active polarization controllers are used. The distinct characters of a simple structure, easy to be implemented, and a high fidelity and efficiency make our protocol very useful for long-distance quantum communications and distributed quantum networks in practical applications.