Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
65works
0followers
38topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

65 published item(s)

preprint2026arXiv

Crafting Reversible SFT Behaviors in Large Language Models

Supervised fine-tuning (SFT) induces new behaviors in large language models, yet imposes no structural constraint on how these behaviors are distributed within the model. Existing behavior interpretation methods, such as circuit attribution approaches, identify sparse subnetworks correlated with SFT-induced behaviors post-hoc. However, such correlations do not imply *causal necessity*, limiting the ability to selectively control SFT-induced behaviors at inference time. We pursue an alternative by asking: can an SFT-induced behavior be deliberately compressed into a sparse, mechanistically necessary subnetwork, termed a *carrier*, while remaining controllable at inference time without weight modification? We propose (a) **Loss-Constrained Dual Descent (LCDD)**, which constructs such carriers by jointly optimizing routing masks and model weights under an explicit utility budget, and (b) **SFT-Eraser**, a soft prompt optimized via activation matching on extracted carrier channels, to reverse the SFT-induced behavior. Across safety, fixed-response, and style behaviors on multiple model families, LCDD yields sparse carriers that preserve target behaviors while enabling strong reversion when triggered by SFT-Eraser. Ablations further establish that the sparse structure is the key precondition for reversal: the same trigger optimization fails on standard SFT models, confirming that structure rather than trigger design is the operative factor. These results provide direct evidence that the learned carriers are causally necessary for the behaviors, pointing to a new direction for systematically localizing and selectively suppressing SFT-induced behaviors in deployed models.

preprint2025arXiv

Baryogenesis via the Chiral Magnetic Effect in a First-Order Electroweak Phase Transition

In this paper, we investigate the generation of the baryon asymmetry of the universe during the first-order electroweak phase transition. We first study the generation of the helical magnetic field in the framework of the standard model effective field theory with a CP-violating operator. We show that, when the chiral magnetic effect is absent, the helical magnetic field and effective chemical potential cannot generate enough baryon asymmetry when vacuum bubbles collide. We further find that the chiral magnetic effect can amplify the lepton asymmetry in the early universe during the phase transition. We present the baryon asymmetry interpretation requirement on certain parameter spaces of the phase transition and the primordial magnetic field.

preprint2023arXiv

Applications of Gorenstein projective $τ$-rigid modules

We first introduce the notion of $CM$-$τ$-tilting free algebras as the generalization of $CM$-free algebras and show the homological properties of $CM$-$τ$-tilting free algebras. Then we give a bijection between Gorenstein projective $τ$-rigid modules and certain modules by using an equivalence established by Kong and Zhang. Finally, we give a partial answer to Tachikawa's first conjecture by using Gorenstein projective $τ$-rigid modules.

preprint2023arXiv

Coexistence of zigzag antiferromagnetic order and superconductivity in compressed NiPSe3

NiPSe3 is regarded as a bandwidth-controlled Mott insulator, distinct from the widely studied Mott insulating magnetic graphene MPSe3 (M = Mn and Fe) family. By employing high-pressure synchrotron X-ray diffraction, we observe two structural transitions as a function of pressure. With the help of first-principles calculations, we discover the antiferromagnetic (AFM) moment directions of NiPSe3 switch from out-of-plane to in-plane and the honeycomb layers slide relative to each other at the first structural transition. The in-plane AFM order persists until the second structural transition, whereupon the two-dimensional (2D) structure assumes a more three-dimensional (3D) character. A bandwidth-controlled Mott insulator-metal transition (IMT) occurs between the two structural transitions at 8.0 GPa, concomitant with the emergence of superconductivity with 4.8 K. The superconductivity in NiPSe3 emerging in the 2D monoclinic phase coexists with the in-plane AFM order and continues into the 3D trigonal phase. Our electronic structure calculations reveal that the Mott IMT and superconductivity in NiPSe3 are both closely related to the enhanced Se2- 4p and Ni2+ 3d electronic hybridizations under pressure. From these results, we construct the temperature-pressure electronic phase diagram of NiPSe3, revealing rich physics and many similarities with copper oxide and iron-based superconductors.

preprint2023arXiv

Semi-Supervised Subspace Clustering via Tensor Low-Rank Representation

In this letter, we propose a novel semi-supervised subspace clustering method, which is able to simultaneously augment the initial supervisory information and construct a discriminative affinity matrix. By representing the limited amount of supervisory information as a pairwise constraint matrix, we observe that the ideal affinity matrix for clustering shares the same low-rank structure as the ideal pairwise constraint matrix. Thus, we stack the two matrices into a 3-D tensor, where a global low-rank constraint is imposed to promote the affinity matrix construction and augment the initial pairwise constraints synchronously. Besides, we use the local geometry structure of input samples to complement the global low-rank prior to achieve better affinity matrix learning. The proposed model is formulated as a Laplacian graph regularized convex low-rank tensor representation problem, which is further solved with an alternative iterative algorithm. In addition, we propose to refine the affinity matrix with the augmented pairwise constraints. Comprehensive experimental results on eight commonly-used benchmark datasets demonstrate the superiority of our method over state-of-the-art methods. The code is publicly available at https://github.com/GuanxingLu/Subspace-Clustering.

preprint2022arXiv

A spherical harmonics method for processing anisotropic X-ray atomic pair distribution functions

A general spherical harmonics method is described for extracting anisotropic pair distribution functions (PDF) in this work. In the structural study of functional crystallized materials, the investigation of the local structures under the application of external stimuli, such as electric field and stress, is in urgent need. A well-established technique for local structure studies is PDF analysis, but the extraction of the X-ray PDF data is usually based on angular integrations of isotropic X-ray structure functions, which is no longer valid for the anisotropic responses of the materials under orientation-dependent stimuli. Therefore, we have developed an advanced spherical harmonics method to transform two-dimensional X-ray diffraction patterns into anisotropic PDF data, based on three-dimensional diffraction geometry and Fourier transform. The electrical-field-induced local structural change in the PbZr0.54Ti0.46O3 ceramics is then presented to demonstrate the method's effectiveness.

preprint2022arXiv

Adaptive Attribute and Structure Subspace Clustering Network

Deep self-expressiveness-based subspace clustering methods have demonstrated effectiveness. However, existing works only consider the attribute information to conduct the self-expressiveness, which may limit the clustering performance. In this paper, we propose a novel adaptive attribute and structure subspace clustering network (AASSC-Net) to simultaneously consider the attribute and structure information in an adaptive graph fusion manner. Specifically, we first exploit an auto-encoder to represent input data samples with latent features for the construction of an attribute matrix. We also construct a mixed signed and symmetric structure matrix to capture the local geometric structure underlying data samples. Then, we perform self-expressiveness on the constructed attribute and structure matrices to learn their affinity graphs separately. Finally, we design a novel attention-based fusion module to adaptively leverage these two affinity graphs to construct a more discriminative affinity graph. Extensive experimental results on commonly used benchmark datasets demonstrate that our AASSC-Net significantly outperforms state-of-the-art methods. In addition, we conduct comprehensive ablation studies to discuss the effectiveness of the designed modules. The code will be publicly available at https://github.com/ZhihaoPENG-CityU.

preprint2022arXiv

Attacking Black-box Recommendations via Copying Cross-domain User Profiles

Recently, recommender systems that aim to suggest personalized lists of items for users to interact with online have drawn a lot of attention. In fact, many of these state-of-the-art techniques have been deep learning based. Recent studies have shown that these deep learning models (in particular for recommendation systems) are vulnerable to attacks, such as data poisoning, which generates users to promote a selected set of items. However, more recently, defense strategies have been developed to detect these generated users with fake profiles. Thus, advanced injection attacks of creating more `realistic' user profiles to promote a set of items is still a key challenge in the domain of deep learning based recommender systems. In this work, we present our framework CopyAttack, which is a reinforcement learning based black-box attack method that harnesses real users from a source domain by copying their profiles into the target domain with the goal of promoting a subset of items. CopyAttack is constructed to both efficiently and effectively learn policy gradient networks that first select, and then further refine/craft, user profiles from the source domain to ultimately copy into the target domain. CopyAttack's goal is to maximize the hit ratio of the targeted items in the Top-$k$ recommendation list of the users in the target domain. We have conducted experiments on two real-world datasets and have empirically verified the effectiveness of our proposed framework and furthermore performed a thorough model analysis.

preprint2022arXiv

Attention-aware contrastive learning for predicting T cell receptor-antigen binding specificity

It has been verified that only a small fraction of the neoantigens presented by MHC class I molecules on the cell surface can elicit T cells. The limitation can be attributed to the binding specificity of T cell receptor (TCR) to peptide-MHC complex (pMHC). Computational prediction of T cell binding to neoantigens is an challenging and unresolved task. In this paper, we propose an attentive-mask contrastive learning model, ATMTCR, for inferring TCR-antigen binding specificity. For each input TCR sequence, we used Transformer encoder to transform it to latent representation, and then masked a proportion of residues guided by attention weights to generate its contrastive view. Pretraining on large-scale TCR CDR3 sequences, we verified that contrastive learning significantly improved the prediction performance of TCR binding to peptide-MHC complex (pMHC). Beyond the detection of important amino acids and their locations in the TCR sequence, our model can also extracted high-order semantic information underlying the TCR-antigen binding specificity. Comparison experiments were conducted on two independent datasets, our method achieved better performance than other existing algorithms. Moreover, we effectively identified important amino acids and their positional preferences through attention weights, which indicated the interpretability of our proposed model.

preprint2022arXiv

Attention-wise masked graph contrastive learning for predicting molecular property

Accurate and efficient prediction of the molecular properties of drugs is one of the fundamental problems in drug research and development. Recent advancements in representation learning have been shown to greatly improve the performance of molecular property prediction. However, due to limited labeled data, supervised learning-based molecular representation algorithms can only search limited chemical space, which results in poor generalizability. In this work, we proposed a self-supervised representation learning framework for large-scale unlabeled molecules. We developed a novel molecular graph augmentation strategy, referred to as attention-wise graph mask, to generate challenging positive sample for contrastive learning. We adopted the graph attention network (GAT) as the molecular graph encoder, and leveraged the learned attention scores as masking guidance to generate molecular augmentation graphs. By minimization of the contrastive loss between original graph and masked graph, our model can capture important molecular structure and higher-order semantic information. Extensive experiments showed that our attention-wise graph mask contrastive learning exhibit state-of-the-art performance in a couple of downstream molecular property prediction tasks.

preprint2022arXiv

Contrastive learning-based computational histopathology predict differential expression of cancer driver genes

Digital pathological analysis is run as the main examination used for cancer diagnosis. Recently, deep learning-driven feature extraction from pathology images is able to detect genetic variations and tumor environment, but few studies focus on differential gene expression in tumor cells. In this paper, we propose a self-supervised contrastive learning framework, HistCode, to infer differential gene expressions from whole slide images (WSIs). We leveraged contrastive learning on large-scale unannotated WSIs to derive slide-level histopathological feature in latent space, and then transfer it to tumor diagnosis and prediction of differentially expressed cancer driver genes. Our extensive experiments showed that our method outperformed other state-of-the-art models in tumor diagnosis tasks, and also effectively predicted differential gene expressions. Interestingly, we found the higher fold-changed genes can be more precisely predicted. To intuitively illustrate the ability to extract informative features from pathological images, we spatially visualized the WSIs colored by the attentive scores of image tiles. We found that the tumor and necrosis areas were highly consistent with the annotations of experienced pathologists. Moreover, the spatial heatmap generated by lymphocyte-specific gene expression patterns was also consistent with the manually labeled WSI.

preprint2022arXiv

Discovery of ATLAS17jrp as an Optical, X-ray and Infrared Bright TDE in a Star-forming Galaxy

We hereby report the discovery of ATLAS17jrp as an extraordinary TDE in star-forming galaxy SDSSJ162034.99+240726.5 in our recent sample of mid-infrared outbursts in nearby galaxies. Its optical/UV light curves rise to a peak luminosity $\sim1.06\times10^{44}\rm\,erg\,s^{-1}$ in about a month and then decay as $\rm t^{-5/3}$ with a roughly constant temperature around 19000~K, and the optical spectra show a blue continuum and very broad Balmer lines with FWHM$\sim$15000 km/s which gradually narrowed to 1400 km/s within 4 years, all agreeing well with other optical TDEs. A delayed and rapidly rising X-ray flare with a peak luminosity $\rm \sim 1.27\times10^{43}\,erg\,s^{-1}$ was detected at $\rm \sim$ 170 days after the optical peak. The high MIR luminosity of ATLAS17jrp ($\sim2\times10^{43} \rm\,erg\,s^{-1}$) has revealed a distinctive dusty environment with covering factor as high as $\sim0.2$, that is comparable with that of torus in active galactic nuclei but at least one order of magnitude higher than normal optical TDEs. Therefore, ATLAS17jrp turns out to be one of the rare unambiguous TDE found in star-forming galaxies and its high dust covering factor implies that the dust extinction could play an important role in the absence of optical TDEs in star-forming galaxies.

preprint2022arXiv

Exchange field enhanced upper critical field of the superconductivity in compressed antiferromagnetic EuTe2

We report high pressure studies on the C-type antiferromagnetic semiconductor EuTe2 up to 36.0 GPa. A structural transition from the I4/mcm to C2/m space group is identified at ~16 GPa. Superconductivity is discovered above ~5 GPa in both the I4/mcm and C2/m space groups. In the low-pressure phase (< 16 GPa), the antiferromagnetic transition temperature is enhanced with increasing pressure due to the enhanced magnetic exchange interactions. Magnetoresistance measurements indicate an interplay between the local moments of Eu2+ and the conduction electrons of Te 5p orbits. The upper critical field of the superconductivity is well above the Pauli limit. Across the structural transition to the high-pressure phase (> 16 GPa), EuTe2 becomes nonmagnetic and the superconducting transition temperature evolves smoothly with the upper critical field below the Pauli limit. Therefore, the high upper critical field of EuTe2 in the low-pressure phase is due to the exchange field compensation effect of the Eu magnetic order and the superconductivity in both structures may arise in the framework of the BCS theory.

preprint2022arXiv

High fidelity generation of complex optical field through scattering medium with iterative wavefront optimization

Light scattering within scattering media presents a substantial obstacle to optical transmission. A speckle pattern with random amplitude and phase distribution is observed when coherent light travels through strong scattering media. Fortunately, wavefront shaping has been successfully employed with a spatial light modulator to recover intensity targets after scattering media, such as a sharp focus point or specified two-dimensional patterns. There have, however, been few studies that attempted to separately manipulate the amplitude and phase of the focusing field. In this paper, we propose a feedback-based wavefront shaping method to generate complex optical fields through scattering medium. A reliable phase retrieval approach is introduced to provide the complex feedback information, i.e., the amplitude and phase of the focusing field. Accordingly, in order to modulate the speckle field into a desired complex structured optical field, a multi-objective genetic algorithm is used to find the best phase map. To demonstrate the proposed method&#39;s high performance, experimental tests have been carried out. High fidelity is demonstrated in the generation of diverse complex light fields, both in amplitude and phase. Our findings may facilitate the manipulation of light field through scattering medium, and are anticipated to further promote future applications such as optogenetics, vortex optical communication, and optical trapping through scattering media.

preprint2022arXiv

High-resolution Solar Image Reconstruction Based on Non-rigid Alignment

Suppressing the interference of atmospheric turbulence and obtaining observation data with a high spatial resolution is an issue to be solved urgently for ground observations. One way to solve this problem is to perform a statistical reconstruction of short-exposure speckle images. Combining the rapidity of Shift-Add and the accuracy of speckle masking, this paper proposes a novel reconstruction algorithm-NASIR (Non-rigid Alignment based Solar Image Reconstruction). NASIR reconstructs the phase of the object image at each frequency by building a computational model between geometric distortion and intensity distribution and reconstructs the modulus of the object image on the aligned speckle images by speckle interferometry. We analyzed the performance of NASIR by using the correlation coefficient, power spectrum, and coefficient of variation of intensity profile (CVoIP) in processing data obtained by the NVST (1m New Vacuum Solar Telescope). The reconstruction experiments and analysis results show that the quality of images reconstructed by NASIR is close to speckle masking when the seeing is good, while NASIR has excellent robustness when the seeing condition becomes worse. Furthermore, NASIR reconstructs the entire field of view in parallel in one go, without phase recursion and block-by-block reconstruction, so its computation time is less than half that of speckle masking. Therefore, we consider NASIR is a robust and high-quality fast reconstruction method that can serve as an effective tool for data filtering and quick look.

preprint2022arXiv

Interpretable Low-Resource Legal Decision Making

Over the past several years, legal applications of deep learning have been on the rise. However, as with other high-stakes decision making areas, the requirement for interpretability is of crucial importance. Current models utilized by legal practitioners are more of the conventional machine learning type, wherein they are inherently interpretable, yet unable to harness the performance capabilities of data-driven deep learning models. In this work, we utilize deep learning models in the area of trademark law to shed light on the issue of likelihood of confusion between trademarks. Specifically, we introduce a model-agnostic interpretable intermediate layer, a technique which proves to be effective for legal documents. Furthermore, we utilize weakly supervised learning by means of a curriculum learning strategy, effectively demonstrating the improved performance of a deep learning model. This is in contrast to the conventional models which are only able to utilize the limited number of expensive manually-annotated samples by legal experts. Although the methods presented in this work tackles the task of risk of confusion for trademarks, it is straightforward to extend them to other fields of law, or more generally, to other similar high-stakes application scenarios.

preprint2022arXiv

Learning to Detect Open Carry and Concealed Object with 77GHz Radar

Detecting harmful carried objects plays a key role in intelligent surveillance systems and has widespread applications, for example, in airport security. In this paper, we focus on the relatively unexplored area of using low-cost 77GHz mmWave radar for the carried objects detection problem. The proposed system is capable of real-time detecting three classes of objects - laptop, phone, and knife - under open carry and concealed cases where objects are hidden with clothes or bags. This capability is achieved by the initial signal processing for localization and generating range-azimuth-elevation image cubes, followed by a deep learning-based prediction network and a multi-shot post-processing module for detecting objects. Extensive experiments for validating the system performance on detecting open carry and concealed objects have been presented with a self-built radar-camera testbed and collected dataset. Additionally, the influence of different input formats, factors, and parameters on system performance is analyzed, providing an intuitive understanding of the system. This system would be the very first baseline for other future works aiming to detect carried objects using 77GHz radar.

preprint2022arXiv

Light Nuclei Production in Au+Au Collisions at $\sqrt{s_{\mathrm{NN}}}$ = 3 GeV from the STAR experiment

Light nuclei production is expected to be sensitive to baryon density fluctuations and can be used to probe the signatures of QCD critical point and/or a first-order phase transition in heavy-ion collisions. In this proceedings, we present the spectra and yields of protons ($p$) and light nuclei ($d$, $t$, $^{3}\mathrm{He}$, $^{4}\mathrm{He}$) in Au+Au collisions at $\sqrt{s_{\mathrm{NN}}}$ = 3 GeV by the STAR experiment. Finally, it is found that the kinetic freeze-out dynamics (temperature $T_{kin}$ $vs.$ average radial flow velocity $\langleβ_{T}\rangle$) at 3 GeV extracted with the blast-wave model deviate from the trends at high energies ($\sqrt{s_{\mathrm{NN}}}$ = 7.7 - 200 GeV), indicating a different medium equation of state.

preprint2022arXiv

Mass Testing and Characterization of 20-inch PMTs for JUNO

Main goal of the JUNO experiment is to determine the neutrino mass ordering using a 20kt liquid-scintillator detector. Its key feature is an excellent energy resolution of at least 3 % at 1 MeV, for which its instruments need to meet a certain quality and thus have to be fully characterized. More than 20,000 20-inch PMTs have been received and assessed by JUNO after a detailed testing program which began in 2017 and elapsed for about four years. Based on this mass characterization and a set of specific requirements, a good quality of all accepted PMTs could be ascertained. This paper presents the performed testing procedure with the designed testing systems as well as the statistical characteristics of all 20-inch PMTs intended to be used in the JUNO experiment, covering more than fifteen performance parameters including the photocathode uniformity. This constitutes the largest sample of 20-inch PMTs ever produced and studied in detail to date, i.e. 15,000 of the newly developed 20-inch MCP-PMTs from Northern Night Vision Technology Co. (NNVT) and 5,000 of dynode PMTs from Hamamatsu Photonics K. K.(HPK).

preprint2022arXiv

Multiplicity of non-contractible closed geodesics on Finsler compact space forms

Let $M=S^n/ Γ$ and $h$ be a nontrivial element of finite order $p$ in $π_1(M)$, where the integer $n, p\geq2$, $Γ$ is a finite abelian group which acts freely and isometrically on the $n$-sphere and therefore $M$ is diffeomorphic to a compact space form. In this paper, we prove that for every irreversible Finsler compact space form $(M,F)$ with reversibility $λ$ and flag curvature $K$ satisfying \[ \frac{4p^2}{(p+1)^2} \big(\fracλ{λ+1} \big)^2 < K \leq 1,\;\;λ< \frac{p+1}{p-1}, \] there exist at least $n-1$ non-contractible closed geodesics of class $[h]$. In addition, if the metric $F$ is bumpy and \[ (\frac{4p}{2p+1})^2 (\fracλ{λ+1})^2 < K \leq 1,\;\;λ<\frac{2p+1}{2p-1}, \] then there exist at least $2[\frac{n+1}{2}]$ non-contractible closed geodesics of class $[h]$, which is the optimal lower bound due to Katok&#39;s example. For $C^4$-generic Finsler metrics, there are infinitely many non-contractible closed geodesics of class $[h]$ on $(M, F)$ if $\frac{λ^2}{(λ+1)^2} < K \leq 1$ with $n$ being odd, or $\frac{λ^2}{(λ+1)^2}\frac{4}{(n-1)^2} < K \leq 1$ with $n$ being even.

preprint2022arXiv

Non-autoregressive Model for Full-line Code Completion

Code completion tools are frequently used by software developers to accelerate software development by suggesting the following code elements. Completing a sequence of code tokens (e.g., a full line of code) has been proved more efficient than predicting a single token at a time. To complete the code sequence, researchers are employing AutoRegressive (AR) decoders to generate tokens in a left-to-right, token-by-token fashion. Consequently, the prediction of the next token depends on all previously generated tokens, which leads to high latency in inference. To improve the efficiency and accuracy of full-line code completion, in this paper, we propose a Non-AutoRegressive (NAR) model for code completion boosted by a syntax-aware sampling strategy. Our experimental results on two widely used datasets suggest that our model outperforms both AR and NAR baselines on full-line code completion, and it is faster than the AR model with up to 9 times speed-up.

preprint2022arXiv

RAMP-CNN: A Novel Neural Network for Enhanced Automotive Radar Object Recognition

Millimeter-wave radars are being increasingly integrated into commercial vehicles to support new advanced driver-assistance systems by enabling robust and high-performance object detection, localization, as well as recognition - a key component of new environmental perception. In this paper, we propose a novel radar multiple-perspectives convolutional neural network (RAMP-CNN) that extracts the location and class of objects based on further processing of the range-velocity-angle (RVA) heatmap sequences. To bypass the complexity of 4D convolutional neural networks (NN), we propose to combine several lower-dimension NN models within our RAMP-CNN model that nonetheless approaches the performance upper-bound with lower complexity. The extensive experiments show that the proposed RAMP-CNN model achieves better average recall and average precision than prior works in all testing scenarios. Besides, the RAMP-CNN model is validated to work robustly under nighttime, which enables low-cost radars as a potential substitute for pure optical sensing under severe conditions.

preprint2022arXiv

Rank-Enhanced Low-Dimensional Convolution Set for Hyperspectral Image Denoising

This paper tackles the challenging problem of hyperspectral (HS) image denoising. Unlike existing deep learning-based methods usually adopting complicated network architectures or empirically stacking off-the-shelf modules to pursue performance improvement, we focus on the efficient and effective feature extraction manner for capturing the high-dimensional characteristics of HS images. To be specific, based on the theoretical analysis that increasing the rank of the matrix formed by the unfolded convolutional kernels can promote feature diversity, we propose rank-enhanced low-dimensional convolution set (Re-ConvSet), which separately performs 1-D convolution along the three dimensions of an HS image side-by-side, and then aggregates the resulting spatial-spectral embeddings via a learnable compression layer. Re-ConvSet not only learns the diverse spatial-spectral features of HS images, but also reduces the parameters and complexity of the network. We then incorporate Re-ConvSet into the widely-used U-Net architecture to construct an HS image denoising method. Surprisingly, we observe such a concise framework outperforms the most recent method to a large extent in terms of quantitative metrics, visual results, and efficiency. We believe our work may shed light on deep learning-based HS image processing and analysis.

preprint2022arXiv

Refinements of Franks&#39; theorem and applications in Reeb dynamics

In this article, we give two refinements of Franks&#39; theorem: For orientation and area preserving homeomorphisms of the closed or open annulus, the existence of $k$-periodic orbits ($(k,n_0)=1$) forces the existence of infinitely many periodic orbits with periods prime to $n_0$. Moreover, if $f$ is reversible, the periodic orbits above could be symmetric. Our improvements of Franks&#39; theorem can be applied to Reeb dynamics and celestial mechanics, for example, we give some precise information about the symmetries of periodic orbits found in Hofer, Wysocki and Zehnder&#39;s dichotomy theorem when the tight 3-sphere is equipped with some additional symmetries, and also the symmetries of periodic orbits on the energy level of H$\acute{e}$non-Heiles system in celestial mechanics.

preprint2022arXiv

Syntax-Aware Network for Handwritten Mathematical Expression Recognition

Handwritten mathematical expression recognition (HMER) is a challenging task that has many potential applications. Recent methods for HMER have achieved outstanding performance with an encoder-decoder architecture. However, these methods adhere to the paradigm that the prediction is made &#34;from one character to another&#34;, which inevitably yields prediction errors due to the complicated structures of mathematical expressions or crabbed handwritings. In this paper, we propose a simple and efficient method for HMER, which is the first to incorporate syntax information into an encoder-decoder network. Specifically, we present a set of grammar rules for converting the LaTeX markup sequence of each expression into a parsing tree; then, we model the markup sequence prediction as a tree traverse process with a deep neural network. In this way, the proposed method can effectively describe the syntax context of expressions, alleviating the structure prediction errors of HMER. Experiments on three benchmark datasets demonstrate that our method achieves better recognition performance than prior arts. To further validate the effectiveness of our method, we create a large-scale dataset consisting of 100k handwritten mathematical expression images acquired from ten thousand writers. The source code, new dataset, and pre-trained models of this work will be publicly available.

preprint2022arXiv

Task-Balanced Distillation for Object Detection

Mainstream object detectors are commonly constituted of two sub-tasks, including classification and regression tasks, implemented by two parallel heads. This classic design paradigm inevitably leads to inconsistent spatial distributions between classification score and localization quality (IOU). Therefore, this paper alleviates this misalignment in the view of knowledge distillation. First, we observe that the massive teacher achieves a higher proportion of harmonious predictions than the lightweight student. Based on this intriguing observation, a novel Harmony Score (HS) is devised to estimate the alignment of classification and regression qualities. HS models the relationship between two sub-tasks and is seen as prior knowledge to promote harmonious predictions for the student. Second, this spatial misalignment will result in inharmonious region selection when distilling features. To alleviate this problem, a novel Task-decoupled Feature Distillation (TFD) is proposed by flexibly balancing the contributions of classification and regression tasks. Eventually, HD and TFD constitute the proposed method, named Task-Balanced Distillation (TBD). Extensive experiments demonstrate the considerable potential and generalization of the proposed method. Specifically, when equipped with TBD, RetinaNet with ResNet-50 achieves 41.0 mAP under the COCO benchmark, outperforming the recent FGD and FRS.

preprint2022arXiv

The Co-alignment of Winged Hα Data Observed by the New Vacuum Solar Telescop

The New Vacuum Solar Telescope (NVST) has been releasing its novel winged Ha data (WHD) since April 2021, namely the Ha imaging spectroscopic data. Compared with the prior released version, the new data are further co-aligned among the off-band images and packaged into a standard solar physics community format. In this study, we illustrate the alignment algorithm used by the novel WHD, which is mainly based on the optical flow method to obtain the translation offset between the winged images. To quantitatively evaluate the alignment results of two images with different similarities, we calculate the alignment accuracies between the images of different off-band and line center, respectively. The result shows that our alignment algorithm could reach up to the accuracy of about 0.1 &#34;when the off-band of winged image is lower than 0.6 Ȧ. In addition, we introduce the final product of the WHD in detail, which can provide convenience for the solar physicists to use high-resolution Hα imaging spectroscopic data of NVST.

preprint2022arXiv

Tick-Tock: The Imminent Merger of a Supermassive Black Hole Binary

Supermassive black hole binaries (SMBHs) are a fascinating byproduct of galaxy mergers in the hierarchical universe. In the last stage of their orbital evolution, gravitational wave radiation drives the binary inspiral and produces the loudest siren awaiting to be detected by gravitational wave observatories. Periodically varying emission from active galactic nuclei has been proposed as a powerful approach to probe such systems, although none of the identified candidates are close to their final coalescence such that the observed periods stay constant in time. In this work, we report on the first system with rapid decaying periods revealed by its optical and X-ray light curves, which has decreased from about one year to one month in three years. Together with its optical hydrogen line spectroscopy, we propose that the system is an uneven mass-ratio, highly eccentric SMBH binary which will merge within three years, as predicted by the trajectory evolution model. If the interpretation is true, coordinated, multi-band electromagnetic campaign should be planned for this first binary SMBH merger event observed in human history, together with possible neutrino measurements. Gravitational wave memory from this event may also be detectable by Pulsar Timing Array with additional five-to-ten year observation.

preprint2022arXiv

Towards Robust Graph Neural Networks for Noisy Graphs with Sparse Labels

Graph Neural Networks (GNNs) have shown their great ability in modeling graph structured data. However, real-world graphs usually contain structure noises and have limited labeled nodes. The performance of GNNs would drop significantly when trained on such graphs, which hinders the adoption of GNNs on many applications. Thus, it is important to develop noise-resistant GNNs with limited labeled nodes. However, the work on this is rather limited. Therefore, we study a novel problem of developing robust GNNs on noisy graphs with limited labeled nodes. Our analysis shows that both the noisy edges and limited labeled nodes could harm the message-passing mechanism of GNNs. To mitigate these issues, we propose a novel framework which adopts the noisy edges as supervision to learn a denoised and dense graph, which can down-weight or eliminate noisy edges and facilitate message passing of GNNs to alleviate the issue of limited labeled nodes. The generated edges are further used to regularize the predictions of unlabeled nodes with label smoothness to better train GNNs. Experimental results on real-world datasets demonstrate the robustness of the proposed framework on noisy graphs with limited labeled nodes.

preprint2022arXiv

Towards Understanding and Harnessing the Effect of Image Transformation in Adversarial Detection

Deep neural networks (DNNs) are threatened by adversarial examples. Adversarial detection, which distinguishes adversarial images from benign images, is fundamental for robust DNN-based services. Image transformation is one of the most effective approaches to detect adversarial examples. During the last few years, a variety of image transformations have been studied and discussed to design reliable adversarial detectors. In this paper, we systematically synthesize the recent progress on adversarial detection via image transformations with a novel classification method. Then, we conduct extensive experiments to test the detection performance of image transformations against state-of-the-art adversarial attacks. Furthermore, we reveal that each individual transformation is not capable of detecting adversarial examples in a robust way, and propose a DNN-based approach referred to as \emph{AdvJudge}, which combines scores of 9 image transformations. Without knowing which individual scores are misleading or not misleading, AdvJudge can make the right judgment, and achieve a significant improvement in detection rate. Finally, we utilize an explainable AI tool to show the contribution of each image transformation to adversarial detection. Experimental results show that the contribution of image transformations to adversarial detection is significantly different, the combination of them can significantly improve the generic detection ability against state-of-the-art adversarial attacks.

preprint2022arXiv

What Makes a Good Commit Message?

A key issue in collaborative software development is communication among developers. One modality of communication is a commit message, in which developers describe the changes they make in a repository. As such, commit messages serve as an &#34;audit trail&#34; by which developers can understand how the source code of a project has changed-and why. Hence, the quality of commit messages affects the effectiveness of communication among developers. Commit messages are often of poor quality as developers lack time and motivation to craft a good message. Several automatic approaches have been proposed to generate commit messages. However, these are based on uncurated datasets including considerable proportions of poorly phrased commit messages. In this multi-method study, we first define what constitutes a &#34;good&#34; commit message, and then establish what proportion of commit messages lack information using a sample of almost 1,600 messages from five highly active open source projects. We find that an average of circa 44% of messages could be improved, suggesting the use of uncurated datasets may be a major threat when commit message generators are trained with such data. We also observe that prior work has not considered semantics of commit messages, and there is surprisingly little guidance available for writing good commit messages. To that end, we develop a taxonomy based on recurring patterns in commit messages&#39; expressions. Finally, we investigate whether &#34;good&#34; commit messages can be automatically identified; such automation could prompt developers to write better commit messages.

preprint2021arXiv

Entanglement of two quantum memories via fibers over dozens of kilometres

Quantum internet will enable a number of revolutionary applications. It relies on entanglement of remote quantum memories over long distances. Despite enormous progresses so far, the maximal physical separation achieved between two nodes is 1.3 km, and challenges for long distance remain. Here we make a significant step forward by entangling two atomic ensembles in one lab via photon transmission through metropolitan-scale fibers. We use cavity enhancement to create bright atom-photon entanglement, and harness quantum frequency conversion to shift the atomic wavelength to telecom. We realize entanglement over 22 km field-deployed fibers via two-photon interference, and entanglement over 50 km coiled fibers via single-photon interference. Our experiment can be extended to physically separated nodes with similar distance as a functional segment for atomic quantum networks, thus paving the way towards establishing atomic entanglement over many nodes and over much longer distance.

preprint2021arXiv

Extracting Concise Bug-Fixing Patches from Human-Written Patches in Version Control Systems

High-quality and large-scale repositories of real bugs and their concise patches collected from real-world applications are critical for research in software engineering community. In such a repository, each real bug is explicitly associated with its fix. Therefore, on one side, the real bugs and their fixes} may inspire novel approaches for finding, locating, and repairing software bugs; on the other side, the real bugs and their fixes are indispensable for rigorous and meaningful evaluation of approaches for software testing, fault localization, and program repair. To this end, a number of such repositories, e.g., Defects4J, have been proposed. However, such repositories are rather small because their construction involves expensive human intervention. Although bug-fixing code commits as well as associated test cases could be retrieved from version control systems automatically, existing approaches could not yet automatically extract concise bug-fixing patches from bug-fixing commits because such commits often involve bug-irrelevant changes. In this paper, we propose an automatic approach, called BugBuilder, to extracting complete and concise bug-fixing patches from human-written patches in version control systems. It excludes refactorings by detecting refactorings involved in bug-fixing commits, and reapplying detected refactorings on the faulty version. It enumerates all subsets of the remaining part and validates them on test cases. If none of the subsets has the potential to be a complete bug-fixing patch, the remaining part as a whole is taken as a complete and concise bug-fixing patch. Evaluation results on 809 real bug-fixing commits in Defects4J suggest that BugBuilder successfully generated complete and concise bug-fixing patches for forty percent of the bug-fixing commits, and its precision (99%) was even higher than human experts.

preprint2021arXiv

JUNO Physics and Detector

The Jiangmen Underground Neutrino Observatory (JUNO) is a 20 kton LS detector at 700-m underground. An excellent energy resolution and a large fiducial volume offer exciting opportunities for addressing many important topics in neutrino and astro-particle physics. With 6 years of data, the neutrino mass ordering can be determined at 3-4 sigma and three oscillation parameters can be measured to a precision of 0.6% or better by detecting reactor antineutrinos. With 10 years of data, DSNB could be observed at 3-sigma; a lower limit of the proton lifetime of 8.34e33 years (90% C.L.) can be set by searching for p->nu_bar K^+; detection of solar neutrinos would shed new light on the solar metallicity problem and examine the vacuum-matter transition region. A core-collapse supernova at 10 kpc would lead to ~5000 IBD and ~2000 (300) all-flavor neutrino-proton (electron) scattering events. Geo-neutrinos can be detected with a rate of ~400 events/year. We also summarize the final design of the JUNO detector and the key R&D achievements. All 20-inch PMTs have been tested. The average photon detection efficiency is 28.9% for the 15,000 MCP PMTs and 28.1% for the 5,000 dynode PMTs, higher than the JUNO requirement of 27%. Together with the >20 m attenuation length of LS, we expect a yield of 1345 p.e. per MeV and an effective energy resolution of 3.02%/\sqrt{E (MeV)}$ in simulations. The underwater electronics is designed to have a loss rate <0.5% in 6 years. With degassing membranes and a micro-bubble system, the radon concentration in the 35-kton water pool could be lowered to <10 mBq/m^3. Acrylic panels of radiopurity <0.5 ppt U/Th are produced. The 20-kton LS will be purified onsite. Singles in the fiducial volume can be controlled to ~10 Hz. The JUNO experiment also features a double calorimeter system with 25,600 3-inch PMTs, a LS testing facility OSIRIS, and a near detector TAO.

preprint2021arXiv

Multiplicity and stability of closed characteristics on compact convex P-cyclic symmetric hypersurfaces in ${\bf R}^{2n}$

Let $Σ$ be a compact convex hypersurface in ${\bf R}^{2n}$ which is P-cyclic symmetric, i.e., $x\in Σ$ implies $Px\inΣ$ with P being a $2n\times2n$ symplectic orthogonal matrix and satisfying $P^k=I_{2n}$, $ker(P^l-I_{2n})=0$ for $1\leq l< k$, where $n, k\geq2$. In this paper, we prove that there exist at least $n$ geometrically distinct closed characteristics on $Σ$, which solves a longstanding conjecture about the multiplicity of closed characteristics for a broad class of compact convex hypersurfaces with symmetries(cf.,Page 235 of \cite{Eke1}). Based on the proof, we further prove that if the number of geometrically distinct closed characteristics on $Σ$ is finite, then at least $2[\frac{n}{2}]$ of them are non-hyperbolic; and if the number of geometrically distinct closed characteristics on $Σ$ is exactly $n$ and $k\geq3$, then all of them are P-cyclic symmetric, where a closed characteristic $(τ, y)$ on $Σ$ is called P-cyclic symmetric if $y({\bf R})=Py({\bf R})$.

preprint2021arXiv

Quantum mechanics of fermion confined to a curved surface in Foldy-Wouthuysen representation

In Foldy-Wouthuysen representation, we deduce the effective quantum mechanics for a particle confined to a curved surface by using the thin-layer quantization scheme. We find that the spin effect caused by confined potential as the results of relativistic correction in the non-relativistic limit. Furthermore, the spin connection appeared in curved surface which depends on curvature contributes a Zeeman-like gap in the relativistic correction term. In addition, the confined potential also induces a curvature-independent energy shift, which is from the zitterbewegung effect. As an example, we apply the effective Hamiltonian to torus surface, in which we obtain expectantly the spin effects related to confined potential. Those results directly demonstrate the scaling of the uncommutation of the non-relativistic limit and the thin-layer quantization formalism

preprint2021arXiv

RODNet: Radar Object Detection Using Cross-Modal Supervision

Radar is usually more robust than the camera in severe driving scenarios, e.g., weak/strong lighting and bad weather. However, unlike RGB images captured by a camera, the semantic information from the radar signals is noticeably difficult to extract. In this paper, we propose a deep radar object detection network (RODNet), to effectively detect objects purely from the carefully processed radar frequency data in the format of range-azimuth frequency heatmaps (RAMaps). Three different 3D autoencoder based architectures are introduced to predict object confidence distribution from each snippet of the input RAMaps. The final detection results are then calculated using our post-processing method, called location-based non-maximum suppression (L-NMS). Instead of using burdensome human-labeled ground truth, we train the RODNet using the annotations generated automatically by a novel 3D localization method using a camera-radar fusion (CRF) strategy. To train and evaluate our method, we build a new dataset -- CRUW, containing synchronized videos and RAMaps in various driving scenarios. After intensive experiments, our RODNet shows favorable object detection performance without the presence of the camera.

preprint2021arXiv

Self-supervised Symmetric Nonnegative Matrix Factorization

Symmetric nonnegative matrix factorization (SNMF) has demonstrated to be a powerful method for data clustering. However, SNMF is mathematically formulated as a non-convex optimization problem, making it sensitive to the initialization of variables. Inspired by ensemble clustering that aims to seek a better clustering result from a set of clustering results, we propose self-supervised SNMF (S$^3$NMF), which is capable of boosting clustering performance progressively by taking advantage of the sensitivity to initialization characteristic of SNMF, without relying on any additional information. Specifically, we first perform SNMF repeatedly with a random nonnegative matrix for initialization each time, leading to multiple decomposed matrices. Then, we rank the quality of the resulting matrices with adaptively learned weights, from which a new similarity matrix that is expected to be more discriminative is reconstructed for SNMF again. These two steps are iterated until the stopping criterion/maximum number of iterations is achieved. We mathematically formulate S$^3$NMF as a constraint optimization problem, and provide an alternative optimization algorithm to solve it with the theoretical convergence guaranteed. Extensive experimental results on $10$ commonly used benchmark datasets demonstrate the significant advantage of our S$^3$NMF over $12$ state-of-the-art methods in terms of $5$ quantitative metrics. The source code is publicly available at https://github.com/jyh-learning/SSSNMF.

preprint2020arXiv

A Non-Linear Magnetic Field Calibration Method for Filter-Based Magnetographs by Multilayer Perceptron

For filter-based magnetographs, the linear calibration method under the weak-field assumption is usually adopted; this leads to magnetic saturation effect in the regions with strong magnetic field. This article explores a new method to overcome the above disadvantage using a multilayer perceptron network, which we call MagMLP, based on a back-propagation algorithm with one input layer, five hidden layers, and one output layer. We use the data from the \textit{Spectropolarimeter} (SP) on board \textit{Hinode} to simulate single-wavelength observations for the model training, and take into account the influence of the Doppler velocity field and the filling factor. The training results show that the linear fitting coefficient (LFC) of the transverse field reaches above 0.91, and that of the longitudinal field is above 0.98. The generalization of the models is good because the corresponding LFCs are above 0.9 for the test subsets. Compared with the linear calibration method, the MagMLP is much more effective on dealing with the magnetic saturation effect. Analyzing an active region, the results of the linear calibration present an evident magnetic saturation effect in the umbra regions; the corresponding systematic error reaches values greater than 1000 G in most areas, or even exceeds 2000 G at some pixels. However, the results of MagMLP at these locations are very close to the inversion results, and the systematic errors are basically within 300 G. In addition, we find that there are many &#34;bright spots&#34; and &#34;dark spots&#34; on the inclination angle images from the inversion results of \textit{Hinode}/SP with values of 180 and 0 degrees, respectively, where the inversion is not reliable and does not produce a good result; the MagMLP handles these points well.

preprint2020arXiv

A nonlinear solar magnetic field calibration method for the filter-based magnetograph by the residual network

The method of solar magnetic field calibration for the filter-based magnetograph is normally the linear calibration method under weak-field approximation that cannot generate the strong magnetic field region well due to the magnetic saturation effect. We try to provide a new method to carry out the nonlinear magnetic calibration with the help of neural networks to obtain more accurate magnetic fields. We employed the data from Hinode/SP to construct a training, validation and test dataset. The narrow-band Stokes I, Q, U, and V maps at one wavelength point were selected from all the 112 wavelength points observed by SP so as to simulate the single-wavelength observations of the filter-based magnetograph. We used the residual network to model the nonlinear relationship between the Stokes maps and the vector magnetic fields. After an extensive performance analysis, it is found that the trained models could infer the longitudinal magnetic flux density, the transverse magnetic flux density, and the azimuth angle from the narrow-band Stokes maps with a precision comparable to the inversion results using 112 wavelength points. Moreover, the maps that were produced are much cleaner than the inversion results. The method can effectively overcome the magnetic saturation effect and infer the strong magnetic region much better than the linear calibration method. The residual errors of test samples to standard data are mostly about 50 G for both the longitudinal and transverse magnetic flux density. The values are about 100 G with our previous method of multilayer perceptron, indicating that the new method is more accurate in magnetic calibration.

preprint2020arXiv

An Overview on Evaluating and Predicting Scholarly Article Impact

Scholarly article impact reflects the significance of academic output recognised by academic peers, and it often plays a crucial role in assessing the scientific achievements of researchers, teams, institutions and countries. It is also used for addressing various needs in the academic and scientific arena, such as recruitment decisions, promotions, and funding allocations. This article provides a comprehensive review of recent progresses related to article impact assessment and prediction. The~review starts by sharing some insight into the article impact research and outlines current research status. Some core methods and recent progress are presented to outline how article impact metrics and prediction have evolved to consider integrating multiple networks. Key techniques, including statistical analysis, machine learning, data mining and network science, are discussed. In particular, we highlight important applications of each technique in article impact research. Subsequently, we discuss the open issues and challenges of article impact research. At the same time, this review points out some important research directions, including article impact evaluation by considering Conflict of Interest, time and location information, various distributions of scholarly entities, and rising stars.

preprint2020arXiv

Angle-Resolved Thermal Emission Spectroscopy Characterization of Non-Hermitian Meta-Crystals

We establish the angle-resolved thermal emission spectroscopy (ARTES) as a new platform to characterize the intrinsic eigenmode properties of non-Hermitian systems. This method can directly map the dispersion of meta-crystals within the light cone with a high angular resolution. To illustrate its usefulness, we demonstrate the existence of bound states in the continuum (BICs) and non-Hermitian Fermi arcs in a planar corrugated meta-crystal by measuring its angle-resolved thermal emission spectra. We show that change in the thickness of the meta-crystal can induce a band inversion between a BIC and a radiative state, and a pair of exceptional points emerge when the band inversion occurs. With this approach, the band mapping of non-Hermitian photonic systems can become a relatively straightforward task.

preprint2020arXiv

Anomalous levitation and annihilation in Floquet topological insulators

Anderson localization in two-dimensional topological insulators takes place via the so-called levitation and pair annihilation process. As disorder is increased, extended bulk states carrying opposite topological invariants move towards each other in energy, reducing the size of the topological gap, eventually meeting and localizing. This results in a topologically trivial Anderson insulator. Here, we introduce the anomalous levitation and pair annihilation, a process unique to periodically-driven, or Floquet systems. Due to the periodicity of the quasienergy spectrum, we find it is possible for the topological gap to increase as a function of disorder strength. Thus, after all bulk states have localized, the system remains topologically nontrivial, forming an anomalous Floquet Anderson insulator (AFAI) phase. We show a concrete example for this process, adding disorder via onsite potential &#34;kicks&#34; to a Chern insulator model. By changing the period between kicks, we can tune which type of (conventional or anomalous) levitation-and-annihilation occurs in the system. We expect our results to be applicable to generic Floquet topological systems and to provide an accessible way to realize AFAIs experimentally, without the need for multi-step driving schemes.

preprint2020arXiv

Does Multi-Encoder Help? A Case Study on Context-Aware Neural Machine Translation

In encoder-decoder neural models, multiple encoders are in general used to represent the contextual information in addition to the individual sentence. In this paper, we investigate multi-encoder approaches in documentlevel neural machine translation (NMT). Surprisingly, we find that the context encoder does not only encode the surrounding sentences but also behaves as a noise generator. This makes us rethink the real benefits of multi-encoder in context-aware translation - some of the improvements come from robust training. We compare several methods that introduce noise and/or well-tuned dropout setup into the training of these encoders. Experimental results show that noisy training plays an important role in multi-encoder-based NMT, especially when the training data is small. Also, we establish a new state-of-the-art on IWSLT Fr-En task by careful use of noise generation and dropout methods.

preprint2020arXiv

Effective dynamics for a spin-1/2 particle constrained to a space curve in an electric and magnetic field

We consider the dynamics of a spin-1/2 particle constrained to move in an arbitrary space curve with an external electric and magnetic field applied. With the aid of gauge theory, we successfully decouple the tangential and normal dynamics and derive the effective Hamiltonian. A new type of quantum potential called SU(2) Zeeman interaction appears, which is induced by the electric field and couples spin and intrinsic orbital angular momentum. Based on the Hamiltonian, we discuss the spin precession for zero intrinsic orbital angular momentum case and the energy splitting caused by the SU(2) Zeeman interaction for a helix as examples, showing the combined effect of geometry and external field. The new interaction may bring new approaches to manipulate quantum states in spintronics.

preprint2020arXiv

Effects of resonance weak decays and hadronic re-scattering on the proton number fluctuations in Au + Au collisions at $\sqrt{s_\mathrm{NN}} = 5$ GeV from JAM model

Proton number fluctuation is sensitive observable to search for the QCD critical point in heavy-ion collisions. In this paper, we studied rapidity acceptance dependence of the proton cumulants and correlation functions in most central Au+Au collisions at $\sqrt{s_\mathrm{NN}} = 5$ GeV from a microscopic hadronic transport model (JAM). At mid-rapidity, we found the effects of resonance weak decays and hadronic re-scattering on the proton cumulants and correlation functions are small, but those effects get larger when further increasing the rapidity acceptance. On the other hand, we found the baryon number conservation is a dominant background effect on the rapidity acceptance dependence of proton number fluctuations. It leads to a strong suppression of cumulants and cumulant ratios, as well as the negative proton correlation functions. We also studied those two effects on the energy dependence of cumulant ratios of net-proton distributions in most central Au+Au collisions at $\sqrt{s_\mathrm{NN}} = 5-200$ GeV from JAM model. This work can serve as a non-critical baseline for future QCD critical point search in heavy-ion collisions at high baryon density region.

preprint2020arXiv

Feasibility and physics potential of detecting $^8$B solar neutrinos at JUNO

The Jiangmen Underground Neutrino Observatory~(JUNO) features a 20~kt multi-purpose underground liquid scintillator sphere as its main detector. Some of JUNO&#39;s features make it an excellent experiment for $^8$B solar neutrino measurements, such as its low-energy threshold, its high energy resolution compared to water Cherenkov detectors, and its much large target mass compared to previous liquid scintillator detectors. In this paper we present a comprehensive assessment of JUNO&#39;s potential for detecting $^8$B solar neutrinos via the neutrino-electron elastic scattering process. A reduced 2~MeV threshold on the recoil electron energy is found to be achievable assuming the intrinsic radioactive background $^{238}$U and $^{232}$Th in the liquid scintillator can be controlled to 10$^{-17}$~g/g. With ten years of data taking, about 60,000 signal and 30,000 background events are expected. This large sample will enable an examination of the distortion of the recoil electron spectrum that is dominated by the neutrino flavor transformation in the dense solar matter, which will shed new light on the tension between the measured electron spectra and the predictions of the standard three-flavor neutrino oscillation framework. If $Δm^{2}_{21}=4.8\times10^{-5}~(7.5\times10^{-5})$~eV$^{2}$, JUNO can provide evidence of neutrino oscillation in the Earth at the about 3$σ$~(2$σ$) level by measuring the non-zero signal rate variation with respect to the solar zenith angle. Moveover, JUNO can simultaneously measure $Δm^2_{21}$ using $^8$B solar neutrinos to a precision of 20\% or better depending on the central value and to sub-percent precision using reactor antineutrinos. A comparison of these two measurements from the same detector will help elucidate the current tension between the value of $Δm^2_{21}$ reported by solar neutrino experiments and the KamLAND experiment.

preprint2020arXiv

Light Nuclei Production in Au+Au Collisions at $\sqrt{s_{\mathrm{NN}}}$ = 5-200 GeV from JAM model

Light nuclei production is sensitive to the baryon density fluctuations and can be used to probe the QCD phase transition in relativistic heavy-ion collisions. In this work, we studied the production of proton, deuteron, triton in central Au+Au collisions at $\sqrt{s_{\mathrm{NN}}}$ = 5, 7.7, 11.5, 14.5, 19.6, 27, 39, 54.4, 62.4 and 200 GeV from a transport model (JAM). Based on the coalescence production of light nuclei, we calculated the energy dependence of rapidity density $dN/dy$ and particle ratios ($d/p$, $t/p$, and $t/d$). More importantly, the yield ratio $N_{t} \times N_{p} / N_{d}^{2}$, which is sensitive to the neutron density fluctuations, shows a flat energy dependence and cannot describe the non-monotonic trend observed by the STAR experiment. Based on the nucleon coalescence, this work can provide constraint and reference to search for the QCD critical point and/or first order phase transition with light nuclei production in future heavy-ion collision experiments.

preprint2020arXiv

Mid-InfraRed Outburst in Nearby Galaxies (MIRONG) I: Sample Selection and Characterization

The optical time-domain astronomy has grown rapidly in the past decade but the dynamic infrared sky is rarely explored. Aiming to construct a sample of mid-infrared outburst in nearby galaxies (MIRONG), we have conducted a systematical search of low-redshift ($z<0.35$) SDSS spectroscopic galaxies that have experienced recent MIR flares using their Wide-field Infrared Survey Explorer (WISE) light curves. A total of 137 galaxies have been selected by requiring a brightening amplitude of 0.5 magnitude in at least one WISE band with respect to their quiescent phases. Only a small faction (10.9%) has corresponding optical flares. Except for the four supernova (SNe) in our sample, the MIR luminosity of remaining sources ($L_{\rm 4.6μm}>10^{42}~\rm erg~s^{-1}$) are markedly brighter than known SNe and their physical locations are very close to the galactic center (median <0.1&#34;). Only four galaxies are radio-loud indicating that synchrotron radiation from relativistic jets could contribute MIR variability. We propose that these MIR outburst are dominated by the dust echoes of transient accretion onto supermassive black holes, such as tidal disruption events (TDEs) and turn-on (changing-look) AGNs. Moreover, the inferred peak MIR luminosity function is generally consistent with the X-ray and optical TDEs at high end albeit with large uncertainties. Our results suggest that a large population of transients have been overlooked by optical surveys, probably due to dust obscuration or intrinsically optical weakness. Thus, a search in the infrared band is crucial for us to obtain a panoramic picture of nuclear outburst. The multiwavength follow-up observations of the MIRONG sample are in progress and will be presented in a series of subsequent papers.

preprint2020arXiv

Multi-Scale One-Class Recurrent Neural Networks for Discrete Event Sequence Anomaly Detection

Discrete event sequences are ubiquitous, such as an ordered event series of process interactions in Information and Communication Technology systems. Recent years have witnessed increasing efforts in detecting anomalies with discrete-event sequences. However, it still remains an extremely difficult task due to several intrinsic challenges including data imbalance issues, the discrete property of the events, and sequential nature of the data. To address these challenges, in this paper, we propose OC4Seq, a multi-scale one-class recurrent neural network for detecting anomalies in discrete event sequences. Specifically, OC4Seq integrates the anomaly detection objective with recurrent neural networks (RNNs) to embed the discrete event sequences into latent spaces, where anomalies can be easily detected. In addition, given that an anomalous sequence could be caused by either individual events, subsequences of events, or the whole sequence, we design a multi-scale RNN framework to capture different levels of sequential patterns simultaneously. Experimental results on three benchmark datasets show that OC4Seq consistently outperforms various representative baselines by a large margin. Moreover, through both quantitative and qualitative analysis, the importance of capturing multi-scale sequential patterns for event anomaly detection is verified.

preprint2020arXiv

Multi-View Spectral Clustering Tailored Tensor Low-Rank Representation

This paper explores the problem of multi-view spectral clustering (MVSC) based on tensor low-rank modeling. Unlike the existing methods that all adopt an off-the-shelf tensor low-rank norm without considering the special characteristics of the tensor in MVSC, we design a novel structured tensor low-rank norm tailored to MVSC. Specifically, we explicitly impose a symmetric low-rank constraint and a structured sparse low-rank constraint on the frontal and horizontal slices of the tensor to characterize the intra-view and inter-view relationships, respectively. Moreover, the two constraints could be jointly optimized to achieve mutual refinement. On the basis of the novel tensor low-rank norm, we formulate MVSC as a convex low-rank tensor recovery problem, which is then efficiently solved with an augmented Lagrange multiplier based method iteratively. Extensive experimental results on five benchmark datasets show that the proposed method outperforms state-of-the-art methods to a significant extent. Impressively, our method is able to produce perfect clustering. In addition, the parameters of our method can be easily tuned, and the proposed model is robust to different datasets, demonstrating its potential in practice.

preprint2020arXiv

Optimization of laser dynamics for active stabilization of DF--VECSELs dedicated to cesium CPT clocks

We report the implementation and performance of a double servo-loop for intensity and phase-difference active stabilization of a dual-frequency vertical external--cavity surface-emitting laser (DF-VECSEL) for coherent population trapping (CPT) of cesium atoms in the framework of compact atomic clocks. In--phase fully correlated pumping of the two laser modes is identified as the best scheme for intensity noise reduction, and an analytical model allows the optimization of the active stabilization strategy. Optical phase-locking the beat-note to a local oscillator leads to a phase noise level below -103~dBc/Hz at 100~Hz from the carrier. The laser contribution to the short-term frequency stability of the clock is predicted to be compatible with a targeted Allan deviation below $σ_y = 5\,\times 10^{-13}$ over one second.

preprint2020arXiv

Pion quasiparticles and QCD phase transitions at finite temperature and isospin density from holography

Spectra of pions, which are known as the pseudo-Goldstone bosons of spontaneous chiral symmetry breaking, as well as their relationship with chiral phase transition and pion superfluidity phase transition, have been investigated in the framework of soft-wall AdS/QCD. In chiral limit, it is proved both numerically and analytically that pions are massless Goldstone bosons even at finite temperature, which was usually considered as an assumption in soft-wall models. Above $T_c$, at which chiral condensate $\langle \bar{q}q\rangle$ vanishes, the spectra of pions and scalar mesons merge together, showing the evidence of the restored chiral symmetry in hadronic spectrum level. Extending to finite quark mass, pion masses increase with quark mass. Further, it is more interesting to observe that the pole masses of pions decrease with temperature below $T_c$, which agrees with the analysis in Phys.Rev.Lett.88(2002)202302. Meanwhile, symmetry restoration above $T_c$ could be seen in the spectra of scalar and pseudo-scalar mesons. With finite temperature and isospin chemical potential $μ_I$, it is shown that the masses of charged pions would split. The mass of positive charged pion $π^+$ decreases almost linearly to zero when $μ_I$ grows to $μ_{I}^c$, where pion condensation starts to form. This reveals the Goldstone nature of $π^+$ after pion superfluidity transition, which are closely related to the experimental observation.

preprint2020arXiv

QCD phase diagram at finite isospin chemical potential and temperature in an IR-improved soft-wall AdS/QCD model

We study the phase transition between pion condensed phase and normal phase, as well as chiral phase transition in a two flavor($\mathcal{N}_f=2$) IR- improved soft-wall AdS/QCD model at finite isospin chemical potential $μ_I$ and temperature $T$. By self-consistently solving the equations of motion, we obtain the phase diagram in the plane of $μ_I$ and $T$. The pion condensation appears together with a massless Nambu-Goldstone boson $m_{π_1}(T_c, μ_I^c)=0$, which is very likely to be a second-order phase transition with mean-field critical exponents in small $μ_I$ region. When $T=0$, the critical isospin chemical potential approximates to vacuum pion mass $μ_I^c \approx m_0$. The pion condensed phase exists in an arched area and the boundary of the chiral crossover intersects the pion condensed phase at a tri-critical point. Qualitatively, the results are in good agreement with previous studies from Lattice simulations and model calculations.

preprint2020arXiv

Sequence to Point Learning Based on Bidirectional Dilated Residual Network for Non Intrusive Load Monitoring

Non Intrusive Load Monitoring (NILM) or Energy Disaggregation (ED), seeks to save energy by decomposing corresponding appliances power reading from an aggregate power reading of the whole house. It is a single channel blind source separation problem (SCBSS) and difficult prediction problem because it is unidentifiable. Recent research shows that deep learning has become a growing popularity for NILM problem. The ability of neural networks to extract load features is closely related to its depth. However, deep neural network is difficult to train because of exploding gradient, vanishing gradient and network degradation. To solve these problems, we propose a sequence to point learning framework based on bidirectional (non-casual) dilated convolution for NILM. To be more convincing, we compare our method with the state of art method, Seq2point (Zhang) directly and compare with existing algorithms indirectly via two same datasets and metrics. Experiments based on REDD and UK-DALE data sets show that our proposed approach is far superior to existing approaches in all appliances.

preprint2020arXiv

Simulating quantum field theory in curved spacetime with quantum many-body systems

This paper proposes a new general framework to build a one-to-one correspondence between quantum field theories in static 1+1 dimensional curved spacetime and quantum many-body systems. We show that a massless scalar field in an arbitrary 2-dimensional static spacetime is always equivalent to a site-dependent bosonic hopping model, while a massless Dirac field is equivalent to a site-dependent free Hubbard model or a site-dependent isotropic XY model. A possible experimental realization for such a correspondence in trapped ions system is suggested. As applications of the analogue gravity model, we show that they can be used to simulate Hawking radiation of black hole and to study its entanglement. We also show in the analogue model that black holes are most chaotic systems and the fastest scramblers in nature. We also offer a concrete example about how to get some insights about quantum many-body systems from back hole physics.

preprint2020arXiv

Spin polarization of electrons through corrugated surface in magnetic field

Noninteracting electrons confined to a corrugated surface are investigated in magnetic field, and the associated effective Pauli equation is given analytically by the thin-layer quantization scheme. Interestingly, the Zeeman splitting gaps can be adjusted by curvature, and there is a geometric potential induced by curvature. Further, we discuss the spin-dependent transport properties for confined electrons by numerical calculation. More interestingly, we find that the spin polarization induced by curvature becomes substantial when the incident energy has small value. The results are considerable for a spin transistor with small spin current.

preprint2020arXiv

TAO Conceptual Design Report: A Precision Measurement of the Reactor Antineutrino Spectrum with Sub-percent Energy Resolution

The Taishan Antineutrino Observatory (TAO, also known as JUNO-TAO) is a satellite experiment of the Jiangmen Underground Neutrino Observatory (JUNO). A ton-level liquid scintillator detector will be placed at about 30 m from a core of the Taishan Nuclear Power Plant. The reactor antineutrino spectrum will be measured with sub-percent energy resolution, to provide a reference spectrum for future reactor neutrino experiments, and to provide a benchmark measurement to test nuclear databases. A spherical acrylic vessel containing 2.8 ton gadolinium-doped liquid scintillator will be viewed by 10 m^2 Silicon Photomultipliers (SiPMs) of >50% photon detection efficiency with almost full coverage. The photoelectron yield is about 4500 per MeV, an order higher than any existing large-scale liquid scintillator detectors. The detector operates at -50 degree C to lower the dark noise of SiPMs to an acceptable level. The detector will measure about 2000 reactor antineutrinos per day, and is designed to be well shielded from cosmogenic backgrounds and ambient radioactivities to have about 10% background-to-signal ratio. The experiment is expected to start operation in 2022.

preprint2020arXiv

The End-of-End-to-End: A Video Understanding Pentathlon Challenge (2020)

We present a new video understanding pentathlon challenge, an open competition held in conjunction with the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2020. The objective of the challenge was to explore and evaluate new methods for text-to-video retrieval-the task of searching for content within a corpus of videos using natural language queries. This report summarizes the results of the first edition of the challenge together with the findings of the participants.

preprint2020arXiv

Whole-Chain Recommendations

With the recent prevalence of Reinforcement Learning (RL), there have been tremendous interests in developing RL-based recommender systems. In practical recommendation sessions, users will sequentially access multiple scenarios, such as the entrance pages and the item detail pages, and each scenario has its specific characteristics. However, the majority of existing RL-based recommender systems focus on optimizing one strategy for all scenarios or separately optimizing each strategy, which could lead to sub-optimal overall performance. In this paper, we study the recommendation problem with multiple (consecutive) scenarios, i.e., whole-chain recommendations. We propose a multi-agent RL-based approach (DeepChain), which can capture the sequential correlation among different scenarios and jointly optimize multiple recommendation strategies. To be specific, all recommender agents (RAs) share the same memory of users&#39; historical behaviors, and they work collaboratively to maximize the overall reward of a session. Note that optimizing multiple recommendation strategies jointly faces two challenges in the existing model-free RL model - (i) it requires huge amounts of user behavior data, and (ii) the distribution of reward (users&#39; feedback) are extremely unbalanced. In this paper, we introduce model-based RL techniques to reduce the training data requirement and execute more accurate strategy updates. The experimental results based on a real e-commerce platform demonstrate the effectiveness of the proposed framework.

preprint2020arXiv

Yet Meta Learning Can Adapt Fast, It Can Also Break Easily

Meta learning algorithms have been widely applied in many tasks for efficient learning, such as few-shot image classification and fast reinforcement learning. During meta training, the meta learner develops a common learning strategy, or experience, from a variety of learning tasks. Therefore, during meta test, the meta learner can use the learned strategy to quickly adapt to new tasks even with a few training samples. However, there is still a dark side about meta learning in terms of reliability and robustness. In particular, is meta learning vulnerable to adversarial attacks? In other words, would a well-trained meta learner utilize its learned experience to build wrong or likely useless knowledge, if an adversary unnoticeably manipulates the given training set? Without the understanding of this problem, it is extremely risky to apply meta learning in safety-critical applications. Thus, in this paper, we perform the initial study about adversarial attacks on meta learning under the few-shot classification problem. In particular, we formally define key elements of adversarial attacks unique to meta learning and propose the first attacking algorithm against meta learning under various settings. We evaluate the effectiveness of the proposed attacking strategy as well as the robustness of several representative meta learning algorithms. Experimental results demonstrate that the proposed attacking strategy can easily break the meta learner and meta learning is vulnerable to adversarial attacks. The implementation of the proposed framework will be released upon the acceptance of this paper.

preprint2019arXiv

An Experimentally Verified Approach to non-Entanglement-Breaking Channel Certification

Ensuring the non-entanglement-breaking (non-EB) property of quantum channels is crucial for the effective distribution and storage of quantum states. However, a practical method for direct and accurate certification of the non-EB feature is highly desirable. Here, we propose and verify a realistic source based measurement device independent certification of non-EB channels. Our method is resilient to repercussions on the certification from experimental conditions, such as multiphotons and imperfect state preparation, and can be implemented with information incomplete set. We achieve good agreement between experimental outcomes and theoretical predictions, which is validated by the expected results of the ideal semi-quantum signaling game, and accurately certify the non-EB channels. Furthermore, our approach is highly robust to effects from noise. Therefore, the proposed approach can be expected to play a significant role in the design and evaluation of realistic quantum channels.

preprint2019arXiv

Continuum Limit Matrix Elements for the Tonks-Girardeau Ground State

The Tonks-Girardeau model is a quantum mechanical model of N impenetrable bosons in 1+1 dimensions. A Vandermonde determinant provides the exact N-particle wave function of the ground state, or equivalently the matrix elements with respect to position eigenstates. We consider the large $N$ limit of these matrix elements. We present a binning prescription which calculates the leading terms of the matrix elements in a time which is independent of $N$, and so is suitable for this limit. In this sense, it allows one to solve for the ground state of a strongly coupled continuum quantum field theory in the field eigenstate basis. As examples, we calculate the matrix elements with respect to states with uniform density and also states consisting of two regions with distinct densities.

preprint2019arXiv

Exceptional Cones in 4D Parameter Space

The notion of synthetic dimensions has expanded the realm of topological physics to four dimensional (4D) space lately. In this work, non-Hermiticity is used as a synthetic parameter in PT-symmetric photonic crystals to study the topological physics in 4D non-Hermitian synthetic parameter space. We realize a 3D exceptional hypersurface (EHS) in such 4D parameter space, and the degeneracy points emerge due to the symmetry of synthetic parameters. We further demonstrate the existence of exceptional degenerate points (EDPs) on the EHS that originates from the chirality of exceptional points (EPs), and the exceptional surface near EDPs behaves like a Dirac cone. We further show that a very narrow reflection plateau can be found near these EDPs, and their sensitivity towards the PT-symmetry breaking environmental perturbation can make these degeneracy points useful in optical sensing and many other nonlinear and quantum optical applications.