Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
91works
0followers
23topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

91 published item(s)

preprint2026arXiv

EmoMM: Benchmarking and Steering MLLM for Multimodal Emotion Recognition under Conflict and Missingness

Multimodal Emotion Recognition (MER) is critical for interpreting real-world interactions. While Multimodal Large Language Models (MLLM) have shown promise in MER, their internal decision-making mechanisms under modality conflict and missingness remain largely underexplored. In this paper, to systematically investigate these behaviors, we introduce EmoMM, a comprehensive benchmark featuring modality-aligned, conflict, and missing subsets. Through extensive evaluation, we uncover a Video Contribution Collapse (VCC) phenomenon, where MLLM marginalize video evidence due to high token redundancy and modality preferences. To address this, we propose Conflict-aware Head-level Attention Steering (CHASE), a lightweight mechanism that detects modality conflicts and performs inference-time attention steering, effectively mitigating decision bias without retraining the backbone. Experimental results demonstrate that CHASE consistently improves performance across various settings, significantly enhancing the reliability of MLLM in complex affective scenarios.

preprint2025arXiv

Adapting In-Domain Few-Shot Segmentation to New Domains without Source Domain Retraining

Cross-domain few-shot segmentation (CD-FSS) aims to segment objects of novel classes in new domains, which is often challenging due to the diverse characteristics of target domains and the limited availability of support data. Most CD-FSS methods redesign and retrain in-domain FSS models using abundant base data from the source domain, which are effective but costly to train. To address these issues, we propose adapting informative model structures of the well-trained FSS model for target domains by learning domain characteristics from few-shot labeled support samples during inference, thereby eliminating the need for source domain retraining. Specifically, we first adaptively identify domain-specific model structures by measuring parameter importance using a novel structure Fisher score in a data-dependent manner. Then, we progressively train the selected informative model structures with hierarchically constructed training samples, progressing from fewer to more support shots. The resulting Informative Structure Adaptation (ISA) method effectively addresses domain shifts and equips existing well-trained in-domain FSS models with flexible adaptation capabilities for new domains, eliminating the need to redesign or retrain CD-FSS models on base data. Extensive experiments validate the effectiveness of our method, demonstrating superior performance across multiple CD-FSS benchmarks. Codes are at https://github.com/fanq15/ISA.

preprint2025arXiv

Atomic-scale spin sensing of a 2D $d$-wave altermagnet via helical tunneling

Altermagnetism simultaneously possesses nonrelativistic spin responses and zero net magnetization, thus combining advantages of ferromagnetism and antiferromagnetism. This superiority originates from its unique dual feature, i.e., opposite-magnetic sublattices in real space and alternating spin polarization in momentum space enforced by the same crystal symmetry. Therefore, the determination of an altermagnetic order and its unique spin response inherently necessitates atomic-scale spin-resolved measurements in real and momentum spaces, an experimental milestone yet to be achieved. Here, via utilizing the helical edge (hinge) modes of a higher order topological insulator as the spin sensor, we realize spin-resolved scanning tunneling microscopy which enables us to pin down the dual-space feature of a layered $d$-wave altermagnet, KV$_2$Se$_2$O. In real space, atomic-registered mapping demonstrates the checkerboard antiferromagnetic order together with density-wave lattice modulation, and in momentum space, spin-resolved spectroscopic imaging provides a direct visualization of d-wave spin splitting of the band structure. Critically, using this new topology-guaranteed spin filter we directly reveal the unidirectional, spin-polarized quasiparticle excitations originating from the crystal symmetry-paired X and Y valleys around opposite magnetic sublattices simultaneously --the unique spin response for $d$-wave altermagnetism. Our experiments establish a solid basis for the exploration and utilization of altermagnetism in layered materials and further facilitate access to atomic-scale spin sensing and manipulating of 2D quantum materials.

preprint2025arXiv

FUSE-RSVLM: Feature Fusion Vision-Language Model for Remote Sensing

Large vision-language models (VLMs) exhibit strong performance across various tasks. However, these VLMs encounter significant challenges when applied to the remote sensing domain due to the inherent differences between remote sensing images and natural images. Existing remote sensing VLMs often fail to extract fine-grained visual features and suffer from visual forgetting during deep language processing. To address this, we introduce MF-RSVLM, a Multi-Feature Fusion Remote Sensing Vision--Language Model that effectively extracts and fuses visual features for RS understanding. MF-RSVLM learns multi-scale visual representations and combines global context with local details, improving the capture of small and complex structures in RS scenes. A recurrent visual feature injection scheme ensures the language model remains grounded in visual evidence and reduces visual forgetting during generation. Extensive experiments on diverse RS benchmarks show that MF-RSVLM achieves state-of-the-art or highly competitive performance across remote sensing classification, image captioning, and VQA tasks. Our code is publicly available at https://github.com/Yunkaidang/RSVLM.

preprint2024arXiv

Investigation of the $ΔI = 1/2$ rule and test of CP violation through the measurement of decay asymmetry parameters in $Ξ^-$ decays

Using $(10087\pm44)\times 10^{6}$ $J/ψ$ events collected with the BESIII detector, numerous $Ξ^-$ and $Λ$ decay asymmetry parameters are simultaneously determined from the process $J/ψ\to Ξ^- \barΞ^+ \to Λ(pπ^-) π^- \barΛ(\bar{n} π^0) π^+$ and its charge-conjugate channel. The precisions of $α_0$ for $Λ\to nπ^0$ and $\barα_0$ for $\barΛ \to \bar{n}π^0$ compared to world averages are improved by factors of 4 and 1.7, respectively. The ratio of decay asymmetry parameters of $Λ\to nπ^0$ to that of $Λ\to pπ^-$, $\langle α_0 \rangle/ \langle α_{Λ-} \rangle $, is determined to be $ 0.873 \pm 0.012^{+0.011}_{-0.010}$, where the first and the second uncertainties are statistical and systematic, respectively. The ratio is smaller than unity more than $5σ$, which signifies the existence of the $ΔI = 3/2$ transition in $Λ$ for the first time. Beside, we test for CP violation in $Ξ^- \to Λπ^-$ and in $Λ\to n π^{0}$ with the best precision to date.

preprint2023arXiv

Search for hidden-charm tetraquark with strangeness in $e^{+}e^{-}\rightarrow K^+ D_{s}^{*-} D^{*0}+c.c.$

We report a search for a heavier partner of the recently observed $Z_{cs}(3985)^{-}$ state, denoted as $Z_{cs}^{\prime -}$, in the process $e^{+} e^{-}\rightarrow K^{+}D_{s}^{*-}D^{* 0}+c.c.$, based on $e^+e^-$ collision data collected at the center-of-mass energies of $\sqrt{s}=4.661$, 4.682 and 4.699 GeV with the BESIII detector. The $Z_{cs}^{\prime -}$ is of interest as it is expected to be a candidate for a hidden-charm and open-strange tetraquark. A partial-reconstruction technique is used to isolate $K^+$ recoil-mass spectra, which are probed for a potential contribution from $Z_{cs}^{\prime -}\to D_{s}^{*-}D^{* 0}$ ($c.c.$). We find an excess of $Z_{cs}^{\prime -}\rightarrow D_{s}^{*-}D^{*0}$ ($c.c.$) candidates with a significance of $2.1σ$, after considering systematic uncertainties, at a mass of $(4123.5\pm0.7_\mathrm{stat.}\pm4.7_\mathrm{syst.})\ \mathrm{MeV}/c^{2}$. As the data set is limited in size, the upper limits are evaluated at the 90\% confidence level on the product of the Born cross sections ($σ^{\mathrm{Born}}$) and the branching fraction ($\mathcal{B}$) of $Z_{cs}^{\prime-}\rightarrow D_{s}^{*-}D^{* 0}$, under different assumptions of the $Z_{cs}^{\prime -}$ mass from 4.120 to 4.140 MeV and of the width from 10 to 50 MeV at the three center-of-mass energies. The upper limits of $σ^{\rm Born}\cdot\mathcal{B}$ are found to be at the level of $\mathcal{O}(1)$ pb at each energy. Larger data samples are needed to confirm the $Z_{cs}^{\prime -}$ state and clarify its nature in the coming years.

preprint2022arXiv

$Λ_{c}^{+}$ decays at BESIII

BESIII has made great progress in taking data which is the largest data samples near the $Λ_{c}^{+}\barΛ_{c}^{-}$ threshold. We have measured the branching fraction of $Λ_{c}^{+}\to nπ^{+}$ to be $(6.6\pm 1.2_{\mathrm{stat.}}\pm 0.4_{\mathrm{syst.}})\times 10^{-4}$ with the statistical significance of 7.3 $σ$ firstly using $3.9\mathrm{fb}^{-1}$ $e^{+}e^{-}$ collision collected with BESIII detector at six center-of-mass energies between 4.612 and 4.699 GeV. With the result of the branching fraction of $Λ_{c}^{+}\to pπ^{0}$ from Belle, the ratio of the branching fractions between $Λ_{c}^{+}\to nπ^{+}$ and $Λ_{c}^{+}\to pπ^{0}$ is measured to be larger than 7.2 at 90% confidence level. The branching fractions of $Λ_{c}^{+}\to Λπ^{+}$ and $Λ_{c}^{+}\to Σ^{0} π^{+}$ are measured to be $(1.31\pm 0.08_{\mathrm{stat.}}\pm 0.05_{\mathrm{syst.}})\times 10^{-2}$ and $(1.22\pm 0.08_{\mathrm{stat.}}\pm 0.07_{\mathrm{syst.}})\times 10^{-2}$, respectively, which are consistent with previous BESIII results. More results of $Λ_{c}^{+}$ decays will be published with better precision this year using $6.4\mathrm{fb}^{-1}$ $e^{+}e^{-}$ collision data samples between 4.600 and 4.946 GeV.

preprint2022arXiv

Amplitude analysis and branching fraction measurement of the decay $D_{s}^{+} \to K^+π^+π^-$

Using $6.32$ fb$^{-1}$ of $e^{+}e^{-}$ collision data collected at the center-of-mass energies between 4.178 and 4.226 GeV with the BESIII detector, we perform an amplitude analysis of the decay $D^+_s \to K^+π^+π^-$ and determine the amplitudes of the various intermediate states. The absolute branching fraction of $D^+_s\to K^+π^+π^-$ is measured to be ($6.11\pm0.18_{\rm stat.}\pm0.11_{\rm syst.})\times 10^{-3}$. The branching fractions of the dominant intermediate processes $D_{s}^{+} \to K^+ρ^0, ρ^0 \to π^+π^-$ and $D_{s}^{+} \to K^*(892)^0π^+, K^*(892)^0 \to K^+π^-$ are determined to be $(1.96\pm0.19_{\rm stat.}\pm0.23_{\rm syst.})\times 10^{-3}$ and $(1.85\pm0.12_{\rm stat.}\pm0.13_{\rm syst.})\times 10^{-3}$, respectively. The intermediate resonances $f_0(500)$, $f_0(980)$, and $f_0(1370)$ are observed for the first time in this channel.

preprint2022arXiv

Amplitude analysis and branching-fraction measurement of $D_{s}^{+} \to π^{+}π^{0}η^{\prime}$

Using data collected with the BESIII detector in $e^+e^-$ collisions at center-of-mass energies between 4.178 and 4.226 GeV and corresponding to 6.32~fb$^{-1}$ of integrated luminosity, we report the amplitude analysis and branching-fraction measurement of the $D^+_s \to π^+ π^0 η^{\prime}$ decay. We find that the dominant intermediate process is $D^+_s \toρ^+ η^{\prime}$ and the significances of other resonant and nonresonant processes are all less than $3σ$. The upper limits on the branching fractions of $S$-wave and $P$-wave nonresonant components are set to $0.10\%$ and $0.74\%$ at the $90\%$ confidence level, respectively. In addition, the branching fraction of the $D^+_s \to π^+ π^0 η^{\prime}$ decay is measured to be $(6.15\pm0.25(\rm stat.)\pm0.18(\rm syst.))\%$, which receives significant contribution only from $D_s^+\to ρ^+η^{\prime}$ according to the amplitude analysis.

preprint2022arXiv

Ask to Understand: Question Generation for Multi-hop Question Answering

Multi-hop Question Answering (QA) requires the machine to answer complex questions by finding scattering clues and reasoning from multiple documents. Graph Network (GN) and Question Decomposition (QD) are two common approaches at present. The former uses the "black-box" reasoning process to capture the potential relationship between entities and sentences, thus achieving good performance. At the same time, the latter provides a clear reasoning logical route by decomposing multi-hop questions into simple single-hop sub-questions. In this paper, we propose a novel method to complete multi-hop QA from the perspective of Question Generation (QG). Specifically, we carefully design an end-to-end QG module on the basis of a classical QA module, which could help the model understand the context by asking inherently logical sub-questions, thus inheriting interpretability from the QD-based method and showing superior performance. Experiments on the HotpotQA dataset demonstrate that the effectiveness of our proposed QG module, human evaluation further clarifies its interpretability quantitatively, and thorough analysis shows that the QG module could generate better sub-questions than QD methods in terms of fluency, consistency, and diversity.

preprint2022arXiv

Better Pseudo-label: Joint Domain-aware Label and Dual-classifier for Semi-supervised Domain Generalization

With the goal of directly generalizing trained model to unseen target domains, domain generalization (DG), a newly proposed learning paradigm, has attracted considerable attention. Previous DG models usually require a sufficient quantity of annotated samples from observed source domains during training. In this paper, we relax this requirement about full annotation and investigate semi-supervised domain generalization (SSDG) where only one source domain is fully annotated along with the other domains totally unlabeled in the training process. With the challenges of tackling the domain gap between observed source domains and predicting unseen target domains, we propose a novel deep framework via joint domain-aware labels and dual-classifier to produce high-quality pseudo-labels. Concretely, to predict accurate pseudo-labels under domain shift, a domain-aware pseudo-labeling module is developed. Also, considering inconsistent goals between generalization and pseudo-labeling: former prevents overfitting on all source domains while latter might overfit the unlabeled source domains for high accuracy, we employ a dual-classifier to independently perform pseudo-labeling and domain generalization in the training process. When accurate pseudo-labels are generated for unlabeled source domains, the domain mixup operation is applied to augment new domains between labeled and unlabeled domains, which is beneficial for boosting the generalization capability of the model. Extensive results on publicly available DG benchmark datasets show the efficacy of our proposed SSDG method.

preprint2022arXiv

BFRnet: A deep learning-based MR background field removal method for QSM of the brain containing significant pathological susceptibility sources

Introduction: Background field removal (BFR) is a critical step required for successful quantitative susceptibility mapping (QSM). However, eliminating the background field in brains containing significant susceptibility sources, such as intracranial hemorrhages, is challenging due to the relatively large scale of the field induced by these pathological susceptibility sources. Method: This study proposes a new deep learning-based method, BFRnet, to remove background field in healthy and hemorrhagic subjects. The network is built with the dual-frequency octave convolutions on the U-net architecture, trained with synthetic field maps containing significant susceptibility sources. The BFRnet method is compared with three conventional BFR methods and one previous deep learning method using simulated and in vivo brains from 4 healthy and 2 hemorrhagic subjects. Robustness against acquisition field-of-view (FOV) orientation and brain masking are also investigated. Results: For both simulation and in vivo experiments, BFRnet led to the best visually appealing results in the local field and QSM results with the minimum contrast loss and the most accurate hemorrhage susceptibility measurements among all five methods. In addition, BFRnet produced the most consistent local field and susceptibility maps between different sizes of brain masks, while conventional methods depend drastically on precise brain extraction and further brain edge erosions. It is also observed that BFRnet performed the best among all BFR methods for acquisition FOVs oblique to the main magnetic field. Conclusion: The proposed BFRnet improved the accuracy of local field reconstruction in the hemorrhagic subjects compared with conventional BFR algorithms. The BFRnet method was effective for acquisitions of titled orientations and retained whole brains without edge erosion as often required by traditional BFR methods.

preprint2022arXiv

Cross section measurements of the processes $e^+e^- \rightarrow ωπ^{0}$ and $ωη$ at center-of-mass energies between 3.773 and 4.701 GeV

The Born cross sections of the processes $e^+e^- \rightarrow ωπ^{0}$ and $e^+e^- \rightarrow ωη$ are measured at center-of-mass energies between 3.773 and 4.701 GeV using a total integrated luminosity of 22.7 fb$^{-1}$ collected with the BESIII detector operating at the BEPCII collider. A simple $s^{-n}$ dependence for the continuum process can describe the measured Born cross sections. No significant contributions from the $ψ(4160)$, $Y(4230)$, $Y(4360)$, $ψ(4415)$, $Y(4660)$ resonances are found, which indicates relative small branching fractions for these resonances into the $ωπ^{0}$ and $ωη$ final states.

preprint2022arXiv

CYBORGS: Contrastively Bootstrapping Object Representations by Grounding in Segmentation

Many recent approaches in contrastive learning have worked to close the gap between pretraining on iconic images like ImageNet and pretraining on complex scenes like COCO. This gap exists largely because commonly used random crop augmentations obtain semantically inconsistent content in crowded scene images of diverse objects. Previous works use preprocessing pipelines to localize salient objects for improved cropping, but an end-to-end solution is still elusive. In this work, we propose a framework which accomplishes this goal via joint learning of representations and segmentation. We leverage segmentation masks to train a model with a mask-dependent contrastive loss, and use the partially trained model to bootstrap better masks. By iterating between these two components, we ground the contrastive updates in segmentation information, and simultaneously improve segmentation throughout pretraining. Experiments show our representations transfer robustly to downstream tasks in classification, detection and segmentation.

preprint2022arXiv

Electric Field Influenced Coordinate Jump of the Guiding Center and Magnetotransport

We derived an electrical current formula in the presence of a strong out of plane magnetic field and an in plane electric field, and within two dimensional disordered systems. This current is originated from the guiding center coordinate jump. At a strong magnetic field regime, the current can be pictured as the migration of the center coordinates. During the electron impurity scattering, the guiding centers suddenly shift their coordinates. Because of the electric field, the coordinate shift accumulatively contributes to a longitudinal current. During the scattering, the value of cyclotron radius changes, which compensates for the change of the electric potential energy during the coordinate jump. The diversion of cyclotron radius is the classical manifestation of electric field dependent broadening and shifting of the Landau levels.

preprint2022arXiv

EleGANt: Exquisite and Locally Editable GAN for Makeup Transfer

Most existing methods view makeup transfer as transferring color distributions of different facial regions and ignore details such as eye shadows and blushes. Besides, they only achieve controllable transfer within predefined fixed regions. This paper emphasizes the transfer of makeup details and steps towards more flexible controls. To this end, we propose Exquisite and locally editable GAN for makeup transfer (EleGANt). It encodes facial attributes into pyramidal feature maps to preserves high-frequency information. It uses attention to extract makeup features from the reference and adapt them to the source face, and we introduce a novel Sow-Attention Module that applies attention within shifted overlapped windows to reduce the computational cost. Moreover, EleGANt is the first to achieve customized local editing within arbitrary areas by corresponding editing on the feature maps. Extensive experiments demonstrate that EleGANt generates realistic makeup faces with exquisite details and achieves state-of-the-art performance. The code is available at https://github.com/Chenyu-Yang-2000/EleGANt.

preprint2022arXiv

Feature-based Style Randomization for Domain Generalization

As a recent noticeable topic, domain generalization (DG) aims to first learn a generic model on multiple source domains and then directly generalize to an arbitrary unseen target domain without any additional adaption. In previous DG models, by generating virtual data to supplement observed source domains, the data augmentation based methods have shown its effectiveness. To simulate the possible unseen domains, most of them enrich the diversity of original data via image-level style transformation. However, we argue that the potential styles are hard to be exhaustively illustrated and fully augmented due to the limited referred styles, leading the diversity could not be always guaranteed. Unlike image-level augmentation, we in this paper develop a simple yet effective feature-based style randomization module to achieve feature-level augmentation, which can produce random styles via integrating random noise into the original style. Compared with existing image-level augmentation, our feature-level augmentation favors a more goal-oriented and sample-diverse way. Furthermore, to sufficiently explore the efficacy of the proposed module, we design a novel progressive training strategy to enable all parameters of the network to be fully trained. Extensive experiments on three standard benchmark datasets, i.e., PACS, VLCS and Office-Home, highlight the superiority of our method compared to the state-of-the-art methods.

preprint2022arXiv

Fighting Fire with Fire: Avoiding DNN Shortcuts through Priming

Across applications spanning supervised classification and sequential control, deep learning has been reported to find "shortcut" solutions that fail catastrophically under minor changes in the data distribution. In this paper, we show empirically that DNNs can be coaxed to avoid poor shortcuts by providing an additional "priming" feature computed from key input features, usually a coarse output estimate. Priming relies on approximate domain knowledge of these task-relevant key input features, which is often easy to obtain in practical settings. For example, one might prioritize recent frames over past frames in a video input for visual imitation learning, or salient foreground over background pixels for image classification. On NICO image classification, MuJoCo continuous control, and CARLA autonomous driving, our priming strategy works significantly better than several popular state-of-the-art approaches for feature selection and data augmentation. We connect these empirical findings to recent theoretical results on DNN optimization, and argue theoretically that priming distracts the optimizer away from poor shortcuts by creating better, simpler shortcuts.

preprint2022arXiv

First Observation of the Semileptonic Decay $Λ_c^+\rightarrow pK^- e^+ν_e$

Using $4.5~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data samples collected at the center-of-mass energies ranging from 4.600~GeV to 4.699~GeV with the BESIII detector at the BEPCII collider, a first study of the semileptonic decays $Λ_c^+\rightarrow pK^-e^+ν_e$, $Λ_c^+\rightarrow Λ(1520) e^+ν_e$ and $Λ_c^+\rightarrow Λ(1405) e^+ν_e$ is performed. The $Λ_c^+\rightarrow pK^-e^+ν_e$ decay is observed with a significance of $8.2σ$ and the branching fraction is measured to be $\mathcal{B}(Λ_c^+\rightarrow pK^- e^+ν_e)=(0.88\pm0.17_{\rm stat.}\pm0.07_{\rm syst.})\times 10^{-3}$. We also report evidence of $Λ_c^+\rightarrow Λ(1520)e^+ν_e$ and $Λ_c^+\rightarrow Λ(1405)e^+ν_e$ with significances of $3.3σ$ and $3.2σ$, respectively, and measure $\mathcal B(Λ^+_c\rightarrow Λ(1520)e^+ν_e)=(1.02\pm0.52_{\rm stat.}\pm0.11_{\rm syst.})\times10^{-3}$ and $\mathcal B(Λ^+_c\rightarrow Λ(1405)[\rightarrow pK^-]e^+ν_e)=(0.42\pm0.19_{\rm stat.}\pm0.04_{\rm syst.})\times10^{-3}$. Combining these with the inclusive semileptonic $Λ_c^+$ branching fraction measured by BESIII, the relative fraction is determined to be $[\mathcal{B}(Λ_c^+\rightarrow pK^-e^+ν_e)/\mathcal{B}(Λ_c^+\rightarrow X e^+ν_e)]=(2.1\pm0.4_{\rm stat.}\pm0.2_{\rm syst.})\%$, which provides a clear confirmation that semileptonic $Λ_c^+$ decays are not saturated by the $Λ\ell^+ν_{\ell}$ final state.

preprint2022arXiv

Giant second-order nonlinearity in twisted bilayer graphene

In the second-order response regime, the Hall voltage can be nonzero without breaking the time-reversal symmetry, as long as the system is noncentrosymmetric. There are multiple mechanisms with different scaling rules that contribute to the nonlinear Hall effect (NLHE). The intrinsic contribution is closely related to the Berry curvature dipole and has been extensively investigated recently. The study of the extrinsic contribution, however, is scarce, although it can enter the NLHE even in the leading order. Here, we report a giant nonlinear transport response in TBG, in which the intrinsic mechanism is forbidden. The magnitude and direction of the second-order nonlinearity can be effectively tuned by the gate voltage. The peak value of the second-order Hall conductivity close to the full filling of the moiré band reaches 8.76 $μmSV^{-1}$, four-order larger than those detected in $WTe_2$. The observed giant second-order nonlinearity can be understood from the collaboration of the asymmetric scattering of electrons off the static (Coulomb impurities) and dynamic disorders (phonons) in noncentrosymmetric crystals. It is mainly determined by the skew-scattering contribution from impurities at 1.7 K. The skew-scattering from phonons has a much larger coupling coefficient as suggested by the scaling results, and becomes as important as the impurity contribution as the temperature rises. Our observations demonstrate the potential of TBG in studying nonlinear response and possible rectification applications.

preprint2022arXiv

Instant tissue field and magnetic susceptibility mapping from MR raw phase using Laplacian enabled deep neural networks

Quantitative susceptibility mapping (QSM) is a valuable MRI post-processing technique that quantifies the magnetic susceptibility of body tissue from phase data. However, the traditional QSM reconstruction pipeline involves multiple non-trivial steps, including phase unwrapping, background field removal, and dipole inversion. These intermediate steps not only increase the reconstruction time but amplify noise and errors. This study develops a large-stencil Laplacian preprocessed deep learning-based neural network for near instant quantitative field and susceptibility mapping (i.e., iQFM and iQSM) from raw MR phase data. The proposed iQFM and iQSM methods were compared with established reconstruction pipelines on simulated and in vivo datasets. In addition, experiments on patients with intracranial hemorrhage and multiple sclerosis were also performed to test the generalization of the novel neural networks. The proposed iQFM and iQSM methods yielded comparable results to multi-step methods in healthy subjects while dramatically improving reconstruction accuracies on intracranial hemorrhages with large susceptibilities. The reconstruction time was also substantially shortened from minutes using multi-step methods to only 30 milliseconds using the trained iQFM and iQSM neural networks.

preprint2022arXiv

Keeping Minimal Experience to Achieve Efficient Interpretable Policy Distillation

Although deep reinforcement learning has become a universal solution for complex control tasks, its real-world applicability is still limited because lacking security guarantees for policies. To address this problem, we propose Boundary Characterization via the Minimum Experience Retention (BCMER), an end-to-end Interpretable Policy Distillation (IPD) framework. Unlike previous IPD approaches, BCMER distinguishes the importance of experiences and keeps a minimal but critical experience pool with almost no loss of policy similarity. Specifically, the proposed BCMER contains two basic steps. Firstly, we propose a novel multidimensional hyperspheres intersection (MHI) approach to divide experience points into boundary points and internal points, and reserve the crucial boundary points. Secondly, we develop a nearest-neighbor-based model to generate robust and interpretable decision rules based on the boundary points. Extensive experiments show that the proposed BCMER is able to reduce the amount of experience to 1.4%~19.1% (when the count of the naive experiences is 10k) and maintain high IPD performance. In general, the proposed BCMER is more suitable for the experience storage limited regime because it discovers the critical experience and eliminates redundant experience.

preprint2022arXiv

LibFewShot: A Comprehensive Library for Few-shot Learning

Few-shot learning, especially few-shot image classification, has received increasing attention and witnessed significant advances in recent years. Some recent studies implicitly show that many generic techniques or ``tricks'', such as data augmentation, pre-training, knowledge distillation, and self-supervision, may greatly boost the performance of a few-shot learning method. Moreover, different works may employ different software platforms, backbone architectures and input image sizes, making fair comparisons difficult and practitioners struggle with reproducibility. To address these situations, we propose a comprehensive library for few-shot learning (LibFewShot) by re-implementing eighteen state-of-the-art few-shot learning methods in a unified framework with the same single codebase in PyTorch. Furthermore, based on LibFewShot, we provide comprehensive evaluations on multiple benchmarks with various backbone architectures to evaluate common pitfalls and effects of different training tricks. In addition, with respect to the recent doubts on the necessity of meta- or episodic-training mechanism, our evaluation results confirm that such a mechanism is still necessary especially when combined with pre-training. We hope our work can not only lower the barriers for beginners to enter the area of few-shot learning but also elucidate the effects of nontrivial tricks to facilitate intrinsic research on few-shot learning. The source code is available from https://github.com/RL-VIG/LibFewShot.

preprint2022arXiv

Maximum Entropy Population-Based Training for Zero-Shot Human-AI Coordination

We study the problem of training a Reinforcement Learning (RL) agent that is collaborative with humans without using any human data. Although such agents can be obtained through self-play training, they can suffer significantly from distributional shift when paired with unencountered partners, such as humans. To mitigate this distributional shift, we propose Maximum Entropy Population-based training (MEP). In MEP, agents in the population are trained with our derived Population Entropy bonus to promote both pairwise diversity between agents and individual diversity of agents themselves, and a common best agent is trained by paring with agents in this diversified population via prioritized sampling. The prioritization is dynamically adjusted based on the training progress. We demonstrate the effectiveness of our method MEP, with comparison to Self-Play PPO (SP), Population-Based Training (PBT), Trajectory Diversity (TrajeDi), and Fictitious Co-Play (FCP) in the Overcooked game environment, with partners being human proxy models and real humans. A supplementary video showing experimental results is available at https://youtu.be/Xh-FKD0AAKE.

preprint2022arXiv

Measurement of $e^{+}e^{-} \to K^{+}K^{-}π^{0}$ cross section and observation of a resonant structure

Based on $e^{+}e^{-}$ collision data collected by the BESIII detector at the BEPCII collider at center-of-mass energies from 2.000 to 3.080 GeV, a partial-wave analysis is performed for the process $e^{+}e^{-} \to K^{+}K^{-}π^{0}$. The Born cross section of the process $e^{+}e^{-} \to K^{+}K^{-}π^{0}$ and its subprocesses $e^{+}e^{-} \to ϕπ^{0}$, $K^{*}(892)K$ and $K^{*}_{2}(1430)K$ are measured. The results for $e^{+}e^{-} \to K^{+}K^{-}π^{0}$ and $ϕπ^{0}$ are consistent with the BaBar measurements and with improved precision. By analyzing the cross section, of the subprocesses $e^{+}e^{-} \to$ $K^{*}(892)K$ and $K^{*}_{2}(1430)K$, a structure with mass $M_R$ = (2208 $\pm$ 19 $\pm$ 24) MeV/$c^{2}$ and width $Γ_R$ = (168 $\pm$ 24 $\pm$ 39) MeV is observed with a combined statistical significance of 7.6$σ$. The measured resonance parameters suggest it can be identified as the $ϕ(2170)$, thus the results provide valuable input to understand the internal nature of this state.

preprint2022arXiv

Measurement of $Λ$ baryon polarization in $e^+e^-\rightarrowΛ\barΛ$ at $\sqrt{s} = 3.773$ GeV

Using a data sample of $ψ(3770)$ events collected with the BESIII detector at BEPCII corresponding to an integrated luminosity of 2.9 fb$^{-1}$, we report a measurement of $Λ$ spin polarization in $e^+e^-\rightarrowΛ\barΛ$ at $\sqrt{s} = 3.773$ GeV. The significance of polarization is found to be 2$σ$ including the systematic uncertainty, which implies a zero phase between the transition amplitudes of the $Λ\barΛ$ helicity states. This phase can be interpreted in terms of psionic form factors, and is determined to be $ΔΦ^Ψ$ = $Φ^Ψ_{E} - Φ^Ψ_{M}$ = $(71^{+66}_{-46}$ $\pm$ 5)$^{\circ}$. Similarly, the ratio between the form factors is found to be $R^ψ$ = $|G^Ψ_{E}/G^Ψ_{M}|$ = $0.48^{+0.12}_{-0.07}$ $\pm$ 0.04. The first uncertainties are statistical and the second systematic.

preprint2022arXiv

Measurement of the branching fraction and decay asymmetry of $Λ\to nγ$

The radiative hyperon decay $Λ\to nγ$ is studied using $(10087\pm44)\times 10^6$ $J/ψ$ events collected with the BESIII detector operating at BEPCII. The absolute branching fraction of the decay $Λ\to nγ$ is determined with a significance of 5.6$σ$ to be $[0.832\pm0.038(\rm stat.)\pm0.054(\rm syst.)]\times10^{-3}$, which lies significantly below the current PDG value. By analyzing the joint angular distribution of the decay products, the first determination of the decay asymmetry $α_γ$ is reported with a value of $-0.16\pm0.10(\rm stat.)\pm0.05(\rm syst.)$.

preprint2022arXiv

Measurement of the branching fraction for $ψ(3686)\to ωK^0_SK^0_S$

Analyzing $(448.1\pm2.9)\times10^6$ $ψ(3686)$ events collected with the BESIII detector at the BEPCII collider, the $ψ(3686)\to ωK_{S}^{0}K_{S}^{0}$ decay is observed for the first time. The branching fraction for this decay is determined to be $\mathcal{B}_{ψ(3686)\to ωK_{S}^{0}K^{0}_{S}}$=$(7.04\pm0.39\pm0.36)$$\times10^{-5}$, where the first uncertainty is statistical and the second is systematic.

preprint2022arXiv

Measurement of the branching fraction of the doubly Cabibbo-suppressed decay $D^0\to K^+π^-π^0$ and search for $D^0\to K^+π^-π^0π^0$

Using $2.93\,\rm fb^{-1}$ of $e^+e^-$ collision data collected at a center-of-mass energy of 3.773\,GeV with the BESIII detector, we present a measurement of the branching fraction of the doubly Cabibbo-suppressed (DCS) decay $D^0\to K^+π^-π^0$ and a search for the DCS decay $D^0\to K^+π^-π^0π^0$. The branching fraction of $D^0\to K^+π^-π^0$ is determined to be $[3.13^{+0.60}_{-0.56}({\rm stat}) \pm 0.09({\rm syst})] \times 10^{-4}$. No signal is observed for $D^0\to K^+π^-π^0π^0$ and an upper limit of $3.6 \times 10^{-4}$ is set on the branching fraction at the 90\% C.L. We combine these results with the world-average branching fractions of their counterpart Cabibbo-favored decays to determine the ratios of the doubly Cabibbo-suppressed over the Cabibbo-favored branching fractions, ${\mathcal B}(D^0\to K^+π^-π^0)/{\mathcal B}(D^0\to K^-π^+π^0)=(0.22\pm 0.04)\%$~and ${\mathcal B}(D^0\to K^+π^-π^0π^0)/{\mathcal B}(D^0\to K^-π^+π^0π^0)<0.40\%$ at the 90\% C.L., which correspond to $(0.75\pm 0.14)\tan^{4} θ_C$~and $1.37\times \tan^{4} θ_C$, respectively, where $θ_C$ is the Cabibbo angle.

preprint2022arXiv

Measurement of the Cross Section for $e^{+}e^{-}\to$ hadrons at Energies from 2.2324 to 3.6710 GeV

Based on electron-positron collision data collected with the BESIII detector operating at the Beijing Electron Positron Collider II storage rings, the value of $R\equivσ(e^{+}e^{-}\to$hadrons)/$σ(e^{+}e^{-}\toμ^{+}μ^{-})$ is measured at 14 center-of-mass energies from 2.2324 to 3.6710 GeV. The resulting uncertainties are less than $3.0\%$, and are dominated by systematic uncertainties.

preprint2022arXiv

Measurement of the cross section of $e^{+}e^{-}\toηπ^{+}π^{-}$ at center-of-mass energies from 3.872 GeV to 4.700 GeV

Using data samples with an integrated luminosity of 19 fb$^{-1}$ at twenty-eight center-of-mass energies from 3.872 GeV to 4.700 GeV collected with the BESIII detector at the BEPCII electron--positron collider, the process $e^{+}e^{-}\toηπ^{+}π^{-}$ and the intermediate process $e^{+}e^{-}\toηρ^{0}$ are studied for the first time. The Born cross sections are measured. No significant resonance structure is observed in the cross section lineshape.

preprint2022arXiv

Measurement of the total and leptonic decay widths of the $J/ψ$ resonance with an energy scan method at BESIII

Using $e^+e^-$ annihilation data sets collected with the BESIII detector, we measure the cross sections of the processes $e^+e^- \to e^+e^-$ and $e^+e^- \to μ^+μ^-$ at fifteen center-of-mass energy points in the vicinity of the $J/ψ$ resonance. By a simultaneous fit to the measured, center-of-mass energy dependent cross sections of the two processes, the combined quantities $Γ_{ee} Γ_{ee} / Γ_{\rm tot}$ and $Γ_{ee} Γ_{μμ} / Γ_{\rm tot}$ are determined to be ($0.346 \pm 0.009$) and ($0.335 \pm 0.006$) keV, respectively, where $Γ_{ee}$, $Γ_{μμ}$, and $Γ_{\rm tot}$ are the electronic, muonic, and total decay widths of the $J/ψ$ resonance, respectively. Using the resultant $Γ_{ee} Γ_{μμ} / Γ_{\rm tot}$ and $Γ_{ee} Γ_{ee} / Γ_{\rm tot}$, the ratio $Γ_{ee} / Γ_{μμ}$ is calculated to be $1.031 \pm 0.015$, which is consistent with the expectation of lepton universality within about two standard deviations. Assuming lepton universality and using the branching fraction of the $J/ψ$ leptonic decay measured by BESIII in 2013, $Γ_{\rm tot}$ and $Γ_{ll}$ are determined to be ($93.0 \pm 2.1$) and ($5.56 \pm 0.11$) keV, respectively, where $Γ_{ll}$ is the average leptonic decay width of the $J/ψ$ resonance.

preprint2022arXiv

Measurements of Absolute Branching Fractions of $D^0\to K_L^0ϕ$, $K_L^0η$, $K_L^0ω$, and $K_L^0η^{\prime}$

We report the first measurements of the absolute branching fractions of $D^0\to K_L^0ϕ$, $D^0\to K_L^0η$, $D^0\to K_L^0ω$, and $D^0\to K_L^0η^{\prime}$, obtained by analyzing $2.93\,\rm fb^{-1}$ of $e^+e^-$ collision data taken at a center-of-mass energy of 3.773 GeV with the BESIII detector. Taking the world averages of the branching fractions of $D^0\to K_S^0ϕ$, $D^0\to K_S^0η$, $D^0\to K_S^0ω$, and $D^0\to K_S^0η^{\prime}$, the $K_S^0$-$K_L^0$ asymmetry $\mathcal{R}(D^0)$ in these decay modes are obtained. The CP asymmetries in these decays are also determined. No significant $CP$ violation is observed.

preprint2022arXiv

Measurements of the absolute branching fractions of hadronic $D$-meson decays involving kaons and pions

By analyzing an electron-positron collision data sample corresponding to an integrated luminosity of $2.93\,\rm fb^{-1}$ taken at the center-of-mass energy of 3.773 GeV with the BESIII detector, we obtain for the first time the absolute branching fractions for seven $D^0$ and $D^+$ hadronic decay modes and search for the hadronic decay $D^0\to K^0_S K^0_Sπ^0$ with much improved sensitivity. The results are ${\mathcal B}(D^0\to K^0_Sπ^0π^0π^0 )=( 7.64\pm 0.30\pm 0.29)\times 10^{-3}$, ${\mathcal B}(D^0\to K^-π^+π^0π^0π^0 )=( 9.54\pm 0.30\pm 0.31)\times 10^{-3}$, ${\mathcal B}(D^0\to K^0_Sπ^+π^-π^0π^0)=(12.66\pm 0.45\pm 0.43)\times 10^{-3}$, ${\mathcal B}(D^+\to K^0_Sπ^+π^0π^0 )=(29.04\pm 0.62\pm 0.87)\times 10^{-3}$, ${\mathcal B}(D^+\to K^0_Sπ^+π^+π^-π^0)=(15.28\pm 0.57\pm 0.60)\times 10^{-3}$, ${\mathcal B}(D^+\to K^0_Sπ^+π^0π^0π^0)=( 5.54\pm 0.44\pm 0.32)\times 10^{-3}$, ${\mathcal B}(D^+\to K^-π^+π^+π^0π^0 )=( 4.95\pm 0.26\pm 0.19)\times 10^{-3}$, ${\mathcal B}({D^0\to K^0_S K^0_Sπ^0}) < 1.57 \times 10^{-4}$ at the 90\% confidence level. Here the first uncertainties are statistical and the second ones systematic. The newly studied decays greatly enrich the knowledge of the $D\to \bar Kπππ$ and $D\to \bar Kππππ$ hadronic decays, and open a bridge to access more two-body hadronic $D$ decays containing scalar, vector, axial and tensor mesons in the charm sector.

preprint2022arXiv

MVDG: A Unified Multi-view Framework for Domain Generalization

To generalize the model trained in source domains to unseen target domains, domain generalization (DG) has recently attracted lots of attention. Since target domains can not be involved in training, overfitting source domains is inevitable. As a popular regularization technique, the meta-learning training scheme has shown its ability to resist overfitting. However, in the training stage, current meta-learning-based methods utilize only one task along a single optimization trajectory, which might produce a biased and noisy optimization direction. Beyond the training stage, overfitting could also cause unstable prediction in the test stage. In this paper, we propose a novel multi-view DG framework to effectively reduce the overfitting in both the training and test stage. Specifically, in the training stage, we develop a multi-view regularized meta-learning algorithm that employs multiple optimization trajectories to produce a suitable optimization direction for model updating. We also theoretically show that the generalization bound could be reduced by increasing the number of tasks in each trajectory. In the test stage, we utilize multiple augmented images to yield a multi-view prediction to alleviate unstable prediction, which significantly promotes model reliability. Extensive experiments on three benchmark datasets validate that our method can find a flat minimum to enhance generalization and outperform several state-of-the-art approaches.

preprint2022arXiv

Observation of $a_0(1710)^+ \to K_S^0K^+$ in study of the $D_s^+\to K_S^0K^+π^0$ decay

Using $e^+e^-$ annihilation data corresponding to an integrated luminosity of 6.32 fb$^{-1}$ collected at center-of-mass energies between 4.178 GeV and 4.226 GeV with the BESIII detector, we perform the first amplitude analysis of the decay $D_s^+\to K_S^0K^+π^0$ and determine the relative branching fractions and phases for intermediate processes. We observe the $a_0(1710)^+$, the isovector partner of the $f_0(1710)$ and $f_0(1770)$ mesons, in its decay to $K_S^0K^+$ for the first time. In addition, we measure the ratio $\frac{\mathcal{B}(D_{s}^{+} \to \bar{K}^{*}(892)^{0}K^{+})}{\mathcal{B}(D_{s}^{+} \to \bar{K}^{0}K^{*}(892)^{+})}$ to be $2.35^{+0.42}_{-0.23\text{stat.}}\pm 0.10_{\rm syst.}$. Finally, we provide a precision measurement of the absolute branching fraction $\mathcal{B}(D_s^+\to K_S^0K^+π^0) = (1.46\pm 0.06_{\text{stat.}}\pm 0.05_{\text{syst.}})\%$.

preprint2022arXiv

Observation of $η_c(2S) \to 3(π^+π^-)$ and measurements of $χ_{cJ} \to 3(π^+π^-)$ in $ψ(3686)$ radiative transitions

The hadronic decay $η_c(2S) \to 3(π^+π^-)$ is observed with a statistical significance of 9.3 standard deviations using $(448.1\pm2.9)\times10^6$ $ψ(3686)$ events collected by the BESIII detector at the BEPCII collider. The measured mass and width of $η_c(2S)$ are $(3643.4 \pm 2.3 (\rm stat.) \pm 4.4 (\rm syst.))$ MeV/$c^2$ and $(19.8 \pm 3.9 (\rm stat.) \pm 3.1 (\rm syst.))$ MeV, respectively, which are consistent with the world average values within two standard deviations. The product branching fraction $\mathcal{B}[ψ(3686)\to γη_c(2S)]\times\mathcal{B}[η_c(2S)\to3(π^+π^-)]$ is measured to be $(9.2 \pm 1.0 (\rm stat.) \pm 0.9 (\rm syst.))\times10^{-6}$. Using $\mathcal{B}[ψ(3686)\to γη_c(2S)]=(7.0^{+3.4}_{-2.5})\times10^{-4}$, we obtain $\mathcal{B}[η_c(2S) \to 3(π^+π^-)] = (1.31 \pm 0.15 (\rm stat.) \pm 0.13 (\rm syst.)(^{+0.64}_{-0.47}) (\rm extr))\times10^{-2}$, where the third uncertainty is from $\mathcal{B}[ψ(3686) \to γη_c(2S)]$. We also measure the $χ_{cJ} \to 3(π^+π^-)$ ($J=0, 1, 2$) decays via $ψ(3686) \to γχ_{cJ}$ transitions. The branching fractions are $\mathcal{B}[χ_{c0} \to 3(π^+π^-)] = (2.080\pm0.006 (\rm stat.)\pm0.068 (\rm syst.))\times10^{-2}$, $\mathcal{B}[χ_{c1} \to 3(π^+π^-)] = (1.092\pm0.004 (\rm stat.)\pm0.035 (\rm syst.))\times10^{-2}$, and $\mathcal{B}[χ_{c2} \to 3(π^+π^-)] = (1.565\pm0.005 (\rm stat.)\pm0.048 (\rm syst.))\times10^{-2}$.

preprint2022arXiv

Observation of resonance structures in $e^+e^-\to π^+π^-ψ_2(3823)$ and mass measurement of $ψ_2(3823)$

Using a data sample corresponding to an integrated luminosity of 11.3 $\rm fb^{-1}$ collected at center-of-mass energies from $4.23$ to $4.70$ GeV with the BESIII detector, we measure the product of the $e^+e^-\to π^+π^-ψ_2(3823)$ cross section and the branching fraction $\mathcal{B}[ψ_2(3823)\to γχ_{c1}]$. For the first time, resonance structure is observed in the cross section line shape of $e^+e^-\to π^+π^-ψ_2(3823)$ with significances exceeding $5σ$. A fit to data with two coherent Breit-Wigner resonances modeling the $\sqrt{s}$-dependent cross section yields $M(R_1)=4406.9\pm 17.2\pm 4.5$ MeV/$c^2$, $Γ(R_1)=128.1\pm 37.2\pm 2.3$ MeV, and $M(R_2)=4647.9\pm 8.6\pm 0.8$ MeV/$c^2$, $Γ(R_2)=33.1\pm 18.6\pm 4.1$ MeV. Though weakly disfavored by the data, a single resonance with $M(R)=4417.5\pm26.2\pm3.5$ MeV/$c^2$, $Γ(R)=245\pm48\pm13$ MeV is also possible to interpret data. This observation deepens our understanding of the nature of the vector charmoniumlike states. The mass of the $ψ_2(3823)$ state is measured as $(3823.12\pm 0.43\pm 0.13)$ MeV/$c^2$, which is the most precise measurement to date.

preprint2022arXiv

Observation of the double Dalitz decay $η&#39;\to e^+e^-e^+e^-$

Based on $(10087 \pm 44)\times10^6$ $J/ψ$ events collected with the BESIII detector at BEPCII, the double Dalitz decay $η&#39;\to e^+e^-e^+e^-$ is observed for the first time via the $J/ψ\toγη&#39;$ decay process. The significance is found to be 5.7$σ$ with systematic uncertainties taken into consideration. Its branching fraction is determined to be $\mathcal{B}(η&#39;\to e^+ e^- e^+ e^-) =(4.5\pm1.0(\mathrm{stat.})\pm0.5(\mathrm{sys.})) \times 10^{-6}$.

preprint2022arXiv

Observation of the electromagnetic Dalitz decay $D^{\ast 0}\to D^{0}e^{+}e^{-}$

Based on 3.19 fb$^{-1}$ of $e^+e^-$ collision data accumulated at the center-of-mass energy 4.178 GeV with the BESIII detector operating at the BEPCII collider, the electromagnetic Dalitz decay $D^{\ast 0}\to D^{0}e^{+}e^{-}$ is observed for the first time with a statistical significance of $13.2σ$. The ratio of the branching fraction of $D^{\ast 0}\to D^{0}e^{+}e^{-}$ to that of $D^{\ast 0}\to D^{0} γ$ is measured to be $(11.08\pm0.76\pm0.49)\times 10^{-3}$. By using the world average value of the branching fraction of $D^{\ast 0}\to D^{0} γ$, the branching fraction of $D^{\ast 0}\to D^{0}e^{+}e^{-}$ is determined to be $(3.91\pm0.27\pm0.17\pm0.10)\times 10^{-3}$, where the first uncertainty is statistical, the second systematic and the third external branching fractions.

preprint2022arXiv

Observation of the Singly Cabibbo-Suppressed Decay $Λ_{c}^{+} \to nπ^{+}$

The singly Cabibbo-suppressed decay $Λ_{c}^{+} \to nπ^{+}$ is observed for the first time with a statistical significance of $7.3σ$ by using 3.9 $\mathrm{fb}^{-1}$ of $e^{+}e^{-}$ collision data collected at center-of-mass energies between 4.612 and 4.699 GeV with the BESIII detector at BEPCII. The branching fraction of $Λ_{c}^{+} \to nπ^{+}$ is measured to be $(6.6\pm1.2_{\rm stat}\pm0.4_{\rm syst})\times 10^{-4}$. By taking the upper limit of branching fractions of $Λ_{c}^{+} \to pπ^0$ from the Belle experiment, the ratio of branching fractions between $Λ_{c}^{+} \to nπ^{+}$ and $Λ_{c}^{+} \to pπ^0$ is calculated to be larger than 7.2 at the 90% confidence level, which disagrees with the current predictions of available phenomenological models. In addition, the branching fractions of the Cabibbo-favored decays $Λ_{c}^{+} \to Λπ^{+}$ and $Λ_{c}^{+} \to Σ^{0}π^{+}$ are measured to be $(1.31\pm0.08_{\rm stat}\pm0.05_{\rm syst})\times 10^{-2}$ and $(1.22\pm0.08_{\rm stat}\pm0.07_{\rm syst})\times 10^{-2}$, respectively, which are consistent with previous results.

preprint2022arXiv

Online Attentive Kernel-Based Temporal Difference Learning

With rising uncertainty in the real world, online Reinforcement Learning (RL) has been receiving increasing attention due to its fast learning capability and improving data efficiency. However, online RL often suffers from complex Value Function Approximation (VFA) and catastrophic interference, creating difficulty for the deep neural network to be applied to an online RL algorithm in a fully online setting. Therefore, a simpler and more adaptive approach is introduced to evaluate value function with the kernel-based model. Sparse representations are superior at handling interference, indicating that competitive sparse representations should be learnable, non-prior, non-truncated and explicit when compared with current sparse representation methods. Moreover, in learning sparse representations, attention mechanisms are utilized to represent the degree of sparsification, and a smooth attentive function is introduced into the kernel-based VFA. In this paper, we propose an Online Attentive Kernel-Based Temporal Difference (OAKTD) algorithm using two-timescale optimization and provide convergence analysis of our proposed algorithm. Experimental evaluations showed that OAKTD outperformed several Online Kernel-based Temporal Difference (OKTD) learning algorithms in addition to the Temporal Difference (TD) learning algorithm with Tile Coding on public Mountain Car, Acrobot, CartPole and Puddle World tasks.

preprint2022arXiv

Partial wave analysis of $J/ψ\to γη^{\prime} η^{\prime}$

Using a sample of $(10.09~\pm~0.04)\times10^{9} ~J/ψ$ events collected with the BESIII detector, a partial wave analysis of $J/ψ\toγη^{\prime}η^{\prime}$ is performed. The masses and widths of the observed resonances and their branching fractions are reported. The main contribution is from $J/ψ\rightarrowγf_0(2020)$ with $f_0(2020)\rightarrowη^{\prime}η^{\prime}$, which is found with a significance of greater than 25$σ$. The product branching fraction ${\cal B}\left(J/ψ\rightarrowγf_0(2020)\right)\cdot{\cal B}\left(f_0(2020)\rightarrowη^{\prime}η^{\prime}\right)$ is measured to be $(2.63\pm0.06({\rm stat.})^{+0.31}_{-0.46}({\rm syst.}))\times10^{-4}$.

preprint2022arXiv

Phylogenetic Study of 2019-nCoV by Using Alignment Free Method (Evolutionary Bifurcation of Novel Coronavirus Mutants)

The phylogenetic tree of SARS-CoV-2 (nCov-19) viruses is reconstructed according to the similarity of genome sequences. The tree topology of Betacoronavirus is remarkably consistent with biologist&#39;s systematics. Because the tree construction contains enough information about virus mutants, it is suitable to study the evolutionary relationship between novel coronavirus mutants transmitted among humans. The emergences of 14 kinds of main mutants are studied and these strains can be classified as eight bifurcations of the phylogenetic tree. It is found that there exist three types of virus mutations, namely, the mutation among sub-branches of the same branch, the off-root mutation and the root-oriented mutation between large branches of the tree. From the point of the relation between viral mutation and host selection we found that individuals with low immunity provide a special environment for the positive natural selection of virus evolution. It gives a mechanism to explain why large mutations between two distant branches generally occur in the nCov-19 phylogenetic tree. The finding is helpful to formulate strategies to control the spread of COVID-19.

preprint2022arXiv

Playing Lottery Tickets in Style Transfer Models

Style transfer has achieved great success and attracted a wide range of attention from both academic and industrial communities due to its flexible application scenarios. However, the dependence on a pretty large VGG-based autoencoder leads to existing style transfer models having high parameter complexities, which limits their applications on resource-constrained devices. Compared with many other tasks, the compression of style transfer models has been less explored. Recently, the lottery ticket hypothesis (LTH) has shown great potential in finding extremely sparse matching subnetworks which can achieve on par or even better performance than the original full networks when trained in isolation. In this work, we for the first time perform an empirical study to verify whether such trainable matching subnetworks also exist in style transfer models. Specifically, we take two most popular style transfer models, i.e., AdaIN and SANet, as the main testbeds, which represent global and local transformation based style transfer methods respectively. We carry out extensive experiments and comprehensive analysis, and draw the following conclusions. (1) Compared with fixing the VGG encoder, style transfer models can benefit more from training the whole network together. (2) Using iterative magnitude pruning, we find the matching subnetworks at 89.2% sparsity in AdaIN and 73.7% sparsity in SANet, which demonstrates that style transfer models can play lottery tickets too. (3) The feature transformation module should also be pruned to obtain a much sparser model without affecting the existence and quality of the matching subnetworks. (4) Besides AdaIN and SANet, other models such as LST, MANet, AdaAttN and MCCNet can also play lottery tickets, which shows that LTH can be generalized to various style transfer models.

preprint2022arXiv

Resolving Copycat Problems in Visual Imitation Learning via Residual Action Prediction

Imitation learning is a widely used policy learning method that enables intelligent agents to acquire complex skills from expert demonstrations. The input to the imitation learning algorithm is usually composed of both the current observation and historical observations since the most recent observation might not contain enough information. This is especially the case with image observations, where a single image only includes one view of the scene, and it suffers from a lack of motion information and object occlusions. In theory, providing multiple observations to the imitation learning agent will lead to better performance. However, surprisingly people find that sometimes imitation from observation histories performs worse than imitation from the most recent observation. In this paper, we explain this phenomenon from the information flow within the neural network perspective. We also propose a novel imitation learning neural network architecture that does not suffer from this issue by design. Furthermore, our method scales to high-dimensional image observations. Finally, we benchmark our approach on two widely used simulators, CARLA and MuJoCo, and it successfully alleviates the copycat problem and surpasses the existing solutions.

preprint2022arXiv

Search for $X(3872)\toπ^0χ_{c0}$ and $X(3872)\toππχ_{c0}$ at BESIII

Using 9.9 fb$^{-1}$ of $e^+e^-$ collision data collected by the BESIII detector at center-of-mass energies between 4.15 and 4.30 GeV, we search for the processes $e^+e^-\toγX(3872)$ with $X(3872)\rightarrowπ^0χ_{c0}$ and $X(3872)\rightarrowππχ_{c0}$. Depending on the fitting model, the statistical significance for $X(3872)\toπ^0χ_{c0}$ ranges from 1.3$σ$ to 2.8$σ$. We set upper limits (at 90\% C.L.) of $\frac{\mathcal{B}(X(3872)\rightarrowπ^0χ_{c0})}{\mathcal{B}(X(3872)\toπ^+π^-J/ψ)}<3.6$, $\frac{\mathcal{B}(X(3872)\rightarrowπ^+π^-χ_{c0})}{\mathcal{B}(X(3872)\toπ^+π^-J/ψ)}<0.68$, and $\frac{\mathcal{B}(X(3872)\rightarrowπ^0π^0χ_{c0})}{\mathcal{B}(X(3872)\toπ^+π^-J/ψ)}<1.7$. Combined with the BESIII measurement of $X(3872)\toπ^0χ_{c1}$, we also set an upper limit of $\frac{\mathcal{B}(X(3872)\rightarrowπ^0χ_{c0})}{\mathcal{B}(X(3872)\toπ^0χ_{c1})}<4.4$.

preprint2022arXiv

Search for baryon and lepton number violating decays $D^{0}\to \bar{p}e^{+}$ and $D^{0}\to pe^{-}$

Using an electron-positron collision data sample corresponding to an integrated luminosity of 2.93~fb$^{-1}$ collected with the BESIII detector at a center-of-mass energy of 3.773 GeV, we search for the baryon and lepton number violating decays $D^{0}\to \bar{p}e^{+}$ and $D^{0}\to pe^{-}$. No obvious signals are found with the current statistics. The upper limits on the branching fractions for $D^{0}\to \bar{p}e^{+}$ and $D^{0}\to pe^{-}$ are set to be $1.2\times 10^{-6}$ and $2.2\times 10^{-6}$ at 90\% confidence level, respectively.

preprint2022arXiv

Search for baryon and lepton number violation decay $D^{\pm}\to n(\bar{n})e^{\pm}$

Using a data set of electron-positron collisions corresponding to an integrated luminosity of ${\rm 2.93~fb^{-1}}$ taken with the BESIII detector at a center-of-mass energy of 3.773 GeV, a search for the baryon ($B$) and lepton ($L$) number violating decays $D^{\pm}\to n(\bar{n})e^{\pm}$ is performed. No signal is observed and the upper limits on the branching fractions at the $90\%$ confidence level are set to be $1.43\times10^{-5}$ for the decays $D^{+(-)}\to \bar{n}(n)e^{+(-)}$ with $Δ|B-L|=0$, and $2.91\times10^{-5}$ for the decays $D^{+(-)}\to n(\bar{n})e^{+(-)}$ with $Δ|B-L|=2$ , where $Δ|B-L|$ denotes the change in the difference between baryon and lepton numbers.

preprint2022arXiv

Search for invisible decays of the $Λ$ baryon

A search for invisible decays of the $Λ$ baryon is carried out in the process $J/ψ\toΛ\barΛ$ based on $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected with the BESIII detector located at the BEPCII storage ring. No signals are found for the invisible decays of $Λ$ baryon, and the upper limit of the branching fraction is determined to be $7.4 \times 10^{-5}$ at the 90% confidence level. This is the first search for invisible decays of baryons; such searches will play an important role in constraining dark sector models related to the baryon asymmetry.

preprint2022arXiv

Search for the decay $D^{0} \to π^{0} ν\barν$

We present the first experimental search for the rare charm decay $D^{0} \to π^{0} ν\barν$. It is based on an $e^+e^-$ collision sample consisting of $10.6\times10^{6}$ pairs of $D^0\bar{D}^0$ mesons collected by the BESIII detector at $\sqrt{s}$=3.773 GeV, corresponding to an integrated luminosity of 2.93~fb$^{-1}$. A data-driven method is used to ensure the reliability of the background modeling. No significant $D^{0} \to π^{0} ν\barν$ signal is observed in data and an upper limit of the branching fraction is set to be $2.1\times 10^{-4}$ at the 90$\%$ confidence level. This is the first experimental constraint on charmed-hadron decays into dineutrino final states.

preprint2022arXiv

Semantic-Aware Fine-Grained Correspondence

Establishing visual correspondence across images is a challenging and essential task. Recently, an influx of self-supervised methods have been proposed to better learn representations for visual correspondence. However, we find that these methods often fail to leverage semantic information and over-rely on the matching of low-level features. In contrast, human vision is capable of distinguishing between distinct objects as a pretext to tracking. Inspired by this paradigm, we propose to learn semantic-aware fine-grained correspondence. Firstly, we demonstrate that semantic correspondence is implicitly available through a rich set of image-level self-supervised methods. We further design a pixel-level self-supervised learning objective which specifically targets fine-grained correspondence. For downstream tasks, we fuse these two kinds of complementary correspondence representations together, demonstrating that they boost performance synergistically. Our method surpasses previous state-of-the-art self-supervised methods using convolutional networks on a variety of visual correspondence tasks, including video object segmentation, human pose tracking, and human part tracking.

preprint2022arXiv

ST++: Make Self-training Work Better for Semi-supervised Semantic Segmentation

Self-training via pseudo labeling is a conventional, simple, and popular pipeline to leverage unlabeled data. In this work, we first construct a strong baseline of self-training (namely ST) for semi-supervised semantic segmentation via injecting strong data augmentations (SDA) on unlabeled images to alleviate overfitting noisy labels as well as decouple similar predictions between the teacher and student. With this simple mechanism, our ST outperforms all existing methods without any bells and whistles, e.g., iterative re-training. Inspired by the impressive results, we thoroughly investigate the SDA and provide some empirical analysis. Nevertheless, incorrect pseudo labels are still prone to accumulate and degrade the performance. To this end, we further propose an advanced self-training framework (namely ST++), that performs selective re-training via prioritizing reliable unlabeled images based on holistic prediction-level stability. Concretely, several model checkpoints are saved in the first stage supervised training, and the discrepancy of their predictions on the unlabeled image serves as a measurement for reliability. Our image-level selection offers holistic contextual information for learning. We demonstrate that it is more suitable for segmentation than common pixel-wise selection. As a result, ST++ further boosts the performance of our ST. Code is available at https://github.com/LiheYoung/ST-PlusPlus.

preprint2022arXiv

Switchable and unswitchable bulk photovoltaic effect in two-dimensional interlayer-sliding ferroelectrics

Spontaneous polarization and bulk photovoltaic effect (BPVE) are two concomitant physical properties in ferroelectric materials. The flipping of ferroelectric order usually accompanies with the switching of BPVE as both of them are reversed under the inversion symmetry. In this study, we report the distinctive BPVE characters in two-dimensional (2D) interlayer sliding ferroelectric materials featuring unswitchable in-plane BPVE (light-induced photocurrent in the xy plane) and switchable out-of-plane BPVE (light-induced polarization along the z-direction). Symmetry analysis within abstract bilayer crystal model and first-principles calculations validate these BPVE properties. It is because the positive and negative ferroelectric states caused by interlayer sliding are related by mirror symmetry which cannot flip all the BPVE tensor elements. This finding extends the understanding of the relationship between ferroelectricity and BPVE. On one hand, the switchable out-of-plane BPVE can be used to design switchable photoelectric devices. On the other hand, the in-plane BPVE is robust against the ferroelectric flipping, and the unswitchable character is beneficial to construct larger-scale photoelectric devices.

preprint2022arXiv

TCM-SD: A Benchmark for Probing Syndrome Differentiation via Natural Language Processing

Traditional Chinese Medicine (TCM) is a natural, safe, and effective therapy that has spread and been applied worldwide. The unique TCM diagnosis and treatment system requires a comprehensive analysis of a patient&#39;s symptoms hidden in the clinical record written in free text. Prior studies have shown that this system can be informationized and intelligentized with the aid of artificial intelligence (AI) technology, such as natural language processing (NLP). However, existing datasets are not of sufficient quality nor quantity to support the further development of data-driven AI technology in TCM. Therefore, in this paper, we focus on the core task of the TCM diagnosis and treatment system -- syndrome differentiation (SD) -- and we introduce the first public large-scale dataset for SD, called TCM-SD. Our dataset contains 54,152 real-world clinical records covering 148 syndromes. Furthermore, we collect a large-scale unlabelled textual corpus in the field of TCM and propose a domain-specific pre-trained language model, called ZY-BERT. We conducted experiments using deep neural networks to establish a strong performance baseline, reveal various challenges in SD, and prove the potential of domain-specific pre-trained language model. Our study and analysis reveal opportunities for incorporating computer science and linguistics knowledge to explore the empirical validity of TCM theories.

preprint2022arXiv

The cold gas and dust properties of red star-forming galaxies

We study the cold gas and dust properties for a sample of red star forming galaxies called &#34;red misfits.&#34; We collect single-dish CO observations and HI observations from representative samples of low-redshift galaxies, as well as our own JCMT CO observations of red misfits. We also obtain SCUBA-2 850 um observations for a subset of these galaxies. With these data we compare the molecular gas, total cold gas, and dust properties of red misfits against those of their blue counterparts (&#34;blue actives&#34;) taking non-detections into account using a survival analysis technique. We compare these properties at fixed position in the log SFR-log M* plane, as well as versus offset from the star-forming main sequence. Compared to blue actives, red misfits have slightly longer molecular gas depletion times, similar total gas depletion times, significantly lower molecular- and total-gas mass fractions, lower dust-to-stellar mass ratios, similar dust-to-gas ratios, and a significantly flatter slope in the $\log M_\mathrm{mol}$-$\log M_\star$ plane. Our results suggest that red misfits as a population are likely quenching due to a shortage in gas supply.

preprint2022arXiv

Towards Explainable Evaluation Metrics for Natural Language Generation

Unlike classical lexical overlap metrics such as BLEU, most current evaluation metrics (such as BERTScore or MoverScore) are based on black-box language models such as BERT or XLM-R. They often achieve strong correlations with human judgments, but recent research indicates that the lower-quality classical metrics remain dominant, one of the potential reasons being that their decision processes are transparent. To foster more widespread acceptance of the novel high-quality metrics, explainability thus becomes crucial. In this concept paper, we identify key properties and propose key goals of explainable machine translation evaluation metrics. We also provide a synthesizing overview over recent approaches for explainable machine translation metrics and discuss how they relate to those goals and properties. Further, we conduct own novel experiments, which (among others) find that current adversarial NLP techniques are unsuitable for automatically identifying limitations of high-quality black-box evaluation metrics, as they are not meaning-preserving. Finally, we provide a vision of future approaches to explainable evaluation metrics and their evaluation. We hope that our work can help catalyze and guide future research on explainable evaluation metrics and, mediately, also contribute to better and more transparent text generation systems.

preprint2022arXiv

Transformers in Medical Image Analysis: A Review

Transformers have dominated the field of natural language processing, and recently impacted the computer vision area. In the field of medical image analysis, Transformers have also been successfully applied to full-stack clinical applications, including image synthesis/reconstruction, registration, segmentation, detection, and diagnosis. Our paper aims to promote awareness and application of Transformers in the field of medical image analysis. Specifically, we first overview the core concepts of the attention mechanism built into Transformers and other basic components. Second, we review various Transformer architectures tailored for medical image applications and discuss their limitations. Within this review, we investigate key challenges revolving around the use of Transformers in different learning paradigms, improving the model efficiency, and their coupling with other techniques. We hope this review can give a comprehensive picture of Transformers to the readers in the field of medical image analysis.

preprint2022arXiv

Unifying Cross-lingual Summarization and Machine Translation with Compression Rate

Cross-Lingual Summarization (CLS) is a task that extracts important information from a source document and summarizes it into a summary in another language. It is a challenging task that requires a system to understand, summarize, and translate at the same time, making it highly related to Monolingual Summarization (MS) and Machine Translation (MT). In practice, the training resources for Machine Translation are far more than that for cross-lingual and monolingual summarization. Thus incorporating the Machine Translation corpus into CLS would be beneficial for its performance. However, the present work only leverages a simple multi-task framework to bring Machine Translation in, lacking deeper exploration. In this paper, we propose a novel task, Cross-lingual Summarization with Compression rate (CSC), to benefit Cross-Lingual Summarization by large-scale Machine Translation corpus. Through introducing compression rate, the information ratio between the source and the target text, we regard the MT task as a special CLS task with a compression rate of 100%. Hence they can be trained as a unified task, sharing knowledge more effectively. However, a huge gap exists between the MT task and the CLS task, where samples with compression rates between 30% and 90% are extremely rare. Hence, to bridge these two tasks smoothly, we propose an effective data augmentation method to produce document-summary pairs with different compression rates. The proposed method not only improves the performance of the CLS task, but also provides controllability to generate summaries in desired lengths. Experiments demonstrate that our method outperforms various strong baselines in three cross-lingual summarization datasets. We released our code and data at https://github.com/ybai-nlp/CLS_CR.

preprint2021arXiv

A novel multiple instance learning framework for COVID-19 severity assessment via data augmentation and self-supervised learning

How to fast and accurately assess the severity level of COVID-19 is an essential problem, when millions of people are suffering from the pandemic around the world. Currently, the chest CT is regarded as a popular and informative imaging tool for COVID-19 diagnosis. However, we observe that there are two issues -- weak annotation and insufficient data that may obstruct automatic COVID-19 severity assessment with CT images. To address these challenges, we propose a novel three-component method, i.e., 1) a deep multiple instance learning component with instance-level attention to jointly classify the bag and also weigh the instances, 2) a bag-level data augmentation component to generate virtual bags by reorganizing high confidential instances, and 3) a self-supervised pretext component to aid the learning process. We have systematically evaluated our method on the CT images of 229 COVID-19 cases, including 50 severe and 179 non-severe cases. Our method could obtain an average accuracy of 95.8%, with 93.6% sensitivity and 96.4% specificity, which outperformed previous works.

preprint2021arXiv

Cross section measurements of the $e^+e^-\to D^{*+}D^{*-}$ and $e^+e^-\to D^{*+}D^{-}$ processes at center-of-mass energies from 4.085 to 4.600 GeV

The Born cross sections of the $e^+e^-\to D^{*+}D^{*-}$ and $e^+e^-\to D^{*+}D^{-}$ processes are measured using $e^+e^-$ collision data collected with the BESIII experiment at center-of-mass energies from 4.085 to 4.600 GeV, corresponding to an integrated luminosity of $15.7~{\rm fb}^{-1}$. The results are consistent with and more precise than the previous measurements by the Belle, Babar and CLEO collaborations. The measurements are essential for understanding the nature of vector charmonium and charmonium-like states.

preprint2021arXiv

Crosslink-Net: Double-branch Encoder Segmentation Network via Fusing Vertical and Horizontal Convolutions

Accurate image segmentation plays a crucial role in medical image analysis, yet it faces great challenges of various shapes, diverse sizes, and blurry boundaries. To address these difficulties, square kernel-based encoder-decoder architecture has been proposed and widely used, but its performance remains still unsatisfactory. To further cope with these challenges, we present a novel double-branch encoder architecture. Our architecture is inspired by two observations: 1) Since the discrimination of features learned via square convolutional kernels needs to be further improved, we propose to utilize non-square vertical and horizontal convolutional kernels in the double-branch encoder, so features learned by the two branches can be expected to complement each other. 2) Considering that spatial attention can help models to better focus on the target region in a large-sized image, we develop an attention loss to further emphasize the segmentation on small-sized targets. Together, the above two schemes give rise to a novel double-branch encoder segmentation framework for medical image segmentation, namely Crosslink-Net. The experiments validate the effectiveness of our model on four datasets. The code is released at https://github.com/Qianyu1226/Crosslink-Net.

preprint2021arXiv

Deep Symmetric Adaptation Network for Cross-modality Medical Image Segmentation

Unsupervised domain adaptation (UDA) methods have shown their promising performance in the cross-modality medical image segmentation tasks. These typical methods usually utilize a translation network to transform images from the source domain to target domain or train the pixel-level classifier merely using translated source images and original target images. However, when there exists a large domain shift between source and target domains, we argue that this asymmetric structure could not fully eliminate the domain gap. In this paper, we present a novel deep symmetric architecture of UDA for medical image segmentation, which consists of a segmentation sub-network, and two symmetric source and target domain translation sub-networks. To be specific, based on two translation sub-networks, we introduce a bidirectional alignment scheme via a shared encoder and private decoders to simultaneously align features 1) from source to target domain and 2) from target to source domain, which helps effectively mitigate the discrepancy between domains. Furthermore, for the segmentation sub-network, we train a pixel-level classifier using not only original target images and translated source images, but also original source images and translated target images, which helps sufficiently leverage the semantic information from the images with different styles. Extensive experiments demonstrate that our method has remarkable advantages compared to the state-of-the-art methods in both cross-modality Cardiac and BraTS segmentation tasks.

preprint2021arXiv

ePointDA: An End-to-End Simulation-to-Real Domain Adaptation Framework for LiDAR Point Cloud Segmentation

Due to its robust and precise distance measurements, LiDAR plays an important role in scene understanding for autonomous driving. Training deep neural networks (DNNs) on LiDAR data requires large-scale point-wise annotations, which are time-consuming and expensive to obtain. Instead, simulation-to-real domain adaptation (SRDA) trains a DNN using unlimited synthetic data with automatically generated labels and transfers the learned model to real scenarios. Existing SRDA methods for LiDAR point cloud segmentation mainly employ a multi-stage pipeline and focus on feature-level alignment. They require prior knowledge of real-world statistics and ignore the pixel-level dropout noise gap and the spatial feature gap between different domains. In this paper, we propose a novel end-to-end framework, named ePointDA, to address the above issues. Specifically, ePointDA consists of three modules: self-supervised dropout noise rendering, statistics-invariant and spatially-adaptive feature alignment, and transferable segmentation learning. The joint optimization enables ePointDA to bridge the domain shift at the pixel-level by explicitly rendering dropout noise for synthetic LiDAR and at the feature-level by spatially aligning the features between different domains, without requiring the real-world statistics. Extensive experiments adapting from synthetic GTA-LiDAR to real KITTI and SemanticKITTI demonstrate the superiority of ePointDA for LiDAR point cloud segmentation.

preprint2021arXiv

Intrinsic nonlinear Hall effect in antiferromagnetic tetragonal CuMnAs

Detecting the orientation of the Néel vector is a major research topic in antiferromagnetic spintronics. Here we recognize the intrinsic nonlinear Hall effect, which is independent of the relaxation time, as a prominent contribution to the time-reversal-odd second order conductivity and can be used to detect the flipping of the Néel vector. In contrast, the Berry-curvature-dipole-induced nonlinear Hall effect depends linear on relaxation time and is time-reversal-even. We study the intrinsic nonlinear Hall effect in an antiferromagnetic metal: tetragonal CuMnAs, and show that its nonlinear Hall conductivity can reach the order of mA/V$^2$. The dependence on the chemical potential of such nonlinear Hall conductivity can be qualitatively explained by a tilted massive Dirac model. Moreover, we demonstrate its strong temperature dependence and briefly discuss its competition with the second order Drude conductivity. Finally, a complete survey of magnetic point groups are presented, providing guidelines for finding more antiferromagnetic materials with the intrinsic nonlinear Hall effect.

preprint2021arXiv

MetricUNet: Synergistic Image- and Voxel-Level Learning for Precise CT Prostate Segmentation via Online Sampling

Fully convolutional networks (FCNs), including UNet and VNet, are widely-used network architectures for semantic segmentation in recent studies. However, conventional FCN is typically trained by the cross-entropy or Dice loss, which only calculates the error between predictions and ground-truth labels for pixels individually. This often results in non-smooth neighborhoods in the predicted segmentation. To address this problem, we propose a two-stage framework, with the first stage to quickly localize the prostate region and the second stage to precisely segment the prostate by a multi-task UNet architecture. We introduce a novel online metric learning module through voxel-wise sampling in the multi-task network. Therefore, the proposed network has a dual-branch architecture that tackles two tasks: 1) a segmentation sub-network aiming to generate the prostate segmentation, and 2) a voxel-metric learning sub-network aiming to improve the quality of the learned feature space supervised by a metric loss. Specifically, the voxel-metric learning sub-network samples tuples (including triplets and pairs) in voxel-level through the intermediate feature maps. Unlike conventional deep metric learning methods that generate triplets or pairs in image-level before the training phase, our proposed voxel-wise tuples are sampled in an online manner and operated in an end-to-end fashion via multi-task learning. To evaluate the proposed method, we implement extensive experiments on a real CT image dataset consisting of 339 patients. The ablation studies show that our method can effectively learn more representative voxel-level features compared with the conventional learning methods with cross-entropy or Dice loss. And the comparisons show that the proposed method outperforms the state-of-the-art methods by a reasonable margin.

preprint2021arXiv

Reinforcement Learning with Latent Flow

Temporal information is essential to learning effective policies with Reinforcement Learning (RL). However, current state-of-the-art RL algorithms either assume that such information is given as part of the state space or, when learning from pixels, use the simple heuristic of frame-stacking to implicitly capture temporal information present in the image observations. This heuristic is in contrast to the current paradigm in video classification architectures, which utilize explicit encodings of temporal information through methods such as optical flow and two-stream architectures to achieve state-of-the-art performance. Inspired by leading video classification architectures, we introduce the Flow of Latents for Reinforcement Learning (Flare), a network architecture for RL that explicitly encodes temporal information through latent vector differences. We show that Flare (i) recovers optimal performance in state-based RL without explicit access to the state velocity, solely with positional state information, (ii) achieves state-of-the-art performance on pixel-based challenging continuous control tasks within the DeepMind control benchmark suite, namely quadruped walk, hopper hop, finger turn hard, pendulum swing, and walker run, and is the most sample efficient model-free pixel-based RL algorithm, outperforming the prior model-free state-of-the-art by 1.9X and 1.5X on the 500k and 1M step benchmarks, respectively, and (iv), when augmented over rainbow DQN, outperforms this state-of-the-art level baseline on 5 of 8 challenging Atari games at 100M time step benchmark.

preprint2020arXiv

A coupled cluster framework for electrons and phonons

We describe a coupled cluster framework for coupled systems of electrons and phonons. Neutral and charged excitations are accessed via the equation-of-motion version of the theory. Benchmarks on the Hubbard-Holstein model allow us to assess the strengths and weaknesses of different coupled cluster approximations which generally perform well for weak to moderate coupling. Finally, we report progress towards an implementation for {\it ab initio} calculations on solids, and present some preliminary results on finite-size models of diamond. We also report the implementation of electron-phonon coupling matrix elements from crystalline Gaussian type orbitals (cGTO) within the PySCF program package.

preprint2020arXiv

Automatic Data Augmentation via Deep Reinforcement Learning for Effective Kidney Tumor Segmentation

Conventional data augmentation realized by performing simple pre-processing operations (\eg, rotation, crop, \etc) has been validated for its advantage in enhancing the performance for medical image segmentation. However, the data generated by these conventional augmentation methods are random and sometimes harmful to the subsequent segmentation. In this paper, we developed a novel automatic learning-based data augmentation method for medical image segmentation which models the augmentation task as a trial-and-error procedure using deep reinforcement learning (DRL). In our method, we innovatively combine the data augmentation module and the subsequent segmentation module in an end-to-end training manner with a consistent loss. Specifically, the best sequential combination of different basic operations is automatically learned by directly maximizing the performance improvement (\ie, Dice ratio) on the available validation set. We extensively evaluated our method on CT kidney tumor segmentation which validated the promising results of our method.

preprint2020arXiv

Crossbar-Net: A Novel Convolutional Network for Kidney Tumor Segmentation in CT Images

Due to the irregular motion, similar appearance and diverse shape, accurate segmentation of kidney tumor in CT images is a difficult and challenging task. To this end, we present a novel automatic segmentation method, termed as Crossbar-Net, with the goal of accurate segmenting the kidney tumors. Firstly, considering that the traditional learning-based segmentation methods normally employ either whole images or squared patches as the training samples, we innovatively sample the orthogonal non-squared patches (namely crossbar patches), to fully cover the whole kidney tumors in either horizontal or vertical directions. These sampled crossbar patches could not only represent the detailed local information of kidney tumor as the traditional patches, but also describe the global appearance from either horizontal or vertical direction using contextual information. Secondly, with the obtained crossbar patches, we trained a convolutional neural network with two sub-models (i.e., horizontal sub-model and vertical sub-model) in a cascaded manner, to integrate the segmentation results from two directions (i.e., horizontal and vertical). This cascaded training strategy could effectively guarantee the consistency between sub-models, by feeding each other with the most difficult samples, for a better segmentation. In the experiment, we evaluate our method on a real CT kidney tumor dataset, collected from 94 different patients including 3,500 images. Compared with the state-of-the-art segmentation methods, the results demonstrate the superior results of our method on dice ratio score, true positive fraction, centroid distance and Hausdorff distance. Moreover, we have extended our crossbar-net to a different task: cardiac segmentation, showing the promising results for the better generalization.

preprint2020arXiv

Crossover-Net: Leveraging the Vertical-Horizontal Crossover Relation for Robust Segmentation

Robust segmentation for non-elongated tissues in medical images is hard to realize due to the large variation of the shape, size, and appearance of these tissues in different patients. In this paper, we present an end-to-end trainable deep segmentation model termed Crossover-Net for robust segmentation in medical images. Our proposed model is inspired by an insightful observation: during segmentation, the representation from the horizontal and vertical directions can provide different local appearance and orthogonality context information, which helps enhance the discrimination between different tissues by simultaneously learning from these two directions. Specifically, by converting the segmentation task to a pixel/voxel-wise prediction problem, firstly, we originally propose a cross-shaped patch, namely crossover-patch, which consists of a pair of (orthogonal and overlapped) vertical and horizontal patches, to capture the orthogonal vertical and horizontal relation. Then, we develop the Crossover-Net to learn the vertical-horizontal crossover relation captured by our crossover-patches. To achieve this goal, for learning the representation on a typical crossover-patch, we design a novel loss function to (1) impose the consistency on the overlap region of the vertical and horizontal patches and (2) preserve the diversity on their non-overlap regions. We have extensively evaluated our method on CT kidney tumor, MR cardiac, and X-ray breast mass segmentation tasks. Promising results are achieved according to our extensive evaluation and comparison with the state-of-the-art segmentation models.

preprint2020arXiv

Deep Learning on Knowledge Graph for Recommender System: A Survey

Recent advances in research have demonstrated the effectiveness of knowledge graphs (KG) in providing valuable external knowledge to improve recommendation systems (RS). A knowledge graph is capable of encoding high-order relations that connect two objects with one or multiple related attributes. With the help of the emerging Graph Neural Networks (GNN), it is possible to extract both object characteristics and relations from KG, which is an essential factor for successful recommendations. In this paper, we provide a comprehensive survey of the GNN-based knowledge-aware deep recommender systems. Specifically, we discuss the state-of-the-art frameworks with a focus on their core component, i.e., the graph embedding module, and how they address practical recommendation issues such as scalability, cold-start and so on. We further summarize the commonly-used benchmark datasets, evaluation metrics as well as open-source codes. Finally, we conclude the survey and propose potential research directions in this rapidly growing field.

preprint2020arXiv

Disentangling Neural Architectures and Weights: A Case Study in Supervised Classification

The history of deep learning has shown that human-designed problem-specific networks can greatly improve the classification performance of general neural models. In most practical cases, however, choosing the optimal architecture for a given task remains a challenging problem. Recent architecture-search methods are able to automatically build neural models with strong performance but fail to fully appreciate the interaction between neural architecture and weights. This work investigates the problem of disentangling the role of the neural structure and its edge weights, by showing that well-trained architectures may not need any link-specific fine-tuning of the weights. We compare the performance of such weight-free networks (in our case these are binary networks with {0, 1}-valued weights) with random, weight-agnostic, pruned and standard fully connected networks. To find the optimal weight-agnostic network, we use a novel and computationally efficient method that translates the hard architecture-search problem into a feasible optimization problem.More specifically, we look at the optimal task-specific architectures as the optimal configuration of binary networks with {0, 1}-valued weights, which can be found through an approximate gradient descent strategy. Theoretical convergence guarantees of the proposed algorithm are obtained by bounding the error in the gradient approximation and its practical performance is evaluated on two real-world data sets. For measuring the structural similarities between different architectures, we use a novel spectral approach that allows us to underline the intrinsic differences between real-valued networks and weight-free architectures.

preprint2020arXiv

Diversity Helps: Unsupervised Few-shot Learning via Distribution Shift-based Data Augmentation

Few-shot learning aims to learn a new concept when only a few training examples are available, which has been extensively explored in recent years. However, most of the current works heavily rely on a large-scale labeled auxiliary set to train their models in an episodic-training paradigm. Such a kind of supervised setting basically limits the widespread use of few-shot learning algorithms. Instead, in this paper, we develop a novel framework called Unsupervised Few-shot Learning via Distribution Shift-based Data Augmentation (ULDA), which pays attention to the distribution diversity inside each constructed pretext few-shot task when using data augmentation. Importantly, we highlight the value and importance of the distribution diversity in the augmentation-based pretext few-shot tasks, which can effectively alleviate the overfitting problem and make the few-shot model learn more robust feature representations. In ULDA, we systemically investigate the effects of different augmentation techniques and propose to strengthen the distribution diversity (or difference) between the query set and support set in each few-shot task, by augmenting these two sets diversely (i.e., distribution shifting). In this way, even incorporated with simple augmentation techniques (e.g., random crop, color jittering, or rotation), our ULDA can produce a significant improvement. In the experiments, few-shot models learned by ULDA can achieve superior generalization performance and obtain state-of-the-art results in a variety of established few-shot learning tasks on Omniglot and miniImageNet. The source code is available in https://github.com/WonderSeven/ULDA.

preprint2020arXiv

GreyReID: A Two-stream Deep Framework with RGB-grey Information for Person Re-identification

In this paper, we observe that most false positive images (i.e., different identities with query images) in the top ranking list usually have the similar color information with the query image in person re-identification (Re-ID). Meanwhile, when we use the greyscale images generated from RGB images to conduct the person Re-ID task, some hard query images can obtain better performance compared with using RGB images. Therefore, RGB and greyscale images seem to be complementary to each other for person Re-ID. In this paper, we aim to utilize both RGB and greyscale images to improve the person Re-ID performance. To this end, we propose a novel two-stream deep neural network with RGB-grey information, which can effectively fuse RGB and greyscale feature representations to enhance the generalization ability of Re-ID. Firstly, we convert RGB images to greyscale images in each training batch. Based on these RGB and greyscale images, we train the RGB and greyscale branches, respectively. Secondly, to build up connections between RGB and greyscale branches, we merge the RGB and greyscale branches into a new joint branch. Finally, we concatenate the features of all three branches as the final feature representation for Re-ID. Moreover, in the training process, we adopt the joint learning scheme to simultaneously train each branch by the independent loss function, which can enhance the generalization ability of each branch. Besides, a global loss function is utilized to further fine-tune the final concatenated feature. The extensive experiments on multiple benchmark datasets fully show that the proposed method can outperform the state-of-the-art person Re-ID methods. Furthermore, using greyscale images can indeed improve the person Re-ID performance.

preprint2020arXiv

HF-UNet: Learning Hierarchically Inter-Task Relevance in Multi-Task U-Net for Accurate Prostate Segmentation

Accurate segmentation of the prostate is a key step in external beam radiation therapy treatments. In this paper, we tackle the challenging task of prostate segmentation in CT images by a two-stage network with 1) the first stage to fast localize, and 2) the second stage to accurately segment the prostate. To precisely segment the prostate in the second stage, we formulate prostate segmentation into a multi-task learning framework, which includes a main task to segment the prostate, and an auxiliary task to delineate the prostate boundary. Here, the second task is applied to provide additional guidance of unclear prostate boundary in CT images. Besides, the conventional multi-task deep networks typically share most of the parameters (i.e., feature representations) across all tasks, which may limit their data fitting ability, as the specificities of different tasks are inevitably ignored. By contrast, we solve them by a hierarchically-fused U-Net structure, namely HF-UNet. The HF-UNet has two complementary branches for two tasks, with the novel proposed attention-based task consistency learning block to communicate at each level between the two decoding branches. Therefore, HF-UNet endows the ability to learn hierarchically the shared representations for different tasks, and preserve the specificities of learned representations for different tasks simultaneously. We did extensive evaluations of the proposed method on a large planning CT image dataset, including images acquired from 339 patients. The experimental results show HF-UNet outperforms the conventional multi-task network architectures and the state-of-the-art methods.

preprint2020arXiv

Interactive Text Ranking with Bayesian Optimisation: A Case Study on Community QA and Summarisation

For many NLP applications, such as question answering and summarisation, the goal is to select the best solution from a large space of candidates to meet a particular user&#39;s needs. To address the lack of user-specific training data, we propose an interactive text ranking approach that actively selects pairs of candidates, from which the user selects the best. Unlike previous strategies, which attempt to learn a ranking across the whole candidate space, our method employs Bayesian optimisation to focus the user&#39;s labelling effort on high quality candidates and integrates prior knowledge in a Bayesian manner to cope better with small data scenarios. We apply our method to community question answering (cQA) and extractive summarisation, finding that it significantly outperforms existing interactive approaches. We also show that the ranking function learned by our method is an effective reward function for reinforcement learning, which improves the state of the art for interactive summarisation.

preprint2020arXiv

Interactive Text-to-Speech System via Joint Style Analysis

While modern TTS technologies have made significant advancements in audio quality, there is still a lack of behavior naturalness compared to conversing with people. We propose a style-embedded TTS system that generates styled responses based on the speech query style. To achieve this, the system includes a style extraction model that extracts a style embedding from the speech query, which is then used by the TTS to produce a matching response. We faced two main challenges: 1) only a small portion of the TTS training dataset has style labels, which is needed to train a multi-style TTS that respects different style embeddings during inference. 2) The TTS system and the style extraction model have disjoint training datasets. We need consistent style labels across these two datasets so that the TTS can learn to respect the labels produced by the style extraction model during inference. To solve these, we adopted a semi-supervised approach that uses the style extraction model to create style labels for the TTS dataset and applied transfer learning to learn the style embedding jointly. Our experiment results show user preference for the styled TTS responses and demonstrate the style-embedded TTS system&#39;s capability of mimicking the speech query style.

preprint2020arXiv

Learning-based Computer-aided Prescription Model for Parkinson&#39;s Disease: A Data-driven Perspective

In this paper, we study a novel problem: &#34;automatic prescription recommendation for PD patients.&#34; To realize this goal, we first build a dataset by collecting 1) symptoms of PD patients, and 2) their prescription drug provided by neurologists. Then, we build a novel computer-aided prescription model by learning the relation between observed symptoms and prescription drug. Finally, for the new coming patients, we could recommend (predict) suitable prescription drug on their observed symptoms by our prescription model. From the methodology part, our proposed model, namely Prescription viA Learning lAtent Symptoms (PALAS), could recommend prescription using the multi-modality representation of the data. In PALAS, a latent symptom space is learned to better model the relationship between symptoms and prescription drug, as there is a large semantic gap between them. Moreover, we present an efficient alternating optimization method for PALAS. We evaluated our method using the data collected from 136 PD patients at Nanjing Brain Hospital, which can be regarded as a large dataset in PD research community. The experimental results demonstrate the effectiveness and clinical potential of our method in this recommendation task, if compared with other competing methods.

preprint2020arXiv

Linear magnetoresistance induced by intra-scattering semiclassics of Bloch electrons

The weak field magnetoresistance has seen a revived interest due to the distinct role played by the momentum-space Berry curvature of Bloch electrons. While most previous studies in this regard focus on the inter-scattering motion of semiclassical Bloch electrons in electromagnetic fields, the intra-scattering effects of the semiclassical dynamics augmented by the Berry curvature, magnetic moment and shift vector on the magnetoresistance have been largely overlooked. Here we uncover that these intra-scattering effects, which are neglected in the field-independent relaxation time approximation to the Boltzmann collision integral, can be as important as the inter-scattering ones. Concrete calculations on the two dimensional gapped Dirac model show that the sign of the negative linear magnetoresistance given by the Berry curvature alone is reversed when one considers the magnetic moment and shift vector.

preprint2020arXiv

Mutual Information-based State-Control for Intrinsically Motivated Reinforcement Learning

In reinforcement learning, an agent learns to reach a set of goals by means of an external reward signal. In the natural world, intelligent organisms learn from internal drives, bypassing the need for external signals, which is beneficial for a wide range of tasks. Motivated by this observation, we propose to formulate an intrinsic objective as the mutual information between the goal states and the controllable states. This objective encourages the agent to take control of its environment. Subsequently, we derive a surrogate objective of the proposed reward function, which can be optimized efficiently. Lastly, we evaluate the developed framework in different robotic manipulation and navigation tasks and demonstrate the efficacy of our approach. A video showing experimental results is available at https://youtu.be/CT4CKMWBYz0

preprint2020arXiv

NAS-FCOS: Fast Neural Architecture Search for Object Detection

The success of deep neural networks relies on significant architecture engineering. Recently neural architecture search (NAS) has emerged as a promise to greatly reduce manual effort in network design by automatically searching for optimal architectures, although typically such algorithms need an excessive amount of computational resources, e.g., a few thousand GPU-days. To date, on challenging vision tasks such as object detection, NAS, especially fast versions of NAS, is less studied. Here we propose to search for the decoder structure of object detectors with search efficiency being taken into consideration. To be more specific, we aim to efficiently search for the feature pyramid network (FPN) as well as the prediction head of a simple anchor-free object detector, namely FCOS, using a tailored reinforcement learning paradigm. With carefully designed search space, search algorithms and strategies for evaluating network quality, we are able to efficiently search a top-performing detection architecture within 4 days using 8 V100 GPUs. The discovered architecture surpasses state-of-the-art object detection models (such as Faster R-CNN, RetinaNet and FCOS) by 1.5 to 3.5 points in AP on the COCO dataset, with comparable computation complexity and memory footprint, demonstrating the efficacy of the proposed NAS for object detection.

preprint2020arXiv

Observational features of exoplanetary synchrotron radio bursts

Magnetic fields of exoplanets are important in shielding the planets from cosmic rays and interplanetary plasma. Due to the interaction with the electrons from their host stars, the exoplanetary magnetospheres are predicted to have both cyclotron and synchrotron radio emissions, of which neither has been definitely identified in observations yet. As the coherent cyclotron emission has been extensively studied in literatures, here we focus on the planetary synchrotron radiation with bursty behaviors (i.e., radio flares) caused by the outbreaks of energetic electron ejections from the host star. Two key parameters of the bursty synchrotron emissions, namely the flux density and burst rate, and two key features namely the burst light curve and frequency shift, are predicted for star - hot Jupiter systems. The planetary orbital phase - burst rate relation is also considered as the signature of star-planet interactions (SPI). As examples, previous X-ray and radio observations of two well studied candidate systems, HD 189733 and V830 τ, are adopted to predict their specific burst rates and fluxes of bursty synchrotron emissions for further observational confirmations. The detectability of such emissions by current and upcoming radio telescopes shows that we are at the dawn of discoveries.

preprint2020arXiv

On the Limitations of Cross-lingual Encoders as Exposed by Reference-Free Machine Translation Evaluation

Evaluation of cross-lingual encoders is usually performed either via zero-shot cross-lingual transfer in supervised downstream tasks or via unsupervised cross-lingual textual similarity. In this paper, we concern ourselves with reference-free machine translation (MT) evaluation where we directly compare source texts to (sometimes low-quality) system translations, which represents a natural adversarial setup for multilingual encoders. Reference-free evaluation holds the promise of web-scale comparison of MT systems. We systematically investigate a range of metrics based on state-of-the-art cross-lingual semantic representations obtained with pretrained M-BERT and LASER. We find that they perform poorly as semantic encoders for reference-free MT evaluation and identify their two key limitations, namely, (a) a semantic mismatch between representations of mutual translations and, more prominently, (b) the inability to punish &#34;translationese&#34;, i.e., low-quality literal translations. We propose two partial remedies: (1) post-hoc re-alignment of the vector spaces and (2) coupling of semantic-similarity based metrics with target-side language modeling. In segment-level MT evaluation, our best metric surpasses reference-based BLEU by 5.7 correlation points.

preprint2020arXiv

Progressive Cross-camera Soft-label Learning for Semi-supervised Person Re-identification

In this paper, we focus on the semi-supervised person re-identification (Re-ID) case, which only has the intra-camera (within-camera) labels but not inter-camera (cross-camera) labels. In real-world applications, these intra-camera labels can be readily captured by tracking algorithms or few manual annotations, when compared with cross-camera labels. In this case, it is very difficult to explore the relationships between cross-camera persons in the training stage due to the lack of cross-camera label information. To deal with this issue, we propose a novel Progressive Cross-camera Soft-label Learning (PCSL) framework for the semi-supervised person Re-ID task, which can generate cross-camera soft-labels and utilize them to optimize the network. Concretely, we calculate an affinity matrix based on person-level features and adapt them to produce the similarities between cross-camera persons (i.e., cross-camera soft-labels). To exploit these soft-labels to train the network, we investigate the weighted cross-entropy loss and the weighted triplet loss from the classification and discrimination perspectives, respectively. Particularly, the proposed framework alternately generates progressive cross-camera soft-labels and gradually improves feature representations in the whole learning course. Extensive experiments on five large-scale benchmark datasets show that PCSL significantly outperforms the state-of-the-art unsupervised methods that employ labeled source domains or the images generated by the GAN-based models. Furthermore, the proposed method even has a competitive performance with respect to deep supervised Re-ID methods.

preprint2020arXiv

Singular points of polarizations in the momentum space of photonic crystal slabs

Bound states in the continuum (BICs), circularly polarized states (C points) and degenerate states are all of three types of singular points of polarization in the momentum space. For photonic crystal slabs (PhCSs) with linearly polarized far fields, BICs were found to be the centers of polarization vortices and attracted more attention in the previous studies. Here, we theoretically demonstrate that the far fields of PhCSs can exhibit remarkably diverse polarizations due to the robust existences of C points in the continuum. Only a pair of C points with identical handedness and opposite topological charge can be annihilated together. Continuously fine tuning of the structure parameters of PhCSs without breaking their symmetry, a pair of C points with identical topological charge and opposite handedness are able to merge into a BIC, then the BIC splits into C points again. Interestingly, a Dirac-degenerate BIC with one half of topological charge is observed when two pairs of C points with identical topological charge from the upper and lower band, respectively, simultaneously merge at the Dirac-degenerate point. The law of topological charge conservation is verified to play an important role in the evolutions and interconversions between different types of polarization singularities. Our findings might shed light on the origin of singular points of polarization, could open a gateway towards the applications of them in the generation and manipulation of vector beams.

preprint2020arXiv

SUPERT: Towards New Frontiers in Unsupervised Evaluation Metrics for Multi-Document Summarization

We study unsupervised multi-document summarization evaluation metrics, which require neither human-written reference summaries nor human annotations (e.g. preferences, ratings, etc.). We propose SUPERT, which rates the quality of a summary by measuring its semantic similarity with a pseudo reference summary, i.e. selected salient sentences from the source documents, using contextualized embeddings and soft token alignment techniques. Compared to the state-of-the-art unsupervised evaluation metrics, SUPERT correlates better with human ratings by 18-39%. Furthermore, we use SUPERT as rewards to guide a neural-based reinforcement learning summarizer, yielding favorable performance compared to the state-of-the-art unsupervised summarizers. All source code is available at https://github.com/yg211/acl20-ref-free-eval.

preprint2020arXiv

Synergistic Learning of Lung Lobe Segmentation and Hierarchical Multi-Instance Classification for Automated Severity Assessment of COVID-19 in CT Images

Understanding chest CT imaging of the coronavirus disease 2019 (COVID-19) will help detect infections early and assess the disease progression. Especially, automated severity assessment of COVID-19 in CT images plays an essential role in identifying cases that are in great need of intensive clinical care. However, it is often challenging to accurately assess the severity of this disease in CT images, due to variable infection regions in the lungs, similar imaging biomarkers, and large inter-case variations. To this end, we propose a synergistic learning framework for automated severity assessment of COVID-19 in 3D CT images, by jointly performing lung lobe segmentation and multi-instance classification. Considering that only a few infection regions in a CT image are related to the severity assessment, we first represent each input image by a bag that contains a set of 2D image patches (with each cropped from a specific slice). A multi-task multi-instance deep network (called M$^2$UNet) is then developed to assess the severity of COVID-19 patients and also segment the lung lobe simultaneously. Our M$^2$UNet consists of a patch-level encoder, a segmentation sub-network for lung lobe segmentation, and a classification sub-network for severity assessment (with a unique hierarchical multi-instance learning strategy). Here, the context information provided by segmentation can be implicitly employed to improve the performance of severity assessment. Extensive experiments were performed on a real COVID-19 CT image dataset consisting of 666 chest CT images, with results suggesting the effectiveness of our proposed method compared to several state-of-the-art methods.

preprint2020arXiv

Thermodynamics of Energy Magnetization

We construct the thermodynamics of energy magnetization in the presence of gravitomagnetic field. We show that the free energy must be modified to account for the modification of the energy current operator in the presence of a confining potential. The explicit expression of the energy magnetization is derived for a periodic system, and the Streda formula for the thermal Hall conductivity is rigorously established. We demonstrate our theory of the energy magnetization and the Streda formula in a Chern insulator.

preprint2020arXiv

Unsupervised Domain Attention Adaptation Network for Caricature Attribute Recognition

Caricature attributes provide distinctive facial features to help research in Psychology and Neuroscience. However, unlike the facial photo attribute datasets that have a quantity of annotated images, the annotations of caricature attributes are rare. To facility the research in attribute learning of caricatures, we propose a caricature attribute dataset, namely WebCariA. Moreover, to utilize models that trained by face attributes, we propose a novel unsupervised domain adaptation framework for cross-modality (i.e., photos to caricatures) attribute recognition, with an integrated inter- and intra-domain consistency learning scheme. Specifically, the inter-domain consistency learning scheme consisting an image-to-image translator to first fill the domain gap between photos and caricatures by generating intermediate image samples, and a label consistency learning module to align their semantic information. The intra-domain consistency learning scheme integrates the common feature consistency learning module with a novel attribute-aware attention-consistency learning module for a more efficient alignment. We did an extensive ablation study to show the effectiveness of the proposed method. And the proposed method also outperforms the state-of-the-art methods by a margin. The implementation of the proposed method is available at https://github.com/KeleiHe/DAAN.

preprint2020arXiv

xQSM: Quantitative Susceptibility Mapping with Octave Convolutional and Noise Regularized Neural Networks

Quantitative susceptibility mapping (QSM) is a valuable magnetic resonance imaging (MRI) contrast mechanism that has demonstrated broad clinical applications. However, the image reconstruction of QSM is challenging due to its ill-posed dipole inversion process. In this study, a new deep learning method for QSM reconstruction, namely xQSM, was designed by introducing modified state-of-the-art octave convolutional layers into the U-net backbone. The xQSM method was compared with recentlyproposed U-net-based and conventional regularizationbased methods, using peak signal to noise ratio (PSNR), structural similarity (SSIM), and region-of-interest measurements. The results from a numerical phantom, a simulated human brain, four in vivo healthy human subjects, a multiple sclerosis patient, a glioblastoma patient, as well as a healthy mouse brain showed that the xQSM led to suppressed artifacts than the conventional methods, and enhanced susceptibility contrast, particularly in the ironrich deep grey matter region, than the original U-net, consistently. The xQSM method also substantially shortened the reconstruction time from minutes using conventional iterative methods to only a few seconds.