Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
94works
0followers
48topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

94 published item(s)

preprint2026arXiv

Personalizing LLMs with Binary Feedback: A Preference-Corrected Optimization Framework

Large Language Model (LLM) personalization aims to align model behaviors with individual user preferences. Existing methods often focus on isolated user histories, neglecting the essential role of inter-user differences. We propose C-BPO, a framework that personalizes LLMs via preference-calibrated binary signals. By treating target user data as positive feedback and other users' data as an auxiliary set of implicit negative signals, C-BPO captures distinct inter-user differences. To mitigate the preference overlap issue, where shared task knowledge is erroneously penalized, we derive an objective grounded in Positive-Unlabeled (PU) learning theory. This approach purifies negative signals by subtracting ``positive bias'', ensuring alignment with unique idiosyncrasies without compromising general helpfulness. Empirical experiments across various personalization tasks and backbone LLMs show C-BPO consistently outperforms baselines, demonstrating the efficacy of preference-calibrated binary signals in modeling inter-user differences.

preprint2026arXiv

PPU-Bench:Real World Benchmark for Personalized Partial Unlearning in Vision Language Models

Multimodal Large Language Models (MLLMs) may memorize sensitive cross-modal information during pretraining. However, existing MLLM unlearning benchmarks rely on synthetic knowledge injection or complete subject-level deletion, which fail to capture realistic, personalized deletion requests that require fine-grained factual control. In this paper, we introduce PPU-Bench, a real-world and fine-tuning-free benchmark for personalized partial unlearning in MLLMs. PPU-Bench contains 24K multimodal and unimodal samples derived from pre-existing knowledge of 500 public figures under three progressively challenging settings: Complete, Selective, and Personalized unlearning. The benchmark evaluates whether methods can remove target knowledge while preserving non-target facts, model utility, and cross-modal consistency. Extensive experiments show that Complete Unlearning often suppresses visual identity rather than factual knowledge, while Selective and Personalized Unlearning expose significant forget--retain trade-offs and challenges in intra-subject factual boundaries. Robustness analysis under cross-image and prompt-based attacks reveals distinct vulnerabilities across different unlearning settings. Motivated by these findings, we propose Boundary-Aware Optimization (BAO), which explicitly models intra-subject forget-retain boundaries. Experimental results on two representative methods demonstrate that BAO can effectively enforce intra-subject factual boundaries.

preprint2026arXiv

Team-Based Self-Play With Dual Adaptive Weighting for Fine-Tuning LLMs

While recent self-training approaches have reduced reliance on human-labeled data for aligning LLMs, they still face critical limitations: (i) sensitivity to synthetic data quality, leading to instability and bias amplification in iterative training; (ii) ineffective optimization due to a diminishing gap between positive and negative responses over successive training iterations. In this paper, we propose Team-based self-Play with dual Adaptive Weighting (TPAW), a novel self-play algorithm designed to improve alignment in a fully self-supervised setting. TPAW adopts a team-based framework in which the current policy model both collaborates with and competes against historical checkpoints, promoting more stable and efficient optimization. To further enhance learning, we design two adaptive weighting mechanisms: (i) a response reweighting scheme that adjusts the importance of target responses, and (ii) a player weighting strategy that dynamically modulates each team member's contribution during training. Initialized from a SFT model, TPAW iteratively refines alignment without requiring additional human supervision. Experimental results demonstrate that TPAW consistently outperforms existing baselines across various base models and LLM benchmarks. Our code is publicly available at https://github.com/lab-klc/TPAW.

preprint2025arXiv

Physical properties and first-principles calculations of an altermagnet candidate Cs$_{1-δ}$V$_2$Te$_2$O

We report the crystal growth, structure, physical properties, and first-principles calculations of a vanadium-based oxytelluride Cs$_{1-δ}$V$_2$Te$_2$O. The material possesses two-dimensional V$_2$O square nets sandwiched by tellurium layers, with local crystallographic symmetry satisfying the spin symmetry for a $d$-wave altermagnet. An antiferromagnetic transition at 293 K is unambiguously evidenced from the measurements of magnetic susceptibility and specific heat. In addition, a secondary transition at $\sim$70 K is also observed, possibly associated with a Lifshitz transition. The first-principles calculations indicate robust Néel-type collinear antiferromagnetism in the V$_2$O plane. Consequently, spin splittings show up in momentum space, in relation with the real-space mirror/rotation symmetry. Interestingly, the V-$d_{yz}/d_{xz}$ electrons, which primarily contribute the quasi-one-dimensional Fermi surface, turns out to be fully orbital- and spin-polarized, akin to the case of a half metal. Our work lays a solid foundation on the potential applications utilizing altermagnetic properties in vanadium-based oxychalcogenides.

preprint2024arXiv

Distinguishing the Topological Charge of Vortex Beam via Fourier Back Plane Imaging with Chiral Gammadion Structure

In recent years, research on the interaction between Orbital Angular Momentum (OAM) and matter has seen a continuous influx of investigations. OAM possesses distinct properties, such as additional degrees of freedom, vortex characteristics, and topological properties, which expand its applications in optical communication, optical sensing, and optical force. Through experiments involving the interaction of a chiral metal swastika structure with a SAM-OAM beam generated by a q-plate, we have observed a phenomenon does not present in pure SAM beams. Fourier back focal plane (FBP) imaging under SAM beam excitation easily identifies the chirality and geometric properties of the structure. When the SAM-OAM beam excites the structure, FBP not only identifies its chirality and geometric properties but also distinguishes different OAM topological charges and signs, as well as the degree of elliptic polarization. The stokes parametric FBP imaging reveals asymmetric polarization distribution resulting from the interaction between a vortex beam and the chiral structure. Moreover, it clearly reflects the conversion process of SAM to OAM. The experimental results align well with simulation results. These findings hold valuable insights for the advancement of optical information storage and communication using OAM, opening up new possibilities for further exploration in this field.

preprint2023arXiv

Audio2Gestures: Generating Diverse Gestures from Audio

People may perform diverse gestures affected by various mental and physical factors when speaking the same sentences. This inherent one-to-many relationship makes co-speech gesture generation from audio particularly challenging. Conventional CNNs/RNNs assume one-to-one mapping, and thus tend to predict the average of all possible target motions, easily resulting in plain/boring motions during inference. So we propose to explicitly model the one-to-many audio-to-motion mapping by splitting the cross-modal latent code into shared code and motion-specific code. The shared code is expected to be responsible for the motion component that is more correlated to the audio while the motion-specific code is expected to capture diverse motion information that is more independent of the audio. However, splitting the latent code into two parts poses extra training difficulties. Several crucial training losses/strategies, including relaxed motion loss, bicycle constraint, and diversity loss, are designed to better train the VAE. Experiments on both 3D and 2D motion datasets verify that our method generates more realistic and diverse motions than previous state-of-the-art methods, quantitatively and qualitatively. Besides, our formulation is compatible with discrete cosine transformation (DCT) modeling and other popular backbones (\textit{i.e.} RNN, Transformer). As for motion losses and quantitative motion evaluation, we find structured losses/metrics (\textit{e.g.} STFT) that consider temporal and/or spatial context complement the most commonly used point-wise losses (\textit{e.g.} PCK), resulting in better motion dynamics and more nuanced motion details. Finally, we demonstrate that our method can be readily used to generate motion sequences with user-specified motion clips on the timeline.

preprint2023arXiv

Quantifying intra-tumoral genetic heterogeneity of glioblastoma toward precision medicine using MRI and a data-inclusive machine learning algorithm

Glioblastoma (GBM) is one of the most aggressive and lethal human cancers. Intra-tumoral genetic heterogeneity poses a significant challenge for treatment. Biopsy is invasive, which motivates the development of non-invasive, MRI-based machine learning (ML) models to quantify intra-tumoral genetic heterogeneity for each patient. This capability holds great promise for enabling better therapeutic selection to improve patient outcomes. We proposed a novel Weakly Supervised Ordinal Support Vector Machine (WSO-SVM) to predict regional genetic alteration status within each GBM tumor using MRI. WSO-SVM was applied to a unique dataset of 318 image-localized biopsies with spatially matched multiparametric MRI from 74 GBM patients. The model was trained to predict the regional genetic alteration of three GBM driver genes (EGFR, PDGFRA, and PTEN) based on features extracted from the corresponding region of five MRI contrast images. For comparison, a variety of existing ML algorithms were also applied. The classification accuracy of each gene was compared between the different algorithms. The SHapley Additive exPlanations (SHAP) method was further applied to compute contribution scores of different contrast images. Finally, the trained WSO-SVM was used to generate prediction maps within the tumoral area of each patient to help visualize the intra-tumoral genetic heterogeneity. This study demonstrated the feasibility of using MRI and WSO-SVM to enable non-invasive prediction of intra-tumoral regional genetic alteration for each GBM patient, which can inform future adaptive therapies for individualized oncology.

preprint2022arXiv

A Blind All-sky Search for Star Clusters in Gaia EDR3: 886 Clusters within 1.2 kpc of the Sun

Although previous searches for star clusters have been very successful, many clusters are likely still omitted, especially at high Galactic latitude regions. In this work, based on the astrometry of Gaia EDR3, we searched nearby (parallax > 0.8 mas) all-sky regions, obtaining 886 star clusters, of which 270 candidates have not been cataloged before. At the same time, we have presented the physical parameters of the clusters by fitting theoretical isochrones to their optical magnitudes. More halo members and expanding structures in many star clusters were also found. Most of the new objects are young clusters that are less than 100 million years old. Our work greatly increased the sample size and physical parameters of star clusters in the solar neighborhood, in particular, 46 clusters are newly found with |b| > 20 deg, which represents an increase of nearly three fold of cluster numbers at high Galactic latitude regions. The cluster parameters and member stars are available at CDS via https://cdsarc.u-strasbg.fr/ftp/vizier.submit//hezh22b/, and the cluster figure sets are available via https://doi.org/10.12149/101133.

preprint2022arXiv

A Dual-Channel Framework for Sarcasm Recognition by Detecting Sentiment Conflict

Sarcasm employs ambivalence, where one says something positive but actually means negative, and vice versa. The essence of sarcasm, which is also a sufficient and necessary condition, is the conflict between literal and implied sentiments expressed in one sentence. However, it is difficult to recognize such sentiment conflict because the sentiments are mixed or even implicit. As a result, the recognition of sophisticated and obscure sentiment brings in a great challenge to sarcasm detection. In this paper, we propose a Dual-Channel Framework by modeling both literal and implied sentiments separately. Based on this dual-channel framework, we design the Dual-Channel Network~(DC-Net) to recognize sentiment conflict. Experiments on political debates (i.e. IAC-V1 and IAC-V2) and Twitter datasets show that our proposed DC-Net achieves state-of-the-art performance on sarcasm recognition. Our code is released to support research.

preprint2022arXiv

A Knowledge Distillation-Based Backdoor Attack in Federated Learning

Federated Learning (FL) is a novel framework of decentralized machine learning. Due to the decentralized feature of FL, it is vulnerable to adversarial attacks in the training procedure, e.g. , backdoor attacks. A backdoor attack aims to inject a backdoor into the machine learning model such that the model will make arbitrarily incorrect behavior on the test sample with some specific backdoor trigger. Even though a range of backdoor attack methods of FL has been introduced, there are also methods defending against them. Many of the defending methods utilize the abnormal characteristics of the models with backdoor or the difference between the models with backdoor and the regular models. To bypass these defenses, we need to reduce the difference and the abnormal characteristics. We find a source of such abnormality is that backdoor attack would directly flip the label of data when poisoning the data. However, current studies of the backdoor attack in FL are not mainly focus on reducing the difference between the models with backdoor and the regular models. In this paper, we propose Adversarial Knowledge Distillation(ADVKD), a method combine knowledge distillation with backdoor attack in FL. With knowledge distillation, we can reduce the abnormal characteristics in model result from the label flipping, thus the model can bypass the defenses. Compared to current methods, we show that ADVKD can not only reach a higher attack success rate, but also successfully bypass the defenses when other methods fails. To further explore the performance of ADVKD, we test how the parameters affect the performance of ADVKD under different scenarios. According to the experiment result, we summarize how to adjust the parameter for better performance under different scenarios. We also use several methods to visualize the effect of different attack and explain the effectiveness of ADVKD.

preprint2022arXiv

A Multi-Behavior Planning Framework for Robot Guide

The guiding task of a mobile robot requires not only human-aware navigation, but also appropriate yet timely interaction for active instruction. State-of-the-art tour-guide models limit their socially-aware consideration to adapting to users' motion, ignoring the interactive behavior planning to fulfill the communicative demands. We propose a multi-behavior planning framework based on Monte Carlo Tree Search to better assist users to understand confusing scene contexts, select proper paths and timely arrive at the destination. To provide proactive guidance, we construct a sampling-based probability model of human motion to consider the interrelated effects between robots and humans. We validate our method both in simulation and real-world experiments along with performance comparison with state-of-the-art models.

preprint2022arXiv

A physical perturbation based study on the prediction of free-fall disks with chaotic modes in the water

We report a phenomenon that physical perturbations sometimes can benefit the certainty of a free-fall motion with chaotic modes, albeit, as commonly believed, they can ruin it. We statistically compare those factors that may lead to uncertainty, by which we find that the growth of the standard deviation of the landing locations is directly determined by the physical perturbations. A significant yardstick is defined in the meantime. This temporal criterion is of big relevance to the replicability of such problems experimentally, although they are inherently chaotic. Our hypothesis is verified by experiments from other literature. This outcome also provides a practical strategy to evaluate the credible prediction time by estimating the disturbances from physical parameters as a priori.

preprint2022arXiv

Absorbance Enhancement of Monolayer MoS$_2$ in a Perfect Absorbing System

We reveal numerically and experimentally that dielectric resonance can enhance the absorbance and emission of monolayer MoS$_2$. By quantifying the absorbance of the Si disk resonators and the monolayer MoS$_2$ separately, a model taking into account of absorbance as well as quantum efficiency modifications by the dielectric disk resonators successfully explains the observed emission enhancement under the normal light incidence. It is demonstrated that the experimentally observed emission enhancement at different pump wavelength results from the absorbance enhancement, which compensates the emission quenching by the disk resonators. In order to further maximize the absorbance value of monolayer MoS$_2$, a perfect absorbing structure is proposed. By placing a Au mirror beneath the Si nanodisks, the incident electromagnetic power is fully absorbed by the hybrid monolayer MoS$_2$-disk system. It is demonstrated that the electromagnetic power is re-distributed within the hybrid structure and 53\% of the total power is absorbed by the monolayer MoS$_2$ at the perfect absorbing wavelength.

preprint2022arXiv

Adversarial Attacks and Defense Methods for Power Quality Recognition

Vulnerability of various machine learning methods to adversarial examples has been recently explored in the literature. Power systems which use these vulnerable methods face a huge threat against adversarial examples. To this end, we first propose a signal-specific method and a universal signal-agnostic method to attack power systems using generated adversarial examples. Black-box attacks based on transferable characteristics and the above two methods are also proposed and evaluated. We then adopt adversarial training to defend systems against adversarial attacks. Experimental analyses demonstrate that our signal-specific attack method provides less perturbation compared to the FGSM (Fast Gradient Sign Method), and our signal-agnostic attack method can generate perturbations fooling most natural signals with high probability. What's more, the attack method based on the universal signal-agnostic algorithm has a higher transfer rate of black-box attacks than the attack method based on the signal-specific algorithm. In addition, the results show that the proposed adversarial training improves robustness of power systems to adversarial examples.

preprint2022arXiv

An Efficient Coarse-to-Fine Facet-Aware Unsupervised Summarization Framework based on Semantic Blocks

Unsupervised summarization methods have achieved remarkable results by incorporating representations from pre-trained language models. However, existing methods fail to consider efficiency and effectiveness at the same time when the input document is extremely long. To tackle this problem, in this paper, we proposed an efficient Coarse-to-Fine Facet-Aware Ranking (C2F-FAR) framework for unsupervised long document summarization, which is based on the semantic block. The semantic block refers to continuous sentences in the document that describe the same facet. Specifically, we address this problem by converting the one-step ranking method into the hierarchical multi-granularity two-stage ranking. In the coarse-level stage, we propose a new segment algorithm to split the document into facet-aware semantic blocks and then filter insignificant blocks. In the fine-level stage, we select salient sentences in each block and then extract the final summary from selected sentences. We evaluate our framework on four long document summarization datasets: Gov-Report, BillSum, arXiv, and PubMed. Our C2F-FAR can achieve new state-of-the-art unsupervised summarization results on Gov-Report and BillSum. In addition, our method speeds up 4-28 times more than previous methods.\footnote{\url{https://github.com/xnliang98/c2f-far}}

preprint2022arXiv

An inverse random source problem for the Helium production-diffusion equation driven by a fractional Brownian motion

In this paper, we consider the prediction of the helium concentrations as function of a spatially variable source term perturbed by fractional Brownian motion. For the direct problem, we show that it is well-posed and has a unique mild solution under some conditions. For the inverse problem, the uniqueness and the instability are given. In the meanwhile, we determine the statistical properties of the source from the expectation and covariance of the final-time data u(r,T). Finally, numerical implements are given to verify the effectiveness of the proposed reconstruction.

preprint2022arXiv

Analyzing the Intensity of Complaints on Social Media

Complaining is a speech act that expresses a negative inconsistency between reality and human expectations. While prior studies mostly focus on identifying the existence or the type of complaints, in this work, we present the first study in computational linguistics of measuring the intensity of complaints from text. Analyzing complaints from such perspective is particularly useful, as complaints of certain degrees may cause severe consequences for companies or organizations. We create the first Chinese dataset containing 3,103 posts about complaints from Weibo, a popular Chinese social media platform. These posts are then annotated with complaints intensity scores using Best-Worst Scaling (BWS) method. We show that complaints intensity can be accurately estimated by computational models with the best mean square error achieving 0.11. Furthermore, we conduct a comprehensive linguistic analysis around complaints, including the connections between complaints and sentiment, and a cross-lingual comparison for complaints expressions used by Chinese and English speakers. We finally show that our complaints intensity scores can be incorporated for better estimating the popularity of posts on social media.

preprint2022arXiv

Attention-based Multimodal Feature Representation Model for Micro-video Recommendation

In recommender systems, models mostly use a combination of embedding layers and multilayer feedforward neural networks. The high-dimensional sparse original features are downscaled in the embedding layer and then fed into the fully connected network to obtain prediction results. However, the above methods have a rather obvious problem, that is, the features directly input are treated as independent individuals, and in fact there are internal correlations between features and features, and even different features have different importance in the recommendation. In this regard, this paper adopts a self-attentive mechanism to mine the internal correlations between features as well as their relative importance. In recent years, as a special form of attention mechanism, self-attention mechanism is favored by many researchers. The self-attentive mechanism captures the internal correlation of data or features by learning itself, thus reducing the dependence on external sources. Therefore, this paper adopts a multi-headed self-attentive mechanism to mine the internal correlations between features and thus learn the internal representation of features. At the same time, considering the rich information often hidden between features, the new feature representation obtained by crossover between the two is likely to imply the new description of the user likes the item. However, not all crossover features are meaningful, i.e., there is a problem of limited expression of feature combinations. Therefore, this paper adopts an attention-based approach to learn the external cross-representation of features.

preprint2022arXiv

Continuous Sign Language Recognition via Temporal Super-Resolution Network

Aiming at the problem that the spatial-temporal hierarchical continuous sign language recognition model based on deep learning has a large amount of computation, which limits the real-time application of the model, this paper proposes a temporal super-resolution network(TSRNet). The data is reconstructed into a dense feature sequence to reduce the overall model computation while keeping the final recognition accuracy loss to a minimum. The continuous sign language recognition model(CSLR) via TSRNet mainly consists of three parts: frame-level feature extraction, time series feature extraction and TSRNet, where TSRNet is located between frame-level feature extraction and time-series feature extraction, which mainly includes two branches: detail descriptor and rough descriptor. The sparse frame-level features are fused through the features obtained by the two designed branches as the reconstructed dense frame-level feature sequence, and the connectionist temporal classification(CTC) loss is used for training and optimization after the time-series feature extraction part. To better recover semantic-level information, the overall model is trained with the self-generating adversarial training method proposed in this paper to reduce the model error rate. The training method regards the TSRNet as the generator, and the frame-level processing part and the temporal processing part as the discriminator. In addition, in order to unify the evaluation criteria of model accuracy loss under different benchmarks, this paper proposes word error rate deviation(WERD), which takes the error rate between the estimated word error rate (WER) and the reference WER obtained by the reconstructed frame-level feature sequence and the complete original frame-level feature sequence as the WERD. Experiments on two large-scale sign language datasets demonstrate the effectiveness of the proposed model.

preprint2022arXiv

Critiquing-based Modeling of Subjective Preferences

Applications designed for entertainment and other non-instrumental purposes are challenging to optimize because the relationships between system parameters and user experience can be unclear. Ideally, we would crowdsource these design questions, but existing approaches are geared towards evaluation or ranking discrete choices and not for optimizing over continuous parameter spaces. In addition, users are accustomed to informally expressing opinions about experiences as critiques (e.g. it's too cold, too spicy, too big), rather than giving precise feedback as an optimization algorithm would require. Unfortunately, it can be difficult to analyze qualitative feedback, especially in the context of quantitative modeling. In this article, we present collective criticism, a critiquing-based approach for modeling relationships between system parameters and subjective preferences. We transform critiques, such as "it was too easy/too challenging", into censored intervals and analyze them using interval regression. Collective criticism has several advantages over other approaches: "too much/too little"-style feedback is intuitive for users and allows us to build predictive models for the optimal parameterization of the variables being critiqued. We present two studies where we model: (i) aesthetic preferences for images generated with neural style transfer, and (ii) users' experiences of challenge in the video game Tetris. These studies demonstrate the flexibility of our approach, and show that it produces robust results that are straightforward to interpret and inline with users' stated preferences.

preprint2022arXiv

Cross-domain Trajectory Prediction with CTP-Net

Most pedestrian trajectory prediction methods rely on a huge amount of trajectories annotation, which is time-consuming and expensive. Moreover, a well-trained model may not effectively generalize to a new scenario captured by another camera. Therefore, it is desirable to adapt the model trained on an annotated source domain to the target domain. To achieve domain adaptation for trajectory prediction, we propose a Cross-domain Trajectory Prediction Network (CTP-Net). In this framework, encoders are used in both domains to encode the observed trajectories, then their features are aligned by a cross-domain feature discriminator. Further, considering the consistency between the observed and the predicted trajectories, a target domain offset discriminator is utilized to adversarially regularize the future trajectory predictions to be in line with the observed trajectories. Extensive experiments demonstrate the effectiveness of our method on domain adaptation for pedestrian trajectory prediction.

preprint2022arXiv

Decoupled IoU Regression for Object Detection

Non-maximum suppression (NMS) is widely used in object detection pipelines for removing duplicated bounding boxes. The inconsistency between the confidence for NMS and the real localization confidence seriously affects detection performance. Prior works propose to predict Intersection-over-Union (IoU) between bounding boxes and corresponding ground-truths to improve NMS, while accurately predicting IoU is still a challenging problem. We argue that the complex definition of IoU and feature misalignment make it difficult to predict IoU accurately. In this paper, we propose a novel Decoupled IoU Regression (DIR) model to handle these problems. The proposed DIR decouples the traditional localization confidence metric IoU into two new metrics, Purity and Integrity. Purity reflects the proportion of the object area in the detected bounding box, and Integrity refers to the completeness of the detected object area. Separately predicting Purity and Integrity can divide the complex mapping between the bounding box and its IoU into two clearer mappings and model them independently. In addition, a simple but effective feature realignment approach is also introduced to make the IoU regressor work in a hindsight manner, which can make the target mapping more stable. The proposed DIR can be conveniently integrated with existing two-stage detectors and significantly improve their performance. Through a simple implementation of DIR with HTC, we obtain 51.3% AP on MS COCO benchmark, which outperforms previous methods and achieves state-of-the-art.

preprint2022arXiv

Designer magnetic topological graphene nanoribbons

The interplay of magnetism and topology lies at the heart of condensed matter physics, which offers great opportunities to design intrinsic magnetic topological materials hosting a variety of exotic topological quantum states including the quantum anomalous Hall effect (QAHE), axion insulator state, and Majorana bound states. Extending this concept to one-dimension (1D) systems offers additional rich quantum spin physics with great promise for molecular-scale spintronics. Despite recent progress in the discovery of symmetry-protected topological quantum phases in 1D graphene nanoribbons (GNRs), the rational design and realization of magnetic topological GNRs (MT-GNRs) represents a grand challenge, as one must tackle multiple dimensions of complexity including time-reversal symmetry (TRS), spatial symmetry (width, edge, end geometry) and many-electron correlations. Here, we devised a new route involving the real- and reciprocal-space descriptions by unifying the chemists and physicists perspectives, for the design of such MT-GNRs with non-trivial electronic topology and robust magnetic terminal. Classic Clar's rule offers a conceptually qualitative real-space picture to predict the transition from closed-shell to open-shell with terminal magnetism, and band gap reopening with possible non-trivial electronic topology in a series of wave-like GNRs, which are further verified by first principle calculations of band-structure topology in a momentum-space. With the advance of on-surface synthesis and careful design of molecular precursors, we have fabricated these MT-GNRs with observation of topological edge bands, whose terminal pi-magnetism can be directly captured using a single-nickelocene spin sensor. Moreover, the transition from strong anti-ferromagnetic to weak coupling (paramagnetism-like) between terminal spins can be controlled by tuning the length of MT-GNRs.

preprint2022arXiv

Discovery of extended structure around open cluster COIN-Gaia 13 based on Gaia EDR3

COIN-Gaia 13 is a newly discovered open cluster revealed by Gaia DR2 data. It is a nearby open cluster with a distance of about 513 pc. Combined with the five-dimensional astrometric data of Gaia EDR3 with higher accuracy, we use the membership assignment algorithm (pyUPMASK) to determine the membership of COIN-Gaia 13 in a large extended spatial region. The cluster has found 478 candidate members. After obtaining reliable cluster members, we further study its basic properties and spatial distribution. Our results show that there is an obvious extended structure of the cluster in the X-Y plane. This elongated structure is distributed along the spiral arm, and the whole length is about 270 pc. The cluster age is 250 Myr, the total mass is about 439 M$_\odot$, and the tidal radius of the cluster is about 11 pc. Since more than half of the member stars (352 stars) are located outside twice the tidal radius, it is suspected that this cluster is undergoing the dynamic dissolution process. Furthermore, the spatial distribution and kinematic analysis indicate that the extended structure in COIN-Gaia 13 is more likely to be caused by the differential rotation of the Galaxy.

preprint2022arXiv

Doctor Recommendation in Online Health Forums via Expertise Learning

Huge volumes of patient queries are daily generated on online health forums, rendering manual doctor allocation a labor-intensive task. To better help patients, this paper studies a novel task of doctor recommendation to enable automatic pairing of a patient to a doctor with relevant expertise. While most prior work in recommendation focuses on modeling target users from their past behavior, we can only rely on the limited words in a query to infer a patient's needs for privacy reasons. For doctor modeling, we study the joint effects of their profiles and previous dialogues with other patients and explore their interactions via self-learning. The learned doctor embeddings are further employed to estimate their capabilities of handling a patient query with a multi-head attention mechanism. For experiments, a large-scale dataset is collected from Chunyu Yisheng, a Chinese online health forum, where our model exhibits the state-of-the-art results, outperforming baselines only consider profiles and past dialogues to characterize a doctor.

preprint2022arXiv

Domain Invariant Model with Graph Convolutional Network for Mammogram Classification

Due to its safety-critical property, the image-based diagnosis is desired to achieve robustness on out-of-distribution (OOD) samples. A natural way towards this goal is capturing only clinically disease-related features, which is composed of macroscopic attributes (e.g., margins, shapes) and microscopic image-based features (e.g., textures) of lesion-related areas. However, such disease-related features are often interweaved with data-dependent (but disease irrelevant) biases during learning, disabling the OOD generalization. To resolve this problem, we propose a novel framework, namely Domain Invariant Model with Graph Convolutional Network (DIM-GCN), which only exploits invariant disease-related features from multiple domains. Specifically, we first propose a Bayesian network, which explicitly decomposes the latent variables into disease-related and other disease-irrelevant parts that are provable to be disentangled from each other. Guided by this, we reformulate the objective function based on Variational Auto-Encoder, in which the encoder in each domain has two branches: the domain-independent and -dependent ones, which respectively encode disease-related and -irrelevant features. To better capture the macroscopic features, we leverage the observed clinical attributes as a goal for reconstruction, via Graph Convolutional Network (GCN). Finally, we only implement the disease-related features for prediction. The effectiveness and utility of our method are demonstrated by the superior OOD generalization performance over others on mammogram benign/malignant diagnosis.

preprint2022arXiv

Estimating accurate reddening values of LAMOST M dwarfs

M dwarfs are the dominating type of stars in the solar neighbourhood. They serve as excellent tracers for the study of the distribution and properties of the nearby interstellar dust. In this work, we aim to obtain high accuracy reddening values of M dwarf stars from the Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) Data Release 8 (DR8). Combining the LAMOST spectra with the high-quality optical photometry from the Gaia Early Data Release 3 (Gaia EDR3), we have estimated the reddening values $E(G_{\rm BP}-G_{\rm RP})$ of 641,426 M dwarfs with the machine-learning algorithm Random Forest regression. The typical reddening uncertainty is only 0.03 mag in $E(G_{\rm BP}-G_{\rm RP})$. We have obtained the reddening coefficient $R_{(G_{\rm BP}-G_{\rm RP})}$, which is a function of the stellar intrinsic colour $(G_{\rm BP}-G_{\rm RP})_0$ and reddening value $E(B-V)$. The values of $E(B-V)$ are also provided for the individual stars in our catalogue. Our resultant high accuracy reddening values of M dwarfs, combined with the Gaia parallaxes, will be very powerful to map the fine structures of the dust in the solar neighbourhood.

preprint2022arXiv

Fermi Observations of GRB 220426A: a burst similar to GRB 090902B

We report on a very bright, long-duration gamma-ray burst (GRB), GRB~220426A, observed by \emph{Fermi} satellite. GRB~220426A with total duration of $T_{90}=6$~s is composed with two main pulses and some sub-peaks. The spectral analysis of this burst with Band function reveals that both the time-integrated and the time-resolved spectra are very narrow with high $α\gtrsim 0.2$ and low $β\lesssim -3.1$. It is strong reminiscent of GRB~090902B, a special GRB with identification of the photospheric emission. Then, we perform the spectral analysis of this burst based on a non-dissipated photospheric emission, which can be well modelled as the multicolor-blackbody with a cutoff power-law distribution of the thermal temperature. The spectral fittings reveal that the photospheric emission can well describe the radiation spectrum of this burst. We conclude that this burst would be a second burst in the class of GRB~090902B observed by \emph{Fermi} satellite. We also discuss the physics of photosphere and the origin of the high-energy component in GRB~220426A .

preprint2022arXiv

Global Classical Solutions to the Full Compressible Navier-Stokes System in 3D Exterior Domains

The full compressible Navier-Stokes system (FNS) describing the motion of a viscous, compressible, heat-conductive, and Newtonian polytropic fluid in a three-dimensional (3D) exterior domain is studied. For the initial-boundary-value problem with the slip boundary conditions on the velocity and the Neumann one on the temperature, it is shown that there exists a unique global classical solutions with the initial data which are of small energy but possibly large oscillations. In particular, both the density and temperature are allowed to vanish initially. This is the first result about classical solutions of FNS system in 3D exterior domain.

preprint2022arXiv

Global Existence of Classical Solutions to Full Compressible Navier-Stokes System with Large Oscillations and Vacuum in 3D Bounded Domains

The full compressible Navier-Stokes system describing the motion of a viscous, compressible, heat-conductive, and Newtonian polytropic fluid is studied in a three-dimensional simply connected bounded domain with smooth boundary having a finite number of two-dimensional connected components. For the initial-boundary-value problem with slip boundary conditions on the velocity and Neumann boundary one on the temperature, the global existence of classical and weak solutions which are of small energy but possibly large oscillations is established. In particular, both the density and temperature are allowed to vanish initially. Finally, the exponential stability of the density, velocity, and temperature is also obtained. Moreover, it is shown that for the classical solutions, the oscillation of the density will grow unboundedly in the long run with an exponential rate provided vacuum appears (even at a point) initially. This is the first result concerning the global existence of classical solutions to the full compressible Navier-Stokes equations with vacuum in general three-dimensional bounded smooth domains.

preprint2022arXiv

Hubble Space Telescope Observations of Active Asteroid P/2020 O1 (Lemmon-PANSTARRS)

We present Hubble Space Telescope observations of active asteroid P/2020 O1 taken to examine its development for a year after perihelion. We find that the mass loss peaks <~1 kg/s in 2020 August and then declines to nearly zero over four months. Long-duration mass loss (~180 days) is consistent with a sublimation origin, indicating that this object is likely an ice-bearing main-belt comet. Equilibrium sublimation of water ice from an area as small as 1580 m^2 can supply the observed mass loss. Time-series photometry shows tentative evidence for extremely rapid rotation (double-peaked period < 2 hr) of the small nucleus (effective radius ~420 m). Ejection velocities of 0.1 mm particles are comparable to the 0.3 m/s gravitational escape speed from the nucleus, while larger particles are ejected at speeds less than the escape velocity. These properties are consistent with the sublimation of near-surface ice aided by centripetal acceleration. If water ice sublimation is confirmed, P/2020 O1 would be the icy asteroid with the smallest semimajor axis (highest temperature), setting new bounds on the distribution of ice in the asteroid belt.

preprint2022arXiv

Many-body exciton and inter-valley correlations in heavily electron-doped WSe$_2$ monolayers

In monolayer transition-metal dichalcogenide semiconductors, many-body correlations can manifest in optical spectra when photoexcited electron-hole pairs (excitons) are introduced into a 2D Fermi sea of mobile carriers. At low carrier densities, the formation of positively and negatively charged excitons ($X^\pm$) is well documented. However, in WSe$_2$ monolayers, an additional absorption resonance, often called $X^{-\prime}$, emerges at high electron density. Its origin is not understood. Here we investigate the $X^{-\prime}$ state via polarized absorption spectroscopy of electrostatically-gated WSe$_2$ monolayers in high magnetic fields to 60~T. Field-induced filling and emptying of the lowest optically-active Landau level in the $K&#39;$ valley causes repeated quenching of the corresponding optical absorption. Surprisingly, however, these quenchings are accompanied by absorption changes to higher-lying Landau levels in both $K&#39;$ and $K$ valleys, which are unoccupied. These results cannot be reconciled within a single-particle picture, and demonstrate the many-body nature and inter-valley correlations of the $X^{-\prime}$ quasiparticle state.

preprint2022arXiv

Mobility Support for Millimeter Wave Communications: Opportunities and Challenges

Millimeter-wave (mmWave) communication technology offers a potential and promising solution to support 5G and B5G wireless networks in dynamic scenarios and applications. However, mobility introduces many challenges as well as opportunities to mmWave applications. To address these problems, we conduct a survey of the opportunities and technologies to support mmWave communications in mobile scenarios. Firstly, we summarize the mobile scenarios where mmWave communications are exploited, including indoor wireless local area network (WLAN) or wireless personal area network (WPAN), cellular access, vehicle-to-everything (V2X), high speed train (HST), unmanned aerial vehicle (UAV), and the new space-air-ground-sea communication scenarios. Then, to address users&#39; mobility impact on the system performance in different application scenarios, we introduce several representative mobility models in mmWave systems, including human mobility, vehicular mobility, high speed train mobility and ship mobility. Next we survey the key challenges and existing solutions to mmWave applications, such as channel modeling, channel estimation, anti-blockage, and capacity improvement. Lastly, we discuss the open issues concerning mobility-aware mmWave communications that deserve further investigation. In particular, we highlight future heterogeneous mobile networks, dynamic resource management, artificial intelligence (AI) for mobility and integration of geographical information, deployment of large intelligent surface and reconfigurable antenna technology, and finally, the evolution to Terahertz (THz) communications.

preprint2022arXiv

Multi-scale temporal network for continuous sign language recognition

Continuous Sign Language Recognition (CSLR) is a challenging research task due to the lack of accurate annotation on the temporal sequence of sign language data. The recent popular usage is a hybrid model based on &#34;CNN + RNN&#34; for CSLR. However, when extracting temporal features in these works, most of the methods using a fixed temporal receptive field and cannot extract the temporal features well for each sign language word. In order to obtain more accurate temporal features, this paper proposes a multi-scale temporal network (MSTNet). The network mainly consists of three parts. The Resnet and two fully connected (FC) layers constitute the frame-wise feature extraction part. The time-wise feature extraction part performs temporal feature learning by first extracting temporal receptive field features of different scales using the proposed multi-scale temporal block (MST-block) to improve the temporal modeling capability, and then further encoding the temporal features of different scales by the transformers module to obtain more accurate temporal features. Finally, the proposed multi-level Connectionist Temporal Classification (CTC) loss part is used for training to obtain recognition results. The multi-level CTC loss enables better learning and updating of the shallow network parameters in CNN, and the method has no parameter increase and can be flexibly embedded in other models. Experimental results on two publicly available datasets demonstrate that our method can effectively extract sign language features in an end-to-end manner without any prior knowledge, improving the accuracy of CSLR and achieving competitive results.

preprint2022arXiv

On genuine entanglement for tripartite systems

We investigate the genuine entanglement in tripartite systems based on partial transposition and the norm of correlation tensors of the density matrices. We first derive an analytical sufficient criterion to detect genuine entanglement of tripartite qubit quantum states combining with the partial transposition of the density matrices. Then we use the norm of correlation tensors to study genuine entanglement for tripartite qudit quantum states and obtain a genuine entanglement criterion by constructing certain matrices. With detailed examples our results are seen to be able to detect more genuine tripartite entangled states than previous studies.

preprint2022arXiv

p-orbital disclination states in non-Euclidean geometries

Disclinations are ubiquitous lattice defects existing in almost all crystalline materials. In two-dimensional nanomaterials, disclinations lead to the warping and deformation of the hosting material, yielding non-Euclidean geometries. However, such geometries have never been investigated experimentally in the context of topological phenomena. Here, by creating the physical realization of disclinations in conical and saddle-shaped acoustic systems, we demonstrate that disclinations can lead to topologically protected bound modes in non-Euclidean surfaces. In the designed honeycomb sonic crystal for p-orbital acoustic waves, non-Euclidean geometry interplay with the p-orbital physics and the band topology, showing intriguing emergent features as confirmed by consistent experiments and simulations. Our study opens a pathway towards topological phenomena in non-Euclidean geometries that may inspire future studies on, e.g., electrons and phonons in nanomaterials with curved surfaces.

preprint2022arXiv

Pasta phases in neutron stars under strong magnetic fields

In the present work, we consider nuclear matter in the innermost crust of neutron stars under the presence of a strong magnetic field within the framework of a relativistic mean-field description. Two models with a different slope of the symmetry energy are considered in order to discuss the density-dependence of the equation of state on the crust structure. The non-homogeneous matter in $β$-equilibrium is described within the coexisting phases method, and the effect of including the anomalous magnetic moment is discussed. Five different geometries for the pasta structures are considered. It is shown that strong magnetic fields cause an extension of the inner crust of the neutron stars, with the occurrence of a series of disconnected non-homogeneous matter regions above the one existing for a null magnetic field. Moreover, we observed that in these disconnected regions, for some values of the magnetic field, all five different cluster geometrical shapes occur, and the gas density is close to the cluster density. Also, the pressure at the neutron star crust-core transition much larger than the pressure obtained for a zero magnetic field. Another noticeable effect of the presence of strong magnetic fields is the increase of the proton fraction, favoring the appearance of protons in the gas background.

preprint2022arXiv

Performance evaluation of baseline-dependent averaging based onfull-scale SKA1-LOW simulation

The Square Kilometre Array (SKA) is the largest radio interferometer under construction in the world. Its immense amount of visibility data poses a considerable challenge to the subsequent processing by the science data processor (SDP). Baseline dependent averaging (BDA), which reduces the amount of visibility data based on the baseline distribution of the radio interferometer, has become a focus of SKA SDP development. This paper developed and implemented a full-featured BDA module based on Radio Astronomy Simulation, Calibration and Imaging Library (RASCIL). Simulated observations were then performed with RASCIL based on a full-scale SKA1-LOW configuration. The performance of the BDA was systematically investigated and evaluated based on the simulated data. The experimental results presented that the amount of visibility data is reduced by about 50\% to 85\% for different time intervals ($Δt_{max}$). In addition, different $Δt_{max}$ have a significant effect on the imaging quality. The smaller the $Δt_{max}$, the smaller the degradation of the imaging quality.

preprint2022arXiv

Quantum control and quantum speed limits in supersymmetric potentials

Supersymmetry allows one to build a hierarchy of Hamiltonians that share the same spectral properties and which are pairwise connected through common superpotentials. The iso-spectral properties of these Hamiltonians imply that the dynamics and therefore control of different eigenstates are connected through supersymmetric intertwining relations. In this work we explore how this enables one to study general dynamics, shortcuts to adiabaticity (STA) and quantum speed limits for distinct states of different supersymmetric partner potentials by using the infinite box as an example.

preprint2022arXiv

Recognition of Handwritten Chinese Text by Segmentation: A Segment-annotation-free Approach

Online and offline handwritten Chinese text recognition (HTCR) has been studied for decades. Early methods adopted oversegmentation-based strategies but suffered from low speed, insufficient accuracy, and high cost of character segmentation annotations. Recently, segmentation-free methods based on connectionist temporal classification (CTC) and attention mechanism, have dominated the field of HCTR. However, people actually read text character by character, especially for ideograms such as Chinese. This raises the question: are segmentation-free strategies really the best solution to HCTR? To explore this issue, we propose a new segmentation-based method for recognizing handwritten Chinese text that is implemented using a simple yet efficient fully convolutional network. A novel weakly supervised learning method is proposed to enable the network to be trained using only transcript annotations; thus, the expensive character segmentation annotations required by previous segmentation-based methods can be avoided. Owing to the lack of context modeling in fully convolutional networks, we propose a contextual regularization method to integrate contextual information into the network during the training stage, which can further improve the recognition performance. Extensive experiments conducted on four widely used benchmarks, namely CASIA-HWDB, CASIA-OLHWDB, ICDAR2013, and SCUT-HCCDoc, show that our method significantly surpasses existing methods on both online and offline HCTR, and exhibits a considerably higher inference speed than CTC/attention-based approaches.

preprint2022arXiv

Rethinking Attention-Model Explainability through Faithfulness Violation Test

Attention mechanisms are dominating the explainability of deep models. They produce probability distributions over the input, which are widely deemed as feature-importance indicators. However, in this paper, we find one critical limitation in attention explanations: weakness in identifying the polarity of feature impact. This would be somehow misleading -- features with higher attention weights may not faithfully contribute to model predictions; instead, they can impose suppression effects. With this finding, we reflect on the explainability of current attention-based techniques, such as Attentio$\odot$Gradient and LRP-based attention explanations. We first propose an actionable diagnostic methodology (henceforth faithfulness violation test) to measure the consistency between explanation weights and the impact polarity. Through the extensive experiments, we then show that most tested explanation methods are unexpectedly hindered by the faithfulness violation issue, especially the raw attention. Empirical analyses on the factors affecting violation issues further provide useful observations for adopting explanation methods in attention models.

preprint2022arXiv

Saliency in Augmented Reality

With the rapid development of multimedia technology, Augmented Reality (AR) has become a promising next-generation mobile platform. The primary theory underlying AR is human visual confusion, which allows users to perceive the real-world scenes and augmented contents (virtual-world scenes) simultaneously by superimposing them together. To achieve good Quality of Experience (QoE), it is important to understand the interaction between two scenarios, and harmoniously display AR contents. However, studies on how this superimposition will influence the human visual attention are lacking. Therefore, in this paper, we mainly analyze the interaction effect between background (BG) scenes and AR contents, and study the saliency prediction problem in AR. Specifically, we first construct a Saliency in AR Dataset (SARD), which contains 450 BG images, 450 AR images, as well as 1350 superimposed images generated by superimposing BG and AR images in pair with three mixing levels. A large-scale eye-tracking experiment among 60 subjects is conducted to collect eye movement data. To better predict the saliency in AR, we propose a vector quantized saliency prediction method and generalize it for AR saliency prediction. For comparison, three benchmark methods are proposed and evaluated together with our proposed method on our SARD. Experimental results demonstrate the superiority of our proposed method on both of the common saliency prediction problem and the AR saliency prediction problem over benchmark methods. Our dataset and code are available at: https://github.com/DuanHuiyu/ARSaliency.

preprint2022arXiv

Searching Extra-tidal Features around the Globular Cluster Whiting 1

Whiting 1 is a faint and young globular cluster in the halo of the Milky Way, and was suggested to have originated in the Sagittarius spherical dwarf galaxy (Sgr dSph). In this paper, we use the deep DESI Legacy Imaging Surveys to explore tentative spatial connection between Whiting 1 and the Sgr dSph. We redetermine the fundamental parameters of Whiting 1 and use the best-fitting isochrone (age $τ$=6.5 Gyr, metalicity Z=0.005 and $\rm d_{\odot}$=26.9 kpc) to construct a theoretical matched filter for the extra-tidal features searching. Without any smooth technique to the matched filter density map, we detect a round-shape feature with possible leading and trailing tails on either side of the cluster. This raw image is not totally new compared to old discoveries, but confirms that no more large-scale features can be detected under a depth of r<=22.5 mag. In our results, the whole feature stretches 0.1-0.2 degree along the orbit of Whiting 1, which gives a much larger area than the cluster core. The tails on both sides of the cluster align along the orbital direction of the Sgr dSph as well as the cluster itself, which implies that these debris are probably stripped remnants of Whiting 1 by the Milky Way.

preprint2022arXiv

Single-stage Rotate Object Detector via Two Points with Solar Corona Heatmap

Oriented object detection is a crucial task in computer vision. Current top-down oriented detection methods usually directly detect entire objects, and not only neglecting the authentic direction of targets, but also do not fully utilise the key semantic information, which causes a decrease in detection accuracy. In this study, we developed a single-stage rotating object detector via two points with a solar corona heatmap (ROTP) to detect oriented objects. The ROTP predicts parts of the object and then aggregates them to form a whole image. Herein, we meticulously represent an object in a random direction using the vertex, centre point with width, and height. Specifically, we regress two heatmaps that characterise the relative location of each object, which enhances the accuracy of locating objects and avoids deviations caused by angle predictions. To rectify the central misjudgement of the Gaussian heatmap on high-aspect ratio targets, we designed a solar corona heatmap generation method to improve the perception difference between the central and non-central samples. Additionally, we predicted the vertex relative to the direction of the centre point to connect two key points that belong to the same goal. Experiments on the HRSC 2016, UCASAOD, and DOTA datasets show that our ROTP achieves the most advanced performance with a simpler modelling and less manual intervention.

preprint2022arXiv

SPTS: Single-Point Text Spotting

Existing scene text spotting (i.e., end-to-end text detection and recognition) methods rely on costly bounding box annotations (e.g., text-line, word-level, or character-level bounding boxes). For the first time, we demonstrate that training scene text spotting models can be achieved with an extremely low-cost annotation of a single-point for each instance. We propose an end-to-end scene text spotting method that tackles scene text spotting as a sequence prediction task. Given an image as input, we formulate the desired detection and recognition results as a sequence of discrete tokens and use an auto-regressive Transformer to predict the sequence. The proposed method is simple yet effective, which can achieve state-of-the-art results on widely used benchmarks. Most significantly, we show that the performance is not very sensitive to the positions of the point annotation, meaning that it can be much easier to be annotated or even be automatically generated than the bounding box that requires precise positions. We believe that such a pioneer attempt indicates a significant opportunity for scene text spotting applications of a much larger scale than previously possible. The code is available at https://github.com/shannanyinxiang/SPTS.

preprint2022arXiv

Successful New-entry Prediction for Multi-Party Online Conversations via Latent Topics and Discourse Modeling

With the increasing popularity of social media, online interpersonal communication now plays an essential role in people&#39;s everyday information exchange. Whether and how a newcomer can better engage in the community has attracted great interest due to its application in many scenarios. Although some prior works that explore early socialization have obtained salient achievements, they are focusing on sociological surveys based on the small group. To help individuals get through the early socialization period and engage well in online conversations, we study a novel task to foresee whether a newcomer&#39;s message will be responded to by other participants in a multi-party conversation (henceforth Successful New-entry Prediction). The task would be an important part of the research in online assistants and social media. To further investigate the key factors indicating such engagement success, we employ an unsupervised neural network, Variational Auto-Encoder (VAE), to examine the topic content and discourse behavior from newcomer&#39;s chatting history and conversation&#39;s ongoing context. Furthermore, two large-scale datasets, from Reddit and Twitter, are collected to support further research on new-entries. Extensive experiments on both Twitter and Reddit datasets show that our model significantly outperforms all the baselines and popular neural models. Additional explainable and visual analyses on new-entry behavior shed light on how to better join in others&#39; discussions.

preprint2022arXiv

Testing the seesaw mechanisms via displaced right-handed neutrinos from a light scalar at the HL-LHC

We investigate the pair production of right-handed neutrinos from the decay of a light $B-L$ scalar in the $U(1)_{B-L}$ model. The $B-L$ scalar mixes to the SM Higgs, and the physical scalar is required to be lighter than the observed Higgs. The produced right-handed neutrinos are predicted to be long-lived according to the type-I seesaw mechanism, and yield potentially distinct signatures such as displaced vertex and time-delayed leptons at the CMS/ATLAS/LHCb, as well as signatures at the far detectors including the CODEX-b, FACET, FASER, MoEDAL-MAPP and MATHUSLA. We analyze the sensitivity reach at the HL-LHC for the right-handed neutrinos with masses of 2.5 $\sim$ 30 GeV, showing that the active-sterile mixing to muons can be probed to $V_{μN} \sim 10^{-5}$ at the CMS/ATLAS/LHCb using the displaced vertex searches, and one magnitude lower at the MATHUSLA/CMS using time-delayed leptons searches, reaching the parameter space interesting for type-I seesaw mechanisms.

preprint2022arXiv

Topological equivalence canonical forms for linear multivariable systems without control

In this paper, we discuss the classification problem for linear time-invariant multivariable systems without control. It turns out that the observability and stability are invariant for topological equivalent systems. Abstract results concerning system decomposition according to eigenvalues and observability are obtained. Finally, as concrete examples, the topological equivalence canonical forms for a three dimensional system equipped with a scalar observation are presented explicitly.

preprint2022arXiv

TVShowGuess: Character Comprehension in Stories as Speaker Guessing

We propose a new task for assessing machines&#39; skills of understanding fictional characters in narrative stories. The task, TVShowGuess, builds on the scripts of TV series and takes the form of guessing the anonymous main characters based on the backgrounds of the scenes and the dialogues. Our human study supports that this form of task covers comprehension of multiple types of character persona, including understanding characters&#39; personalities, facts and memories of personal experience, which are well aligned with the psychological and literary theories about the theory of mind (ToM) of human beings on understanding fictional characters during reading. We further propose new model architectures to support the contextualized encoding of long scene texts. Experiments show that our proposed approaches significantly outperform baselines, yet still largely lag behind the (nearly perfect) human performance. Our work serves as a first step toward the goal of narrative character comprehension.

preprint2021arXiv

Accurate predictions of chaotic motion of a free fall disk

It is important to know the accurate trajectory of a free fall object in fluid (such as a spacecraft), whose motion might be chaotic in many cases. However, it is impossible to accurately predict its chaotic trajectory in a long enough duration by traditional numerical algorithms in double precision. In this paper, we give the accurate predictions of the same problem by a new strategy, namely the Clean Numerical Simulation (CNS). Without loss of generality, a free fall disk in water is considered, whose motion is governed by the Andersen-Pesavento-Wang model. We illustrate that convergent and reliable trajectories of a chaotic free fall disk in a long enough interval of time can be obtained by means of the CNS, but different traditional algorithms in double precision give disparate trajectories. Besides, unlike the traditional algorithms in double precision, the CNS can predict the accurate posture of the free fall disk near the vicinity of the bifurcation point of some physical parameters in a long duration. Therefore, the CNS can provide reliable prediction of chaotic systems in a long enough interval of time.

preprint2021arXiv

Global Existence of Strong and Weak Solutions to 2D Compressible Navier-Stokes System in Bounded Domains with Large Data and Vacuum

We study the barotropic compressible Navier-Stokes system where the shear viscosity is a positive constant and the bulk one proportional to a power of the density with the power bigger than one and a third. The system is subject to the Navier-slip boundary conditions in a general two-dimensional bounded simply connected domain. For initial density allowed to vanish, we establish the global existence of strong and weak solutions without any restrictions on the size of initial value. To get over the difficulties brought by boundary, on the one hand, we apply Riemann mapping theorem and the pull back Green&#39;s function method to get a pointwise representation of the effective viscous flux. On the other hand, observing that the orthogonality is preserved under conformal mapping due to its preservation on the angle, we use the slip boundary conditions to reduce the integral representation to the desired commutator form whose singularities can be cancelled out by using the estimates on the spatial gradient of the velocity.

preprint2021arXiv

Hydrogen molecule spectrum by many-body GW and Bethe-Salpeter equation

We check the ab initio GW approximation and Bethe-Salpeter equation (BSE) many-body methodology against the exact solution benchmark of the hydrogen molecule H$_2$ ground state and excitation spectrum, and in comparison with the configuration interaction (CI) and time-dependent Hartree-Fock methods. The comparison is made on all the states we could unambiguously identify from the excitonic wave functions&#39; symmetry. At the equilibrium distance $R = 1.4 \, a_0$, the GW+BSE energy levels are in good agreement with the exact results, with an accuracy of 0.1~0.2 eV. GW+BSE potential-energy curves are also in good agreement with the CI and the exact result up to $2.3 \, a_0$. The solution no longer exists beyond $3.0 \, a_0$ for triplets ($4.3 \, a_0$ for singlets) due to instability of the ground state. We tried to improve the GW reference ground state by a renormalized random-phase approximation (r-RPA), but this did not solve the problem.

preprint2021arXiv

Mimicing the Kane-Mele type spin orbit interaction by spin-flexual phonon coupling in graphene devices

On the efforts of enhancing the spin orbit interaction (SOI) of graphene for seeking the dissipationless quantum spin Hall devices, unique Kane-Mele type SOI and high mobility samples are desired. However, common external decoration often introduces extrinsic Rashba-type SOI and simultaneous impurity scattering. Here we show, by the EDTA-Dy molecule decorating, the Kane-Mele type SOI is mimicked with even improved carrier mobility. It is evidenced by the suppressed weak localization at equal carrier densities and simultaneous Elliot-Yafet spin relaxation. The extracted spin scattering time is monotonically dependent on the carrier elastic scattering time, where the Elliot-Yafet plot gives the interaction strength of 3.3 meV. Improved quantum Hall plateaus can be even seen after the external operation. This is attributed to the spin-flexural phonon coupling induced by the enhanced graphene ripples, as revealed by the in-plane magnetotransport measurement.

preprint2021arXiv

Minority Oversampling for Imbalanced Time Series Classification

Many important real-world applications involve time-series data with skewed distribution. Compared to conventional imbalance learning problems, the classification of imbalanced time-series data is more challenging due to high dimensionality and high inter-variable correlation. This paper proposes a structure preserving Oversampling method to combat the High-dimensional Imbalanced Time-series classification (OHIT). OHIT first leverages a density-ratio based shared nearest neighbor clustering algorithm to capture the modes of minority class in high-dimensional space. It then for each mode applies the shrinkage technique of large-dimensional covariance matrix to obtain accurate and reliable covariance structure. Finally, OHIT generates the structure-preserving synthetic samples based on multivariate Gaussian distribution by using the estimated covariance matrices. Experimental results on several publicly available time-series datasets (including unimodal and multimodal) demonstrate the superiority of OHIT against the state-of-the-art oversampling algorithms in terms of F1, G-mean, and AUC.

preprint2021arXiv

Quantum Gaussian process regression

In this paper, a quantum algorithm based on gaussian process regression model is proposed. The proposed quantum algorithm consists of three sub-algorithms. One is the first quantum subalgorithm to efficiently generate mean predictor. The improved HHL algorithm is proposed to obtain the sign of outcomes. Therefore, the terrible situation that results is ambiguous in terms of original HHL algorithm is avoided, which makes whole algorithm more clear and exact. The other is to product covariance predictor with same method. Thirdly, the squared exponential covariance matrices are prepared that annihilation operator and generation operator are simulated by the unitary linear decomposition Hamiltonian simulation and kernel function vectors is generated with blocking coding techniques on covariance matrices. In addition, it is shown that the proposed quantum gaussian process regression algorithm can achieve quadratic faster over the classical counterpart.

preprint2021arXiv

Spatial Curvature and Large Scale Lorentz Violation

The tension between the Hubble constant obtained from the local measurements and from cosmic microwave background (CMB) measurements motivated us to consider the cosmological model beyond $Λ$CDM one. We investigate the cosmology in the large scale Lorentz violation model with non-vanishing spatial curvature. The degeneracy among spatial curvature, cosmological constant and cosmological contortion distribution makes the model viable in describing the known observation date. We get some constraints on the spatial curvature by the comparison of the relation between measured distance modulus and red-shift with the predicted one, the evolution of matter density over time and the evolution of effective cosmological constant. The performance of large scale Lorentz violation model with non-vanishing spatial curvature under these constrains is discussed.

preprint2020arXiv

A Feshbach engine in the Thomas-Fermi regime

Bose-Einstein condensates can be used to produce work by tuning the strength of the interparticle interactions with the help of Feshbach resonances. In inhomogeneous potentials, these interaction ramps change the volume of the trapped gas allowing one to create a thermodynamic cycle known as the Feshbach engine. However, in order to obtain a large power output, the engine strokes must be performed on a short timescale, which is in contrast with the fact that the efficiency of the engine is reduced by irreversible work if the strokes are done in a non-adiabatic fashion. Here we investigate how such an engine can be run in the Thomas-Fermi regime and present a shortcut to adiabaticity that minimizes the irreversible work and allows for efficient engine operation.

preprint2020arXiv

A Resource Allocation and Coordinated Transmission Scheme for Large Cellular Networks

With the increasing number of user equipment (UE) and data demands, denser access points (APs) are being employed. Resource allocation problems have been extensively researched with interference treated as noise. It is well understood that the overall spectral efficiency can be significantly improved if multiple terminals coordinate their transmissions either coherently or non-coherently.The focus of this paper is to study how to select pairs of APs for coordination and allocate resources accordingly. An optimization problem is formulated to maximize the network utility function by optimizing AP paring, spectrum allocation, user association, and power management in a flexible manner. A scalable and efficient algorithm is proposed based on iterative scheme pursuit and fractional programming. Numerical results demonstrate substantial gains in the network utility due to coordinated transmission in a network of 128 APs and 384 UE.

preprint2020arXiv

An Attention Based Neural Network for Jet Tagging

Convolutional neural networks are basic structures using jet images as input for the jet tagging problems. However, what they have learned during the training process is always difficult to understand just through feature maps. Inspired by the attention mechanism popular in machine learning fields, we propose a novel attention-based neural network (ABNN) to get insight of this problem. The ABNN combines a jet image with average jet images from the signal and the background to generate attention maps which show clearly the relevant importance according to the different origination of jets. Compared with networks in the similar architecture, this network achieves better performance, which indicates the potential of attention mechanism to use in other works.

preprint2020arXiv

Charge detection in an array of CMOS quantum dots

The recent development of arrays of quantum dots in semiconductor nanostructures highlights the progress of quantum devices toward large scale. However, how to realize such arrays on a scalable platform such as silicon is still an open question. One of the main challenge resides in the detection of charges within the array. It is a prerequisite functionality to initialize a desired charge state and readout spins through spin-to-charge conversion mechanisms. In this paper, we use two methods based on either a single-lead charge detector, or a reprogrammable single electron transistor. Thanks to these methods, we study the charge dynamics and sensitivity by performing single shot detection of the charge. Finally, we can probe the charge stability at any node of a linear array and assess the Coulomb disorder in the structure. We find an electrochemical potential fluctuation induced by charge noise comparable to that reported in other silicon quantum dots.

preprint2020arXiv

Constructive and Destructive Interference of Kerker-type Scattering in an Ultra-thin Silicon Huygens Metasurface

High refractive index dielectric nanoparticles have provided a new platform for exotic light manipulation through the interference of multipole modes. The Kerker effect is one example of a Huygens source design. Rather than exploiting interference between the electric dipole and magnetic dipole, as in many conventional Huygens source designs, we explore Kerker-type suppressed backward scattering mediated by the dominant electric dipole, toroidal dipole and magnetic quadrupole. These modes are provided by a designed and fabricated CMOS compatible ultra-thin Silicon nanodisk metasurface with a suppressed magnetic dipole contribution, and verified through multipole decomposition. The non-trivial substrate effect is considered using a semi-analytical transfer matrix model. The model successfully predicts the observed reflection dip. By applying a general criterion for constructive and destructive interference, it is shown that while constructive interference occurs between the electric and toroidal dipole contributions, the experimentally observed suppressed backward Kerker-type scattering arises from the destructive interference between backward scattered contributions due to the total electric dipole and the magnetic quadrupole. Our study paves the way towards new types of Huygens sources or metasurface design, such as for peculiar transverse Kerker scattering.

preprint2020arXiv

Continuous-Time Analysis of the Bitcoin and Prism Backbone Protocols

Bitcoin is a peer-to-peer payment system proposed by Nakamoto in 2008. Based on the Nakamoto consensus, Bagaria, Kannan, Tse, Fanti, and Viswanath proposed the Prism protocol in 2018 and showed that it achieves near-optimal blockchain throughput while maintaining a similar level of security as bitcoin. Previous probabilistic security guarantees for the bitcoin and Prism backbone protocols were either established under a simplified discrete-time model or expressed in terms of exponential order results. This paper presents a streamlined and strengthened analysis under a more realistic continuous-time model. A fully rigorous model for blockchains is developed with no restrictions on adversarial miners except for an upper bound on their aggregate mining rate. The only assumption on the peer-to-peer network is that all block propagation delays are upper bounded by a constant. A new notion of &#34;t-credible blockchains&#34; is introduced, which, together with some carefully defined &#34;typical&#34; events concerning block production over time intervals, is crucial to establish probabilisitic security guarantees in continuous time. A blockchain growth theorem, a blockchain quality theorem, and a common prefix theorem are established with explicit probability bounds. Moreover, under a certain typical event which occurs with probability close to $1$, a valid transaction that is deep enough in one credible blockchain is shown to be permanent in the sense that it must be found in} in all future credible blockchains.

preprint2020arXiv

Double spin asymmetry in dihadron production in SIDIS off the longitudinally polarized nucleon target

In this paper we study the double longitudinal spin asymmetry of dihadron production in semi-inclusive deep inelastic scattering (SIDIS). We calculate a unknown twist-3 dihadron fragmentation function $\widetilde{D}^\sphericalangle$ within a spectator model which has been used successfully in describing the dihadron production in both the unpolarized and the single polarized processes. The collinear picture, in which the transverse momentum of the final state hadron pair is integrated out, has been considered. The $\cosϕ_R$ azimuthal asymmetry arises from the coupling $e_L H_1^\sphericalangle$ and the coupling $g_1 \widetilde{D}^\sphericalangle$ is studied. We estimate the $\cosϕ_R$ asymmetry at the kinematics of COMPASS and compare with the data. The prediction at the future Electron Ion Collider (EIC) has also been presented.

preprint2020arXiv

Effects of coherence on quantum speed limits and shortcuts to adiabaticity in many-particle systems

We discuss the effects of many-body coherence on the speed of evolution of ultracold atomic gases and the relation to quantum speed limits. Our approach is focused on two related systems, spinless fermions and the bosonic Tonks-Girardeau gas, which possess equivalent density dynamics but very different coherence properties. To illustrate the effect of the coherence on the dynamics we consider squeezing an anharmonic potential which confines the particles and find that the speed of the evolution exhibits subtle, but fundamental differences between the two systems. Furthermore, we explore the difference in the driven dynamics by implementing a shortcut to adiabaticity designed to reduce spurious excitations. We show that collisions between the strongly interacting bosons can lead to changes in the coherence which results in different evolution speeds and therefore different fidelities of the final states.

preprint2020arXiv

Efficient implementation of immersed boundary-lattice Boltzmann method for massive particle-laden flows Part I: Serial computing

Immersed boundary-lattice Boltzmann method (IB-LBM) has been widely used for simulation of particle-laden flows recently. However, it was limited to small-scale simulations with no more than O(103) particles. Here, we expand IB-LBM for massive particle-laden flows with more than O(104) particles by two sequential works. First is the Part I: serial computing on a single CPU core and following the Part II: parallel computing on many CPU cores. In this Part I paper, a highly efficient and localized implementation of IB-LBM is proposed for serial computing. We optimize in three main aspects: swap algorithm for incompressible LBM, local grid-to-point algorithm for IBM and improved grid search algorithm for particle pair short-range interaction. In addition, symmetry algorithm is proposed for the half-calculation of LB collision and external force term. The computational performance on a single CPU core is analyzed. Different scales of two dimensional (2D) and three-dimensional (3D) particles settling in closed cavities are used for testing. The solid volume fraction is varied from 0 to 0.40. Simulation results demonstrate that all calculation parts are dramatically decreased by the improved algorithm. For the particle-free flows, the Mega Lattice Site Update per Second (MLUPS) can be achieved up to 36 (2D) and 12 (3D) using the improved algorithm. For the particle-laden flows, MLUPS can be achieved no lower than 15 (2D) and 7 (3D) in the simulations of dense flows. At last, we discuss the potential of the new algorithms for the high-performance computation of the large-scale systems of particle-laden flows with MPI parallel technique.

preprint2020arXiv

Fairness Constraints in Semi-supervised Learning

Fairness in machine learning has received considerable attention. However, most studies on fair learning focus on either supervised learning or unsupervised learning. Very few consider semi-supervised settings. Yet, in reality, most machine learning tasks rely on large datasets that contain both labeled and unlabeled data. One of key issues with fair learning is the balance between fairness and accuracy. Previous studies arguing that increasing the size of the training set can have a better trade-off. We believe that increasing the training set with unlabeled data may achieve the similar result. Hence, we develop a framework for fair semi-supervised learning, which is formulated as an optimization problem. This includes classifier loss to optimize accuracy, label propagation loss to optimize unlabled data prediction, and fairness constraints over labeled and unlabeled data to optimize the fairness level. The framework is conducted in logistic regression and support vector machines under the fairness metrics of disparate impact and disparate mistreatment. We theoretically analyze the source of discrimination in semi-supervised learning via bias, variance and noise decomposition. Extensive experiments show that our method is able to achieve fair semi-supervised learning, and reach a better trade-off between accuracy and fairness than fair supervised learning.

preprint2020arXiv

GPM: A Generic Probabilistic Model to Recover Annotator&#39;s Behavior and Ground Truth Labeling

In the big data era, data labeling can be obtained through crowdsourcing. Nevertheless, the obtained labels are generally noisy, unreliable or even adversarial. In this paper, we propose a probabilistic graphical annotation model to infer the underlying ground truth and annotator&#39;s behavior. To accommodate both discrete and continuous application scenarios (e.g., classifying scenes vs. rating videos on a Likert scale), the underlying ground truth is considered following a distribution rather than a single value. In this way, the reliable but potentially divergent opinions from &#34;good&#34; annotators can be recovered. The proposed model is able to identify whether an annotator has worked diligently towards the task during the labeling procedure, which could be used for further selection of qualified annotators. Our model has been tested on both simulated data and real-world data, where it always shows superior performance than the other state-of-the-art models in terms of accuracy and robustness.

preprint2020arXiv

GRB 140423A: A Case of Stellar Wind to Interstellar Medium Transition in the Afterglow

We present very early ground-based optical follow-up observations of GRB~140423A, which was discovered by \emph{Swift}/BAT and by {\it Fermi}/GBM. Its broadband afterglow was monitored by {\it Swift}/XRT and ground-based optical telescopes from $T_0+$70.96~s to 4.8~d after the {\it Swift}/BAT trigger. This is one more case of prompt optical emission observation. The temporal and spectral joint fit of the multiwavelength light curves of GRB 140423A reveals that achromatic behavior is consistent with the external shock model including a transition from a stellar wind to the interstellar medium (ISM) and energy injection. In terms of the optical light curves, there is an onset bump in the early afterglow with a rising index $α_{\rm O,I} = -0.59 \pm 0.04$ (peaking at $t_{\rm peak}-T_0 \approx 206$~s). It then decays with a steep index $α_{\rm O,II} = 1.78 \pm 0.03$, and shows a steeper to flatter &#34;transition&#34; with $α_{\rm O,III} = 1.13 \pm 0.03$ at around $T_0 + 5000$~s. The observed X-ray afterglow reflects an achromatic behavior, as does the optical light curve. There is no obvious evolution of the spectral energy distribution between the X-ray and optical afterglow, with an average value of the photon index $Γ\approx 1.95$. This &#34;transition&#34; is consistent with an external shock model having the circumburst medium transition from a wind to the ISM, by introducing a long-lasting energy injection with a Lorentz factor stratification of the ejecta. The best parameters from Monte Carlo Markov Chain fitting are $E_{\rm K,iso} \approx 2.14\times10^{55}$ erg, $Γ_0 \approx 162$, $ε_e \approx 0.02$, $ε_B \approx 1.7\times10^{-6}$, $A_\ast \approx 1.0$, $R_t \approx 4.1\times10^{17}$ cm, $n \approx 11.0 \rm\ cm^{-3}$, $L_0 \approx 3.1\times10^{52} \rm\ erg\ s^{-1}$, $k \approx 1.98$, $s \approx 1.54$, and $θ_j > 0.3$ rad.

preprint2020arXiv

Ground-state correlation energy of beryllium dimer by the Bethe-Salpeter equation

Since the &#39;30s the interatomic potential of the beryllium dimer Be$_2$ has been both an experimental and a theoretical challenge. Calculating the ground-state correlation energy of Be$_2$ along its dissociation path is a difficult problem for theory. We present ab initio many-body perturbation theory calculations of the Be$_2$ interatomic potential using the GW approximation and the Bethe-Salpeter equation (BSE). The ground-state correlation energy is calculated by the trace formula with checks against the adiabatic-connection fluctuation-dissipation theorem formula. We show that inclusion of GW corrections already improves the energy even at the level of the random-phase approximation. At the level of the BSE on top of the GW approximation, our calculation is in surprising agreement with the most accurate theories and with experiment. It even reproduces an experimentally observed flattening of the interatomic potential due to a delicate correlations balance from a competition between covalent and van der Waals bonding.

preprint2020arXiv

High-order numerical methods for the Riesz space fractional advection-dispersion equations

In this paper, we propose high-order numerical methods for the Riesz space fractional advection-dispersion equations (RSFADE) on a {f}inite domain. The RSFADE is obtained from the standard advection-dispersion equation by replacing the first-order and second-order space derivative with the Riesz fractional derivatives of order $α\in(0,1)$ and $β\in(1,2]$, respectively. Firstly, we utilize the weighted and shifted Grünwald difference operators to approximate the Riesz fractional derivative and present the {f}inite difference method for the RSFADE. Specifically, we discuss the Crank-Nicolson scheme and solve it in matrix form. Secondly, we prove that the scheme is unconditionally stable and convergent with the accuracy of $\mathcal {O}(τ^2+h^2)$. Thirdly, we use the Richardson extrapolation method (REM) to improve the convergence order which can be $\mathcal {O}(τ^4+h^4)$. Finally, some numerical examples are given to show the effectiveness of the numerical method, and the results are excellent with the theoretical analysis.

preprint2020arXiv

Hole-phonon interactions in quantum dots: Effects of phonon confinement and encapsulation materials on spin-orbit qubits

Spin-phonon interactions are one of the mechanisms limiting the lifetime of spin qubits made in semiconductor quantum dots. At variance with other mechanisms such as charge noise, phonons are intrinsic to the device and can hardly be mitigated. They set, therefore fundamental limits to the relaxation time of the qubits. Here we introduce a general framework for the calculation of the spin (and charge) transition rates induced by bulk (3D) and strongly confined 1D or 2D phonons. We discuss the particular case of hole spin-orbit qubits described by the 6 bands kp model. We next apply this theory to a hole qubit in a silicon-on-insulator device. We show that spin relaxation in this device is dominated by a band mixing term that couples the holes to transverse acoustic phonons through the valence band deformation potential d, and optimize the bias point and magnetic field orientation to maximize the number of Rabi oscillations Q that can be achieved within on relaxation time T1. Despite the strong spin-orbit coupling in the valence band, the phonon-limited Q can reach a few tens of thousands. We next explore the effects of phonon confinement in 1D and 2D structures, and the impact of the encapsulation materials on the relaxation rates. We show that the spin lifetimes can depend on the structure of the device over micrometer-long length scales and that they improve when the materials around the qubit get harder. Phonon engineering in semiconductor qubits may therefore become relevant once the extrinsic sources of relaxation have been reduced.

preprint2020arXiv

Hubble Space Telescope Search for Activity in High Perihelion Objects

Solar system objects with perihelia beyond the orbit of Jupiter ($q >$ 5 AU) are too cold for water ice to generate an appreciable coma via sublimation. Despite this, numerous high perihelion objects (HPOs) including many comets and recently escaped Kuiper belt objects (``Centaurs&#39;&#39;) are observed to be active out at least to the orbit of Saturn ($q \sim$ 10 AU). Peak equilibrium temperatures at 10 AU ($\sim$125 K), while far too low to sublimate water ice, are sufficient to sublimate super-volatiles such as CO and CO$_2$ ice. Temperatures at 10 AU are also high enough to trigger the rapid crystallization of exposed amorphous ice, thus constituting another possible driver of distant activity. While supervolatile ices can sublimate strongly (as $r_H^{-2}$) to at least Kuiper belt (30 AU) distances, crystallization is an exponential function of temperature that cannot be sustained much beyond $\sim$10 AU. The heliocentric dependence of the activity thus suggests an observational test. If activity in high perihelion objects is triggered by crystallization, then no examples of activity should be found with perihelia $q >>$ 10 AU. If, on the other hand, activity is due to free sublimation of exposed supervolatile ices, or another cause, then distant activity might be detected. We obtained sensitive, high resolution Hubble Space Telescope observations of HPOs to search for activity beyond the crystallization zone. No examples of activity were detected in 53 objects with $q >$ 15 AU, consistent with the crystallization trigger hypothesis. However, sensitivity limits are such that we cannot reject the alternative hypothesis that mass loss is driven by the sublimation of supervolatile ices. We also searched for binary companions in our sample, finding none and setting an empirical 3$σ$ limit to the binary fraction of $<8$\%.

preprint2020arXiv

Large Batch Optimization for Deep Learning: Training BERT in 76 minutes

Training large deep neural networks on massive datasets is computationally very challenging. There has been recent surge in interest in using large batch stochastic optimization methods to tackle this issue. The most prominent algorithm in this line of research is LARS, which by employing layerwise adaptive learning rates trains ResNet on ImageNet in a few minutes. However, LARS performs poorly for attention models like BERT, indicating that its performance gains are not consistent across tasks. In this paper, we first study a principled layerwise adaptation strategy to accelerate training of deep neural networks using large mini-batches. Using this strategy, we develop a new layerwise adaptive large batch optimization technique called LAMB; we then provide convergence analysis of LAMB as well as LARS, showing convergence to a stationary point in general nonconvex settings. Our empirical results demonstrate the superior performance of LAMB across various tasks such as BERT and ResNet-50 training with very little hyperparameter tuning. In particular, for BERT training, our optimizer enables use of very large batch sizes of 32868 without any degradation of performance. By increasing the batch size to the memory limit of a TPUv3 Pod, BERT training time can be reduced from 3 days to just 76 minutes (Table 1). The LAMB implementation is available at https://github.com/tensorflow/addons/blob/master/tensorflow_addons/optimizers/lamb.py

preprint2020arXiv

LSOTB-TIR:A Large-Scale High-Diversity Thermal Infrared Object Tracking Benchmark

In this paper, we present a Large-Scale and high-diversity general Thermal InfraRed (TIR) Object Tracking Benchmark, called LSOTBTIR, which consists of an evaluation dataset and a training dataset with a total of 1,400 TIR sequences and more than 600K frames. We annotate the bounding box of objects in every frame of all sequences and generate over 730K bounding boxes in total. To the best of our knowledge, LSOTB-TIR is the largest and most diverse TIR object tracking benchmark to date. To evaluate a tracker on different attributes, we define 4 scenario attributes and 12 challenge attributes in the evaluation dataset. By releasing LSOTB-TIR, we encourage the community to develop deep learning based TIR trackers and evaluate them fairly and comprehensively. We evaluate and analyze more than 30 trackers on LSOTB-TIR to provide a series of baselines, and the results show that deep trackers achieve promising performance. Furthermore, we re-train several representative deep trackers on LSOTB-TIR, and their results demonstrate that the proposed training dataset significantly improves the performance of deep TIR trackers. Codes and dataset are available at https://github.com/QiaoLiuHit/LSOTB-TIR.

preprint2020arXiv

Molecular dynamics simulation for cross-linking processes and material properties of epoxy resins with the first principle calculation combined with global reaction route mapping algorithms

Herein, epoxy resin is cured by coupling quantum chemical (QC) calculations with molecular dynamics (MD) simulations that enable parameter-free prediction of material characteristics. A polymer network is formed by the reaction between base resin and curing agent. The reaction uses activation energy and heat of formation data obtained by first-principle calculations coupled with global reaction route mapping (GRRM) algorithms. Density, glass transition temperature, Young&#39;s modulus, and curing conversion is used to validate the procedure. Experimental and simulation results indicate that base resin with multi-functional reaction groups increases glass-transition temperature and Young&#39;s modulus because of cross-linked formations at the molecular scale.

preprint2020arXiv

Pose-Assisted Multi-Camera Collaboration for Active Object Tracking

Active Object Tracking (AOT) is crucial to many visionbased applications, e.g., mobile robot, intelligent surveillance. However, there are a number of challenges when deploying active tracking in complex scenarios, e.g., target is frequently occluded by obstacles. In this paper, we extend the single-camera AOT to a multi-camera setting, where cameras tracking a target in a collaborative fashion. To achieve effective collaboration among cameras, we propose a novel Pose-Assisted Multi-Camera Collaboration System, which enables a camera to cooperate with the others by sharing camera poses for active object tracking. In the system, each camera is equipped with two controllers and a switcher: The vision-based controller tracks targets based on observed images. The pose-based controller moves the camera in accordance to the poses of the other cameras. At each step, the switcher decides which action to take from the two controllers according to the visibility of the target. The experimental results demonstrate that our system outperforms all the baselines and is capable of generalizing to unseen environments. The code and demo videos are available on our website https://sites.google.com/view/pose-assistedcollaboration.

preprint2020arXiv

Secure Metric Learning via Differential Pairwise Privacy

Distance Metric Learning (DML) has drawn much attention over the last two decades. A number of previous works have shown that it performs well in measuring the similarities of individuals given a set of correctly labeled pairwise data by domain experts. These important and precisely-labeled pairwise data are often highly sensitive in real world (e.g., patients similarity). This paper studies, for the first time, how pairwise information can be leaked to attackers during distance metric learning, and develops differential pairwise privacy (DPP), generalizing the definition of standard differential privacy, for secure metric learning. Unlike traditional differential privacy which only applies to independent samples, thus cannot be used for pairwise data, DPP successfully deals with this problem by reformulating the worst case. Specifically, given the pairwise data, we reveal all the involved correlations among pairs in the constructed undirected graph. DPP is then formalized that defines what kind of DML algorithm is private to preserve pairwise data. After that, a case study employing the contrastive loss is exhibited to clarify the details of implementing a DPP-DML algorithm. Particularly, the sensitivity reduction technique is proposed to enhance the utility of the output distance metric. Experiments both on a toy dataset and benchmarks demonstrate that the proposed scheme achieves pairwise data privacy without compromising the output performance much (Accuracy declines less than 0.01 throughout all benchmark datasets when the privacy budget is set at 4).

preprint2020arXiv

Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling

Recently, there has been growing interest in developing learning-based methods to detect and utilize salient semi-global or global structures, such as junctions, lines, planes, cuboids, smooth surfaces, and all types of symmetries, for 3D scene modeling and understanding. However, the ground truth annotations are often obtained via human labor, which is particularly challenging and inefficient for such tasks due to the large number of 3D structure instances (e.g., line segments) and other factors such as viewpoints and occlusions. In this paper, we present a new synthetic dataset, Structured3D, with the aim of providing large-scale photo-realistic images with rich 3D structure annotations for a wide spectrum of structured 3D modeling tasks. We take advantage of the availability of professional interior designs and automatically extract 3D structures from them. We generate high-quality images with an industry-leading rendering engine. We use our synthetic dataset in combination with real images to train deep networks for room layout estimation and demonstrate improved performance on benchmark datasets.

preprint2020arXiv

Superconducting proximity effect in a transparent van der Waals superconductor-metal junction

We report on Andreev reflections at clean NbSe2-bilayer graphene junctions. The high transparency of the junction, which manifests as a large conductance enhancement of up to 1.8, enables us to see clear evidence of a proximity-induced superconducting gap in bilayer graphene and two Andreev reflections through a vertical NbSe2-graphene and a lateral graphene-graphene junction respectively. Quantum transport simulations capture the complexity of the experimental data and illuminate the impact of various microscopic parameters on the transmission of the junction. Our work establishes the practice and understanding of an all-van-der-Waals, high-performance superconducting junction. The realization of a highly transparent proximized graphene-graphene junction opens up possibilities to engineer emergent quantum phenomena.

preprint2020arXiv

Travelling wave solutions of the density-suppressed motility model

In this paper, we study the traveling wave solutions to the density-suppressed motility model describing the ``self-trapping&#39;&#39; mechanism that induces spatio-temporal pattern formations observed in the experiment. We establish the existence of traveling wavefronts with a minimal wave speed and discuss the selection of wave profiles supplemented with numerical simulations illustrating the wave patterns which are well consistent with experimental observations.

preprint2020arXiv

UFTR: A Unified Framework for Ticket Routing

Corporations today face increasing demands for the timely and effective delivery of customer service. This creates the need for a robust and accurate automated solution to what is formally known as the ticket routing problem. This task is to match each unresolved service incident, or &#34;ticket&#34;, to the right group of service experts. Existing studies divide the task into two independent subproblems - initial group assignment and inter-group transfer. However, our study addresses both subproblems jointly using an end-to-end modeling approach. We first performed a preliminary analysis of half a million archived tickets to uncover relevant features. Then, we devised the UFTR, a Unified Framework for Ticket Routing using four types of features (derived from tickets, groups, and their interactions). In our experiments, we implemented two ranking models with the UFTR. Our models outperform baselines on three routing metrics. Furthermore, a post-hoc analysis reveals that this superior performance can largely be attributed to the features that capture the associations between ticket assignment and group assignment. In short, our results demonstrate that the UFTR is a superior solution to the ticket routing problem because it takes into account previously unexploited interrelationships between the group assignment and group transfer problems.

preprint2020arXiv

Unusual Intralayer Ferromagnetism Between S = 5/2 ions in MnBi$_2$Te$_4$: Role of Empty Bi $p$ States

The layered magnetic topological insulator MnBi$_2$Te$_4$ is a promising platform to realize the quantum anomalous Hall effect because its layers possess intrinsic ferromagnetism. However, it is not well understood why the high-spin $d^5$ magnetic ions Mn$^{2+}$ forming the Mn-Te-Mn spin exchange paths prefer ferromagnetic (FM) coupling, contrary to the prediction of the Goodenough-Kanamori rule that a TM-L-TM spin exchange, where TM and L are a transition-metal magnetic cation and a main group ligand, respectively, is antiferromagnetic (AFM) even when the bond angle of the exchange path is 90$^{\circ}$. Using density functional theory (DFT) calculations, we show that the presence of Bi$^{3+}$ ions is essential for the FM coupling in MnBi$_2$Te$_4$. Then, using a tight-binding model Hamiltonian, we find that high-spin $d^5$ ions (S = 5/2) in TM-L-TM spin exchange paths prefer FM coupling if the empty p-orbitals of a nonmagnetic cation M (e.g., Bi$^{3+}$ ion) hybridize strongly with those of the bridging ligand L, but AFM coupling otherwise.

preprint2020arXiv

What Changed Your Mind: The Roles of Dynamic Topics and Discourse in Argumentation Process

In our world with full of uncertainty, debates and argumentation contribute to the progress of science and society. Despite of the increasing attention to characterize human arguments, most progress made so far focus on the debate outcome, largely ignoring the dynamic patterns in argumentation processes. This paper presents a study that automatically analyzes the key factors in argument persuasiveness, beyond simply predicting who will persuade whom. Specifically, we propose a novel neural model that is able to dynamically track the changes of latent topics and discourse in argumentative conversations, allowing the investigation of their roles in influencing the outcomes of persuasion. Extensive experiments have been conducted on argumentative conversations on both social media and supreme court. The results show that our model outperforms state-of-the-art models in identifying persuasive arguments via explicitly exploring dynamic factors of topic and discourse. We further analyze the effects of topics and discourse on persuasiveness, and find that they are both useful - topics provide concrete evidence while superior discourse styles may bias participants, especially in social media arguments. In addition, we draw some findings from our empirical results, which will help people better engage in future persuasive conversations.

preprint2019arXiv

An improved bound for strong unitary uncertainty relations with refined sequence

We derive the lower bound of uncertainty relations of two unitary operators for a class of states based on the geometric-arithmetic inequality and Cauchy-Schwarz inequality. Furthermore, we propose a set of uncertainty relations for three unitary operators. Compared to the known bound introduced in Phys.Rev.A.100,022116(2019), the unitary uncertainty relations bound with our method is tighter, to a certain extent. Meanwhile, some examples are given in the paper to illustrate our conclusions.

preprint2019arXiv

Coupled density-spin Bose-Einstein condensates dynamics and collapse in systems with quintic nonlinearity

We investigate the effects of spin-orbit coupling and Zeeman splitting on the coupled density-spin dynamics and collapse of the Bose-Einstein condensate driven by the quintic self-attraction in the same- and cross-spin channels. The characteristic feature of the collapse is the decrease in the width as given by the participation ratio of the density rather than by the expectation values of the coordinate. Qualitative arguments and numerical simulations reveal the existence of a critical spin-orbit coupling strength which either prohibits or leads to the collapse, and its dependence on other parameters, such as the condensates norm, spin-dependent nonlinear coupling, and the Zeeman splitting. The entire nonlinear dynamics critically depend on the initial spin sate.

preprint2019arXiv

Deep Image-to-Video Adaptation and Fusion Networks for Action Recognition

Existing deep learning methods for action recognition in videos require a large number of labeled videos for training, which is labor-intensive and time-consuming. For the same action, the knowledge learned from different media types, e.g., videos and images, may be related and complementary. However, due to the domain shifts and heterogeneous feature representations between videos and images, the performance of classifiers trained on images may be dramatically degraded when directly deployed to videos. In this paper, we propose a novel method, named Deep Image-to-Video Adaptation and Fusion Networks (DIVAFN), to enhance action recognition in videos by transferring knowledge from images using video keyframes as a bridge. The DIVAFN is a unified deep learning model, which integrates domain-invariant representations learning and cross-modal feature fusion into a unified optimization framework. Specifically, we design an efficient cross-modal similarities metric to reduce the modality shift among images, keyframes and videos. Then, we adopt an autoencoder architecture, whose hidden layer is constrained to be the semantic representations of the action class names. In this way, when the autoencoder is adopted to project the learned features from different domains to the same space, more compact, informative and discriminative representations can be obtained. Finally, the concatenation of the learned semantic feature representations from these three autoencoders are used to train the classifier for action recognition in videos. Comprehensive experiments on four real-world datasets show that our method outperforms some state-of-the-art domain adaptation and action recognition methods.

preprint2019arXiv

Gaussian Process Regression and Conditional Polynomial Chaos for Parameter Estimation

We present a new approach for constructing a data-driven surrogate model and using it for Bayesian parameter estimation in partial differential equation (PDE) models. We first use parameter observations and Gaussian Process regression to condition the Karhunen-Loéve (KL) expansion of the unknown space-dependent parameters and then build the conditional generalized Polynomial Chaos (gPC) surrogate model of the PDE states. Next, we estimate the unknown parameters by computing coefficients in the KL expansion minimizing the square difference between the gPC predictions and measurements of the states using the Markov Chain Monte Carlo method. Our approach addresses two major challenges in the Bayesian parameter estimation. First, it reduces dimensionality of the parameter space and replaces expensive direct solutions of PDEs with the conditional gPC surrogates. Second, the estimated parameter field exactly matches the parameter measurements. In addition, we show that the conditional gPC surrogate can be used to estimate the states variance, which, in turn, can be used to guide data acquisition. We demonstrate that our approach improves its accuracy with application to one- and two-dimensional Darcy equation with (unknown) space-dependent hydraulic conductivity. We also discuss the effect of hydraulic conductivity and head locations on the accuracy of the hydraulic conductivity estimations.

preprint2019arXiv

Improving solution accuracy and convergence for stochastic physics parameterizations with colored noise

Stochastic parameterizations are used in numerical weather prediction and climate modeling to help capture the uncertainty in the simulations and improve their statistical properties. Convergence issues can arise when time integration methods originally developed for deterministic differential equations are applied naively to stochastic problems. (Hodyss et al 2013, 2014) demonstrated that a correction term to various deterministic numerical schemes, known in stochastic analysis as the Itô correction, can help improve solution accuracy and ensure convergence to the physically relevant solution without substantial computational overhead. The usual formulation of the Itô correction is valid only when the stochasticity is represented by {\it white} noise. In this study, a generalized formulation of the Itô correction is derived for noises of any color. The formulation is applied to a test problem described by an advection-diffusion equation forced with a spectrum of fast processes. We present numerical results for cases with both constant and spatially varying advection velocities to show that, for the same time step sizes, the introduction of the generalized Itô correction helps to substantially reduce time integration error and significantly improve the convergence rate of the numerical solutions when the forcing term in the governing equation is rough (fast varying); alternatively, for the same target accuracy, the generalized Itô correction allows for the use of significantly longer time steps and hence helps to reduce the computational cost of the numerical simulation.

preprint2019arXiv

Model reduction for a power grid model

We apply model reduction techniques to the DeMarco power grid model. The DeMarco model, when augmented by an appropriate line failure mechanism, can be used to study cascade failures. Here we examine the DeMarco model without the line failure mechanism and we investigate how to construct reduced order models for subsets of the state variables. We show that due to the oscillating nature of the solutions and the absence of timescale separation between resolved and unresolved variables, the construction of accurate reduced models becomes highly non-trivial since one has to account for long memory effects. In addition, we show that a reduced model which includes even a short memory is drastically better than a memoryless model.

preprint2019arXiv

Robust Cubature Kalman Filter for Dynamic State Estimation of Synchronous Machines under Unknown Measurement Noise Statistics

Kalman-type filtering techniques including cubature Kalman filter (CKF) does not work well in non-Gaussian environments, especially in the presence of outliers. To solve this problem, Huber&#39;s M-estimation based robust CKF (RCKF) is proposed for synchronous machines by combining the Huber&#39;s M-estimation theory with the classical CKF, which is capable of coping with the deterioration in performance and discretization of tracking curves when measurement noise statistics deviatefrom the prior noise statistics. The proposed RCKF algorithm has good adaptability to unknown measurement noise statistics characteristics including non-Gaussian measurement noise and outliers. The simulation results on the WSCC 3-machine 9-bus system and New England 16-machine 68-bus system verify the effectiveness of the proposed method and its advantage over the classical CKF.

preprint2019arXiv

Tracing Kinematic and Chemical Properties of Sagittarius Stream by K-Giants, M-Giants, and BHB stars

We characterize the kinematic and chemical properties of $\sim$3,000 Sagittarius (Sgr) stream stars, including K-giants, M-giants, and BHBs, select from SEGUE-2, LAMOST, and SDSS separately in Integrals-of-Motion space. The orbit of Sgr stream is quite clear from the velocity vector in $X$-$Z$ plane. Stars traced by K-giants and M-giants present the apogalacticon of trailing steam is $\sim$ 100 kpc. The metallicity distributions of Sgr K-, M-giants, and BHBs present that the M-giants are on average the most metal-rich population, followed by K-giants and BHBs. All of the K-, M-giants, and BHBs indicate that the trailing arm is on average more metal-rich than leading arm, and the K-giants show that the Sgr debris is the most metal-poor part. The $α$-abundance of Sgr stars exhibits a similar trend with the Galactic halo stars at lower metallicity ([Fe/H] $<\sim$ $-$1.0 dex), and then evolve down to lower [$α$/Fe] than disk stars at higher metallicity, which is close to the evolution pattern of $α$-element of Milky Way dwarf galaxies. We find $V_Y$ and metallicity of K-giants have gradients along the direction of line-of-sight from the Galactic center in $X$-$Z$ plane, and the K-giants show that $V_Y$ increases with metallicity at [Fe/H] $>\sim-$1.5 dex. After dividing the Sgr stream into bright and faint stream according to their locations in equatorial coordinate, the K-giants and BHBs show that the bright and faint stream present different $V_Y$ and metallicities, the bright stream is on average higher in $V_Y$ and metallicity than the faint stream.

preprint2017arXiv

On the Cauchy Problem of 3D Nonhomogeneous Navier-Stokes Equations with Density-Dependent Viscosity and Vacuum

We consider the global existence and large-time asymptotic behavior of strong solutions to the Cauchy problem of the three-dimensional nonhomogeneous incompressible Navier-Stokes equations with density-dependent viscosity and vacuum. We establish some key a priori exponential decay-in-time rates of the strong solutions. Then after using these estimates, we also obtain the global existence of strong solutions in the whole three-dimensional space, provided that the initial velocity is suitably small in the $\dot H^β$-norm for some $β\in(1/2,1].$ Note that this result is proved without any smallness conditions on the initial density. Moreover, the density can contain vacuum states and even have compact support initially.

preprint2013arXiv

Global Well-Posedness and Large Time Asymptotic Behavior of Classical Solutions to the Compressible Navier-Stokes Equations with Vacuum

This paper concerns the global well-posedness and large time asymptotic behavior of strong and classical solutions to the Cauchy problem of the Navier-Stokes equations for viscous compressible barotropic flows in two or three spatial dimensions with vacuum as far field density. For strong and classical solutions, some a priori decay with rates (in large time) for both the pressure and the spatial gradient of the velocity field are obtained provided that the initial total energy is suitably {small.} Moreover, by using these key decay rates and some analysis on the expansion rates of the essential support of the density, we establish the global existence and uniqueness of classical solutions (which may be of possibly large oscillations) in two spatial dimensions, provided the smooth initial data are of small total energy. In addition, the initial density can even have compact support. This, in particular, yields the global regularity and uniqueness of the re-normalized weak solutions of Lions-Feireisl to the two-dimensional compressible barotropic flows for all adiabatic number $γ>1$ provided that the initial total energy is small.