Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
124works
0followers
55topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

124 published item(s)

preprint2026arXiv

CCL-D: A High-Precision Diagnostic System for Slow and Hang Anomalies in Large-Scale Model Training

As training scales grow, collective communication libraries (CCL) increasingly face anomalies arising from complex interactions among hardware, software, and environmental factors. These anomalies typically manifest as slow/hang communication, the most frequent and time-consuming category to diagnose. However, traditional diagnostic methods remain inaccurate and inefficient, frequently requiring hours or even days for root cause analysis. To address this, we propose CCL-D, a high-precision diagnostic system designed to detect and locate slow/hang anomalies in large-scale distributed training. CCL-D integrates a rank-level real-time probe with an intelligent decision analyzer. The probe measures cross-layer anomaly metrics using a lightweight distributed tracing framework to monitor communication traffic. The analyzer performs automated anomaly detection and root-cause location, precisely identifying the faulty GPU rank. Deployed on a 4,000-GPU cluster over one year, CCL-D achieved near-complete coverage of known slow/hang anomalies and pinpointed affected ranks within 6 minutes-substantially outperforming existing solutions.

preprint2025arXiv

CogRec: A Cognitive Recommender Agent Fusing Large Language Models and Soar for Explainable Recommendation

Large Language Models (LLMs) have demonstrated a remarkable capacity in understanding user preferences for recommendation systems. However, they are constrained by several critical challenges, including their inherent "Black-Box" characteristics, susceptibility to knowledge hallucination, and limited online learning capacity. These factors compromise their trustworthiness and adaptability. Conversely, cognitive architectures such as Soar offer structured and interpretable reasoning processes, yet their knowledge acquisition is notoriously laborious. To address these complementary challenges, we propose a novel cognitive recommender agent called CogRec which synergizes the strengths of LLMs with the Soar cognitive architecture. CogRec leverages Soar as its core symbolic reasoning engine and leverages an LLM for knowledge initialization to populate its working memory with production rules. The agent operates on a Perception-Cognition-Action(PCA) cycle. Upon encountering an impasse, it dynamically queries the LLM to obtain a reasoned solution. This solution is subsequently transformed into a new symbolic production rule via Soar's chunking mechanism, thereby enabling robust online learning. This learning paradigm allows the agent to continuously evolve its knowledge base and furnish highly interpretable rationales for its recommendations. Extensive evaluations conducted on three public datasets demonstrate that CogRec demonstrates significant advantages in recommendation accuracy, explainability, and its efficacy in addressing the long-tail problem.

preprint2024arXiv

Dual Teacher Knowledge Distillation with Domain Alignment for Face Anti-spoofing

Face recognition systems have raised concerns due to their vulnerability to different presentation attacks, and system security has become an increasingly critical concern. Although many face anti-spoofing (FAS) methods perform well in intra-dataset scenarios, their generalization remains a challenge. To address this issue, some methods adopt domain adversarial training (DAT) to extract domain-invariant features. However, the competition between the encoder and the domain discriminator can cause the network to be difficult to train and converge. In this paper, we propose a domain adversarial attack (DAA) method to mitigate the training instability problem by adding perturbations to the input images, which makes them indistinguishable across domains and enables domain alignment. Moreover, since models trained on limited data and types of attacks cannot generalize well to unknown attacks, we propose a dual perceptual and generative knowledge distillation framework for face anti-spoofing that utilizes pre-trained face-related models containing rich face priors. Specifically, we adopt two different face-related models as teachers to transfer knowledge to the target student model. The pre-trained teacher models are not from the task of face anti-spoofing but from perceptual and generative tasks, respectively, which implicitly augment the data. By combining both DAA and dual-teacher knowledge distillation, we develop a dual teacher knowledge distillation with domain alignment framework (DTDA) for face anti-spoofing. The advantage of our proposed method has been verified through extensive ablation studies and comparison with state-of-the-art methods on public datasets across multiple protocols.

preprint2024arXiv

Research on Multilingual Natural Scene Text Detection Algorithm

Natural scene text detection is a significant challenge in computer vision, with tremendous potential applications in multilingual, diverse, and complex text scenarios. We propose a multilingual text detection model to address the issues of low accuracy and high difficulty in detecting multilingual text in natural scenes. In response to the challenges posed by multilingual text images with multiple character sets and various font styles, we introduce the SFM Swin Transformer feature extraction network to enhance the model's robustness in detecting characters and fonts across different languages. Dealing with the considerable variation in text scales and complex arrangements in natural scene text images, we present the AS-HRFPN feature fusion network by incorporating an Adaptive Spatial Feature Fusion module and a Spatial Pyramid Pooling module. The feature fusion network improvements enhance the model's ability to detect text sizes and orientations. Addressing diverse backgrounds and font variations in multilingual scene text images is a challenge for existing methods. Limited local receptive fields hinder detection performance. To overcome this, we propose a Global Semantic Segmentation Branch, extracting and preserving global features for more effective text detection, aligning with the need for comprehensive information. In this study, we collected and built a real-world multilingual natural scene text image dataset and conducted comprehensive experiments and analyses. The experimental results demonstrate that the proposed algorithm achieves an F-measure of 85.02\%, which is 4.71\% higher than the baseline model. We also conducted extensive cross-dataset validation on MSRA-TD500, ICDAR2017MLT, and ICDAR2015 datasets to verify the generality of our approach. The code and dataset can be found at https://github.com/wangmelon/CEMLT.

preprint2024arXiv

VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning

Correspondence pruning aims to find correct matches (inliers) from an initial set of putative correspondences, which is a fundamental task for many applications. The process of finding is challenging, given the varying inlier ratios between scenes/image pairs due to significant visual differences. However, the performance of the existing methods is usually limited by the problem of lacking visual cues (\eg texture, illumination, structure) of scenes. In this paper, we propose a Visual-Spatial Fusion Transformer (VSFormer) to identify inliers and recover camera poses accurately. Firstly, we obtain highly abstract visual cues of a scene with the cross attention between local features of two-view images. Then, we model these visual cues and correspondences by a joint visual-spatial fusion module, simultaneously embedding visual cues into correspondences for pruning. Additionally, to mine the consistency of correspondences, we also design a novel module that combines the KNN-based graph and the transformer, effectively capturing both local and global contexts. Extensive experiments have demonstrated that the proposed VSFormer outperforms state-of-the-art methods on outdoor and indoor benchmarks. Our code is provided at the following repository: https://github.com/sugar-fly/VSFormer.

preprint2023arXiv

A Multi-Scale Framework for Out-of-Distribution Detection in Dermoscopic Images

The automatic detection of skin diseases via dermoscopic images can improve the efficiency in diagnosis and help doctors make more accurate judgments. However, conventional skin disease recognition systems may produce high confidence for out-of-distribution (OOD) data, which may become a major security vulnerability in practical applications. In this paper, we propose a multi-scale detection framework to detect out-of-distribution skin disease image data to ensure the robustness of the system. Our framework extracts features from different layers of the neural network. In the early layers, rectified activation is used to make the output features closer to the well-behaved distribution, and then an one-class SVM is trained to detect OOD data; in the penultimate layer, an adapted Gram matrix is used to calculate the features after rectified activation, and finally the layer with the best performance is chosen to compute a normality score. Experiments show that the proposed framework achieves superior performance when compared with other state-of-the-art methods in the task of skin disease recognition.

preprint2023arXiv

Robust Remote Sensing Scene Classification with Multi-View Voting and Entropy Ranking

Deep convolutional neural networks have been widely used in scene classification of remotely sensed images. In this work, we propose a robust learning method for the task that is secure against partially incorrect categorization of images. Specifically, we remove and correct errors in the labels progressively by iterative multi-view voting and entropy ranking. At each time step, we first divide the training data into disjoint parts for separate training and voting. The unanimity in the voting reveals the correctness of the labels, so that we can train a strong model with only the images with unanimous votes. In addition, we adopt entropy as an effective measure for prediction uncertainty, in order to partially recover labeling errors by ranking and selection. We empirically demonstrate the superiority of the proposed method on the WHU-RS19 dataset and the AID dataset.

preprint2023arXiv

Spatio-Temporal Context Modeling for Road Obstacle Detection

Road obstacle detection is an important problem for vehicle driving safety. In this paper, we aim to obtain robust road obstacle detection based on spatio-temporal context modeling. Firstly, a data-driven spatial context model of the driving scene is constructed with the layouts of the training data. Then, obstacles in the input image are detected via the state-of-the-art object detection algorithms, and the results are combined with the generated scene layout. In addition, to further improve the performance and robustness, temporal information in the image sequence is taken into consideration, and the optical flow is obtained in the vicinity of the detected objects to track the obstacles across neighboring frames. Qualitative and quantitative experiments were conducted on the Small Obstacle Detection (SOD) dataset and the Lost and Found dataset. The results indicate that our method with spatio-temporal context modeling is superior to existing methods for road obstacle detection.

preprint2023arXiv

UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion

Text-to-speech (TTS) and voice conversion (VC) are two different tasks both aiming at generating high quality speaking voice according to different input modality. Due to their similarity, this paper proposes UnifySpeech, which brings TTS and VC into a unified framework for the first time. The model is based on the assumption that speech can be decoupled into three independent components: content information, speaker information, prosody information. Both TTS and VC can be regarded as mining these three parts of information from the input and completing the reconstruction of speech. For TTS, the speech content information is derived from the text, while in VC it's derived from the source speech, so all the remaining units are shared except for the speech content extraction module in the two tasks. We applied vector quantization and domain constrain to bridge the gap between the content domains of TTS and VC. Objective and subjective evaluation shows that by combining the two task, TTS obtains better speaker modeling ability while VC gets hold of impressive speech content decoupling capability.

preprint2022arXiv

A Long Short-term Memory Based Recurrent Neural Network for Interventional MRI Reconstruction

Interventional magnetic resonance imaging (i-MRI) for surgical guidance could help visualize the interventional process such as deep brain stimulation (DBS), improving the surgery performance and patient outcome. Different from retrospective reconstruction in conventional dynamic imaging, i-MRI for DBS has to acquire and reconstruct the interventional images sequentially online. Here we proposed a convolutional long short-term memory (Conv-LSTM) based recurrent neural network (RNN), or ConvLR, to reconstruct interventional images with golden-angle radial sampling. By using an initializer and Conv-LSTM blocks, the priors from the pre-operative reference image and intra-operative frames were exploited for reconstructing the current frame. Data consistency for radial sampling was implemented by a soft-projection method. To improve the reconstruction accuracy, an adversarial learning strategy was adopted. A set of interventional images based on the pre-operative and post-operative MR images were simulated for algorithm validation. Results showed with only 10 radial spokes, ConvLR provided the best performance compared with state-of-the-art methods, giving an acceleration up to 40 folds. The proposed algorithm has the potential to achieve real-time i-MRI for DBS and can be used for general purpose MR-guided intervention.

preprint2022arXiv

A new family of quantum synchronizable codes from negacyclic codes

Quantum synchronizable codes are kinds of quantum error-correcting codes that can not only correct the effects of quantum noise on qubits but also the misalignment in block synchronization. In this paper, a new method for construct quantum synchronizable codes from negacyclic codes are proposed, where the length of these negacyclic codes are $p$ and $pq$. Through this method, the quantum synchronizable code possesses optimal or almost optimal error-correcting capability towards bits errors and phase errors, since the negacyclic codes we used are optimal or almost optimal. Moreover, this paper contributes to construct two classes quantum synchronizable codes, whose synchronization capabilities can reach the upper limit under certain conditions.

preprint2022arXiv

A No-Reference Deep Learning Quality Assessment Method for Super-resolution Images Based on Frequency Maps

To support the application scenarios where high-resolution (HR) images are urgently needed, various single image super-resolution (SISR) algorithms are developed. However, SISR is an ill-posed inverse problem, which may bring artifacts like texture shift, blur, etc. to the reconstructed images, thus it is necessary to evaluate the quality of super-resolution images (SRIs). Note that most existing image quality assessment (IQA) methods were developed for synthetically distorted images, which may not work for SRIs since their distortions are more diverse and complicated. Therefore, in this paper, we propose a no-reference deep-learning image quality assessment method based on frequency maps because the artifacts caused by SISR algorithms are quite sensitive to frequency information. Specifically, we first obtain the high-frequency map (HM) and low-frequency map (LM) of SRI by using Sobel operator and piecewise smooth image approximation. Then, a two-stream network is employed to extract the quality-aware features of both frequency maps. Finally, the features are regressed into a single quality value using fully connected layers. The experimental results show that our method outperforms all compared IQA models on the selected three super-resolution quality assessment (SRQA) databases.

preprint2022arXiv

An extension of Thomassen's result on choosability

Thomassen proved that all planar graphs are $5$-choosable. Škrekovski strengthened the result by showing that all $K_{5}$-minor-free graphs are $5$-choosable. Dvořák and Postle pointed out that all planar graphs are DP-$5$-colorable. In this note, we first improve these results by showing that every $K_{5}$-minor-free or $K_{3, 3}$-minor-free graph is DP-$5$-colorable. In the final section, we further improve these results under the term strictly $f$-degenerate transversal.

preprint2022arXiv

An Initial Investigation for Detecting Vocoder Fingerprints of Fake Audio

Many effective attempts have been made for fake audio detection. However, they can only provide detection results but no countermeasures to curb this harm. For many related practical applications, what model or algorithm generated the fake audio also is needed. Therefore, We propose a new problem for detecting vocoder fingerprints of fake audio. Experiments are conducted on the datasets synthesized by eight state-of-the-art vocoders. We have preliminarily explored the features and model architectures. The t-SNE visualization shows that different vocoders generate distinct vocoder fingerprints.

preprint2022arXiv

An STDP-Based Supervised Learning Algorithm for Spiking Neural Networks

Compared with rate-based artificial neural networks, Spiking Neural Networks (SNN) provide a more biological plausible model for the brain. But how they perform supervised learning remains elusive. Inspired by recent works of Bengio et al., we propose a supervised learning algorithm based on Spike-Timing Dependent Plasticity (STDP) for a hierarchical SNN consisting of Leaky Integrate-and-fire (LIF) neurons. A time window is designed for the presynaptic neuron and only the spikes in this window take part in the STDP updating process. The model is trained on the MNIST dataset. The classification accuracy approach that of a Multilayer Perceptron (MLP) with similar architecture trained by the standard back-propagation algorithm.

preprint2022arXiv

Black hole thermodynamics is extensive with variable Newton constant

Inspired by the recent studies on the thermodynamics of AdS black holes in the restricted phase space formalism, we propose a similar formalism for the thermodynamics of non-AdS black holes with variable Newton constant. It is shown that, by introducing the new variables $N,μ$, where $N$ is proportional to the inverse Newton constant and $μ$ its conjugate variable, referred to as the chemical potential, the black hole thermodynamics can be formulated in a form which is consistent with the standard extensive thermodynamics for open macroscopic systems, with the first law and the Euler relation hold simultaneously. This formalism has profound implications, in particular, the mass is a homogeneous function of the first order in the extensive variables and the intensive variables are zeroth order homogeneous functions. The chemical potential is shown to be closely related to the Euclidean action evaluated at the black hole configuration.

preprint2022arXiv

Blind Surveillance Image Quality Assessment via Deep Neural Network Combined with the Visual Saliency

The intelligent video surveillance system (IVSS) can automatically analyze the content of the surveillance image (SI) and reduce the burden of the manual labour. However, the SIs may suffer quality degradations in the procedure of acquisition, compression, and transmission, which makes IVSS hard to understand the content of SIs. In this paper, we first conduct an example experiment (i.e. the face detection task) to demonstrate that the quality of the SIs has a crucial impact on the performance of the IVSS, and then propose a saliency-based deep neural network for the blind quality assessment of the SIs, which helps IVSS to filter the low-quality SIs and improve the detection and recognition performance. Specifically, we first compute the saliency map of the SI to select the most salient local region since the salient regions usually contain rich semantic information for machine vision and thus have a great impact on the overall quality of the SIs. Next, the convolutional neural network (CNN) is adopted to extract quality-aware features for the whole image and local region, which are then mapped into the global and local quality scores through the fully connected (FC) network respectively. Finally, the overall quality score is computed as the weighted sum of the global and local quality scores. Experimental results on the SI quality database (SIQD) show that the proposed method outperforms all compared state-of-the-art BIQA methods.

preprint2022arXiv

CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing

The text-based speech editor allows the editing of speech through intuitive cutting, copying, and pasting operations to speed up the process of editing speech. However, the major drawback of current systems is that edited speech often sounds unnatural due to cut-copy-paste operation. In addition, it is not obvious how to synthesize records according to a new word not appearing in the transcript. This paper proposes a novel end-to-end text-based speech editing method called context-aware mask prediction network (CampNet). The model can simulate the text-based speech editing process by randomly masking part of speech and then predicting the masked region by sensing the speech context. It can solve unnatural prosody in the edited region and synthesize the speech corresponding to the unseen words in the transcript. Secondly, for the possible operation of text-based speech editing, we design three text-based operations based on CampNet: deletion, insertion, and replacement. These operations can cover various situations of speech editing. Thirdly, to synthesize the speech corresponding to long text in insertion and replacement operations, a word-level autoregressive generation method is proposed. Fourthly, we propose a speaker adaptation method using only one sentence for CampNet and explore the ability of few-shot learning based on CampNet, which provides a new idea for speech forgery tasks. The subjective and objective experiments on VCTK and LibriTTS datasets show that the speech editing results based on CampNet are better than TTS technology, manual editing, and VoCo method. We also conduct detailed ablation experiments to explore the effect of the CampNet structure on its performance. Finally, the experiment shows that speaker adaptation with only one sentence can further improve the naturalness of speech. Examples of generated speech can be found at https://hairuo55.github.io/CampNet.

preprint2022arXiv

Convergence Rates in Uniform Ergodicity by Hitting Times and $L^2$-exponential Convergence Rates

Generally the convergence rate in exponential ergodicity $λ$ is an upper bound for the convergence rate $κ$ in uniform ergodicity for a Markov process, that is $λ\geqslantκ$. In this paper, we prove that $κ\geqslant \inf \{lambda,1/M_H\}$, where $M_H$ is a uniform bound on the moment of the hitting time to a "compact" set $H$. In the case where $M_H$ can be made arbitrarily small for $H$ large enough, we obtain that $λ=κ$. The general results are applied to Markov chains, diffusion processes and solutions to SDEs driven by symmetric stable processes.

preprint2022arXiv

Core localized alpha-channeling via low frequency Alfven mode generation in reversed shear scenarios

A novel channel for fuel ions heating in tokamak core plasma is proposed and analyzed using nonlinear gyrokinetic theory. The channel is achieved via spontaneous decay of reversed shear Alfven eigenmode (RSAE) into low frequency Alfven modes (LFAM), which then heat fuel ions via collisionless ion Landau damping. The conditions for RSAE spontaneous decay are investigated, and the saturation level and the consequent fuel ion heating rate are also derived. The channel is expected to be crucial for future reactors operating under reversed shear configurations, where fusion alpha particles are generated in the tokamak core where the magnetic shear is typically reversed, and there is a dense RSAE spectrum due to the small alpha particle characteristic dimensionless orbits.

preprint2022arXiv

COVID-19 Hospitalizations Forecasts Using Internet Search Data

As the COVID-19 spread over the globe and new variants of COVID-19 keep occurring, reliable real-time forecasts of COVID-19 hospitalizations are critical for public health decision on medical resources allocations such as ICU beds, ventilators, and personnel to prepare for the surge of COVID-19 pandemics. Inspired by the strong association between public search behavior and hospitalization admission, we extended previously-proposed influenza tracking model, ARGO (AutoRegression with GOogle search data), to predict future 2-week national and state-level COVID-19 new hospital admissions. Leveraging the COVID-19 related time series information and Google search data, our method is able to robustly capture new COVID-19 variants' surges, and self-correct at both national and state level. Based on our retrospective out-of-sample evaluation over 12-month comparison period, our method achieves on average 15\% error reduction over the best alternative models collected from COVID-19 forecast hub. Overall, we showed that our method is flexible, self-correcting, robust, accurate, and interpretable, making it a potentially powerful tool to assist health-care officials and decision making for the current and future infectious disease outbreak.

preprint2022arXiv

DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition

The MultiCoNER shared task aims at detecting semantically ambiguous and complex named entities in short and low-context settings for multiple languages. The lack of contexts makes the recognition of ambiguous named entities challenging. To alleviate this issue, our team DAMO-NLP proposes a knowledge-based system, where we build a multilingual knowledge base based on Wikipedia to provide related context information to the named entity recognition (NER) model. Given an input sentence, our system effectively retrieves related contexts from the knowledge base. The original input sentences are then augmented with such context information, allowing significantly better contextualized token representations to be captured. Our system wins 10 out of 13 tracks in the MultiCoNER shared task.

preprint2022arXiv

Deep Neural Network for Blind Visual Quality Assessment of 4K Content

The 4K content can deliver a more immersive visual experience to consumers due to the huge improvement of spatial resolution. However, existing blind image quality assessment (BIQA) methods are not suitable for the original and upscaled 4K contents due to the expanded resolution and specific distortions. In this paper, we propose a deep learning-based BIQA model for 4K content, which on one hand can recognize true and pseudo 4K content and on the other hand can evaluate their perceptual visual quality. Considering the characteristic that high spatial resolution can represent more abundant high-frequency information, we first propose a Grey-level Co-occurrence Matrix (GLCM) based texture complexity measure to select three representative image patches from a 4K image, which can reduce the computational complexity and is proven to be very effective for the overall quality prediction through experiments. Then we extract different kinds of visual features from the intermediate layers of the convolutional neural network (CNN) and integrate them into the quality-aware feature representation. Finally, two multilayer perception (MLP) networks are utilized to map the quality-aware features into the class probability and the quality score for each patch respectively. The overall quality index is obtained through the average pooling of patch results. The proposed model is trained through the multi-task learning manner and we introduce an uncertainty principle to balance the losses of the classification and regression tasks. The experimental results show that the proposed model outperforms all compared BIQA metrics on four 4K content quality assessment databases.

preprint2022arXiv

Deep Probabilistic Graph Matching

Most previous learning-based graph matching algorithms solve the \textit{quadratic assignment problem} (QAP) by dropping one or more of the matching constraints and adopting a relaxed assignment solver to obtain sub-optimal correspondences. Such relaxation may actually weaken the original graph matching problem, and in turn hurt the matching performance. In this paper we propose a deep learning-based graph matching framework that works for the original QAP without compromising on the matching constraints. In particular, we design an affinity-assignment prediction network to jointly learn the pairwise affinity and estimate the node assignments, and we then develop a differentiable solver inspired by the probabilistic perspective of the pairwise affinities. Aiming to obtain better matching results, the probabilistic solver refines the estimated assignments in an iterative manner to impose both discrete and one-to-one matching constraints. The proposed method is evaluated on three popularly tested benchmarks (Pascal VOC, Willow Object and SPair-71k), and it outperforms all previous state-of-the-arts on all benchmarks.

preprint2022arXiv

Dense Gas and Star Formation in Nearby Infrared Bright Galaxies: APEX survey of HCN and HCO+ J=2-1

Both Galactic and extragalactic studies on star formation suggest that stars form directly from dense molecular gas. To trace such high volume density gas, HCN and HCO+ J=1-0 have been widely used for their high dipole moments, relatively high abundances, and often being the strongest lines after CO. However, HCN and HCO+ J=1-0 emission could be arguably dominated by the gas components at low volume densities. HCN J=2-1 and HCO+ J=2-1, with more suitable critical densities and excitation requirements, would trace typical dense gas closely related to star formation. Here we report new observations of HCN J=2-1 and HCO+ J=2-1 towards 17 nearby infrared-bright galaxies with the APEX 12-m telescope. The correlation slopes between luminosities of HCN J=2-1, and HCO+ J=2-1 and total infrared emission are 1.03 +- 0.05 and 1.00 +- 0.05, respectively. The correlations of their surface densities, normalised with the area of radio/sub-millimeter continuum, show even tighter relations (Slopes: 0.99 +- 0.03 and 1.02 +- 0.03). The eight AGN-dominated galaxies show no significant difference from the eleven star-formation dominated galaxies in above relations. The average HCN/HCO+ ratios are 1.15 +- 0.26 and 0.98 +- 0.42 for AGN-dominated and star-formation dominated galaxies, respectively, without obvious dependencies on infrared luminosity, dust temperature, or infrared pumping. The Magellanic Clouds roughly follow the same correlations, expanding to eight orders of magnitude. On the other hand, ultra-luminous infrared galaxies with active galactic nucleus (AGN) systematically lay above the correlations, indicating potential biases introduced by AGNs.

preprint2022arXiv

End-to-end video instance segmentation via spatial-temporal graph neural networks

Video instance segmentation is a challenging task that extends image instance segmentation to the video domain. Existing methods either rely only on single-frame information for the detection and segmentation subproblems or handle tracking as a separate post-processing step, which limit their capability to fully leverage and share useful spatial-temporal information for all the subproblems. In this paper, we propose a novel graph-neural-network (GNN) based method to handle the aforementioned limitation. Specifically, graph nodes representing instance features are used for detection and segmentation while graph edges representing instance relations are used for tracking. Both inter and intra-frame information is effectively propagated and shared via graph updates and all the subproblems (i.e. detection, segmentation and tracking) are jointly optimized in an unified framework. The performance of our method shows great improvement on the YoutubeVIS validation dataset compared to existing methods and achieves 35.2% AP with a ResNet-50 backbone, operating at 22 FPS. Code is available at http://github.com/lucaswithai/visgraph.git .

preprint2022arXiv

Floquet engineering Hz-Level Rabi Spectra in Shallow Optical Lattice Clock

Quantum metrology with ultra-high precision usually requires atoms prepared in an ultra-stable environment with well-defined quantum states. Thus, in optical lattice clock systems deep lattice potentials are used to trap ultra-cold atoms. However, decoherence, induced by Raman scattering and higher order light shifts, can significantly be reduced if atomic clocks are realized in shallow optical lattices. On the other hand, in such lattices, tunneling among different sites can cause additional dephasing and strongly broadening of the Rabi spectrum. Here, in our experiment, we periodically drive a shallow $^{87}$Sr optical lattice clock. Counter intuitively, shaking the system can deform the wide broad spectral line into a sharp peak with 5.4Hz line-width. With careful comparison between the theory and experiment, we demonstrate that the Rabi frequency and the Bloch bands can be tuned, simultaneously and independently. Our work not only provides a different idea for quantum metrology, such as building shallow optical lattice clock in outer space, but also paves the way for quantum simulation of new phases of matter by engineering exotic spin orbit couplings.

preprint2022arXiv

FRIH: Fine-grained Region-aware Image Harmonization

Image harmonization aims to generate a more realistic appearance of foreground and background for a composite image. Existing methods perform the same harmonization process for the whole foreground. However, the implanted foreground always contains different appearance patterns. All the existing solutions ignore the difference of each color block and losing some specific details. Therefore, we propose a novel global-local two stages framework for Fine-grained Region-aware Image Harmonization (FRIH), which is trained end-to-end. In the first stage, the whole input foreground mask is used to make a global coarse-grained harmonization. In the second stage, we adaptively cluster the input foreground mask into several submasks by the corresponding pixel RGB values in the composite image. Each submask and the coarsely adjusted image are concatenated respectively and fed into a lightweight cascaded module, adjusting the global harmonization performance according to the region-aware local feature. Moreover, we further designed a fusion prediction module by fusing features from all the cascaded decoder layers together to generate the final result, which could utilize the different degrees of harmonization results comprehensively. Without bells and whistles, our FRIH algorithm achieves the best performance on iHarmony4 dataset (PSNR is 38.19 dB) with a lightweight model. The parameters for our model are only 11.98 M, far below the existing methods.

preprint2022arXiv

Geometric Synthesis: A Free lunch for Large-scale Palmprint Recognition Model Pretraining

Palmprints are private and stable information for biometric recognition. In the deep learning era, the development of palmprint recognition is limited by the lack of sufficient training data. In this paper, by observing that palmar creases are the key information to deep-learning-based palmprint recognition, we propose to synthesize training data by manipulating palmar creases. Concretely, we introduce an intuitive geometric model which represents palmar creases with parameterized Bézier curves. By randomly sampling Bézier parameters, we can synthesize massive training samples of diverse identities, which enables us to pretrain large-scale palmprint recognition models. Experimental results demonstrate that such synthetically pretrained models have a very strong generalization ability: they can be efficiently transferred to real datasets, leading to significant performance improvements on palmprint recognition. For example, under the open-set protocol, our method improves the strong ArcFace baseline by more than 10\% in terms of TAR@1e-6. And under the closed-set protocol, our method reduces the equal error rate (EER) by an order of magnitude.

preprint2022arXiv

GLaM: Efficient Scaling of Language Models with Mixture-of-Experts

Scaling language models with more data, compute and parameters has driven significant progress in natural language processing. For example, thanks to scaling, GPT-3 was able to achieve strong results on in-context learning tasks. However, training these large dense models requires significant amounts of computing resources. In this paper, we propose and develop a family of language models named GLaM (Generalist Language Model), which uses a sparsely activated mixture-of-experts architecture to scale the model capacity while also incurring substantially less training cost compared to dense variants. The largest GLaM has 1.2 trillion parameters, which is approximately 7x larger than GPT-3. It consumes only 1/3 of the energy used to train GPT-3 and requires half of the computation flops for inference, while still achieving better overall zero-shot and one-shot performance across 29 NLP tasks.

preprint2022arXiv

GLAN: A Graph-based Linear Assignment Network

Differentiable solvers for the linear assignment problem (LAP) have attracted much research attention in recent years, which are usually embedded into learning frameworks as components. However, previous algorithms, with or without learning strategies, usually suffer from the degradation of the optimality with the increment of the problem size. In this paper, we propose a learnable linear assignment solver based on deep graph networks. Specifically, we first transform the cost matrix to a bipartite graph and convert the assignment task to the problem of selecting reliable edges from the constructed graph. Subsequently, a deep graph network is developed to aggregate and update the features of nodes and edges. Finally, the network predicts a label for each edge that indicates the assignment relationship. The experimental results on a synthetic dataset reveal that our method outperforms state-of-the-art baselines and achieves consistently high accuracy with the increment of the problem size. Furthermore, we also embed the proposed solver, in comparison with state-of-the-art baseline solvers, into a popular multi-object tracking (MOT) framework to train the tracker in an end-to-end manner. The experimental results on MOT benchmarks illustrate that the proposed LAP solver improves the tracker by the largest margin.

preprint2022arXiv

Isolation mechanisms for high-speed packet-processing pipelines

Data-plane programmability is now mainstream. As we find more use cases, deployments need to be able to run multiple packet-processing modules in a single device. These are likely to be developed by independent teams, either within the same organization or from multiple organizations. Therefore, we need isolation mechanisms to ensure that modules on the same device do not interfere with each other. This paper presents Menshen, an extension of the Reconfigurable Match Tables (RMT) pipeline that enforces isolation between different packet-processing modules. Menshen is comprised of a set of lightweight hardware primitives and an extension to the open source P4-16 reference compiler that act in conjunction to meet this goal. We have prototyped Menshen on two FPGA platforms (NetFPGA and Corundum). We show that our design provides isolation, and allows new modules to be loaded without impacting the ones already running. Finally, we demonstrate that feasibility of implementing Menshen on ASICs by using the FreePDK45nm technology library and the Synopsys DC synthesis software, showing that our design meets timing at a 1GHz clock frequency and needs approximately 6% additional chip area. We have open sourced the code for Menshen's hardware and software at https://isolation.quest/.

preprint2022arXiv

Low star-formation activity and low gas content of quiescent galaxies at $z=$ 3.5-4.0 constrained with ALMA

The discovery in deep near-infrared surveys of a population of massive quiescent galaxies at $z>3$ has given rise to the question of how they came to be quenched so early in the history of the Universe. Measuring their molecular gas properties can distinguish between physical processes where they stop forming stars due to a lack of fuel versus those where star-formation efficiency is reduced and the gas is retained. We conducted Atacama Large Millimeter/sub-millimeter Array (ALMA) observations of four quiescent galaxies at $z=$ 3.5-4.0 found by the Fourstar Galaxy Evolution Survey (ZFOURGE) and a serendipitous optically dark galaxy at $z=3.71$. We aim to investigate the presence of dust-obscured star-formation and their gas content by observing the dust continuum emission at Band-7 and the atomic carbon [C I]($^3P_1$-$^3P_0$) line at 492.16 GHz. Among the four quiescent galaxies, only one source is detected in the dust continuum at $λ_{\rm obs} = 870 {\rm μm}$. The sub-mm observations confirm their passive nature, and all of them are located more than four times below the main sequence of star-forming galaxies at $z=3.7$. None of the targets are detected in [C I], constraining their gas mass fractions to be $<$ 20%. These gas mass fractions are more than three times lower than the scaling relation for star-forming galaxies at $z=3.7$. These results support scenarios where massive galaxies at $z=$ 3.5-4.0 quench by consuming/expelling all the gas rather than by reducing the efficiency of the conversion of their gas into stars.

preprint2022arXiv

Multiple Instance Neural Networks Based on Sparse Attention for Cancer Detection using T-cell Receptor Sequences

Early detection of cancers has been much explored due to its paramount importance in biomedical fields. Among different types of data used to answer this biological question, studies based on T cell receptors (TCRs) are under recent spotlight due to the growing appreciation of the roles of the host immunity system in tumor biology. However, the one-to-many correspondence between a patient and multiple TCR sequences hinders researchers from simply adopting classical statistical/machine learning methods. There were recent attempts to model this type of data in the context of multiple instance learning (MIL). Despite the novel application of MIL to cancer detection using TCR sequences and the demonstrated adequate performance in several tumor types, there is still room for improvement, especially for certain cancer types. Furthermore, explainable neural network models are not fully investigated for this application. In this article, we propose multiple instance neural networks based on sparse attention (MINN-SA) to enhance the performance in cancer detection and explainability. The sparse attention structure drops out uninformative instances in each bag, achieving both interpretability and better predictive performance in combination with the skip connection. Our experiments show that MINN-SA yields the highest area under the ROC curve (AUC) scores on average measured across 10 different types of cancers, compared to existing MIL approaches. Moreover, we observe from the estimated attentions that MINN-SA can identify the TCRs that are specific for tumor antigens in the same T cell repertoire.

preprint2022arXiv

NeuralDPS: Neural Deterministic Plus Stochastic Model with Multiband Excitation for Noise-Controllable Waveform Generation

The traditional vocoders have the advantages of high synthesis efficiency, strong interpretability, and speech editability, while the neural vocoders have the advantage of high synthesis quality. To combine the advantages of two vocoders, inspired by the traditional deterministic plus stochastic model, this paper proposes a novel neural vocoder named NeuralDPS which can retain high speech quality and acquire high synthesis efficiency and noise controllability. Firstly, this framework contains four modules: a deterministic source module, a stochastic source module, a neural V/UV decision module and a neural filter module. The input required by the vocoder is just the spectral parameter, which avoids the error caused by estimating additional parameters, such as F0. Secondly, to solve the problem that different frequency bands may have different proportions of deterministic components and stochastic components, a multiband excitation strategy is used to generate a more accurate excitation signal and reduce the neural filter&#39;s burden. Thirdly, a method to control noise components of speech is proposed. In this way, the signal-to-noise ratio (SNR) of speech can be adjusted easily. Objective and subjective experimental results show that our proposed NeuralDPS vocoder can obtain similar performance with the WaveNet and it generates waveforms at least 280 times faster than the WaveNet vocoder. It is also 28% faster than WaveGAN&#39;s synthesis efficiency on a single CPU core. We have also verified through experiments that this method can effectively control the noise components in the predicted speech and adjust the SNR of speech. Examples of generated speech can be found at https://hairuo55.github.io/NeuralDPS.

preprint2022arXiv

No-Reference Quality Assessment for 3D Colored Point Cloud and Mesh Models

To improve the viewer&#39;s Quality of Experience (QoE) and optimize computer graphics applications, 3D model quality assessment (3D-QA) has become an important task in the multimedia area. Point cloud and mesh are the two most widely used digital representation formats of 3D models, the visual quality of which is quite sensitive to lossy operations like simplification and compression. Therefore, many related studies such as point cloud quality assessment (PCQA) and mesh quality assessment (MQA) have been carried out to measure the visual quality degradations of 3D models. However, a large part of previous studies utilize full-reference (FR) metrics, which indicates they can not predict the quality level with the absence of the reference 3D model. Furthermore, few 3D-QA metrics consider color information, which significantly restricts their effectiveness and scope of application. In this paper, we propose a no-reference (NR) quality assessment metric for colored 3D models represented by both point cloud and mesh. First, we project the 3D models from 3D space into quality-related geometry and color feature domains. Then, the 3D natural scene statistics (3D-NSS) and entropy are utilized to extract quality-aware features. Finally, machine learning is employed to regress the quality-aware features into visual quality scores. Our method is validated on the colored point cloud quality assessment database (SJTU-PCQA), the Waterloo point cloud assessment database (WPC), and the colored mesh quality assessment database (CMDM). The experimental results show that the proposed method outperforms most compared NR 3D-QA metrics with competitive computational resources and greatly reduces the performance gap with the state-of-the-art FR 3D-QA metrics. The code of the proposed model is publicly available now to facilitate further research.

preprint2022arXiv

Nonlinear Stability of MHD Contact Discontinuities with Surface Tension

We consider the motion of two inviscid, compressible, and electrically conducting fluids separated by an interface across which there is no fluid flow in the presence of surface tension. The magnetic field is supposed to be nowhere tangential to the interface. This leads to the characteristic free boundary problem for contact discontinuities with surface tension in three-dimensional ideal compressible magnetohydrodynamics (MHD). We prove the nonlinear structural stability of MHD contact discontinuities with surface tension in Sobolev spaces by a modified Nash--Moser iteration scheme. The main ingredient of our proof is deriving the resolution and tame estimate of the linearized problem in usual Sobolev spaces of sufficiently large regularity. In particular, for solving the linearized problem, we introduce a suitable regularization that preserves the transport-type structure for the linearized entropy and divergence of the magnetic field.

preprint2022arXiv

Nuclear phase retrieval spectroscopy using resonant x-ray scattering

Light-matter interaction is exploited in spectroscopic techniques to access information about molecular, atomic or nuclear constituents of the sample of interest. While scattered light carries both amplitude and phase information of the electromagnetic field, most of the time the latter is lost in intensity measurements. However, often the phase information is paramount to reconstruct the desired information of the target, as it is well known from coherent x-ray imaging. Here we introduce a new phase retrieval algorithm which allows us to reconstruct the field phase information from two-dimensional time- and energy-resolved spectra. We apply this method to the particular case of x-ray scattering off Mössbauer nuclei at a synchrotron radiation source. Knowledge of the phase allows also for an excellent reconstruction of the energy spectra from experimental data, which could not be achieved with this resolution otherwise. Our approach provides an efficient novel data analysis tool which will benefit x-ray quantum optics and Mössbauer spectroscopy with synchrotron radiation alike.

preprint2022arXiv

Planar graphs without normally adjacent short cycles

Let $\mathscr{G}$ be the class of plane graphs without triangles normally adjacent to $8^{-}$-cycles, without $4$-cycles normally adjacent to $6^{-}$-cycles, and without normally adjacent $5$-cycles. In this paper, it is shown that every graph in $\mathscr{G}$ is $3$-choosable. Instead of proving this result, we directly prove a stronger result in the form of ``weakly&#39;&#39; DP-$3$-coloring. The main theorem improves the results in [J. Combin. Theory Ser. B 129 (2018) 38--54; European J. Combin. 82 (2019) 102995]. Consequently, every planar graph without $4$-, $6$-, $8$-cycles is $3$-choosable, and every planar graph without $4$-, $5$-, $7$-, $8$-cycles is $3$-choosable. In the third section, using almost the same technique, we prove that the vertex set of every graph in $\mathscr{G}$ can be partitioned into an independent set and a set that induces a forest, which strengthens the result in [Discrete Appl. Math. 284 (2020) 626--630]. In the final section, tightness is discussed.

preprint2022arXiv

PnP-DETR: Towards Efficient Visual Analysis with Transformers

Recently, DETR pioneered the solution of vision tasks with transformers, it directly translates the image feature map into the object detection result. Though effective, translating the full feature map can be costly due to redundant computation on some area like the background. In this work, we encapsulate the idea of reducing spatial redundancy into a novel poll and pool (PnP) sampling module, with which we build an end-to-end PnP-DETR architecture that adaptively allocates its computation spatially to be more efficient. Concretely, the PnP module abstracts the image feature map into fine foreground object feature vectors and a small number of coarse background contextual feature vectors. The transformer models information interaction within the fine-coarse feature space and translates the features into the detection result. Moreover, the PnP-augmented model can instantly achieve various desired trade-offs between performance and computation with a single model by varying the sampled feature length, without requiring to train multiple models as existing methods. Thus it offers greater flexibility for deployment in diverse scenarios with varying computation constraint. We further validate the generalizability of the PnP module on panoptic segmentation and the recent transformer-based image recognition model ViT and show consistent efficiency gain. We believe our method makes a step for efficient visual analysis with transformers, wherein spatial redundancy is commonly observed. Code will be available at \url{https://github.com/twangnh/pnp-detr}.

preprint2022arXiv

PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision

Existing self-supervised 3D human pose estimation schemes have largely relied on weak supervisions like consistency loss to guide the learning, which, inevitably, leads to inferior results in real-world scenarios with unseen poses. In this paper, we propose a novel self-supervised approach that allows us to explicitly generate 2D-3D pose pairs for augmenting supervision, through a self-enhancing dual-loop learning framework. This is made possible via introducing a reinforcement-learning-based imitator, which is learned jointly with a pose estimator alongside a pose hallucinator; the three components form two loops during the training process, complementing and strengthening one another. Specifically, the pose estimator transforms an input 2D pose sequence to a low-fidelity 3D output, which is then enhanced by the imitator that enforces physical constraints. The refined 3D poses are subsequently fed to the hallucinator for producing even more diverse data, which are, in turn, strengthened by the imitator and further utilized to train the pose estimator. Such a co-evolution scheme, in practice, enables training a pose estimator on self-generated motion data without relying on any given 3D data. Extensive experiments across various benchmarks demonstrate that our approach yields encouraging results significantly outperforming the state of the art and, in some cases, even on par with results of fully-supervised methods. Notably, it achieves 89.1% 3D PCK on MPI-INF-3DHP under self-supervised cross-dataset evaluation setup, improving upon the previous best self-supervised methods by 8.6%. Code can be found at: https://github.com/Garfield-kh/PoseTriplet

preprint2022arXiv

Quantum phase diagram for two species hardcore bosons in one-dimensional optical lattices with the resonantly driven Rabi frequency

We propose an experimental realization of the time-periodically modulated Rabi frequency and suggest density-dependent hoppings of two species hardcore bosons in a one-dimensional optical lattice. Distinct from the previous work [Phys. Rev. Research {\bf 2}, 013275 (2020)], we study effects in the first resonance region. In the effective Hamiltonian, the intra-species hopping occurs only if the density discrepancy of the other species on these sites is zero, while the inter-species one is allowed once the relevant density discrepancy becomes nonzero. At integer-$1$ filling, the quantum phase diagram of the effective Hamiltonian is determined by the perturbation analysis together with numerical calculations. We find that in the limit of dominant $J_{1}$, the system becomes a double-degenerate dimerized state, with spontaneously breaking the translation symmetry. The interplay of $J_{0}$, $J_{1}$ and the fixed ${\bar U}=1$ leads to three BKT transition lines and a tricritical BKT point. Exact transition lines are obtained by the level spectroscopic technique. Besides, general physical properties, including the charge gap, neutral gap, superfluid density and dimerization strength, are investigated as well.

preprint2022arXiv

Single UHD Image Dehazing via Interpretable Pyramid Network

Currently, most single image dehazing models cannot run an ultra-high-resolution (UHD) image with a single GPU shader in real-time. To address the problem, we introduce the principle of infinite approximation of Taylor&#39;s theorem with the Laplace pyramid pattern to build a model which is capable of handling 4K hazy images in real-time. The N branch networks of the pyramid network correspond to the N constraint terms in Taylor&#39;s theorem. Low-order polynomials reconstruct the low-frequency information of the image (e.g. color, illumination). High-order polynomials regress the high-frequency information of the image (e.g. texture). In addition, we propose a Tucker reconstruction-based regularization term that acts on each branch network of the pyramid model. It further constrains the generation of anomalous signals in the feature space. Extensive experimental results demonstrate that our approach can not only run 4K images with haze in real-time on a single GPU (80FPS) but also has unparalleled interpretability. The developed method achieves state-of-the-art (SOTA) performance on two benchmarks (O/I-HAZE) and our updated 4KID dataset while providing the reliable groundwork for subsequent optimization schemes.

preprint2022arXiv

Starbursts with suppressed velocity dispersion revealed in a forming cluster at z=2.51

One of the most prominent features of galaxy clusters is the presence of a dominant population of massive ellipticals in their cores. Stellar archaeology suggests that these gigantic beasts assembled most of their stars in the early Universe via starbursts. However, the role of dense environments and their detailed physical mechanisms in triggering starburst activities remain unknown. Here we report spatially resolved Atacama Large Millimeter/submillimeter Array (ALMA) observations of the CO $J= 3-2$ emission line, with a resolution of about 2.5 kiloparsecs, toward a forming galaxy cluster core with starburst galaxies at $z=2.51$. In contrast to starburst galaxies in the field often associated with galaxy mergers or highly turbulent gaseous disks, our observations show that the two starbursts in the cluster exhibit dynamically cold (rotation-dominated) gas-rich disks. Their gas disks have extremely low velocity dispersion ($σ_{\mathrm{0}} \sim 20-30$ km s$^{-1}$), which is three times lower than their field counterparts at similar redshifts. The high gas fraction and suppressed velocity dispersion yield gravitationally unstable gas disks, which enables highly efficient star formation. The suppressed velocity dispersion, likely induced by the accretion of corotating and coplanar cold gas, might serve as an essential avenue to trigger starbursts in massive halos at high redshifts.

preprint2022arXiv

Sub-percentage measure of distances to redshift of 0.1 by a new cosmic ruler

Distance-redshift diagrams probe expansion history of the Universe. We show that the stellar mass-binding energy (massE) relation of galaxies proposed in our previous study offers a new distance ruler at cosmic scales. By using elliptical galaxies in the main galaxy sample of the Sloan Digital Sky Survey Data Release 7, we construct a distance-redshift diagram over the redshift range from 0.05 to 0.2 with the massE ruler. The best-fit dark energy density is 0.675+-0.079 for flat Lambda-CDM, consistent with those by other probes. At the median redshift of 0.11, the median distance is estimated to have a fractional error of 0.34%, much lower than those by supernova (SN) Ia and baryonic acoustic oscillation (BAO) and even exceeding their future capability at this redshift. The above low-z measurement is useful for probing dark energy that dominates at the late Universe. For a flat dark energy equation of state model (flat wCDM), the massE alone constrains w to an error that is only a factor of 2.2, 1.7 and 1.3 times larger than those by BAO, SN Ia, and cosmic microwave background (CMB), respectively.

preprint2022arXiv

Subjective Quality Assessment for Images Generated by Computer Graphics

With the development of rendering techniques, computer graphics generated images (CGIs) have been widely used in practical application scenarios such as architecture design, video games, simulators, movies, etc. Different from natural scene images (NSIs), the distortions of CGIs are usually caused by poor rending settings and limited computation resources. What&#39;s more, some CGIs may also suffer from compression distortions in transmission systems like cloud gaming and stream media. However, limited work has been put forward to tackle the problem of computer graphics generated images&#39; quality assessment (CG-IQA). Therefore, in this paper, we establish a large-scale subjective CG-IQA database to deal with the challenge of CG-IQA tasks. We collect 25,454 in-the-wild CGIs through previous databases and personal collection. After data cleaning, we carefully select 1,200 CGIs to conduct the subjective experiment. Several popular no-reference image quality assessment (NR-IQA) methods are tested on our database. The experimental results show that the handcrafted-based methods achieve low correlation with subjective judgment and deep learning based methods obtain relatively better performance, which demonstrates that the current NR-IQA models are not suitable for CG-IQA tasks and more effective models are urgently needed.

preprint2022arXiv

Superconductivity in the Uniform Electron Gas: Irrelevance of Kohn-Luttinger Mechanism

We study the Cooper instability in jellium model in the controlled regime of small to intermediate values of the Coulomb parameter $r_s \leq 2$. We confirm that superconductivity naturally emerges from purely repulsive interactions described by the Kukkonen-Overhauser vertex function. By employing the implicit renormalization approach we reveal that even in the small-$r_s$ limit, the dominant mechanism behind Cooper instability is based on dynamic screening of the Coulomb interaction--accurately captured by the random phase approximation, whereas the Kohn-Luttinger contribution is negligibly small and, thus, not relevant.

preprint2022arXiv

The Black Hole-Galaxy Connection: Interplay between Feedback, Obscuration, and Host Galaxy Substructure

There is growing evidence for physical influence between supermassive black holes and their host galaxies. We present a case study of nearby galaxy NGC 7582, for which we find evidence that galactic substructure plays an important role in affecting the collimation of ionized outflows as well as contributing to the heavy active galactic nucleus (AGN) obscuration. This result contrasts with a simple, small-scale AGN torus model, according to which AGN wind collimation may take place inside the torus itself, at subparsec scale. Using 3D spectroscopy with the MUSE instrument, we probe the kinematics of the stellar and ionized gas components as well as the ionization state of the gas from a combination of emission line ratios. We report for the first time a kinematically distinct core (KDC) in NGC 7582, on a scale of ~600pc. This KDC coincides spatially with dust lanes and starbursting complexes previously observed. We interpret it as a circumnuclear ring of stars and dusty, gas-rich material. We obtain a clear view of the outflowing cones over kpc scales, and demonstrate that they are predominantly photoionized by the central engine. We detect the back cone (behind the galaxy), and confirm previous results of a large nuclear obscuration of both the stellar continuum and HII regions. While we tentatively associate the presence of the KDC to a large-scale bar and/or a minor galaxy merger, we stress the importance of gaining a better understanding of the role of galaxy substructure in controlling the fueling, feedback and obscuration of AGN.

preprint2022arXiv

The Development and Prospect of Code Clone

The application of code clone technology accelerates code search, improves code reuse efficiency, and assists in software quality assessment and code vulnerability detection. However, the application of code clones also introduces software quality issues and increases the cost of software maintenance. As an important research field in software engineering, code clone has been extensively explored and studied by researchers, and related studies on various sub-research fields have emerged, including code clone detection, code clone evolution, code clone analysis, etc. However, there lacks a comprehensive exploration of the entire field of code clone, as well as an analysis of the trend of each sub-research field. This paper collects related work of code clones in the past ten years. In summary, the contributions of this paper mainly include: (1) summarize and classify the sub-research fields of code clone, and explore the relative popularity and relation of these sub-research fields; (2) analyze the overall research trend of code clone and each sub-research field; (3) compare and analyze the difference between academy and industry regarding code clone research; (4) construct a network of researchers, and excavate the major contributors in code clone research field; (5) The list of popular conferences and journals was statistically analyzed. The popular research directions in the future include clone visualization, clone management, etc. For the clone detection technique, researchers can optimize the scalability and execution efficiency of the method, targeting particular clone detection tasks and contextual environments, or apply the technology to other related research fields continuously.

preprint2022arXiv

The molecular gas resolved by ALMA in the low-metallicity dwarf merging galaxy Haro 11

The physical mechanisms for starburst or quenching in less massive ($M_* < 10^{10} M_{\odot}$) galaxies are unclear. The merger is one of the inescapable processes referred to as both starburst and quenching in massive galaxies. However, the effects of the merger on star formation in dwarf galaxies and their evolution results are still uncertain. We aim to explore how to trigger and quench star formation in dwarf galaxies by studying the metal-poor gas-rich dwarf mergers based on the multi-band observations at a spatial resolution of $\sim$ 460 pc. We use the archival data of ALMA (band 3, 8) and VLT/MUSE to map CO($J=$1-0), [CI]($^3$P$_1 - ^3$P$_0$), and H$α$ emission in one of the most extreme starburst merging dwarf galaxies, Haro 11. We find the molecular gas is assembled around the central two star-forming regions. The molecular/ionized gas and stellar components show complex kinematics, indicating that the gas is probably at a combined stage of collision of clouds and feedback from star formation. The peak location and distribution of [CI](1-0) strongly resemble the CO(1-0) emission, meaning that it might trace the same molecular gas as CO in such a dwarf merger starburst galaxy. The enhancement of line ratios ($\sim 0.5$) of [CI]/CO around knot C is probably generated by the dissociation of CO molecules by cosmic rays and far-ultraviolet photons. Globally, Haro 11 and its star-forming regions share similar SFEs as the high-$z$ starburst galaxies or the clumps in nearby (U)LIRGs. Given the high SFE, sSFR, small stellar mass, low metallicity, and deficient HI gas, Haro 11 could be an analog of high-$z$ dwarf starburst and the potential progenitor of the nearby less massive elliptical galaxies. The significantly smaller turbulent pressure and viral parameter will probably trigger the intense starbursts. We also predict that it will quench at $M_* < 8.5 \times 10^9 M_{\odot}$.

preprint2022arXiv

The Value of Information in Stopping Problems

We consider stopping problems in which a decision maker (DM) faces an unknown state of nature and decides sequentially whether to stop and take an irreversible action; pay a fee and obtain additional information; or wait without acquiring information. We discuss the value and quality of information. The former is the maximal discounted expected revenue the DM can generate. We show that among all history-dependent fee schemes, the upfront scheme (as opposed, for instance, to pay-for-use) is optimal: it generates the highest possible value of information. The effects on the optimal strategy of obtaining information from a more accurate source and of having a higher discount factor are distinct, as far as expected stopping time and its distribution are concerned. However, these factors have a similar effect in that they both enlarge the set of cases in which the optimal strategy prescribes waiting.

preprint2022arXiv

Theoretical and experimental study on Noise Equivalent Power of X-ray semiconductor ultra-fast response material based on the rad-optic effect

Semiconductor material based on the rad-optic effect enables ultra-fast detection of X-rays and plays an important role in fusion diagnostics. Obtaining the accurate noise equivalent power (NEP) of the semiconductor ultrafast response material is the key to detecting X-rays. In this paper, the refractive index change mechanism of the semiconductor under X-ray irradiation was analyzed, and the quantitative relationship between the diffraction efficiency and the X-ray photon energy was established through the LT-AlGaAs diffraction imaging experiments. The impulse responses of LT-AlGaAs under 1 KeV-10 KeV X-ray radiation were calculated, revealing the variation of NEP density with radiated photon energy. In the case of bombarding the Al target to generate 1.5 KeV X-rays, the imaging experiments of LT-AlGaAs were performed. The diffraction image of LT-AlGaAs has a linear relationship with the radiation intensity, and the NEP density of LT-AlGaAs reaches 4.80*105W/cm2. This study has reference significance for the development of ultra-fast X-ray imaging systems based on the rad-optic effect.

preprint2022arXiv

Topological and Algebraic Structures of Atanassov&#39;s Intuitionistic Fuzzy-Values Space

We prove that the space of intuitionistic fuzzy values (IFVs) with a linear order based on a score function and an accuracy function has the same algebraic structure as the one induced by a linear order based on a similarity function and an accuracy function. By introducing a new operator for IFVs via the linear order based on a score function and an accuracy function, we show that such an operator is a strong negation on IFVs. Moreover, we observe that the space of IFVs is a complete lattice and a Kleene algebra with the new operator. We also demonstrate that the topological space of IFVs with the order topology induced by the above two linear orders is not separable and metrizable but compact and connected. From some new perspectives,our results partially answer three open problems posed by Atanassov [Intuitionistic Fuzzy Sets: Theory and Applications, Springer, 1999] and [On Intuitionistic Fuzzy Sets Theory, Springer, 2012]. Furthermore, we construct an isomorphism between the spaces of IFVs and q-rung orthopedic fuzzy values (q-ROFVs) under the corresponding linear orders. To this end, we introduce the concept of admissible similarity measures with particular orders for IFSs, extending the existing definition of the similarity measure for IFSs, and construct an admissible similarity measure with a linear order based on a score function and an accuracy function, which is effectively applied to a pattern recognition problem about the classification of building materials.

preprint2022arXiv

Uncertainty-based Network for Few-shot Image Classification

The transductive inference is an effective technique in the few-shot learning task, where query sets update prototypes to improve themselves. However, these methods optimize the model by considering only the classification scores of the query instances as confidence while ignoring the uncertainty of these classification scores. In this paper, we propose a novel method called Uncertainty-Based Network, which models the uncertainty of classification results with the help of mutual information. Specifically, we first data augment and classify the query instance and calculate the mutual information of these classification scores. Then, mutual information is used as uncertainty to assign weights to classification scores, and the iterative update strategy based on classification scores and uncertainties assigns the optimal weights to query instances in prototype optimization. Extensive results on four benchmarks show that Uncertainty-Based Network achieves comparable performance in classification accuracy compared to state-of-the-art method.

preprint2022arXiv

Uniqueness, symmetry and convergence of positive ground state solutions of the Choquard type equation on a ball

This paper is concerned with the qualitative properties of the positive ground state solutions to the nonlocal Choquard type equation on a ball $B_R$. First, we prove the radial symmetry of the positive ground state solutions by using Talenti&#39;s inequality. Next we develop Newton&#39;s Theorem and then resort to the contraction mapping principle to establish the uniqueness of the positive ground state solutions. Finally, by constructing cut-off functions and applying energy comparison method, we show the convergence of the positive ground state solutions as $R\to \infty$. Our results generalize and improve the existing ones in the literature.

preprint2022arXiv

Using the Optical--NIR Spectral Energy Distributions To Search for the Evidence of Dust Formation of 66 Supernovae

In this paper, we searched for the dust formation evidence of 66 supernovae (SNe) by using the blackbody model and the blackbody plus dust {emission} model to fit their early$-$time optical$-$near infrared (NIR) spectral energy distributions (SEDs). We find that, while the blackbody model can fit most SEDs of the SNe in our sample, the model cannot fit the SEDs of some SNe, in which the SEDs of 2 SNe (SNe~2010bq and 2012ca) show NIR excesses which can be attributed to the emission from the heated dust. We use blackbody plus dust emission model to fit the SEDs showing NIR excesses, finding that both graphite and silicate dust models can fit the SEDs, and the graphite model get reasonable temperatures or better fits. Assuming that the dust is graphite, the best-fitting temperatures (masses) of the dust of the SNe~2010bq and 2012ca are $\sim 1300-1800$ K ($\sim 0.1-3.4 \times 10^{-4}$ M$_\odot$) and $\sim 600-1000$ K ($\sim 0.6-7.5 \times 10^{-3}$ M$_\odot$), respectively. We compare the vaporization radii and the blackbody radii of the dust shells of the 2 SNe with the upper limits of the ejecta radii of the SNe at the first epochs, and demonstrate that the NIR excesses of the SEDs of the 2 SNe might be caused by the pre-existing dust.

preprint2022arXiv

Variable degeneracy of graphs with restricted structures

Bernshteyn and Lee defined a new notion, weak degeneracy, which is slightly weaker than the ordinary degeneracy. It is proved that strictly $f$-degenerate transversal is a common generalization of list coloring, $L$-forested-coloring and DP-coloring. In this paper, we consider three classes of graphs, including planar graphs without any configuration in Fig. 2, toroidal graphs without any configuration in Fig. 5, and planar graphs without intersecting $5$-cycles. We give structural results for each class of graphs, and prove each structure is reducible for weakly $3$-degenerate and the existence of strictly $f$-degenerate transversals. As consequences, these three classes of graphs are weakly $3$-degenerate, and have a strictly $f$-degenerate transversal. Then these three classes of graph have DP-paint number at most four, and have list vertex arboricity at most two. This greatly improve all the results in [2-4, 11-13, 16-18, 22, 25, 32, 34]. Furthermore, the first and the third classes of graphs have Alon-Tarsi number at most four.

preprint2021arXiv

A census of optically dark massive galaxies in the early Universe from magnification by lensing galaxy clusters

We present ALMA 870um and JCMT SCUBA2 850um dust continuum observations of a sample of optically dark and strongly lensed galaxies in the cluster fields. The ALMA and SCUBA2 observations reach a median rms of about 0.11 mJy and 0.44 mJy, respectively, with the latter close to the confusion limit of the data at 850um. This represents one of the most sensitive searches for dust emission in optically dark galaxies. We detect the dust emission in 12 out of 15 galaxies at >3.8 sigma, corresponding to a detection rate of 80 per cent. Thanks to the gravitational lensing, our observations reach a deeper limiting flux than previous surveys in blank fields by a factor of 3. We estimate delensed infrared luminosities in the range log(LIR)=11.5-12.7 Lsun, which correspond to dust-obscured star formation rates (SFRs) of 30 to 520 Msun per year. Stellar population fits to the optical-to-NIR photometric data yield a median redshift z=4.26 and de-lensed stellar mass log(Mstar)=10.78 Msun. They contribute a lensing-corrected star-formation rate density at least an order of magnitude higher than that of equivalently massive UV-selected galaxies at z>3. The results suggest that there is a missing population of massive star-forming galaxies in the early Universe, which may dominate the SFR density at the massive end. Five optically dark galaxies are located within r<50 arcsec in one cluster field, representing a potential overdensity structure that has a physical origin at a confidence level >99.974% from Poisson statistics. Follow-up spectroscopic observations with ALMA and JWST are crucial to confirm whether it is associated with a protocluster at similar redshifts.

preprint2021arXiv

A duality of scaffolds for translation association schemes

Scaffolds are certain tensors arising in the study of association schemes, and have been (implicitly) understood diagrammatically as digraphs with distinguished &#34;root&#34; nodes and with matrix edge weights, often taken from Bose-Mesner algebras. In this paper, we first present a slight modification of Martin&#39;s conjecture (2021) concerning a duality of scaffolds whose digraphs are embedded in a closed disk in the plane with root nodes all lying on the boundary circle, and then show that this modified conjecture holds true if we restrict ourselves to the class of translation association schemes, i.e., those association schemes that admit abelian regular automorphism groups.

preprint2021arXiv

A Universal Model for Cross Modality Mapping by Relational Reasoning

With the aim of matching a pair of instances from two different modalities, cross modality mapping has attracted growing attention in the computer vision community. Existing methods usually formulate the mapping function as the similarity measure between the pair of instance features, which are embedded to a common space. However, we observe that the relationships among the instances within a single modality (intra relations) and those between the pair of heterogeneous instances (inter relations) are insufficiently explored in previous approaches. Motivated by this, we redefine the mapping function with relational reasoning via graph modeling, and further propose a GCN-based Relational Reasoning Network (RR-Net) in which inter and intra relations are efficiently computed to universally resolve the cross modality mapping problem. Concretely, we first construct two kinds of graph, i.e., Intra Graph and Inter Graph, to respectively model intra relations and inter relations. Then RR-Net updates all the node features and edge features in an iterative manner for learning intra and inter relations simultaneously. Last, RR-Net outputs the probabilities over the edges which link a pair of heterogeneous instances to estimate the mapping results. Extensive experiments on three example tasks, i.e., image classification, social recommendation and sound recognition, clearly demonstrate the superiority and universality of our proposed model.

preprint2021arXiv

Attention Models for Point Clouds in Deep Learning: A Survey

Recently, the advancement of 3D point clouds in deep learning has attracted intensive research in different application domains such as computer vision and robotic tasks. However, creating feature representation of robust, discriminative from unordered and irregular point clouds is challenging. In this paper, our ultimate goal is to provide a comprehensive overview of the point clouds feature representation which uses attention models. More than 75+ key contributions in the recent three years are summarized in this survey, including the 3D objective detection, 3D semantic segmentation, 3D pose estimation, point clouds completion etc. We provide a detailed characterization (1) the role of attention mechanisms, (2) the usability of attention models into different tasks, (3) the development trend of key technology.

preprint2021arXiv

Camera-aware Style Separation and Contrastive Learning for Unsupervised Person Re-identification

Unsupervised person re-identification (ReID) is a challenging task without data annotation to guide discriminative learning. Existing methods attempt to solve this problem by clustering extracted embeddings to generate pseudo labels. However, most methods ignore the intra-class gap caused by camera style variance, and some methods are relatively complex and indirect although they try to solve the negative impact of the camera style on feature distribution. To solve this problem, we propose a camera-aware style separation and contrastive learning method (CA-UReID), which directly separates camera styles in the feature space with the designed camera-aware attention module. It can explicitly divide the learnable feature into camera-specific and camera-agnostic parts, reducing the influence of different cameras. Moreover, to further narrow the gap across cameras, we design a camera-aware contrastive center loss to learn more discriminative embedding for each identity. Extensive experiments demonstrate the superiority of our method over the state-of-the-art methods on the unsupervised person ReID task.

preprint2021arXiv

DP-3-coloring of planar graphs without certain cycles

DP-coloring is a generalization of list coloring, which was introduced by Dvořák and Postle [J. Combin. Theory Ser. B 129 (2018) 38--54]. Zhang [Inform. Process. Lett. 113 (9) (2013) 354--356] showed that every planar graph with neither adjacent triangles nor 5-, 6-, 9-cycles is 3-choosable. Liu et al. [Discrete Math. 342 (2019) 178--189] showed that every planar graph without 4-, 5-, 6- and 9-cycles is DP-3-colorable. In this paper, we show that every planar graph with neither adjacent triangles nor 5-, 6-, 9-cycles is DP-3-colorable, which generalizes these results. Yu et al. gave three Bordeaux-type results by showing that (i) every planar graph with the distance of triangles at least three and no 4-, 5-cycles is DP-3-colorable; (ii) every planar graph with the distance of triangles at least two and no 4-, 5-, 6-cycles is DP-3-colorable; (iii) every planar graph with the distance of triangles at least two and no 5-, 6-, 7-cycles is DP-3-colorable. We also give two Bordeaux-type results in the last section: (i) every plane graph with neither 5-, 6-, 8-cycles nor triangles at distance less than two is DP-3-colorable; (ii) every plane graph with neither 4-, 5-, 7-cycles nor triangles at distance less than two is DP-3-colorable.

preprint2021arXiv

DP-4-coloring of planar graphs with some restrictions on cycles

DP-coloring was introduced by Dvořák and Postle as a generalization of list coloring. It was originally used to solve a longstanding conjecture by Borodin, stating that every planar graph without cycles of lengths 4 to 8 is 3-choosable. In this paper, we give three sufficient conditions for a planar graph to be DP-4-colorable. Actually all the results (Theorem 1.3, 1.4 and 1.7) are stated in the ``color extendability&#39;&#39; form, and uniformly proved by vertex identification and discharging method.

preprint2021arXiv

Magnetic Interactions Between Radical Pairs in Chiral Graphene Nanoribbons

Magnetic graphene nanoribbons (GNRs) have become promising candidates for future applications, including quantum technologies. Here, we characterize magnetic states hosted by chiral graphene nanoribbons (chGNRs). The substitution of a hydrogen atom at the chGNR edge by a ketone group effectively adds one p_z electron to the π-electron network, thus producing an unpaired π radical. A closely related scenario occurs for regular ketone-functionalized chGNRs in which one oxygen atom is missing. Two such radical states can interact via exchange coupling and we study those interactions as a function of their relative position, which includes a remarkable dependence on the chirality, as well as on the nature of the surrounding GNR, i.e., with or without ketone functionalization. In addition, we determine the parameters whereby this type of systems with oxygen heteroatoms can be adequately described within the widely used mean-field Hubbard model. Altogether, we provide new insights to both theoretically model and devise GNR-based nanostructures with tunable magnetic properties.

preprint2021arXiv

Mobility-Aware Seamless Handover with MPTCP in Software-Defined HetNets

In this paper, the problem of vertical handover in software-defined network (SDN) based heterogeneous networks (HetNets) is studied. In the studied model, HetNets are required to offer diverse services for mobile users. Using an SDN controller, HetNets have the capability of managing users&#39; access and mobility issues but still have the problems of ping-pong effect and service interruption during vertical handover. To solve these problems, a mobility-aware seamless handover method based on multipath transmission control protocol (MPTCP) is proposed. The proposed handover method is executed in the controller of the software-defined HetNets (SDHetNets) and consists of three steps: location prediction, network selection, and handover execution. In particular, the method first predicts the user&#39;s location in the next moment with an echo state network (ESN). Given the predicted location, the SDHetNet controller can determine the candidate network set for the handover to pre-allocate network wireless resources. Second, the target network is selected through fuzzy analytic hierarchical process (FAHP) algorithm, jointly considering user preferences, service requirements, network attributes, and user mobility patterns. Then, seamless handover is realized through the proposed MPTCP-based handover mechanism. Simulations using real-world user trajectory data from Korea Advanced Institute of Science & Technology show that the proposed method can reduce the handover times by 10.85% to 29.12% compared with traditional methods. The proposed method also maintains at least one MPTCP subflow connected during the handover process and achieves a seamless handover.

preprint2021arXiv

Rapid Robust Principal Component Analysis: CUR Accelerated Inexact Low Rank Estimation

Robust principal component analysis (RPCA) is a widely used tool for dimension reduction. In this work, we propose a novel non-convex algorithm, coined Iterated Robust CUR (IRCUR), for solving RPCA problems, which dramatically improves the computational efficiency in comparison with the existing algorithms. IRCUR achieves this acceleration by employing CUR decomposition when updating the low rank component, which allows us to obtain an accurate low rank approximation via only three small submatrices. Consequently, IRCUR is able to process only the small submatrices and avoid expensive computing on the full matrix through the entire algorithm. Numerical experiments establish the computational advantage of IRCUR over the state-of-art algorithms on both synthetic and real-world datasets.

preprint2021arXiv

Revisiting Knowledge Distillation via Label Smoothing Regularization

Knowledge Distillation (KD) aims to distill the knowledge of a cumbersome teacher model into a lightweight student model. Its success is generally attributed to the privileged information on similarities among categories provided by the teacher model, and in this sense, only strong teacher models are deployed to teach weaker students in practice. In this work, we challenge this common belief by following experimental observations: 1) beyond the acknowledgment that the teacher can improve the student, the student can also enhance the teacher significantly by reversing the KD procedure; 2) a poorly-trained teacher with much lower accuracy than the student can still improve the latter significantly. To explain these observations, we provide a theoretical analysis of the relationships between KD and label smoothing regularization. We prove that 1) KD is a type of learned label smoothing regularization and 2) label smoothing regularization provides a virtual teacher model for KD. From these results, we argue that the success of KD is not fully due to the similarity information between categories from teachers, but also to the regularization of soft targets, which is equally or even more important. Based on these analyses, we further propose a novel Teacher-free Knowledge Distillation (Tf-KD) framework, where a student model learns from itself or manuallydesigned regularization distribution. The Tf-KD achieves comparable performance with normal KD from a superior teacher, which is well applied when a stronger teacher model is unavailable. Meanwhile, Tf-KD is generic and can be directly deployed for training deep neural networks. Without any extra computation cost, Tf-KD achieves up to 0.65\% improvement on ImageNet over well-established baseline models, which is superior to label smoothing regularization.

preprint2021arXiv

You Recommend, I Buy: How and Why People Engage in Instant Messaging Based Social Commerce

As an emerging business phenomenon especially in China, instant messaging (IM) based social commerce is growing increasingly popular, attracting hundreds of millions of users and is becoming one important way where people make everyday purchases. Such platforms embed shopping experiences within IM apps, e.g., WeChat, WhatsApp, where real-world friends post and recommend products from the platforms in IM group chats and quite often form lasting recommending/buying relationships. How and why do users engage in IM based social commerce? Do such platforms create novel experiences that are distinct from prior commerce? And do these platforms bring changes to user social lives and relationships? To shed light on these questions, we launched a qualitative study where we carried out semi-structured interviews on 12 instant messaging based social commerce users in China. We showed that IM based social commerce: 1) enables more reachable, cost-reducing, and immersive user shopping experience, 2) shapes user decision-making process in shopping through pre-existing social relationship, mutual trust, shared identity, and community norm, and 3) creates novel social interactions, which can contribute to new tie formation while maintaining existing social relationships. We demonstrate that all these unique aspects link closely to the characteristics of IM platforms, as well as the coupling of user social and economic lives under such business model. Our study provides important research and design implications for social commerce, and decentralized, trusted socio-technical systems in general.

preprint2020arXiv

A Multi-Level Approach to Waste Object Segmentation

We address the problem of localizing waste objects from a color image and an optional depth image, which is a key perception component for robotic interaction with such objects. Specifically, our method integrates the intensity and depth information at multiple levels of spatial granularity. Firstly, a scene-level deep network produces an initial coarse segmentation, based on which we select a few potential object regions to zoom in and perform fine segmentation. The results of the above steps are further integrated into a densely connected conditional random field that learns to respect the appearance, depth, and spatial affinities with pixel-level accuracy. In addition, we create a new RGBD waste object segmentation dataset, MJU-Waste, that is made public to facilitate future research in this area. The efficacy of our method is validated on both MJU-Waste and the Trash Annotation in Context (TACO) dataset.

preprint2020arXiv

A multiple attributes image quality database for smartphone camera photo quality assessment

Smartphone is the superstar product in digital device market and the quality of smartphone camera photos (SCPs) is becoming one of the dominant considerations when consumers purchase smartphones. How to evaluate the quality of smartphone cameras and the taken photos is urgent issue to be solved. To bridge the gap between academic research accomplishment and industrial needs, in this paper, we establish a new Smartphone Camera Photo Quality Database (SCPQD2020) including 1800 images with 120 scenes taken by 15 smartphones. Exposure, color, noise and texture which are four dominant factors influencing the quality of SCP are evaluated in the subjective study, respectively. Ten popular no-reference (NR) image quality assessment (IQA) algorithms are tested and analyzed on our database. Experimental results demonstrate that the current objective models are not suitable for SCPs, and quality metrics having high correlation with human visual perception are highly needed.

preprint2020arXiv

A system to test 2D optoelectronic devices in high vacuum

The exploration of electronic and optoelectronic properties of two-dimensional (2D) materials has become one of the most attractive line of research since the isolation of graphene. Such &#39;all-surface materials&#39; present a strong sensitivity to environmental conditions and thus characterization of the devices based on these materials usually requires measurement systems operating in high-vacuum. However, conventional optoelectronic probe-station testing systems are are not compatible with high vacuum operation and vacuum-compatible versions are rather expensive. Here, we present a high-vacuum system specifically designed to test electronic and optoelectronic devices based on 2D materials. This system can be implemented with low budget and it is mostly based on the assembly of commercially available standard vacuum and optic components. Despite the simplicity of this system we demonstrate full capabilities to characterize optoelectronic devices in a broad range of wavelengths with fast pumping/venting speed and possibility of modulating the device temperature (room temperature to ~150deg).

preprint2020arXiv

A Tamm Plasmon-Porous GaN Distributed Bragg Reflector Cavity

This paper reports on design, measurement and optimisation of a Tamm plasmon metal-DBR cavity for use in the green part of the visible spectrum. It uses an optimised silver layer thickness and a porous DBR created using a novel electro-chemical etching technique. This device has applications in low cost lasers, photodetectors and photoconductive switches for the visible wavelength range.

preprint2020arXiv

ALMA Deep Field in SSA22: A near-infrared-dark submillimeter galaxy at z=4.0

Deep surveys with Atacama Large Millimeter Array (ALMA) have uncovered a population of dusty star-forming galaxies which are faint or even undetected at optical to near-infrared wavelengths. Their faintness at short wavelengths makes detailed characterization of the population challenging. Here we present a spectroscopic redshift identification and characterization of one of such near-infrared-dark galaxy discovered by an ALMA deep survey. Detection of [CI](1-0) and CO(4-3) emission lines determines the precise redshift of the galaxy, ADF22.A2, to be z=3.9913+/-0.0008. On the basis of multi-wavelength analysis, ADF22.A2 is found to be a massive, star-forming galaxy with stellar mass Mstar = $1.1_{-0.6}^{+1.3}$ x 10^{11} Msun and SFR = $430_{-150}^{+230}$ Msun/yr. The molecular gas mass is derived to be M ($H_2$) = 5.9 +/- 1.5x10^{10} Msun, indicating a gas fraction of ~35%, and the ratios of $L_{\rm [CI](1-0)}/L_{\rm IR}$ and $L_{\rm [CI](1-0)}/L_{\rm CO(4-3)}$ suggests that the nature of the interstellar medium in ADF22.A2 is in accordance with those of other bright submillimeter galaxies. The properties of ADF22.A2, including redshift, star-formation rate, stellar mass, and depletion time scale (tau~0.1-0.2 Gyr), also suggest that ADF22.A2 has the characteristics expected for the progenitors of quiescent galaxies at z>3. Our results demonstrate the power of ALMA contiguous mapping and line scan to obtain an unbiased view of galaxy formation in the early Universe.

preprint2020arXiv

An inexpensive system for the deterministic transfer of 2D materials

The development of systems for the deterministic transfer of two-dimensional (2D) materials have undoubtedly contributed to a great advance in the 2D materials research. In fact, they have made it possible to fabricate van der Waals heterostructures and 2D materials-based devices with complex architectures. Nonetheless, as far as we know, the amount of papers in the literature providing enough details to reproduce these systems by other research groups is very scarce in the literature. Moreover, these systems typically require the use of expensive optical and mechanical components hampering their applicability in research groups with low budget. Here we demonstrate how a deterministic placement system for 2D materials set up with full capabilities can be implemented under 900 Eur which can be easily implemented in low budget labs and educational labs.

preprint2020arXiv

An unnoticed side effect of electric vehicles

We illustrate that the electrification of our transport system might impose unnecessary extra congestion and delay for daily commuting passengers. By modelling travel behaviors of these passengers, it is found that more of them tend to depart at a narrower peak-hour time window. The occurrence of this shift is mainly caused by (1) the energy consumption of electric vehicles (EVs) is much lower than that of traditional vehicles and (2) the energy consumption of EVs is less sensitive to congestion than that of traditional vehicles. We further examine the role of congestion toll in minimizing the extra congestion and delay.

preprint2020arXiv

Automatic low-bit hybrid quantization of neural networks through meta learning

Model quantization is a widely used technique to compress and accelerate deep neural network (DNN) inference, especially when deploying to edge or IoT devices with limited computation capacity and power consumption budget. The uniform bit width quantization across all the layers is usually sub-optimal and the exploration of hybrid quantization for different layers is vital for efficient deep compression. In this paper, we employ the meta learning method to automatically realize low-bit hybrid quantization of neural networks. A MetaQuantNet, together with a Quantization function, are trained to generate the quantized weights for the target DNN. Then, we apply a genetic algorithm to search the best hybrid quantization policy that meets compression constraints. With the best searched quantization policy, we subsequently retrain or finetune to further improve the performance of the quantized target network. Extensive experiments demonstrate the performance of searched hybrid quantization scheme surpass that of uniform bitwidth counterpart. Compared to the existing reinforcement learning (RL) based hybrid quantization search approach that relies on tedious explorations, our meta learning approach is more efficient and effective for any compression requirements since the MetaQuantNet only needs be trained once.

preprint2020arXiv

Central Similarity Quantization for Efficient Image and Video Retrieval

Existing data-dependent hashing methods usually learn hash functions from pairwise or triplet data relationships, which only capture the data similarity locally, and often suffer from low learning efficiency and low collision rate. In this work, we propose a new \emph{global} similarity metric, termed as \emph{central similarity}, with which the hash codes of similar data pairs are encouraged to approach a common center and those for dissimilar pairs to converge to different centers, to improve hash learning efficiency and retrieval accuracy. We principally formulate the computation of the proposed central similarity metric by introducing a new concept, i.e., \emph{hash center} that refers to a set of data points scattered in the Hamming space with a sufficient mutual distance between each other. We then provide an efficient method to construct well separated hash centers by leveraging the Hadamard matrix and Bernoulli distributions. Finally, we propose the Central Similarity Quantization (CSQ) that optimizes the central similarity between data points w.r.t.\ their hash centers instead of optimizing the local similarity. CSQ is generic and applicable to both image and video hashing scenarios. Extensive experiments on large-scale image and video retrieval tasks demonstrate that CSQ can generate cohesive hash codes for similar data pairs and dispersed hash codes for dissimilar pairs, achieving a noticeable boost in retrieval performance, i.e. 3\%-20\% in mAP over the previous state-of-the-arts. The code is at: \url{https://github.com/yuanli2333/Hadamard-Matrix-for-hashing}

preprint2020arXiv

Classification Calibration for Long-tail Instance Segmentation

Remarkable progress has been made in object instance detection and segmentation in recent years. However, existing state-of-the-art methods are mostly evaluated with fairly balanced and class-limited benchmarks, such as Microsoft COCO dataset [8]. In this report, we investigate the performance drop phenomenon of state-of-the-art two-stage instance segmentation models when processing extreme long-tail training data based on the LVIS [5] dataset, and find a major cause is the inaccurate classification of object proposals. Based on this observation, we propose to calibrate the prediction of classification head to improve recognition performance for the tail classes. Without much additional cost and modification of the detection model architecture, our calibration method improves the performance of the baseline by a large margin on the tail classes. Codes will be available. Importantly, after the submission, we find significant improvement can be further achieved by modifying the calibration head, which we will update later.

preprint2020arXiv

CTM: Collaborative Temporal Modeling for Action Recognition

With the rapid development of digital multimedia, video understanding has become an important field. For action recognition, temporal dimension plays an important role, and this is quite different from image recognition. In order to learn powerful feature of videos, we propose a Collaborative Temporal Modeling (CTM) block (Figure 1) to learn temporal information for action recognition. Besides a parameter-free identity shortcut, as a separate temporal modeling block, CTM includes two collaborative paths: a spatial-aware temporal modeling path, which we propose the Temporal-Channel Convolution Module (TCCM) with unshared parameters for each spatial position (H*W) to build, and a spatial-unaware temporal modeling path. CTM blocks can seamlessly be inserted into many popular networks to generate CTM Networks and bring the capability of learning temporal information to 2D CNN backbone networks, which only capture spatial information. Experiments on several popular action recognition datasets demonstrate that CTM blocks bring the performance improvements on 2D CNN baselines, and our method achieves the competitive results against the state-of-the-art methods. Code will be made publicly available.

preprint2020arXiv

Deep Learning based OTDOA Positioning for NB-IoT Communication Systems

Positioning is becoming a key component in many Internet of Things (IoT) applications. The main challenges and limitations are the narrow bandwidth, low power and low cost which reduces the accuracy of the time of arrival (TOA) estimation. In this paper, we consider the positioning scenario of Narrowband IoT (NB-IoT) that can benefit from observed time difference of arrival (OTDOA). By applying the deep learning based technique, we explore the generalization and feature extraction abilities of neural networks to tackle the aforementioned challenges. As demonstrated in the numerical experiments, the proposed algorithm can be used in different inter-site distance situations and results in a 15% and 50% positioning accuracy improvement compared with Gauss-Newton method in line-of-sight (LOS) scenario and non-line-of-sight (NLOS) scenario respectively.

preprint2020arXiv

Double Circulant Self-dual Codes on Sextic Cyclotomy

This paper contributes to construct double circulant self-dual codes by sextic cyclotomy. Generator matrixes of a family of pure double circulant codes and a family of double circulant codes with boundary are formed from sextic cyclotomic classes. These codes are proved to be self-dual on certain conditions. Moreover, experiments show that some of them are optimal self-dual codes or best than ever codes over GF}(2) and GF}(4).

preprint2020arXiv

Dynamics of reversed shear Alfvén eigenmode and energetic particles during current ramp-up

Hybrid MHD-gyrokinetic code simulations are used to investigate the dynamics of frequency sweeping reversed shear Alfvén eigenmode (RSAE) strongly driven by energetic particles (EPs) during plasma current ramp-up in a conventional tokamak configuration. A series of weakly reversed shear equilibria representing time slices of long timescale MHD equilibrium evolution is considered, where the self-consistent RSAE-EP resonant interactions on the short timescale are analyzed in detail. Both linear and nonlinear RSAE dynamics are shown to be subject to the non-perturbative effect of EPs by maximizing wave-EP power transfer. In linear stage, EPs induce evident mode structure and frequency shifts; meanwhile, RSAE saturates by radial decoupling with resonant EPs due to weak magnetic shear, and gives rise to global EP convective transport and non-adiabatic frequency chirping. The spatiotemporal scales of phase space wave-EP interactions are characterized by the perpendicular wavelength and wave-particle trapping time. The simulations provide insights into general as well as specific features of RSAE spectra and EP transport from experimental observations, and illustrate the fundamental physics of wave-EP resonant interaction with the interplay of magnetic geometry, plasma non-uniformity and non-perturbative EPs.

preprint2020arXiv

EDCNN: Edge enhancement-based Densely Connected Network with Compound Loss for Low-Dose CT Denoising

In the past few decades, to reduce the risk of X-ray in computed tomography (CT), low-dose CT image denoising has attracted extensive attention from researchers, which has become an important research issue in the field of medical images. In recent years, with the rapid development of deep learning technology, many algorithms have emerged to apply convolutional neural networks to this task, achieving promising results. However, there are still some problems such as low denoising efficiency, over-smoothed result, etc. In this paper, we propose the Edge enhancement based Densely connected Convolutional Neural Network (EDCNN). In our network, we design an edge enhancement module using the proposed novel trainable Sobel convolution. Based on this module, we construct a model with dense connections to fuse the extracted edge information and realize end-to-end image denoising. Besides, when training the model, we introduce a compound loss that combines MSE loss and multi-scales perceptual loss to solve the over-smoothed problem and attain a marked improvement in image quality after denoising. Compared with the existing low-dose CT image denoising algorithms, our proposed model has a better performance in preserving details and suppressing noise.

preprint2020arXiv

Electron confinement by laser-driven azimuthal magnetic fields during direct laser acceleration

A laser-driven azimuthal plasma magnetic field is known to facilitate electron energy gain from the irradiating laser pulse. The enhancement is due to changes in the orientation between the laser electric field and electron velocity caused by magnetic field deflections. Transverse electron confinement is critical for realizing this concept experimentally. We find that the phase velocity of the laser pulse has a profound impact on the transverse size of electron trajectories. The transverse size remains constant below a threshold energy that depends on the degree of the superluminosity and it increases with the electron energy above the threshold. This increase can cause electron losses in tightly focused laser pulses. We show using 3D particle-in-cell simulations that the electron energy gain can be significantly increased by increasing laser power at fixed intensity due to the increased electron confinement. This finding makes a strong case for designing experiments at multi-PW laser facilities.

preprint2020arXiv

Ferromagnetic Gyroscopes for Tests of Fundamental Physics

A ferromagnetic gyroscope (FG) is a ferromagnet whose angular momentum is dominated by electron spin polarization and that will precess under the action of an external torque, such as that due to a magnetic field. Here we model and analyze FG dynamics and sensitivity, focusing on practical schemes for experimental realization. In the case of a freely floating FG, we model the transition from dynamics dominated by libration in relatively high externally applied magnetic fields, to those dominated by precession at relatively low applied fields. Measurement of the libration frequency enables in situ measurement of the magnetic field and a technique to reduce the field below the threshold for which precession dominates the FG dynamics. We note that evidence of gyroscopic behavior is present even at magnetic fields much larger than the threshold field below which precession dominates. We also model the dynamics of an FG levitated above a type-I superconductor via the Meissner effect, and find that for FGs with dimensions larger than about 100 nm the observed precession frequency is reduced compared to that of a freely floating FG. This is akin to negative feedback that arises from the distortion of the field from the FG by the superconductor. Finally we assess the sensitivity of an FG levitated above a type-I superconductor to exotic spin-dependent interactions under practical experimental conditions, demonstrating the potential of FGs for tests of fundamental physics.

preprint2020arXiv

Finding Action Tubes with a Sparse-to-Dense Framework

The task of spatial-temporal action detection has attracted increasing attention among researchers. Existing dominant methods solve this problem by relying on short-term information and dense serial-wise detection on each individual frames or clips. Despite their effectiveness, these methods showed inadequate use of long-term information and are prone to inefficiency. In this paper, we propose for the first time, an efficient framework that generates action tube proposals from video streams with a single forward pass in a sparse-to-dense manner. There are two key characteristics in this framework: (1) Both long-term and short-term sampled information are explicitly utilized in our spatiotemporal network, (2) A new dynamic feature sampling module (DTS) is designed to effectively approximate the tube output while keeping the system tractable. We evaluate the efficacy of our model on the UCF101-24, JHMDB-21 and UCFSports benchmark datasets, achieving promising results that are competitive to state-of-the-art methods. The proposed sparse-to-dense strategy rendered our framework about 7.6 times more efficient than the nearest competitor.

preprint2020arXiv

Giant piezoresistive effect and strong band gap tunability in ultrathin InSe upon biaxial strain

The ultrathin nature and dangling bonds free surface of two-dimensional (2D) semiconductors allow for significant modifications of their band gap through strain engineering. Here, thin InSe photodetector devices are biaxially stretched, finding, a strong band gap tunability upon strain. The applied biaxial strain is controlled through the substrate expansion upon temperature increase and the effective strain transfer from the substrate to the thin InSe is confirmed by Raman spectroscopy. The band gap change upon biaxial strain is determined through photoluminescence measurements, finding a gauge factor of up to ~200 meV/%. We further characterize the effect of biaxial strain on the electrical properties of the InSe devices. In the dark state, a large increase of the current is observed upon applied strain which gives a piezoresistive gauge factor value of ~450-1000, ~5-12 times larger than that of other 2D materials and of state-of-the-art silicon strain gauges. Moreover, the biaxial strain tuning of the InSe band gap also translates in a strain-induced redshift of the spectral response of our InSe photodetectors with ΔEcut-off ~173 meV at a rate of ~360 meV/% of strain, indicating a strong strain tunability of the spectral bandwidth of the photodetectors.

preprint2020arXiv

Gravity Probe Spin: Prospects for measuring general-relativistic precession of intrinsic spin using a ferromagnetic gyroscope

An experimental test at the intersection of quantum physics and general relativity is proposed: measurement of relativistic frame dragging and geodetic precession using intrinsic spin of electrons. The behavior of intrinsic spin in spacetime dragged and warped by a massive rotating body is an experimentally open question, hence the results of such a measurement could have important theoretical consequences. Such a measurement is possible by using mm-scale ferromagnetic gyroscopes in orbit around the Earth. Under conditions where the rotational angular momentum of a ferromagnet is sufficiently small, a ferromagnet&#39;s angular momentum is dominated by atomic electron spins and is predicted to exhibit macroscopic gyroscopic behavior. If such a ferromagnetic gyroscope is sufficiently isolated from the environment, rapid averaging of quantum uncertainty via the spin-lattice interaction enables readout of the ferromagnetic gyroscope dynamics with sufficient sensitivity to measure both the Lense-Thirring (frame dragging) and de Sitter (geodetic precession) effects due to the Earth.

preprint2020arXiv

Growth, charge and thermal transport of flowered graphene

We report on the structural and transport properties of the smallest dislocation loop in graphene, known as a flower defect. First, by means of advanced experimental imaging techniques, we deduce how flower defects are formed during recrystallization of chemical vapor deposited graphene. We propose that the flower defects arise from a bulge type mechanism in which the flower domains are the grains left over by dynamic recrystallisation. Next, in order to evaluate the use of such defects as possible building blocks for all-graphene electronics, we combine multiscale modeling tools to investigate the structure and the electron and phonon transport properties of large monolayer graphene samples with a random distribution of flower defects. For large enough flower densities, we find that electron transport is strongly suppressed while, surprisingly, hole transport remains almost unaffected. These results suggest possible applications of flowered graphene for electron energy filtering. For the same defect densities, phonon transport is reduced by orders of magnitude as elastic scattering by defects becomes dominant. Heat transport by flexural phonons, key in graphene, is largely suppressed even for very low concentrations.

preprint2020arXiv

High Accurate Time-of-Arrival Estimation with Fine-Grained Feature Generation for Internet-of-Things Applications

Conventional schemes often require extra reference signals or more complicated algorithms to improve the time-of-arrival (TOA) estimation accuracy. However, in this letter, we propose to generate fine-grained features from the full band and resource block (RB) based reference signals, and calculate the cross-correlations accordingly to improve the observation resolution as well as the TOA estimation results. Using the spectrogram-like cross-correlation feature map, we apply the machine learning technology with decoupled feature extraction and fitting to understand the variations in the time and frequency domains and project the features directly into TOA results. Through numerical examples, we show that the proposed high accurate TOA estimation with fine-grained feature generation can achieve at least 51% root mean square error (RMSE) improvement in the static propagation environments and 38 ns median TOA estimation errors for multipath fading environments, which is equivalently 36% and 25% improvement if compared with the existing MUSIC and ESPRIT algorithms, respectively.

preprint2020arXiv

InSe Schottky diodes based on van der Waals contacts

Two-dimensional semiconductors are excellent candidates for next-generation electronics and optoelec-tronics thanks to their electrical properties and strong light-matter interaction. To fabricate devices with optimal electrical properties, it is crucial to have both high-quality semiconducting crystals and ideal con-tacts at metal-semiconductor interfaces. Thanks to the mechanical exfoliation of van der Waals crystals, atomically-thin high-quality single-crystals can easily be obtained in a laboratory. However, conventional metal deposition techniques can introduce chemical disorder and metal-induced mid-gap states that induce Fermi level pinning and can degrade the metal-semiconductor interfaces, resulting in poorly performing devices. In this article, we explore the electrical contact characteristics of Au-InSe and graphite-InSe van der Waals contacts, obtained by stacking mechanically exfoliated InSe flakes onto pre-patterned Au or graphite electrodes without the need of lithography or metal deposition. The high quality of the metal-semiconductor interfaces obtained by van der Waals contact allows to fabricate high-quality Schottky di-odes based on the Au-InSe Schottky barrier. Our experimental observation indicates that the contact barrier at the graphite-InSe interface is negligible due to the similar electron affinity of InSe and graphite, while the Au-InSe interfaces are dominated by a large Schottky barrier.

preprint2020arXiv

InSe: a two-dimensional semiconductor with superior flexibility

Two-dimensional Indium Selenide (InSe) has attracted extensive attention recently due to its record-high charge carrier mobility and photoresponsivity in the fields of electronics and optoelectronics. Nevertheless, the mechanical properties of this material in the ultra-thin regime have not been investigated yet. Here, we present our efforts to determine the Young&#39;s modulus of thin InSe (~1-2 layers to ~40 layers) flakes experimentally by using buckling-based methodology. We find that the Young&#39;s modulus has a value of 23.1 +- 5.2 GPa, one of the lowest values reported up to date for crystalline two-dimensional materials. This superior flexibility can be very attractive for different applications, such as strain engineering and flexible electronics.

preprint2020arXiv

iqiyi Submission to ActivityNet Challenge 2019 Kinetics-700 challenge: Hierarchical Group-wise Attention

In this report, the method for the iqiyi submission to the task of ActivityNet 2019 Kinetics-700 challenge is described. Three models are involved in the model ensemble stage: TSN, HG-NL and StNet. We propose the hierarchical group-wise non-local (HG-NL) module for frame-level features aggregation for video classification. The standard non-local (NL) module is effective in aggregating frame-level features on the task of video classification but presents low parameters efficiency and high computational cost. The HG-NL method involves a hierarchical group-wise structure and generates multiple attention maps to enhance performance. Basing on this hierarchical group-wise structure, the proposed method has competitive accuracy, fewer parameters and smaller computational cost than the standard NL. For the task of ActivityNet 2019 Kinetics-700 challenge, after model ensemble, we finally obtain an averaged top-1 and top-5 error percentage 28.444% on the test set.

preprint2020arXiv

Lyapunov-type Conditions for Non-strong Ergodicity of Markov Processes

We present Lyapunov-type conditions for non-strong ergodicity of Markov processes. Some concrete models are discussed including diffusion processes on Riemannian manifolds and Ornstein-Uhlenbeck processes driven by symmetric $α$-stable processes. For SDE driven by $α$-stable process ($α\in (0,2]$) with polynomial drift, the strong ergodicity or not is independent on $α$.

preprint2020arXiv

Making Robots Draw A Vivid Portrait In Two Minutes

Significant progress has been made with artistic robots. However, existing robots fail to produce high-quality portraits in a short time. In this work, we present a drawing robot, which can automatically transfer a facial picture to a vivid portrait, and then draw it on paper within two minutes averagely. At the heart of our system is a novel portrait synthesis algorithm based on deep learning. Innovatively, we employ a self-consistency loss, which makes the algorithm capable of generating continuous and smooth brush-strokes. Besides, we propose a componential sparsity constraint to reduce the number of brush-strokes over insignificant areas. We also implement a local sketch synthesis algorithm, and several pre- and post-processing techniques to deal with the background and details. The portrait produced by our algorithm successfully captures individual characteristics by using a sparse set of continuous brush-strokes. Finally, the portrait is converted to a sequence of trajectories and reproduced by a 3-degree-of-freedom robotic arm. The whole portrait drawing robotic system is named AiSketcher. Extensive experiments show that AiSketcher can produce considerably high-quality sketches for a wide range of pictures, including faces in-the-wild and universal images of arbitrary content. To our best knowledge, AiSketcher is the first portrait drawing robot that uses neural style transfer techniques. AiSketcher has attended a quite number of exhibitions and shown remarkable performance under diverse circumstances.

preprint2020arXiv

Non-thermal vibrations in biased molecular junctions

We study vibrational statistics in current-carrying model molecular junctions using master equation approach. Especially, we concentrate on the validity of using an effective temperature $T_{\rm eff}$ to characterize the nonequilibrium steady state of a vibrational mode. We identify cases where a single $T_{\rm eff}$ can not fully describe one vibrational state. In such cases, the probability distribution among different vibrational states does not follow the Boltzmann type. Consequently, the actual entropy (free energy) of the vibrational mode is lower (higher) than the corresponding thermal value given by $T_{\rm eff}$, indicating extra work can be extracted from these states. Our results will be useful for the study of non-thermal vibrational state in thermodynamics of nanoscale systems, and its usage in nanoscale heat engines.

preprint2020arXiv

Numerical analysis of two Galerkin discretizations with graded temporal grids for fractional evolution equations

Two numerical methods with graded temporal grids are analyzed for fractional evolution equations. One is a low-order discontinuous Galerkin (DG) discretization in the case of fractional order $0<α<1$, and the other one is a low-order Petrov Galerkin (PG) discretization in the case of fractional order $1<α<2$. By a new duality technique, pointwise-in-time error estimates of first-order and $ (3-α) $-order temporal accuracies are respectively derived for DG and PG, under reasonable regularity assumptions on the initial value. Numerical experiments are performed to verify the theoretical results.

preprint2020arXiv

Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax

Solving long-tail large vocabulary object detection with deep learning based models is a challenging and demanding task, which is however under-explored.In this work, we provide the first systematic analysis on the underperformance of state-of-the-art models in front of long-tail distribution. We find existing detection methods are unable to model few-shot classes when the dataset is extremely skewed, which can result in classifier imbalance in terms of parameter magnitude. Directly adapting long-tail classification models to detection frameworks can not solve this problem due to the intrinsic difference between detection and classification.In this work, we propose a novel balanced group softmax (BAGS) module for balancing the classifiers within the detection frameworks through group-wise training. It implicitly modulates the training process for the head and tail classes and ensures they are both sufficiently trained, without requiring any extra sampling for the instances from the tail classes.Extensive experiments on the very recent long-tail large vocabulary object recognition benchmark LVIS show that our proposed BAGS significantly improves the performance of detectors with various backbones and frameworks on both object detection and instance segmentation. It beats all state-of-the-art methods transferred from long-tail image classification and establishes new state-of-the-art.Code is available at https://github.com/FishYuLi/BalancedGroupSoftmax.

preprint2020arXiv

Quantum phase transition of the Bose-Hubbard model on cubic lattice with anisotropic hopping

In quantum many-body system, dimensionality plays a critical role on type of the quantum phase transition. In order to study the quantum system during dimensional crossover, we studied the Bose-Hubbard model on cubic lattice with anisotropic hopping by using the high order symbolic strong coupling expansion method. The analytic series expanded boundaries between the Mott-insulator and superfluid phase up to eighth order are calculated. The critical exponents are extracted by Padé re-summation method, which clearly shows the dimensional crossover behavior. Meanwhile, the critical points at commensurate filling can also be obtained, and they match well with the prediction of renormalization group theory. The scaling of the gap energy and whole phase diagram are given at last, and they can be taken as the benchmark for experiment and numerical simulations in the future study.

preprint2020arXiv

SCUBA-2 Ultra Deep Imaging Eao Survey (Studies) III: Multi-wavelength properties, luminosity functions and preliminary source catalog of 450-$μ$m-selected galaxies

We construct a SCUBA-2 450-$μ$m map in the COSMOS field that covers an area of 300 arcmin$^{2}$ and reaches a 1$σ$ noise level of 0.65 mJy in the deepest region. We extract 256 sources detected at 450 $μ$m with signal-to-noise ratio $>$ 4.0 and analyze the physical properties of their multi-wavelength counterparts. We find that most of the sources are at $z\lesssim3$, with a median of $z = 1.79^{+0.03}_{-0.15}$. About $35^{+32}_{-25}$% of our sources are classified as starburst galaxies based on their total star-formation rates (SFRs) and stellar masses ($M_{\ast}$). By fitting the far-infrared spectral energy distributions, we find that our 450-$μ$m-selected sample has a wide range of dust temperatures (20 K $ \lesssim T_{\rm d} \lesssim$ 60 K), with a median of ${T}_{\rm d} = 38.3^{+0.4}_{-0.9}$ K. We do not find a redshift evolution in dust temperature for sources with $L_{\rm IR}$ > $10^{12}$ $\rm L_\odot$ at $z<3$. However, we find a moderate correlation where dust temperature increases with the deviation from the SFR-$M_{\ast}$ relation. The increase in dust temperature also correlates with optical morphology, which is consistent with merger-triggered starbursts in sub-millimeter galaxies. Our galaxies do not show the tight IRX-$β_{\rm UV}$ correlation that has been observed in the local Universe. We construct the infrared luminosity functions of our 450-$μ$m sources and measure their comoving SFR densities. The contribution of the $L_{\rm IR}$ > $10^{12}$ $\rm L_\odot$ population to the SFR density rises dramatically from $z$ = 0 to 2 ($\propto$ ($1+z$)$^{3.9\pm1.1}$) and dominates the total SFR density at $z \gtrsim 2$.

preprint2020arXiv

Structure-Level Knowledge Distillation For Multilingual Sequence Labeling

Multilingual sequence labeling is a task of predicting label sequences using a single unified model for multiple languages. Compared with relying on multiple monolingual models, using a multilingual model has the benefit of a smaller model size, easier in online serving, and generalizability to low-resource languages. However, current multilingual models still underperform individual monolingual models significantly due to model capacity limitations. In this paper, we propose to reduce the gap between monolingual models and the unified multilingual model by distilling the structural knowledge of several monolingual models (teachers) to the unified multilingual model (student). We propose two novel KD methods based on structure-level information: (1) approximately minimizes the distance between the student&#39;s and the teachers&#39; structure level probability distributions, (2) aggregates the structure-level knowledge to local distributions and minimizes the distance between two local probability distributions. Our experiments on 4 multilingual tasks with 25 datasets show that our approaches outperform several strong baselines and have stronger zero-shot generalizability than both the baseline model and teacher models.

preprint2020arXiv

T-square resistivity without Umklapp scattering in dilute metallic Bi$_2$O$_2$Se

The electrical resistivity of Fermi liquids (FLs) displays a quadratic temperature ($T$) dependence because of electron-electron (e-e) scattering. For such collisions to decay the charge current, there are two known mechanisms: inter-band scattering (identified by Baber) and Umklapp events. However, dilute metallic strontium titanate (STO) was found to display $T^2$ resistivity in absence of either of these two mechanisms. The presence of soft phonons and their possible role as scattering centers raised the suspicion that $T$-square resistivity in STO is not due to e-e scattering. Here, we present the case of Bi$_2$O$_2$Se, a layered semiconductor with hard phonons, which becomes a dilute metal with a small single-component Fermi surface upon doping. It displays $T$-square resistivity well below the degeneracy temperature where neither Umklapp nor interband scattering is conceivable. We observe a universal scaling between the prefactor of $T^2$ resistivity and the Fermi energy, which is an extension of the Kadowaki-Woods plot to dilute metals. Our results imply the absence of a satisfactory theoretical basis for the ubiquity of e-e driven $T$-square resistivity in Fermi liquids.

preprint2020arXiv

The role of traps in the photocurrent generation mechanism in thin In-Se photodetectors

Due to the excellent electrical transport properties and optoelectronic performance, thin indium selenide (InSe) has recently attracted attention in the field of 2D semiconducting materials. However, the mechanism behind the photocurrent generation in thin InSe photodetectors remains elusive. Here, we present a set of experiments aimed at explaining the strong scattering in the photoresponsivity values reported in the literature for thin InSe photodetectors. By performing optoelectronic measurements on thin InSe-based photodetectors operated under different environmental conditions we find that the photoresponsivity, the response time and the photocurrent power dependency are strongly correlated in this material. This observation indicates that the photogating effect plays an imporant role for thin InSe flakes, and it is the dominant mechanism in the ultra-high photoresponsivity of pristine InSe devices. In addition, when exposing the pristine InSe photodetectors to the ambient environment we observe a fast and irreversible change in the photoresponse, with a decrease in the photoresponsivity accompanied by an increase of the operating speed. We attribute this photodetector performance change (upon atmospheric exposure) to the decrease in the density of the traps present in InSe, due to the passivation of selenium vacancies by atmospheric oxygen species. This passivation is accompanied by a downward shift of the InSe Fermi level and by a decrease of the Fermi level pinning, which leads to an increase of the Schottky barrier between Au and InSe. Our study reveals the important role of traps induced by defects in tailoring the properties of devices based on 2D materials and offers a controllable route to design and functionalize thin InSe photodetectors to realize devices with either ultrahigh photoresposivity or fast operation speed.

preprint2020arXiv

Thickness identification of thin InSe by optical microscopy methods

Indium selenide (InSe), as a novel van der Waals layered semiconductor, has attracted a large research interest thanks to its excellent optical and electrical properties in the ultra-thin limit. Here, we discuss four different optical methods to quantitatively identify the thickness of thin InSe flakes on various substrates, such as SiO2/Si or transparent polymeric substrates. In the case of thin InSe deposited on a transparent substrate, the transmittance of the flake in the blue region of the visible spectrum can be used to estimate the thickness. For InSe supported by SiO2/Si, the thickness of the flakes can be estimated either by assessing their apparent colors or accurately analyzed using a Fresnel-law based fitting model of the optical contrast spectra. Finally, we also studied the thickness dependency of the InSe photoluminescence emission energy, which provides an additional tool to estimate the InSe thickness and it works both for InSe deposited on SiO2/Si and on a transparent polymeric substrate.

preprint2020arXiv

Variational Prototype Replays for Continual Learning

Continual learning refers to the ability to acquire and transfer knowledge without catastrophically forgetting what was previously learned. In this work, we consider \emph{few-shot} continual learning in classification tasks, and we propose a novel method, Variational Prototype Replays, that efficiently consolidates and recalls previous knowledge to avoid catastrophic forgetting. In each classification task, our method learns a set of variational prototypes with their means and variances, where embedding of the samples from the same class can be represented in a prototypical distribution and class-representative prototypes are separated apart. To alleviate catastrophic forgetting, our method replays one sample per class from previous tasks, and correspondingly matches newly predicted embeddings to their nearest class-representative prototypes stored from previous tasks. Compared with recent continual learning approaches, our method can readily adapt to new tasks with more classes without requiring the addition of new units. Furthermore, our method is more memory efficient since only class-representative prototypes with their means and variances, as well as only one sample per class from previous tasks need to be stored. Without tampering with the performance on initial tasks, our method learns novel concepts given a few training examples of each class in new tasks.

preprint2019arXiv

Asymptotic properties of the plane shear thickening fluids with bounded energy integral

In this note we investigate the asymptotic behavior of plane shear thickening fluids around a bounded obstacle. Different from the Navier-Stokes case considered by Gilbarg-Weinberger in \cite{GW1978}, where the good structure of the vorticity can be exploited and weighted energy estimates can be applied, we have to overcome the nonlinear term of high order. The decay estimates of the velocity was obtained by combining Point-wise Behavior Theorem in \cite{Galdi} and Brezis-Gallouet inequality in \cite{BG1980} together, which is independent of interest.

preprint2019arXiv

Floquet-Induced Superfluidity with Periodically Modulated Interactions of Two-Species Hardcore Bosons in a One-dimensional Optical Lattice

We consider two species of hard-core bosons with density dependent hopping in a one-dimensional optical lattice, for which we propose experimental realizations using time-periodic driving. The quantum phase diagram for half-integer filling is determined by combining different advanced numerical simulations with analytic calculations. We find that a reduction of the density-dependent hopping induces a Mott-insulator to superfluid transition. For negative hopping a previously unknown state is found, where one species induces a gauge phase of the other species, which leads to a superfluid phase of gauge-paired particles. The corresponding experimental signatures are discussed.

preprint2019arXiv

Investigation of spin orbit torque driven dynamics in ferromagnetic heterostructures

We use time-resolved (TR) measurements based on the polar magneto-optical Kerr effect (MOKE) to study the magnetization dynamics excited by spin orbit torques in Py (Permalloy)/Pt and Ta/CoFeB bilayers. The analysis reveals that the field-like (FL) spin orbit torque (SOT) dominates the amplitude of the first oscillation cycle of the magnetization precession and the damping-like (DL) torque determines the final steady-state magnetization. In our bilayer samples, we have extracted the effective fields, hFL and hDL, of the two SOTs from the time-resolved magnetization oscillation spectrum. The extracted values are in good agreement with those extracted from time-integrated DCMOKE measurements, suggesting that the SOTs do not change at high frequencies. We also find that the amplitude ratio of the first oscillation to steady state is linearly proportional to the ratio hFL/hDL. The first oscillation amplitude is inversely proportional to, whereas the steady state value is independent of, the applied external field along the current direction.

preprint2019arXiv

Quantum-to-classical correspondence in two-dimensional Heisenberg models

The quantum-to-classical correspondence (QCC) in spin models is a puzzling phenomenon where the static susceptibility of a quantum system agrees with its classical-system counterpart, at a different corresponding temperature, within the systematic error at a sub-percent level. We employ the bold diagrammatic Monte Carlo method to explore the universality of QCC by considering three different two-dimensional spin-1/2 Heisenberg models. In particular, we reveal the existence of QCC in two-parametric models.

preprint2018arXiv

Minor stars in plane graphs with minimum degree five

The weight of a subgraph $H$ in $G$ is the sum of the degrees in $G$ of vertices of $H$. The {\em height} of a subgraph $H$ in $G$ is the maximum degree of vertices of $H$ in $G$. A star in a given graph is minor if its center has degree at most five in the given graph. Lebesgue (1940) gave an approximate description of minor $5$-stars in the class of normal plane maps with minimum degree five. In this paper, we give two descriptions of minor $5$-stars in plane graphs with minimum degree five. By these descriptions, we can extend several results and give some new results on the weight and height for some special plane graphs with minimum degree five.

preprint2018arXiv

Nonlinear Stability of Relativistic Vortex Sheets in Three-Dimensional Minkowski Spacetime

We are concerned with the nonlinear stability of vortex sheets for the relativistic Euler equations in three-dimensional Minkowski spacetime. This is a nonlinear hyperbolic problem with a characteristic free boundary. In this paper, we introduce a new symmetrization by choosing appropriate functions as primary unknowns. A necessary and sufficient condition for the weakly linear stability of relativistic vortex sheets is obtained by analyzing the roots of the Lopatinski\uı determinant associated to the constant coefficient linearized problem. Under this stability condition, we show that the variable coefficient linearized problem obeys an energy estimate with a loss of derivatives. The construction of certain weight functions plays a crucial role in absorbing error terms caused by microlocalization. Based on the weakly linear stability result, we establish the existence and nonlinear stability of relativistic vortex sheets under small initial perturbations by a Nash--Moser iteration scheme.

preprint2018arXiv

Total coloring of 1-toroidal graphs of maximum degree at least 11 and no adjacent triangles

A {\em total coloring} of a graph $G$ is an assignment of colors to the vertices and the edges of $G$ such that every pair of adjacent/incident elements receive distinct colors. The {\em total chromatic number} of a graph $G$, denoted by $\chiup&#39;&#39;(G)$, is the minimum number of colors in a total coloring of $G$. The well-known Total Coloring Conjecture (TCC) says that every graph with maximum degree $Δ$ admits a total coloring with at most $Δ+ 2$ colors. A graph is {\em $1$-toroidal} if it can be drawn in torus such that every edge crosses at most one other edge. In this paper, we investigate the total coloring of $1$-toroidal graphs, and prove that the TCC holds for the $1$-toroidal graphs with maximum degree at least~$11$ and some restrictions on the triangles. Consequently, if $G$ is a $1$-toroidal graph with maximum degree $Δ$ at least~$11$ and without adjacent triangles, then $G$ admits a total coloring with at most $Δ+ 2$ colors.

preprint2017arXiv

Star 5-edge-colorings of subcubic multigraphs

The star chromatic index of a multigraph $G$, denoted $χ&#39;_{s}(G)$, is the minimum number of colors needed to properly color the edges of $G$ such that no path or cycle of length four is bi-colored. A multigraph $G$ is star $k$-edge-colorable if $χ&#39;_{s}(G)\le k$. Dvořák, Mohar and Šámal [Star chromatic index, J Graph Theory 72 (2013), 313--326] proved that every subcubic multigraph is star $7$-edge-colorable, and conjectured that every subcubic multigraph should be star $6$-edge-colorable. Kerdjoudj, Kostochka and Raspaud considered the list version of this problem for simple graphs and proved that every subcubic graph with maximum average degree less than $7/3$ is star list-$5$-edge-colorable. It is known that a graph with maximum average degree $14/5$ is not necessarily star $5$-edge-colorable. In this paper, we prove that every subcubic multigraph with maximum average degree less than $12/5$ is star $5$-edge-colorable.

preprint2017arXiv

Symmetric flows for compressible heat-conducting fluids with temperature dependent viscosity coefficients

We consider the Navier--Stokes equations for compressible heat-conducting ideal polytropic gases in a bounded annular domain when the viscosity and thermal conductivity coefficients are general smooth functions of temperature. A global-in-time, spherically or cylindrically symmetric, classical solution to the initial boundary value problem is shown to exist uniquely and converge exponentially to the constant state as the time tends to infinity under certain assumptions on the initial data and the adiabatic exponent $γ$. The initial data can be large if $γ$ is sufficiently close to 1. These results are of Nishida--Smoller type and extend the work [Liu et al., SIAM J. Math. Anal. 46 (2014), 2185--2228] restricted to the one-dimensional flows.

preprint2016arXiv

Light subgraphs in graphs with average degree at most four

A graph $H$ is said to be {\em light} in a family $\mathfrak{G}$ of graphs if at least one member of $\mathfrak{G}$ contains a copy of $H$ and there exists an integer $λ(H, \mathfrak{G})$ such that each member $G$ of $\mathfrak{G}$ with a copy of $H$ also has a copy $K$ of $H$ such that $°_{G}(v) \leq λ(H, \mathfrak{G})$ for all $v \in V(K)$. In this paper, we study the light graphs in the class of graphs with small average degree, including the plane graphs with some restrictions on girth.

preprint2016arXiv

One-Dimensional Compressible Heat-Conducting Gas with Temperature-Dependent Viscosity

We consider the one-dimensional compressible Navier--Stokes system for a viscous and heat-conducting ideal polytropic gas when the viscosity $μ$ and the heat conductivity $κ$ depend on the specific volume $v$ and the temperature $θ$ and are both proportional to $h(v)θ^α$ for certain non-degenerate smooth function $h$. We prove the existence and uniqueness of a global-in-time non-vacuum solution to its Cauchy problem under certain assumptions on the parameter $α$ and initial data, which imply that the initial data can be large if $|α|$ is sufficiently small. Our result appears to be the first global existence result for general adiabatic exponent and large initial data when the viscosity coefficient depends on both the density and the temperature.

preprint2015arXiv

Asymptotic stability of wave patterns to compressible viscous and heat-conducting gases in the half space

We study the large-time behavior of solutions to the compressible Navier-Stokes equations for a viscous and heat-conducting ideal polytropic gas in the one-dimensional half-space. A rarefaction wave and its superposition with a non-degenerate stationary solution are shown to be asymptotically stable for the outflow problem with large initial perturbation and general adiabatic exponent.

preprint2015arXiv

Stability of stationary solutions to the outflow problem for full compressible Navier-Stokes equations with large initial perturbation

We investigate the large-time behavior of solutions to an outflow problem of the full compressible Navier-Stokes equations in the half line. The non-degenerate stationary solution is shown to be asymptotically stable under large initial perturbation with no restriction on the adiabatic exponent $γ$, provided that the boundary strength is sufficiently small. The proofs are based on the standard energy method and the crucial step is to obtain positive lower and upper bounds of the density and the temperature uniformly in time and space.

preprint2015arXiv

Strong chromatic index of k-degenerate graphs

A {\em strong edge coloring} of a graph $G$ is a proper edge coloring in which every color class is an induced matching. The {\em strong chromatic index} $\chiup_{s}&#39;(G)$ of a graph $G$ is the minimum number of colors in a strong edge coloring of $G$. In this note, we improve a result by D{\k e}bski \etal [Strong chromatic index of sparse graphs, arXiv:1301.1992v1] and show that the strong chromatic index of a $k$-degenerate graph $G$ is at most $(4k-2) \cdot Δ(G) - 2k^{2} + 1$. As a direct consequence, the strong chromatic index of a $2$-degenerate graph $G$ is at most $6Δ(G) - 7$, which improves the upper bound $10Δ(G) - 10$ by Chang and Narayanan [Strong chromatic index of 2-degenerate graphs, J. Graph Theory 73 (2013) (2) 119--126]. For a special subclass of $2$-degenerate graphs, we obtain a better upper bound, namely if $G$ is a graph such that all of its $3^{+}$-vertices induce a forest, then $\chiup_{s}&#39;(G) \leq 4 Δ(G) -3$; as a corollary, every minimally $2$-connected graph $G$ has strong chromatic index at most $4 Δ(G) - 3$. Moreover, all the results in this note are best possible in some sense.

preprint2014arXiv

Acyclic edge coloring of graphs

An {\em acyclic edge coloring} of a graph $G$ is a proper edge coloring such that the subgraph induced by any two color classes is a linear forest (an acyclic graph with maximum degree at most two). The {\em acyclic chromatic index} $\chiup_{a}&#39;(G)$ of a graph $G$ is the least number of colors needed in an acyclic edge coloring of $G$. Fiamčík (1978) conjectured that $\chiup_{a}&#39;(G) \leq Δ(G) + 2$, where $Δ(G)$ is the maximum degree of $G$. This conjecture is well known as Acyclic Edge Coloring Conjecture (AECC). A graph $G$ with maximum degree at most $κ$ is {\em $κ$-deletion-minimal} if $\chiup_{a}&#39;(G) > κ$ and $\chiup_{a}&#39;(H) \leq κ$ for every proper subgraph $H$ of $G$. The purpose of this paper is to provide many structural lemmas on $κ$-deletion-minimal graphs. By using the structural lemmas, we firstly prove that AECC is true for the graphs with maximum average degree less than four (\autoref{NMAD4}). We secondly prove that AECC is true for the planar graphs without triangles adjacent to cycles of length at most four, with an additional condition that every $5$-cycle has at most three edges contained in triangles (\autoref{NoAdjacent}), from which we can conclude some known results as corollaries. We thirdly prove that every planar graph $G$ without intersecting triangles satisfies $\chiup_{a}&#39;(G) \leq Δ(G) + 3$ (\autoref{NoIntersect}). Finally, we consider one extreme case and prove it: if $G$ is a graph with $Δ(G) \geq 3$ and all the $3^{+}$-vertices are independent, then $\chiup_{a}&#39;(G) = Δ(G)$. We hope the structural lemmas will shed some light on the acyclic edge coloring problems.

preprint2014arXiv

Inflow Problem for the One-dimensional Compressible Navier-Stokes Equations under Large Initial Perturbation

This paper is concerned with the inflow problem for the one-dimensional compressible Navier-Stokes equations. For such a problem, Matsumura and Nishihara showed in [A. Matsumura and K. Nishihara, Large-time behaviors of solutions to an inflow problem in the half space for a one-dimensional system of compressible viscous gas. Comm. Math. Phys. 222 (2001), 449-474] that there exists boundary layer solution to the inflow problem and both the boundary layer solution, the rarefaction wave, and the superposition of boundary layer solution and rarefaction wave are nonlinear stable under small initial perturbation. The main purpose of this paper is to show that similar stability results for the boundary layer solution and the supersonic rarefaction wave still hold for a class of large initial perturbation which can allow the initial density to have large oscillation. The proofs are given by an elementary energy method and the key point is to deduce the desired lower and upper bounds on the density function.

preprint2006arXiv

Factor-Critical Property in 3-Dominating-Critical Graphs

A vertex subset $S$ of a graph $G$ is a dominating set if every vertex of $G$ either belongs to $S$ or is adjacent to a vertex of $S$. The cardinality of a smallest dominating set is called the dominating number of $G$ and is denoted by $γ(G)$. A graph $G$ is said to be $γ$- vertex-critical if $γ(G-v)< γ(G)$, for every vertex $v$ in $G$. Let $G$ be a 2-connected $K_{1,5}$-free 3-vertex-critical graph. For any vertex $v \in V(G)$, we show that $G-v$ has a perfect matching (except two graphs), which is a conjecture posed by Ananchuen and Plummer.