Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
45works
0followers
32topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

45 published item(s)

preprint2026arXiv

Empowering Heterogeneous Graph Foundation Models via Decoupled Relation Alignment

While Graph Foundation Models (GFMs) have achieved remarkable success in homogeneous graphs, extending them to multi-domain heterogeneous graphs (MDHGs) remains a formidable challenge due to cross-type feature shifts and intra-domain relation gaps. Existing global feature alignment methods (PCA or SVD) enforce a shared feature space blindly, which distorts type-specific semantics and disrupts original topologies, inevitably leading to "Type Collapse" and "Relation Confusion". To address these fundamental limitations, we propose Decoupled relation Subspace Alignment (DRSA), a novel, plug-and-play relation-driven alignment framework. DRSA fundamentally shifts the paradigm by decoupling feature semantics from relation structures. Specifically, it introduces a dual-relation subspace projection mechanism to coordinate cross-type interactions within a shared low-rank relation subspace explicitly. Furthermore, a feature-structure decoupled representation is designed to decompose aligned features into a semantic projection component and a structural residual term, adaptively absorbing intra-domain variations. Optimized via a stable alternating minimization strategy based on Block Coordinate Descent, DRSA constructs a well-calibrated, structure-aware latent space. Extensive experiments on multiple real-world benchmark datasets demonstrate that DRSA can be seamlessly integrated as a universal preprocessing module, significantly and consistently enhancing the cross-domain and few-shot knowledge transfer capabilities of state-of-the-art GFMs. The code is available at: https://github.com/zhengziyu77/DSRA.

preprint2024arXiv

Sharp Hardy inequalities involving distance functions from submanifolds of Riemannian manifolds

We establish various Hardy inequalities involving the distance function from submanifolds of Riemannian manifolds, where the natural weights are expressed in terms of bounds of the mean curvature of the submanifold and sectional/Ricci curvature of the ambient Riemannian manifold. Our approach is based on subtle Heintze-Karcher-type Laplace comparisons of the distance function and on a D'Ambrosio-Dipierro-type weak divergence formula for suitable vector fields, providing Barbatis-Filippas-Tertikas-type Hardy inequalities in the curved setting. Under very mild assumptions, we also establish the sharpness and non-existence of extremal functions within the Hardy inequalities and - depending on the geometry of the ambient manifold - their extensibility to various function spaces. Several examples are provided by showing the applicability of our approach; in particular, well-known Hardy inequalities appear as limit cases of our new inequalities.

preprint2023arXiv

1st Place Solution for ECCV 2022 OOD-CV Challenge Object Detection Track

OOD-CV challenge is an out-of-distribution generalization task. To solve this problem in object detection track, we propose a simple yet effective Generalize-then-Adapt (G&A) framework, which is composed of a two-stage domain generalization part and a one-stage domain adaptation part. The domain generalization part is implemented by a Supervised Model Pretraining stage using source data for model warm-up and a Weakly Semi-Supervised Model Pretraining stage using both source data with box-level label and auxiliary data (ImageNet-1K) with image-level label for performance boosting. The domain adaptation part is implemented as a Source-Free Domain Adaptation paradigm, which only uses the pre-trained model and the unlabeled target data to further optimize in a self-supervised training manner. The proposed G&A framework help us achieve the first place on the object detection leaderboard of the OOD-CV challenge. Code will be released in https://github.com/hikvision-research/OOD-CV.

preprint2023arXiv

PTA-Det: Point Transformer Associating Point cloud and Image for 3D Object Detection

In autonomous driving, 3D object detection based on multi-modal data has become an indispensable approach when facing complex environments around the vehicle. During multi-modal detection, LiDAR and camera are simultaneously applied for capturing and modeling. However, due to the intrinsic discrepancies between the LiDAR point and camera image, the fusion of the data for object detection encounters a series of problems. Most multi-modal detection methods perform even worse than LiDAR-only methods. In this investigation, we propose a method named PTA-Det to improve the performance of multi-modal detection. Accompanied by PTA-Det, a Pseudo Point Cloud Generation Network is proposed, which can convert image information including texture and semantic features by pseudo points. Thereafter, through a transformer-based Point Fusion Transition (PFT) module, the features of LiDAR points and pseudo points from image can be deeply fused under a unified point-based representation. The combination of these modules can conquer the major obstacle in feature fusion across modalities and realizes a complementary and discriminative representation for proposal generation. Extensive experiments on the KITTI dataset show the PTA-Det achieves a competitive result and support its effectiveness.

preprint2023arXiv

Transition routes of electrokinetic flow in a divergent microchannel with bending walls

Electrokinetic flow can be generated as a highly coupled phenomenon among velocity field, electric conductivity field and electric field. It can exhibit different responses to AC electric fields in different frequency regimes, according to different instability/receptivity mechanisms. In this investigation, by both flow visualization and single-point laser-induced fluorescence (LIF) method, the response of AC electrokinetic flow and the transition routes towards chaos and turbulence have been experimentally investigated. It is found, when the AC frequency $f_f<30$ Hz, the interface responds at both the neutral frequency of the basic flow and the AC frequency. However, when $f_f>=30$ Hz, the interface responds only at the neutral frequency of the basic flow. Both periodic doubling and subcritical bifurcations have been observed in the transition of AC electrokinetic flow. We hope the current investigation can promote our current understanding on the ultrafast transition process of electrokinetic flow from laminar state to turbulence.

preprint2023arXiv

Voxelized 3D Feature Aggregation for Multiview Detection

Multi-view detection incorporates multiple camera views to alleviate occlusion in crowded scenes, where the state-of-the-art approaches adopt homography transformations to project multi-view features to the ground plane. However, we find that these 2D transformations do not take into account the object&#39;s height, and with this neglection features along the vertical direction of same object are likely not projected onto the same ground plane point, leading to impure ground-plane features. To solve this problem, we propose VFA, voxelized 3D feature aggregation, for feature transformation and aggregation in multi-view detection. Specifically, we voxelize the 3D space, project the voxels onto each camera view, and associate 2D features with these projected voxels. This allows us to identify and then aggregate 2D features along the same vertical line, alleviating projection distortions to a large extent. Additionally, because different kinds of objects (human vs. cattle) have different shapes on the ground plane, we introduce the oriented Gaussian encoding to match such shapes, leading to increased accuracy and efficiency. We perform experiments on multiview 2D detection and multiview 3D detection problems. Results on four datasets (including a newly introduced MultiviewC dataset) show that our system is very competitive compared with the state-of-the-art approaches. %Our code and data will be open-sourced.Code and MultiviewC are released at https://github.com/Robert-Mar/VFA.

preprint2022arXiv

Bi-CLKT: Bi-Graph Contrastive Learning based Knowledge Tracing

The goal of Knowledge Tracing (KT) is to estimate how well students have mastered a concept based on their historical learning of related exercises. The benefit of knowledge tracing is that students&#39; learning plans can be better organised and adjusted, and interventions can be made when necessary. With the recent rise of deep learning, Deep Knowledge Tracing (DKT) has utilised Recurrent Neural Networks (RNNs) to accomplish this task with some success. Other works have attempted to introduce Graph Neural Networks (GNNs) and redefine the task accordingly to achieve significant improvements. However, these efforts suffer from at least one of the following drawbacks: 1) they pay too much attention to details of the nodes rather than to high-level semantic information; 2) they struggle to effectively establish spatial associations and complex structures of the nodes; and 3) they represent either concepts or exercises only, without integrating them. Inspired by recent advances in self-supervised learning, we propose a Bi-Graph Contrastive Learning based Knowledge Tracing (Bi-CLKT) to address these limitations. Specifically, we design a two-layer contrastive learning scheme based on an &#34;exercise-to-exercise&#34; (E2E) relational subgraph. It involves node-level contrastive learning of subgraphs to obtain discriminative representations of exercises, and graph-level contrastive learning to obtain discriminative representations of concepts. Moreover, we designed a joint contrastive loss to obtain better representations and hence better prediction performance. Also, we explored two different variants, using RNN and memory-augmented neural networks as the prediction layer for comparison to obtain better representations of exercises and concepts respectively. Extensive experiments on four real-world datasets show that the proposed Bi-CLKT and its variants outperform other baseline models.

preprint2022arXiv

Entropy Induced Pruning Framework for Convolutional Neural Networks

Structured pruning techniques have achieved great compression performance on convolutional neural networks for image classification task. However, the majority of existing methods are weight-oriented, and their pruning results may be unsatisfactory when the original model is trained poorly. That is, a fully-trained model is required to provide useful weight information. This may be time-consuming, and the pruning results are sensitive to the updating process of model parameters. In this paper, we propose a metric named Average Filter Information Entropy (AFIE) to measure the importance of each filter. It is calculated by three major steps, i.e., low-rank decomposition of the &#34;input-output&#34; matrix of each convolutional layer, normalization of the obtained eigenvalues, and calculation of filter importance based on information entropy. By leveraging the proposed AFIE, the proposed framework is able to yield a stable importance evaluation of each filter no matter whether the original model is trained fully. We implement our AFIE based on AlexNet, VGG-16, and ResNet-50, and test them on MNIST, CIFAR-10, and ImageNet, respectively. The experimental results are encouraging. We surprisingly observe that for our methods, even when the original model is only trained with one epoch, the importance evaluation of each filter keeps identical to the results when the model is fully-trained. This indicates that the proposed pruning strategy can perform effectively at the beginning stage of the training process for the original model.

preprint2022arXiv

Investigating the environmental dependence of ultralight scalar dark matter with atom interferometers

We study the environmental dependence of ultralight scalar dark matter (DM) with linear interactions to the standard model particles. The solution to the DM field turns out to be a sum of the cosmic harmonic oscillation term and the local exponential fluctuation term. The amplitude of the first term depends on the local DM density and the mass of the DM field. The second term is induced by the local distribution of matter, such as the Earth. And it depends not only on the mass of the Earth, but also the density of the Earth. Then, we compute the phase shift induced by the DM field in atom interferometers (AIs), through solving the trajectories of atoms. Especially, the AI signal for the violation of weak equivalence principle (WEP) caused by the DM field is calculated. Depending on the values of the DM coupling parameters, contributions to the WEP violation from the first and second terms of the DM field can be either comparable or one larger than the other. Finally, we give some constraints to DM coupling parameters using results from the terrestrial atomic WEP tests.

preprint2022arXiv

Learning Quality-aware Dynamic Memory for Video Object Segmentation

Recently, several spatial-temporal memory-based methods have verified that storing intermediate frames and their masks as memory are helpful to segment target objects in videos. However, they mainly focus on better matching between the current frame and the memory frames without explicitly paying attention to the quality of the memory. Therefore, frames with poor segmentation masks are prone to be memorized, which leads to a segmentation mask error accumulation problem and further affect the segmentation performance. In addition, the linear increase of memory frames with the growth of frame number also limits the ability of the models to handle long videos. To this end, we propose a Quality-aware Dynamic Memory Network (QDMN) to evaluate the segmentation quality of each frame, allowing the memory bank to selectively store accurately segmented frames to prevent the error accumulation problem. Then, we combine the segmentation quality with temporal consistency to dynamically update the memory bank to improve the practicability of the models. Without any bells and whistles, our QDMN achieves new state-of-the-art performance on both DAVIS and YouTube-VOS benchmarks. Moreover, extensive experiments demonstrate that the proposed Quality Assessment Module (QAM) can be applied to memory-based methods as generic plugins and significantly improves performance. Our source code is available at https://github.com/workforai/QDMN.

preprint2022arXiv

Mixing and flow transition in an optimized electrokinetic turbulent micromixer

Micromixer is a key element in lab on a chip for broad applications in the analysis and measurement of chemistry and engineering. Previous investigations reported electrokinetic (EK) turbulence could be realized in a Y-type micromixer with a cross-sectional dimension of 100 $μ$m order. Although the ultrafast turbulent mixing can be generated at a bulk flow Reynolds number of O(1), the micromixer has not been optimized. In this investigation, we systematically investigated the influence of electric field intensity, AC frequency, electric conductivity ratio, and channel width at the entrance on the mixing effect and transition electric Rayleigh number in the &#34;Y&#34; type electrokinetic micromixer. It is found the optimal mixing is realized in a 350 $μ$m wide micromixer, under 100 kHz and 1.14*10^5 V/m AC electric field, with an electric conductivity ratio of 1:3000. Under the conditions, a maximum degree of mixedness of 0.93 can be achieved at 84 $μ$m from the entrance and 100 ms. A further investigation of the critical electric field and the critical electric Rayleigh number indicates the most unstable condition of EK flow instability is inconsistent with that of the optimal mixing in EK turbulence. To predict the evolution of EK flow under high $Ra_{e}$, it is necessary to apply a computational turbulence model, instead of linear instability analysis.

preprint2022arXiv

SBPF: Sensitiveness Based Pruning Framework For Convolutional Neural Network On Image Classification

Pruning techniques are used comprehensively to compress convolutional neural networks (CNNs) on image classification. However, the majority of pruning methods require a well pre-trained model to provide useful supporting parameters, such as C1-norm, BatchNorm value and gradient information, which may lead to inconsistency of filter evaluation if the parameters of the pre-trained model are not well optimized. Therefore, we propose a sensitiveness based method to evaluate the importance of each layer from the perspective of inference accuracy by adding extra damage for the original model. Because the performance of the accuracy is determined by the distribution of parameters across all layers rather than individual parameter, the sensitiveness based method will be robust to update of parameters. Namely, we can obtain similar importance evaluation of each convolutional layer between the imperfect-trained and fully trained models. For VGG-16 on CIFAR-10, even when the original model is only trained with 50 epochs, we can get same evaluation of layer importance as the results when the model is trained fully. Then we will remove filters proportional from each layer by the quantified sensitiveness. Our sensitiveness based pruning framework is verified efficiently on VGG-16, a customized Conv-4 and ResNet-18 with CIFAR-10, MNIST and CIFAR-100, respectively.

preprint2022arXiv

SimKGC: Simple Contrastive Knowledge Graph Completion with Pre-trained Language Models

Knowledge graph completion (KGC) aims to reason over known facts and infer the missing links. Text-based methods such as KGBERT (Yao et al., 2019) learn entity representations from natural language descriptions, and have the potential for inductive KGC. However, the performance of text-based methods still largely lag behind graph embedding-based methods like TransE (Bordes et al., 2013) and RotatE (Sun et al., 2019b). In this paper, we identify that the key issue is efficient contrastive learning. To improve the learning efficiency, we introduce three types of negatives: in-batch negatives, pre-batch negatives, and self-negatives which act as a simple form of hard negatives. Combined with InfoNCE loss, our proposed model SimKGC can substantially outperform embedding-based methods on several benchmark datasets. In terms of mean reciprocal rank (MRR), we advance the state-of-the-art by +19% on WN18RR, +6.8% on the Wikidata5M transductive setting, and +22% on the Wikidata5M inductive setting. Thorough analyses are conducted to gain insights into each component. Our code is available at https://github.com/intfloat/SimKGC .

preprint2022arXiv

Towards Explainable Evaluation Metrics for Natural Language Generation

Unlike classical lexical overlap metrics such as BLEU, most current evaluation metrics (such as BERTScore or MoverScore) are based on black-box language models such as BERT or XLM-R. They often achieve strong correlations with human judgments, but recent research indicates that the lower-quality classical metrics remain dominant, one of the potential reasons being that their decision processes are transparent. To foster more widespread acceptance of the novel high-quality metrics, explainability thus becomes crucial. In this concept paper, we identify key properties and propose key goals of explainable machine translation evaluation metrics. We also provide a synthesizing overview over recent approaches for explainable machine translation metrics and discuss how they relate to those goals and properties. Further, we conduct own novel experiments, which (among others) find that current adversarial NLP techniques are unsuitable for automatically identifying limitations of high-quality black-box evaluation metrics, as they are not meaning-preserving. Finally, we provide a vision of future approaches to explainable evaluation metrics and their evaluation. We hope that our work can help catalyze and guide future research on explainable evaluation metrics and, mediately, also contribute to better and more transparent text generation systems.

preprint2022arXiv

Ultralight scalar dark matter detection with ZAIGA

ZAIGA is a proposed underground long-baseline atom interferometer (AI) facility, aiming for experimental research on gravitation and related problems. In this paper, we study the possibility of detecting the ultralight scalar dark matter (DM) with ZAIGA. According to a popular scalar DM model, the DM field contains a background oscillation term and a local exponential fluctuation term. In order to calculate the proposed constraints on DM coupling parameters, we need to first compute the DM signals in ZAIGA. For the case of two AIs vertically separated by 300 meters, the DM-induced differential phase consists of three contributions, coming from the DM-induced changes in atomic internal energy levels, atomic masses and the gravitational acceleration. For the case of two AIs horizontally separated by several kilometers, the signal comes from the DM-induced changes in atomic internal energy levels. With the current and future technical parameters of ZAIGA, we then obtain the proposed constraints on five DM coupling parameters. It turns out that our proposed constraints could be several orders of magnitude better than the ones set by the MICROSCOPE space mission.

preprint2022arXiv

Uniformly $S$-Noetherian rings

Let $R$ be a ring and $S$ a multiplicative subset of $R$. Then $R$ is called a uniformly $S$-Noetherian ($u$-$S$-Noetherian for abbreviation) ring provided there exists an element $s\in S$ such that for any ideal $I$ of $R$, $sI \subseteq K$ for some finitely generated sub-ideal $K$ of $I$. We give the Eakin-Nagata-Formanek Theorem for $u$-$S$-Noetherian rings. Besides, the $u$-$S$-Noetherian properties on several ring constructions are given. The notion of $u$-$S$-injective modules is also introduced and studied. Finally, we obtain the Cartan-Eilenberg-Bass Theorem for uniformly $S$-Noetherian rings.

preprint2022arXiv

What Do They Capture? -- A Structural Analysis of Pre-Trained Language Models for Source Code

Recently, many pre-trained language models for source code have been proposed to model the context of code and serve as a basis for downstream code intelligence tasks such as code completion, code search, and code summarization. These models leverage masked pre-training and Transformer and have achieved promising results. However, currently there is still little progress regarding interpretability of existing pre-trained code models. It is not clear why these models work and what feature correlations they can capture. In this paper, we conduct a thorough structural analysis aiming to provide an interpretation of pre-trained language models for source code (e.g., CodeBERT, and GraphCodeBERT) from three distinctive perspectives: (1) attention analysis, (2) probing on the word embedding, and (3) syntax tree induction. Through comprehensive analysis, this paper reveals several insightful findings that may inspire future studies: (1) Attention aligns strongly with the syntax structure of code. (2) Pre-training language models of code can preserve the syntax structure of code in the intermediate representations of each Transformer layer. (3) The pre-trained models of code have the ability of inducing syntax trees of code. Theses findings suggest that it may be helpful to incorporate the syntax structure of code into the process of pre-training for better code representations.

preprint2021arXiv

A novel multiple instance learning framework for COVID-19 severity assessment via data augmentation and self-supervised learning

How to fast and accurately assess the severity level of COVID-19 is an essential problem, when millions of people are suffering from the pandemic around the world. Currently, the chest CT is regarded as a popular and informative imaging tool for COVID-19 diagnosis. However, we observe that there are two issues -- weak annotation and insufficient data that may obstruct automatic COVID-19 severity assessment with CT images. To address these challenges, we propose a novel three-component method, i.e., 1) a deep multiple instance learning component with instance-level attention to jointly classify the bag and also weigh the instances, 2) a bag-level data augmentation component to generate virtual bags by reorganizing high confidential instances, and 3) a self-supervised pretext component to aid the learning process. We have systematically evaluated our method on the CT images of 229 COVID-19 cases, including 50 severe and 179 non-severe cases. Our method could obtain an average accuracy of 95.8%, with 93.6% sensitivity and 96.4% specificity, which outperformed previous works.

preprint2021arXiv

A Two-Stage Wavelet Decomposition Method for Instantaneous Power Quality Indices Estimation Considering Interharmonics and Transient Disturbances

As the complexity increases in modern power systems, power quality analysis considering interharmonics has become a challenging and important task. This paper proposes a novel decomposition and estimation method for instantaneous power quality indices (PQIs) monitoring in single-phase and three-phase systems with interharmonics and transient disturbances. To separate the interharmonic components, a set of new scaling filter and wavelet filter with narrow transition bands are designed for the undecimated wavelet packet transform (UWPT). Further, a two-stage decomposition method for multi-tone voltage and current signals is proposed. The Hilbert transform (HT) is applied to calculate the instantaneous amplitude and phase of each frequency component, which accordingly allows the monitoring of different PQI parameters. Numerical tests are conducted to check the performance of the proposed method. The test results show that compared to other conventional approaches, instantaneous PQIs estimated by the proposed method present significant advances for tracking transitory changes in power systems, and could be considered as a helpful tool for high-accuracy PQ detections.

preprint2021arXiv

Contracting over persistent information

We consider a dynamic moral hazard problem between a principal and an agent, where the sole instrument the principal has to incentivize the agent is the disclosure of information. The principal aims at maximizing the (discounted) number of times the agent chooses a particular action, e.g., to work hard. We show that there exists an optimal contract, where the principal stops disclosing information as soon as its most preferred action is a static best reply for the agent or else continues disclosing information until the agent perfectly learns the principal&#39;s private information. If the agent perfectly learns the state, he learns it in finite time with probability one; the more patient the agent, the later he learns it.

preprint2021arXiv

Resolution of the paradox of the diamagnetic effect on the Kibble coil

Employing very simple electro-mechanical principles known from classical physics, the Kibble balance establishes a very precise and absolute link between quantum electrical standards and macroscopic mass or force measurements. The success of the Kibble balance, in both determining fundamental constants ($h$, $N_A$, $e$) and realizing a quasi-quantum mass in the 2019 newly revised International System of Units, relies on the perfection of Maxwell&#39;s equations and the symmetry they describe between Lorentz&#39;s force and Faraday&#39;s induction, a principle and a symmetry stunningly demonstrated in the weighing and velocity modes of Kibble balances to within $1\times10^{-8}$, with nothing but imperfect wires and magnets. However, recent advances in the understanding of the current effect in Kibble balances reveal a troubling paradox. A diamagnetic effect, a force that does not cancel between mass-on and mass-off measurement, is challenging balance maker&#39;s assumptions of symmetry at levels that are almost two orders of magnitude larger than the reported uncertainties. The diamagnetic effect, if it exists, shows up in weighing mode without a readily apparent reciprocal effect in the velocity mode, begging questions about systematic errors at the very foundation of the new measurement system. The hypothetical force is caused by the coil current changing the magnetic field, producing an unaccounted force that is systematically modulated with the weighing current. Here we show that this diamagnetic force exists, but the additional force does not change the equivalence between weighing and velocity measurements. We reveal the unexpected way that symmetry is preserved and show that for typical materials and geometries the total relative effect on the measurement is $\approx 1\times10^{-9}$.

preprint2021arXiv

Sharp Hardy inequalities via Riemannian submanifolds

This paper is devoted to Hardy inequalities concerning distance functions from submanifolds of arbitrary codimensions in the Riemannian setting. On a Riemannian manifold with non-negative curvature, we establish several sharp weighted Hardy inequalities in the cases when the submanifold is compact as well as non-compact. In particular, these inequalities remain valid even if the ambient manifold is compact, in which case we find an optimal space of smooth functions to study Hardy inequalities. Further examples are also provided. Our results complement in several aspects those obtained recently in the Euclidean and Riemannian settings.

preprint2021arXiv

Structural Interventions in Networks

Two types of interventions are commonly implemented in networks: characteristic intervention, which influences individuals&#39; intrinsic incentives, and structural intervention, which targets the social links among individuals. In this paper we provide a general framework to evaluate the distinct equilibrium effects of both types of interventions. We identify a hidden equivalence between a structural intervention and an endogenously determined characteristic intervention. Compared with existing approaches in the literature, the perspective from such an equivalence provides several advantages in the analysis of interventions that target network structure. We present a wide range of applications of our theory, including identifying the most wanted criminal(s) in delinquent networks and targeting the key connector for isolated communities.

preprint2021arXiv

The Observation of Ferroelastic and Ferrielectric Domains in AgNbO3 Single Crystal

Compared to AgNbO3 based ceramics, the experimental investigations on the single crystalline AgNbO3, especially the ground state and ferroic domain structures, are not on the same level. Here in this work, based on successfully synthesized AgNbO3 single crystal using flux method, we observed the coexistence of ferroelastic and ferrielectric domain structures by a combination study of polarized light microscopy and piezoresponse force microscope, this finding may provide a new aspect for studying AgNbO3. The result also suggests a weak electromechanical response from the ferrielectric phase of AgNbO3 which is also supported by the transmission electron microscope characterization. Our results reveal that the AgNbO3 single crystal is in a polar ferrielectric phase at room temperature, clarifying its ground state which is controversial from the AgNbO3 ceramic materials.

preprint2020arXiv

A deep learning approach for virtual monochromatic spectral CT imaging with a standard single energy CT scanner

Purpose/Objectives: To develop and assess a strategy of using deep learning (DL) to generate virtual monochromatic CT (VMCT) images from a single-energy CT (SECT) scan. Materials/Methods: The proposed data-driven VMCT imaging consists of two steps: (i) using a supervised DL model trained with a large number of 100 kV and 140 kV dual-energy CT (DECT) image pairs to produce the corresponding high-energy CT image from a low-energy image; and (ii) reconstructing VMCT images with energy ranging from 40 to 150 keV. To evaluate the performance of the method, we retrospectively studied 6,767 abdominal DECT images. The VMCT images reconstructed using both DL-derived DECT (DL-DECT) images and the images from DECT scanner were compared quantitatively. Paired-sample t-tests were used for statistical analysis to show the consistency and precision of calculated HU values. Results: Excellent agreement was found between the DL-DECT and the ground truth DECT images (p values ranged from 0.50 to 0.95). Noise reduction up to 68% (from 163 HU to 51 HU) was achieved for DL-based VMCT imaging as compared to that obtained by using the standard DECT. For the DL-based VMCT, the maximum iodine contrast-to-noise ratio (CNR) for each patient (ranging from 15.1 to 16.6) was achieved at 40 keV. In addition to the enormous benefit of VMCT acquisition with merely a SECT image, an improvement of CNR as high as 55% (from 10.7 to 16.6) was attained with the proposed approach. Conclusions: This study demonstrates that high-quality VMCT images can be obtained with only a SECT scan.

preprint2020arXiv

Bayesian Sparse Mediation Analysis with Targeted Penalization of Natural Indirect Effects

Causal mediation analysis aims to characterize an exposure&#39;s effect on an outcome and quantify the indirect effect that acts through a given mediator or a group of mediators of interest. With the increasing availability of measurements on a large number of potential mediators, like the epigenome or the microbiome, new statistical methods are needed to simultaneously accommodate high-dimensional mediators while directly target penalization of the natural indirect effect (NIE) for active mediator identification. Here, we develop two novel prior models for identification of active mediators in high-dimensional mediation analysis through penalizing NIEs in a Bayesian paradigm. Both methods specify a joint prior distribution on the exposure-mediator effect and mediator-outcome effect with either (a) a four-component Gaussian mixture prior or (b) a product threshold Gaussian prior. By jointly modeling the two parameters that contribute to the NIE, the proposed methods enable penalization on their product in a targeted way. Resultant inference can take into account the four-component composite structure underlying the NIE. We show through simulations that the proposed methods improve both selection and estimation accuracy compared to other competing methods. We applied our methods for an in-depth analysis of two ongoing epidemiologic studies: the Multi-Ethnic Study of Atherosclerosis (MESA) and the LIFECODES birth cohort. The identified active mediators in both studies reveal important biological pathways for understanding disease mechanisms.

preprint2020arXiv

Conductive Domain Walls in Non-Oxide Ferroelectrics Sn2P2S6

The conductive domain wall (CDW) is extensively investigated in ferroelectrics, which can be considered as a quasi-two-dimensional reconfigurable conducting channel embedded into an insulating material. Therefore, it is highly important for the application of ferroelectric nanoelectronics. Hitherto, most CDW investigations are restricted in oxides, and limited work has been reported in non-oxides to the contrary. Here, by successfully synthesizing the non-oxide ferroelectric Sn2P2S6 single crystal, we observed and confirmed the domain wall conductivity by using different scanning probe techniques which origins from the nature of inclined domain walls. Moreover, the domains separated by CDW also exhibit distinguishable electrical conductivity due to the interfacial polarization charge with opposite signs. The result provides a novel platform for understanding electrical conductivity behavior of the domains and domain walls in non-oxide ferroelectrics.

preprint2020arXiv

COVID-19 Chest CT Image Segmentation -- A Deep Convolutional Neural Network Solution

A novel coronavirus disease 2019 (COVID-19) was detected and has spread rapidly across various countries around the world since the end of the year 2019, Computed Tomography (CT) images have been used as a crucial alternative to the time-consuming RT-PCR test. However, pure manual segmentation of CT images faces a serious challenge with the increase of suspected cases, resulting in urgent requirements for accurate and automatic segmentation of COVID-19 infections. Unfortunately, since the imaging characteristics of the COVID-19 infection are diverse and similar to the backgrounds, existing medical image segmentation methods cannot achieve satisfactory performance. In this work, we try to establish a new deep convolutional neural network tailored for segmenting the chest CT images with COVID-19 infections. We firstly maintain a large and new chest CT image dataset consisting of 165,667 annotated chest CT images from 861 patients with confirmed COVID-19. Inspired by the observation that the boundary of the infected lung can be enhanced by adjusting the global intensity, in the proposed deep CNN, we introduce a feature variation block which adaptively adjusts the global properties of the features for segmenting COVID-19 infection. The proposed FV block can enhance the capability of feature representation effectively and adaptively for diverse cases. We fuse features at different scales by proposing Progressive Atrous Spatial Pyramid Pooling to handle the sophisticated infection areas with diverse appearance and shapes. We conducted experiments on the data collected in China and Germany and show that the proposed deep CNN can produce impressive performance effectively.

preprint2020arXiv

Dual-energy CT imaging from single-energy CT data with material decomposition convolutional neural network

Dual-energy computed tomography (DECT) is of great significance for clinical practice due to its huge potential to provide material-specific information. However, DECT scanners are usually more expensive than standard single-energy CT (SECT) scanners and thus are less accessible to undeveloped regions. In this paper, we show that the energy-domain correlation and anatomical consistency between standard DECT images can be harnessed by a deep learning model to provide high-performance DECT imaging from fully-sampled low-energy data together with single-view high-energy data, which can be obtained by using a scout-view high-energy image. We demonstrate the feasibility of the approach with contrast-enhanced DECT scans from 5,753 slices of images of twenty-two patients and show its superior performance on DECT applications. The deep learning-based approach could be useful to further significantly reduce the radiation dose of current premium DECT scanners and has the potential to simplify the hardware of DECT imaging systems and to enable DECT imaging using standard SECT scanners.

preprint2020arXiv

From $μ_0$ to $e$: A Survey of Major Impacts for Electrical Measurements in Recent SI Revision

A milestone revision of the International System of Units (SI) was made at the 26th General Conference on Weights and Measures that four of the seven SI base units, i.e. kilogram, ampere, kelvin, and mole, are redefined by fundamental physical constants of nature. The SI base unit founding the electrical measurement activities, i.e. ampere, is defined by fixing the numerical value of the elementary charge to $e=1.602\,176\,634\times10^{-19}$C. For electrical measurement, several major adjustments, mostly positive, are involved in this SI revision. In this paper, the main impacts of the new SI for electrical measurement activities are surveyed under the new framework.

preprint2020arXiv

Long distance measurement using single soliton microcomb

Dispersive interferometry (DPI) takes a major interest in optical frequency comb (OFC) based long distance laser-based light detection and ranging (LIDAR) for the merits of strong anti-interference ability and long coherent length. However, the mismatch between the repetition rate of OFC and the resolution of optical spectrum acquisition system induces a large dead-zone which is a major obstacle for practical applications. Here, a new DPI LIDAR on the strength of high-repetition-rate soliton microcomb is demonstrated, which reaches a minimum Allan deviation of 27 nm for an outdoor 1179 m ranging experiment. The proposed scheme approaches a compact, high-accuracy, and none-dead-zone long distance ranging system, opening up new opportunities for emerging applications of frontier scientific researches and advanced manufacturing.

preprint2020arXiv

Noise control and utility: from regulatory network to spatial patterning

Stochasticity (or noise) at cellular and molecular levels has been observed extensively as a universal feature for living systems. However, how living systems deal with noise while performing desirable biological functions remains a major mystery. Regulatory network configurations, such as their topology and timescale, are shown to be critical in attenuating noise, and noise is also found to facilitate cell fate decision. Here we review major recent findings on noise attenuation through regulatory control, the benefit of noise via noise-induced cellular plasticity during developmental patterning, and summarize key principles underlying noise control.

preprint2020arXiv

Noise2Context: Context-assisted Learning 3D Thin-layer Low Dose CT Without Clean Data

Computed tomography (CT) has played a vital role in medical diagnosis, assessment, and therapy planning, etc. In clinical practice, concerns about the increase of X-ray radiation exposure attract more and more attention. To lower the X-ray radiation, low-dose CT is often used in certain scenarios, while it will induce the degradation of CT image quality. In this paper, we proposed a training method that trained denoising neural networks without any paired clean data. we trained the denoising neural network to map one noise LDCT image to its two adjacent LDCT images in a singe 3D thin-layer low-dose CT scanning, simultaneously In other words, with some latent assumptions, we proposed an unsupervised loss function with the integration of the similarity between adjacent CT slices in 3D thin-layer lowdose CT to train the denoising neural network in an unsupervised manner. For 3D thin-slice CT scanning, the proposed virtual supervised loss function was equivalent to a supervised loss function with paired noisy and clean samples when the noise in the different slices from a single scan was uncorrelated and zero-mean. Further experiments on Mayo LDCT dataset and a realistic pig head were carried out and demonstrated superior performance over existing unsupervised methods.

preprint2020arXiv

On a Curvature Flow in a Band Domain with Unbounded Boundary Slopes

We consider an anisotropic curvature flow $V= A(\mathbf{n})H + B(\mathbf{n})$ in a band domain $Ω:=[-1,1]\times R$, where $\mathbf{n}$, $V$ and $H$ denote the unit normal vector, normal velocity and curvature, respectively, of a graphic curve $Γ_t$. We consider the case when $A>0>B$ and the curve $Γ_t$ contacts $\partial_\pm Ω$ with slopes equaling to $\pm 1$ times of its height (which are unbounded when the solution moves to infinity). First, we present the global well-posedness and then, under some symmetric assumptions on $A$ and $B$, we show the uniform interior gradient estimates for the solution. Based on these estimates, we prove that $Γ_t$ converges as $t\to \infty$ in $C^{2,1}_{\text{loc}} ((-1,1)\times R)$ topology to a cup-like traveling wave with {\it infinite} derivatives on the boundaries.

preprint2020arXiv

On Manually Reverse Engineering Communication Protocols of Linux Based IoT Systems

IoT security and privacy has raised grave concerns. Efforts have been made to design tools to identify and understand vulnerabilities of IoT systems. Most of the existing protocol security analysis techniques rely on a well understanding of the underlying communication protocols. In this paper, we systematically present the first manual reverse engineering framework for discovering communication protocols of embedded Linux based IoT systems. We have successfully applied our framework to reverse engineer a number of IoT systems. As an example, we present a detailed use of the framework reverse-engineering the WeMo smart plug communication protocol by extracting the firmware from the flash, performing static and dynamic analysis of the firmware and analyzing network traffic. The discovered protocol exposes severe design flaws that allow attackers to control or deny the service of victim plugs. Our manual reverse engineering framework is generic and can be applied to both read-only and writable Embedded Linux filesystems.

preprint2020arXiv

On the Limitations of Cross-lingual Encoders as Exposed by Reference-Free Machine Translation Evaluation

Evaluation of cross-lingual encoders is usually performed either via zero-shot cross-lingual transfer in supervised downstream tasks or via unsupervised cross-lingual textual similarity. In this paper, we concern ourselves with reference-free machine translation (MT) evaluation where we directly compare source texts to (sometimes low-quality) system translations, which represents a natural adversarial setup for multilingual encoders. Reference-free evaluation holds the promise of web-scale comparison of MT systems. We systematically investigate a range of metrics based on state-of-the-art cross-lingual semantic representations obtained with pretrained M-BERT and LASER. We find that they perform poorly as semantic encoders for reference-free MT evaluation and identify their two key limitations, namely, (a) a semantic mismatch between representations of mutual translations and, more prominently, (b) the inability to punish &#34;translationese&#34;, i.e., low-quality literal translations. We propose two partial remedies: (1) post-hoc re-alignment of the vector spaces and (2) coupling of semantic-similarity based metrics with target-side language modeling. In segment-level MT evaluation, our best metric surpasses reference-based BLEU by 5.7 correlation points.

preprint2020arXiv

Quantum key distribution with dissipative Kerr soliton generated by on-chip microresonators

Quantum key distribution (QKD) can distribute symmetric key bits between remote legitimate users with the guarantee of quantum mechanics principles. For practical applications, the compact and robust photonic components for QKD are essential, and there are increasing attention to integrate the source, detector and modulators on a photonic chip. However, the massive and parallel QKD based on wavelength multiplexing are still challenge, due to the limited coherent light sources on the chip. Here, we introduce the Kerr dissipative soliton in a microresonator, which provides the locked coherent frequency comb with 49GHz frequency spacing, for QKD. We demonstrate the parallel QKD by demulplexing the coherent comb lines form the soliton, and showing the potential of Gbps secret key rate if the hundreds of channels covering C and L bands are fully exploited. The demonstrated soliton based QKD architecture are compatible with the efforts of quantum photonic integrated circuits, which are compact, robust and low-cost, and provides a competitive platform of practical QKD chip.

preprint2020arXiv

Sapphire: Automatic Configuration Recommendation for Distributed Storage Systems

Modern distributed storage systems come with aplethora of configurable parameters that controlmodule behavior and affect system performance. Default settings provided by developers are often suboptimal for specific user cases. Tuning parameters can provide significant performance gains but is a difficult task requiring profound experience and expertise, due to the immense number of configurable parameters, complex inner dependencies and non-linearsystem behaviors. To overcome these difficulties, we propose an automatic simulation-based approach, Sapphire, to recommend optimal configurations by leveraging machine learning and black-box optimization techniques. We evaluate Sapphire on Ceph. Results show that Sapphire significantly boosts Ceph performance to 2.2x compared to the default configuration.

preprint2020arXiv

Severity Assessment of Coronavirus Disease 2019 (COVID-19) Using Quantitative Features from Chest CT Images

Background: Chest computed tomography (CT) is recognized as an important tool for COVID-19 severity assessment. As the number of affected patients increase rapidly, manual severity assessment becomes a labor-intensive task, and may lead to delayed treatment. Purpose: Using machine learning method to realize automatic severity assessment (non-severe or severe) of COVID-19 based on chest CT images, and to explore the severity-related features from the resulting assessment model. Materials and Method: Chest CT images of 176 patients (age 45.3$\pm$16.5 years, 96 male and 80 female) with confirmed COVID-19 are used, from which 63 quantitative features, e.g., the infection volume/ratio of the whole lung and the volume of ground-glass opacity (GGO) regions, are calculated. A random forest (RF) model is trained to assess the severity (non-severe or severe) based on quantitative features. Importance of each quantitative feature, which reflects the correlation to the severity of COVID-19, is calculated from the RF model. Results: Using three-fold cross validation, the RF model shows promising results, i.e., 0.933 of true positive rate, 0.745 of true negative rate, 0.875 of accuracy, and 0.91 of area under receiver operating characteristic curve (AUC). The resulting importance of quantitative features shows that the volume and its ratio (with respect to the whole lung volume) of ground glass opacity (GGO) regions are highly related to the severity of COVID-19, and the quantitative features calculated from the right lung are more related to the severity assessment than those of the left lung. Conclusion: The RF based model can achieve automatic severity assessment (non-severe or severe) of COVID-19 infection, and the performance is promising. Several quantitative features, which have the potential to reflect the severity of COVID-19, were revealed.

preprint2020arXiv

Sharp uncertainty principles on general Finsler manifolds

The paper is devoted to sharp uncertainty principles (Heisenberg-Pauli-Weyl, Caffarelli-Kohn-Nirenberg and Hardy inequalities) on forward complete Finsler manifolds endowed with an arbitrary measure. Under mild assumptions, the existence of extremals corresponding to the sharp constants in the Heisenberg-Pauli-Weyl and Caffarelli-Kohn-Nirenberg inequalities fully characterizes the nature of the Finsler manifold in terms of three non-Riemannian quantities, namely, its reversibility and the vanishing of the flag curvature and $S$-curvature induced by the measure, respectively. It turns out in particular that the Busemann-Hausdorff measure is the optimal one in the study of sharp uncertainty principles on Finsler manifolds. The optimality of our results are supported by Randers-type Finslerian examples originating from the Zermelo navigation problem.

preprint2020arXiv

Single-photon scattering and bound states in an atom-waveguide system with two or multiple coupling points

In this paper, we investigate the single-photon scattering and bound states in a one-dimensional coupled-resonator waveguide which couples to a single artificial giant atom with two or more coupling points. When the atom couples to the waveguide via two resonators, the single-photon reflection rate is characterized by either Breit-Wigner or Fano line shapes. When the atom couples to the waveguide via multiple resonators, we numerically show how the destructive interference effect leads to a complete single-photon reflection. We also find a phase transition phenomena for the multi-resonator coupling case, which reveals that the upper bound state only exists when the atom-waveguide coupling strength is above a critical value.

preprint2020arXiv

SUPERT: Towards New Frontiers in Unsupervised Evaluation Metrics for Multi-Document Summarization

We study unsupervised multi-document summarization evaluation metrics, which require neither human-written reference summaries nor human annotations (e.g. preferences, ratings, etc.). We propose SUPERT, which rates the quality of a summary by measuring its semantic similarity with a pseudo reference summary, i.e. selected salient sentences from the source documents, using contextualized embeddings and soft token alignment techniques. Compared to the state-of-the-art unsupervised evaluation metrics, SUPERT correlates better with human ratings by 18-39%. Furthermore, we use SUPERT as rewards to guide a neural-based reinforcement learning summarizer, yielding favorable performance compared to the state-of-the-art unsupervised summarizers. All source code is available at https://github.com/yg211/acl20-ref-free-eval.

preprint2020arXiv

Synergistic Learning of Lung Lobe Segmentation and Hierarchical Multi-Instance Classification for Automated Severity Assessment of COVID-19 in CT Images

Understanding chest CT imaging of the coronavirus disease 2019 (COVID-19) will help detect infections early and assess the disease progression. Especially, automated severity assessment of COVID-19 in CT images plays an essential role in identifying cases that are in great need of intensive clinical care. However, it is often challenging to accurately assess the severity of this disease in CT images, due to variable infection regions in the lungs, similar imaging biomarkers, and large inter-case variations. To this end, we propose a synergistic learning framework for automated severity assessment of COVID-19 in 3D CT images, by jointly performing lung lobe segmentation and multi-instance classification. Considering that only a few infection regions in a CT image are related to the severity assessment, we first represent each input image by a bag that contains a set of 2D image patches (with each cropped from a specific slice). A multi-task multi-instance deep network (called M$^2$UNet) is then developed to assess the severity of COVID-19 patients and also segment the lung lobe simultaneously. Our M$^2$UNet consists of a patch-level encoder, a segmentation sub-network for lung lobe segmentation, and a classification sub-network for severity assessment (with a unique hierarchical multi-instance learning strategy). Here, the context information provided by segmentation can be implicitly employed to improve the performance of severity assessment. Extensive experiments were performed on a real COVID-19 CT image dataset consisting of 666 chest CT images, with results suggesting the effectiveness of our proposed method compared to several state-of-the-art methods.

preprint2019arXiv

Raman laser from an optical resonator with a grafted single molecule monolayer

Raman-based technologies have enabled many ground-breaking scientific discoveries related to surface science, single molecule chemistry and biology. For example, researchers have identified surface bound molecules by their Raman vibrational modes and demonstrated polarization-dependent Raman gain. However, a surface constrained Raman laser has yet to be demonstrated because of the challenges associated with achieving a sufficiently high photon population located at a surface to transition from spontaneous to stimulated Raman scattering. Here, advances in surface chemistry and in integrated photonics are combined to demonstrate lasing based on surface stimulated Raman scattering (SSRS). By creating an oriented, constrained Si-O-Si monolayer on the surface of integrated silica optical microresonators, the requisite conditions for SSRS are achieved with low threshold powers (200microW). The expected polarization-dependence of the SSRS due to the orientation of the Si-O-Si bond is observed. Due to the ordered monolayer, the Raman lasing efficiency is improved from ~5% to over 40%.

preprint2019arXiv

Tensor train-Karhunen-Loève expansion for continuous-indexed random fields using higher-order cumulant functions

The goals of this work are two-fold: firstly, to propose a new theoretical framework for representing random fields on a large class of multidimensional geometrical domain in the tensor train format; secondly, to develop a new algorithm framework for accurately computing the modes and the second and third-order cumulant tensors within moderate time. The core of the new theoretical framework is the tensor train decomposition of cumulant functions. This decomposition is accurately computed with a novel rank-revealing algorithm. Compared with existing Galerkin-type and collocation-type methods, the proposed computational procedure totally removes the need of selecting the basis functions or collocation points and the quadrature points, which not only greatly enhances adaptivity, but also avoids solving large-scale eigenvalue problems. Moreover, by computing with third-order cumulant functions, the new theoretical and algorithm frameworks show great potential for representing general non-Gaussian non-homogeneous random fields. Three numerical examples, including a three-dimensional random field discretization problem, illustrate the efficiency and accuracy of the proposed algorithm framework.