Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
155works
0followers
54topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

155 published item(s)

preprint2026arXiv

Asymmetric Invertible Threat: Learning Reversible Privacy Defense for Face Recognition

Face Recognition systems are widely deployed in real-world applications, but they also raise privacy concerns due to unauthorized collection and misuse of facial data. Existing adversarial privacy protection methods rely on input-space perturbations to obfuscate identity information, yet their protection can degrade when adversaries learn restoration or purification mappings that partially invert the transformation. We study this setting as an asymmetric adversarial attack, in which reverse manipulation becomes feasible because existing defense paradigms do not control reversibility. To address this problem, we propose Asymmetric Reversible Face Protection (ARFP), a restoration-aware extension of personalized face cloaking that integrates privacy protection, keyed recovery, and tamper indication in a single framework. ARFP consists of three components: Key-Conditioned Manifold Binding, which ties the protection transformation to a user-provided key; Adversarial Restoration-Aware Training, which introduces a surrogate restoration adversary during training to improve robustness against evaluated inverse purification attacks; and Authorized Reversible Restoration, which supports recovery with the correct key while providing nonce-based tamper indication. Extensive experiments under the threat models considered in this work show that ARFP improves resistance to the evaluated restoration attacks while preserving authorized recovery utility. These results provide empirical evidence of key-sensitive recovery behavior and tamper awareness in the tested settings.

preprint2025arXiv

Many-electron characterizations of higher-charge superconductors

The theoretical understanding of conventional superconductivity as the phonon-assisted formation and condensation of two-electron Cooper pairs is a significant triumph in condensed matter physics. Here, we propose many-electron characterizations of higher-charge superconductivity with Cooper pairs consisting of more than two electrons, by implementing translation symmetrization on parent pair-density-wave-ordered states. In particular, we demonstrate many-electron constructions with vanishing charge-2e sectors, but with sharp signatures in charge-4e or charge-6e expectation values instead. Such characterizations are consistent with previous phenomenology of vestigial order and Ginzburg-Landau theory, yet, instead of point-group-symmetry presumptions, we show that momentum conservation is both vital and sufficient. Our study thus offers a novel, general, and microscopic route to understand and characterize higher-charge superconductivity, including nontrivial experimental signatures such as fractional magnetic flux and period in interferometry, as well as localized Cooper pairs at lattice topological defects.

preprint2025arXiv

Observation of the $γ$-ray Emission from W43 with LHAASO

In this paper, we report the detection of the very-high-energy (VHE, $ 100{\rm\ GeV} < E < 100{\rm\ TeV} $) and ultra-high-energy (UHE, $E > 100\rm\ TeV$) $γ$-ray emissions from the direction of the young star-forming region W43, observed by the Large High Altitude Air Shower Observation (LHAASO). The extended $γ$-ray source was detected with a significance of ${\sim}16\,σ$ by KM2A and ${\sim}17\,σ$ by WCDA, respectively. The angular extension of this $γ$-ray source is about 0.5 degrees, corresponding to a physical size of about 50 pc. We discuss the origin of the $γ$-ray emission and possible cosmic ray acceleration in the W43 region using multi-wavelength data. Our findings suggest that W43 is likely another young star cluster capable of accelerating cosmic rays (CRs) to at least several hundred TeV.

preprint2025arXiv

Study of Ultra-High-Energy Gamma-Ray Source 1LHAASO J0056+6346u and Its Possible Origins

We report a dedicated study of the newly discovered extended UHE $γ$-ray source 1LHAASO J0056+6346u. Analyzing 979 days of LHAASO-WCDA data and 1389 days of LHAASO-KM2A data, we observed a significant excess of $γ$-ray events with both WCDA and KM2A. Assuming a point power-law source with a fixed spectral index, the significance maps reveal excesses of ${\sim}12.65\,σ$, ${\sim}22.18\,σ$, and ${\sim}10.24\,σ$ in the energy ranges of 1--25 TeV, 25--100 TeV, and $> 100$ TeV, respectively. We use a 3D likelihood algorithm to derive the morphological and spectral parameters, and the source is detected with significances of $12.65\,σ$ by WCDA and $25.27\,σ$ by KM2A. The best-fit positions derived from WCDA and KM2A data are (R.A. = $13.96^\circ\pm0.09^\circ$, Decl. = $63.92^\circ\pm0.05^\circ$) and (R.A. = $14.00^\circ\pm0.05^\circ$, Decl. = $63.79^\circ\pm0.02^\circ$), respectively. The angular size ($r_{39}$) of 1LHAASO J0056+6346u is $0.34^\circ\pm0.04^\circ$ at 1--25 TeV and $0.24^\circ\pm0.02^\circ$ at $> 25$ TeV. The differential flux of this UHE $γ$-ray source can be described by an exponential cutoff power-law function: $(2.67\pm0.25) \times 10^{-15} (E/20\,\text{TeV})^{-1.97\pm0.10} e^{-E/(55.1\pm7.2)\,\text{TeV}} \,\text{TeV}^{-1}\,\text{cm}^{-2}\,\text{s}^{-1}$. To explore potential sources of $γ$-ray emission, we investigated the gas distribution around 1LHAASO J0056+6346u. 1LHAASO J0056+6346u is likely to be a TeV PWN powered by an unknown pulsar, which would naturally explain both its spatial and spectral properties. Another explanation is that this UHE $γ$-ray source might be associated with gas content illuminated by a nearby CR accelerator, possibly the SNR candidate G124.0+1.4.

preprint2025arXiv

Ultrahigh-Energy Gamma-ray Emission Associated with Black Hole-Jet Systems

Black holes (BH), one of the most intriguing objects in the universe, can manifest themselves through electromagnetic radiation initiated by the accretion flow. Some stellar-mass BHs drive relativistic jets when accreting matter from their companion stars, forming microquasars. Non-thermal emission from the radio to tera-electronvolt (TeV) gamma-ray band has been observed from microquasars, indicating the acceleration of relativistic particles. Here we report detection of four microquasars (SS 433, V4641 Sgr, GRS 1915+105, MAXI J1820+070) of spectrum extending to the ultrahigh-energy (UHE; photon energy $E>100$ TeV) band and one microquasar (Cygnus X-1) of spectrum approaching 100 TeV, using the Large High Altitude Air Shower Observatory (LHAASO). Notably, the total emission associated with SS 433 cannot be interpreted with a single leptonic component. In the UHE band, its emission is in spatial coincidence with a giant atomic cloud, which is consistent with a hadronic origin. An elongated source is discovered from V4641 Sgr with the spectrum continuing up to 800 TeV. The detection of UHE gamma rays demonstrates that accreting BHs and their environments can operate as extremely efficient accelerators of particles out of 1 peta-electronvolt (PeV), suggesting microquasars to be important contributors to Galactic cosmic rays especially around the `knee&#39; region.

preprint2024arXiv

Aircraft Landing Time Prediction with Deep Learning on Trajectory Images

Aircraft landing time (ALT) prediction is crucial for air traffic management, especially for arrival aircraft sequencing on the runway. In this study, a trajectory image-based deep learning method is proposed to predict ALTs for the aircraft entering the research airspace that covers the Terminal Maneuvering Area (TMA). Specifically, the trajectories of all airborne arrival aircraft within the temporal capture window are used to generate an image with the target aircraft trajectory labeled as red and all background aircraft trajectory labeled as blue. The trajectory images contain various information, including the aircraft position, speed, heading, relative distances, and arrival traffic flows. It enables us to use state-of-the-art deep convolution neural networks for ALT modeling. We also use real-time runway usage obtained from the trajectory data and the external information such as aircraft types and weather conditions as additional inputs. Moreover, a convolution neural network (CNN) based module is designed for automatic holding-related featurizing, which takes the trajectory images, the leading aircraft holding status, and their time and speed gap at the research airspace boundary as its inputs. Its output is further fed into the final end-to-end ALT prediction. The proposed ALT prediction approach is applied to Singapore Changi Airport (ICAO Code: WSSS) using one-month Automatic Dependent Surveillance-Broadcast (ADS-B) data from November 1 to November 30, 2022. Experimental results show that by integrating the holding featurization, we can reduce the mean absolute error (MAE) from 82.23 seconds to 43.96 seconds, and achieve an average accuracy of 96.1\%, with 79.4\% of the predictions errors being less than 60 seconds.

preprint2024arXiv

Vertex degree sums for perfect matchings in 3-uniform hypergraphs

Let $n \equiv 0\, (\, \text{mod } 3\,)$ and $H_{n, n/3}^2$ be the 3-graph of order $n$, whose vertex set is partitioned into two sets $S$ and $T$ of size $\frac{1}{3}n+1$ and $\frac{2}{3}n -1$, respectively, and whose edge set consists of all triples with at least $2$ vertices in $T$. Suppose that $n$ is sufficiently large and $H$ is a 3-uniform hypergraph of order $n$ with no isolated vertex. Zhang and Lu [Discrete Math. 341 (2018), 748--758] conjectured that if $deg(u)+deg(v) > 2(\binom{n-1}{2}-\binom{2n/3}{2})$ for any two vertices $u$ and $v$ that are contained in some edge of $H$, then $H$ contains a perfect matching or $H$ is a subgraph of $H_{n,n/3}^2$. We construct a counter-example to the conjecture. Furthermore, for all $γ>0$ and let $n \in 3 \mathbb{Z}$ be sufficiently large, we prove that if $deg(u)+deg(v) > (3/5+γ)n^2$ for any two vertices $u$ and $v$ that are contained in some edge of $H$, then $H$ contains a perfect matching or $H$ is a subgraph of $H_{n,n/3}^2$. This implies a result of Zhang, Zhao and Lu [Electron. J. Combin. 25 (3), 2018].

preprint2023arXiv

A Survey on Automated Driving System Testing: Landscapes and Trends

Automated Driving Systems (ADS) have made great achievements in recent years thanks to the efforts from both academia and industry. A typical ADS is composed of multiple modules, including sensing, perception, planning, and control, which brings together the latest advances in different domains. Despite these achievements, safety assurance of ADS is of great significance, since unsafe behavior of ADS can bring catastrophic consequences. Testing has been recognized as an important system validation approach that aims to expose unsafe system behavior; however, in the context of ADS, it is extremely challenging to devise effective testing techniques, due to the high complexity and multidisciplinarity of the systems. There has been great much literature that focuses on the testing of ADS, and a number of surveys have also emerged to summarize the technical advances. Most of the surveys focus on the system-level testing performed within software simulators, and they thereby ignore the distinct features of different modules. In this paper, we provide a comprehensive survey on the existing ADS testing literature, which takes into account both module-level and system-level testing. Specifically, we make the following contributions: (1) we survey the module-level testing techniques for ADS and highlight the technical differences affected by the features of different modules; (2) we also survey the system-level testing techniques, with focuses on the empirical studies that summarize the issues occurring in system development or deployment, the problems due to the collaborations between different modules, and the gap between ADS testing in simulators and the real world; (3) we identify the challenges and opportunities in ADS testing, which pave the path to the future research in this field.

preprint2023arXiv

Mottness in two-dimensional van der Waals Nb$_3$X$_8$ monolayers (X=Cl, Br, and I)

We investigate strong electron-electron correlation effects on 2-dimensional van der Waals materials Nb$_3$X$_8$ (X=Cl, Br, I). We find that the monolayers Nb$_3$X$_8$ are ideal systems close to the strong correlation limit. They can be described by a half-filled single band Hubbard model in which the ratio between the Hubbard, U, and the bandwidth, W, U/W $\approx$ 5 $\sim$ 10. Both Mott and magnetic transitions of the material are calculated by the slave boson mean field theory. Doping the Mott state, a $d_{x^2-y^2}+id_{xy}$ superconducting pairing instability is found. We also construct a tunable bilayer Hubbard system for two sliding Nb$_3$X$_8$ layers. The bilayer system displays a crossover between the band insulator and Mott insulator.

preprint2023arXiv

On the stable Auslander-Reiten components of certain monomorphism categories

Let $Λ$ be an Artin algebra and let $\rm{Gprj}\mbox{-}Λ$ denote the class of all finitely generated Gorenstein projective $Λ$-modules. In this paper, we study the components of the stable Auslander-Reiten quiver of a certain subcategory of the monomorphism category $\mathcal{S}({\rm Gprj}\mbox{-}Λ)$ containing boundary vertices. We describe the shape of such components. It is shown that certain components are linked to the orbits of an auto-equivalence on the stable category $\underline{\rm{Gprj}}\mbox{-}Λ$. In particular, for the finite components, we show that under certain mild conditions their cardinalities are divisible by $3$. We see that this three-periodicity phenomenon reoccurs several times in the paper.

preprint2022arXiv

$B_c \to J/ψ$ helicity form factors and the $B_c^+ \to J/ψ+(P, V, \ell^+ ν_\ell)$ decays

In this paper, we calculate the $B_c\to J/ψ$ helicity form factors (HFFs) up to twist-4 accuracy by using the light-cone sum rules (LCSR) approach. After extrapolating those HFFs to the physically allowable $q^2$ region, we investigate the $B^+_c$-meson two-body decays and semi-leptonic decays $B_c^+ \to J/ψ+(P, V, \ell^+ ν_\ell)$ with $P/V$ stands for light pseudoscalar/vector meson, respectively. The branching fractions can be derived by using the CKM matrix element and the $B_c$ lifetime from the Particle Data Group, and we obtain ${\cal B}(B_c^+ \to J/ψπ^+)=(0.136^{+0.002}_{-0.002})\%$, ${\cal B}(B_c^+ \to J/ψK^+)=(0.010^{+0.000}_{-0.000})\%$, ${\cal B}(B_c^+ \to J/ψρ^+) =(0.768^{+0.029}_{-0.033})\%$, ${\cal B}(B_c^+ \to J/ψK^{\ast +})=(0.043^{+0.001}_{-0.001})\%$, ${\cal B}(B_c^+ \to J/ψμ^+ν_μ)=(2.802^{+0.526}_{-0.675})\%$ and ${\cal B}(B_c^+ \to J/ψτ^+ν_τ)=(0.559^{+0.131}_{-0.170})\%$. We then obtain ${\cal R}_{π^+/μ^+ν_μ} = 0.048^{+ 0.009}_{-0.012}$ and ${\cal R}_{K^+ / π^+} = 0.075^{+0.005}_{-0.005}$, which agree with the LHCb measured value within $1σ$-error. We also obtain ${\cal R}_{J/ψ}=0.199^{+ 0.060}_{-0.077}$, which like other theoretical predictions, is consistent with the LHCb measured value within $2σ$-error. Those imply that the HFFs under the LCSR approach are also applicable to the $B^+_c$ meson two-body decays and semi-leptonic decays $B_c^+ \to J/ψ+(P, V, \ell^+ ν_\ell)$, and the HFFs obtained by using LCSR in a new way implies that there may be new physics in the $B_c\to J/ψ\ell^+ ν_\ell$ semi-leptonic decays.

preprint2022arXiv

A microstructure estimation Transformer inspired by sparse representation for diffusion MRI

Diffusion magnetic resonance imaging (dMRI) is an important tool in characterizing tissue microstructure based on biophysical models, which are complex and highly non-linear. Resolving microstructures with optimization techniques is prone to estimation errors and requires dense sampling in the q-space. Deep learning based approaches have been proposed to overcome these limitations. Motivated by the superior performance of the Transformer, in this work, we present a learning-based framework based on Transformer, namely, a Microstructure Estimation Transformer with Sparse Coding (METSC) for dMRI-based microstructure estimation with downsampled q-space data. To take advantage of the Transformer while addressing its limitation in large training data requirements, we explicitly introduce an inductive bias - model bias into the Transformer using a sparse coding technique to facilitate the training process. Thus, the METSC is composed with three stages, an embedding stage, a sparse representation stage, and a mapping stage. The embedding stage is a Transformer-based structure that encodes the signal to ensure the voxel is represented effectively. In the sparse representation stage, a dictionary is constructed by solving a sparse reconstruction problem that unfolds the Iterative Hard Thresholding (IHT) process. The mapping stage is essentially a decoder that computes the microstructural parameters from the output of the second stage, based on the weighted sum of normalized dictionary coefficients where the weights are also learned. We tested our framework on two dMRI models with downsampled q-space data, including the intravoxel incoherent motion (IVIM) model and the neurite orientation dispersion and density imaging (NODDI) model. The proposed method achieved up to 11.25 folds of acceleration in scan time and outperformed the other state-of-the-art learning-based methods.

preprint2022arXiv

A non-invasive fault location method for modular multilevel converters under light load conditions

This paper proposes a non-invasive fault location method for modular multilevel converters (MMC) considering light load conditions. The prior-art fault location methods of the MMC are often developed and verified under full load conditions. However, it is revealed that the faulty arm current will be suppressed to be unipolar when the open-circuit fault happens on the submodule switch under light load. This leads to the capacitor voltage of the healthy and faulty submodules rising or falling with the same variations, increasing the difficulty of fault location. The proposed approach of injecting the second-order circulating current will rebuild the bipolar arm current of the MMC and enlarge the capacitor voltage deviations between the healthy and faulty SMs. As a result, the fault location time is significantly shortened. The simulations are carried out to validate the effectiveness of the proposed approach, showing that the fault location time is reduced to 1/6 compared with the condition without second-order circulating current injection.

preprint2022arXiv

Adaptive Testing for Connected and Automated Vehicles with Sparse Control Variates in Overtaking Scenarios

Testing and evaluation is a critical step in the development and deployment of connected and automated vehicles (CAVs). Due to the black-box property and various types of CAVs, how to test and evaluate CAVs adaptively remains a major challenge. Many approaches have been proposed to adaptively generate testing scenarios during the testing process. However, most existing approaches cannot be applied to complex scenarios, where the variables needed to define such scenarios are high dimensional. Towards filling this gap, the adaptive testing with sparse control variates method is proposed in this paper. Instead of adaptively generating testing scenarios, our approach evaluates CAVs&#39; performances by adaptively utilizing the testing results. Specifically, each testing result is adjusted using multiple linear regression techniques based on control variates. As the regression coefficients can be adaptively optimized for the CAV under test, using the adjusted results can reduce the estimation variance, compared with using the testing results directly. To overcome the high dimensionality challenge, sparse control variates are utilized only for the critical variables of testing scenarios. To validate the proposed method, the high-dimensional overtaking scenarios are investigated, and the results demonstrate that our approach can further accelerate the evaluation process by about 30 times.

preprint2022arXiv

Augmented Transfer Regression Learning with Semi-non-parametric Nuisance Models

In contemporary statistical learning, covariate shift correction plays an important role in transfer learning when distribution of the testing data is shifted from the training data. Importance weighting, as a natural and principle strategy to adjust for covariate shift, has been commonly used in the field of transfer learning. However, this strategy is not robust to model misspecification or excessive estimation error. In this paper, we propose an augmented transfer regression learning (ATReL) approach that introduces an imputation model for the targeted response, and uses it to augment the importance weighting equation. With novel semi-non-parametric constructions and calibrated moment estimating equations for the two nuisance models, our ATReL method is less prone to (i) the curse of dimensionality compared to nonparametric approaches, and (ii) model mis-specification than parametric approaches. We show that our ATReL estimator is root-n-consistent when at least one nuisance model is correctly specified, estimation for the parametric part of the nuisance models achieves parametric rate, and the nonparametric components are rate doubly robust. Simulation studies demonstrate that our method is more robust and efficient than existing parametric and fully nonparametric (machine learning) estimators under various configurations. We also examine the utility of our method through a real example about transfer learning of phenotyping algorithm for rheumatoid arthritis across different time windows. Finally, we propose ways to enhance the intrinsic efficiency of our estimator and to incorporate modern machine learning methods with our proposed framework.

preprint2022arXiv

Clinical Prompt Learning with Frozen Language Models

Prompt learning is a new paradigm in the Natural Language Processing (NLP) field which has shown impressive performance on a number of natural language tasks with common benchmarking text datasets in full, few-shot, and zero-shot train-evaluation setups. Recently, it has even been observed that large but frozen pre-trained language models (PLMs) with prompt learning outperform smaller but fine-tuned models. However, as with many recent NLP trends, the performance of even the largest PLMs such as GPT-3 do not perform well on specialized domains (e.g. medical text), and the common practice to achieve State of the Art (SoTA) results still consists of pre-training and fine-tuning the PLMs on downstream tasks. The reliance on fine-tuning large PLMs is problematic in clinical settings where data is often held in non-GPU environments, and more resource efficient methods of training specialized domain models is crucial. We investigated the viability of prompt learning on clinically meaningful decision tasks and directly compared with more traditional fine-tuning methods. Results are partially in line with the prompt learning literature, with prompt learning able to match or improve on traditional fine-tuning with substantially fewer trainable parameters and requiring less training data. We argue that prompt learning therefore provides lower computational resource costs applicable to clinical settings, that can serve as an alternative to fine-tuning ever increasing in size PLMs. Complementary code to reproduce experiments presented in this work can be found at: https://github.com/NtaylorOX/Public_Clinical_Prompt.

preprint2022arXiv

Constraining the baryon loading factor of AGN jets: implication from the gamma-ray emission of the Coma cluster

High-energy cosmic rays (CRs) can be accelerated in the relativistic jets of Active Galactic Nuclei (AGNs) powered by supermassive black holes. The baryon loading efficiency onto relativistic CR baryons from the accreting black holes is poorly constrained by observations so far. In this paper, we suggest that the $γ$-ray emission of galaxy clusters can be used to study the baryon loading factor of AGN jets, since CRs injected by AGN jets are completely confined in the galaxy clusters and sufficiently interact with intra-cluster medium via hadronic process, producing diffuse $γ$-rays. We study the propagation of CRs in the galaxy clusters and calculate the radial distribution of the gamma-rays in the galaxy cluster with different injection rates from AGNs. By comparison with the $γ$-ray flux and upper limits of the Coma cluster measured by $Fermi$-LAT and VERITAS, we find the upper limit of the average baryon loading factor {(defined as the efficiency with which the gravitational energy is converted into relativistic particles)} to be $η_{p, \mathrm{grav}} < 0.1$. The upper limit is much lower than that required to account for diffuse neutrino flux in the conventional blazar models.

preprint2022arXiv

Counterfactually Measuring and Eliminating Social Bias in Vision-Language Pre-training Models

Vision-Language Pre-training (VLP) models have achieved state-of-the-art performance in numerous cross-modal tasks. Since they are optimized to capture the statistical properties of intra- and inter-modality, there remains risk to learn social biases presented in the data as well. In this work, we (1) introduce a counterfactual-based bias measurement \emph{CounterBias} to quantify the social bias in VLP models by comparing the [MASK]ed prediction probabilities of factual and counterfactual samples; (2) construct a novel VL-Bias dataset including 24K image-text pairs for measuring gender bias in VLP models, from which we observed that significant gender bias is prevalent in VLP models; and (3) propose a VLP debiasing method \emph{FairVLP} to minimize the difference in the [MASK]ed prediction probabilities between factual and counterfactual image-text pairs for VLP debiasing. Although CounterBias and FairVLP focus on social bias, they are generalizable to serve as tools and provide new insights to probe and regularize more knowledge in VLP models.

preprint2022arXiv

Deep Multi-task Network for Delay Estimation and Echo Cancellation

Echo path delay (or ref-delay) estimation is a big challenge in acoustic echo cancellation. Different devices may introduce various ref-delay in practice. Ref-delay inconsistency slows down the convergence of adaptive filters, and also degrades the performance of deep learning models due to &#39;unseen&#39; ref-delays in the training set. In this paper, a multi-task network is proposed to address both ref-delay estimation and echo cancellation tasks. The proposed architecture consists of two convolutional recurrent networks (CRNNs) to estimate the echo and enhanced signals separately, as well as a fully-connected (FC) network to estimate the echo path delay. Echo signal is first predicted, and then is combined with reference signal together for delay estimation. At the end, delay compensated reference and microphone signals are used to predict the enhanced target signal. Experimental results suggest that the proposed method makes reliable delay estimation and outperforms the existing state-of-the-art solutions in inconsistent echo path delay scenarios, in terms of echo return loss enhancement (ERLE) and perceptual evaluation of speech quality (PESQ). Furthermore, a data augmentation method is studied to evaluate the model performance on different portion of synthetical data with artificially introduced ref-delay.

preprint2022arXiv

Depth Completion using Geometry-Aware Embedding

Exploiting internal spatial geometric constraints of sparse LiDARs is beneficial to depth completion, however, has been not explored well. This paper proposes an efficient method to learn geometry-aware embedding, which encodes the local and global geometric structure information from 3D points, e.g., scene layout, object&#39;s sizes and shapes, to guide dense depth estimation. Specifically, we utilize the dynamic graph representation to model generalized geometric relationship from irregular point clouds in a flexible and efficient manner. Further, we joint this embedding and corresponded RGB appearance information to infer missing depths of the scene with well structure-preserved details. The key to our method is to integrate implicit 3D geometric representation into a 2D learning architecture, which leads to a better trade-off between the performance and efficiency. Extensive experiments demonstrate that the proposed method outperforms previous works and could reconstruct fine depths with crisp boundaries in regions that are over-smoothed by them. The ablation study gives more insights into our method that could achieve significant gains with a simple design, while having better generalization capability and stability. The code is available at https://github.com/Wenchao-Du/GAENet.

preprint2022arXiv

Design Challenges for a Multi-Perspective Search Engine

Many users turn to document retrieval systems (e.g. search engines) to seek answers to controversial questions. Answering such user queries usually require identifying responses within web documents, and aggregating the responses based on their different perspectives. Classical document retrieval systems fall short at delivering a set of direct and diverse responses to the users. Naturally, identifying such responses within a document is a natural language understanding task. In this paper, we examine the challenges of synthesizing such language understanding objectives with document retrieval, and study a new perspective-oriented document retrieval paradigm. We discuss and assess the inherent natural language understanding challenges in order to achieve the goal. Following the design challenges and principles, we demonstrate and evaluate a practical prototype pipeline system. We use the prototype system to conduct a user survey in order to assess the utility of our paradigm, as well as understanding the user information needs for controversial queries.

preprint2022arXiv

Efficient Burst Raw Denoising with Variance Stabilization and Multi-frequency Denoising Network

With the growing popularity of smartphones, capturing high-quality images is of vital importance to smartphones. The cameras of smartphones have small apertures and small sensor cells, which lead to the noisy images in low light environment. Denoising based on a burst of multiple frames generally outperforms single frame denoising but with the larger compututional cost. In this paper, we propose an efficient yet effective burst denoising system. We adopt a three-stage design: noise prior integration, multi-frame alignment and multi-frame denoising. First, we integrate noise prior by pre-processing raw signals into a variance-stabilization space, which allows using a small-scale network to achieve competitive performance. Second, we observe that it is essential to adopt an explicit alignment for burst denoising, but it is not necessary to integrate a learning-based method to perform multi-frame alignment. Instead, we resort to a conventional and efficient alignment method and combine it with our multi-frame denoising network. At last, we propose a denoising strategy that processes multiple frames sequentially. Sequential denoising avoids filtering a large number of frames by decomposing multiple frames denoising into several efficient sub-network denoising. As for each sub-network, we propose an efficient multi-frequency denoising network to remove noise of different frequencies. Our three-stage design is efficient and shows strong performance on burst denoising. Experiments on synthetic and real raw datasets demonstrate that our method outperforms state-of-the-art methods, with less computational cost. Furthermore, the low complexity and high-quality performance make deployment on smartphones possible.

preprint2022arXiv

Explanation-based Counterfactual Retraining(XCR): A Calibration Method for Black-box Models

With the rapid development of eXplainable Artificial Intelligence (XAI), a long line of past work has shown concerns about the Out-of-Distribution (OOD) problem in perturbation-based post-hoc XAI models and explanations are socially misaligned. We explore the limitations of post-hoc explanation methods that use approximators to mimic the behavior of black-box models. Then we propose eXplanation-based Counterfactual Retraining (XCR), which extracts feature importance fastly. XCR applies the explanations generated by the XAI model as counterfactual input to retrain the black-box model to address OOD and social misalignment problems. Evaluation of popular image datasets shows that XCR can improve model performance when only retaining 12.5% of the most crucial features without changing the black-box model structure. Furthermore, the evaluation of the benchmark of corruption datasets shows that the XCR is very helpful for improving model robustness and positively impacts the calibration of OOD problems. Even though not calibrated in the validation set like some OOD calibration methods, the corrupted data metric outperforms existing methods. Our method also beats current OOD calibration methods on the OOD calibration metric if calibration on the validation set is applied.

preprint2022arXiv

Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation

Occlusion poses a great threat to monocular multi-person 3D human pose estimation due to large variability in terms of the shape, appearance, and position of occluders. While existing methods try to handle occlusion with pose priors/constraints, data augmentation, or implicit reasoning, they still fail to generalize to unseen poses or occlusion cases and may make large mistakes when multiple people are present. Inspired by the remarkable ability of humans to infer occluded joints from visible cues, we develop a method to explicitly model this process that significantly improves bottom-up multi-person human pose estimation with or without occlusions. First, we split the task into two subtasks: visible keypoints detection and occluded keypoints reasoning, and propose a Deeply Supervised Encoder Distillation (DSED) network to solve the second one. To train our model, we propose a Skeleton-guided human Shape Fitting (SSF) approach to generate pseudo occlusion labels on the existing datasets, enabling explicit occlusion reasoning. Experiments show that explicitly learning from occlusions improves human pose estimation. In addition, exploiting feature-level information of visible joints allows us to reason about occluded joints more accurately. Our method outperforms both the state-of-the-art top-down and bottom-up methods on several benchmarks.

preprint2022arXiv

Fine-Tuning BERT for Automatic ADME Semantic Labeling in FDA Drug Labeling to Enhance Product-Specific Guidance Assessment

Product-specific guidances (PSGs) recommended by the United States Food and Drug Administration (FDA) are instrumental to promote and guide generic drug product development. To assess a PSG, the FDA assessor needs to take extensive time and effort to manually retrieve supportive drug information of absorption, distribution, metabolism, and excretion (ADME) from the reference listed drug labeling. In this work, we leveraged the state-of-the-art pre-trained language models to automatically label the ADME paragraphs in the pharmacokinetics section from the FDA-approved drug labeling to facilitate PSG assessment. We applied a transfer learning approach by fine-tuning the pre-trained Bidirectional Encoder Representations from Transformers (BERT) model to develop a novel application of ADME semantic labeling, which can automatically retrieve ADME paragraphs from drug labeling instead of manual work. We demonstrated that fine-tuning the pre-trained BERT model can outperform the conventional machine learning techniques, achieving up to 11.6% absolute F1 improvement. To our knowledge, we were the first to successfully apply BERT to solve the ADME semantic labeling task. We further assessed the relative contribution of pre-training and fine-tuning to the overall performance of the BERT model in the ADME semantic labeling task using a series of analysis methods such as attention similarity and layer-based ablations. Our analysis revealed that the information learned via fine-tuning is focused on task-specific knowledge in the top layers of the BERT, whereas the benefit from the pre-trained BERT model is from the bottom layers.

preprint2022arXiv

Flexible and efficient Bayesian pharmacometrics modeling using Stan and Torsten, Part I

Stan is an open-source probabilistic programing language, primarily designed to do Bayesian data analysis. Its main inference algorithm is an adaptive Hamiltonian Monte Carlo sampler, supported by state of the art gradient computation. Stan&#39;s strengths include efficient computation, an expressive language which offers a great deal of flexibility, and numerous diagnostics that allow modelers to check whether the inference is reliable. Torsten extends Stan with a suite of functions that facilitate the specification of pharmacokinetic and pharmacodynamic models, and makes it straightforward to specify a clinical event schedule. Part I of this tutorial demonstrates how to build, fit, and criticize standard pharmacokinetic and pharmacodynamic models using Stan and Torsten.

preprint2022arXiv

Gradual-impulsive control for continuous-time Markov decision processes with total undiscounted costs and constraints: linear programming approach via a reduction method

We consider the constrained optimal control problem for the gradual-impulsive CTMDP model with the performance criteria being the expected total undiscounted costs (from the running cost and the cost from each time an impulse being applied). The discounted model is covered as a special case. We justify fully a reduction method, and close an open issue in the previous literature. The reduction method induces an equivalent but simpler standard CTMDP model with gradual control only, based on which, we establish effectively, under rather natural conditions, a linear programming approach for solving the concerned constrained optimal control problem.

preprint2022arXiv

Hands-on Wireless Sensing with Wi-Fi: A Tutorial

With the rapid development of wireless communication technology, wireless access points (AP) and internet of things (IoT) devices have been widely deployed in our surroundings. Various types of wireless signals (e.g., Wi-Fi, LoRa, LTE) are filling out our living and working spaces. Previous researches reveal the fact that radio waves are modulated by the spatial structure during the propagation process (e.g., reflection, diffraction, and scattering) and superimposed on the receiver. This observation allows us to reconstruct the surrounding environment based on received wireless signals, called &#34;wireless sensing&#34;. Wireless sensing is an emerging technology that enables a wide range of applications, such as gesture recognition for human-computer interaction, vital signs monitoring for health care, and intrusion detection for security management. Compared with other sensing paradigms, such as vision-based and IMU-based sensing, wireless sensing solutions have unique advantages such as high coverage, pervasiveness, low cost, and robustness under adverse light and texture scenarios. Besides, wireless sensing solutions are generally lightweight in terms of both computation overhead and device size. This tutorial takes Wi-Fi sensing as an example. It introduces both the theoretical principles and the code implementation of data collection, signal processing, features extraction, and model design. In addition, this tutorial highlights state-of-the-art deep learning models (e.g., CNN, RNN, and adversarial learning models) and their applications in wireless sensing systems. We hope this tutorial will help people in other research fields to break into wireless sensing research and learn more about its theories, designs, and implementation skills, promoting prosperity in the wireless sensing research field.

preprint2022arXiv

HiGNN: Hierarchical Informative Graph Neural Networks for Molecular Property Prediction Equipped with Feature-Wise Attention

Elucidating and accurately predicting the druggability and bioactivities of molecules plays a pivotal role in drug design and discovery and remains an open challenge. Recently, graph neural networks (GNN) have made remarkable advancements in graph-based molecular property prediction. However, current graph-based deep learning methods neglect the hierarchical information of molecules and the relationships between feature channels. In this study, we propose a well-designed hierarchical informative graph neural networks framework (termed HiGNN) for predicting molecular property by utilizing a co-representation learning of molecular graphs and chemically synthesizable BRICS fragments. Furthermore, a plug-and-play feature-wise attention block is first designed in HiGNN architecture to adaptively recalibrate atomic features after the message passing phase. Extensive experiments demonstrate that HiGNN achieves state-of-the-art predictive performance on many challenging drug discovery-associated benchmark datasets. In addition, we devise a molecule-fragment similarity mechanism to comprehensively investigate the interpretability of HiGNN model at the subgraph level, indicating that HiGNN as a powerful deep learning tool can help chemists and pharmacists identify the key components of molecules for designing better molecules with desired properties or functions. The source code is publicly available at https://github.com/idruglab/hignn.

preprint2022arXiv

HyperCI: A Higher Order Collective Influence Measure for Hypernetwork Dismantling

The connectivity of networked systems is often dependent on a small portion of critical nodes. Network dismantling studies the strategy to identify a subset of nodes the removal of which will maximally destroy the connectivity of a network and fragment it into disconnected components. However, conventional network dismantling approaches focus on simple network which models only pairwise interaction between nodes while groupwise interactions among arbitrary number of nodes are ubiquitous in networked systems like integrated circuits. Groupwise interactions modeled by hypernetwork introduce higher order connectivity patterns, which limits the application of conventional network dismantling methods on hypernetwork. In this brief, we propose HyperCI, a higher order collective influence measure for hypernetwork dismantling. It considers the node co-occurrence characteristics and higher order influence ability both introduced by hyperedges in hypernetwork. We evaluate the effectiveness of our proposed HyperCI on six real world hypernetworks including integrated circuits and citation networks and the results indicate our proposed HyperCI outperforms baseline network dismantling methods for both simple network and hypernetwork.

preprint2022arXiv

IDR: Self-Supervised Image Denoising via Iterative Data Refinement

The lack of large-scale noisy-clean image pairs restricts supervised denoising methods&#39; deployment in actual applications. While existing unsupervised methods are able to learn image denoising without ground-truth clean images, they either show poor performance or work under impractical settings (e.g., paired noisy images). In this paper, we present a practical unsupervised image denoising method to achieve state-of-the-art denoising performance. Our method only requires single noisy images and a noise model, which is easily accessible in practical raw image denoising. It performs two steps iteratively: (1) Constructing a noisier-noisy dataset with random noise from the noise model; (2) training a model on the noisier-noisy dataset and using the trained model to refine noisy images to obtain the targets used in the next round. We further approximate our full iterative method with a fast algorithm for more efficient training while keeping its original high performance. Experiments on real-world, synthetic, and correlated noise show that our proposed unsupervised denoising approach has superior performances over existing unsupervised methods and competitive performance with supervised methods. In addition, we argue that existing denoising datasets are of low quality and contain only a small number of scenes. To evaluate raw image denoising performance in real-world applications, we build a high-quality raw image dataset SenseNoise-500 that contains 500 real-life scenes. The dataset can serve as a strong benchmark for better evaluating raw image denoising. Code and dataset will be released at https://github.com/zhangyi-3/IDR

preprint2022arXiv

Impact of Loss Model Selection on Power Semiconductor Lifetime Prediction in Electric Vehicles

Power loss estimation is an indispensable procedure to conduct lifetime prediction for power semiconductor device. The previous studies successfully perform steady-state power loss estimation for different applications, but which may be limited for the electric vehicles (EVs) with high dynamics. Based on two EV standard driving cycle profiles, this paper gives a comparative study of power loss estimation models with two different time resolutions, i.e., the output period average and the switching period average. The correspondingly estimated power losses, thermal profiles, and lifetime clearly pointed out that the widely applied power loss model with the output period average is limited for EV applications, in particular for the highly dynamic driving cycle. The difference in the predicted lifetime can be up to 300 times due to the unreasonable choice the loss model, which calls for the industry attention on the differences of the EVs and the importance of loss model selection in lifetime prediction.

preprint2022arXiv

Label Semantic Aware Pre-training for Few-shot Text Classification

In text classification tasks, useful information is encoded in the label names. Label semantic aware systems have leveraged this information for improved text classification performance during fine-tuning and prediction. However, use of label-semantics during pre-training has not been extensively explored. We therefore propose Label Semantic Aware Pre-training (LSAP) to improve the generalization and data efficiency of text classification systems. LSAP incorporates label semantics into pre-trained generative models (T5 in our case) by performing secondary pre-training on labeled sentences from a variety of domains. As domain-general pre-training requires large amounts of data, we develop a filtering and labeling pipeline to automatically create sentence-label pairs from unlabeled text. We perform experiments on intent (ATIS, Snips, TOPv2) and topic classification (AG News, Yahoo! Answers). LSAP obtains significant accuracy improvements over state-of-the-art models for few-shot text classification while maintaining performance comparable to state of the art in high-resource settings.

preprint2022arXiv

Learning Degradation Representations for Image Deblurring

In various learning-based image restoration tasks, such as image denoising and image super-resolution, the degradation representations were widely used to model the degradation process and handle complicated degradation patterns. However, they are less explored in learning-based image deblurring as blur kernel estimation cannot perform well in real-world challenging cases. We argue that it is particularly necessary for image deblurring to model degradation representations since blurry patterns typically show much larger variations than noisy patterns or high-frequency textures.In this paper, we propose a framework to learn spatially adaptive degradation representations of blurry images. A novel joint image reblurring and deblurring learning process is presented to improve the expressiveness of degradation representations. To make learned degradation representations effective in reblurring and deblurring, we propose a Multi-Scale Degradation Injection Network (MSDI-Net) to integrate them into the neural networks. With the integration, MSDI-Net can handle various and complicated blurry patterns adaptively. Experiments on the GoPro and RealBlur datasets demonstrate that our proposed deblurring framework with the learned degradation representations outperforms state-of-the-art methods with appealing improvements. The code is released at https://github.com/dasongli1/Learning_degradation.

preprint2022arXiv

Mamba: a systematic software solution for beamline experiments at HEPS

To cater for the diverse experiment requirements at the High Energy Photon Source (HEPS) with often limited human resources, Bluesky is chosen as the basis for our software framework, Mamba. In our attempt to address Bluesky&#39;s lack of integrated GUIs, command injection with feedback is chosen as the main way for the GUIs to cooperate with the CLI; a RPC service is provided, which also covers functionalities unsuitable for command injection, as well as pushing of status updates. In order to fully support high-frequency applications like fly scans, Bluesky&#39;s support for asynchronous control is being improved; to support high-throughput experiments, Mamba Data Worker (MDW) is being developed to cover the complexity in asynchronous online data processing for these experiments. To systematically simplify the specification of metadata, scan parameters and data-processing graphs for each type of experiments, an experiment parameter generator (EPG) will be developed; experiment-specific modules to automate preparation steps will also be made. The integration of off-the-shelf code in Mamba for domain-specific needs is under investigation, and Mamba GUI Studio (MGS) is being developed to simplify the implementation and integration of GUIs.

preprint2022arXiv

Multi-armed Bandits for Link Configuration in Millimeter-wave Networks

Establishing and maintaining millimeter-wave (mmWave) links is challenging due to the changing environment and the high sensibility of mmWave signal to user mobility and channel conditions. MmWave link configuration problems often involve a search for optimal system parameter under environmental uncertainties, from a finite set of alternatives that are supported by the system hardware and protocol. For example, beam sweeping aims at identifying the optimal beam(s) for data transmission from a discrete codebook. Selecting parameters such as the beam sweeping period and the beamwidth are crucial to achieving high overall system throughput. In this article, we motivate the use of the multi-armed bandit (MAB) framework to intelligently search out the optimal configuration when establishing the mmWave links. MAB is a reinforcement learning framework that guides a decision-maker to sequentially select one action from a set of actions. As an example, we show that within the MAB framework, the optimal beam sweeping period, beamwidth, and beam directions could be dynamically learned with sample-computational-efficient bandit algorithms. We conclude by highlighting some future research directions on enhancing mmWave link configuration design with MAB.

preprint2022arXiv

Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System

Pre-trained language models have been recently shown to benefit task-oriented dialogue (TOD) systems. Despite their success, existing methods often formulate this task as a cascaded generation problem which can lead to error accumulation across different sub-tasks and greater data annotation overhead. In this study, we present PPTOD, a unified plug-and-play model for task-oriented dialogue. In addition, we introduce a new dialogue multi-task pre-training strategy that allows the model to learn the primary TOD task completion skills from heterogeneous dialog corpora. We extensively test our model on three benchmark TOD tasks, including end-to-end dialogue modelling, dialogue state tracking, and intent classification. Experimental results show that PPTOD achieves new state of the art on all evaluated tasks in both high-resource and low-resource scenarios. Furthermore, comparisons against previous SOTA methods show that the responses generated by PPTOD are more factually correct and semantically coherent as judged by human annotators.

preprint2022arXiv

NFANet: A Novel Method for Weakly Supervised Water Extraction from High-Resolution Remote Sensing Imagery

The use of deep learning for water extraction requires precise pixel-level labels. However, it is very difficult to label high-resolution remote sensing images at the pixel level. Therefore, we study how to utilize point labels to extract water bodies and propose a novel method called the neighbor feature aggregation network (NFANet). Compared with pixellevel labels, point labels are much easier to obtain, but they will lose much information. In this paper, we take advantage of the similarity between the adjacent pixels of a local water-body, and propose a neighbor sampler to resample remote sensing images. Then, the sampled images are sent to the network for feature aggregation. In addition, we use an improved recursive training algorithm to further improve the extraction accuracy, making the water boundary more natural. Furthermore, our method utilizes neighboring features instead of global or local features to learn more representative features. The experimental results show that the proposed NFANet method not only outperforms other studied weakly supervised approaches, but also obtains similar results as the state-of-the-art ones.

preprint2022arXiv

On Predicting Generalization using GANs

Research on generalization bounds for deep networks seeks to give ways to predict test error using just the training dataset and the network parameters. While generalization bounds can give many insights about architecture design, training algorithms, etc., what they do not currently do is yield good predictions for actual test error. A recently introduced Predicting Generalization in Deep Learning competition~\citep{jiang2020neurips} aims to encourage discovery of methods to better predict test error. The current paper investigates a simple idea: can test error be predicted using {\em synthetic data,} produced using a Generative Adversarial Network (GAN) that was trained on the same training dataset? Upon investigating several GAN models and architectures, we find that this turns out to be the case. In fact, using GANs pre-trained on standard datasets, the test error can be predicted without requiring any additional hyper-parameter tuning. This result is surprising because GANs have well-known limitations (e.g. mode collapse) and are known to not learn the data distribution accurately. Yet the generated samples are good enough to substitute for test data. Several additional experiments are presented to explore reasons why GANs do well at this task. In addition to a new approach for predicting generalization, the counter-intuitive phenomena presented in our work may also call for a better understanding of GANs&#39; strengths and limitations.

preprint2022arXiv

On some combinatorial sequences associated to invariant theory

We study the enumerative and analytic properties of some sequences constructed using tensor invariant theory. The octant sequences are constructed from the exceptional Lie group $G_2$ and the quadrant sequences from the special linear group $SL(3)$. In each case we show that the corresponding sequences are related by binomial transforms. The first three octant sequences and the first four quadrant sequences are listed in the On-Line Encyclopedia of Integer Sequences (OEIS). These sequences all have interpretations as enumerating two-dimensional lattice walks but for the octant sequences the boundary conditions are unconventional. These sequences are all P-recursive and we give the corresponding recurrence relations. In all cases the associated differential operators are of third order and have the remarkable property that they can be solved to give closed formulae for the ordinary generating functions in terms of classical Gaussian hypergeometric functions. Moreover, we show that the octant sequences and the quadrant sequences are related by the branching rules for the inclusion of $SL(3)$ in $G_2$.

preprint2022arXiv

On the quantisation and anomalies of antisymmetric tensor-spinors

We perform the quantisation of antisymmetric tensor-spinors (fermionic $p$-forms) $ψ^α_{μ_1 \dots μ_p}$ using the Batalin-Vilkovisky field-antifield formalism. Just as for the gravitino ($p=1$), an extra propagating Nielsen-Kallosh ghost appears in quadratic gauges containing a differential operator. The appearance of this `third ghost&#39; is described within the BV formalism for arbitrary reducible gauge theories. We then use the resulting spectrum of ghosts and the Atiyah-Singer index theorem to compute gravitational anomalies.

preprint2022arXiv

Optimal Dynamic Orchestration in NDN-based Computing Networks

Named Data Networking (NDN) offers promising advantages in deploying next-generation service applications over distributed computing networks. We consider the problem of dynamic orchestration over a NDN-based computing network, in which nodes can be equipped with communication, computation, and data producing resources. Given a set of services with function-chaining structures, we address the design of distributed online algorithm that controls each node to make adaptive decisions on flowing service requests, committing function implementations, and/or producing data. We design a Service Discovery Assisted Dynamic Orchestration (SDADO) algorithm that reduces the end-to-end (E2E) delay of delivering the services, while providing optimal throughput performance. The proposed algorithm hybrids queuing-based flexibility and topology-based discipline, where the topological information is not pre-available but obtained through our proposed service discovery mechanism. We provide throughput-optimality analysis for SDADO, and then provide numerical results that confirm our analysis and demonstrates reduced round-trip E2E delay.

preprint2022arXiv

PKUSEG: A Toolkit for Multi-Domain Chinese Word Segmentation

Chinese word segmentation (CWS) is a fundamental step of Chinese natural language processing. In this paper, we build a new toolkit, named PKUSEG, for multi-domain word segmentation. Unlike existing single-model toolkits, PKUSEG targets multi-domain word segmentation and provides separate models for different domains, such as web, medicine, and tourism. Besides, due to the lack of labeled data in many domains, we propose a domain adaptation paradigm to introduce cross-domain semantic knowledge via a translation system. Through this method, we generate synthetic data using a large amount of unlabeled data in the target domain and then obtain a word segmentation model for the target domain. We also further refine the performance of the default model with the help of synthetic data. Experiments show that PKUSEG achieves high performance on multiple domains. The new toolkit also supports POS tagging and model training to adapt to various application scenarios. The toolkit is now freely and publicly available for the usage of research and industry.

preprint2022arXiv

Probing Electronic States in Monolayer Semiconductors through Static and Transient Third-Harmonic Spectroscopy

Electronic states and their dynamics are of critical importance for electronic and optoelectronic applications. Here, we probe various relevant electronic states in monolayer MoS2, such as multiple excitonic Rydberg states and free-particle energy bands, with a high relative contrast of up to >200 via broadband (from ~1.79 to 3.10 eV) static third-harmonic spectroscopy, which is further supported by theoretical calculations. Moreover, we introduce transient third-harmonic spectroscopy to demonstrate that third-harmonic generation can be all-optically modulated with a modulation depth exceeding ~94% at ~2.18 eV, providing direct evidence of dominant carrier relaxation processes, associated with carrier-exciton and carrier-phonon interactions. Our results indicate that static and transient third-harmonic spectroscopies are not only promising techniques for the characterization of monolayer semiconductors and their heterostructures, but also a potential platform for disruptive photonic and optoelectronic applications, including all-optical modulation and imaging.

preprint2022arXiv

Research on Multi-Objective Planning of Electric Vehicle Charging Stations Considering the Condition of Urban Traffic Network

As an important supporting facility for electric vehicles, the reasonable planning and layout of charging stations are of great significance to the development of electric vehicles. However, the planning and layout of charging stations is affected by various complex factors such as policy economy, charging demand, user charging comfort, and road traffic conditions. How to weigh various factors to construct a reasonable model of charging station location and capacity has become a major difficulty in the field of electric vehicle charging facility planning. Firstly, this paper constructs the location and capacity optimization model of the charging station with the goal of maximizing the revenue of operators and minimizing the user&#39;s charging additional cost. At the same time, the road time-consuming index is introduced to quantify the impact of road congestion on the user&#39;s charging additional cost, so as to effectively improve the user&#39;s satisfaction during charging. Then, aiming at the charging station planning model, a non-dominated sorting genetic algorithm with an elite strategy (NSGA-II) based on chaos initialization and arithmetic crossover operator is proposed. Finally, taking the Haidian District of Beijing as the simulation object, the results show that compared with the situation of urban traffic networks not considered, the model proposed in this paper significantly reduces the cost of lost time of users by 11.4% and the total additional cost of users&#39; charging by 7.6%. It not only ensures the economy of the system, but also effectively improves the charging satisfaction of users, which further verifies the feasibility and effectiveness of the model, and can provide a reference for the planning and layout of charging stations in the future.

preprint2022arXiv

Robust Speaker Extraction Network Based on Iterative Refined Adaptation

Speaker extraction aims to extract target speech signal from a multi-talker environment with interference speakers and surrounding noise, given the target speaker&#39;s reference information. Most speaker extraction systems achieve satisfactory performance on the premise that the test speakers have been encountered during training time. Such systems suffer from performance degradation given unseen target speakers and/or mismatched reference voiceprint information. In this paper we propose a novel strategy named Iterative Refined Adaptation (IRA) to improve the robustness and generalization capability of speaker extraction systems in the aforementioned scenarios. Given an initial speaker embedding encoded by an auxiliary network, the extraction network can obtain a latent representation of the target speaker, which is fed back to the auxiliary network to get a refined embedding to provide more accurate guidance for the extraction network. Experiments on WSJ0-2mix-extr and WHAM! dataset confirm the superior performance of the proposed method over the network without IRA in terms of SI-SDR and PESQ improvement.

preprint2022arXiv

SALIENCE: An Unsupervised User Adaptation Model for Multiple Wearable Sensors Based Human Activity Recognition

Unsupervised user adaptation aligns the feature distributions of the data from training users and the new user, so a well-trained wearable human activity recognition (WHAR) model can be well adapted to the new user. With the development of wearable sensors, multiple wearable sensors based WHAR is gaining more and more attention. In order to address the challenge that the transferabilities of different sensors are different, we propose SALIENCE (unsupervised user adaptation model for multiple wearable sensors based human activity recognition) model. It aligns the data of each sensor separately to achieve local alignment, while uniformly aligning the data of all sensors to ensure global alignment. In addition, an attention mechanism is proposed to focus the activity classifier of SALIENCE on the sensors with strong feature discrimination and well distribution alignment. Experiments are conducted on two public WHAR datasets, and the experimental results show that our model can yield a competitive performance.

preprint2022arXiv

Semantic Parsing in Task-Oriented Dialog with Recursive Insertion-based Encoder

We introduce a Recursive INsertion-based Encoder (RINE), a novel approach for semantic parsing in task-oriented dialog. Our model consists of an encoder network that incrementally builds the semantic parse tree by predicting the non-terminal label and its positions in the linearized tree. At the generation time, the model constructs the semantic parse tree by recursively inserting the predicted non-terminal labels at the predicted positions until termination. RINE achieves state-of-the-art exact match accuracy on low- and high-resource versions of the conversational semantic parsing benchmark TOP (Gupta et al., 2018; Chen et al., 2020), outperforming strong sequence-to-sequence models and transition-based parsers. We also show that our model design is applicable to nested named entity recognition task, where it performs on par with state-of-the-art approach designed for that task. Finally, we demonstrate that our approach is 2-3.5 times faster than the sequence-to-sequence model at inference time.

preprint2022arXiv

SemanticAxis: Exploring Multi-attribute Data by Semantics Construction and Ranking Analysis

Mining the distribution of features and sorting items by combined attributes are two common tasks in exploring and understanding multi-attribute (or multivariate) data. Up to now, few have pointed out the possibility of merging these two tasks into a united exploration context and the potential benefits of doing so. In this paper, we present SemanticAxis, a technique that achieves this goal by enabling analysts to build a semantic vector in two-dimensional space interactively. Essentially, the semantic vector is a linear combination of the original attributes. It can be used to represent and explain abstract concepts implied in local (outliers, clusters) or global (general pattern) features of reduced space, as well as serving as a ranking metric for its defined concepts. In order to validate the significance of combining the above two tasks in multi-attribute data analysis, we design and implement a visual analysis system, in which several interactive components cooperate with SemanticAxis seamlessly and expand its capacity to handle complex scenarios. We prove the effectiveness of our system and the SemanticAxis technique via two practical cases.

preprint2022arXiv

Simulating COVID19 Transmission From Observed Movement: An Agent-Based Model of Classroom Dispersion

Current models of COVID-19 transmission predict infection from reported or assumed interactions. Here we leverage high-resolution observations of interaction to simulate infectious processes. Ultra-Wide Radio Frequency Identification (RFID) systems were employed to track the real-time physical movements and directional orientation of children and their teachers in 4 preschool classes over a total of 34 observations. An agent-based transmission model combined observed interaction patterns (individual distance and orientation) with CDC-published risk guidelines to estimate the transmission impact of an infected patient zero attending class on the proportion of overall infections, the average transmission rate, and the time lag to the appearance of symptomatic individuals. These metrics highlighted the prophylactic role of decreased classroom density and teacher vaccinations. Reduction of classroom density to half capacity was associated with an 18.2% drop in overall infection proportion while teacher vaccination receipt was associated with a 25.3%drop. Simulation results of classroom transmission dynamics may inform public policy in the face of COVID-19 and similar infectious threats.

preprint2022arXiv

SOUL-Net: A Sparse and Low-Rank Unrolling Network for Spectral CT Image Reconstruction

Spectral computed tomography (CT) is an emerging technology, that generates a multienergy attenuation map for the interior of an object and extends the traditional image volume into a 4D form. Compared with traditional CT based on energy-integrating detectors, spectral CT can make full use of spectral information, resulting in high resolution and providing accurate material quantification. Numerous model-based iterative reconstruction methods have been proposed for spectral CT reconstruction. However, these methods usually suffer from difficulties such as laborious parameter selection and expensive computational costs. In addition, due to the image similarity of different energy bins, spectral CT usually implies a strong low-rank prior, which has been widely adopted in current iterative reconstruction models. Singular value thresholding (SVT) is an effective algorithm to solve the low-rank constrained model. However, the SVT method requires manual selection of thresholds, which may lead to suboptimal results. To relieve these problems, in this paper, we propose a Sparse and lOw-rank UnroLling Network for spectral CT image reconstruction (SOUL-Net), that learns the parameters and thresholds in a data-driven manner. Furthermore, a Taylor expansion-based neural network backpropagation method is introduced to improve the numerical stability. The qualitative and quantitative results demonstrate that the proposed method outperforms several representative state-of-the-art algorithms in terms of detail preservation and artifact reduction.

preprint2022arXiv

Stepping beyond your comfort zone: Diffusion-based network analytics for knowledge trajectory recommendation

Interest in tracing the research interests of scientific researchers is rising, and particularly that of predicting a researcher&#39;s knowledge trajectories beyond their current foci into potential inter-/cross-/multi-disciplinary interactions. Hence, in this study, we present a method of diffusion-based network analytics for knowledge trajectory recommendation. The method begins by constructing a heterogeneous bibliometric network consisting of a co-topic layer and a co-authorship layer. A novel link prediction approach with a diffusion strategy is then used to reflect real-world academic activity, such as knowledge sharing between co-authors or diffusing between similar research topics. This strategy differentiates the interactions occurring between homogeneous and heterogeneous nodes and weights the strengths of these interactions. Two sets of experiments - one with a local dataset and another with a global dataset - demonstrate that the proposed method is prior to selected baselines. In addition, to further examine the reliability of our method, we conducted a case study on recommending knowledge trajectories of selected information scientists and their research groups. The results demonstrate the empirical insights our method yields for individual researchers, communities, and research institutions in the information science discipline.

preprint2022arXiv

Tailor Versatile Multi-modal Learning for Multi-label Emotion Recognition

Multi-modal Multi-label Emotion Recognition (MMER) aims to identify various human emotions from heterogeneous visual, audio and text modalities. Previous methods mainly focus on projecting multiple modalities into a common latent space and learning an identical representation for all labels, which neglects the diversity of each modality and fails to capture richer semantic information for each label from different perspectives. Besides, associated relationships of modalities and labels have not been fully exploited. In this paper, we propose versaTile multi-modAl learning for multI-labeL emOtion Recognition (TAILOR), aiming to refine multi-modal representations and enhance discriminative capacity of each label. Specifically, we design an adversarial multi-modal refinement module to sufficiently explore the commonality among different modalities and strengthen the diversity of each modality. To further exploit label-modal dependence, we devise a BERT-like cross-modal encoder to gradually fuse private and common modality representations in a granularity descent way, as well as a label-guided decoder to adaptively generate a tailored representation for each label with the guidance of label semantics. In addition, we conduct experiments on the benchmark MMER dataset CMU-MOSEI in both aligned and unaligned settings, which demonstrate the superiority of TAILOR over the state-of-the-arts. Code is available at https://github.com/kniter1/TAILOR.

preprint2022arXiv

Team Power Dynamics and Team Impact: New Perspectives on Scientific Collaboration using Career Age as a Proxy for Team Power

Power dynamics influence every aspect of scientific collaboration. Team power dynamics can be measured by team power level and team power hierarchy. Team power level is conceptualized as the average level of the possession of resources, expertise, or decision-making authorities of a team. Team power hierarchy represents the vertical differences of the possessions of resources in a team. In Science of Science, few studies have looked at scientific collaboration from the perspective of team power dynamics. This research examines how team power dynamics affect team impact to fill the research gap. In this research, all co-authors of one publication are treated as one team. Team power level and team power hierarchy of one team are measured by the mean and Gini index of career age of co-authors in this team. Team impact is quantified by citations of a paper authored by this team. By analyzing over 7.7 million teams from Science (e.g., Computer Science, Physics), Social Sciences (e.g., Sociology, Library & Information Science), and Arts & Humanities (e.g., Art), we find that flat team structure is associated with higher team impact, especially when teams have high team power level. These findings have been repeated in all five disciplines except Art, and are consistent in various types of teams from Computer Science including teams from industry or academia, teams with different gender groups, teams with geographical contrast, and teams with distinct size.

preprint2022arXiv

The Large High Altitude Air Shower Observatory (LHAASO) Science Book (2021 Edition)

Since the science white paper of the Large High Altitude Air Shower Observatory (LHAASO) published on arXiv in 2019 [e-Print: 1905.02773 (astro-ph.HE)], LHAASO has completed the transition from a project to an operational gamma-ray astronomical observatory LHAASO is a new generation multi-component facility located in Daocheng, Sichuan province of China, at an altitude of 4410 meters. It aims at measuring with unprecedented sensitivity the spectrum, composition, and anisotropy of cosmic rays in the energy range between 10$^{12}$ and 10$^{18}$~eV, and acting simultaneously as a wide aperture (one stereoradiant) continuously operating gamma-ray telescope in the energy range between 10$^{11}$ and $10^{15}$~eV with the designed sensitivity of 1.3\% of the Crab Unit (CU) above 100 TeV. LHAASO&#39;s capability of measuring simultaneously different shower components (electrons, muons, and Cherenkov/fluorescence light), will allow it to investigate the origin, acceleration, and propagation of CR through measurement of the energy spectrum, elemental composition, and anisotropy with unprecedented resolution. The remarkable sensitivity of LHAASO will play a key role in CR physics and gamma-ray astronomy for a general and comprehensive exploration of the high energy universe and will allow important studies of fundamental physics (such as indirect dark matter search, Lorentz invariance violation, quantum gravity) and solar and heliospheric physics. The LHAASO Collaboration organized an editorial working group and finished all editorial work of this science book, to summarize the instrumental features and outline the prospects of scientific researches with the LHAASO experiment.

preprint2022arXiv

The ratio $\mathcal{R}(D_s)$ for $B_s \to D_s \ellν_\ell$ by using the QCD light-cone sum rules within the framework of heavy quark effective field theory

In the paper, we study the $B_s\to D_s$ transition form factors by using the light-cone sum rules within the framework of heavy quark effective field theory. We adopt a chiral current correlation function to do the calculation, the resultant transition form factors $f_+^{B_s\to D_s}(q^2)$ and $f_0^{B_s\to D_s}(q^2)$ are dominated by the contribution of $D_s$-meson leading-twist distribution amplitude, while the contributions from less certain $D_s$-meson twist-3 distribution amplitudes are greatly suppressed. At the largest recoil point, we obtain $f_{+,0}^{B_s \to D_s}(0)=0.533^{+0.112}_{-0.094}$. By further extrapolating the transition form factors into all the physically allowable $q^2$ region with the help of the $z$-series parametrization approach, we calculate the branching fractions $\mathcal{B}(B_s \to D_s \ell^\prime ν_{\ell^\prime})$ with $(\ell^\prime= e,μ)$ and $\mathcal{B}(B_s \to D_s τν_τ)$, which gives $\mathcal{R}(D_s)=0.334\pm 0.017$.

preprint2022arXiv

Tunable Polarization-Multiplexed Achromatic Dielectric Metalens

Tunable metasurfaces provide a compact and efficient strategy for optical components that require active wavefront shaping. Varifocal metalens is one of the most important applications. However, the existing tunable metalens rarely serves broadband wavelengths restricting their applications in broadband imaging and color display due to chromatic aberration. Herein, we demonstrate an electrically tunable polarization-multiplexed varifocal achromatic dielectric metalens integrated with twisted nematic liquid crystals (TNLCs) in the visible region. The phase profiles at different wavelengths under two orthogonal polarization channels are customized by the particle swarm optimization algorithm and optimally matched with the metaunits database to achieve polarization-multiplexed dispersion manipulation including achromatic performance. By combining the broadband linear polarization conversion ability of TNLC, the tunability of varifocal achromatic metalens is realized by applying different voltages. Further, the electrically tunable customized dispersion-manipulated metalens and switchable color metaholograms are demonstrated. The proposed devices will accelerate the application of metasurfaces in broadband zoom imaging, AR/VR displays, and spectral detection.

preprint2022arXiv

Two-Stage Fine-Tuning: A Novel Strategy for Learning Class-Imbalanced Data

Classification on long-tailed distributed data is a challenging problem, which suffers from serious class-imbalance and hence poor performance on tail classes with only a few samples. Owing to this paucity of samples, learning on the tail classes is especially challenging for the fine-tuning when transferring a pretrained model to a downstream task. In this work, we present a simple modification of standard fine-tuning to cope with these challenges. Specifically, we propose a two-stage fine-tuning: we first fine-tune the final layer of the pretrained model with class-balanced reweighting loss, and then we perform the standard fine-tuning. Our modification has several benefits: (1) it leverages pretrained representations by only fine-tuning a small portion of the model parameters while keeping the rest untouched; (2) it allows the model to learn an initial representation of the specific task; and importantly (3) it protects the learning of tail classes from being at a disadvantage during the model updating. We conduct extensive experiments on synthetic datasets of both two-class and multi-class tasks of text classification as well as a real-world application to ADME (i.e., absorption, distribution, metabolism, and excretion) semantic labeling. The experimental results show that the proposed two-stage fine-tuning outperforms both fine-tuning with conventional loss and fine-tuning with a reweighting loss on the above datasets.

preprint2022arXiv

Ultra-wideband Antireflection Assisted by Continuously Varying Temporal Medium

We demonstrate that reflectionless propagation of electromagnetic waves between two different materials can be achieved by designing an intermediate temporal medium, which can work in an ultra-wide frequency band. Such a temporal medium is designed with consideration of a multi-stage variation of the material&#39; s permittivity in the time domain. The multi-stage temporal permittivity is formed by a cascaded quarter-wave temporal coating, which is an extension of the antireflection temporal coating by Pacheco-Peña et al [[1] Optica 7, 323 (2020)]. The strategy to render ultra-wideband antireflection temporal medium is discussed analytically and verified numerically. In-depth analysis shows that the multi-stage design of the temporal media implies a continuously temporal variation of the material&#39; s constitutive parameters, thus an ultra-wideband antireflection temporal medium is reasonably obtained. As an illustrative example for application, the proposed temporal medium is adopted to realize impedance matching between a dielectric slab and free space, which validates our new findings.

preprint2022arXiv

Vision-based Uneven BEV Representation Learning with Polar Rasterization and Surface Estimation

In this work, we propose PolarBEV for vision-based uneven BEV representation learning. To adapt to the foreshortening effect of camera imaging, we rasterize the BEV space both angularly and radially, and introduce polar embedding decomposition to model the associations among polar grids. Polar grids are rearranged to an array-like regular representation for efficient processing. Besides, to determine the 2D-to-3D correspondence, we iteratively update the BEV surface based on a hypothetical plane, and adopt height-based feature transformation. PolarBEV keeps real-time inference speed on a single 2080Ti GPU, and outperforms other methods for both BEV semantic segmentation and BEV instance segmentation. Thorough ablations are presented to validate the design. The code will be released at \url{https://github.com/SuperZ-Liu/PolarBEV}.

preprint2022arXiv

Weighted infinitesimal bialgebras

As an algebraic meaning of the nonhomogenous associative Yang-Baxter equation, weighted infinitesimal bialgebras play an important role in mathematics and mathematical physics. In this paper, we introduce the concept of weighted infinitesimal Hopf modules and show that any module carries a natural structure of weighted infinitesimal unitary Hopf module over a weighted quasitriangular infinitesimal unitary bialgebra. We decorate planar rooted forests in a new way, and prove that the space of rooted forests, together with a coproduct and a family of grafting operations, is the free $Ω$-cocycle infinitesimal unitary bialgebra of weight zero on a set. A combinatorial description of the coproduct is given. As applications, we obtain the initial object in the category of cocycle infinitesimal unitary bialgebras on undecorated planar rooted forests, which is the object studied in the (noncommutative) Connes-Kreimer Hopf algebra. Finally, we derive two pre-Lie algebras from an arbitrary weighted infinitesimal bialgebra and weighted commutative infinitesimal bialgebra, respectively. The second construction generalizes the Gel&#39;fand-Dorfman Theorem on Novikov algebras.

preprint2021arXiv

A mass-, kinetic energy- and helicity-conserving mimetic dual-field discretization for three-dimensional incompressible Navier-Stokes equations, part I: Periodic domains

We introduce a mimetic dual-field discretization which conserves mass, kinetic energy and helicity for three-dimensional incompressible Navier-Stokes equations. The discretization makes use of a conservative dual-field mixed weak formulation where two evolution equations of velocity are employed and dual representations of the solution are sought for each variable. A temporal discretization, which staggers the evolution equations and handles the nonlinearity such that the resulting discrete algebraic systems are linear and decoupled, is constructed. The spatial discretization is mimetic in the sense that the finite dimensional function spaces form a discrete de Rham complex. Conservation of mass, kinetic energy and helicity in the absence of dissipative terms is proven at the discrete level. Proper dissipation rates of kinetic energy and helicity in the viscous case is also proven. Numerical tests supporting the method are provided.

preprint2021arXiv

An Intelligent Multi-Speed Advisory System using Improved Whale Optimisation Algorithm

An intelligent speed advisory system can be used to recommend speed for vehicles travelling in a given road network in cities. In this paper, we extend our previous work where a distributed speed advisory system has been devised to recommend an optimal consensus speed for a fleet of Internal Combustion Engine Vehicles (ICEVs) in a highway scenario. In particular, we propose a novel optimisation framework where the exact format of each vehicle&#39;s cost function can be implicit, and our algorithm can be used to recommend multiple consensus speeds for vehicles travelling on different lanes in an urban highway scenario. Our studies show that the proposed scheme based on an improved whale optimisation algorithm can effectively reduce CO2 emission generated from ICEVs while providing different recommended speed options for groups of vehicles.

preprint2021arXiv

ATCSpeechNet: A multilingual end-to-end speech recognition framework for air traffic control systems

In this paper, a multilingual end-to-end framework, called as ATCSpeechNet, is proposed to tackle the issue of translating communication speech into human-readable text in air traffic control (ATC) systems. In the proposed framework, we focus on integrating the multilingual automatic speech recognition (ASR) into one model, in which an end-to-end paradigm is developed to convert speech waveform into text directly, without any feature engineering or lexicon. In order to make up for the deficiency of the handcrafted feature engineering caused by ATC challenges, a speech representation learning (SRL) network is proposed to capture robust and discriminative speech representations from the raw wave. The self-supervised training strategy is adopted to optimize the SRL network from unlabeled data, and further to predict the speech features, i.e., wave-to-feature. An end-to-end architecture is improved to complete the ASR task, in which a grapheme-based modeling unit is applied to address the multilingual ASR issue. Facing the problem of small transcribed samples in the ATC domain, an unsupervised approach with mask prediction is applied to pre-train the backbone network of the ASR model on unlabeled data by a feature-to-feature process. Finally, by integrating the SRL with ASR, an end-to-end multilingual ASR framework is formulated in a supervised manner, which is able to translate the raw wave into text in one model, i.e., wave-to-text. Experimental results on the ATCSpeech corpus demonstrate that the proposed approach achieves a high performance with a very small labeled corpus and less resource consumption, only 4.20% label error rate on the 58-hour transcribed corpus. Compared to the baseline model, the proposed approach obtains over 100% relative performance improvement which can be further enhanced with the increasing of the size of the transcribed samples.

preprint2021arXiv

Atomic line defects and topological superconductivity in unconventional superconductors

Topological superconductors (TSCs) are correlated quantum states with simultaneous off-diagonal long-range order and nontrivial topological invariants. They produce gapless or zero energy boundary excitations, including Majorana zero modes and chiral Majorana edge states with topologically protected phase coherence essential for fault-tolerant quantum computing. Candidate TSCs are very rare in nature. Here, we propose a novel route toward emergent quasi-one-dimensional (1D) TSCs in naturally embedded quantum structures such as atomic line defects in unconventional spin-singlet $s$-wave and $d$-wave superconductors. We show that inversion symmetry breaking and charge transfer due to the missing atoms lead to the occupation of incipient impurity bands and mixed parity spin singlet and triplet Cooper pairing of neighboring electrons traversing the line defect. Nontrivial topological invariants arise and occupy a large part of the parameter space, including the time reversal symmetry breaking Zeeman coupling due to applied magnetic field or defect-induced magnetism, creating TSCs in different topological classes with robust Majorana zero modes at both ends of the line defect. Beyond providing a novel mechanism for the recent discovery of zero-energy bound states at both ends of an atomic line defect in monolayer Fe(Te,Se) superconductors, the findings pave the way for new material realizations of the simplest and most robust 1D TSCs using embedded quantum structures in unconventional superconductors with large pairing energy gaps and high transition temperatures.

preprint2021arXiv

Cross sections for the reactions $e^+e^-\rightarrow K^+K^-π^+π^-(π^0)$, $K^+K^-K^+K^-(π^0)$, $π^+π^-π^+π^-(π^0)$, $p\bar{p}π^+π^-(π^0)$ in the energy region between 3.773 and 4.600 GeV

Using the data samples collected in the energy range from 3.773 to 4.600 GeV with the BESIII detector at the BEPCII collider, we measure the dressed cross sections as a function of center-of-mass energy for $e^+e^-\rightarrow K^+K^-π^+π^-(π^0)$, $K^+K^-K^+K^-(π^0)$, $π^+π^-π^+π^-(π^0)$, and $p\bar{p}π^+π^-(π^0)$. The cross sections for $e^+e^-\rightarrow K^+K^-K^+K^-π^0$, $p\bar{p}π^+π^-(π^0)$ are the first measurements. Cross sections for the other five channels are much more precise than previous results in this energy region. We also search for charmonium and charmonium-like resonances, such as the $Y(4230)$, decaying into the same final states. We find evidence of the $ψ(4040)$ decaying to $π^+π^-π^+π^-π^0$ with a statistical significance of $3.6σ$. Upper limits are provided for other decays since no clear signals are observed.

preprint2021arXiv

DU-GAN: Generative Adversarial Networks with Dual-Domain U-Net Based Discriminators for Low-Dose CT Denoising

LDCT has drawn major attention in the medical imaging field due to the potential health risks of CT-associated X-ray radiation to patients. Reducing the radiation dose, however, decreases the quality of the reconstructed images, which consequently compromises the diagnostic performance. Various deep learning techniques have been introduced to improve the image quality of LDCT images through denoising. GANs-based denoising methods usually leverage an additional classification network, i.e. discriminator, to learn the most discriminate difference between the denoised and normal-dose images and, hence, regularize the denoising model accordingly; it often focuses either on the global structure or local details. To better regularize the LDCT denoising model, this paper proposes a novel method, termed DU-GAN, which leverages U-Net based discriminators in the GANs framework to learn both global and local difference between the denoised and normal-dose images in both image and gradient domains. The merit of such a U-Net based discriminator is that it can not only provide the per-pixel feedback to the denoising network through the outputs of the U-Net but also focus on the global structure in a semantic level through the middle layer of the U-Net. In addition to the adversarial training in the image domain, we also apply another U-Net based discriminator in the image gradient domain to alleviate the artifacts caused by photon starvation and enhance the edge of the denoised CT images. Furthermore, the CutMix technique enables the per-pixel outputs of the U-Net based discriminator to provide radiologists with a confidence map to visualize the uncertainty of the denoised results, facilitating the LDCT-based screening and diagnosis. Extensive experiments on the simulated and real-world datasets demonstrate superior performance over recently published methods both qualitatively and quantitatively.

preprint2021arXiv

Dual camera snapshot hyperspectral imaging system via physics informed learning

We consider using the system&#39;s optical imaging process with convolutional neural networks (CNNs) to solve the snapshot hyperspectral imaging reconstruction problem, which uses a dual-camera system to capture the three-dimensional hyperspectral images (HSIs) in a compressed way. Various methods using CNNs have been developed in recent years to reconstruct HSIs, but most of the supervised deep learning methods aimed to fit a brute-force mapping relationship between the captured compressed image and standard HSIs. Thus, the learned mapping would be invalid when the observation data deviate from the training data. Especially, we usually don&#39;t have ground truth in real-life scenarios. In this paper, we present a self-supervised dual-camera equipment with an untrained physics-informed CNNs framework. Extensive simulation and experimental results show that our method without training can be adapted to a wide imaging environment with good performance. Furthermore, compared with the training-based methods, our system can be constantly fine-tuned and self-improved in real-life scenarios.

preprint2021arXiv

Emergent channel over a pair of pockets in strong density waves

Different channels over which electrons scatter between parts of the Fermi surface are the key to various electronic quantum matters, such as superconductivity and density waves. We consider an effective model in higher dimensions where each of the two pockets in the original model maps to (the Landau levels of) two Dirac fermions. We discover an emergent channel when two Dirac fermions from different pairs annihilate, where the presence of a strong density wave is essential. We support our analysis with numerical calculations on model examples in the vicinity of ferromagnetic and antiferromagnetic orders. We also discuss interesting consequences on electron interaction channels that beyond-mean-field fluctuations may induce.

preprint2021arXiv

Exploring Lexical Irregularities in Hypothesis-Only Models of Natural Language Inference

Natural Language Inference (NLI) or Recognizing Textual Entailment (RTE) is the task of predicting the entailment relation between a pair of sentences (premise and hypothesis). This task has been described as a valuable testing ground for the development of semantic representations, and is a key component in natural language understanding evaluation benchmarks. Models that understand entailment should encode both, the premise and the hypothesis. However, experiments by Poliak et al. revealed a strong preference of these models towards patterns observed only in the hypothesis, based on a 10 dataset comparison. Their results indicated the existence of statistical irregularities present in the hypothesis that bias the model into performing competitively with the state of the art. While recast datasets provide large scale generation of NLI instances due to minimal human intervention, the papers that generate them do not provide fine-grained analysis of the potential statistical patterns that can bias NLI models. In this work, we analyze hypothesis-only models trained on one of the recast datasets provided in Poliak et al. for word-level patterns. Our results indicate the existence of potential lexical biases that could contribute to inflating the model performance.

preprint2021arXiv

Gapless excitations inside the fully gapped kagome superconductors AV$_3$Sb$_5$

The superconducting gap structures in the transition-metal-based kagome metal AV$_3$Sb$_5$ (A=K,Rb,Cs), the first family of quasi-two-dimensional kagome superconductors, remain elusive as there is strong experimental evidence for both nodal and nodaless gap structures. Here we show that the dichotomy can be resolved because of the coexistence of time-reversal symmetry breaking with a conventional fully gapped superconductivity. The symmetry protects the edge states which arise on the domains of the lattice symmetry breaking order to remain gapless in proximity to a conventional pairing. We demonstrate this result in a four-band tight-binding model using the V $d_{X^2-Y^2}$-like and the in-plane Sb $p_z$-like Wannier functions that can faithfully capture the main feature of the materials near the Fermi level.

preprint2021arXiv

Measurement of cross-section for $e^+e^-\toΞ^-\barΞ^+$ near threshold at BESIII

The Born cross-sections and effective form factors for process $e^+e^-\toΞ^-\barΞ^+$ are measured at eight center-of-mass energies between 2.644 and 3.080 GeV, using a total integrated luminosity of 363.9 pb$^{-1}$ $e^+e^-$ collision data collected with the BESIII detector at BEPCII. After performing a fit to the Born cross-section of $e^+e^-\toΞ^-\barΞ^+$, no significant threshold effect is observed.

preprint2021arXiv

Measurement of the $e^{+}e^{-}\toΣ^{0}\barΣ^{0}$ cross sections at center-of-mass energies from $2.3864$ to $3.0200$ GeV

The Born cross sections of $e^{+}e^{-}\to Σ^{0}\barΣ^{0}$ are measured at center-of-mass energies from $2.3864$ to $3.0200$ GeV using data samples with an integrated luminosity of $328.5$ pb$^{-1}$ collected with the BESIII detector operating at the BEPCII collider. The analysis makes use of a novel reconstruction method for energies near production threshold, while a single-tag method is employed at other center-of-mass energies. The measured cross sections are consistent with earlier results from BaBar, with a substantially improved precision. The cross-section lineshape can be well described by a perturbative QCD-driven energy function. In addition, the effective form factors of the $Σ^{0}$ baryon are determined. The results provide precise experimental input for testing various theoretical predictions.

preprint2021arXiv

Measurements of $e^+e^-\rightarrow η_{\rm c}π^+ π^-π^0$, $η_{\rm c}π^+ π^-$ and $η_{\rm c}π^0γ$ at $\sqrt{s}$ from 4.18 to 4.60\,GeV, and search for a $Z_{\rm c}$ state close to the $D\bar{D}$ threshold decaying to $η_{\rm c}π$ at $\sqrt{s}$ = 4.23 GeV

We study $η_{\rm c}$ production at center-of-mass energies $\sqrt{s}$ from 4.18 to 4.60 GeV in $e^+e^-$ annihilation data collected with the BESIII detector operating at the BEPCII storage ring, corresponding to 7.3 fb$^{-1}$ of integrated luminosity. We measure the cross sections of the three different exclusive reactions $e^+e^-\rightarrow η_{\rm c}π^+ π^-π^0$, $e^+e^- \rightarrow η_{\rm c}π^+ π^-$, and $e^+e^- \rightarrow η_{\rm c}π^0γ$. We find significant $η_{\rm c}$ production in $e^+e^-\rightarrow η_{\rm c}π^+ π^-π^0$ at $\sqrt{s}$ of 4.23 GeV and 4.26 GeV and observe a significant energy-dependent Born cross section that we measure to be consistent with the production via the intermediate $Y(4260)$ resonance. In addition, we perform a search for a charmonium-like $Z_{\rm c}$ state close to the $D\bar{D}$ threshold that decays to $η_{\rm c}π$, involving ground state charmonium, and observe no signal. Corresponding upper limits on the cross section of $η_{\rm c}$ and $Z_{\rm c}$ production are provided, where the yields are not found to be significant.

preprint2021arXiv

Model independent determination of the spin of the $Ω^{-}$ and its polarization alignment in $ψ(3686)\rightarrowΩ^{-}\barΩ^{+}$

We present an analysis of the process $ψ(3686) \to Ω^- \barΩ^+$ ($Ω^-\to K^-Λ$, $\barΩ^+\to K^+\barΛ$, $Λ\to pπ^-$, $\barΛ\to \bar{p}π^+$) based on a data set of $448\times 10^6$ $ψ(3686)$ decays collected with the BESIII detector at the BEPCII electron-positron collider. The helicity amplitudes for the process $ψ(3686) \to Ω^- \barΩ^+$ and the decay parameters of the subsequent decay $Ω^-\to K^-Λ$ $(\barΩ^+\to K^+\barΛ)$ are measured for the first time by a fit to the angular distribution of the complete decay chain. The branching fraction of $ψ(3686) \to Ω^- \barΩ^+$ is measured to be $(5.82\pm 0.12\pm 0.24)\times 10^{-5}$, with an improved precision compared to previous measurements.

preprint2021arXiv

Prospect of Detecting X-Ray Halos Around Middle-Aged Pulsars with eROSITA

The detection of extended TeV $γ$-ray emission (dubbed &#34;TeV halos&#34;) from Geminga and Monogem pulsars by HAWC collaboration implies that the halo-like morphologies around middle-aged pulsars may be common. The $γ$-rays above 10 TeV are thought to arise from inverse Compton (IC) scattering of relativistic electrons/positrons in the pulsar halos off cosmic microwave background photons. In the meanwhile, these electrons and positrons can produce X-ray synchrotron emission in the interstellar magnetic field, resulting in a diffuse emission in the X-ray band (namely X-ray halos). Here, we study the prospect of detecting X-ray halos with eROSITA from 10 middle-aged pulsars with characteristic age larger than tens of thousands of years in the ATNF pulsar catalog. Assuming a benchmark value (i.e., $B = 3 \rm \, μG$) for the magnetic field, most of the X-ray halos are found to be bright enough to be detectable by eROSITA in the energy range of 0.5-2 keV during its four-year all-sky survey. Among these pulsar halos, three are supposed to produce X-ray fluxes above the eROSITA sensitivity of the first all-sky survey. Given the good angular resolution and the large field of view, eROSITA is expected to be able to measure the spatial distribution of the X-ray halos from sub-pc scale up to tens of pc scale. The intensity profiles of the X-ray halos are very useful to constrain the magnetic field and the energy-dependence of the diffusion coefficient in the pulsar halos.

preprint2021arXiv

Rip van Winkle&#39;s Razor: A Simple Estimate of Overfit to Test Data

Traditional statistics forbids use of test data (a.k.a. holdout data) during training. Dwork et al. 2015 pointed out that current practices in machine learning, whereby researchers build upon each other&#39;s models, copying hyperparameters and even computer code -- amounts to implicitly training on the test set. Thus error rate on test data may not reflect the true population error. This observation initiated {\em adaptive data analysis}, which provides evaluation mechanisms with guaranteed upper bounds on this difference. With statistical query (i.e. test accuracy) feedbacks, the best upper bound is fairly pessimistic: the deviation can hit a practically vacuous value if the number of models tested is quadratic in the size of the test set. In this work, we present a simple new estimate, {\em Rip van Winkle&#39;s Razor}. It relies upon a new notion of \textquotedblleft information content\textquotedblright\ of a model: the amount of information that would have to be provided to an expert referee who is intimately familiar with the field and relevant science/math, and who has been just been woken up after falling asleep at the moment of the creation of the test data (like \textquotedblleft Rip van Winkle\textquotedblright\ of the famous fairy tale). This notion of information content is used to provide an estimate of the above deviation which is shown to be non-vacuous in many modern settings.

preprint2021arXiv

Search for the $X(2370)$ and observation of $η_{c}\toηηη^\prime$ in $J/ψ\toγηηη^{\prime}$

Using a sample of $1.31\times10^{9} ~J/ψ$ events collected with the BESIII detector, we perform a study of $J/ψ\toγηηη^{\prime}$ to search for the $X(2370)$ and $η_{c}$ in the $ηηη^{\prime}$ invariant mass distribution. No significant signal for the $X(2370)$ is observed, and we set an upper limit for the product branching fraction of ${\cal B}(J/ψ\toγX(2370)\cdot{\cal B}(X(2370)\toηηη^{\prime}) < 9.2\times10^{-6}$ at the 90% confidence level. A clear $η_{c}$ signal is observed for the first time, yielding a product branching fraction of ${\cal B}(J/ψ\to γη_{c})\cdot{\cal B}(η_{c}\to ηηη^{\prime}) = (4.86\pm0.62~({\rm stat.})\pm0.45~({\rm sys.}))\times10^{-5}$.

preprint2021arXiv

Tactical Decision Making for Emergency Vehicles Based on A Combinational Learning Method

Increasing the response time of emergency vehicles(EVs) could lead to an immeasurable loss of property and life. On this account, tactical decision making for EVs&#39; microscopic control remains an indispensable issue to be improved. In this paper, a rule-based avoiding strategy(AS) is devised, that CVs in the prioritized zone ahead of EV should accelerate or change their lane to avoid it. Besides, a novel DQN method with speed-adaptive compact state space (SC-DQN) is put forward to fit in EVs&#39; high-speed feature and generalize in various road topologies. Afterward, the execution of AS feedback to the input of SC-DQN so that they joint organically as a combinational method. The following approach reveals that DRL could complement rule-based avoiding strategy in generalization, and on the contrary, the rule-based avoiding strategy could complement DRL in stability, and their combination could lead to less response time, lower collision rate and smoother trajectory.

preprint2021arXiv

Vibrational Spectroscopic Detection of Single Virus by a Confocal Interferometric Mid-Infrared Photothermal Microscope

We report a confocal interferometric mid-infrared photothermal (MIP) microscope and its application to label-free detection of biological nanoparticles down to single virus level. We apply the interferometric scattering principle to detect the weak photothermal effect induced by infrared absorption in a viral particle. We validated this method with two kinds of viruses, namely the vesicular stomatitis virus (VSV) and poxvirus. The single virus spectra generated by MIP microscopy show high consistency in the same group, with dominant peaks contributed by the amide I and amide II vibrations. The ratio of these two peaks are significantly different between VSV and poxvirus, highlighting the potential of using MIP microscopy for label-free differentiation of viral particles. This all-optical chemical imaging method opens a new way for spectroscopic detection of biological nanoparticles in a label-free manner and may facilitate in predicting and controlling outbreaks of emerging virus strains.

preprint2021arXiv

Weak phases and CP-symmetry tests in sequential decays of entangled double-strange baryons

Using a sample of $1.31\times10^9$ $J/ψ$ events collected with the BESIII detector at the electron-positron collider BEPCII, we analyse the full $J/ψ\to$ $Ξ^-\overlineΞ^+$, $Ξ^-\to Λπ^-$, $Λ\to pπ^-$, $\overlineΞ^+\to\overlineΛπ^+$, $\overlineΛ\to\overline{p}π^+$ decay chain. A new method, exploiting the fact that the $Ξ^-\overlineΞ^+$ pair is entangled and sequentially decaying, and where the complete decay chains are reconstructed, is applied for the first time. This enables precision measurements of the decay parameters for the $Ξ^-\toΛπ^-$ decay ($α_Ξ$, $ϕ_Ξ$) as well as the $\overlineΞ^+\to\overlineΛπ^+$ decay ($\overlineα_Ξ$, $\overlineϕ_Ξ$). From the decay parameters, two independent CP tests were performed, quantified by the observables $A_{\rm CP}^Ξ$ and $Δϕ_Ξ$. Our results, $A_{\rm CP}^Ξ$ = $(6.0\pm13.4\pm5.6)\times10^{-3}$ and $Δϕ_Ξ= (-4.8\pm13.7\pm2.9)\times10^{-3}~{\rm rad}$, are consistent with CP symmetry. Furthermore, our method enables a separation of strong and weak $Ξ\toΛπ$ decay amplitudes. This results in the first direct measurement of the weak phase difference for any baryon decay. The result is found to be $(ξ_{P} - ξ_{S}) = (1.2\pm3.4\pm0.8)\times10^{-2}$ rad and is one of the most precise tests of CP symmetry for strange baryons. The strong phase difference is measured to be $(δ_P - δ_S) = (-4.0\pm3.3\pm1.7)\times10^{-2}$ rad. In addition, we provide an independent measurement of the recently debated $Λ$ decay parameter, $α_Λ = 0.757 \pm 0.011 \pm 0.008 $. The $Λ\overlineΛ$ asymmetry is measured to be $A_{\rm CP}^Λ = (-3.7\pm11.7\pm9.0)\times10^{-3}$.

preprint2020arXiv

$Σ^{+}$ and $\barΣ^-$ polarization in the $J/ψ$ and $ψ(3686)$ decays

From $1310.6\times10^{6}$ $J/ψ$ and $448.1\times10^{6}$ $ψ(3686)$ events collected with the BESIII experiment, we report the first observation of $Σ^{+}$ and $\barΣ^{-}$ spin polarization in $e^+e^-\rightarrow J/ψ(ψ(3686)) \rightarrow Σ^{+} \barΣ^{-}$ decays. The relative phases of the form factors $ΔΦ$ have been measured to be $(-15.5\pm0.7\pm0.5)^{\circ}$ and $(21.7\pm4.0\pm0.8)^{\circ}$ with $J/ψ$ and $ψ(3686)$ data, respectively. The non-zero value of $ΔΦ$ allows for a direct and simultaneous measurement of the decay asymmetry parameters of $Σ^{+}\rightarrow p π^{0}~(α_0 = -0.998\pm0.037\pm0.009)$ and $\barΣ^{-}\rightarrow \bar{p} π^{0}~(\barα_0 = 0.990\pm0.037\pm0.011)$, the latter value being determined for the first time. The average decay asymmetry, $(α_{0} - \barα_{0})/2$, is calculated to be $-0.994\pm0.004\pm0.002$. The CP asymmetry $A_{\rm CP,Σ} = (α_0 + \barα_0)/(α_0 - \barα_0) = -0.004\pm0.037\pm0.010$ is extracted for the first time, and is found to be consistent with CP conservation.

preprint2020arXiv

A Fixation-based 360° Benchmark Dataset for Salient Object Detection

Fixation prediction (FP) in panoramic contents has been widely investigated along with the booming trend of virtual reality (VR) applications. However, another issue within the field of visual saliency, salient object detection (SOD), has been seldom explored in 360° (or omnidirectional) images due to the lack of datasets representative of real scenes with pixel-level annotations. Toward this end, we collect 107 equirectangular panoramas with challenging scenes and multiple object classes. Based on the consistency between FP and explicit saliency judgements, we further manually annotate 1,165 salient objects over the collected images with precise masks under the guidance of real human eye fixation maps. Six state-of-the-art SOD models are then benchmarked on the proposed fixation-based 360° image dataset (F-360iSOD), by applying a multiple cubic projection-based fine-tuning method. Experimental results show a limitation of the current methods when used for SOD in panoramic images, which indicates the proposed dataset is challenging. Key issues for 360° SOD is also discussed. The proposed dataset is available at https://github.com/PanoAsh/F-360iSOD.

preprint2020arXiv

A Sample Complexity Separation between Non-Convex and Convex Meta-Learning

One popular trend in meta-learning is to learn from many training tasks a common initialization for a gradient-based method that can be used to solve a new task with few samples. The theory of meta-learning is still in its early stages, with several recent learning-theoretic analyses of methods such as Reptile [Nichol et al., 2018] being for convex models. This work shows that convex-case analysis might be insufficient to understand the success of meta-learning, and that even for non-convex models it is important to look inside the optimization black-box, specifically at properties of the optimization trajectory. We construct a simple meta-learning instance that captures the problem of one-dimensional subspace learning. For the convex formulation of linear regression on this instance, we show that the new task sample complexity of any initialization-based meta-learning algorithm is $Ω(d)$, where $d$ is the input dimension. In contrast, for the non-convex formulation of a two layer linear network on the same instance, we show that both Reptile and multi-task representation learning can have new task sample complexity of $\mathcal{O}(1)$, demonstrating a separation from convex meta-learning. Crucially, analyses of the training dynamics of these methods reveal that they can meta-learn the correct subspace onto which the data should be projected.

preprint2020arXiv

A useful technique for piecewise deterministic Markov decision processes

This paper presents with justifications a technique that is useful for the study of piecewise deterministic Markov decision processes (PDMDPs) with general policies and unbounded transition intensities. This technique produces an auxiliary PDMDP from the original one. As to be discussed and claified, the auxiliary PDMDP possesses certain desired properties, which may not be possessed by the original PDMDP. Moreover, the performance measure of any policy in the original PDMDP can be replicated by the auxiliary PDMDP for a large class of performance criteria. As an application, we apply this technique to risk-sensitive PDMDPs with total cost criteria.

preprint2020arXiv

Analysis of adaptive two-grid finite element algorithms for linear and nonlinear problems

This paper proposes some efficient and accurate adaptive two-grid (ATG) finite element algorithms for linear and nonlinear partial differential equations (PDEs). The main idea of these algorithms is to utilize the solutions on the $k$-th level adaptive meshes to find the solutions on the $(k+1)$-th level adaptive meshes which are constructed by performing adaptive element bisections on the $k$-th level adaptive meshes. These algorithms transform non-symmetric positive definite (non-SPD) PDEs (resp., nonlinear PDEs) into symmetric positive definite (SPD) PDEs (resp., linear PDEs). The proposed algorithms are both accurate and efficient due to the following advantages: they do not need to solve the non-symmetric or nonlinear systems; the degrees of freedom (d.o.f.) are very small; they are easily implemented; the interpolation errors are very small. Next, this paper constructs residue-type {\em a posteriori} error estimators, which are shown to be reliable and efficient. The key ingredient in proving the efficiency is to establish an upper bound of the oscillation terms, which may not be higher-order terms (h.o.t.) due to the low regularity of the numerical solution. Furthermore, the convergence of the algorithms is proved when bisection is used for the mesh refinements. Finally, numerical experiments are provided to verify the accuracy and efficiency of the ATG finite element algorithms, compared to regular adaptive finite element algorithms and two-grid finite element algorithms [27].

preprint2020arXiv

Analysis of the decay $D^0\rightarrow K_{S}^{0} K^{+} K^{-}$

Using a data sample of $2.93~fb^{-1}$ of $e^+e^-$ collisions collected at $\sqrt{s}=3.773 GeV$ in the BESIII experiment, we perform an analysis of the decay $D^0\rightarrow K_{S}^{0} K^{+} K^{-}$. The Dalitz plot is analyzed using $1856\pm 45$ flavor-tagged signal decays. We find that the Dalitz plot is well described by a set of six resonances: $a_0(980)^0$, $a_0(980)^+$, $ϕ(1020)$, $a_2(1320)^+$, $a_2(1320)^-$ and $a_0(1450)^-$. Their magnitudes, phases and fit fractions are determined as well as the coupling of $a_0(980)$ to $K\bar{K}$, $g_{K\bar{K}}=3.77\pm 0.24\text{(stat.)}\pm0.35\text{(sys.)} GeV$. The branching fraction of the decay $D^0\rightarrow K_{S}^{0} K^{+} K^{-}$ is measured using $11660\pm 118$ untagged signal decays to be $(4.51\pm 0.05\text{(stat.)}\pm 0.16\text{(sys.)})10^{-3}$. Both measurements are limited by their systematic uncertainties.

preprint2020arXiv

Atomic line defects and zero-energy end states in monolayer Fe(Te,Se) high-temperature superconductors

Majorana zero-energy bound states (ZEBSs) have been proposed to exist at the ends of one-dimensional Rashba nanowires proximity-coupled to an s-wave superconductor in an external magnetic field induced Zeeman field. Such hybrid structures have been a central platform in the search for non-Abelian Majorana zero modes (MZMs) toward fault-tolerant topological quantum computing. Here we report the discovery of ZEBSs simultaneously appearing at each end of a one-dimensional atomic line defect in monolayer iron-based high-temperature superconductor FeTe0.5Se0.5 films grown on SrTiO3(001) substrates. The spectroscopic properties of the ZEBSs, including the temperature and tunneling barrier dependences, as well as their fusion induced by coupling on line defects of different lengths are found to be robust and consistent with those of the MZMs. These observations suggest a realization of topological Shockley defects at the ends of an atomic line defect in a two-dimensional s-wave superconductor that can host a Kramers pair of MZMs protected by time-reversal symmetry along the chain. Our findings reveal an unprecedented class of topological line defect excitations in two-dimensional superconductor FeTe0.5Se0.5 monolayer films and offer an advantageous platform for generating topological zero-energy excitations at higher operating temperatures, in a single material, and under zero external magnetic field.

preprint2020arXiv

Computation of the Expected Euler Characteristic for the Largest Eigenvalue of a Real Non-central Wishart Matrix

We give an approximate formula for the distribution of the largest eigenvalue of real Wishart matrices by the expected Euler characteristic method for the general dimension. The formula is expressed in terms of a definite integral with parameters. We derive a differential equation satisfied by the integral for the $2 \times 2$ matrix case and perform a numerical analysis of it.

preprint2020arXiv

Correlated insulating phases of twisted bilayer graphene at commensurate filling fractions: a Hartree-Fock study

Motivated by the recently observed insulating states in twisted bilayer graphene, we study the nature of the correlated insulating phases of the twisted bilayer graphene at commensurate filling fractions. We use the continuum model and project the Coulomb interaction onto the flat bands to study the ground states by using a Hartree-Fock approximation. In the absence of the hexagonal boron nitride substrate, the ground states are the intervalley coherence states at charge neutrality (filling $ν$ = 0, or four electrons per moiré cell) and at $ν$ = -1/4 and -1/2 (three and two electrons per cell, respectively) and the $C_2\mathcal{T}$ symmetry-broken state at $ν$= -3/4 (one electron per cell). The hexagonal boron nitride substrate drives the ground states at all $ν$ into $C_2\mathcal{T}$ symmetry broken-states. Our results provide good reference points for further study of the rich correlated physics in the twisted bilayer graphene.

preprint2020arXiv

Cross section measurement of $e^+e^- \rightarrow η&#39;J/ψ$ from $\sqrt{s} = 4.178$ to $4.600$ GeV

The cross section of the process $e^+e^- \rightarrow η&#39;J/ψ$ is measured at center-of-mass energies from $\sqrt{s} =$ 4.178 to 4.600 GeV using data samples corresponding to a total integrated luminosity of 11 fb$^{-1}$ collected with the BESIII detector operating at the BEPCII storage ring. The dependence of the cross section on $\sqrt{s}$ shows an enhancement around $4.2$ GeV. While the shape of the cross section cannot be fully explained with a single $ψ(4160)$ or $ψ(4260)$ state, a coherent sum of the two states does provide a reasonable description of the data.

preprint2020arXiv

Deep Multi-Task Learning via Generalized Tensor Trace Norm

The trace norm is widely used in multi-task learning as it can discover low-rank structures among tasks in terms of model parameters. Nowadays, with the emerging of big datasets and the popularity of deep learning techniques, tensor trace norms have been used for deep multi-task models. However, existing tensor trace norms cannot discover all the low-rank structures and they require users to manually determine the importance of their components. To solve those two issues together, in this paper, we propose a Generalized Tensor Trace Norm (GTTN). The GTTN is defined as a convex combination of matrix trace norms of all possible tensor flattenings and hence it can discover all the possible low-rank structures. In the induced objective function, we will learn combination coefficients in the GTTN to automatically determine the importance. Experiments on real-world datasets demonstrate the effectiveness of the proposed GTTN.

preprint2020arXiv

Democratizing the Edge: A Pervasive Edge Computing Framework

The needs of emerging applications, such as augmented and virtual reality, federated machine learning, and autonomous driving, have motivated edge computing--the push of computation capabilities to the edge. Various edge computing architectures have emerged, including multi-access edge computing and edge-cloud, all with the premise of reducing communication latency and augmenting privacy. However, these architectures rely on static and pre-deployed infrastructure, falling short in harnessing the abundant resources at the network&#39;s edge. In this paper, we discuss the design of Pervasive Edge Computing (PEC)--a democratized edge computing framework, which enables end-user devices (e.g., smartphones, IoT devices, and vehicles) to dynamically participate in a large-scale computing ecosystem. Our vision of the democratized edge involves the real-time composition of services using available edge resources like data, software, and compute-hardware from multiple stakeholders. We discuss how the novel Named-Data Networking architecture can facilitate service deployment, discovery, invocation, and migration. We also discuss the economic models critical to the adoption of PEC and the outstanding challenges for its full realization.

preprint2020arXiv

Detecting Nematic Order in STM/STS Data with Artificial Intelligence

Detecting the subtle yet phase defining features in Scanning Tunneling Microscopy and Spectroscopy data remains an important challenge in quantum materials. We meet the challenge of detecting nematic order from local density of states data with supervised machine learning and artificial neural networks for the difficult scenario without sharp features such as visible lattice Bragg peaks or Friedel oscillation signatures in the Fourier transform spectrum. We train the artificial neural networks to classify simulated data of isotropic and anisotropic two-dimensional metals in the presence of disorder. The supervised machine learning succeeds only with at least one hidden layer in the ANN architecture, demonstrating it is a higher level of complexity than nematic order detected from Bragg peaks which requires just two neurons. We apply the finalized ANN to experimental STM data on CaFe2As2, and it predicts nematic symmetry breaking with 99% confidence (probability 0.99), in agreement with previous analysis. Our results suggest ANNs could be a useful tool for the detection of nematic order in STM data and a variety of other forms of symmetry breaking.

preprint2020arXiv

Detecting Problem Statements in Peer Assessments

Effective peer assessment requires students to be attentive to the deficiencies in the work they rate. Thus, their reviews should identify problems. But what ways are there to check that they do? We attempt to automate the process of deciding whether a review comment detects a problem. We use over 18,000 review comments that were labeled by the reviewees as either detecting or not detecting a problem with the work. We deploy several traditional machine-learning models, as well as neural-network models using GloVe and BERT embeddings. We find that the best performer is the Hierarchical Attention Network classifier, followed by the Bidirectional Gated Recurrent Units (GRU) Attention and Capsule model with scores of 93.1% and 90.5% respectively. The best non-neural network model was the support vector machine with a score of 89.71%. This is followed by the Stochastic Gradient Descent model and the Logistic Regression model with 89.70% and 88.98%.

preprint2020arXiv

Determination of strong-phase parameters in $D\rightarrow K^0_{S,L}π^+π^-$

We report the most precise measurements to date of the strong-phase parameters between $D^0$ and $\bar{D}^0$ decays to $K^0_{S,L}π^+π^-$ using a sample of 2.93 fb$^{-1}$ of $e^+e^-$ annihilation data collected at a center-of-mass energy of 3.773 GeV with the BESIII detector at the BEPCII collider. Our results provide the key inputs for a binned model-independent determination of the Cabibbo-Kobayashi-Maskawa angle $γ/ϕ_3$ with $B$ decays. Using our results, the decay model sensitivity to the $γ/ϕ_3$ measurement is expected to be between 0.7$^{\circ}$ and 1.2$^{\circ}$, approximately a factor of three smaller than that achievable with previous measurements. The improved precision of this work ensures that measurements of $γ/ϕ_3$ will not be limited by knowledge of strong phases for the next decade. Furthermore, our results provide critical input for other flavor-physics investigations, including charm mixing, other measurements of $CP$ violation, and the measurement of strong-phase parameters for other $D$-decay modes.

preprint2020arXiv

Discovery of oscillations above 200 keV in a black hole X-ray binary with Insight-HXMT

Low-frequency quasi-periodic oscillations (LFQPOs) are commonly found in black hole X-ray binaries, and their origin is still under debate. The properties of LFQPOs at high energies (above 30 keV) are closely related to the nature of the accretion flow in the innermost regions, and thus play a crucial role in critically testing various theoretical models. The Hard X-ray Modulation Telescope (Insight-HXMT) is capable of detecting emissions above 30 keV, and is therefore an ideal instrument to do so. Here we report the discovery of LFQPOs above 200 keV in the new black hole MAXI J1820+070 in the X-ray hard state, which allows us to understand the behaviours of LFQPOs at hundreds of kiloelectronvolts. The phase lag of the LFQPO is constant around zero below 30 keV, and becomes a soft lag (that is, the high-energy photons arrive first) above 30 keV. The soft lag gradually increases with energy and reaches ~0.9s in the 150-200 keV band. The detection at energies above 200 keV, the large soft lag and the energy-related behaviors of the LFQPO pose a great challenge for most currently existing models, but suggest that the LFQPO probably originates from the precession of a small-scale jet.

preprint2020arXiv

Diversity, Density, and Homogeneity: Quantitative Characteristic Metrics for Text Collections

Summarizing data samples by quantitative measures has a long history, with descriptive statistics being a case in point. However, as natural language processing methods flourish, there are still insufficient characteristic metrics to describe a collection of texts in terms of the words, sentences, or paragraphs they comprise. In this work, we propose metrics of diversity, density, and homogeneity that quantitatively measure the dispersion, sparsity, and uniformity of a text collection. We conduct a series of simulations to verify that each metric holds desired properties and resonates with human intuitions. Experiments on real-world datasets demonstrate that the proposed characteristic metrics are highly correlated with text classification performance of a renowned model, BERT, which could inspire future applications.

preprint2020arXiv

Erratum to &#34;Measurement of the $e^+e^-\toπ^+π^-$ cross section between 600 and 900 MeV using initial state radiation&#34;

In Phys. Lett. B 753, 629-638 (2016) [arXiv:1507.08188] the BESIII collaboration published a cross section measurement of the process $e^+e^-\to π^+ π^-$ in the energy range between 600 and 900 MeV. In this erratum we report a corrected evaluation of the statistical errors in terms of a fully propagated covariance matrix. The correction also yields a reduced statistical uncertainty for the hadronic vacuum polarization contribution to the anomalous magnetic moment of the muon, which now reads as $a_μ^{ππ\mathrm{, LO}}(600 - 900\,\mathrm{MeV}) = (368.2 \pm 1.5_{\rm stat} \pm 3.3_{\rm syst})\times 10^{-10}$. The central values of the cross section measurement and of $a_μ^{ππ\mathrm{, LO}}$, as well as the systematic uncertainties remain unchanged.

preprint2020arXiv

Explaining Landscape Connectivity of Low-cost Solutions for Multilayer Nets

Mode connectivity is a surprising phenomenon in the loss landscape of deep nets. Optima -- at least those discovered by gradient-based optimization -- turn out to be connected by simple paths on which the loss function is almost constant. Often, these paths can be chosen to be piece-wise linear, with as few as two segments. We give mathematical explanations for this phenomenon, assuming generic properties (such as dropout stability and noise stability) of well-trained deep nets, which have previously been identified as part of understanding the generalization properties of deep nets. Our explanation holds for realistic multilayer nets, and experiments are presented to verify the theory.

preprint2020arXiv

Extension of causal decomposition in the mutual complex dynamic process

Causal decomposition depicts a cause-effect relationship that is not based on the concept of prediction, but based on the phase dependence of time series. It has been validated in both stochastic and deterministic systems and is now anticipated for its application in the complex dynamic process. Here, we present an extension of causal decomposition in the mutual complex dynamic process: cause and effect of time series are inherited in the decomposition of intrinsic components in a similar time scale. Furthermore, we illustrate comparative studies with predominate methods used in neuroscience, and show the applicability of the method particularly to physiological time series in brain-muscle interactions, implying the potential to the causality analysis in the complex physiological process.

preprint2020arXiv

First Measurements of $χ_{cJ}\rightarrow Σ^{-} \barΣ^{+} (J = 0, 1, 2)$ Decays

We measured the branching fractions of the decays $χ_{cJ}\toΣ^{-}\barΣ^{+}$ for the first time using the final states $n\bar{n}π^{+}π^{-}$. The data sample exploited here is $448.1\times10^{6}$ $ψ(3686)$ events collected with BESIII. We find $\mathcal{B}(χ_{cJ}\rightarrowΣ^{-}\barΣ^{+}) = (51.3\pm2.4\pm4.1)\times10^{-5},\, (5.7\pm1.4\pm0.6)\times10^{-5},\, \rm{and}~ (4.4\pm1.7\pm0.5)\times10^{-5}$, for $J=0,1,2$, respectively, where the first uncertainties are statistical and the second systematic.

preprint2020arXiv

Fractionalized Excitations Revealed by Entanglement Entropy

Fractionalized excitations develop in many unusual many-body states such as quantum spin liquids, disordered phases that cannot be described using any local order parameter. Because these exotic excitations correspond to emergent degrees of freedom, how to probe them and establish their existence is a long-standing challenge. We present a general procedure to reveal the fractionalized excitations using real-space entanglement entropy in critical spin liquids that are particularly relevant to experiments. Moreover, we show how to use the entanglement entropy to construct the corresponding spinon Fermi surface. Our work defines a new pathway to establish and characterize exotic excitations in novel quantum phases of matter.

preprint2020arXiv

Future Physics Programme of BESIII

There has recently been a dramatic renewal of interest in the subjects of hadron spectroscopy and charm physics. This renaissance has been driven in part by the discovery of a plethora of charmonium-like $XYZ$ states at BESIII and $B$ factories, and the observation of an intriguing proton-antiproton threshold enhancement and the possibly related $X(1835)$ meson state at BESIII, as well as the threshold measurements of charm mesons and charm baryons. We present a detailed survey of the important topics in tau-charm physics and hadron physics that can be further explored at BESIII over the remaining lifetime of BEPCII operation. This survey will help in the optimization of the data-taking plan over the coming years, and provides physics motivation for the possible upgrade of BEPCII to higher luminosity.

preprint2020arXiv

Hyper RPCA: Joint Maximum Correntropy Criterion and Laplacian Scale Mixture Modeling On-the-Fly for Moving Object Detection

Moving object detection is critical for automated video analysis in many vision-related tasks, such as surveillance tracking, video compression coding, etc. Robust Principal Component Analysis (RPCA), as one of the most popular moving object modelling methods, aims to separate the temporally varying (i.e., moving) foreground objects from the static background in video, assuming the background frames to be low-rank while the foreground to be spatially sparse. Classic RPCA imposes sparsity of the foreground component using l1-norm, and minimizes the modeling error via 2-norm. We show that such assumptions can be too restrictive in practice, which limits the effectiveness of the classic RPCA, especially when processing videos with dynamic background, camera jitter, camouflaged moving object, etc. In this paper, we propose a novel RPCA-based model, called Hyper RPCA, to detect moving objects on the fly. Different from classic RPCA, the proposed Hyper RPCA jointly applies the maximum correntropy criterion (MCC) for the modeling error, and Laplacian scale mixture (LSM) model for foreground objects. Extensive experiments have been conducted, and the results demonstrate that the proposed Hyper RPCA has competitive performance for foreground detection to the state-of-the-art algorithms on several well-known benchmark datasets.

preprint2020arXiv

Inclusive charged and neutral particle multiplicity distributions in $χ_{cJ}$ and $J/ψ$ decays

Using a sample of 106 million $ψ(3686)$ decays, $ψ(3686) \to γχ_{cJ} (J = 0, 1, 2)$ and $ψ(3686) \to γχ_{cJ}, χ_{cJ} \to γJ/ψ$ $(J = 1, 2)$ events are utilized to study inclusive $χ_{cJ} \to$ anything, $χ_{cJ} \to$ hadrons, and $J/ψ\to$ anything distributions, including distributions of the number of charged tracks, electromagnetic calorimeter showers, and $π^0$s, and to compare them with distributions obtained from the BESIII Monte Carlo simulation. Information from each Monte Carlo simulated decay event is used to construct matrices connecting the detected distributions to the input predetection &#34;produced&#34; distributions. Assuming these matrices also apply to data, they are used to predict the analogous produced distributions of the decay events. Using these, the charged particle multiplicities are compared with results from MARK I. Further, comparison of the distributions of the number of photons in data with those in Monte Carlo simulation indicates that G-parity conservation should be taken into consideration in the simulation.

preprint2020arXiv

Intriguing effects of underlying star topology in Schelling&#39;s model with blocks

We explore the intriguing effects of underlying star topological structure in the framework of Schelling&#39;s segregation model with blocks. The significant consequences exerted by the star topology are both theoretically analysed and numerically simulated with and without introducing a fraction of altruistic agents, respectively. The collective utility of the model with pure egoists alone can be optimized and the optimum stationary state is achieved with the underlying star topology of blocks. More surprisingly, once a proportion of altruists are introduced, the average utility gradually decreases as altruists&#39; fraction increases. This presents a sharp contrast to the results in Schelling&#39;s model with lattice topology of blocks. Furthermore, an adding-link mechanism is introduced to bridge the gap between the lattice and the star topologies, and extend our analysis to more general scenarios. A novel scaling law of the average utility function are found for star topology of blocks.

preprint2020arXiv

Is the epidemic spread related to GDP? Visualizing the distribution of COVID-19 in Chinese Mainland

In December 2019, COVID-19 were detected in Wuhan City, Hubei Province of China. SARS-CoV-2 rapidly spread to the whole Chinese mainland with the people during the Chinese Spring Festival Travel Rush. As of 19 February 2020, 74576 confirmed cases of COVID-19 had been reported in Chinese Mainland. What kind of cities have more confirmed cases, and is there any relationship between GDP and confirmed cases? In this study, we explored the relationship between the confirmed cases of COVID-19 and GDP at the prefectural-level, found a positive correlation between them. This finding warns high GDP areas should pay more prevention and control efforts when an epidemic outbreak, as they have greater risks than other areas nearby.

preprint2020arXiv

Learning to Classify Intents and Slot Labels Given a Handful of Examples

Intent classification (IC) and slot filling (SF) are core components in most goal-oriented dialogue systems. Current IC/SF models perform poorly when the number of training examples per class is small. We propose a new few-shot learning task, few-shot IC/SF, to study and improve the performance of IC and SF models on classes not seen at training time in ultra low resource scenarios. We establish a few-shot IC/SF benchmark by defining few-shot splits for three public IC/SF datasets, ATIS, TOP, and Snips. We show that two popular few-shot learning algorithms, model agnostic meta learning (MAML) and prototypical networks, outperform a fine-tuning baseline on this benchmark. Prototypical networks achieves significant gains in IC performance on the ATIS and TOP datasets, while both prototypical networks and MAML outperform the baseline with respect to SF on all three datasets. In addition, we demonstrate that joint training as well as the use of pre-trained language models, ELMo and BERT in our case, are complementary to these few-shot learning methods and yield further gains.

preprint2020arXiv

MAGIC: Manifold and Graph Integrative Convolutional Network for Low-Dose CT Reconstruction

Low-dose computed tomography (LDCT) scans, which can effectively alleviate the radiation problem, will degrade the imaging quality. In this paper, we propose a novel LDCT reconstruction network that unrolls the iterative scheme and performs in both image and manifold spaces. Because patch manifolds of medical images have low-dimensional structures, we can build graphs from the manifolds. Then, we simultaneously leverage the spatial convolution to extract the local pixel-level features from the images and incorporate the graph convolution to analyze the nonlocal topological features in manifold space. The experiments show that our proposed method outperforms both the quantitative and qualitative aspects of state-of-the-art methods. In addition, aided by a projection loss component, our proposed method also demonstrates superior performance for semi-supervised learning. The network can remove most noise while maintaining the details of only 10% (40 slices) of the training data labeled.

preprint2020arXiv

MD-Recon-Net: A Parallel Dual-Domain Convolutional Neural Network for Compressed Sensing MRI

Compressed sensing magnetic resonance imaging (CS-MRI) is a theoretical framework that can accurately reconstruct images from undersampled k-space data with a much lower sampling rate than the one set by the classical Nyquist-Shannon sampling theorem. Therefore, CS-MRI can efficiently accelerate acquisition time and relieve the psychological burden on patients while maintaining high imaging quality. The problems with traditional CS-MRI reconstruction are solved by iterative numerical solvers, which usually suffer from expensive computational cost and the lack of accurate handcrafted priori. In this paper, inspired by deep learning&#39;s (DL&#39;s) fast inference and excellent end-to-end performance, we propose a novel cascaded convolutional neural network called MD-Recon-Net to facilitate fast and accurate MRI reconstruction. Especially, different from existing DL-based methods, which operate on single domain data or both domains in a certain order, our proposed MD-Recon-Net contains two parallel and interactive branches that simultaneously perform on k-space and spatial-domain data, exploring the latent relationship between k-space and the spatial domain. The simulated experimental results show that the proposed method not only achieves competitive visual effects to several state-of-the-art methods, but also outperforms other DL-based methods in terms of model scale and computational cost.

preprint2020arXiv

Measurement of {\boldmath $J/ψ\toΞ(1530)^{-}\barΞ^{+}$} and evidence for the radiative decay {\boldmath $Ξ(1530)^{-}\toγΞ^-$}

The SU(3)-flavor violating decay $J/ψ\toΞ(1530)^{-}\barΞ^{+}+c.c.$ is studied using $(1310.6\pm7.0)\times 10^{6} ~J/ψ$ events collected with the BESIII detector at BEPCII and the branching fraction is measured to be ${\cal{B}}(J/ψ\toΞ(1530)^{-}\barΞ^{+}+c.c.)=(3.17\pm0.02_{\rm stat.}\pm0.08_{\rm syst.})\times10^{-4}$. This is consistent with previous measurements with an improved precision. The angular parameter for this decay is measured for the first time and is found to be $α=-0.21\pm0.04_{\rm stat.}\pm0.06_{\rm syst.}$. In addition, we report evidence for the radiative decay $Ξ(1530)^{-}\toγΞ^- $ with a significance of 3.9$σ$, including the systematic uncertainties. The 90\% confidence level upper limit on the branching fraction is determined to be $\mathcal{B}(Ξ(1530)^{-}\toγΞ^- )\leq3.7$\%.

preprint2020arXiv

Measurement of proton electromagnetic form factors in $e^+e^- \to p\bar{p}$ in the energy region 2.00-3.08 GeV

The process of $e^+e^- \rightarrow p\bar{p}$ is studied at 22 center-of-mass energy points ($\sqrt{s}$) from 2.00 to 3.08 GeV, exploiting 688.5~pb$^{-1}$ of data collected with the BESIII detector operating at the BEPCII collider. The Born cross section~($σ_{p\bar{p}}$) of $e^+e^- \rightarrow p\bar{p}$ is measured with the energy-scan technique and it is found to be consistent with previously published data, but with much improved accuracy. In addition, the electromagnetic form-factor ratio ($|G_{E}/G_{M}|$) and the value of the effective ($|G_{\rm{eff}}|$), electric ($|G_E|$) and magnetic ($|G_M|$) form factors are measured by studying the helicity angle of the proton at 16 center-of-mass energy points. $|G_{E}/G_{M}|$ and $|G_M|$ are determined with high accuracy, providing uncertainties comparable to data in the space-like region, and $|G_E|$ is measured for the first time. We reach unprecedented accuracy, and precision results in the time-like region provide information to improve our understanding of the proton inner structure and to test theoretical models which depend on non-perturbative Quantum Chromodynamics.

preprint2020arXiv

Measurement of Singly Cabibbo-Suppressed Decays $D \to ωππ$

Using 2.93 fb$^{-1}$ of $e^{+}e^{-}$ collision data taken at a center-of-mass energy of 3.773 GeV by the BESIII detector at the BEPCII, we measure the branching fractions of the singly Cabibbo-suppressed decays $D \to ωππ$ to be $\mathcal{B}(D^0 \to ωπ^+π^-) = (1.33 \pm 0.16 \pm 0.12)\times 10^{-3}$ and $\mathcal{B}(D^+ \to ωπ^+π^0) =(3.87 \pm 0.83 \pm 0.25)\times 10^{-3}$, where the first uncertainties are statistical and the second ones systematic. The statistical significances are $12.9σ$ and $7.7 σ$, respectively. The precision of $\mathcal{B}(D^0 \to ωπ^+π^-)$ is improved by a factor of 2.1 over the CLEO measurement, and $\mathcal{B}(D^+ \to ωπ^+π^0)$ is measured for the first time. No significant signal of $\mathcal{B}(D^0 \to ωπ^0π^0)$ is observed, and the upper limit on the branching fraction is $\mathcal{B}(D^0 \to ωπ^0π^0) < 1.10 \times 10^{-3}$ at the $90\%$ confidence level. The branching fractions of $D\to ηππ$ are also measured and consistent with existing results.

preprint2020arXiv

Measurement of the Born Cross Sections for $e^+e^-\to D_s^+ D_{s1}(2460)^- +c.c.$ and $e^+e^-\to D_s^{\ast +} D_{s1}(2460)^- +c.c.$

The processes $e^+e^-\to D_s^+ D_{s1}(2460)^- +c.c.$ and $e^+e^-\to D_s^{\ast +} D_{s1}(2460)^- +c.c.$ are studied for the first time using data samples collected with the BESIII detector at the BEPCII collider. The Born cross sections of $e^+e^-\to D_s^+ D_{s1}(2460)^- +c.c.$ at nine center-of-mass energies between 4.467\,GeV and 4.600\,GeV and those of $e^+e^-\to D_s^{\ast +} D_{s1}(2460)^- +c.c.$ at ${\sqrt s}=$ 4.590\,GeV and 4.600\,GeV are measured. No obvious charmonium or charmonium-like structure is seen in the measured cross sections.

preprint2020arXiv

Measurement of the cross section for $e^{+}e^{-}\rightarrowΞ^{-}\barΞ^{+}$ and observation of an excited $Ξ$ baryon

Using a total of 11.0 fb$^{-1}$ of $e^{+}e^{-}$ collision data with center-of-mass energies between 4.009 GeV and 4.6 GeV and collected with the BESIII detector at BEPCII, we measure fifteen exclusive cross sections and effective form factors for the process $e^{+}e^{-}\rightarrowΞ^{-}\barΞ^{+}$ by means of a single baryon-tag method. After performing a fit to the dressed cross section of $e^{+}e^{-}\rightarrowΞ^{-}\barΞ^{+}$, no significant $ψ(4230)$ or $ψ(4260)$ resonance is observed in the $Ξ^{-}\barΞ^{+}$ final states, and upper limits at the 90\% confidence level on $Γ_{ee}\mathcal{B}$ for the processes $ψ(4230)$/$ψ(4260)\rightarrowΞ^{-}\barΞ^{+}$ are determined. In addition, an excited $Ξ$ baryon at 1820 MeV/$c^{2}$ is observed with a statistical significance of 6.2 $\sim$ 6.5$σ$ by including the systematic uncertainty, and the mass and width are measured to be $M = (1825.5 \pm 4.7 \pm 4.7)$~MeV/$c^{2}$ and $Γ= (17.0 \pm 15.0 \pm 7.9)$~MeV, which confirms the existence of the $J^{P}=\frac{3}{2}^{-}$ state $Ξ(1820)$.

preprint2020arXiv

Model-independent determination of the relative strong-phase difference between $D^0$ and $\bar{D}^0\rightarrow K^0_{S,L}π^+π^-$ and its impact on the measurement of the CKM angle $γ/ϕ_3$

Crucial inputs for a variety of $CP$-violation studies can be determined through the analysis of pairs of quantum-entangled neutral $D$ mesons, which are produced in the decay of the $ψ(3770)$ resonance. The relative strong-phase parameters between $D^0$ and $\bar{D}^0$ in the decays $D^0\rightarrow K^0_{S,L}π^+π^-$ are studied using 2.93~${\rm fb}^{-1}$ of $e^+e^-$ annihilation data delivered by the BEPCII collider and collected by the BESIII detector at a center-of-mass energy of 3.773 GeV. Results are presented in regions of the phase space of the decay. These are the most precise measurements to date of the strong-phase parameters in $D \to K_{S,L}^0π^+π^-$ decays. Using these parameters, the associated uncertainty on the Cabibbo-Kobayashi-Maskawa angle $γ/ϕ_3$ is expected to be between $0.7^\circ$ and $1.2^\circ$, for an analysis using the decay $B^{\pm}\rightarrow DK^{\pm}$, $D\rightarrow K^0_Sπ^+π^-$, where $D$ represents a superposition of $D^0$ and $\bar{D^0}$ states. This is a factor of three smaller than that achievable with previous measurements. Furthermore, these results provide valuable input for charm-mixing studies, other measurements of $CP$ violation, and the measurement of strong-phase parameters for other $D$-decay modes.

preprint2020arXiv

Neural Network Solver for Small Quantum Clusters

Machine learning approaches have recently been applied to the study of various problems in physics. Most of the studies are focused on interpreting the data generated by conventional numerical methods or an existing database. An interesting question is whether it is possible to use a machine learning approach, in particular a neural network, for solving the many-body problem. In this paper, we present a solver for interacting quantum problem for small clusters based on the neural network. We study the small quantum cluster which mimics the single impurity Anderson model. We demonstrate that the neural network based solver provides quantitatively accurate results for the spectral function as compared to the exact diagonalization method. This opens the possibility of utilizing the neural network approach as an impurity solver for other many body numerical approaches, such as dynamical mean field theory.

preprint2020arXiv

Noise-Powered Disentangled Representation for Unsupervised Speckle Reduction of Optical Coherence Tomography Images

Due to its noninvasive character, optical coherence tomography (OCT) has become a popular diagnostic method in clinical settings. However, the low-coherence interferometric imaging procedure is inevitably contaminated by heavy speckle noise, which impairs both visual quality and diagnosis of various ocular diseases. Although deep learning has been applied for image denoising and achieved promising results, the lack of well-registered clean and noisy image pairs makes it impractical for supervised learning-based approaches to achieve satisfactory OCT image denoising results. In this paper, we propose an unsupervised OCT image speckle reduction algorithm that does not rely on well-registered image pairs. Specifically, by employing the ideas of disentangled representation and generative adversarial network, the proposed method first disentangles the noisy image into content and noise spaces by corresponding encoders. Then, the generator is used to predict the denoised OCT image with the extracted content features. In addition, the noise patches cropped from the noisy image are utilized to facilitate more accurate disentanglement. Extensive experiments have been conducted, and the results suggest that our proposed method is superior to the classic methods and demonstrates competitive performance to several recently proposed learning-based approaches in both quantitative and qualitative aspects.

preprint2020arXiv

Objects detection for remote sensing images based on polar coordinates

Arbitrary-oriented object detection is an important task in the field of remote sensing object detection. Existing studies have shown that the polar coordinate system has obvious advantages in dealing with the problem of rotating object modeling, that is, using fewer parameters to achieve more accurate rotating object detection. However, present state-of-the-art detectors based on deep learning are all modeled in Cartesian coordinates. In this article, we introduce the polar coordinate system to the deep learning detector for the first time, and propose an anchor free Polar Remote Sensing Object Detector (P-RSDet), which can achieve competitive detection accuracy via uses simpler object representation model and less regression parameters. In P-RSDet method, arbitrary-oriented object detection can be achieved by predicting the center point and regressing one polar radius and two polar angles. Besides, in order to express the geometric constraint relationship between the polar radius and the polar angle, a Polar Ring Area Loss function is proposed to improve the prediction accuracy of the corner position. Experiments on DOTA, UCAS-AOD and NWPU VHR-10 datasets show that our P-RSDet achieves state-of-the-art performances with simpler model and less regression parameters.

preprint2020arXiv

Observation of a resonant structure in $e^{+}e^{-} \to ωη$ and another in $e^{+}e^{-} \to ωπ^{0}$ at center-of-mass energies between 2.00 and 3.08 GeV

Born cross sections for the processes $e^+e^- \to ωη$ and $e^+e^- \to ωπ^{0}$ have been determined for center-of-mass energies between 2.00 and 3.08 GeV with the BESIII detector at the BEPCII collider. The results obtained in this work are consistent with previous measurements but with improved precision. Two resonant structures are observed. In the $e^{+}e^{-} \to ωη$ cross sections, a resonance with a mass of $(2179 \pm 21 \pm 3)\text{MeV}/c^2$ and a width of $(89 \pm 28 \pm 5)\text{MeV}$ is observed with a significance of 6.1$σ$. Its properties are consistent with the $ϕ(2170)$. In the $e^{+}e^{-} \toωπ^{0}$ cross sections, a resonance denoted $Y(2040)$ is observed with a significance of more than 10$σ$. Its mass and width are determined to be $(2034 \pm 13 \pm 9)\text{MeV}/c^2$ and $(234 \pm 30 \pm 25)\text{MeV}$, respectively, where the first uncertainties are statistical and the second ones are systematic.

preprint2020arXiv

Observation of a structure in $e^+e^- \to ϕη^{\prime}$ at $\sqrt{s}$ from 2.05 to 3.08 GeV

The process $e^{+}e^{-} \to ϕη^{\prime}$ has been studied for the first time in detail using data sample collected with the BESIII detector at the BEPCII collider at center of mass energies from 2.05 to 3.08 GeV. A resonance with quantum numbers $J^{PC}=1^{--}$ is observed with mass $M$ = (2177.5 $\pm$ 4.8 (stat) $\pm$ 19.5 (syst)) MeV/${ \it{c}^{\mathrm{2}}}$ and width $Γ$ = (149.0 $\pm$ 15.6 (stat) $\pm$ 8.9 (syst)) MeV with a statistical significance larger than 10$σ$. The observed structure could be identified with the $ϕ(2170)$, then the ratio of partial width between the $ϕη^{\prime}$ by BESIII and $ϕη$ by BABAR is ($\mathcal{B}^{R}_{ϕη}Γ^{R}_{ee})/{(\mathcal{B}^{R}_{ϕη^{\prime}}Γ^{R}_{ee})}$ = 0.23 $\pm$ 0.10 (stat) $\pm$ 0.18 (syst), which is smaller than the prediction of the $s\bar{s}g$ hybrid models by several orders of magnitude.

preprint2020arXiv

Observation of the $Y(4220)$ and $Y(4360)$ in the process $e^{+}e^{-} \to ηJ/ψ$

The cross sections of the process $e^{+}e^{-} \to ηJ/ψ$ at center-of-mass energies ($\sqrt{s}$) between 3.81 and 4.60 GeV are measured with high precision by using data samples collected with the BESIII detector operating at the BEPCII storage ring. Three structures are observed by analyzing the lineshape of the measured cross sections, and a maximum-likelihood fit including three resonances is performed by assuming the lowest lying structure is the $ψ(4040)$. For the other resonances, we obtain masses of $(4218.7 \pm 4.0 \pm 2.5)$ and $(4380.4 \pm 14.2 \pm 1.8)$ MeV/c$^{2}$ with corresponding widths of $(82.5 \pm 5.9 \pm 0.5)$ and $(147.0 \pm 63.0 \pm 25.8)$ MeV, respectively, where the first uncertainties are statistical and the second ones systematic. The measured resonant parameters are consistent with those of the $Y(4220)$ and $Y(4360)$ from pr evious measurements of different final states. For the first time, we observe the decays of the $Y(4220)$ and $Y(4360)$ into $ηJ/ψ$ final states.

preprint2020arXiv

Over-parameterized Adversarial Training: An Analysis Overcoming the Curse of Dimensionality

Adversarial training is a popular method to give neural nets robustness against adversarial perturbations. In practice adversarial training leads to low robust training loss. However, a rigorous explanation for why this happens under natural conditions is still missing. Recently a convergence theory for standard (non-adversarial) supervised training was developed by various groups for {\em very overparametrized} nets. It is unclear how to extend these results to adversarial training because of the min-max objective. Recently, a first step towards this direction was made by Gao et al. using tools from online learning, but they require the width of the net to be \emph{exponential} in input dimension $d$, and with an unnatural activation function. Our work proves convergence to low robust training loss for \emph{polynomial} width instead of exponential, under natural assumptions and with the ReLU activation. Key element of our proof is showing that ReLU networks near initialization can approximate the step function, which may be of independent interest.

preprint2020arXiv

Parallel Data Augmentation for Formality Style Transfer

The main barrier to progress in the task of Formality Style Transfer is the inadequacy of training data. In this paper, we study how to augment parallel data and propose novel and simple data augmentation methods for this task to obtain useful sentence pairs with easily accessible models and systems. Experiments demonstrate that our augmented parallel data largely helps improve formality style transfer when it is used to pre-train the model, leading to the state-of-the-art results in the GYAFC benchmark dataset.

preprint2020arXiv

Partial wave analysis of $ψ(3686)\rightarrow K^{+}K^{-}η$

Using a sample of $(448.1\pm2.9)\times10^6$ $ψ(3686)$ events collected with the BESIII detector, we perform the first partial wave analysis of $ψ(3686)\rightarrow K^+K^-η$. In addition to the well established states, $ϕ(1020)$, $ϕ(1680)$, and $K_3^*(1780)$, contributions from $X(1750)$, $ρ(2150)$, $ρ_3(2250)$, and $K^*_2(1980)$ are also observed. The $X(1750)$ state is determined to be a $1^{--}$ resonance. The simultaneous observation of the $ϕ(1680)$ and $X(1750)$ indicates that the $X(1750)$, with previous observations in photoproduction, is distinct from the $ϕ(1680)$. The masses, widths, branching fractions of $ψ(3686)\rightarrow K^+K^-η$ and the intermediate resonances are also measured.

preprint2020arXiv

Robust Platoon Control in Mixed Traffic Flow Based on Tube Model Predictive Control

The design of cooperative adaptive cruise control is critical in mixed traffic flow, where connected and automated vehicles (CAVs) and human-driven vehicles (HDVs) coexist. Compared with pure CAVs, the major challenge is how to handle the prediction uncertainty of HDVs, which can cause significant state deviation of CAVs from planned trajectories. In most existing studies, model predictive control (MPC) is utilized to replan CAVs&#39; trajectories to mitigate the deviation at each time step. However, as the replan process is usually conducted by solving an optimization problem with information through inter-vehicular communication, MPC methods suffer from heavy computational and communicational burdens. To address this limitation, a robust platoon control framework is proposed based on tube MPC in this paper. The prediction uncertainty is dynamically mitigated by the feedback control and restricted inside a set with a high probability. When the uncertainty exceeds the set or additional external disturbance emerges, the feedforward control is triggered to plan a ``tube&#39;&#39; (a sequence of the set), which can bound CAVs&#39; actual trajectories. As the replan process is usually not required, the proposed method is much more efficient regarding computation and communication, compared with the MPC method. Comprehensive simulations are provided to validate the effectiveness of the proposed framework.

preprint2020arXiv

Search for baryon and lepton number violating decays $D^+\to\barΛ(\barΣ^0)e^+$ and $D^+\toΛ(Σ^0)e^+$

Using a 2.93 fb$^{-1}$ data sample of electron-positron collisions taken with the BESIII detector at a center-of-mass energy of 3.773 GeV, which corresponds to $(8296\pm31\pm64)\times10^3 D^+D^-$ pairs, we search for the baryon and lepton number violating decays $D^+\to\barΛ(\barΣ^0)e^+$ and $D^+\toΛ(Σ^0)e^+$. No obvious signals are found with the current statistics and upper limits on the branching fractions of these four decays are set at the level of $10^{-6}$ at 90% confidence level.

preprint2020arXiv

Search for New Hadronic Decays of $h_c$ and Observation of $h_c\rightarrow K^{+}K^{-}π^{+}π^{-}π^{0}$

Ten hadronic final states of the $h_c$ decays are investigated via the process $ψ(3686)\rightarrow π^0 h_c$, using a data sample of $(448.1 \pm 2.9) \times 10^6$ $ψ(3686)$ events collected with the BESIII detector. The decay channel $h_c\rightarrow K^{+}K^{-}π^{+}π^{-}π^{0}$ is observed for the first time with a significance of $6.0 σ$. The corresponding branching fraction is determined to be $\mathcal{B}(h_c\rightarrow K^{+}K^{-}π^{+}π^{-}π^{0}) =(3.3 \pm 0.6 \pm 0.6)\times 10^{-3}$ (the first uncertainty is statistical and the second systematical). Evidence for the decays $h_c\rightarrow π^{+} π^{-} π^{0} η$ and $h_c\rightarrow K^{0}_{S}K^{\pm}π^{\mp}π^{+}π^{-}$ is found with a significance of $3.6 σ$ and $3.8 σ$, respectively. The corresponding branching fractions (and upper limits) are obtained to be $\mathcal{B}(h_c\rightarrow π^{+} π^{-} π^{0} η) =(7.2 \pm 1.8 \pm 1.3)\times 10^{-3}$ $(< 1.8 \times 10^{-2})$ and $\mathcal{B}(h_c\rightarrow K^{0}_{S}K^{\pm}π^{\mp}π^{+}π^{-}) =(2.8 \pm 0.9 \pm 0.5)\times 10^{-3}$ $(<4.7\times 10^{-3})$. Upper limits on the branching fractions for the final states $h_c \rightarrow K^{+}K^{-}π^{0}$, $K^{+}K^{-}η$, $K^{+}K^{-}π^{+}π^{-}η$, $2(K^{+}K^{-})π^{0}$, $K^{+}K^{-}π^{0}η$, $K^{0}_{S}K^{\pm}π^{\mp}$, and $p\bar{p}π^{0}π^{0}$ are determined at a confidence level of 90\%.

preprint2020arXiv

Search for the decay $J/ψ\toγ+ \rm {invisible}$

We search for $J/ψ$ radiative decays into a weakly interacting neutral particle, namely an invisible particle, using the $J/ψ$ produced through the process $ψ(3686)\toπ^+π^-J/ψ$ in a data sample of $(448.1\pm2.9)\times 10^6$ $ψ(3686)$ decays collected by the BESIII detector at BEPCII. No significant signal is observed. Using a modified frequentist method, upper limits on the branching fractions are set under different assumptions of invisible particle masses up to 1.2 $\mathrm{\ Ge\kern -0.1em V}/c^2$. The upper limit corresponding to an invisible particle with zero mass is 7.0$\times 10^{-7}$ at the 90\% confidence level.

preprint2020arXiv

Search for the semileptonic decay $D^{0(+)}\to b_1(1235)^{-(0)} e^+ν_e$

Using $2.93~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at a center-of-mass energy $\sqrt{s}=3.773$ GeV with the BESIII detector operating at the BEPCII collider, we search for the semileptonic $D^{0(+)}$ decays into a $b_1(1235)^{-(0)}$ axial-vector meson for the first time. No significant signal is observed for either charge combination. The upper limits on the product branching fractions are ${\mathcal B}_{D^0\to b_1(1235)^- e^+ν_e}\cdot {\mathcal B}_{b_1(1235)^-\to ωπ^-}<1.12\times 10^{-4}$ and ${\mathcal B}_{D^+\to b_1(1235)^0 e^+ν_e}\cdot {\mathcal B}_{b_1(1235)^0\to ωπ^0}<1.75\times 10^{-4}$ at the 90\% confidence level.

preprint2020arXiv

Simultaneous state and parameter estimation: the role of sensitivity analysis

State and parameter estimation is essential for process monitoring and control. Observability plays an important role in both state and parameter estimation. In simultaneous state and parameter estimation, the parameters are often augmented as extra states of the original system. When the augmented system is observable, various existing state estimation approaches may be used to estimate the states and parameters simultaneously. However, when the augmented system is not observable, how we should proceed to maximally extract the information contained in the measured outputs is not clear. This paper concerns about simultaneous state and parameter estimation when the augmented system is not fully observable. Specifically, we first show how sensitivity analysis is related to observability of a dynamical system, and then illustrate how it may be used to select variables for simultaneous estimation. We also propose a moving horizon state estimation (MHE) design that can use the variable selection results in a natural way. Extensive simulations are carried out to show the efficiency of the proposed approach.

preprint2020arXiv

Spin coating TPB film on acrylics and measurement of its wavelength shifting efficiency

Scintillation light from liquid noble gas in a neutrino or dark matter experiment lies typically within the vacuum ultraviolet (VUV) region and might be strongly absorbed by surrounding materials such as light guides or photomultiplier. Tetraphenyl butadiene (TPB) is a fluorescent material and acts as a wavelength shifter (WLS) which can turn the UV light to the visible light around a peak wavelength of 425 nm, enabling the light signals to be detected easily for physics study. Compared with a traditional TPB coating method using vapor deposition, we propose an alternative technique with a spin coating procedure in order to facilitate the development of neutrino and dark matter detectors. This article introduces how to fabricate the TPB film on acrylics using the spin coating method, reports measurement of sample film thickness and roughness, shows the reemission spectrum, and quantifies the wavelength shifting efficiency (WLSE).

preprint2020arXiv

Study of $e^{+}e^{-} \to D^{+} D^{-} π^{+} π^{-} $ at center-of-mass energies from 4.36 to 4.60 GeV

We report a study of the $e^{+}e^{-} \to D^{+} D^{-} π^{+} π^{-}$ process using $e^{+}e^{-}$ collision data samples with an integrated luminosity of $2.5\,\rm{fb}^{-1}$ at center-of-mass energies from 4.36 to $4.60 \rm{GeV}$, collected with the BESIII detector at the BEPCII storage ring. The $D_{1}(2420)^+$ is observed in the $D^{+} π^{+} π^{-}$ mass spectrum. The mass and width of the $D_{1}(2420)^+$ are measured to be $(2427.2\pm 1.0_{\rm stat.}\pm 1.2_{\rm syst.}) \rm{MeV}/c^2$ and $(23.2\pm 2.3_{\rm stat.} \pm2.3_{\rm syst.}) \rm{MeV}$, respectively. The first errors are statistical and the second ones are systematic. In addition, the Born cross sections of the $e^{+}e^{-} \to D_{1}(2420)^+D^- + c.c. \to D^{+} D^{-} π^{+} π^{-}$ and $e^{+}e^{-} \to ψ(3770) π^{+} π^{-} \to D^{+} D^{-} π^{+} π^{-}$ processes are measured as a function of the center-of-mass energy.

preprint2020arXiv

Study of BESIII Trigger Efficiencies with the 2018 $J/ψ$ Data

Using a dedicated data sample taken in 2018 on the $J/ψ$ peak, we perform a detailed study of the trigger efficiencies of the BESIII detector. The efficiencies are determined from three representative physics processes, namely Bhabha-scattering, dimuon production and generic hadronic events with charged particles. The combined efficiency of all active triggers approaches $100\%$ in most cases with uncertainties small enough as not to affect most physics analyses.

preprint2020arXiv

Study of open-charm decays and radiative transitions of the X(3872)

The processes $X(3872)\to D^{*0}\bar{D^{0}}+c.c.,~γJ/ψ,~γψ(2S),$ and $γD^{+}D^{-}$ are searched for in a $9.0~\rm fb^{-1}$ data sample collected at center-of-mass energies between $4.178$ and $4.278$ GeV with the BESIII detector. We observe $X(3872)\to D^{*0}\bar{D^{0}}+c.c.$ and find evidence for $X(3872)\toγJ/ψ$ with statistical significances of $7.4σ$ and $3.5σ$, respectively. No evident signals for $X(3872)\toγψ(2S)$ and $γD^{+}D^{-}$ are found, and upper limit on the relative branching ratio $R_{γψ} \equiv\frac{\mathcal{B}(X(3872)\toγψ(2S))}{\mathcal{B}(X(3872)\toγJ/ψ)}<0.59$ is set at 90$\%$ confidence level. Measurements of branching ratios relative to decay $X(3872)\toπ^+π^- J/ψ$ are also reported for decays $X(3872)\to D^{*0}\bar{D^{0}}+c.c., ~γψ(2S),~γJ/ψ$, $γD^{+}D^{-}$, as well as the non-$D^{*0}\bar{D}^{0}$ three-body decays $π^0 D^{0}\bar{D}^{0}$ and $γD^{0}\bar{D}^{0}$.

preprint2020arXiv

Switches between accretion structures during flares in 4U 1901+03

We report on our analysis of the 2019 outburst of the X-ray accreting pulsar 4U 1901+03 observed with Insight-HXMT and NICER. Both spectra and pulse profiles evolve significantly in the decaying phase of the outburst. Dozens of flares are observed throughout the outburst. They are more frequent and brighter at the outburst peak. We find that the flares, which have a duration from tens to hundreds of seconds, are generally brighter than the persistent emission by a factor of $\sim$ 1.5. The pulse profile shape during the flares can be significantly different than that of the persistent emission. In particular, a phase shift is clearly observed in many cases. We interpret these findings as direct evidence of changes of the pulsed beam pattern, due to transitions between the sub- and super-critical accretion regimes on a short time scale. We also observe that at comparable luminosities the flares&#39; pulse profiles are rather similar to those of the persistent emission. This indicates that the accretion on the polar cap of the neutron star is mainly determined by the luminosity, i.e., the mass accretion rate.

preprint2020arXiv

Synthesize then Compare: Detecting Failures and Anomalies for Semantic Segmentation

The ability to detect failures and anomalies are fundamental requirements for building reliable systems for computer vision applications, especially safety-critical applications of semantic segmentation, such as autonomous driving and medical image analysis. In this paper, we systematically study failure and anomaly detection for semantic segmentation and propose a unified framework, consisting of two modules, to address these two related problems. The first module is an image synthesis module, which generates a synthesized image from a segmentation layout map, and the second is a comparison module, which computes the difference between the synthesized image and the input image. We validate our framework on three challenging datasets and improve the state-of-the-arts by large margins, \emph{i.e.}, 6% AUPR-Error on Cityscapes, 7% Pearson correlation on pancreatic tumor segmentation in MSD and 20% AUPR on StreetHazards anomaly segmentation.

preprint2020arXiv

The weak decay $B_c$ to $Z(3930)$ and $X(4160)$ by Bethe-Salpeter method

Considering $Z(3930)$ and $X(4160)$ as $χ_{c2}(2P)$ and $χ_{c2}(3P)$ states, the semileptonic and nonleptonic of $B_c$ decays to $Z(3930)$ and $X(4160)$ are studied by the improved Bethe-Salpeter(B-S) Method. The form factors of decay are calculated through the overlap integrals of the meson wave functions in the whole accessible kinematical range. The influence of relativistic corrections are considered in the exclusive decays. Branching ratios of $B_c$ weak decays to $Z(3930)$ and $X(4160)$ are predicted. Some of the branching ratios are: $Br(B_c^+\to Z(3930)e^+ν_e)$$=(3.03^{+0.09}_{-0.16})\times 10^{-4}$ and $Br(B_c^+\to X(4160)e^+ν_e)$$=(3.55^{+0.83}_{-0.35})\times 10^{-6}$. These results may provide useful information to discover $Z(3930)$ and $X(4160)$ and the necessary information for the phenomenological study of $B_c$ physics.

preprint2020arXiv

Towards Accuracy-Fairness Paradox: Adversarial Example-based Data Augmentation for Visual Debiasing

Machine learning fairness concerns about the biases towards certain protected or sensitive group of people when addressing the target tasks. This paper studies the debiasing problem in the context of image classification tasks. Our data analysis on facial attribute recognition demonstrates (1) the attribution of model bias from imbalanced training data distribution and (2) the potential of adversarial examples in balancing data distribution. We are thus motivated to employ adversarial example to augment the training data for visual debiasing. Specifically, to ensure the adversarial generalization as well as cross-task transferability, we propose to couple the operations of target task classifier training, bias task classifier training, and adversarial example generation. The generated adversarial examples supplement the target task training dataset via balancing the distribution over bias variables in an online fashion. Results on simulated and real-world debiasing experiments demonstrate the effectiveness of the proposed solution in simultaneously improving model accuracy and fairness. Preliminary experiment on few-shot learning further shows the potential of adversarial attack-based pseudo sample generation as alternative solution to make up for the training data lackage.

preprint2020arXiv

Vehicle Re-Identification Based on Complementary Features

In this work, we present our solution to the vehicle re-identification (vehicle Re-ID) track in AI City Challenge 2020 (AIC2020). The purpose of vehicle Re-ID is to retrieve the same vehicle appeared across multiple cameras, and it could make a great contribution to the Intelligent Traffic System(ITS) and smart city. Due to the vehicle&#39;s orientation, lighting and inter-class similarity, it is difficult to achieve robust and discriminative representation feature. For the vehicle Re-ID track in AIC2020, our method is to fuse features extracted from different networks in order to take advantages of these networks and achieve complementary features. For each single model, several methods such as multi-loss, filter grafting, semi-supervised are used to increase the representation ability as better as possible. Top performance in City-Scale Multi-Camera Vehicle Re-Identification demonstrated the advantage of our methods, and we got 5-th place in the vehicle Re-ID track of AIC2020. The codes are available at https://github.com/gggcy/AIC2020_ReID.

preprint2019arXiv

Dynamic 3-D measurement based on fringe-to-fringe transformation using deep learning

Fringe projection profilometry (FPP) has become increasingly important in dynamic 3-D shape measurement. In FPP, it is necessary to retrieve the phase of the measured object before shape profiling. However, traditional phase retrieval techniques often require a large number of fringes, which may generate motion-induced error for dynamic objects. In this paper, a novel phase retrieval technique based on deep learning is proposed, which uses an end-to-end deep convolution neural network to transform a single or two fringes into the phase retrieval required fringes. When the object&#39;s surface is located in a restricted depth, the presented network only requires a single fringe as the input, which otherwise requires two fringes in an unrestricted depth. The proposed phase retrieval technique is first theoretically analyzed, and then numerically and experimentally verified on its applicability for dynamic 3-D measurement.

preprint2019arXiv

Homogeneous Online Transfer Learning with Online Distribution Discrepancy Minimization

Transfer learning has been demonstrated to be successful and essential in diverse applications, which transfers knowledge from related but different source domains to the target domain. Online transfer learning(OTL) is a more challenging problem where the target data arrive in an online manner. Most OTL methods combine source classifier and target classifier directly by assigning a weight to each classifier, and adjust the weights constantly. However, these methods pay little attention to reducing the distribution discrepancy between domains. In this paper, we propose a novel online transfer learning method which seeks to find a new feature representation, so that the marginal distribution and conditional distribution discrepancy can be online reduced simultaneously. We focus on online transfer learning with multiple source domains and use the Hedge strategy to leverage knowledge from source domains. We analyze the theoretical properties of the proposed algorithm and provide an upper mistake bound. Comprehensive experiments on two real-world datasets show that our method outperforms state-of-the-art methods by a large margin.

preprint2019arXiv

Matching Rota-Baxter algebras, matching dendriform algebras and matching pre-Lie algebras

We introduce the notion of a matching Rota-Baxter algebra motivated by the recent work on multiple pre-Lie algebras arising from the study of algebraic renormalization of regularity structures~[10,18]. This notion is also related to iterated integrals with multiple kernels and solutions of the associative polarized Yang-Baxter equation. Generalizing the natural connection of Rota-Baxter algebras with dendriform algebras to matching Rota-Baxter algebras , we obtain the notion of matching dendriform algebras. As in the classical case of one operation, matching Rota-Baxter algebras and matching dendriform algebras are related to matching pre-Lie algebras which coincide with the aforementioned multiple pre-Lie algebras. More general notions and results on matching tridendriform algebras and matching PostLie algebras are also obtained.

preprint2019arXiv

Observation of the decays $χ_{cJ} \to ϕϕη$

Using a data sample of $(448.1\pm2.9)\times10^{6}$ $ψ(3686)$ decays collected by the BESIII detector at the Beijing Electron Positron Collider (BEPCII), we observe the decays $χ_{cJ}\to ϕϕη~(J=0,~1,~2)$, where the $χ_{cJ}$ are produced via the radiative processes $ψ(3686)\toγχ_{cJ}$. The branching fractions are measured to be $\mathcal B(χ_{c0}\toϕϕη)=(8.41\pm0.74\pm0.62)\times10^{-4}$, $\mathcal B(χ_{c1}\toϕϕη)=(2.96\pm0.43\pm0.22)\times 10^{-4}$, and $\mathcal B(χ_{c2} \to ϕϕη)=(5.33\pm0.52\pm0.39) \times 10^{-4}$, where the first uncertainties are statistical and the second are systematic. We also search for intermediate states in the $ϕϕ$ or $ηϕ$ combinations, but no significant structure is seen due to the limited statistics.

preprint2019arXiv

Overview to the Hard X-ray Modulation Telescope (Insight-HXMT) Satellite

As China&#39;s first X-ray astronomical satellite, the Hard X-ray Modulation Telescope (HXMT), which was dubbed as Insight-HXMT after the launch on June 15, 2017, is a wide-band (1-250 keV) slat-collimator-based X-ray astronomy satellite with the capability of all-sky monitoring in 0.2-3 MeV. It was designed to perform pointing, scanning and gamma-ray burst (GRB) observations and, based on the Direct Demodulation Method (DDM), the image of the scanned sky region can be reconstructed. Here we give an overview of the mission and its progresses, including payload, core sciences, ground calibration/facility, ground segment, data archive, software, in-orbit performance, calibration, background model, observations and some preliminary results.

preprint2019arXiv

Search for the rare decay $η&#39;\rightarrowπ^{0}π^{0}π^{0}π^{0}$ at BESIII

Based on a sample of 1.31 billion $J/ψ$ events collected with the BESIII detector, we perform a search for the rare decay $η&#39;\rightarrow 4π^{0}$ via $J/ψ\rightarrowγη&#39;$. No significant $η&#39;$ signal is observed in the invariant mass spectrum of 4$π^{0}$. With a Bayesian approach, the upper limit on the branching fraction of $η&#39;\rightarrow 4π^{0}$ is determined to be $\mathcal{B}(η&#39;\rightarrow 4π^{0})$ $< 4.94\times10^{-5}$ at the 90\% confidence level, which is a factor of six smaller than the previous experimental limit.

preprint2019arXiv

Short-Term Temporal Convolutional Networks for Dynamic Hand Gesture Recognition

The purpose of gesture recognition is to recognize meaningful movements of human bodies, and gesture recognition is an important issue in computer vision. In this paper, we present a multimodal gesture recognition method based on 3D densely convolutional networks (3D-DenseNets) and improved temporal convolutional networks (TCNs). The key idea of our approach is to find a compact and effective representation of spatial and temporal features, which orderly and separately divide task of gesture video analysis into two parts: spatial analysis and temporal analysis. In spatial analysis, we adopt 3D-DenseNets to learn short-term spatio-temporal features effectively. Subsequently, in temporal analysis, we use TCNs to extract temporal features and employ improved Squeeze-and-Excitation Networks (SENets) to strengthen the representational power of temporal features from each TCNs&#39; layers. The method has been evaluated on the VIVA and the NVIDIA Gesture Dynamic Hand Gesture Datasets. Our approach obtains very competitive performance on VIVA benchmarks with the classification accuracies of 91.54%, and achieve state-of-the art performance with 86.37% accuracy on NVIDIA benchmark.

preprint2019arXiv

Weighted infinitesimal unitary bialgebras, pre-Lie, matrix algebras and polynomial algebras

Motivated by the classical comatrix coalgebra, we introduce the concept of a Newtonian comatrix coalgebra. We construct an infinitesimal unitary bialgebra on a matrix algebra and a weighted infinitesimal unitary bialgebra on a non-commutative polynomial algebra, via two constructions of suitable coproducts. As a consequence, a Newtonian comatrix coalgebra is established. Furthermore, an infinitesimal unitary Hopf algebra, under the view of Aguiar, is constructed on a matrix algebra. By investigating the relationship between weighted infinitesimal bialgebras and pre-Lie algebras, we erect respectively a pre-Lie algebraic structure and further a new Lie algebraic structure on matrix algebras. Finally, a pre-Lie algebraic structure and a Lie algebraic structure on non-commutative polynomial algebras are also given.

preprint2018arXiv

Weighted infinitesimal unitary bialgebras on matrix algebras and weighted associative Yang-Baxter equations

We equip a matrix algebra with a weighted infinitesimal unitary bialgebraic structure, via a construction of a suitable coproduct. Furthermore, an infinitesimal unitary Hopf algebra, under the view of Aguiar, is constructed on a matrix algebra. By exploring the relationship between weighted infinitesimal bialgebras and pre-Lie algebras, we construct a pre-Lie algebraic structure and then a new Lie algebraic structure on a matrix algebra. We also introduce the weighted associative Yang-Baxter equations (AYBEs) and obtain the relationship between solutions of weighted AYBEs and weighted infinitesimal unitary bialgebras. We give a bijection between the solutions of the associative Yang-Baxter equation of weight $λ$ and Rota-Baxter operators of weight $-λ$ on matrix algebras. As a consequence, weighted quasitriangular infinitesimal unitary bialgebras are constructed, which generalize the results studied by Aguiar. Finally, We show that any weighted quasitriangular infinitesimal unitary bialgebra can be made into a dendriform algebra.

preprint2017arXiv

Dynamic contact angle hysteresis in liquid bridges

This work presents a combined experimental and theoretical study of dynamic contact angle hysteresis using liquid bridges under cyclic compression and stretching between two identical plates. Under various loading rates, contact angle hysteresis for three different liquids was measured by examination of advancing and receding angles in liquid bridges, and the capillary forces were recorded. It is found that, for a given liquid, the hysteretic phenomenon of the contact angle is more pronounced at higher loading rates. By unifying the behaviour of the three liquids, power-law correlations were proposed to describe the relationship between the dynamic contact angle and the capillary number for advancing and receding cases. It is found that the exponents of obtained power-law correlations differ from those derived through earlier methods (e.g., capillary rise), due to the different kinematics of the triple-line. The various hysteretic loops of capillary force in liquid bridges under varied cyclic loading rates were also observed, which can be captured quantitatively by the prediction of our developed model incorporating the dynamic contact angle hysteresis. These results illustrate the importance of varying triple-line geometries during dynamic wetting and dewetting processes, and warrant an improved modelling approach for higher level phenomena involving these processes, e.g., multiphase flow in porous media and liquid transfer between surfaces with moving contact lines.

preprint2016arXiv

Observation of charge density wave order in 1D mirror twin boundaries of single-layer MoSe2

Properties of two-dimensional transition metal dichalcogenides are highly sensitive to the presence of defects in the crystal structure. A detailed understanding of defect structure may lead to control of material properties through defect engineering. Here we provide direct evidence for the existence of isolated, one-dimensional charge density waves at mirror twin boundaries in single-layer MoSe2. Our low-temperature scanning tunneling microscopy/spectroscopy measurements reveal a substantial bandgap of 60 - 140 meV opening at the Fermi level in the otherwise one dimensional metallic structure. We find an energy-dependent periodic modulation in the density of states along the mirror twin boundary, with a wavelength of approximately three lattice constants. The modulations in the density of states above and below the Fermi level are spatially out of phase, consistent with charge density wave order. In addition to the electronic characterization, we determine the atomic structure and bonding configuration of the one-dimensional mirror twin boundary by means of high-resolution non-contact atomic force microscopy. Density functional theory calculations reproduce both the gap opening and the modulations of the density of states.