Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
71works
0followers
37topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

71 published item(s)

preprint2026arXiv

Continuous Latent Diffusion Language Model

Large language models have achieved remarkable success under the autoregressive paradigm, yet high-quality text generation need not be tied to a fixed left-to-right order. Existing alternatives still struggle to jointly achieve generation efficiency, scalable representation learning, and effective global semantic modeling. We propose Cola DLM, a hierarchical latent diffusion language model that frames text generation through hierarchical information decomposition. Cola DLM first learns a stable text-to-latent mapping with a Text VAE, then models a global semantic prior in continuous latent space with a block-causal DiT, and finally generates text through conditional decoding. From a unified Markov-path perspective, its diffusion process performs latent prior transport rather than token-level observation recovery, thereby separating global semantic organization from local textual realization. This design yields a more flexible non-autoregressive inductive bias, supports semantic compression and prior fitting in continuous space, and naturally extends to other continuous modalities. Through experiments spanning 4 research questions, 8 benchmarks, strictly matched ~2B-parameter autoregressive and LLaDA baselines, and scaling curves up to about 2000 EFLOPs, we identify an effective overall configuration of Cola DLM and verify its strong scaling behavior for text generation. Taken together, the results establish hierarchical continuous latent prior modeling as a principled alternative to strictly token-level language modeling, where generation quality and scaling behavior may better reflect model capability than likelihood, while also suggesting a concrete path toward unified modeling across discrete text and continuous modalities.

preprint2024arXiv

Investigation of the $ΔI = 1/2$ rule and test of CP violation through the measurement of decay asymmetry parameters in $Ξ^-$ decays

Using $(10087\pm44)\times 10^{6}$ $J/ψ$ events collected with the BESIII detector, numerous $Ξ^-$ and $Λ$ decay asymmetry parameters are simultaneously determined from the process $J/ψ\to Ξ^- \barΞ^+ \to Λ(pπ^-) π^- \barΛ(\bar{n} π^0) π^+$ and its charge-conjugate channel. The precisions of $α_0$ for $Λ\to nπ^0$ and $\barα_0$ for $\barΛ \to \bar{n}π^0$ compared to world averages are improved by factors of 4 and 1.7, respectively. The ratio of decay asymmetry parameters of $Λ\to nπ^0$ to that of $Λ\to pπ^-$, $\langle α_0 \rangle/ \langle α_{Λ-} \rangle $, is determined to be $ 0.873 \pm 0.012^{+0.011}_{-0.010}$, where the first and the second uncertainties are statistical and systematic, respectively. The ratio is smaller than unity more than $5σ$, which signifies the existence of the $ΔI = 3/2$ transition in $Λ$ for the first time. Beside, we test for CP violation in $Ξ^- \to Λπ^-$ and in $Λ\to n π^{0}$ with the best precision to date.

preprint2023arXiv

MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation

We present MicroCinema, a straightforward yet effective framework for high-quality and coherent text-to-video generation. Unlike existing approaches that align text prompts with video directly, MicroCinema introduces a Divide-and-Conquer strategy which divides the text-to-video into a two-stage process: text-to-image generation and image\&text-to-video generation. This strategy offers two significant advantages. a) It allows us to take full advantage of the recent advances in text-to-image models, such as Stable Diffusion, Midjourney, and DALLE, to generate photorealistic and highly detailed images. b) Leveraging the generated image, the model can allocate less focus to fine-grained appearance details, prioritizing the efficient learning of motion dynamics. To implement this strategy effectively, we introduce two core designs. First, we propose the Appearance Injection Network, enhancing the preservation of the appearance of the given image. Second, we introduce the Appearance Noise Prior, a novel mechanism aimed at maintaining the capabilities of pre-trained 2D diffusion models. These design elements empower MicroCinema to generate high-quality videos with precise motion, guided by the provided text prompts. Extensive experiments demonstrate the superiority of the proposed framework. Concretely, MicroCinema achieves SOTA zero-shot FVD of 342.86 on UCF-101 and 377.40 on MSR-VTT. See https://wangyanhui666.github.io/MicroCinema.github.io/ for video samples.

preprint2023arXiv

Search for hidden-charm tetraquark with strangeness in $e^{+}e^{-}\rightarrow K^+ D_{s}^{*-} D^{*0}+c.c.$

We report a search for a heavier partner of the recently observed $Z_{cs}(3985)^{-}$ state, denoted as $Z_{cs}^{\prime -}$, in the process $e^{+} e^{-}\rightarrow K^{+}D_{s}^{*-}D^{* 0}+c.c.$, based on $e^+e^-$ collision data collected at the center-of-mass energies of $\sqrt{s}=4.661$, 4.682 and 4.699 GeV with the BESIII detector. The $Z_{cs}^{\prime -}$ is of interest as it is expected to be a candidate for a hidden-charm and open-strange tetraquark. A partial-reconstruction technique is used to isolate $K^+$ recoil-mass spectra, which are probed for a potential contribution from $Z_{cs}^{\prime -}\to D_{s}^{*-}D^{* 0}$ ($c.c.$). We find an excess of $Z_{cs}^{\prime -}\rightarrow D_{s}^{*-}D^{*0}$ ($c.c.$) candidates with a significance of $2.1σ$, after considering systematic uncertainties, at a mass of $(4123.5\pm0.7_\mathrm{stat.}\pm4.7_\mathrm{syst.})\ \mathrm{MeV}/c^{2}$. As the data set is limited in size, the upper limits are evaluated at the 90\% confidence level on the product of the Born cross sections ($σ^{\mathrm{Born}}$) and the branching fraction ($\mathcal{B}$) of $Z_{cs}^{\prime-}\rightarrow D_{s}^{*-}D^{* 0}$, under different assumptions of the $Z_{cs}^{\prime -}$ mass from 4.120 to 4.140 MeV and of the width from 10 to 50 MeV at the three center-of-mass energies. The upper limits of $σ^{\rm Born}\cdot\mathcal{B}$ are found to be at the level of $\mathcal{O}(1)$ pb at each energy. Larger data samples are needed to confirm the $Z_{cs}^{\prime -}$ state and clarify its nature in the coming years.

preprint2022arXiv

3D ToF LiDAR in Mobile Robotics: A Review

In the past ten years, the use of 3D Time-of-Flight (ToF) LiDARs in mobile robotics has grown rapidly. Based on our accumulation of relevant research, this article systematically reviews and analyzes the use 3D ToF LiDARs in research and industrial applications. The former includes object detection, robot localization, long-term autonomy, LiDAR data processing under adverse weather conditions, and sensor fusion. The latter encompasses service robots, assisted and autonomous driving, and recent applications performed in response to public health crises. We hope that our efforts can effectively provide readers with relevant references and promote the deployment of existing mature technologies in real-world systems.

preprint2022arXiv

A Flexible Diffusion Model

Diffusion (score-based) generative models have been widely used for modeling various types of complex data, including images, audios, and point clouds. Recently, the deep connection between forward-backward stochastic differential equations (SDEs) and diffusion-based models has been revealed, and several new variants of SDEs are proposed (e.g., sub-VP, critically-damped Langevin) along this line. Despite the empirical success of the hand-crafted fixed forward SDEs, a great quantity of proper forward SDEs remain unexplored. In this work, we propose a general framework for parameterizing the diffusion model, especially the spatial part of the forward SDE. An abstract formalism is introduced with theoretical guarantees, and its connection with previous diffusion models is leveraged. We demonstrate the theoretical advantage of our method from an optimization perspective. Numerical experiments on synthetic datasets, MINIST and CIFAR10 are also presented to validate the effectiveness of our framework.

preprint2022arXiv

A Primal-Dual SGD Algorithm for Distributed Nonconvex Optimization

The distributed nonconvex optimization problem of minimizing a global cost function formed by a sum of $n$ local cost functions by using local information exchange is considered. This problem is an important component of many machine learning techniques with data parallelism, such as deep learning and federated learning. We propose a distributed primal--dual stochastic gradient descent (SGD) algorithm, suitable for arbitrarily connected communication networks and any smooth (possibly nonconvex) cost functions. We show that the proposed algorithm achieves the linear speedup convergence rate $\mathcal{O}(1/\sqrt{nT})$ for general nonconvex cost functions and the linear speedup convergence rate $\mathcal{O}(1/(nT))$ when the global cost function satisfies the Polyak--Łojasiewicz (P--Ł) condition, where $T$ is the total number of iterations. We also show that the output of the proposed algorithm with constant parameters linearly converges to a neighborhood of a global optimum. We demonstrate through numerical experiments the efficiency of our algorithm in comparison with the baseline centralized SGD and recently proposed distributed SGD algorithms.

preprint2022arXiv

Amplitude analysis and branching fraction measurement of the decay $D_{s}^{+} \to K^+π^+π^-$

Using $6.32$ fb$^{-1}$ of $e^{+}e^{-}$ collision data collected at the center-of-mass energies between 4.178 and 4.226 GeV with the BESIII detector, we perform an amplitude analysis of the decay $D^+_s \to K^+π^+π^-$ and determine the amplitudes of the various intermediate states. The absolute branching fraction of $D^+_s\to K^+π^+π^-$ is measured to be ($6.11\pm0.18_{\rm stat.}\pm0.11_{\rm syst.})\times 10^{-3}$. The branching fractions of the dominant intermediate processes $D_{s}^{+} \to K^+ρ^0, ρ^0 \to π^+π^-$ and $D_{s}^{+} \to K^*(892)^0π^+, K^*(892)^0 \to K^+π^-$ are determined to be $(1.96\pm0.19_{\rm stat.}\pm0.23_{\rm syst.})\times 10^{-3}$ and $(1.85\pm0.12_{\rm stat.}\pm0.13_{\rm syst.})\times 10^{-3}$, respectively. The intermediate resonances $f_0(500)$, $f_0(980)$, and $f_0(1370)$ are observed for the first time in this channel.

preprint2022arXiv

Amplitude analysis and branching-fraction measurement of $D_{s}^{+} \to π^{+}π^{0}η^{\prime}$

Using data collected with the BESIII detector in $e^+e^-$ collisions at center-of-mass energies between 4.178 and 4.226 GeV and corresponding to 6.32~fb$^{-1}$ of integrated luminosity, we report the amplitude analysis and branching-fraction measurement of the $D^+_s \to π^+ π^0 η^{\prime}$ decay. We find that the dominant intermediate process is $D^+_s \toρ^+ η^{\prime}$ and the significances of other resonant and nonresonant processes are all less than $3σ$. The upper limits on the branching fractions of $S$-wave and $P$-wave nonresonant components are set to $0.10\%$ and $0.74\%$ at the $90\%$ confidence level, respectively. In addition, the branching fraction of the $D^+_s \to π^+ π^0 η^{\prime}$ decay is measured to be $(6.15\pm0.25(\rm stat.)\pm0.18(\rm syst.))\%$, which receives significant contribution only from $D_s^+\to ρ^+η^{\prime}$ according to the amplitude analysis.

preprint2022arXiv

Beyond a Video Frame Interpolator: A Space Decoupled Learning Approach to Continuous Image Transition

Video frame interpolation (VFI) aims to improve the temporal resolution of a video sequence. Most of the existing deep learning based VFI methods adopt off-the-shelf optical flow algorithms to estimate the bidirectional flows and interpolate the missing frames accordingly. Though having achieved a great success, these methods require much human experience to tune the bidirectional flows and often generate unpleasant results when the estimated flows are not accurate. In this work, we rethink the VFI problem and formulate it as a continuous image transition (CIT) task, whose key issue is to transition an image from one space to another space continuously. More specifically, we learn to implicitly decouple the images into a translatable flow space and a non-translatable feature space. The former depicts the translatable states between the given images, while the later aims to reconstruct the intermediate features that cannot be directly translated. In this way, we can easily perform image interpolation in the flow space and intermediate image synthesis in the feature space, obtaining a CIT model. The proposed space decoupled learning (SDL) approach is simple to implement, while it provides an effective framework to a variety of CIT problems beyond VFI, such as style transfer and image morphing. Our extensive experiments on a variety of CIT tasks demonstrate the superiority of SDL to existing methods. The source code and models can be found at \url{https://github.com/yangxy/SDL}.

preprint2022arXiv

Compact Token Representations with Contextual Quantization for Efficient Document Re-ranking

Transformer based re-ranking models can achieve high search relevance through context-aware soft matching of query tokens with document tokens. To alleviate runtime complexity of such inference, previous work has adopted a late interaction architecture with pre-computed contextual token representations at the cost of a large online storage. This paper proposes contextual quantization of token embeddings by decoupling document-specific and document-independent ranking contributions during codebook-based compression. This allows effective online decompression and embedding composition for better search relevance. This paper presents an evaluation of the above compact token representation model in terms of relevance and space efficiency.

preprint2022arXiv

Composite Re-Ranking for Efficient Document Search with BERT

Although considerable efforts have been devoted to transformer-based ranking models for document search, the relevance-efficiency tradeoff remains a critical problem for ad-hoc ranking. To overcome this challenge, this paper presents BECR (BERT-based Composite Re-Ranking), a composite re-ranking scheme that combines deep contextual token interactions and traditional lexical term-matching features. In particular, BECR exploits a token encoding mechanism to decompose the query representations into pre-computable uni-grams and skip-n-grams. By applying token encoding on top of a dual-encoder architecture, BECR separates the attentions between a query and a document while capturing the contextual semantics of a query. In contrast to previous approaches, this framework does not perform expensive BERT computations during online inference. Thus, it is significantly faster, yet still able to achieve high competitiveness in ad-hoc ranking relevance. Finally, an in-depth comparison between BECR and other start-of-the-art neural ranking baselines is described using the TREC datasets, thereby further demonstrating the enhanced relevance and efficiency of BECR.

preprint2022arXiv

Dual Skipping Guidance for Document Retrieval with Learned Sparse Representations

This paper proposes a dual skipping guidance scheme with hybrid scoring to accelerate document retrieval that uses learned sparse representations while still delivering a good relevance. This scheme uses both lexical BM25 and learned neural term weights to bound and compose the rank score of a candidate document separately for skipping and final ranking, and maintains two top-k thresholds during inverted index traversal. This paper evaluates time efficiency and ranking relevance of the proposed scheme in searching MS MARCO TREC datasets.

preprint2022arXiv

First Observation of the Semileptonic Decay $Λ_c^+\rightarrow pK^- e^+ν_e$

Using $4.5~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data samples collected at the center-of-mass energies ranging from 4.600~GeV to 4.699~GeV with the BESIII detector at the BEPCII collider, a first study of the semileptonic decays $Λ_c^+\rightarrow pK^-e^+ν_e$, $Λ_c^+\rightarrow Λ(1520) e^+ν_e$ and $Λ_c^+\rightarrow Λ(1405) e^+ν_e$ is performed. The $Λ_c^+\rightarrow pK^-e^+ν_e$ decay is observed with a significance of $8.2σ$ and the branching fraction is measured to be $\mathcal{B}(Λ_c^+\rightarrow pK^- e^+ν_e)=(0.88\pm0.17_{\rm stat.}\pm0.07_{\rm syst.})\times 10^{-3}$. We also report evidence of $Λ_c^+\rightarrow Λ(1520)e^+ν_e$ and $Λ_c^+\rightarrow Λ(1405)e^+ν_e$ with significances of $3.3σ$ and $3.2σ$, respectively, and measure $\mathcal B(Λ^+_c\rightarrow Λ(1520)e^+ν_e)=(1.02\pm0.52_{\rm stat.}\pm0.11_{\rm syst.})\times10^{-3}$ and $\mathcal B(Λ^+_c\rightarrow Λ(1405)[\rightarrow pK^-]e^+ν_e)=(0.42\pm0.19_{\rm stat.}\pm0.04_{\rm syst.})\times10^{-3}$. Combining these with the inclusive semileptonic $Λ_c^+$ branching fraction measured by BESIII, the relative fraction is determined to be $[\mathcal{B}(Λ_c^+\rightarrow pK^-e^+ν_e)/\mathcal{B}(Λ_c^+\rightarrow X e^+ν_e)]=(2.1\pm0.4_{\rm stat.}\pm0.2_{\rm syst.})\%$, which provides a clear confirmation that semileptonic $Λ_c^+$ decays are not saturated by the $Λ\ell^+ν_{\ell}$ final state.

preprint2022arXiv

High-Order Leader-Follower Tracking Control under Limited Information Availability

Limited information availability represents a fundamental challenge for control of multi-agent systems, since an agent often lacks sensing capabilities to measure certain states of its own and can exchange data only with its neighbors. The challenge becomes even greater when agents are governed by high-order dynamics. The present work is motivated to conduct control design for linear and nonlinear high-order leader-follower multi-agent systems in a context where only the first state of an agent is measured. To address this open challenge, we develop novel distributed observers to enable followers to reconstruct unmeasured or unknown quantities about themselves and the leader and on such a basis, build observer-based tracking control approaches. We analyze the convergence properties of the proposed approaches and validate their performance through simulation.

preprint2022arXiv

Learning Disentangled Representation by Exploiting Pretrained Generative Models: A Contrastive Learning View

From the intuitive notion of disentanglement, the image variations corresponding to different factors should be distinct from each other, and the disentangled representation should reflect those variations with separate dimensions. To discover the factors and learn disentangled representation, previous methods typically leverage an extra regularization term when learning to generate realistic images. However, the term usually results in a trade-off between disentanglement and generation quality. For the generative models pretrained without any disentanglement term, the generated images show semantically meaningful variations when traversing along different directions in the latent space. Based on this observation, we argue that it is possible to mitigate the trade-off by $(i)$ leveraging the pretrained generative models with high generation quality, $(ii)$ focusing on discovering the traversal directions as factors for disentangled representation learning. To achieve this, we propose Disentaglement via Contrast (DisCo) as a framework to model the variations based on the target disentangled representations, and contrast the variations to jointly discover disentangled directions and learn disentangled representations. DisCo achieves the state-of-the-art disentangled representation learning and distinct direction discovering, given pretrained non-disentangled generative models including GAN, VAE, and Flow. Source code is at https://github.com/xrenaa/DisCo.

preprint2022arXiv

Low Gain Avalanche Detectors with Good Time Resolution Developed by IHEP and IME for ATLAS HGTD project

This paper shows the simulation and test results of 50um thick Low Gain Avalanche Detectors (LGAD) sensors designed by the Institute of High Energy Physics (IHEP) and fabricated by the Institute of Microelectronics of the Chinese Academy of Sciences (IME). Three wafers have been produced with four different gain layer implant doses each. Different production processes, including variation in the n++ layer implant energy and carbon co-implantation were used. Test results show that the IHEP-IME sensors with the higher dose of gain layer have lower breakdown voltages and higher gain layer voltages from capacitance-voltage properties, which are consistent with the TCAD simulation. Beta test results show that the time resolution of IHEP-IME sensors is better than 35ps when operated at high voltage and the collected charges of IHEP-IME sensors are larger than 15fC before irradiation, which fulfill the required specifications of sensors before irradiations for the ATLAS HGTD project.

preprint2022arXiv

Measurement of $e^{+}e^{-} \to K^{+}K^{-}π^{0}$ cross section and observation of a resonant structure

Based on $e^{+}e^{-}$ collision data collected by the BESIII detector at the BEPCII collider at center-of-mass energies from 2.000 to 3.080 GeV, a partial-wave analysis is performed for the process $e^{+}e^{-} \to K^{+}K^{-}π^{0}$. The Born cross section of the process $e^{+}e^{-} \to K^{+}K^{-}π^{0}$ and its subprocesses $e^{+}e^{-} \to ϕπ^{0}$, $K^{*}(892)K$ and $K^{*}_{2}(1430)K$ are measured. The results for $e^{+}e^{-} \to K^{+}K^{-}π^{0}$ and $ϕπ^{0}$ are consistent with the BaBar measurements and with improved precision. By analyzing the cross section, of the subprocesses $e^{+}e^{-} \to$ $K^{*}(892)K$ and $K^{*}_{2}(1430)K$, a structure with mass $M_R$ = (2208 $\pm$ 19 $\pm$ 24) MeV/$c^{2}$ and width $Γ_R$ = (168 $\pm$ 24 $\pm$ 39) MeV is observed with a combined statistical significance of 7.6$σ$. The measured resonance parameters suggest it can be identified as the $ϕ(2170)$, thus the results provide valuable input to understand the internal nature of this state.

preprint2022arXiv

Measurement of the branching fraction and decay asymmetry of $Λ\to nγ$

The radiative hyperon decay $Λ\to nγ$ is studied using $(10087\pm44)\times 10^6$ $J/ψ$ events collected with the BESIII detector operating at BEPCII. The absolute branching fraction of the decay $Λ\to nγ$ is determined with a significance of 5.6$σ$ to be $[0.832\pm0.038(\rm stat.)\pm0.054(\rm syst.)]\times10^{-3}$, which lies significantly below the current PDG value. By analyzing the joint angular distribution of the decay products, the first determination of the decay asymmetry $α_γ$ is reported with a value of $-0.16\pm0.10(\rm stat.)\pm0.05(\rm syst.)$.

preprint2022arXiv

Measurement of the Cross Section for $e^{+}e^{-}\to$ hadrons at Energies from 2.2324 to 3.6710 GeV

Based on electron-positron collision data collected with the BESIII detector operating at the Beijing Electron Positron Collider II storage rings, the value of $R\equivσ(e^{+}e^{-}\to$hadrons)/$σ(e^{+}e^{-}\toμ^{+}μ^{-})$ is measured at 14 center-of-mass energies from 2.2324 to 3.6710 GeV. The resulting uncertainties are less than $3.0\%$, and are dominated by systematic uncertainties.

preprint2022arXiv

Measurement of the cross section of $e^{+}e^{-}\toηπ^{+}π^{-}$ at center-of-mass energies from 3.872 GeV to 4.700 GeV

Using data samples with an integrated luminosity of 19 fb$^{-1}$ at twenty-eight center-of-mass energies from 3.872 GeV to 4.700 GeV collected with the BESIII detector at the BEPCII electron--positron collider, the process $e^{+}e^{-}\toηπ^{+}π^{-}$ and the intermediate process $e^{+}e^{-}\toηρ^{0}$ are studied for the first time. The Born cross sections are measured. No significant resonance structure is observed in the cross section lineshape.

preprint2022arXiv

Measurement of the total and leptonic decay widths of the $J/ψ$ resonance with an energy scan method at BESIII

Using $e^+e^-$ annihilation data sets collected with the BESIII detector, we measure the cross sections of the processes $e^+e^- \to e^+e^-$ and $e^+e^- \to μ^+μ^-$ at fifteen center-of-mass energy points in the vicinity of the $J/ψ$ resonance. By a simultaneous fit to the measured, center-of-mass energy dependent cross sections of the two processes, the combined quantities $Γ_{ee} Γ_{ee} / Γ_{\rm tot}$ and $Γ_{ee} Γ_{μμ} / Γ_{\rm tot}$ are determined to be ($0.346 \pm 0.009$) and ($0.335 \pm 0.006$) keV, respectively, where $Γ_{ee}$, $Γ_{μμ}$, and $Γ_{\rm tot}$ are the electronic, muonic, and total decay widths of the $J/ψ$ resonance, respectively. Using the resultant $Γ_{ee} Γ_{μμ} / Γ_{\rm tot}$ and $Γ_{ee} Γ_{ee} / Γ_{\rm tot}$, the ratio $Γ_{ee} / Γ_{μμ}$ is calculated to be $1.031 \pm 0.015$, which is consistent with the expectation of lepton universality within about two standard deviations. Assuming lepton universality and using the branching fraction of the $J/ψ$ leptonic decay measured by BESIII in 2013, $Γ_{\rm tot}$ and $Γ_{ll}$ are determined to be ($93.0 \pm 2.1$) and ($5.56 \pm 0.11$) keV, respectively, where $Γ_{ll}$ is the average leptonic decay width of the $J/ψ$ resonance.

preprint2022arXiv

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

This paper reviews the NTIRE 2022 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The task of the challenge was to super-resolve an input image with a magnification factor of $\times$4 based on pairs of low and corresponding high resolution images. The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29.00dB on DIV2K validation set. IMDN is set as the baseline for efficiency measurement. The challenge had 3 tracks including the main track (runtime), sub-track one (model complexity), and sub-track two (overall performance). In the main track, the practical runtime performance of the submissions was evaluated. The rank of the teams were determined directly by the absolute value of the average runtime on the validation set and test set. In sub-track one, the number of parameters and FLOPs were considered. And the individual rankings of the two metrics were summed up to determine a final ranking in this track. In sub-track two, all of the five metrics mentioned in the description of the challenge including runtime, parameter count, FLOPs, activations, and memory consumption were considered. Similar to sub-track one, the rankings of five metrics were summed up to determine a final ranking. The challenge had 303 registered participants, and 43 teams made valid submissions. They gauge the state-of-the-art in efficient single image super-resolution.

preprint2022arXiv

Observation of $η_c(2S) \to 3(π^+π^-)$ and measurements of $χ_{cJ} \to 3(π^+π^-)$ in $ψ(3686)$ radiative transitions

The hadronic decay $η_c(2S) \to 3(π^+π^-)$ is observed with a statistical significance of 9.3 standard deviations using $(448.1\pm2.9)\times10^6$ $ψ(3686)$ events collected by the BESIII detector at the BEPCII collider. The measured mass and width of $η_c(2S)$ are $(3643.4 \pm 2.3 (\rm stat.) \pm 4.4 (\rm syst.))$ MeV/$c^2$ and $(19.8 \pm 3.9 (\rm stat.) \pm 3.1 (\rm syst.))$ MeV, respectively, which are consistent with the world average values within two standard deviations. The product branching fraction $\mathcal{B}[ψ(3686)\to γη_c(2S)]\times\mathcal{B}[η_c(2S)\to3(π^+π^-)]$ is measured to be $(9.2 \pm 1.0 (\rm stat.) \pm 0.9 (\rm syst.))\times10^{-6}$. Using $\mathcal{B}[ψ(3686)\to γη_c(2S)]=(7.0^{+3.4}_{-2.5})\times10^{-4}$, we obtain $\mathcal{B}[η_c(2S) \to 3(π^+π^-)] = (1.31 \pm 0.15 (\rm stat.) \pm 0.13 (\rm syst.)(^{+0.64}_{-0.47}) (\rm extr))\times10^{-2}$, where the third uncertainty is from $\mathcal{B}[ψ(3686) \to γη_c(2S)]$. We also measure the $χ_{cJ} \to 3(π^+π^-)$ ($J=0, 1, 2$) decays via $ψ(3686) \to γχ_{cJ}$ transitions. The branching fractions are $\mathcal{B}[χ_{c0} \to 3(π^+π^-)] = (2.080\pm0.006 (\rm stat.)\pm0.068 (\rm syst.))\times10^{-2}$, $\mathcal{B}[χ_{c1} \to 3(π^+π^-)] = (1.092\pm0.004 (\rm stat.)\pm0.035 (\rm syst.))\times10^{-2}$, and $\mathcal{B}[χ_{c2} \to 3(π^+π^-)] = (1.565\pm0.005 (\rm stat.)\pm0.048 (\rm syst.))\times10^{-2}$.

preprint2022arXiv

Observation of the double Dalitz decay $η'\to e^+e^-e^+e^-$

Based on $(10087 \pm 44)\times10^6$ $J/ψ$ events collected with the BESIII detector at BEPCII, the double Dalitz decay $η'\to e^+e^-e^+e^-$ is observed for the first time via the $J/ψ\toγη'$ decay process. The significance is found to be 5.7$σ$ with systematic uncertainties taken into consideration. Its branching fraction is determined to be $\mathcal{B}(η'\to e^+ e^- e^+ e^-) =(4.5\pm1.0(\mathrm{stat.})\pm0.5(\mathrm{sys.})) \times 10^{-6}$.

preprint2022arXiv

Quench dynamics of Bose-Einstein condensates in boxlike traps

We investigate the nonequilibrium dynamics of two-dimensional Bose-Einstein condensates in boxlike traps with power-law potential boundaries by quenching the interatomic interactions. For both concave and convex potentials, we show that ring dark solitons can be excited during the quench dynamics. The modulation strength of the quench and the steepness of the boundary are two main factors affecting the evolution of the system. Five dynamic regimes are identified concerning the number of ring dark solitons excited in the condensate. For the situation without ring dark soliton excitations, the condensate undergoes damped radius oscillation. As far as the appearance of ring dark solitons, interesting structures arise from their decay. For the concave potential, the excitation patterns show a nested structure of vortex-antivortex pairs. For the convex potential, on the other hand, the dynamic excitation patterns display richer structures that have multiple transport behaviors.

preprint2022arXiv

Real-World Blind Super-Resolution via Feature Matching with Implicit High-Resolution Priors

A key challenge of real-world image super-resolution (SR) is to recover the missing details in low-resolution (LR) images with complex unknown degradations (e.g., downsampling, noise and compression). Most previous works restore such missing details in the image space. To cope with the high diversity of natural images, they either rely on the unstable GANs that are difficult to train and prone to artifacts, or resort to explicit references from high-resolution (HR) images that are usually unavailable. In this work, we propose Feature Matching SR (FeMaSR), which restores realistic HR images in a much more compact feature space. Unlike image-space methods, our FeMaSR restores HR images by matching distorted LR image {\it features} to their distortion-free HR counterparts in our pretrained HR priors, and decoding the matched features to obtain realistic HR images. Specifically, our HR priors contain a discrete feature codebook and its associated decoder, which are pretrained on HR images with a Vector Quantized Generative Adversarial Network (VQGAN). Notably, we incorporate a novel semantic regularization in VQGAN to improve the quality of reconstructed images. For the feature matching, we first extract LR features with an LR encoder consisting of several Swin Transformer blocks and then follow a simple nearest neighbour strategy to match them with the pretrained codebook. In particular, we equip the LR encoder with residual shortcut connections to the decoder, which is critical to the optimization of feature matching loss and also helps to complement the possible feature matching errors. Experimental results show that our approach produces more realistic HR images than previous methods. Codes are released at \url{https://github.com/chaofengc/FeMaSR}.

preprint2022arXiv

Search for $X(3872)\toπ^0χ_{c0}$ and $X(3872)\toππχ_{c0}$ at BESIII

Using 9.9 fb$^{-1}$ of $e^+e^-$ collision data collected by the BESIII detector at center-of-mass energies between 4.15 and 4.30 GeV, we search for the processes $e^+e^-\toγX(3872)$ with $X(3872)\rightarrowπ^0χ_{c0}$ and $X(3872)\rightarrowππχ_{c0}$. Depending on the fitting model, the statistical significance for $X(3872)\toπ^0χ_{c0}$ ranges from 1.3$σ$ to 2.8$σ$. We set upper limits (at 90\% C.L.) of $\frac{\mathcal{B}(X(3872)\rightarrowπ^0χ_{c0})}{\mathcal{B}(X(3872)\toπ^+π^-J/ψ)}<3.6$, $\frac{\mathcal{B}(X(3872)\rightarrowπ^+π^-χ_{c0})}{\mathcal{B}(X(3872)\toπ^+π^-J/ψ)}<0.68$, and $\frac{\mathcal{B}(X(3872)\rightarrowπ^0π^0χ_{c0})}{\mathcal{B}(X(3872)\toπ^+π^-J/ψ)}<1.7$. Combined with the BESIII measurement of $X(3872)\toπ^0χ_{c1}$, we also set an upper limit of $\frac{\mathcal{B}(X(3872)\rightarrowπ^0χ_{c0})}{\mathcal{B}(X(3872)\toπ^0χ_{c1})}<4.4$.

preprint2022arXiv

Search for baryon and lepton number violation decay $D^{\pm}\to n(\bar{n})e^{\pm}$

Using a data set of electron-positron collisions corresponding to an integrated luminosity of ${\rm 2.93~fb^{-1}}$ taken with the BESIII detector at a center-of-mass energy of 3.773 GeV, a search for the baryon ($B$) and lepton ($L$) number violating decays $D^{\pm}\to n(\bar{n})e^{\pm}$ is performed. No signal is observed and the upper limits on the branching fractions at the $90\%$ confidence level are set to be $1.43\times10^{-5}$ for the decays $D^{+(-)}\to \bar{n}(n)e^{+(-)}$ with $Δ|B-L|=0$, and $2.91\times10^{-5}$ for the decays $D^{+(-)}\to n(\bar{n})e^{+(-)}$ with $Δ|B-L|=2$ , where $Δ|B-L|$ denotes the change in the difference between baryon and lepton numbers.

preprint2022arXiv

Simulation of the 4H-SiC Low Gain Avalanche Diode

Silicon Carbide device (4H-SiC) has potential radiation hardness, high saturated carrier velocity and low temperature sensitivity theoretically. The Silicon Low Gain Avalanche Diode (LGAD) has been verified to have excellent time performance. Therefore, the 4H-SiC LGAD is introduced in this work for application to detect the Minimum Ionization Particles (MIPs). We provide guidance to determine the thickness and doping level of the gain layer after an analytical analysis. The gain layer thickness $d_{gain}=0.5~μm$ is adopted in our design. We design two different types of 4H-SiC LGAD which have two types electric field, and the corresponding leakage current, capacitance and gain are simulated by TCAD tools. Through analysis of the simulation results, the advantages and disadvantages are discussed for two types of 4H-SiC LGAD.

preprint2022arXiv

Space-borne atom interferometric gravitational wave detections. Part II. Dark sirens and finding the one

In this paper, we investigate the potential of dark sirens by the space-borne atom interferometric gravitational-wave detectors to probe the Hubble constant. In the mid-frequency band, the sources live a long time. The motion of a detector around the Sun as well as in Earth orbit would induce large Doppler and reorientation effects, providing a precise angular resolution. Such precise localization for the GW sources makes it possible to observe the dark sirens with only one potential host galaxy, which are dubbed &#34;golden dark sirens&#34;. We construct the catalogs of golden dark sirens and estimate that there are around 79 and 35 golden dark sirens of binary neutron stars (BNS) and binary black holes (BBH) that would be pass the detection threshold of AEDGE in 5 years. Our results show that with 5, 10, and all 79 golden dark BNS tracked by AEDGE one can constrain $H_0$ at 5.5\%, 4.1\%, and 1.8\% precision levels. With 5, 10, and all 35 golden dark BBH one can constrain $H_0$ at 2.2\%, 1.8\%, and 1.5\% precision levels, respectively. It suggests that only 5-10 golden dark BBH by AEDGE are sufficient to arbitrate the current tension between local and high-$z$ measurements of $H_0$.

preprint2022arXiv

Test-time Batch Normalization

Deep neural networks often suffer the data distribution shift between training and testing, and the batch statistics are observed to reflect the shift. In this paper, targeting of alleviating distribution shift in test time, we revisit the batch normalization (BN) in the training process and reveals two key insights benefiting test-time optimization: $(i)$ preserving the same gradient backpropagation form as training, and $(ii)$ using dataset-level statistics for robust optimization and inference. Based on the two insights, we propose a novel test-time BN layer design, GpreBN, which is optimized during testing by minimizing Entropy loss. We verify the effectiveness of our method on two typical settings with distribution shift, i.e., domain generalization and robustness tasks. Our GpreBN significantly improves the test-time performance and achieves the state of the art results.

preprint2022arXiv

Thermodynamic limit and boundary energy of the spin-1 Heisenberg chain with non-diagonal boundary fields

The thermodynamic limit and boundary energy of the isotropic spin-1 Heisenberg chain with non-diagonal boundary fields are studied. The finite size scaling properties of the inhomogeneous term in the $ T-Q $ relation at the ground state are calculated by the density matrix renormalization group. Based on our findings, the boundary energy of the system in the thermodynamic limit can be obtained from Bethe ansatz equations of a related model with parallel boundary fields. These results can be generalized to the $SU(2)$ symmetric high spin Heisenberg model directly.

preprint2022arXiv

Time Resolution of the 4H-SiC PIN Detector

We address the determination of the time resolution for the $\rm 100~μm$ 4H-SiC PIN detectors fabricated by Nanjing University (NJU). The time response to $\rm β$ particles from a $\rm ^{90}$Sr source is investigated for the detection of the minimum ionizing particles (MIPs). We study the influence of different reverse voltages, which correspond to different carrier velocities and device sizes, and how this correlates with the detector capacitance. We determine a time resolution $\rm (94\pm1)~ps$ for $\rm 100~μm$ 4H-SiC PIN detector. A fast simulation software, termed RASER (RAdiation SEmiconductoR), is developed, and validated by comparing the waveform obtained from simulated and measured data. The simulated time resolution is $\rm (73\pm 1)~ps$ after considering the intrinsic leading contributions of the detector to time resolution.

preprint2022arXiv

Timing performance simulation for 3D 4H-SiC detector

To meet high radiation challenge for detectors in future high-energy physics, a novel 3D 4H-SiC detector was investigated. SiC detectors could potentially operate in radiation harsh and room temperature environment because of its high thermal conductivity and high atomic displacement threshold energy. 3D structure, which decouples thickness and distance between electrodes, further improves timing performance and radiation hardness of the detector. We developed a simulation software - RASER (RAdiation SEmiconductoR) to simulate the time resolution of planar and 3D 4H-SiC detectors with different parameters and structures, and the reliability of the software is verified by comparing time resolution results of simulation with data. The rough time resolution of 3D 4H-SiC detector was estimated, and the simulation parameters could be used as guideline to 3D 4H-SiC detector design and optimization.

preprint2022arXiv

Towards Building A Group-based Unsupervised Representation Disentanglement Framework

Disentangled representation learning is one of the major goals of deep learning, and is a key step for achieving explainable and generalizable models. A well-defined theoretical guarantee still lacks for the VAE-based unsupervised methods, which are a set of popular methods to achieve unsupervised disentanglement. The Group Theory based definition of representation disentanglement mathematically connects the data transformations to the representations using the formalism of group. In this paper, built on the group-based definition and inspired by the n-th dihedral group, we first propose a theoretical framework towards achieving unsupervised representation disentanglement. We then propose a model, based on existing VAE-based methods, to tackle the unsupervised learning problem of the framework. In the theoretical framework, we prove three sufficient conditions on model, group structure, and data respectively in an effort to achieve, in an unsupervised way, disentangled representation per group-based definition. With the first two of the conditions satisfied and a necessary condition derived for the third one, we offer additional constraints, from the perspective of the group-based definition, for the existing VAE-based models. Experimentally, we train 1800 models covering the most prominent VAE-based methods on five datasets to verify the effectiveness of our theoretical framework. Compared to the original VAE-based methods, these Groupified VAEs consistently achieve better mean performance with smaller variances.

preprint2021arXiv

A Driving Behavior Recognition Model with Bi-LSTM and Multi-Scale CNN

In autonomous driving, perceiving the driving behaviors of surrounding agents is important for the ego-vehicle to make a reasonable decision. In this paper, we propose a neural network model based on trajectories information for driving behavior recognition. Unlike existing trajectory-based methods that recognize the driving behavior using the hand-crafted features or directly encoding the trajectory, our model involves a Multi-Scale Convolutional Neural Network (MSCNN) module to automatically extract the high-level features which are supposed to encode the rich spatial and temporal information. Given a trajectory sequence of an agent as the input, firstly, the Bi-directional Long Short Term Memory (Bi-LSTM) module and the MSCNN module respectively process the input, generating two features, and then the two features are fused to classify the behavior of the agent. We evaluate the proposed model on the public BLVD dataset, achieving a satisfying performance.

preprint2021arXiv

A Risk-Sensitive Task Offloading Strategy for Edge Computing in Industrial Internet of Things

Edge computing has become one of the key enablers for ultra-reliable and low-latency communications in the industrial Internet of Things in the fifth generation communication systems, and is also a promising technology in the future sixth generation communication systems. In this work, we consider the application of edge computing to smart factories for mission-critical task offloading through wireless links. In such scenarios, although high end-to-end delays from the generation to completion of tasks happen with low probability, they may incur severe casualties and property loss, and should be seriously treated. Inspired by the risk management theory widely used in finance, we adopt the Conditional Value at Risk to capture the tail of the delay distribution. An upper bound of the Conditional Value at Risk is derived through analysis of the queues both at the devices and the edge computing servers. We aim to find out the optimal offloading policy taking into consideration both the average and the worst case delay performance of the system. Given that the formulated optimization problem is a non-convex mixed integer non-linear programming problem, a decomposition into sub-problems is performed and a two-stage heuristic algorithm is proposed. Simulation results validate our analysis and indicate that the proposed algorithm can reduce the risk in both the queuing and end-to-end delay.

preprint2021arXiv

Acoustically Manipulating Internal Structure of Disk-in-Sphere Endoskeletal Droplets

Manipulation of micro/nano particles has been well studied and demonstrated by optical, electromagnetic, and acoustic approaches, or their combinations. Manipulation of internal structure of droplet/particle is rarely explored and remains challenging due to its complicated nature. Here we demonstrated the manipulation of internal structure of disk-in-sphere endoskeletal droplets using acoustic wave for the first time. We developed a model to investigate the physical mechanisms behind this novel phenomenon. Theoretical analysis of the acoustic interactions indicated that these assembly dynamics arise from a balance of the primary and secondary radiation forces. Additionally, the disk orientation was found to change with acoustic driving frequency, which allowed on-demand, reversible adjusting disk orientations with respect to the substrate. This novel dynamic behavior leads to unique reversible arrangements of the endoskeletal droplets and their internal architecture, which may provide a new avenue for directed assembly of novel hierarchical colloidal architectures and intracellular organelles or intra-organoid structures.

preprint2021arXiv

Active Anomaly Detection with Switching Cost

The problem of detecting a single anomalous process among multiple independent processes is considered. Under a constraint on the number of processes that can be probed simultaneously, the decision maker should decide which processes to probe at each time and when to terminate the probing. Compared with previous work considering only the observation costs, the switching costs of switchings across processes also need to be taken into account in many practical scenarios. The objective is an active inference strategy that minimizes the Bayesian risk taking into account of the sample complexity, switching cost, as well as detection errors. Based on the framework of sequential design of experiments, we propose a low-complexity, low-switching deterministic policy for two scenarios where the total switching cost is negligible and the total switching cost is comparable to the total observation cost. We show that the proposed algorithm is asymptotically optimal in the former scenario and is order optimal in the latter scenario. Simulation results demonstrate strong performance in the finite regime for both scenarios.

preprint2021arXiv

Cross section measurements of the $e^+e^-\to D^{*+}D^{*-}$ and $e^+e^-\to D^{*+}D^{-}$ processes at center-of-mass energies from 4.085 to 4.600 GeV

The Born cross sections of the $e^+e^-\to D^{*+}D^{*-}$ and $e^+e^-\to D^{*+}D^{-}$ processes are measured using $e^+e^-$ collision data collected with the BESIII experiment at center-of-mass energies from 4.085 to 4.600 GeV, corresponding to an integrated luminosity of $15.7~{\rm fb}^{-1}$. The results are consistent with and more precise than the previous measurements by the Belle, Babar and CLEO collaborations. The measurements are essential for understanding the nature of vector charmonium and charmonium-like states.

preprint2021arXiv

Exact solutions of the $C_n$ quantum spin chain

We study the exact solutions of quantum integrable model associated with the $C_n$ Lie algebra, with either a periodic or an open one with off-diagonal boundary reflections, by generalizing the nested off-diagonal Bethe ansatz method. Taking the $C_3$ as an example we demonstrate how the generalized method works. We give the fusion structures of the model and provide a way to close fusion processes. Based on the resulted operator product identities among fused transfer matrices and some necessary additional constraints such as asymptotic behaviors and relations at some special points, we obtain the eigenvalues of transfer matrices and parameterize them as homogeneous $T-Q$ relations in the periodic case or inhomogeneous ones in the open case. We also give the exact solutions of the $C_n$ model with an off-diagonal open boundary condition. The method and results in this paper can be generalized to other high rank integrable models associated with other Lie algebras.

preprint2021arXiv

g-SiC6 Monolayer: A New Graphene-like Dirac Cone Material with a High Fermi Velocity

Two-dimensional (2D) materials with Dirac cones have been intrigued by many unique properties, i.e., the effective masses of carriers close to zero and Fermi velocity of ultrahigh, which yields a great possibility in high-performance electronic devices. In this work, using first-principles calculations, we have predicted a new Dirac cone material of silicon carbide with the new stoichiometries, named g-SiC6 monolayer, which is composed of sp2 hybridized with a graphene-like structure. The detailed calculations have revealed that g-SiC6 has outstanding dynamical, thermal, and mechanical stabilities, and the mechanical and electronic properties are still isotropic. Of great interest is that the Fermi velocity of g-SiC6 monolayer is the highest in silicon carbide Dirac materials until now. The Dirac cone of the g-SiC6 is controllable by an in-plane uniaxial strain and shear strain, which is promised to realize a direct application in electronics and optoelectronics. Moreover, we found that new stoichiometries AB6 (A, B = C, Si, and Ge) compounds with the similar SiC6 monolayer structure are both dynamics stable and possess Dirac cones, and their Fermi velocity was also calculated in this paper. Given the outstanding properties of those new types of silicon carbide monolayer, which is a promising 2D material for further exploring the potential applications.

preprint2021arXiv

Maximizing Marginal Fairness for Dynamic Learning to Rank

Rankings, especially those in search and recommendation systems, often determine how people access information and how information is exposed to people. Therefore, how to balance the relevance and fairness of information exposure is considered as one of the key problems for modern IR systems. As conventional ranking frameworks that myopically sorts documents with their relevance will inevitably introduce unfair result exposure, recent studies on ranking fairness mostly focus on dynamic ranking paradigms where result rankings can be adapted in real-time to support fairness in groups (i.e., races, genders, etc.). Existing studies on fairness in dynamic learning to rank, however, often achieve the overall fairness of document exposure in ranked lists by significantly sacrificing the performance of result relevance and fairness on the top results. To address this problem, we propose a fair and unbiased ranking method named Maximal Marginal Fairness (MMF). The algorithm integrates unbiased estimators for both relevance and merit-based fairness while providing an explicit controller that balances the selection of documents to maximize the marginal relevance and fairness in top-k results. Theoretical and empirical analysis shows that, with small compromises on long list fairness, our method achieves superior efficiency and effectiveness comparing to the state-of-the-art algorithms in both relevance and fairness for top-k rankings.

preprint2021arXiv

Multiplicity and asymptotics of standing waves for the energy critical half-wave

In this paper, we consider the multiplicity and asymptotics of standing waves with prescribed mass $\int_{{\mathbb{R}^N}} {{u}^2}=a^2$ to the energy critical half-wave \begin{equation}\label{eqA0.1} \sqrt{-Δ}u=λu+μ|u|^{q-2} u+|u|^{2^*-2}u,\ \ u\in H^{1/2}(\R^N), \end{equation} where $N\!\geq\! 2$, $a\!>\!0$, $q \!\in\!\big(2,2+\frac{2}{N}\big)$, $2^*\!=\!\frac{2N}{N-1}$ and $λ\!\in\!\R$ appears as a Lagrange multiplier. We show that \eqref{eqA0.1} admits a ground state $u_a$ and an excited state $v_a$, which are characterised by a local minimizer and a mountain-pass critical point of the corresponding energy functional. Several asymptotic properties of $\{u_a\}$, $\{v_a\}$ are obtained and it is worth pointing out that we get a precise description of $\{u_a\}$ as $a\!\to\! 0^+$ without needing any uniqueness condition on the related limit problem. The main contribution of this paper is to extend the main results in J. Bellazzini et al. [Math. Ann. 371 (2018), 707-740] from energy subcritical to energy critical case. Furthermore, these results can be extended to the general fractional nonlinear Schrödinger equation with Sobolev critical exponent, which generalize the work of H. J. Luo-Z. T. Zhang [Calc. Var. Partial Differ. Equ. 59 (2020)] from energy subcritical to energy critical case.

preprint2021arXiv

Performance Analysis for Correlated AoI and Energy Efficiency in Heterogeneous CR-IoT System

We consider a cognitive radio based Internet of Things (CR-IoT) system where the secondary IoT device (SD) accesses the licensed channel during the transmission vacancies of the primary IoT device (PD). We focus on the impact of the IoT devices&#39; heterogeneous traffic pattern on the energy efficiency and on the age of information (AoI) performance of the SD. We first derive closed-form expressions of the energy efficiency and the average AoI, and subsequently explore their convexity and monotonicity to the transmit power. Following these characterizations, an optimal transmit power optimization algorithm (TPOA) is proposed for the SD to maximize the energy efficiency while maintaining the average AoI under a predefined threshold. Numerical results verify the different preferences of the SD toward different PD traffic patterns, and provides insights into the tradeoff between the energy efficiency and the average AoI.

preprint2021arXiv

SME: ReRAM-based Sparse-Multiplication-Engine to Squeeze-Out Bit Sparsity of Neural Network

Resistive Random-Access-Memory (ReRAM) crossbar is a promising technique for deep neural network (DNN) accelerators, thanks to its in-memory and in-situ analog computing abilities for Vector-Matrix Multiplication-and-Accumulations (VMMs). However, it is challenging for crossbar architecture to exploit the sparsity in the DNN. It inevitably causes complex and costly control to exploit fine-grained sparsity due to the limitation of tightly-coupled crossbar structure. As the countermeasure, we developed a novel ReRAM-based DNN accelerator, named Sparse-Multiplication-Engine (SME), based on a hardware and software co-design framework. First, we orchestrate the bit-sparse pattern to increase the density of bit-sparsity based on existing quantization methods. Second, we propose a novel weigh mapping mechanism to slice the bits of a weight across the crossbars and splice the activation results in peripheral circuits. This mechanism can decouple the tightly-coupled crossbar structure and cumulate the sparsity in the crossbar. Finally, a superior squeeze-out scheme empties the crossbars mapped with highly-sparse non-zeros from the previous two steps. We design the SME architecture and discuss its use for other quantization methods and different ReRAM cell technologies. Compared with prior state-of-the-art designs, the SME shrinks the use of crossbars up to 8.7x and 2.1x using Resent-50 and MobileNet-v2, respectively, with less than 0.3% accuracy drop on ImageNet.

preprint2021arXiv

Surface energy of the one-dimensional supersymmetric $t-J$ model with general integrable boundary terms in the antiferromagnetic sector

In this paper, we study the surface energy of the one-dimensional supersymmetric $t-J$ model with unparallel boundary magnetic fields, which is a typical $U(1)$-symmetry broken quantum integrable strongly correlated electron system. It is shown that at the ground state, the contribution of inhomogeneous term in the Bethe ansatz solution of eigenvalues of transfer matrix satisfies the finite size scaling law $L^β$ where $β<0$. Based on it, the physical quantities of the system in the thermodynamic limit are calculated. We obtain the patterns of Bethe roots and the analytical expressions of density of states, ground state energy and surface energy. We also find that there exist the stable boundary bound states if the boundary fields satisfy some constraints.

preprint2021arXiv

Zeroth-Order Algorithms for Stochastic Distributed Nonconvex Optimization

In this paper, we consider a stochastic distributed nonconvex optimization problem with the cost function being distributed over $n$ agents having access only to zeroth-order (ZO) information of the cost. This problem has various machine learning applications. As a solution, we propose two distributed ZO algorithms, in which at each iteration each agent samples the local stochastic ZO oracle at two points with a time-varying smoothing parameter. We show that the proposed algorithms achieve the linear speedup convergence rate $\mathcal{O}(\sqrt{p/(nT)})$ for smooth cost functions under the state-dependent variance assumptions which are more general than the commonly used bounded variance and Lipschitz assumptions, and $\mathcal{O}(p/(nT))$ convergence rate when the global cost function additionally satisfies the Polyak--Łojasiewicz (P--Ł) condition in addition, where $p$ and $T$ are the dimension of the decision variable and the total number of iterations, respectively. To the best of our knowledge, this is the first linear speedup result for distributed ZO algorithms, which enables systematic processing performance improvements by adding more agents. We also show that the proposed algorithms converge linearly under the relative bounded second moment assumptions and the P--Ł condition. We demonstrate through numerical experiments the efficiency of our algorithms on generating adversarial examples from deep neural networks in comparison with baseline and recently proposed centralized and distributed ZO algorithms.

preprint2020arXiv

A new measure of thermal micro-behavior for the AdS black hole

Inspired by the hypothesis of the black hole molecule, with the help of the Hawking temperature, entropy and the thermodynamics curvature of the black hole, we propose a new measure of the relation between the interaction and the thermal motion of molecules of the AdS black hole as a preliminary and coarse-grained description. The measure enables us to introduce a dimensionless ratio to characterize this relation and show that there is indeed competition between the interaction among black hole molecules and their thermal motion. For the charged AdS black hole, below the critical dimensionless pressure, there are three transitions between the interaction state and the thermal motion state. While above the critical dimensionless pressure, there is only one transition. For the Schwarzschild-AdS black hole and five-dimensional Gauss-Bonnet AdS black hole, there is always a transition between the interaction state and the thermal motion state.

preprint2020arXiv

Analysis of Multivariate Scoring Functions for Automatic Unbiased Learning to Rank

Leveraging biased click data for optimizing learning to rank systems has been a popular approach in information retrieval. Because click data is often noisy and biased, a variety of methods have been proposed to construct unbiased learning to rank (ULTR) algorithms for the learning of unbiased ranking models. Among them, automatic unbiased learning to rank (AutoULTR) algorithms that jointly learn user bias models (i.e., propensity models) with unbiased rankers have received a lot of attention due to their superior performance and low deployment cost in practice. Despite their differences in theories and algorithm design, existing studies on ULTR usually use uni-variate ranking functions to score each document or result independently. On the other hand, recent advances in context-aware learning-to-rank models have shown that multivariate scoring functions, which read multiple documents together and predict their ranking scores jointly, are more powerful than uni-variate ranking functions in ranking tasks with human-annotated relevance labels. Whether such superior performance would hold in ULTR with noisy data, however, is mostly unknown. In this paper, we investigate existing multivariate scoring functions and AutoULTR algorithms in theory and prove that permutation invariance is a crucial factor that determines whether a context-aware learning-to-rank model could be applied to existing AutoULTR framework. Our experiments with synthetic clicks on two large-scale benchmark datasets show that AutoULTR models with permutation-invariant multivariate scoring functions significantly outperform those with uni-variate scoring functions and permutation-variant multivariate scoring functions.

preprint2020arXiv

Coherence of an extended central spin model with a coupled spin bath

We study an extended central spin model with an isotropic nearest-neighbour spin-exchange interaction among the bath spins. The system is controllable by external magnetic fields applied on the central spin and the bath, respectively. We construct a basis set of the Hilbert space and express the Hamiltonian of the extended model into a series of $2\times2$ block matrices to obtain the exact solution of the model successfully. Therefrom, the coherence and the spin polarization of the central spin are investigated. We find that if the couplings among the bath spins are antiferromagnetic, the central spin has good coherence and polarization at low temperatures. Moreover, the decoherence is greatly suppressed at the critical point where the strengthes of the central and the bath magnetic field take the same value. A dephasing phenomenon is identified when the initial state of the central spin is unpolarized.

preprint2020arXiv

Commonsense Evidence Generation and Injection in Reading Comprehension

Human tackle reading comprehension not only based on the given context itself but often rely on the commonsense beyond. To empower the machine with commonsense reasoning, in this paper, we propose a Commonsense Evidence Generation and Injection framework in reading comprehension, named CEGI. The framework injects two kinds of auxiliary commonsense evidence into comprehensive reading to equip the machine with the ability of rational thinking. Specifically, we build two evidence generators: the first generator aims to generate textual evidence via a language model; the other generator aims to extract factual evidence (automatically aligned text-triples) from a commonsense knowledge graph after graph completion. Those evidences incorporate contextual commonsense and serve as the additional inputs to the model. Thereafter, we propose a deep contextual encoder to extract semantic relationships among the paragraph, question, option, and evidence. Finally, we employ a capsule network to extract different linguistic units (word and phrase) from the relations, and dynamically predict the optimal option based on the extracted units. Experiments on the CosmosQA dataset demonstrate that the proposed CEGI model outperforms the current state-of-the-art approaches and achieves the accuracy (83.6%) on the leaderboard.

preprint2020arXiv

Multiplicity, asymptotics and stability of standing waves for nonlinear Schrödinger equation with rotation

In this article, we study the multiplicity, asymptotics and stability of standing waves with prescribed mass $c>0$ for nonlinear Schrödinger equation with rotation in the mass-supercritical regime arising in Bose-Einstein condensation. Under suitable restriction on the rotation frequency, by searching critical points of the corresponding energy functional on the mass-sphere, we obtain a local minimizer $u_c$ and a mountain pass solution $\hat{u}_c$. %under suitable assumptions on the related parameters. Furthermore, we show that $u_c$ is a ground state for small mass $c>0$ and describe a mass collapse behavior of the minimizers as $c\to 0$, while $\hat{u}_c$ is an excited state. Finally, we prove that the standing wave associated with $u_c$ is stable. Notice that the pioneering works \cite{aMsC,shYZ} imply that finite time blow-up of solutions to this model occurred in the mass-supercritical setting, therefore, we in the present paper obtain a new stability result. The main contribution of this paper is to extend the main results in \cite{JeSp,gYlW} concerning the same model from mass-subcritical and mass-critical regimes to mass-supercritical regime, where the physically most relevant case is covered.

preprint2020arXiv

Multiplier Hopf coquasigroups with faithful integrals

Let $A$ be a multiplier Hopf coquasigroup. If the faithful integrals exist, then they are unique up to scalar. Furthermore, if $A$ is of discrete type, then its integral duality $\widehat{A}$ is a Hopf quasigroup, and the biduality $\widehat{\widehat{A}}$ is isomorphic to the original $A$ as multiplier Hopf coquasigroups. This biduality theorem also holds for a class of Hopf quasigroups with faithful integrals.

preprint2020arXiv

New probe of gravity: strongly lensed gravitational wave multi-messenger approach

Strong gravitational lensing by galaxies provides us with a unique opportunity to understand the nature of gravity on galactic and extra-galactic scales. In this paper, we propose a new multimessenger approach using data from both gravitational wave (GW) and the corresponding electromagnetic (EM) counterpart to infer the constraint of the modified gravity (MG) theory denoted by the scale dependent phenomenological parameter. To demonstrate the robustness of this approach, we calculate the time-delay predictions by choosing various values of the phenomenological parameters and then compare them with that from the general relativity (GR). For the third generation ground-based GW observatory, with one typical strongly lensed GW+EM event, and assuming that the dominated error from the stellar velocity dispersions is 5\%, the GW time-delay data can distinguish an 18\% MG effect on a scale of tens of kiloparsecs with a $68\%$ confidence level. Assuming GR and a Singular Isothermal Sphere mass model, there exists a simplified consistency relationship between time-delay and imaging data. This relationship does not require for the velocity dispersion measurement, and hence can avoid major uncertainties. By using this relationship, the multimessenger approach is able to distinguish an $8\%$ MG effect. Our results show that the GW multimessenger approach can play an important role in revealing the nature of gravity on the galactic and extra-galactic scales.

preprint2020arXiv

Security of Medical Cyber-physical Systems: An Empirical Study on Imaging Devices

Recent years have witnessed a boom of connected medical devices, which brings security issues in the meantime. Medical imaging devices, an essential part of medical cyber-physical systems, play a vital role in modern hospitals and are often life-critical. However, security and privacy issues in these medical cyber-physical systems are sometimes ignored. In this paper, we perform an empirical study on imaging devices to analyse the security of medical cyber-physical systems. To be precise, we design a threat model and propose prospective attack techniques for medical imaging devices. To tackle potential cyber threats, we introduce protection mechanisms, evaluate the effectiveness and efficiency of protection mechanisms as well as its interplay with attack techniques. To scoring security, we design a hierarchical system that provides actionable suggestions for imaging devices in different scenarios. We investigate 15 devices from 9 manufacturers to demonstrate empirical comprehension and real-world security issues.

preprint2020arXiv

Sparsity-Aware SSAF Algorithm with Individual Weighting Factors for Acoustic Echo Cancellation

In this paper, we propose and analyze the sparsity-aware sign subband adaptive filtering with individual weighting factors (S-IWF-SSAF) algorithm, and consider its application in acoustic echo cancellation (AEC). Furthermore, we design a joint optimization scheme of the step-size and the sparsity penalty parameter to enhance the S-IWF-SSAF performance in terms of convergence rate and steady-state error. A theoretical analysis shows that the S-IWF-SSAF algorithm outperforms the previous sign subband adaptive filtering with individual weighting factors (IWF-SSAF) algorithm in sparse scenarios. In particular, compared with the existing analysis on the IWF-SSAF algorithm, the proposed analysis does not require the assumptions of large number of subbands, long adaptive filter, and paraunitary analysis filter bank, and matches well the simulated results. Simulations in both system identification and AEC situations have demonstrated our theoretical analysis and the effectiveness of the proposed algorithms.

preprint2020arXiv

Stem-leaf segmentation and phenotypic trait extraction of maize shoots from three-dimensional point cloud

Nowadays, there are many approaches to acquire three-dimensional (3D) point clouds of maize plants. However, automatic stem-leaf segmentation of maize shoots from three-dimensional (3D) point clouds remains challenging, especially for new emerging leaves that are very close and wrapped together during the seedling stage. To address this issue, we propose an automatic segmentation method consisting of three main steps: skeleton extraction, coarse segmentation based on the skeleton, fine segmentation based on stem-leaf classification. The segmentation method was tested on 30 maize seedlings and compared with manually obtained ground truth. The mean precision, mean recall, mean micro F1 score and mean over accuracy of our segmentation algorithm were 0.964, 0.966, 0.963 and 0.969. Using the segmentation results, two applications were also developed in this paper, including phenotypic trait extraction and skeleton optimization. Six phenotypic parameters can be accurately and automatically measured, including plant height, crown diameter, stem height and diameter, leaf width and length. Furthermore, the values of R2 for the six phenotypic traits were all above 0.94. The results indicated that the proposed algorithm could automatically and precisely segment not only the fully expanded leaves, but also the new leaves wrapped together and close together. The proposed approach may play an important role in further maize research and applications, such as genotype-to-phenotype study, geometric reconstruction and dynamic growth animation. We released the source code and test data at the web site https://github.com/syau-miao/seg4maize.git

preprint2020arXiv

TexSmart: A Text Understanding System for Fine-Grained NER and Enhanced Semantic Analysis

This technique report introduces TexSmart, a text understanding system that supports fine-grained named entity recognition (NER) and enhanced semantic analysis functionalities. Compared to most previous publicly available text understanding systems and tools, TexSmart holds some unique features. First, the NER function of TexSmart supports over 1,000 entity types, while most other public tools typically support several to (at most) dozens of entity types. Second, TexSmart introduces new semantic analysis functions like semantic expansion and deep semantic representation, that are absent in most previous systems. Third, a spectrum of algorithms (from very fast algorithms to those that are relatively slow but more accurate) are implemented for one function in TexSmart, to fulfill the requirements of different academic and industrial applications. The adoption of unsupervised or weakly-supervised algorithms is especially emphasized, with the goal of easily updating our models to include fresh data with less human annotation efforts. The main contents of this report include major functions of TexSmart, algorithms for achieving these functions, how to use the TexSmart toolkit and Web APIs, and evaluation results of some key algorithms.

preprint2020arXiv

The first simultaneous measurement of Hubble constant and post-Newtonian parameter from Time-Delay Strong Lensing

Strong gravitational lensing has been a powerful probe of cosmological models and gravity. To date, constraints in either domain have been obtained separately. We propose a new methodology through which the cosmological model, specifically the Hubble constant, and post-Newtonian parameter can be simultaneously constrained. Using the time-delay cosmography from strong lensing combined with the stellar kinematics of the deflector lens, we demonstrate the Hubble constant and post-Newtonian parameter are incorporated in two distance ratios which reflect the lensing mass and dynamical mass, respectively. Through the reanalysis of the four publicly released lenses distance posteriors from the H0LiCOW collaboration, the simultaneous constraints of Hubble constant and post-Newtonian parameter are obtained. Our results suggests no deviation from the General Relativity, $γ_{\texttt{PPN}}=0.87^{+0.19}_{-0.17}$ with a Hubble constant favors the local Universe value, $H_0=73.65^{+1.95}_{-2.26}$ km s$^{-1}$ Mpc$^{-1}$. Finally, we forecast the robustness of gravity tests by using the time-delay strong lensing for constraints we expect in the next few years. We find that the joint constraint from 40 lenses are able to reach the order of $7.7\%$ for the post-Newtonian parameter and $1.4\%$ for Hubble constant.

preprint2020arXiv

Topological transition from superuid vortex rings to isolated knots and links

Knots and links are fundamental topological objects play a key role in both classical and quantum fluids. In this research, we propose a novel scheme to generate torus vortex knots and links through the reconnections of vortex rings perturbed by Kelvin waves in trapped Bose-Einstein condensates. We observe a new phenomenon in a confined superfluid system in which the transfer of helicity between knots/links and coils can occur in both directions with different pathways. The pathways of topology transition can be controlled through designing specific initial states. The generation of a knot or link can be achieved by setting the parity of the Kelvin wave number. The stability of knots/links can be improved greatly with tunable parameters, including the ideal relative angle and the minimal distance between the initial vortex rings.

preprint2020arXiv

Traffic Agent Trajectory Prediction Using Social Convolution and Attention Mechanism

The trajectory prediction is significant for the decision-making of autonomous driving vehicles. In this paper, we propose a model to predict the trajectories of target agents around an autonomous vehicle. The main idea of our method is considering the history trajectories of the target agent and the influence of surrounding agents on the target agent. To this end, we encode the target agent history trajectories as an attention mask and construct a social map to encode the interactive relationship between the target agent and its surrounding agents. Given a trajectory sequence, the LSTM networks are firstly utilized to extract the features for all agents, based on which the attention mask and social map are formed. Then, the attention mask and social map are fused to get the fusion feature map, which is processed by the social convolution to obtain a fusion feature representation. Finally, this fusion feature is taken as the input of a variable-length LSTM to predict the trajectory of the target agent. We note that the variable-length LSTM enables our model to handle the case that the number of agents in the sensing scope is highly dynamic in traffic scenes. To verify the effectiveness of our method, we widely compare with several methods on a public dataset, achieving a 20% error decrease. In addition, the model satisfies the real-time requirement with the 32 fps.

preprint2020arXiv

Turbulent cascade induced persistent current of cold atomic superfluids

Dissipating of disorder quantum vortices in an annular two-dimensional Bose-Einstein condensate can form a macroscopic persistent flow of atoms. We propose a protocol to create persistent flow with high winding number based on a double concentric ring-shaped configuration. We find that a sudden geometric quench of the trap from single ring-shape into double concentric ring-shape will enhance the circulation flow in the outer ring-shaped region of the trap when the initial state of the condensate is with randomly distributed vortices of the same charge. The circulation flows that we created are with high stability and good uniformity free from topological excitations. Our study is promising for new atomtronic designing, and is also helpful for quantitatively understanding quantum tunneling and interacting quantum systems driven far from equilibrium.

preprint2019arXiv

Collective oscillations of a Bose-Einstein condensate induced by a vortex ring

We study the collective oscillations of three-dimensional Bose-Einstein condensates (BECs) excited by a vortex ring. We identify independent, integrated, and stationary modes of the center-of-mass oscillation of the condensate with respect to the vortex ring movement. We show that the oscillation amplitude {of the center-of-mass of the condensate} depends strongly on the initial radius of the vortex ring, the inter-atomic interaction, and the aspect ration of the trap, while the oscillation frequency is fixed and equal to the frequency of the harmonic trap in the direction of the ring movement. However, when applying Kelvin wave perturbations on the vortex ring, the center-of-mass oscillation of the BEC is changed nontrivially with respect to the perturbation modes, the long-scale perturbation strength as well as the wave number of the perturbations. The parity of the wave number of the Kelvin perturbations plays important role on the mode of the center-of-mass oscillation of the condensate.

preprint2019arXiv

Controllable splitting dynamics of a doubly quantized vortex in a rotating ring-shaped condensate

We study the dynamics of a doubly quantized vortex (DQV), created by releasing a ring-shaped Bose-Einstein condensate with quantized circulation into harmonic potential traps. It is shown that a DQV can be generated and exists stably in the middle of the ring-shaped condensate with the initial circulation $s = 2$ after released into the rotationally symmetric trap potential. For an asymmetric trap with a small degree of anisotropy the DQV initially splits into two singly quantized vortices and revives again but eventually evolves into two unit vortices due to the dynamic instability. For the degree of anisotropy above a critical value, the DQV is extremely unstably and decays rapidly into two singlet vortices. The geometry-dependent lifetime of the DQV and vortex-induced excitations are also discussed intensively.

preprint2019arXiv

Correlation functions of the XXZ spin chain with the twisted boundary condition

The scalar products, form factors and correlation functions of the XXZ spin chain with twisted (or antiperiodic) boundary condition are obtained based on the inhomogeneous $T-Q$ relation and the Bethe states constructed via the off-diagonal Bethe Ansatz. It is shown that the scalar product of two off-shell Bethe states, the form factors and the two-point correlation functions can be expressed as the summation of certain determinants. The corresponding homogeneous limits are studied. The results are also checked by the numerical calculations.

preprint2019arXiv

Deep Image-to-Video Adaptation and Fusion Networks for Action Recognition

Existing deep learning methods for action recognition in videos require a large number of labeled videos for training, which is labor-intensive and time-consuming. For the same action, the knowledge learned from different media types, e.g., videos and images, may be related and complementary. However, due to the domain shifts and heterogeneous feature representations between videos and images, the performance of classifiers trained on images may be dramatically degraded when directly deployed to videos. In this paper, we propose a novel method, named Deep Image-to-Video Adaptation and Fusion Networks (DIVAFN), to enhance action recognition in videos by transferring knowledge from images using video keyframes as a bridge. The DIVAFN is a unified deep learning model, which integrates domain-invariant representations learning and cross-modal feature fusion into a unified optimization framework. Specifically, we design an efficient cross-modal similarities metric to reduce the modality shift among images, keyframes and videos. Then, we adopt an autoencoder architecture, whose hidden layer is constrained to be the semantic representations of the action class names. In this way, when the autoencoder is adopted to project the learned features from different domains to the same space, more compact, informative and discriminative representations can be obtained. Finally, the concatenation of the learned semantic feature representations from these three autoencoders are used to train the classifier for action recognition in videos. Comprehensive experiments on four real-world datasets show that our method outperforms some state-of-the-art domain adaptation and action recognition methods.

preprint2019arXiv

Gaussian processes reconstruction of modified gravitational wave propagation

Recent work has shown that modified gravitational wave (GW) propagation can be a powerful probe of dark energy and modified gravity, specific to GW observations. We use the technique of Gaussian processes, that allows the reconstruction of a function from the data without assuming any parametrization, to measurements of the GW luminosity distance from simulated joint GW-GRB detections, combined with measurements of the electromagnetic luminosity distance by simulated DES data. For the GW events we consider both a second-generation LIGO/Virgo/Kagra (HVLKI) network, and a third-generation detector such as the Einstein Telescope. We find that the HVLKI network at target sensitivity, with $O(15)$ neutron star binaries with electromagnetic counterpart, could already detect deviations from GR at a level predicted by some modified gravity models, and a third-generation detector such as ET would have a remarkable discovery potential. We discuss the complementarity of the Gaussian processes technique to the $(Ξ_0,n)$ parametrization of modified GW propagation.

preprint2019arXiv

Multi-Grained Named Entity Recognition

This paper presents a novel framework, MGNER, for Multi-Grained Named Entity Recognition where multiple entities or entity mentions in a sentence could be non-overlapping or totally nested. Different from traditional approaches regarding NER as a sequential labeling task and annotate entities consecutively, MGNER detects and recognizes entities on multiple granularities: it is able to recognize named entities without explicitly assuming non-overlapping or totally nested structures. MGNER consists of a Detector that examines all possible word segments and a Classifier that categorizes entities. In addition, contextual information and a self-attention mechanism are utilized throughout the framework to improve the NER performance. Experimental results show that MGNER outperforms current state-of-the-art baselines up to 4.4% in terms of the F1 score among nested/non-overlapping NER tasks.