Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
43works
0followers
29topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

43 published item(s)

preprint2026arXiv

Dual-Backend Multibeam Position Switching Targeted SETI Observations toward Nearby Active Planet-Hosting Systems with FAST

The Five-hundred-meter Aperture Spherical Telescope (FAST), the world's largest single-dish radio telescope, lists the search for extraterrestrial intelligence (SETI) as one of its key scientific objectives. In this work, we present a targeted SETI observation for 7 nearby active stars utilizing the FAST L-band multibeam receiver, employing a observational strategy that combines position switching with multibeam tracking to balance on-source integration time with the accuracy of the beam response. Using both pulsar and SETI backends, we perform a comprehensive search for narrowband drifting signals with Doppler drift rates within diversified drift rate ranges and channel-width periodic signal with periods between 0.12 and 100 s and duty cycles between 10% and 50%. No credible radio technosignatures were detected from any of the target systems. Based on this null result, we place constraints on the presence of transmitters at a 95% confidence level, ruling out narrowband transmitters with with EIRP above $3.98\times10^8 \,\mathrm{W}$ and periodic transmitter with EIRP above $1.80\times10^{10} \,\mathrm{W}$,respectively, within the observation band.

preprint2022arXiv

A Support Vector Machine Based Cure Rate Model For Interval Censored Data

The mixture cure rate model is the most commonly used cure rate model in the literature. In the context of mixture cure rate model, the standard approach to model the effect of covariates on the cured or uncured probability is to use a logistic function. This readily implies that the boundary classifying the cured and uncured subjects is linear. In this paper, we propose a new mixture cure rate model based on interval censored data that uses the support vector machine (SVM) to model the effect of covariates on the uncured or the cured probability (i.e., on the incidence part of the model). Our proposed model inherits the features of the SVM and provides flexibility to capture classification boundaries that are non-linear and more complex. Furthermore, the new model can be used to model the effect of covariates on the incidence part when the dimension of covariates is high. The latency part is modeled by a proportional hazards structure. We develop an estimation procedure based on the expectation maximization (EM) algorithm to estimate the cured/uncured probability and the latency model parameters. Our simulation study results show that the proposed model performs better in capturing complex classification boundaries when compared to the existing logistic regression based mixture cure rate model. We also show that our model's ability to capture complex classification boundaries improve the estimation results corresponding to the latency parameters. For illustrative purpose, we present our analysis by applying the proposed methodology to an interval censored data on smoking cessation.

preprint2022arXiv

Discrete Probabilistic Inverse Optimal Transport

Optimal transport (OT) formalizes the problem of finding an optimal coupling between probability measures given a cost matrix. The inverse problem of inferring the cost given a coupling is Inverse Optimal Transport (IOT). IOT is less well understood than OT. We formalize and systematically analyze the properties of IOT using tools from the study of entropy-regularized OT. Theoretical contributions include characterization of the manifold of cross-ratio equivalent costs, the implications of model priors, and derivation of an MCMC sampler. Empirical contributions include visualizations of cross-ratio equivalent effect on basic examples and simulations validating theoretical results.

preprint2022arXiv

Distribution Calibration for Out-of-Domain Detection with Bayesian Approximation

Out-of-Domain (OOD) detection is a key component in a task-oriented dialog system, which aims to identify whether a query falls outside the predefined supported intent set. Previous softmax-based detection algorithms are proved to be overconfident for OOD samples. In this paper, we analyze overconfident OOD comes from distribution uncertainty due to the mismatch between the training and test distributions, which makes the model can't confidently make predictions thus probably causing abnormal softmax scores. We propose a Bayesian OOD detection framework to calibrate distribution uncertainty using Monte-Carlo Dropout. Our method is flexible and easily pluggable into existing softmax-based baselines and gains 33.33\% OOD F1 improvements with increasing only 0.41\% inference time compared to MSP. Further analyses show the effectiveness of Bayesian learning for OOD detection.

preprint2022arXiv

Evolution of beliefs in social networks

Evolution of beliefs of a society are a product of interactions between people (horizontal transmission) in the society over generations (vertical transmission). Researchers have studied both horizontal and vertical transmission separately. Extending prior work, we propose a new theoretical framework which allows application of tools from Markov chain theory to the analysis of belief evolution via horizontal and vertical transmission. We analyze three cases: static network, randomly changing network, and homophily-based dynamic network. Whereas the former two assume network structure is independent of beliefs, the latter assumes that people tend to communicate with those who have similar beliefs. We prove under general conditions that both static and randomly changing networks converge to a single set of beliefs among all individuals along with the rate of convergence. We prove that homophily-based network structures do not in general converge to a single set of beliefs shared by all and prove lower bounds on the number of different limiting beliefs as a function of initial beliefs. We conclude by discussing implications for prior theories and directions for future work.

preprint2022arXiv

FAST observations of an extremely active episode of FRB 20201124A: II. Energy Distribution

We report the properties of more than 800 bursts detected from the repeating fast radio burst (FRB) source FRB 20201124A with the Five-hundred-meter Aperture Spherical radio Telescope (FAST) during an extremely active episode on UTC September 25-28, 2021 in a series of four papers. In this second paper of the series, we mainly focus on the energy distribution of the detected bursts. The event rate initially increased exponentially but the source activity stopped within 24 hours after the 4th day. The detection of 542 bursts in one hour during the fourth day marked the highest event rate detected from one single FRB source so far. The bursts have complex structures in the time-frequency space. We find a double-peak distribution of the waiting time, which can be modeled with two log-normal functions peaking at 51.22 ms and 10.05 s, respectively. Compared with the emission from a previous active episode of the source detected with FAST, the second distribution peak time is smaller, suggesting that this peak is defined by the activity level of the source. We calculate the isotropic energy of the bursts using both a partial bandwidth and a full bandwidth and find that the energy distribution is not significantly changed. We find that an exponentially connected broken-power-law function can fit the cumulative burst energy distribution well, with the lower and higher-energy indices being $-1.22\pm0.01$ and $-4.27\pm0.23$, respectively. Assuming a radio radiative efficiency of $η_r = 10^{-4}$, the total isotropic energy of the bursts released during the four days when the source was active is already $3.9\times10^{46}$ erg, exceeding $\sim 23\%$ of the available magnetar dipolar magnetic energy. This challenges the magnetar models invoking an inefficient radio emission (e.g. synchrotron maser models).

preprint2022arXiv

FAST observations of an extremely active episode of FRB 20201124A: III. Polarimetry

As the third paper in the multiple-part series, we report the statistical properties of radio bursts detected from the repeating fast radio burst (FRB) source FRB 20201124A with the Five-hundred-meter Aperture Spherical radio telescope (FAST) during an extremely active episode between the 25th and the 28th of September 2021 (UT). We focus on the polarisation properties of 536 bright bursts with $\mathrm{S/N}>50$. We found that the Faraday rotation measures (RMs) monotonically dropped from $-579 \ {\rm rad \ m^{-2}}$ to $-605 \ {\rm rad \ m^{-2}}$ in the 4-day window. The RM values were compatible with the values ($-300$ to $-900\ {\rm rad \ m^{-2}}$ ) reported 4 month ago (Xu et al. 2022). However, the RM evolution rate in the current observation window was at least an order of magnitude smaller than the one ($\sim 500\ {\rm rad \ m^{-2}\, day^{-1}}$) previously reported during the rapid RM-variation phase, but is still higher than the one ($\le 1\ {\rm rad \ m^{-2} day^{-1}}$ ) during the later RM no-evolution phase. The bursts of FRB 20201124A were highly polarised with the total degree of polarisation (circular plus linear) greater than 90% for more than 90\% of all bursts. The distribution of linear polarisation position angles (PAs), degree of linear polarisation ($L/I$), and degree of circular polarisation ($V/I$) can be characterised with unimodal distribution functions. During the observation window, the distributions became wider with time, i.e. with larger scatter, but the centroids of the distribution functions remained nearly constant. For individual bursts, significant PA variations (confidence level 5-$σ$) were observed in 33% of all bursts. The polarisation of single pulses seems to follow certain complex trajectories on the Poincaré sphere, which may shed light on the radiation mechanism at the source or the plasma properties along the path of FRB propagation.

preprint2022arXiv

FAST observations of an extremely active episode of FRB 20201124A: IV. Spin Period Search

We report the properties of more than 800 bursts detected from the repeating fast radio burst (FRB) source FRB 20201124A with the Five-hundred-meter Aperture Spherical radio telescope (FAST) during an extremely active episode on UTC September 25th-28th, 2021 in a series of four papers. In this fourth paper of the series, we present a systematic search of the spin period and linear acceleration of the source object from both 996 individual pulse peaks and the dedispersed time series. No credible spin period was found from this data set. We rule out the presence of significant periodicity in the range between 1 ms to 100 s with a pulse duty cycle $< 0.49\pm0.08$ (when the profile is defined by a von-Mises function, not a boxcar function) and linear acceleration up to $300$ m s$^{-2}$ in each of the four one-hour observing sessions, and up to $0.6$ m s$^{-2}$ in all 4 days. These searches contest theoretical scenarios involving a 1 ms to 100 s isolated magnetar/pulsar with surface magnetic field $<10^{15}$ G and a small duty cycle (such as in a polar-cap emission mode) or a pulsar with a companion star or black hole up to 100 M$_{\rm \odot}$ and $P_b>10$ hours. We also perform a periodicity search of the fine structures and identify 53 unrelated millisecond-timescale &#34;periods&#34; in multi-components with the highest significance of 3.9 $σ$. The &#34;periods&#34; recovered from the fine structures are neither consistent nor harmonically related. Thus they are not likely to come from a spin period. We caution against claiming spin periodicity with significance below $\sim$ 4 $σ$ with multi-components from one-off FRBs. We discuss the implications of our results and the possible connections between FRB multi-components and pulsar micro-structures.

preprint2022arXiv

Frequency-dependent polarization of repeating fast radio bursts-implications for their origin

The polarization of fast radio bursts (FRBs), bright astronomical transients, contains crucial information about their environments. We report polarization measurements of five repeating FRBs, the abundant signals of which enable wide-band observations with two telescopes. A clear trend of lower polarization at lower frequencies was found, which can be well characterized by a single parameter rotation-measure-scatter (σRM) and modeled by multi-path scatter. Sources with higher σRM have higher RM magnitude and scattering timescales. The two sources with the most substantial σRM, FRB 20121102A and FRB 20190520B, are associated with a compact persistent radio source. These properties indicate a complex environment near the repeating FRBs, such as a supernova remnant or a pulsar wind nebula, consistent with their arising from young populations.

preprint2022arXiv

Generalized Intent Discovery: Learning from Open World Dialogue System

Traditional intent classification models are based on a pre-defined intent set and only recognize limited in-domain (IND) intent classes. But users may input out-of-domain (OOD) queries in a practical dialogue system. Such OOD queries can provide directions for future improvement. In this paper, we define a new task, Generalized Intent Discovery (GID), which aims to extend an IND intent classifier to an open-world intent set including IND and OOD intents. We hope to simultaneously classify a set of labeled IND intent classes while discovering and recognizing new unlabeled OOD types incrementally. We construct three public datasets for different application scenarios and propose two kinds of frameworks, pipeline-based and end-to-end for future work. Further, we conduct exhaustive experiments and qualitative analysis to comprehend key challenges and provide new guidance for future GID research.

preprint2022arXiv

Learning Distinctive Margin toward Active Domain Adaptation

Despite plenty of efforts focusing on improving the domain adaptation ability (DA) under unsupervised or few-shot semi-supervised settings, recently the solution of active learning started to attract more attention due to its suitability in transferring model in a more practical way with limited annotation resource on target data. Nevertheless, most active learning methods are not inherently designed to handle domain gap between data distribution, on the other hand, some active domain adaptation methods (ADA) usually requires complicated query functions, which is vulnerable to overfitting. In this work, we propose a concise but effective ADA method called Select-by-Distinctive-Margin (SDM), which consists of a maximum margin loss and a margin sampling algorithm for data selection. We provide theoretical analysis to show that SDM works like a Support Vector Machine, storing hard examples around decision boundaries and exploiting them to find informative and transferable data. In addition, we propose two variants of our method, one is designed to adaptively adjust the gradient from margin loss, the other boosts the selectivity of margin sampling by taking the gradient direction into account. We benchmark SDM with standard active learning setting, demonstrating our algorithm achieves competitive results with good data scalability. Code is available at https://github.com/TencentYoutuResearch/ActiveLearning-SDM

preprint2022arXiv

Lightweight Cross-Lingual Sentence Representation Learning

Large-scale models for learning fixed-dimensional cross-lingual sentence representations like LASER (Artetxe and Schwenk, 2019b) lead to significant improvement in performance on downstream tasks. However, further increases and modifications based on such large-scale models are usually impractical due to memory limitations. In this work, we introduce a lightweight dual-transformer architecture with just 2 layers for generating memory-efficient cross-lingual sentence representations. We explore different training tasks and observe that current cross-lingual training tasks leave a lot to be desired for this shallow architecture. To ameliorate this, we propose a novel cross-lingual language model, which combines the existing single-word masked language model with the newly proposed cross-lingual token-level reconstruction task. We further augment the training task by the introduction of two computationally-lite sentence-level contrastive learning tasks to enhance the alignment of cross-lingual sentence representation space, which compensates for the learning bottleneck of the lightweight transformer for generative tasks. Our comparisons with competing models on cross-lingual sentence retrieval and multilingual document classification confirm the effectiveness of the newly proposed training tasks for a shallow model.

preprint2022arXiv

Neurosymbolic hybrid approach to driver collision warning

There are two main algorithmic approaches to autonomous driving systems: (1) An end-to-end system in which a single deep neural network learns to map sensory input directly into appropriate warning and driving responses. (2) A mediated hybrid recognition system in which a system is created by combining independent modules that detect each semantic feature. While some researchers believe that deep learning can solve any problem, others believe that a more engineered and symbolic approach is needed to cope with complex environments with less data. Deep learning alone has achieved state-of-the-art results in many areas, from complex gameplay to predicting protein structures. In particular, in image classification and recognition, deep learning models have achieved accuracies as high as humans. But sometimes it can be very difficult to debug if the deep learning model doesn&#39;t work. Deep learning models can be vulnerable and are very sensitive to changes in data distribution. Generalization can be problematic. It&#39;s usually hard to prove why it works or doesn&#39;t. Deep learning models can also be vulnerable to adversarial attacks. Here, we combine deep learning-based object recognition and tracking with an adaptive neurosymbolic network agent, called the Non-Axiomatic Reasoning System (NARS), that can adapt to its environment by building concepts based on perceptual sequences. We achieved an improved intersection-over-union (IOU) object recognition performance of 0.65 in the adaptive retraining model compared to IOU 0.31 in the COCO data pre-trained model. We improved the object detection limits using RADAR sensors in a simulated environment, and demonstrated the weaving car detection capability by combining deep learning-based object detection and tracking with a neurosymbolic model.

preprint2022arXiv

Omni-DETR: Omni-Supervised Object Detection with Transformers

We consider the problem of omni-supervised object detection, which can use unlabeled, fully labeled and weakly labeled annotations, such as image tags, counts, points, etc., for object detection. This is enabled by a unified architecture, Omni-DETR, based on the recent progress on student-teacher framework and end-to-end transformer based object detection. Under this unified architecture, different types of weak labels can be leveraged to generate accurate pseudo labels, by a bipartite matching based filtering mechanism, for the model to learn. In the experiments, Omni-DETR has achieved state-of-the-art results on multiple datasets and settings. And we have found that weak annotations can help to improve detection performance and a mixture of them can achieve a better trade-off between annotation cost and accuracy than the standard complete annotation. These findings could encourage larger object detection datasets with mixture annotations. The code is available at https://github.com/amazon-research/omni-detr.

preprint2022arXiv

Radio detection of an elusive millisecond pulsar in the Globular Cluster NGC 6397

We report the discovery of a new 5.78 ms-period millisecond pulsar (MSP), PSR J1740-5340B (NGC 6397B), in an eclipsing binary system discovered with the Parkes radio telescope (now also known as Murriyang), Australia, and confirmed with the MeerKAT radio telescope in South Africa. The measured orbital period, 1.97 days, is the longest among all eclipsing binaries in globular clusters (GCs) and consistent with that of the coincident X-ray source U18, previously suggested to be a &#39;hidden MSP&#39;. Our XMM-Newton observations during NGC 6397B&#39;s radio quiescent epochs detected no X-ray flares. NGC 6397B is either a transitional MSP or an eclipsing binary in its initial stage of mass transfer after the companion star left the main sequence. The discovery of NGC 6397B potentially reveals a subgroup of extremely faint and heavily obscured binary pulsars, thus providing a plausible explanation to the apparent dearth of binary neutron stars in core-collapsed GCs as well as a critical constraint on the evolution of GCs.

preprint2022arXiv

Simulating high-time resolution radio-telescope observations

We describe a new software package for simulating channelised, high-time resolution data streams from radio telescopes. The software simulates data from the telescope and observing system taking into account the observation strategy, receiver system and digitisation. The signatures of pulsars, fast radio bursts and flare stars are modelled, including frequency-dependent effects such as scattering and scintillation. We also simulate more generic signals using spline curves and images. Models of radio frequency interference include signals from satellites, terrestrial transmitters and impulsive, broadband signals. The simulated signals can also be injected into real data sets. Uses of this software include the production of machine learning training data sets, development and testing of new algorithms to search for anomalous patterns and to characterise processing pipelines.

preprint2021arXiv

A GPU based single-pulse search pipeline (GSP) with database and its application to the commensal radio astronomy FAST survey (CRAFTS)

We developed a GPU based single-pulse search pipeline (GSP) with candidate-archiving database. Largely based upon the infrastructure of Open source pulsar search and analysis toolkit (PRESTO), GSP implements GPU acceleration of the de-dispersion and integrates a candidate-archiving database. We applied GSP to the data streams from the commensal radio astronomy FAST survey (CRAFTS), which resulted in a quasi-real-time processing. The integrated candidate database facilitates synergistic usage of multiple machine-learning tools and thus improves efficient identification of radio pulsars such as rotating radio transients (RRATs) and Fast Radio Bursts (FRBs). We first tested GSP on pilot CRAFTS observations with the FAST Ultra-Wide Band (UWB) receiver. GSP detected all pulsars known from the the Parkes multibeam pulsar survey in the respective sky area covered by the FAST-UWB. GSP also discovered 13 new pulsars. We measured the computational efficiency of GSP to be ~120 times faster than the original PRESTO and ~60 times faster than a MPI-parallelized version of PRESTO.

preprint2021arXiv

A Metamodel and Framework for Artificial General Intelligence From Theory to Practice

This paper introduces a new metamodel-based knowledge representation that significantly improves autonomous learning and adaptation. While interest in hybrid machine learning / symbolic AI systems leveraging, for example, reasoning and knowledge graphs, is gaining popularity, we find there remains a need for both a clear definition of knowledge and a metamodel to guide the creation and manipulation of knowledge. Some of the benefits of the metamodel we introduce in this paper include a solution to the symbol grounding problem, cumulative learning, and federated learning. We have applied the metamodel to problems ranging from time series analysis, computer vision, and natural language understanding and have found that the metamodel enables a wide variety of learning mechanisms ranging from machine learning, to graph network analysis and learning by reasoning engines to interoperate in a highly synergistic way. Our metamodel-based projects have consistently exhibited unprecedented accuracy, performance, and ability to generalize. This paper is inspired by the state-of-the-art approaches to AGI, recent AGI-aspiring work, the granular computing community, as well as Alfred Korzybski&#39;s general semantics. One surprising consequence of the metamodel is that it not only enables a new level of autonomous learning and optimal functioning for machine intelligences, but may also shed light on a path to better understanding how to improve human cognition.

preprint2021arXiv

Distributionally-Constrained Policy Optimization via Unbalanced Optimal Transport

We consider constrained policy optimization in Reinforcement Learning, where the constraints are in form of marginals on state visitations and global action executions. Given these distributions, we formulate policy optimization as unbalanced optimal transport over the space of occupancy measures. We propose a general purpose RL objective based on Bregman divergence and optimize it using Dykstra&#39;s algorithm. The approach admits an actor-critic algorithm for when the state or action space is large, and only samples from the marginals are available. We discuss applications of our approach and provide demonstrations to show the effectiveness of our algorithm.

preprint2021arXiv

Efficient Discretizations of Optimal Transport

Obtaining solutions to Optimal Transportation (OT) problems is typically intractable when the marginal spaces are continuous. Recent research has focused on approximating continuous solutions with discretization methods based on i.i.d. sampling, and has proven convergence as the sample size increases. However, obtaining OT solutions with large sample sizes requires intensive computation effort, that can be prohibitive in practice. In this paper, we propose an algorithm for calculating discretizations with a given number of points for marginal distributions, by minimizing the (entropy-regularized) Wasserstein distance, and result in plans that are comparable to those obtained with much larger numbers of i.i.d. samples. Moreover, a local version of such discretizations which is parallelizable for large scale applications is proposed. We prove bounds for our approximation and demonstrate performance on a wide range of problems.

preprint2021arXiv

Estimating the Number of Infected Cases in COVID-19 Pandemic

The COVID-19 pandemic has caused major disturbance to human life. An important reason behind the widespread social anxiety is the huge uncertainty about the pandemic. A fundamental uncertainty is how many or what percentage of people have been infected. There are published and frequently updated data on various statistics of the pandemic, at local, country or global level. However, due to various reasons, many cases were not included in those reported numbers. We propose a structured approach for the estimation of the number of unreported cases, where we distinguish cases that arrive late in the reported numbers and those who had mild or no symptoms and thus were not captured by any medical system at all. We use post-report data for the estimation of the former and population matching to the latter. We estimate that the reported number of infected cases in the US should be corrected by multiplying a factor of 220.54% as of Apr 20, 2020, while the infection ratio out of the US population is estimated to be 0.53%, implying a case mortality rate at 2.85% which is close to the 3.4% suggested by the WHO in Mar 2020. Towards the end of the summer of 2020, the overall infection ratio of the US rises to 2.49% while the case mortality decreases to 2.09%, and the ratio of asymptomatic cases out of all infected cases reduces from the pre-summer 35-40% to around 20-25%.

preprint2021arXiv

The first evidence for three-dimensional spin-velocity alignment in pulsars

More than 50 years after the discovery of pulsars and confirmation of their association with supernova explosions, the origin of the initial spin and velocity of pulsars remains largely a mystery. The typical space velocities of several hundred km/s have been attributed to &#34;kicks&#34; resulting from asymmetries either in the supernova ejecta or in the neutrino emission. Observations have shown a strong tendency for alignment of the pulsar space velocity and spin axis in young pulsars but, up to now, these comparisons have been restricted to two dimensions. We report here the first evidence for three-dimensional alignment between the spin and velocity vectors, largely based on observations made with the Five-hundred-meter Aperture Spherical radio Telescope of the pulsar PSR~J0538+2817 and its associated supernova remnant S147. Analysis of these and related observations has enabled us to determine the location of the pulsar within the supernova remnant and hence its radial velocity. Current simulations of supernova explosions have difficulty producing such three-dimensional alignment. Our results, which depend on the unprecedented sensitivity of the new observations, add another dimension to the intriguing correlation between pulsar spin-axis and birth-kick directions, thus deepening the mysteries surrounding the birth of neutron stars.

preprint2020arXiv

A Fast Radio Burst discovered in FAST drift scan survey

We report the discovery of a highly dispersed fast radio burst, FRB~181123, from an analysis of $\sim$1500~hr of drift-scan survey data taken using the Five-hundred-meter Aperture Spherical radio Telescope (FAST). The pulse has three distinct emission components, which vary with frequency across our 1.0--1.5~GHz observing band. We measure the peak flux density to be $>0.065$~Jy and the corresponding fluence $>0.2$~Jy~ms. Based on the observed dispersion measure of 1812~cm$^{-3}$~pc, we infer a redshift of $\sim 1.9$. From this, we estimate the peak luminosity and isotropic energy to be $\lesssim 2\times10^{43}$~erg~s$^{-1}$ and $\lesssim 2\times10^{40}$~erg, respectively. With only one FRB from the survey detected so far, our constraints on the event rate are limited. We derive a 95\% confidence lower limit for the event rate of 900 FRBs per day for FRBs with fluences $>0.025$~Jy~ms. We performed follow-up observations of the source with FAST for four hours and have not found a repeated burst. We discuss the implications of this discovery for our understanding of the physical mechanisms of FRBs.

preprint2020arXiv

A theory of nonequilibrium steady states in quantum chaotic systems

Nonequilibrium steady state (NESS) is a quasistationary state, in which exist currents that continuously produce entropy, but the local observables are stationary everywhere. We propose a theory of NESS under the framework of quantum chaos. In an isolated quantum system, there exist some initial states for which the thermodynamic limit and the long-time limit are noncommutative. The density matrix $\hat ρ$ of these states displays a universal structure. Suppose that $α$ and $β$ are different eigenstates of the Hamiltonian with energies $E_α$ and $E_β$, respectively. $<α|\hat ρ|β>$ behaves as a random number which approximately follows the Laplace distribution with zero mean. In thermodynamic limit, the variance of $<α|\hat ρ|β>$ is a smooth function of $\left|E_α-E_β\right|$, scaling as $1/(E_α-E_β)^2$ in the limit $\left|E_α-E_β\right|\to 0$. If and only if this scaling law is obeyed, the initial state evolves into NESS in the long time limit. We present numerical evidence of our hypothesis in a few chaotic models. Furthermore, we find that our hypothesis implies the eigenstate thermalization hypothesis (ETH) in a bipartite system.

preprint2020arXiv

Automated classification of stems and leaves of potted plants based on point cloud data

The accurate classification of plant organs is a key step in monitoring the growing status and physiology of plants. A classification method was proposed to classify the leaves and stems of potted plants automatically based on the point cloud data of the plants, which is a nondestructive acquisition. The leaf point training samples were automatically extracted by using the three-dimensional convex hull algorithm, while stem point training samples were extracted by using the point density of a two-dimensional projection. The two training sets were used to classify all the points into leaf points and stem points by utilizing the support vector machine (SVM) algorithm. The proposed method was tested by using the point cloud data of three potted plants and compared with two other methods, which showed that the proposed method can classify leaf and stem points accurately and efficiently.

preprint2020arXiv

Automatic marker-free registration of tree point-cloud data based on rotating projection

Point-cloud data acquired using a terrestrial laser scanner (TLS) play an important role in digital forestry research. Multiple scans are generally used to overcome occlusion effects and obtain complete tree structural information. However, it is time-consuming and difficult to place artificial reflectors in a forest with complex terrain for marker-based registration, a process that reduces registration automation and efficiency. In this study, we propose an automatic coarse-to-fine method for the registration of point-cloud data from multiple scans of a single tree. In coarse registration, point clouds produced by each scan are projected onto a spherical surface to generate a series of two-dimensional (2D) images, which are used to estimate the initial positions of multiple scans. Corresponding feature-point pairs are then extracted from these series of 2D images. In fine registration, point-cloud data slicing and fitting methods are used to extract corresponding central stem and branch centers for use as tie points to calculate fine transformation parameters. To evaluate the accuracy of registration results, we propose a model of error evaluation via calculating the distances between center points from corresponding branches in adjacent scans. For accurate evaluation, we conducted experiments on two simulated trees and a real-world tree. Average registration errors of the proposed method were 0.26m around on simulated tree point clouds, and 0.05m around on real-world tree point cloud.

preprint2020arXiv

Building and Maintaining a Third-Party Library Supply Chain for Productive and Secure SGX Enclave Development

The big data industry is facing new challenges as concerns about privacy leakage soar. One of the remedies to privacy breach incidents is to encapsulate computations over sensitive data within hardware-assisted Trusted Execution Environments (TEE). Such TEE-powered software is called secure enclaves. Secure enclaves hold various advantages against competing for privacy-preserving computation solutions. However, enclaves are much more challenging to build compared with ordinary software. The reason is that the development of TEE software must follow a restrictive programming model to make effective use of strong memory encryption and segregation enforced by hardware. These constraints transitively apply to all third-party dependencies of the software. If these dependencies do not officially support TEE hardware, TEE developers have to spend additional engineering effort in porting them. High development and maintenance cost is one of the major obstacles against adopting TEE-based privacy protection solutions in production. In this paper, we present our experience and achievements with regard to constructing and continuously maintaining a third-party library supply chain for TEE developers. In particular, we port a large collection of Rust third-party libraries into Intel SGX, one of the most mature trusted computing platforms. Our supply chain accepts upstream patches in a timely manner with SGX-specific security auditing. We have been able to maintain the SGX ports of 159 open-source Rust libraries with reasonable operational costs. Our work can effectively reduce the engineering cost of developing SGX enclaves for privacy-preserving data processing and exchange.

preprint2020arXiv

Connecting dynamical quantum phase transitions and topological steady-state transitions by tuning the energy gap

Considerable theoretical and experimental efforts have been devoted to the quench dynamics, in particular, the dynamical quantum phase transition (DQPT) and the steady-state transition. These developments have motivated us to study the quench dynamics of the topological systems, from which we find the connection between these two transitions, that is, the DQPT, accompanied by a nonanalytic behavior as a function of time, always merges into a steady-state transition signaled by the nonanalyticity of observables in the steady limit. As the characteristic time of the DQPT diverges, it exhibits universal scaling behavior, which is related to the scaling behavior at the corresponding steady-state transition.

preprint2020arXiv

Discrete Lorentz symmetry and discrete spacetime translational symmetry in two- and three-dimensional crystals

As is well known, crystals have discrete space translational symmetry. It was recently noticed that one-dimensional crystals possibly have discrete Poincaré symmetry, which contains discrete Lorentz and discrete time translational symmetry as well. In this paper, we classify the discrete Poincaré groups on two- and three-dimensional Bravais lattices. They are the candidate symmetry groups of two- or three-dimensional crystals, respectively. The group is determined by an integer generator $g$, and it reduces to the space group of crystals at $g=2$.

preprint2020arXiv

Dissipative phase transitions in the fully-connected Ising model with $p$-spin interaction

In this paper, we study the driven-dissipative p-spin models for $p\geq 2$. In thermodynamics limit, the equation of motion is derived by using a semiclassical approach. The long-time asymptotic states are obtained analytically, which exhibit multi-stability in some regions of the parameter space. The steady state is unique as the number of spins is finite. But the thermodynamic limit of the steady-state magnetization displays nonanalytic behavior somewhere inside the semiclassical multi-stable region. We find both the first-order and continuous dissipative phase transitions. As the number of spins increases, both the Liouvillian gap and magnetization variance vanish according to a power law at the continuous transition. At the first-order transition, the gap vanishes exponentially accompanied by a jump of magnetization in thermodynamic limit. The properties of transitions depend on the symmetry and semiclassical multistability, being qualitatively different among $p=2$, odd $p$ ($p\geq 3$) and even $p$ ($p\geq 4$).

preprint2020arXiv

First SETI Observations with China&#39;s Five-hundred-meter Aperture Spherical radio Telescope (FAST)

The Search for Extraterrestrial Intelligence (SETI) attempts to address the possibility of the presence of technological civilizations beyond the Earth. Benefiting from high sensitivity, large sky coverage, an innovative feed cabin for China&#39;s Five-hundred-meter Aperture Spherical radio Telescope (FAST), we performed the SETI first observations with FAST&#39;s newly commisioned 19-beam receiver; we report preliminary results in this paper. Using the data stream produced by the SERENDIP VI realtime multibeam SETI spectrometer installed at FAST, as well as its off-line data processing pipelines, we identify and remove four kinds of radio frequency interference(RFI): zone, broadband, multi-beam, and drifting, utilizing the Nebula SETI software pipeline combined with machine learning algorithms. After RFI mitigation, the Nebula pipeline identifies and ranks interesting narrow band candidate ET signals, scoring candidates by the number of times candidate signals have been seen at roughly the same sky position and same frequency, signal strength, proximity to a nearby star or object of interest, along with several other scoring criteria. We show four example candidates groups that demonstrate these RFI mitigation and candidate selection. This preliminary testing on FAST data helps to validate our SETI instrumentation techniques as well as our data processing pipeline.

preprint2020arXiv

Opportunities to Search for Extra-Terrestrial Intelligence with the Five-hundred-meter Aperture Spherical radio Telescope

The discovery of ubiquitous habitable extrasolar planets, combined with revolutionary advances in instrumentation and observational capabilities, has ushered in a renaissance in the search for extra-terrestrial intelligence (SETI). Large scale SETI activities are now underway at numerous international facilities. The Five-hundred-meter Aperture Spherical radio Telescope (FAST) is the largest single-aperture radio telescope in the world, well positioned to conduct sensitive searches for radio emission indicative of exo-intelligence. SETI is one of the five key science goals specified in the original FAST project plan. A collaboration with the Breakthrough Listen Initiative has been initiated in 2016 with a joint statement signed both by Dr. Jun Yan, the then director of the National Astronomical Observatories, Chinese Academy of Sciences (NAOC), and Dr. Peter Worden, the Chairman of the Breakthrough Prize Foundation. In this paper, we highlight some of the unique features of FAST that will allow for novel SETI observations. We identify and describe three different signal types indicative of a technological source, namely, narrow-band, wide-band artificially dispersed, and modulated signals. We here propose observations with FAST to achieve sensitivities never before explored.

preprint2020arXiv

Poincaré crystal on the one-dimensional lattice

In this paper, we develop the quantum theory of particles that has discrete Poincaré symmetry on the one-dimensional Bravais lattice. We review the recently discovered discrete Lorentz symmetry, which is the unique Lorentz symmetry that coexists with the discrete space translational symmetry on a Bravais lattice. The discrete Lorentz transformations and spacetime translations form the discrete Poincaré group, which are represented by unitary operators in a quantum theory. We find the conditions for the existence of representation, which are expressed as the congruence relation between quasi-momentum and quasi-energy. We then build the Lorentz-invariant many-body theory of indistinguishable particles by expressing both the unitary operators and Floquet Hamiltonians in terms of the field operators. Some typical Hamiltonians include the long-range hopping which fluctuates as the distance between sites increases. We calculate the Green&#39;s functions of the lattice theory. The spacetime points where the Green&#39;s function is nonzero display a lattice structure. During the propagation, the particles stay localized on a single or a few sites to preserve the Lorentz symmetry.

preprint2020arXiv

Predicting Large-Chern-Number Phases in a Shaken Optical Dice Lattice

With respect to the quantum anomalous Hall effect (QAHE), the detection of topological nontrivial large-Chern-number phases is an intriguing subject. Motivated by recent research on Floquet topological phases, this study proposes a periodic driving protocol to engineer large-Chern-number phases using QAHE. Herein, spinless ultracold fermionic atoms are studied in a two-dimensional optical dice lattice with nearest-neighbor hopping and a $Λ$/V-type sublattice potential subjected to a circular driving force. Results suggest that large-Chern-number phases exist with Chern numbers equal to $C=-2$, which is consistent with the edge-state energy spectra.

preprint2020arXiv

SCOUT: Self-aware Discriminant Counterfactual Explanations

The problem of counterfactual visual explanations is considered. A new family of discriminant explanations is introduced. These produce heatmaps that attribute high scores to image regions informative of a classifier prediction but not of a counter class. They connect attributive explanations, which are based on a single heat map, to counterfactual explanations, which account for both predicted class and counter class. The latter are shown to be computable by combination of two discriminant explanations, with reversed class pairs. It is argued that self-awareness, namely the ability to produce classification confidence scores, is important for the computation of discriminant explanations, which seek to identify regions where it is easy to discriminate between prediction and counter class. This suggests the computation of discriminant explanations by the combination of three attribution maps. The resulting counterfactual explanations are optimization free and thus much faster than previous methods. To address the difficulty of their evaluation, a proxy task and set of quantitative metrics are also proposed. Experiments under this protocol show that the proposed counterfactual explanations outperform the state of the art while achieving much higher speeds, for popular networks. In a human-learning machine teaching experiment, they are also shown to improve mean student accuracy from chance level to 95\%.

preprint2020arXiv

Sequential Cooperative Bayesian Inference

Cooperation is often implicitly assumed when learning from other agents. Cooperation implies that the agent selecting the data, and the agent learning from the data, have the same goal, that the learner infer the intended hypothesis. Recent models in human and machine learning have demonstrated the possibility of cooperation. We seek foundational theoretical results for cooperative inference by Bayesian agents through sequential data. We develop novel approaches analyzing consistency, rate of convergence and stability of Sequential Cooperative Bayesian Inference (SCBI). Our analysis of the effectiveness, sample efficiency and robustness show that cooperation is not only possible in specific instances but theoretically well-founded in general. We discuss implications for human-human and human-machine cooperation.

preprint2020arXiv

Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier

Long-tail recognition tackles the natural non-uniformly distributed data in real-world scenarios. While modern classifiers perform well on populated classes, its performance degrades significantly on tail classes. Humans, however, are less affected by this since, when confronted with uncertain examples, they simply opt to provide coarser predictions. Motivated by this, a deep realistic taxonomic classifier (Deep-RTC) is proposed as a new solution to the long-tail problem, combining realism with hierarchical predictions. The model has the option to reject classifying samples at different levels of the taxonomy, once it cannot guarantee the desired performance. Deep-RTC is implemented with a stochastic tree sampling during training to simulate all possible classification conditions at finer or coarser levels and a rejection mechanism at inference time. Experiments on the long-tailed version of four datasets, CIFAR100, AWA2, Imagenet, and iNaturalist, demonstrate that the proposed approach preserves more information on all classes with different popularity levels. Deep-RTC also outperforms the state-of-the-art methods in longtailed recognition, hierarchical classification, and learning with rejection literature using the proposed correctly predicted bits (CPB) metric.

preprint2020arXiv

The FAST discovery of an Eclipsing Binary Millisecond Pulsar in the Globular Cluster M92 (NGC 6341)

We report the discovery of an eclipsing binary millisecond pulsar in the globular cluster M92 (NGC6341) with the Five-hundred-meter Aperture Spherical radio Telescope (FAST). PSR J1717+4308A, or M92A, has a pulse frequency of 316.5~Hz (3.16~ms) and a dispersion measure of 35.45 pc cm$^{-3}$. The pulsar is a member of a binary system with an orbital period of 0.20~days around a low-mass companion which has a median mass of $\sim$0.18~\Ms. From observations so far, at least two eclipsing events have been observed in each orbit. The longer one lasted for ~5000~s in the orbital phase range 0.1--0.5. The other lasted for ~500~s and occurred between 1000--2000~s before or after the longer eclipsing event. The lengths of these two eclipsing events also change. These properties suggest that J1717+4308A is a ``red-back&#39;&#39; system with a low-mass main sequence or sub-giant companion. Timing observations of the pulsar and further searches of the data for additional pulsars are ongoing.

preprint2020arXiv

Towards Memory Safe Python Enclave for Security Sensitive Computation

Intel SGX Guard eXtensions (SGX), a hardware-supported trusted execution environment (TEE), is designed to protect security-sensitive applications. However, since enclave applications are developed with memory unsafe languages such as C/C++, traditional memory corruption is not eliminated in SGX. Rust-SGX is the first toolkit providing enclave developers with a memory-language. However, Rust is considered a Systems language and has become the right choice for concurrent applications and web browsers. Many application domains such as Big Data, Machine Learning, Robotics, Computer Vision are more commonly developed in the python programming language. Therefore, Python application developers cannot benefit from secure enclaves like Intel SGX and rust-SGX. To fill this gap, we propose Python-SGX, which is a memory-safe SGX SDK providing enclave developers a memory-safe Python development environment. The key idea is to enable memory-safe Python language in SGX by solving the following key challenges: (1) defining a memory-safe Python interpreter (2)replacing unsafe elements of Python interpreter with safe ones,(3) achieving comparable performance to non-enclave Python applications, and (4) not introducing any unsafe new code or libraries into SGX. We propose to build Python-SGX with PyPy, a Python interpreter written by RPython, which is a subset of Python, and tame unsafe parts in PyPy by formal verification, security hardening, and memory safe language. We have implemented python-SGX and tested it with a series of benchmarks programs. Our evaluation results show that Python-SGX does not cause significant overhead.

preprint2020arXiv

Variational Bayesian Weighted Complex Network Reconstruction

Complex network reconstruction is a hot topic in many fields. Currently, the most popular data-driven reconstruction framework is based on lasso. However, it is found that, in the presence of noise, lasso loses efficiency for weighted networks. This paper builds a new framework to cope with this problem. The key idea is to employ a series of linear regression problems to model the relationship between network nodes, and then to use an efficient variational Bayesian algorithm to infer the unknown coefficients. The numerical experiments conducted on both synthetic and real data demonstrate that the new method outperforms lasso with regard to both reconstruction accuracy and running speed.

preprint2019arXiv

Wide Bandwidth Observations of Pulsars C, D and J in 47 Tucanae

We report the first wideband observations of pulsars C, D and J in the globular cluster 47Tucanae (NGC 104) using the Ultra-Wideband Low (UWL) receiver system recently installed on the Parkes 64 m radio telescope. The wide frequency range of the UWL receiver (704-4032 MHz), along with the well-calibrated system, allowed us to obtain flux density measurements and polarization pulse profiles. The mean pulse profiles have significant linear and circular polarization, allowing for determination of the Faraday rotation measure for each pulsar. Precise measurements of the dispersion measures show a significant deviation in the value for pulsar D compared to earlier results. Searches for new pulsars in the cluster are on-going and we have determined optimal bands for such searches using the Parkes UWL receiver system.

preprint2018arXiv

Critical behavior of order parameter at the nonequilibrium phase transition of the Ising model

After a quench of transverse field, the asymptotic long-time state of Ising model displays a transition from a ferromagnetic phase to a paramagnetic phase as the post-quench field strength increases, which is revealed by the vanishing of the order parameter defined as the averaged magnetization over time. We estimate the critical behavior of the magnetization at this nonequilibrium phase transition by using mean-field approximation. In the vicinity of the critical field, the magnetization vanishes as the inverse of a logarithmic function, which is significantly distinguished from the critical behavior of order parameter at the corresponding equilibrium phase transition, i.e. a power-law function.

preprint2016arXiv

Higgs amplitude mode in massless Dirac fermion systems

The Higgs amplitude mode in superconductors is the condensed matter analogy of Higgs bosons in particle physics. We investigate the time evolution of Higgs amplitude mode in massless Dirac systems, induced by a weak quench of an attractive interaction. We find that the Higgs amplitude mode in the half-filling honeycomb lattice has a logarithmic decaying behaviour, qualitatively different from the $1/\sqrt{t}$ decay in the normal superconductors. Our study is also extended to the doped cases in honeycomb lattice. As for the 3D Dirac semimetal at half filling, we obtain an undamped oscillation of the amplitude mode. Our finding is not only an important supplement to the previous theoretical studies on normal fermion systems, but also provide an experimental signature to characterize the superconductivity in 2D or 3D Dirac systems.