Source author record

Yuan Shi

Yuan Shi appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Artificial Intelligence astro-ph.CO astro-ph.IM Computation and Language Methodology

Catalog footprint

What is connected

4works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Lunar Orbit Measurement of Cosmic Dawn 21 cm Global Spectrum

A redshifted 21 cm line absorption signature is commonly expected from the cosmic dawn era, when the first stars and galaxies formed. The detailed traits of this signal can provide important insight on the cosmic history. However, high precision measurement of this signal is hampered by the ionosphere refraction and absorption, as well as radio frequency interference (RFI). A space observation can solve the problem of the ionosphere, and the Moon can shield the RFI from the Earth. In this paper, we present simulations of the global spectrum measurement in the 30 -- 120 MHz frequency band on the lunar orbit, from the proposed Discovering the Sky at the Longest wavelength (DSL) project. In particular, we consider how the measured signal varies as the satellite moves along the orbit, take into account the blockage of different parts of the sky by the Moon and the antenna response. We estimate the sensitivity for such a 21 cm global spectrum experiment. An RMS noise level of $\le 0.05$ K is expected at 75 MHz after 10 orbits ($\sim$ 1 day) observation, for a frequency channel width of 0.4 MHz. We also study the influence of a frequency-dependent beam, which may generate complex spectral structures in the spectrum. Estimates of the uncertainties in the foreground and 21 cm model parameters are obtained.

preprint2022arXiv

Using Domain Knowledge for Low Resource Named Entity Recognition

In recent years, named entity recognition has always been a popular research in the field of natural language processing, while traditional deep learning methods require a large amount of labeled data for model training, which makes them not suitable for areas where labeling resources are scarce. In addition, the existing cross-domain knowledge transfer methods need to adjust the entity labels for different fields, so as to increase the training cost. To solve these problems, enlightened by a processing method of Chinese named entity recognition, we propose to use domain knowledge to improve the performance of named entity recognition in areas with low resources. The domain knowledge mainly applied by us is domain dictionary and domain labeled data. We use dictionary information for each word to strengthen its word embedding and domain labeled data to reinforce the recognition effect. The proposed model avoids large-scale data adjustments in different domains while handling named entities recognition with low resources. Experiments demonstrate the effectiveness of our method, which has achieved impressive results on the data set in the field of scientific and technological equipment, and the F1 score has been significantly improved compared with many other baseline methods.

preprint2012arXiv

Information-Theoretical Learning of Discriminative Clusters for Unsupervised Domain Adaptation

We study the problem of unsupervised domain adaptation, which aims to adapt classifiers trained on a labeled source domain to an unlabeled target domain. Many existing approaches first learn domain-invariant features and then construct classifiers with them. We propose a novel approach that jointly learn the both. Specifically, while the method identifies a feature space where data in the source and the target domains are similarly distributed, it also learns the feature space discriminatively, optimizing an information-theoretic metric as an proxy to the expected misclassification error on the target domain. We show how this optimization can be effectively carried out with simple gradient-based methods and how hyperparameters can be cross-validated without demanding any labeled data from the target domain. Empirical studies on benchmark tasks of object recognition and sentiment analysis validated our modeling assumptions and demonstrated significant improvement of our method over competing ones in classification accuracies.

preprint2011arXiv

Learning Discriminative Metrics via Generative Models and Kernel Learning

Metrics specifying distances between data points can be learned in a discriminative manner or from generative models. In this paper, we show how to unify generative and discriminative learning of metrics via a kernel learning framework. Specifically, we learn local metrics optimized from parametric generative models. These are then used as base kernels to construct a global kernel that minimizes a discriminative training criterion. We consider both linear and nonlinear combinations of local metric kernels. Our empirical results show that these combinations significantly improve performance on classification tasks. The proposed learning algorithm is also very efficient, achieving order of magnitude speedup in training time compared to previous discriminative baseline methods.