Source author record

Chao Liu

Chao Liu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

115works

39topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Decision-Aware Semantic State Synchronization in Compute-First Networking

In Compute-First Networking (CFN), an Access Point (AP) makes task offloading decisions based on resource state information reported by a Service Node (SN). A fundamental challenge arises from the trade-off between update overhead and decision accuracy: Frequent state updates consume limited network resources, while infrequent updates lead to stale state views and degraded task performance, especially under high system load. Existing approaches based on periodic updates or Age of Information (AoI) mainly focus on temporal freshness and often overlook whether a state change is actually relevant to offloading decisions. This paper proposes SenseCFN, a decision-aware state synchronization framework for CFN. Instead of synchronizing raw resource states, SenseCFN focuses on identifying state changes that are likely to alter offloading decisions. To this end, we introduce a lightweight semantic state representation that captures decision-relevant system characteristics, along with a Semantic Deviation Index (SDI) to quantify the impact of state shifts on decision outcomes. Based on SDI, the SN triggers updates only when significant decision-impacting changes are detected. Meanwhile, the AP performs offloading decisions using cached semantic states with explicit awareness of potential staleness. The update and offloading policies are jointly optimized using a centralized training with distributed execution (CTDE) approach. Simulation results show that SenseCFN maintains a task success rate of up to 99.6% in saturation-prone scenarios, outperforming baseline methods by more than 25%, while reducing status update frequency by approximately 70% to 96%. These results indicate that decision-aware state synchronization provides an effective and practical alternative to purely time-based update strategies in CFN.

preprint2026arXiv

Evidence-Grounded Multi-Agent Planning Support for Urban Carbon Governance via RAG

Urban carbon governance requires planners to integrate heterogeneous evidence -- emission inventories, statistical yearbooks, policy texts, technical measures, and academic findings -- into actionable, cross-departmental plans. Large Language Models (LLMs) can assist planning workflows, yet their factual reliability and evidential traceability remain critical barriers in professional use. This paper presents an evidence-grounded multi-agent planning support system for urban carbon governance built upon standard text-based Retrieval-Augmented Generation (RAG) (without GraphRAG). We align the system with the typical planning workflow by decomposing tasks into four specialized agents: (i) evidence Q\&A for fact checking and compliance queries, (ii) emission status assessment for diagnostic analysis, (iii) planning recommendation for generating multi-sector governance pathways, and (iv) report integration for producing planning-style deliverables. We evaluate the system in two task families: factual retrieval and comprehensive planning generation. On factual retrieval tasks, introducing RAG increases the average score from below 6 to above 90, and dramatically improves key-field extraction (e.g., region and numeric values near 100\% detection). A real-city case study (Ningbo, China) demonstrates end-to-end report generation with strong relevance, coverage, and coherence in expert review, while also highlighting boundary inconsistencies across data sources as a practical limitation.

preprint2026arXiv

On the Adversarial Robustness of 3D Large Vision-Language Models

3D Vision-Language Models (VLMs), such as PointLLM and GPT4Point, have shown strong reasoning and generalization abilities in 3D understanding tasks. However, their adversarial robustness remains largely unexplored. Prior work in 2D VLMs has shown that the integration of visual inputs significantly increases vulnerability to adversarial attacks, making these models easier to manipulate into generating toxic or misleading outputs. In this paper, we investigate whether incorporating 3D vision similarly compromises the robustness of 3D VLMs. To this end, we present the first systematic study of adversarial robustness in point-based 3D VLMs. We propose two complementary attack strategies: \textit{Vision Attack}, which perturbs the visual token features produced by the 3D encoder and projector to assess the robustness of vision-language alignment; and \textit{Caption Attack}, which directly manipulates output token sequences to evaluate end-to-end system robustness. Each attack includes both untargeted and targeted variants to measure general vulnerability and susceptibility to controlled manipulation. Our experiments reveal that 3D VLMs exhibit significant adversarial vulnerabilities under untargeted attacks, while demonstrating greater resilience against targeted attacks aimed at forcing specific harmful outputs, compared to their 2D counterparts. These findings highlight the importance of improving the adversarial robustness of 3D VLMs, especially as they are deployed in safety-critical applications.

preprint2026arXiv

Transition Matching Distillation for Fast Video Generation

Large video diffusion and flow models have achieved remarkable success in high-quality video generation, but their use in real-time interactive applications remains limited due to their inefficient multi-step sampling process. In this work, we present Transition Matching Distillation (TMD), a novel framework for distilling video diffusion models into efficient few-step generators. The central idea of TMD is to match the multi-step denoising trajectory of a diffusion model with a few-step probability transition process, where each transition is modeled as a lightweight conditional flow. To enable efficient distillation, we decompose the original diffusion backbone into two components: (1) a main backbone, comprising the majority of early layers, that extracts semantic representations at each outer transition step; and (2) a flow head, consisting of the last few layers, that leverages these representations to perform multiple inner flow updates. Given a pretrained video diffusion model, we first introduce a flow head to the model, and adapt it into a conditional flow map. We then apply distribution matching distillation to the student model with flow head rollout in each transition step. Extensive experiments on distilling Wan2.1 1.3B and 14B text-to-video models demonstrate that TMD provides a flexible and strong trade-off between generation speed and visual quality. In particular, TMD outperforms existing distilled models under comparable inference costs in terms of visual fidelity and prompt adherence. Project page: https://research.nvidia.com/labs/genair/tmd

preprint2025arXiv

Introduction to the Chinese Space Station Survey Telescope (CSST)

The Chinese Space Station Survey Telescope (CSST) is an upcoming Stage-IV sky survey telescope, distinguished by its large field of view (FoV), high image quality, and multi-band observation capabilities. It can simultaneously conduct precise measurements of the Universe by performing multi-color photometric imaging and slitless spectroscopic surveys. The CSST is equipped with five scientific instruments, i.e. Multi-band Imaging and Slitless Spectroscopy Survey Camera (SC), Multi-Channel Imager (MCI), Integral Field Spectrograph (IFS), Cool Planet Imaging Coronagraph (CPI-C), and THz Spectrometer (TS). Using these instruments, CSST is expected to make significant contributions and discoveries across various astronomical fields, including cosmology, galaxies and active galactic nuclei (AGN), the Milky Way and nearby galaxies, stars, exoplanets, Solar System objects, astrometry, and transients and variable sources. This review aims to provide a comprehensive overview of the CSST instruments, observational capabilities, data products, and scientific potential.

preprint2025arXiv

Millions of Main-Sequence Binary Stars from Gaia BP/RP Spectra

We present the main-sequence binary (MSMS) Catalog derived from Gaia Data Release 3 BP/RP (XP) spectra. Leveraging the vast sample of low-resolution Gaia XP spectra, we develop a forward modeling approach that maps stellar mass and photometric metallicity to XP spectra using a neural network. Our methodology identifies binary systems through statistical comparison of single- and binary-star model fits, enabling detection of binaries with mass ratios between 0.4 and 1.0 and flux ratios larger than 0.1. From an initial sample of 35 million stars within 1 kpc, we identify 14 million binary candidates and define a high-confidence "golden sample" of 1 million binary systems. This large, homogeneous sample enables detailed statistical analysis of binary properties across diverse Galactic environments, providing new insights into binary star formation and evolution. In addition, the $χ^2$ comparison allows us to distinguish stars with luminous companions from single stars or binaries with dark companions, such as white dwarfs, neutron stars and black hole candidates, improving our understanding of compact object populations.

preprint2023arXiv

Nanoparticles Passive Targeting Allows Optical Imaging of Bone Diseases

Bone health related skeletal disorders are commonly diagnosed by X-ray imaging, but the radiation limits its use. Light excitation and optical imaging through the near-infrared-II window (NIR-II, 1000-1700 nm) can penetrate deep tissues without radiation risk, but the targeting of contrast agent is non-specific. Here, we report that lanthanide-doped nanocrystals can be passively transported by endothelial cells and macrophages from the blood vessels into bone marrow microenvironment. We found that this passive targeting scheme can be effective for longer than two months. We therefore developed an intravital 3D and high-resolution planar imaging instrumentation for bone disease diagnosis. We demonstrated the regular monitoring of 1 mm bone defects for over 10 days, with resolution similar to X-ray imaging result, but more flexible use in prognosis. Moreover, the passive targeting can be used to reveal the early onset inflammation at the joints as the synovitis in the early stage of rheumatoid arthritis. Furthermore, the proposed method is comparable to μCT in recognizing symptoms of osteoarthritis, including the mild hyperostosis in femur which is ~100 μm thicker than normal, and the growth of millimeter-scale osteophyte in the knee joint, which further proves the power and universality of our approach in diagnosis of bone diseases

preprint2022arXiv

A bimodal distribution of haze in Pluto's atmosphere

Pluto, Titan, and Triton make up a unique class of solar system bodies, with icy surfaces and chemically reducing atmospheres rich in organic photochemistry and haze formation. Hazes play important roles in these atmospheres, with physical and chemical processes highly dependent on particle sizes, but the haze size distribution in reducing atmospheres is currently poorly understood. Here we report observational evidence that Pluto's haze particles are bimodally distributed, which successfully reproduces the full phase scattering observations from New Horizons. Combined with previous simulations of Titan's haze, this result suggests that haze particles in reducing atmospheres undergo rapid shape change near pressure levels ~0.5Pa and favors a photochemical rather than a dynamical origin for the formation of Titan's detached haze. It also demonstrates that both oxidizing and reducing atmospheres can produce multi-modal hazes, and encourages reanalysis of observations of hazes on Titan and Triton.

preprint2022arXiv

A Low-Cost, Highly Customizable Solution for Position Estimation in Modular Robots

Accurate position sensing is important for state estimation and control in robotics. Reliable and accurate position sensors are usually expensive and difficult to customize. Incorporating them into systems that have very tight volume constraints such as modular robots are particularly difficult. PaintPots are low-cost, reliable, and highly customizable position sensors, but their performance is highly dependent on the manufacturing and calibration process. This paper presents a Kalman filter with a simplified observation model developed to deal with the non-linearity issues that result in the use of low-cost microcontrollers. In addition, a complete solution for the use of PaintPots in a variety of sensing modalities including manufacturing, characterization, and estimation is presented for an example modular robot, SMORES-EP. This solution can be easily adapted to a wide range of applications.

preprint2022arXiv

A Robust Hot Subdwarfs Identification Method Based on Deep Learning

Hot subdwarf star is a particular type of star that is crucial for studying binary evolution and atmospheric diffusion processes. In recent years, identifying Hot subdwarfs by machine learning methods has become a hot topic, but there are still limitations in automation and accuracy. In this paper, we proposed a robust identification method based on the convolutional neural network (CNN). We first constructed the dataset using the spectral data of LAMOS DR7-V1. We then constructed a hybrid recognition model including an 8-class classification model and a binary classification model. The model achieved an accuracy of 96.17% on the testing set. To further validate the accuracy of the model, we selected 835 Hot subdwarfs that were not involved in the training process from the identified LAMOST catalog (2428, including repeated observations) as the validation set. An accuracy of 96.05% was achieved. On this basis, we used the model to filter and classify all 10,640,255 spectra of LAMOST DR7-V1, and obtained a catalog of 2393 Hot subdwarf candidates, of which 2067 have been confirmed. We found 25 new Hot subdwarfs among the remaining candidates by manual validation. The overall accuracy of the model is 87.42%. Overall, the model presented in this study can effectively identify specific spectra with robust results and high accuracy, and can be further applied to the classification of large-scale spectra and the search of specific targets.

preprint2022arXiv

CAIBC: Capturing All-round Information Beyond Color for Text-based Person Retrieval

Given a natural language description, text-based person retrieval aims to identify images of a target person from a large-scale person image database. Existing methods generally face a \textbf{color over-reliance problem}, which means that the models rely heavily on color information when matching cross-modal data. Indeed, color information is an important decision-making accordance for retrieval, but the over-reliance on color would distract the model from other key clues (e.g. texture information, structural information, etc.), and thereby lead to a sub-optimal retrieval performance. To solve this problem, in this paper, we propose to \textbf{C}apture \textbf{A}ll-round \textbf{I}nformation \textbf{B}eyond \textbf{C}olor (\textbf{CAIBC}) via a jointly optimized multi-branch architecture for text-based person retrieval. CAIBC contains three branches including an RGB branch, a grayscale (GRS) branch and a color (CLR) branch. Besides, with the aim of making full use of all-round information in a balanced and effective way, a mutual learning mechanism is employed to enable the three branches which attend to varied aspects of information to communicate with and learn from each other. Extensive experimental analysis is carried out to evaluate our proposed CAIBC method on the CUHK-PEDES and RSTPReid datasets in both \textbf{supervised} and \textbf{weakly supervised} text-based person retrieval settings, which demonstrates that CAIBC significantly outperforms existing methods and achieves the state-of-the-art performance on all the three tasks.

preprint2022arXiv

CodeMatcher: Searching Code Based on Sequential Semantics of Important Query Words

To accelerate software development, developers frequently search and reuse existing code snippets from a large-scale codebase, e.g., GitHub. Over the years, researchers proposed many information retrieval based models for code search, but they fail to connect the semantic gap between query and code. An early successful deep learning based model DeepCS solved this issue by learning the relationship between pairs of code methods and corresponding natural language descriptions. Two major advantages of DeepCS are the capability of understanding irrelevant/noisy keywords and capturing sequential relationships between words in query and code. In this paper, we proposed an IR-based model CodeMatcher that inherits the advantages of DeepCS, while it can leverage the indexing technique in the IR-based model to accelerate the search response time substantially. CodeMatcher first collects metadata for query words to identify irrelevant/noisy ones, then iteratively performs fuzzy search with important query words on the codebase that is indexed by the Elasticsearch tool, and finally reranks a set of returned candidate code according to how the tokens in the candidate code snippet sequentially matched the important words in a query. We verified its effectiveness on a large-scale codebase with ~41k repositories. Experimental results showed that CodeMatcher achieves an MRR of 0.60, outperforming DeepCS, CodeHow, and UNIF by 82%, 62%, and 46% respectively. Our proposed model is over 1.2k times faster than DeepCS. Moreover, CodeMatcher outperforms GitHub and Google search by 46% and 33% respectively in terms of MRR. We also observed that: fusing the advantages of IR-based and DL-based models is promising; improving the quality of method naming helps code search, since method name plays an important role in connecting query and code.

preprint2022arXiv

Enhancing Marine Data Transmission with Socially-Aware Resilient Vessel Networks

With the multi-dimensional exploration towards oceans, enormous sensing data has been generated with significant volume, velocity, variety and heterogeneity. The resulted Big Marine Data (BMD) thus issue unprecedented architectural challenges on existing marine communication systems. Current dominant marine communication technologies, e.g., shore-based cellular stations, high frequency radio, and expensive satellites, extremely suffer from short coverage, low bandwidth, insecurity, and unavailable cross-domain transmission. In this paper, Resilient Vessel Network (RVN) is proposed to fundamentally enhance BMD transmission. RVNs with widespread self-organized vessels and opportunistic connections reveal advantages of ubiquity, resilience, low cost and cross-domain transmission. To efficiently manage opportunistic vessel-to-vessel (V2V) connections for optimal routing, Social Network Analysis (SNA) on historical vessel interactions is applied for vessel familiarity measurement and community detection. The performance of the proposed community-based routing (CBR) is comprehensively evaluated with real datasets of fishing vessel trajectories. It is demonstrated that CBR achieves much lower transmission cost with comparable delivery ratio compared to typical routing algorithms.

preprint2022arXiv

Graph Decipher: A transparent dual-attention graph neural network to understand the message-passing mechanism for the node classification

Graph neural networks can be effectively applied to find solutions for many real-world problems across widely diverse fields. The success of graph neural networks is linked to the message-passing mechanism on the graph, however, the message-aggregating behavior is still not entirely clear in most algorithms. To improve functionality, we propose a new transparent network called Graph Decipher to investigate the message-passing mechanism by prioritizing in two main components: the graph structure and node attributes, at the graph, feature, and global levels on a graph under the node classification task. However, the computation burden now becomes the most significant issue because the relevance of both graph structure and node attributes are computed on a graph. In order to solve this issue, only relevant representative node attributes are extracted by graph feature filters, allowing calculations to be performed in a category-oriented manner. Experiments on seven datasets show that Graph Decipher achieves state-of-the-art performance while imposing a substantially lower computation burden under the node classification task. Additionally, since our algorithm has the ability to explore the representative node attributes by category, it is utilized to alleviate the imbalanced node classification problem on multi-class graph datasets.

preprint2022arXiv

Identification of new classical Be stars from the LAMOST MRS survey

Be stars are B-type main-sequence stars that display broad Balmer emission lines in their spectra. Identification of Be population is essential to further examine the formation and evolutionary models. We report the detection of classical Be (CBe) stars from observations with the Large sky Area Multi-Object fiber Spectroscopic Telescope Medium Resolution Survey of Date Release 7 (LAMOST MRS DR7). We used a deep convolutional neural network, the ResNet, with an 18-layer module to examine the morphology of the H alpha profile. We identified 1,162 candidate Be stars from the collection of 2,260,387 spectra for 789,918 stars in the database. The ResNet network achieves a Be star classification accuracy of 99.5%. Among the detections, 151 of these are prior known Be stars cross-matched from the literature. By applying a three-step test, we identified 183 new CBe stars. We find that 41 CBe stars are members of known open clusters. Based upon an investigation of the kinematics of the identified CBe stars from the Gaia EDR3 astrometric solutions, we identified 16 new runaways. These new identifications will provide a reference for future follow-ups to further investigate their physical properties.

preprint2022arXiv

Immunofluorescence Capillary Imaging Segmentation: Cases Study

Nonunion is one of the challenges faced by orthopedics clinics for the technical difficulties and high costs in photographing interosseous capillaries. Segmenting vessels and filling capillaries are critical in understanding the obstacles encountered in capillary growth. However, existing datasets for blood vessel segmentation mainly focus on the large blood vessels of the body, and the lack of labeled capillary image datasets greatly limits the methodological development and applications of vessel segmentation and capillary filling. Here, we present a benchmark dataset, named IFCIS-155, consisting of 155 2D capillary images with segmentation boundaries and vessel fillings annotated by biomedical experts, and 19 large-scale, high-resolution 3D capillary images. To obtain better images of interosseous capillaries, we leverage state-of-the-art immunofluorescence imaging techniques to highlight the rich vascular morphology of interosseous capillaries. We conduct comprehensive experiments to verify the effectiveness of the dataset and the benchmarking deep learning models (\eg UNet/UNet++ and the modified UNet/UNet++). Our work offers a benchmark dataset for training deep learning models for capillary image segmentation and provides a potential tool for future capillary research. The IFCIS-155 dataset and code are all publicly available at \url{https://github.com/ncclabsustech/IFCIS-55}.

preprint2022arXiv

Implementation of an Automated Learning System for Non-experts

Automated machine learning systems for non-experts could be critical for industries to adopt artificial intelligence to their own applications. This paper detailed the engineering system implementation of an automated machine learning system called YMIR, which completely relies on graphical interface to interact with users. After importing training/validation data into the system, a user without AI knowledge can label the data, train models, perform data mining and evaluation by simply clicking buttons. The paper described: 1) Open implementation of model training and inference through docker containers. 2) Implementation of task and resource management. 3) Integration of Labeling software. 4) Implementation of HCI (Human Computer Interaction) with a rebuilt collaborative development paradigm. We also provide subsequent case study on training models with the system. We hope this paper can facilitate the prosperity of our automated machine learning community from industry application perspective. The code of the system has already been released to GitHub (https://github.com/industryessentials/ymir).

preprint2022arXiv

LAMOST medium-resolution spectroscopic survey of binarity and exotic star (LAMOST-MRS-B): Observation strategy and target selection

LAMOST-MRS-B is one of the sub-surveys of LAMOST medium-resolution (R~7500) spectroscopic survey. It aims at studying the statistical properties (e.g., binary fraction, orbital period distribution, mass ratio distribution) of binary stars and exotic stars. We intend to observe about 30000 stars (10 mag <= G <= 14.5 mag) with at least 10 visits in five years. We first planned to observe 25 plates around the galactic plane in 2018. Then the plates were reduced to 12 in 2019 because of the limitation of observation. At the same time, two new plates located at the high galactic latitude were added to explore binary properties influenced by the different environments. In this survey project, we set the identified exotic and low-metallicity stars with the highest observation priorities. For the rest of the selected stars, we gave higher priority to the relatively brighter stars in order to obtain high-quality spectra as many as possible. Spectra of 49129 stars have been obtained in LAMOST-MRS-B field and released in DR8, of which 28828 and 3375 stars have been visited more than twice and ten times with SNR >= 10, respectively. Most of the sources are B-, A-, and F-type stars with 0.6 < [Fe/H] < 0.4 dex. We also obtain 347 identified variable and exotic stars and about 250 stars with [Fe/H] < 1 dex. We measure radial velocities (RVs) by using 892233 spectra of the stars. The uncertainties of RV achieve about 1 km/s and 10 km/s1 for 95% of late- and early-type stars, respectively. The datasets presented in this paper are available at http://www.doi.org/10.57760/sciencedb.j00113.00035.

preprint2022arXiv

Look Before You Leap: Improving Text-based Person Retrieval by Learning A Consistent Cross-modal Common Manifold

The core problem of text-based person retrieval is how to bridge the heterogeneous gap between multi-modal data. Many previous approaches contrive to learning a latent common manifold mapping paradigm following a \textbf{cross-modal distribution consensus prediction (CDCP)} manner. When mapping features from distribution of one certain modality into the common manifold, feature distribution of the opposite modality is completely invisible. That is to say, how to achieve a cross-modal distribution consensus so as to embed and align the multi-modal features in a constructed cross-modal common manifold all depends on the experience of the model itself, instead of the actual situation. With such methods, it is inevitable that the multi-modal data can not be well aligned in the common manifold, which finally leads to a sub-optimal retrieval performance. To overcome this \textbf{CDCP dilemma}, we propose a novel algorithm termed LBUL to learn a Consistent Cross-modal Common Manifold (C$^{3}$M) for text-based person retrieval. The core idea of our method, just as a Chinese saying goes, is to `\textit{san si er hou xing}', namely, to \textbf{Look Before yoU Leap (LBUL)}. The common manifold mapping mechanism of LBUL contains a looking step and a leaping step. Compared to CDCP-based methods, LBUL considers distribution characteristics of both the visual and textual modalities before embedding data from one certain modality into C$^{3}$M to achieve a more solid cross-modal distribution consensus, and hence achieve a superior retrieval accuracy. We evaluate our proposed method on two text-based person retrieval datasets CUHK-PEDES and RSTPReid. Experimental results demonstrate that the proposed LBUL outperforms previous methods and achieves the state-of-the-art performance.

preprint2022arXiv

Mass-Ratio Distribution of Binaries From the LAMOST-MRS Survey

Binary evolution leads to the formation of important objects crucial to the development of astrophysics, but the statistical properties of binary populations are still poorly understood. The LAMOST-MRS has provided a large sample of stars to study the properties of binary populations, especially for the mass ratio distributions and the binary fractions. We have devised a Peak Amplitude Ratio (PAR) approach to derive the mass ratio of a binary system based on results obtained from its spectrum. By computing a cross-correlation function (CCF), we established a relationship between the derived mass ratio and the PARs of the binary systems. By utilizing spectral observations obtained from LAMSOT DR6 & DR7, we applied the PAR approach to form distributions of the derived mass ratio of the binary systems to the spectral types. We selected the mass ratio within the range of $0.6-1.0$ for investigating the mass-ratio distribution. Through a power-law fitting, we obtained the power index $γ$ values of $-0.42\pm0.27$, $0.03\pm0.12$, and $2.12\pm0.19$ for A-, F-, and G-type stars identified in the sample, respectively. The derived $γ$-values display an increasing trend toward lower primary star masses, and G-type binaries tend to be more in twins. The close binary fractions (for $P\lesssim 150\,{\rm d}$ and $q\gtrsim 0.6$) in our sample for A, F and G binaries are $7.6\pm 0.5 \%$, $4.9\pm 0.2 \%$ and $3.7 \pm 0.1 \%$, respectively. Note that the PAR approach can be applied to large spectroscopic surveys of stars.

preprint2022arXiv

Milky Way Mass with K Giants and BHB Stars Using LAMOST, SDSS/SEGUE, and Gaia: 3D Spherical Jeans Equation and Tracer Mass Estimator

We measure the enclosed Milky Way mass profile to Galactocentric distances of $\sim70$ and $\sim50$ kpc using the smooth, diffuse stellar halo samples of Bird et al. The samples are LAMOST and SDSS/SEGUE K giants (KG) and SDSS/SEGUE blue horizontal branch (BHB) stars with accurate metallicities. The 3D kinematics are available through LAMOST and SDSS/SEGUE distances and radial velocities and {\it Gaia} DR2 proper motions. Two methods are used to estimate the enclosed mass: 3D spherical Jeans equation and Evans et al. tracer mass estimator (TME). We remove substructure via the Xue et al. method based on integrals of motion. We evaluate the uncertainties on our estimates due to random sampling noise, systematic distance errors, the adopted density profile, and non-virialization and non-spherical effects of the halo. The tracer density profile remains a limiting systematic in our mass estimates, although within these limits we find reasonable agreement across the different samples and the methods applied. Out to $\sim70$ and $\sim50$ kpc, the Jeans method yields total enclosed masses of $4.3\pm0.95$ (random) $\pm0.6$ (systematic) $\times10^{11}$ M$_\odot$ and $4.1\pm1.2$ (random) $\pm0.6$ (systematic) $\times10^{11}$ M$_\odot$ for the KG and BHB stars, respectively. For the KG and BHB samples we find a dark matter virial mass of $M_{200}=0.55^{+0.15}_{-0.11}$ (random) $\pm0.083$ (systematic) $\times10^{12}$ M$_\odot$ and $M_{200}=1.00^{+0.67}_{-0.33}$ (random) $\pm0.15$ (systematic) $\times10^{12}$ M$_\odot$, respectively.

preprint2022arXiv

MixNN: A design for protecting deep learning models

In this paper, we propose a novel design, called MixNN, for protecting deep learning model structure and parameters. The layers in a deep learning model of MixNN are fully decentralized. It hides communication address, layer parameters and operations, and forward as well as backward message flows among non-adjacent layers using the ideas from mix networks. MixNN has following advantages: 1) an adversary cannot fully control all layers of a model including the structure and parameters, 2) even some layers may collude but they cannot tamper with other honest layers, 3) model privacy is preserved in the training phase. We provide detailed descriptions for deployment. In one classification experiment, we compared a neural network deployed in a virtual machine with the same one using the MixNN design on the AWS EC2. The result shows that our MixNN retains less than 0.001 difference in terms of classification accuracy, while the whole running time of MixNN is about 7.5 times slower than the one running on a single virtual machine.

preprint2022arXiv

On-demand Integrated Quantum Memory for Polarization Qubits

Photonic polarization qubits are widely used in quantum computation and quantum communication due to the robustness in transmission and the easy qubit manipulation. An integrated quantum memory for polarization qubits is a fundamental building block for large-scale integrated quantum networks. However, on-demand storing polarization qubits in an integrated quantum memory is a long-standing challenge due to the anisotropic absorption of solids and the polarization-dependent features of microstructures. Here we demonstrate a reliable on-demand quantum memory for polarization qubits, using a depressed-cladding waveguide fabricated in a 151Eu3+: Y2SiO5 crystal. The site-2 151Eu3+ ions in Y2SiO5 crystal provides a near-uniform absorption for arbitrary polarization states and a new pump sequence is developed to prepare a wideband and enhanced absorption profile. A fidelity of 99.4\pm0.6% is obtained for the qubit storage process with an input of 0.32 photons per pulse, together with a storage bandwidth of 10 MHz. This reliable integrated quantum memory for polarization qubits reveals the potential for use in the construction of integrated quantum networks.

preprint2022arXiv

On-demand multimode optical storage in a laser-written on-chip waveguide

Quantum memory is a fundamental building block for large-scale quantum networks. On-demand optical storage with a large bandwidth, a high multimode capacity and an integrated structure simultaneously is crucial for practical application. However, this has not been demonstrated yet. Here, we fabricate an on-chip waveguide in a $\mathrm {^{151}Eu^{3+}:Y_2SiO_5}$ crystal with insertion losses of 0.2 dB, and propose a novel pumping scheme to enable spin-wave atomic frequency comb (AFC) storage with a bandwidth of 11 MHz inside the waveguide. Based on this, we demonstrate the storage of 200 temporal modes using the AFC scheme and conditional on-demand storage of 100 temporal modes using the spin-wave AFC scheme. The interference visibility between the readout light field and the reference light field is $99.0\% \pm 0.6\%$ and $97\% \pm 3\%$ for AFC and spin-wave AFC storage, respectively, indicating the coherent nature of this low-loss, multimode and integrated storage device.

preprint2022arXiv

Overview of the LAMOST survey in the first decade

The Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST), also known as the Guoshoujing Telescope, is a major national scientific facility for astronomical research located in Xinglong, China. Beginning with a pilot survey in 2011, LAMOST has been surveying the night sky for more than 10 years. The LAMOST survey covers various objects in the Universe, from normal stars to peculiar ones, from the Milky Way to other galaxies, and from stellar black holes and their companions to quasars that ignite ancient galaxies. Until the latest data release 8, the LAMOST survey has released spectra for more than 10 million stars, ~220,000 galaxies, and ~71,000 quasars. With this largest celestial spectra database ever constructed, LAMOST has helped astronomers to deepen their understanding of the Universe, especially for our Milky Way galaxy and the millions of stars within it. In this article, we briefly review the characteristics, observations, and scientific achievements of LAMOST. In particular, we show how astrophysical knowledge about the Milky Way has been improved by LAMOST data.

preprint2022arXiv

Planets Across Space and Time (PAST). III. Morphology of the Planetary Radius Valley as a Function of Stellar Age and Metallicity in the Galactic Context Revealed by the LAMOST-Gaia-Kepler Sample

The radius valley, a dip in the radius distribution of exoplanets at ~1.9 Earth radii separates compact rocky Super-Earths and Sub-Neptunes with lower density. Various hypotheses have been put forward to explain the radius valley. Characterizing the radius valley morphology and its correlation to stellar properties will provide crucial observation constraints on its origin mechanism and deepen the understanding of planet formation and evolution. In this paper, the third part of the Planets Across the Space and Time (PAST) series, using the LAMOST-Gaia-Kepler catalog, we perform a systematical investigation into how the radius valley morphology varies in the Galactic context, i.e., thin/thick galactic disks, stellar age and metallicity abundance ([Fe/H] and [alpha/Fe]). We find that (1) The valley becomes more prominent with the increase of both age and [Fe/H]. (2) The number ratio of super-Earths to sub-Neptunes monotonically increases with age but decreases with [Fe/H] and [alpha/Fe]. (3) The average radius of planets above the valley (2.1-6 Earth radii) decreases with age but increases with [Fe/H]. (4) In contrast, the average radius of planets below the valley (R < 1.7 Earth radii) is broadly independent on age and metallicity. Our results demonstrate that the valley morphology as well as the whole planetary radius distribution evolves on a long timescale of giga-years, and metallicities (not only Fe but also other metal elements, e.g., Mg, Si, Ca, Ti) play important roles in planet formation and in the long term planetary evolution.

preprint2022arXiv

Precognition in Task-oriented Dialogue Understanding: Posterior Regularization by Future Context

Task-oriented dialogue systems have become overwhelmingly popular in recent researches. Dialogue understanding is widely used to comprehend users' intent, emotion and dialogue state in task-oriented dialogue systems. Most previous works on such discriminative tasks only models current query or historical conversations. Even if in some work the entire dialogue flow was modeled, it is not suitable for the real-world task-oriented conversations as the future contexts are not visible in such cases. In this paper, we propose to jointly model historical and future information through the posterior regularization method. More specifically, by modeling the current utterance and past contexts as prior, and the entire dialogue flow as posterior, we optimize the KL distance between these distributions to regularize our model during training. And only historical information is used for inference. Extensive experiments on two dialogue datasets validate the effectiveness of our proposed method, achieving superior results compared with all baseline models.

preprint2022arXiv

Recursive Least Squares Policy Control with Echo State Network

The echo state network (ESN) is a special type of recurrent neural networks for processing the time-series dataset. However, limited by the strong correlation among sequential samples of the agent, ESN-based policy control algorithms are difficult to use the recursive least squares (RLS) algorithm to update the ESN's parameters. To solve this problem, we propose two novel policy control algorithms, ESNRLS-Q and ESNRLS-Sarsa. Firstly, to reduce the correlation of training samples, we use the leaky integrator ESN and the mini-batch learning mode. Secondly, to make RLS suitable for training ESN in mini-batch mode, we present a new mean-approximation method for updating the RLS correlation matrix. Thirdly, to prevent ESN from over-fitting, we use the L1 regularization technique. Lastly, to prevent the target state-action value from overestimation, we employ the Mellowmax method. Simulation results show that our algorithms have good convergence performance.

preprint2022arXiv

Rigorous proof of slightly nonlinear Jeans instability in the expanding Newtonian universe

Due to the nonlinearity of the Euler{Poisson equations, it is possible that the nonlinear Jeans instability may lead to a faster density growing rate than the rate in the standard theory of linearized Jeans instability, which motivates us to study the nonlinear Jeans instability. The aim of this article is to develop a method proving the Jeans instability for slightly nonlinear Euler-Poisson equations in the expanding Newtonian universe. The standard proofs of the Jeans instability rely on the Fourier analysis. However, it is difficult to generalize Fourier method to a nonlinear setting, and thus there is no result in the nonlinear analysis of Jeans instability. We firstly develop a non-Fourier-based method to reprove the linearized Jeans instability in the expanding Newtonian universe. Secondly, we generalize this idea to a slightly nonlinear case. This method relies on the Cauchy problem of the Fuchsian system due to the recent developments of this system in mathematics. The fully nonlinear Jeans instability for the Euler-Poisson and Einstein-Euler equations are in progress.

preprint2022arXiv

S4OD: Semi-Supervised learning for Single-Stage Object Detection

Single-stage detectors suffer from extreme foreground-background class imbalance, while two-stage detectors do not. Therefore, in semi-supervised object detection, two-stage detectors can deliver remarkable performance by only selecting high-quality pseudo labels based on classification scores. However, directly applying this strategy to single-stage detectors would aggravate the class imbalance with fewer positive samples. Thus, single-stage detectors have to consider both quality and quantity of pseudo labels simultaneously. In this paper, we design a dynamic self-adaptive threshold (DSAT) strategy in classification branch, which can automatically select pseudo labels to achieve an optimal trade-off between quality and quantity. Besides, to assess the regression quality of pseudo labels in single-stage detectors, we propose a module to compute the regression uncertainty of boxes based on Non-Maximum Suppression. By leveraging only 10% labeled data from COCO, our method achieves 35.0% AP on anchor-free detector (FCOS) and 32.9% on anchor-based detector (RetinaNet).

preprint2022arXiv

Searching Extra-tidal Features around the Globular Cluster Whiting 1

Whiting 1 is a faint and young globular cluster in the halo of the Milky Way, and was suggested to have originated in the Sagittarius spherical dwarf galaxy (Sgr dSph). In this paper, we use the deep DESI Legacy Imaging Surveys to explore tentative spatial connection between Whiting 1 and the Sgr dSph. We redetermine the fundamental parameters of Whiting 1 and use the best-fitting isochrone (age $τ$=6.5 Gyr, metalicity Z=0.005 and $\rm d_{\odot}$=26.9 kpc) to construct a theoretical matched filter for the extra-tidal features searching. Without any smooth technique to the matched filter density map, we detect a round-shape feature with possible leading and trailing tails on either side of the cluster. This raw image is not totally new compared to old discoveries, but confirms that no more large-scale features can be detected under a depth of r<=22.5 mag. In our results, the whole feature stretches 0.1-0.2 degree along the orbit of Whiting 1, which gives a much larger area than the cluster core. The tails on both sides of the cluster align along the orbital direction of the Sgr dSph as well as the cluster itself, which implies that these debris are probably stripped remnants of Whiting 1 by the Milky Way.

preprint2022arXiv

Sparse-Dyn: Sparse Dynamic Graph Multi-representation Learning via Event-based Sparse Temporal Attention Network

Dynamic graph neural networks have been widely used in modeling and representation learning of graph structure data. Current dynamic representation learning focuses on either discrete learning which results in temporal information loss or continuous learning that involves heavy computation. In this work, we proposed a novel dynamic graph neural network, Sparse-Dyn. It adaptively encodes temporal information into a sequence of patches with an equal amount of temporal-topological structure. Therefore, while avoiding the use of snapshots which causes information loss, it also achieves a finer time granularity, which is close to what continuous networks could provide. In addition, we also designed a lightweight module, Sparse Temporal Transformer, to compute node representations through both structural neighborhoods and temporal dynamics. Since the fully-connected attention conjunction is simplified, the computation cost is far lower than the current state-of-the-arts. Link prediction experiments are conducted on both continuous and discrete graph datasets. Through comparing with several state-of-the-art graph embedding baselines, the experimental results demonstrate that Sparse-Dyn has a faster inference speed while having competitive performance.

preprint2022arXiv

The Eclipsing Binaries from the LAMOST Medium-resolution Survey.III. A High-precision Empirical Stellar Mass Library

High-precision stellar mass and radius measured directly from binaries can effectively calibrate the stellar models. However, such a database containing full spectral types and large range of metallicity is still not fully established. A continuous effort of data collecting and analysis are requested to complete the database. In this work, we provide a catalog containing 184 binaries with independent atmospheric parameters and accurate masses and radii as the benchmark of stellar mass and radius. The catalog contains 56 new detached binaries from LAMOST Medium-resolution spectroscopic (MRS) survey and 128 detached eclipsing binaries compiled from previous studies. We obtain the orbital solutions of the new detached binaries with uncertainties of masses and radii smaller than 5%. These new samples densify the distribution of metallicity of the high-precision stellar mass library and add 9 hot stars with Teff>8000 K. Comparisons show that these samples well agree with the PARSEC isochrones in Teff-logg-mass-radius-luminosity space. We compare mass and radius estimates from isochrone and SED fitting, respectively, with those from the binary orbital solution. We find that the precision of the stellar-model dependent mass estimates is >10% and the precision of the radius estimates based on atmospheric parameters is >15%. These give a general view of the uncertainty of the usual approaches to estimate stellar mass and radius.

preprint2022arXiv

The North/South Asymmetry of the Galaxy: Possible Connection to the Vertical Phase Space Snail

The Galaxy is found to be in disequilibrium based on recent findings of the North/South (N/S) asymmetry and the phase mixing signatures, such as a phase spiral (snail) structure in the vertical phase space ($z-V_{z}$). We show that the N/S asymmetry in a tracer population of dwarfs may be quantitatively modeled with a simple phase snail model superimposed on a smooth equilibrium background. As the phase snail intersects with the $z$ axis, the number density is enhanced, and the velocity dispersion ($σ_{z}$) is decreased relative to the other side of the Galactic plane. Fitting only to the observed asymmetric N/S $σ_{z}$ profiles, we obtain reasonable parameters for the phase space snail and the potential utilized in modeling the background, despite the complex dependence of the model on the potential parameters and the significant selection effects of the data. Both the snail shape and the N/S number density difference given by our best-fit model are consistent with previous observations. The equilibrium background implies a local dark matter density of $0.0151^{+0.0050}_{-0.0051}$ ${\rm M}_{\odot}\,{\rm pc}^{-3}$. The vertical bulk motion of our model is similar to the observation, but with a $\sim$1.2 $\rm km\,s^{-1}$ shift. Our work demonstrates the strong correlation between the phase space snail and the N/S asymmetry. Future observational constraints will facilitate more comprehensive snail models to unravel the Milky Way potential and the perturbation history encoded in the snail feature.

preprint2021arXiv

A Catalog of LAMOST Variable Sources Based on Time-domain Photometry of ZTF

The identification and analysis of different variable sources is a hot issue in astrophysical research. The Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) spectroscopic survey has accumulated massive spectral data but contains no information about variable sources. Although a few related studies present variable source catalogs for the LAMOST, the studies still have a few deficiencies regarding the type and number of variable sources identified. In this study, we presented a statistical modeling approach to identify variable source candidates. We first crossed the Kepler, Sloan Digital Sky Survey (SDSS), and Zwicky Transient Facility (ZTF) catalogs to obtain light curves data of variable and non-variable sources. The data are then modeled statistically using commonly used variability parameters, respectively. And then, an optimal variable source identification model is determined using the Receiver Operating Characteristic (ROC) curve and four credible evaluation indices such as precision, accuracy, recall, and F1score. Based on this identification model, a catalog of LAMOST variable sources (including 631,769 variable source candidates with a probability greater than 95% and so on) is obtained. To validate the correctness of the catalog, we performed a two-by-two cross-comparison with the GAIA catalog and other published variable source catalogs. We achieved the correct rate ranging from 50% to 100%. Among the 123,756 sources cross-matched, our variable source catalog identifies 85,669 with a correct rate of 69%, which indicates that the variable source catalog presented in this study is credible.

preprint2021arXiv

Binary fraction of O and B-type stars from LAMOST data

Binary stars plays important role in the evolution of stellar populations . The intrinsic binary fraction ($f_{bin}$) of O and B-type (OB) stars in LAMOST DR5 was investigated in this work. We employed a cross-correlation approach to estimate relative radial velocities for each of the stellar spectra. The algorithm described by \cite{2013A&A...550A.107S} was implemented and several simulations were made to assess the performance of the approach. Binary fraction of the OB stars are estimated through comparing the uni-distribution between observations and simulations with the Kolmogorov-Smirnov tests. Simulations show that it is reliable for stars most of whom have $6,7$ and $8$ repeated observations. The uncertainty of orbital parameters of binarity become larger when observational frequencies decrease. By adopting the fixed power exponents of $π=-0.45$ and $κ=-1$ for period and mass ratio distributions, respectively, we obtain that $f_{bin}=0.4_{-0.06}^{+0.05}$ for the samples with more than 3 observations. When we consider the full samples with at least 2 observations, the binary fraction turns out to be $0.37_{-0.03}^{+0.03}$. These two results are consistent with each other in $1σ$.

preprint2021arXiv

BLOCKEYE: Hunting For DeFi Attacks on Blockchain

Decentralized finance, i.e., DeFi, has become the most popular type of application on many public blockchains (e.g., Ethereum) in recent years. Compared to the traditional finance, DeFi allows customers to flexibly participate in diverse blockchain financial services (e.g., lending, borrowing, collateralizing, exchanging etc.) via smart contracts at a relatively low cost of trust. However, the open nature of DeFi inevitably introduces a large attack surface, which is a severe threat to the security of participants funds. In this paper, we proposed BLOCKEYE, a real-time attack detection system for DeFi projects on the Ethereum blockchain. Key capabilities provided by BLOCKEYE are twofold: (1) Potentially vulnerable DeFi projects are identified based on an automatic security analysis process, which performs symbolic reasoning on the data flow of important service states, e.g., asset price, and checks whether they can be externally manipulated. (2) Then, a transaction monitor is installed offchain for a vulnerable DeFi project. Transactions sent not only to that project but other associated projects as well are collected for further security analysis. A potential attack is flagged if a violation is detected on a critical invariant configured in BLOCKEYE, e.g., Benefit is achieved within a very short time and way much bigger than the cost. We applied BLOCKEYE in several popular DeFi projects and managed to discover potential security attacks that are unreported before. A video of BLOCKEYE is available at https://youtu.be/7DjsWBLdlQU.

preprint2021arXiv

Future stability of the FLRW spacetime for a large class of perfect fluids

We establish the future non-linear stability of Friedmann-Lema\^ıtre-Robertson-Walker (FLRW) solutions to the Einstein-Euler equations of the universe filled with a large class of perfect fluids (the equations of state are allowed to be certain nonlinear or linear types both). Several previous results as specific examples can be covered in the results of this article. We emphasize that the future stability of FLRW metric for polytropic fluids with positive cosmological constant has been a difficult problem and can not be directly generalized from the previous known results. Our result in this article has not only covered this difficult case for the polytropic fluids, but also unified more types of fluids in a same scheme of proofs.

preprint2021arXiv

Generation of entanglement between a highly wave-packet-tunable photon and a spin-wave memory in cold atoms

Controls of waveforms (pulse durations) of single photons are important tasks for effectively interconnecting disparate atomic memories in hybrid quantum networks. So far, the waveform control of single photon that is entangled with an atomic memory remains unexplored. Here, we demonstrated control of waveform length of the photon that is entangled with an atomic spin-wave memory by varying light-atom interaction time in cold atoms. The Bell parameter S as a function of the duration of photon pulse is measured, which shows that violations of Bell equality can be achieved for the photon pulse in the duration range from 40 ns to 50 us, where, S=2.64+/-0.02 and S=2.26+/-0.05 for the 40-ns and 50-μs durations, respectively. The measured results show that S parameter decreases with the increase in the pulse duration. We confirm that the increase in photon noise probability per pulse with the pulse-duration is responsible for the S decrease.

preprint2021arXiv

LAMOST Time-Domain Survey: First Results of four $K$2 plates

From Oct. 2019 to Apr. 2020, LAMOST performs a time-domain spectroscopic survey of four $K$2 plates with both low- and med-resolution observations. The low-resolution spectroscopic survey gains 282 exposures ($\approx$46.6 hours) over 25 nights, yielding a total of about 767,000 spectra, and the med-resolution survey takes 177 exposures ($\approx$49.1 hours) over 27 nights, collecting about 478,000 spectra. More than 70%/50% of low-resolution/med-resolution spectra have signal-to-noise ratio higher than 10. We determine stellar parameters (e.g., $T_{\rm eff}$, log$g$, [Fe/H]) and radial velocity (RV) with different methods, including LASP, DD-Payne, and SLAM. In general, these parameter estimations from different methods show good agreement, and the stellar parameter values are consistent with those of APOGEE. We use the $Gaia$ DR2 RV data to calculate a median RV zero point (RVZP) for each spectrograph exposure by exposure, and the RVZP-corrected RVs agree well with the APOGEE data. The stellar evolutionary and spectroscopic masses are estimated based on the stellar parameters, multi-band magnitudes, distances and extinction values. Finally, we construct a binary catalog including about 2700 candidates by analyzing their light curves, fitting the RV data, calculating the binarity parameters from med-resolution spectra, and cross-matching the spatially resolved binary catalog from $Gaia$ EDR3. The LAMOST TD survey is expected to get breakthrough in various scientific topics, such as binary system, stellar activity, and stellar pulsation, etc.

preprint2021arXiv

Noise suppression in a temporal-multimode quantum memory entangled with a photon via asymmetrical photon-collection channel

Quantum interfaces (QIs) that generate entanglement between a multimode atomic memory and a photon forms a multiplexed repeater node and hold promise to greatly improve quantum repeater rates. Recently, the temporal multimode spin-wave memory that is entangled with a photon has been demonstrated with cold atoms. However, due to additional noise generated in multimode operation, the fidelity of spin-wave-photon entanglement significantly decreases with the mode number. So far, the improvement on temporal-multimode entanglement fidelity via suppressing the additional noise remains unexplored. Here, we propose and experimentally demonstrate a scheme that can suppress the additional noise of a temporally-multiplexed QI. The scheme uses an asymmetric channel to collect the photons coming and retrieving from the temporally-multiplexed QI. For making comparisons, we also set up a QI that uses symmetric channel for the photon collections. When the QIs store 14 modes, the measured Bell parameter S for the QIs using the asymmetric and the symmetric photon-collection channels are 2.36+/-0.03 and 2.24+/-0.04, respectively, showing that the QI using the asymmetric channel gives rise to a 3% increase in entanglement fidelity, i.e., a 1.7-fold decrease in the additional noise, compared with the QI using the symmetric one. On the other hand, the 14-mode entanglement QIs that use the asymmetric and symmetric collections preserve the violation of a Bell inequality for storage times up to 25 us and 20 us, respectively, showing that the asymmetric QI has a higher entanglement storage performance.

preprint2021arXiv

The Binarity of Early-type Stars from LAMOST Medium-resolution Spectroscopic Survey

Massive binaries play significant roles in many fields. Identification of massive stars, particularly massive binaries, is of great importance. In this paper, by adopting the technique of measuring the equivalent widths of several spectral lines, we identified 9,382 early-type stars from LAMOST medium-resolution survey and divided the sample into four groups, T1 ($\sim$O-B4), T2 ($\sim$B5), T3 ($\sim$B7), and T4 ($\sim$B8-A). The relative radial velocities $RV_{\rm rel}$ were calculated using the Maximum Likelihood Estimation. The stars with significant changes of $RV_{\rm rel}$ and at least larger than 15.57km s$^{-1}$ were identified as spectroscopic binaries. We found that the observed spectroscopic binary fractions for the four groups are $24.6\%\pm0.5\%$, $20.8\%\pm0.6\%$, $13.7\%\pm0.3\%$, and $7.4\%\pm0.3\%$, respectively. Assuming that orbital period ($P$) and mass ratio ($q$) have intrinsic distributions as $f(P) \propto P^π$ (1\textless$P$\textless1000 days) and $f(q) \propto q^κ$ (0.1\textless$q$\textless1), respectively, we conducted a series of Monte-Carlo simulations to correct observational biases for estimating the intrinsic multiplicity properties. The results show that the intrinsic binary fractions for the four groups are 68$\%\pm8\%$, 52$\%\pm3\%$, 44$\%\pm6\%$, and 44$\%\pm6\%$, respectively. The best estimated values for $π$ are -1$\pm0.1$, -1.1$\pm0.05$, -1.1$\pm0.1$, and -0.6$\pm0.05$, respectively. The $κ$ cannot be constrained for groups T1 and T2 and is -2.4$\pm0.3$ for group T3 and -1.6$\pm0.3$ for group T4. We confirmed the relationship of a decreasing trend in binary fractions towards late-type stars. No correlation between the spectral type and the orbital period distribution has been found yet, possibly due to the limitation of observational cadence.

preprint2021arXiv

The mass of the Milky Way out to 100 kpc using halo stars

We use a distribution function analysis to estimate the mass of the Milky Way out to 100 kpc using a large sample of halo stars. These stars are compiled from the literature, and the vast majority (~98%) have 6D phase-space information. We pay particular attention to systematic effects, such as the dynamical influence of the Large Magellanic Cloud (LMC), and the effect of unrelaxed substructure. The LMC biases the (pre-LMC infall) halo mass estimates towards higher values, while realistic stellar halos from cosmological simulations tend to underestimate the true halo mass. After applying our method to the Milky Way data we find a mass within 100 kpc of M(< 100 kpc) = 6.07 +/- 0.29 (stat.) +/- 1.21 (sys.) x 10^11 M_Sun. For this estimate, we have approximately corrected for the reflex motion induced by the LMC using the Erkal et al. model, which assumes a rigid potential for the LMC and MW. Furthermore, stars that likely belong to the Sagittarius stream are removed, and we include a 5% systematic bias, and a 20% systematic uncertainty based on our tests with cosmological simulations. Assuming the mass-concentration relation for Navarro-Frenk-White haloes, our mass estimate favours a total (pre-LMC infall) Milky Way mass of M_200c = 1.01 +/- 0.24 x 10^12 M_Sun, or (post-LMC infall) mass of M_200c = 1.16 +/- 0.24 x 10^12 M_Sun when a 1.5 x 10^11 M_Sun mass of a rigid LMC is included.

preprint2021arXiv

The Spectroscopic Binaries from LAMOST Medium-Resolution Survey (MRS). I. Searching for Double-lined Spectroscopic Binaries (SB2s) with Convolutional Neural Network

We developed a convolutional neural network (CNN) model to distinguish the double-lined spectroscopic binaries (SB2s) from others based on single exposure medium-resolution spectra ($R\sim 7,500$). The training set consists of a large set of mock spectra of single stars and binaries synthesized based on the MIST stellar evolutionary model and ATLAS9 atmospheric model. Our model reaches a novel theoretic false positive rate by adding a proper penalty on the negative sample (e.g., 0.12\% and 0.16\% for the blue/red arm when the penalty parameter $Λ=16$). Tests show that the performance is as expected and favors FGK-type Main-sequence binaries with high mass ratio ($q \geq 0.7$) and large radial velocity separation ($Δv \geq 50\,\mathrm{km\,s^{-1}}$). Although the real false positive rate can not be estimated reliably, validating on eclipsing binaries identified from Kepler light curves indicates that our model predicts low binary probabilities at eclipsing phases (0, 0.5, and 1.0) as expected. The color-magnitude diagram also helps illustrate its feasibility and capability of identifying FGK MS binaries from spectra. We conclude that this model is reasonably reliable and can provide an automatic approach to identify SB2s with period $\lesssim 10$ days. This work yields a catalog of binary probabilities for over 5 million spectra of 1 million sources from the LAMOST medium-resolution survey (MRS), and a catalog of 2198 SB2 candidates whose physical properties will be analyzed in our following-up paper. Data products are made publicly available at the journal as well as our Github website.

preprint2021arXiv

Two-dimensional charge density wave TaX$_2$ (X=S, Se, Te) from first principles

Transition metal dichalcogenides are rich in their structural phases, e.g. 1T-TaS2 and 1T-TaSe2 form charge density wave (CDW) under low temperature with interesting and exotic properties. Here, we present a systematic study of different structures in two-dimensional TaX2 (X=S, Se, Te) using density functional theory calculations with consideration of van der Waals interaction. All the normal phases present metal characteristics with various ground state and magnetic properties. The lattice reconstruction of CDW drastically affects the electronic and structural characteristics of 1T-TaS2 and 1T-TaSe2, leading to a transition from metal to insulator and an emergence of magnetic moment within periodic atomic clusters called the Star of David. The evaluated Heisenberg couplings indicate the weak ferromagnetic coupling between the clusters in monolayer. Furthermore, in bilayer commensurate CDW cases, we find intriguing phenomenon of the varying magnetic properties with different stacking orders. The magnetic moment in each layer disappears when two layers are coupled, but may sustain in certain stackings of interlayer antiferromagnetic configurations.

preprint2020arXiv

A Convolutional Neural Network-Based Low Complexity Filter

Convolutional Neural Network (CNN)-based filters have achieved significant performance in video artifacts reduction. However, the high complexity of existing methods makes it difficult to be applied in real usage. In this paper, a CNN-based low complexity filter is proposed. We utilize depth separable convolution (DSC) merged with the batch normalization (BN) as the backbone of our proposed CNN-based network. Besides, a weight initialization method is proposed to enhance the training performance. To solve the well known over smoothing problem for the inter frames, a frame-level residual mapping (RM) is presented. We analyze some of the mainstream methods like frame-level and block-level based filters quantitatively and build our CNN-based filter with frame-level control to avoid the extra complexity and artificial boundaries caused by block-level control. In addition, a novel module called RM is designed to restore the distortion from the learned residuals. As a result, we can effectively improve the generalization ability of the learning-based filter and reach an adaptive filtering effect. Moreover, this module is flexible and can be combined with other learning-based filters. The experimental results show that our proposed method achieves significant BD-rate reduction than H.265/HEVC. It achieves about 1.2% BD-rate reduction and 79.1% decrease in FLOPs than VR-CNN. Finally, the measurement on H.266/VVC and ablation studies are also conducted to ensure the effectiveness of the proposed method.

preprint2020arXiv

Anisotropy of the Milky Way's stellar halo using K giants from LAMOST and $Gaia$

The anisotropy parameter $β$ characterizes the extent to which orbits in stellar systems are predominantly radial or tangential, and is likely to constrain, for the stellar halo of the Milky Way, scenarios for its formation and evolution. We have measured the anisotropy $β$ as a function of Galactocentric radius from $5-100$ kpc for over 8600 metal poor ([Fe/H] $<-1.3$) halo K giants from the LAMOST catalog with line-of-sight velocities and distances, matched to proper motions from the second $Gaia$ data release. We construct full 6-D positions and velocities for the K giants to directly measure the 3 components of the velocity dispersion $(σ_r, σ_θ, σ_ϕ)$ (in spherical coordinates). We find that the orbits in the halo are radial over our full Galactocentric distance range reaching over 100 kpc. The anisotropy remains remarkably unchanged with Galactocentric radius from approximately 5 to 25 kpc, with an amplitude that depends on the metallicity of the stars, dropping from $β\approx 0.9$ for $-1.8 \leq$ [Fe/H] $< -1.3$ (for the bulk of the stars) to $β\approx 0.6$ for the lowest metallicities ([Fe/H] $< -1.8$). Considering our sample as a whole, $β\approx0.8$ and, beyond 25 kpc, the orbits gradually become less radial and anisotropy decreases to $β<0.3$ past 100 kpc. Within 8 kpc, $β<0.8$. The measurement of anisotropy is affected by substructure and streams, particularly beyond a Galactocentric distance of approximately 25 kpc, where the Sagittarius stream is prominent in the data. These results are complimentary to recent analysis of simulations by Loebman et al. and of SDSS/$Gaia$ DR1 data by Belokurov et al.

preprint2020arXiv

Automatic Lumbar Spinal CT Image Segmentation with a Dual Densely Connected U-Net

The clinical treatment of degenerative and developmental lumbar spinal stenosis (LSS) is different. Computed tomography (CT) is helpful in distinguishing degenerative and developmental LSS due to its advantage in imaging of osseous and calcified tissues. However, boundaries of the vertebral body, spinal canal and dural sac have low contrast and hard to identify in a CT image, so the diagnosis depends heavily on the knowledge of expert surgeons and radiologists. In this paper, we develop an automatic lumbar spinal CT image segmentation method to assist LSS diagnosis. The main contributions of this paper are the following: 1) a new lumbar spinal CT image dataset is constructed that contains 2393 axial CT images collected from 279 patients, with the ground truth of pixel-level segmentation labels; 2) a dual densely connected U-shaped neural network (DDU-Net) is used to segment the spinal canal, dural sac and vertebral body in an end-to-end manner; 3) DDU-Net is capable of segmenting tissues with large scale-variant, inconspicuous edges (e.g., spinal canal) and extremely small size (e.g., dural sac); and 4) DDU-Net is practical, requiring no image preprocessing such as contrast enhancement, registration and denoising, and the running time reaches 12 FPS. In the experiment, we achieve state-of-the-art performance on the lumbar spinal image segmentation task. We expect that the technique will increase both radiology workflow efficiency and the perceived value of radiology reports for referring clinicians and patients.

preprint2020arXiv

Characterising the Performance of High-Speed Data Converters for RFSoC-based Radio Astronomy Receivers

RF system-on-chip (RFSoC) devices provide the potential for implementing a complete radio astronomy receiver on a single board, but performance of the integrated analogue-to-digital converters is critical. We have evaluated the performance of the data converters in the Xilinx ZU28DR RFSoC, which are 12-bit, 8-fold interleaved converters with a maximum sample speed of 4.096 Giga-sample per second (GSPS). We measured the spurious-free dynamic range (SFDR), signal-to-noise and distortion (SINAD), effective number of bits (ENOB), intermodulation distortion (IMD) and cross-talk between adjacent channels over the bandwidth of 2.048 GHz. We both captured data for off-line analysis with floating-point arithmetic, and implemented a real-time integer arithmetic spectrometer on the RFSoC. The performance of the ADCs is sufficient for radio astronomy applications and close to the vendor specifications in most of the scenarios. We have carried out spectral integrations of up to 100 s and stability tests over tens of hours and find thermal noise-limited performance over these timescales.

preprint2020arXiv

Differential rotation of the halo traced by the K-giant stars

We use K-giant stars selected from the LAMOST DR5 to study the variation of the rotational velocity of the galactic halo at different space positions. Modelling the rotational velocity distribution with both the halo and disk components, we find that the rotational velocity of the halo population decreases almost linearly with increasing vertical distance to the galactic disk plane, $Z$, at fixed galactocentric radius, $R$. The samples are separated into two parts with $6<R<12$ kpc and $12<R<20$ kpc. We derive that the decreasing rates along $Z$ for the two subsamples are $-3.07\pm0.63$ and $-1.89\pm0.37$ km s$^{-1}$ kpc$^{-1}$, respectively. Compared with the TNG simulations, we suggest that this trend is probably caused by the interaction between the disk and halo. The results from the simulations show that only the oblate halo can provide a decreasing rotational velocity with an increasing $Z$. This indicates that the Galactic halo is oblate with galactocentric radius $R<20$ kpc. On the other hand, the flaring of the disk component (mainly the thick disk) is clearly traced by this study, with $R$ between 12 and 20 kpc, the disk can vertically extend to $6\sim10$ kpc above the disk plane. What is more interesting is that, we find the Gaia-Enceladus-Sausage (GES) component has a significant contribution only in the halo with $R<12$ kpc, i.e. a fraction of 23$-$47\%. While in the outer subsample, the contribution is too low to be well constrained.

preprint2020arXiv

Discovery of two nearby post-T Tauri stellar associations

In this work we report the discovery of 2 new stellar associations in close vicinity of the Sun at roughly 180 and 150 pc. These two associations, named as u Tau assoc and e Tau assoc, were detected based on their clustering in a multi-dimensional parameter space including $α$, $δ$, $μ_α$ , $μ_δ$ and $π$ of Gaia. The fitting of pre-main-sequence model isochrones in their color-magnitude diagrams suggests that the two associations are of about 50 Myr old and the group members lower than ${\sim}$0.8 $M_{\odot}$ are at the stage of post-T Tauri.

preprint2020arXiv

Hyperfine Structure and Coherent Dynamics of Rare Earth Spins Explored with Electron-Nuclear Double Resonance at Sub-Kelvin Temperatures

An experimental platform of ultralow-temperature pulsed ENDOR (electron-nuclear double resonance) spectroscopy is constructed for the bulk materials. Coherent property of the coupled electron and nuclear spins of the rare-earth (RE) dopants in a crystal (143Nd3+:Y2SiO5) is investigated from 100 mK to 6 K. At the lowest working temperatures, two-pulse-echo coherence time exceeding 2 ms and 40 ms are achieved for the electron and nuclear spins, while the electronic Zeeman and hyperfine population lifetimes are more than 15 s and 10 min. With the aid of the near-unity electron spin polarization at 100 mK, the complete hyperfine level structure with 16 energy levels is measured using ENDOR technique without the assistance of the reconstructed spin Hamiltonian. These results demonstrate the suitability of the deeply cooled paramagnetic RE-doped solids for memory components aimed for quantum communication and quantum computation. The developed experimental platform is expected to be a powerful tool for paramagnetic materials from various research fields.

preprint2020arXiv

Interpretable Machine Learning Model for Early Prediction of Mortality in Elderly Patients with Multiple Organ Dysfunction Syndrome (MODS): a Multicenter Retrospective Study and Cross Validation

Background: Elderly patients with MODS have high risk of death and poor prognosis. The performance of current scoring systems assessing the severity of MODS and its mortality remains unsatisfactory. This study aims to develop an interpretable and generalizable model for early mortality prediction in elderly patients with MODS. Methods: The MIMIC-III, eICU-CRD and PLAGH-S databases were employed for model generation and evaluation. We used the eXtreme Gradient Boosting model with the SHapley Additive exPlanations method to conduct early and interpretable predictions of patients' hospital outcome. Three types of data source combinations and five typical evaluation indexes were adopted to develop a generalizable model. Findings: The interpretable model, with optimal performance developed by using MIMIC-III and eICU-CRD datasets, was separately validated in MIMIC-III, eICU-CRD and PLAGH-S datasets (no overlapping with training set). The performances of the model in predicting hospital mortality as validated by the three datasets were: AUC of 0.858, sensitivity of 0.834 and specificity of 0.705; AUC of 0.849, sensitivity of 0.763 and specificity of 0.784; and AUC of 0.838, sensitivity of 0.882 and specificity of 0.691, respectively. Comparisons of AUC between this model and baseline models with MIMIC-III dataset validation showed superior performances of this model; In addition, comparisons in AUC between this model and commonly used clinical scores showed significantly better performance of this model. Interpretation: The interpretable machine learning model developed in this study using fused datasets with large sample sizes was robust and generalizable. This model outperformed the baseline models and several clinical scores for early prediction of mortality in elderly ICU patients. The interpretative nature of this model provided clinicians with the ranking of mortality risk features.

preprint2020arXiv

LAMOST Medium-Resolution Spectroscopic Survey (LAMOST-MRS): Scientific goals and survey plan

Since September 2018, LAMOST starts a new 5-year medium-resolution spectroscopic survey (MRS) using bright/gray nights. We present the scientific goals of LAMOST-MRS and propose a near optimistic strategy of the survey. A complete footprint is also provided. Not only the regular medium-resolution survey, but also a time-domain spectroscopic survey is being conducted since 2018 and will be end in 2023. According to the detailed survey plan, we expect that LAMOST-MRS can observe about 2 million stellar spectra with ~7500 and limiting magnitude of around G=15 mag. Moreover, it will also provide about 200 thousand stars with averagely 60-epoch observations and limiting magnitude of G~14 mag. These high quality spectra will give around 20 elemental abundances, rotational velocities, emission line profiles as well as precise radial velocity with uncertainty less than 1 km/s. With these data, we expect that LAMOST can effectively leverage sciences on stellar physics, e.g. exotic binary stars, detailed observation of many types of variable stars etc., planet host stars, emission nebulae, open clusters, young pre-main-sequence stars etc.

preprint2020arXiv

Measuring the local dark matter density with LAMOST DR5 and Gaia DR2

We apply the vertical Jeans equation to the kinematics of Milky Way stars in the solar neighbourhood to measure the local dark matter density. More than 90,000 G- and K-type dwarf stars are selected from the cross-matched sample of LAMOST DR5 and Gaia DR2 for our analyses. The mass models applied consist of a single exponential stellar disc, a razor thin gas disc and a constant dark matter density. We first consider the simplified vertical Jeans equation which ignores the tilt term and assumes a flat rotation curve. Under a Gaussian prior on the total stellar surface density, the local dark matter density inferred from Markov Chain Monte Carlo simulations is $0.0133_{-0.0022}^{+0.0024}\ {\rm M}_{\odot}\,{\rm pc}^{-3}$. The local dark matter densities for subsamples in an azimuthal angle range of $-10^{\circ} < ϕ< 5^{\circ}$ are consistent within their 1$σ$ errors. However, the northern and southern subsamples show a large discrepancy due to plateaux in the northern and southern vertical velocity dispersion profiles. These plateaux may be the cause of the different estimates of the dark matter density between the north and south. Taking the tilt term into account has little effect on the parameter estimations and does not explain the north and south asymmetry. Taking half of the difference of $σ_{z}$ profiles as unknown systematic errors, we then obtain consistent measurements for the northern and southern subsamples. We discuss the influence of the vertical data range, the scale height of the tracer population, the vertical distribution of stars and the sample size on the uncertainty of the determination of the local dark matter density.

preprint2020arXiv

On the Chemical and Kinematic Consistency Between N-rich Metal-poor Field Stars and Enriched Populations in Globular Clusters

Interesting chemically peculiar field stars may reflect their stellar evolution history and their possible origin in a different environment from where they are found now, which is one of the most important research fields in Galactic archaeology. To explore this further, we have used the CN-CH bands around 4000 A to identify N-rich metal-poor field stars in LAMOST DR3. Here we expand our N-rich metal-poor field star sample to ~100 stars in LAMOST DR5, where 53 of them are newly found in this work. We investigate light elements of the common stars between our sample and APOGEE DR14. While Mg, Al, and Si abundances generally agree with the hypothesis that N-rich metal-poor field stars come from enriched populations in globular clusters, it is still inconclusive for C, N, and O. After integrating the orbits of our N-rich field stars and a control sample of normal metal-poor field stars, we find that N-rich field stars have different orbital parameter distributions compared to the control sample, specifically, apocentric distances, maximum vertical amplitude (Zmax), orbital energy, and z direction angular momentum (Lz). The orbital parameters of N-rich field stars indicate that most of them are inner-halo stars. The kinematics of N-rich field stars support their possible GC origin. The spatial and velocity distributions of our bona fide N-rich field star sample are important observational evidence to constrain simulations of the origin of these interesting objects.

preprint2020arXiv

On-demand quantum storage of photonic qubits in an on-chip waveguide

Photonic quantum memory is the core element in quantum information processing (QIP). For the scalable and convenient practical applications, great efforts have been devoted to the integrated quantum memory based on various waveguides fabricated in solids. However, on-demand storage of qubits, which is an essential requirement for QIP, is still challenging to be implemented using such integrated quantum memory. Here we report the on-demand storage of time-bin qubits in an on-chip waveguide memory on the surface of a $^{151}$Eu$^{3+}$:Y$_2$SiO$_5$ crystal, utilizing the Stark modulated atomic frequency comb protocol. A qubit storage fidelity of $99.3\%\pm0.2\%$ is obtained with a input of 0.5 photons per pulse, far beyond the highest fidelity achievable using the classical measure-and-prepare strategy. The developed integrated quantum memory with the on-demand retrieval capability, represents an important step towards practical applications of integrated quantum nodes in quantum networks.

preprint2020arXiv

Possible evidence of hydrogen emission in the first-overtone and multi-mode RR Lyrae variables

The nature of shock waves in non-fundamental mode RR Lyrae stars remains a mystery because of limited spectroscopic observations. We apply a pattern recognition algorithm on spectroscopic data from SDSS and LAMOST and report the first evidence of hydrogen emission in first-overtone and multi-mode RR Lyrae stars showing the "first apparition", which is the most prominent observational characteristic of shock in RR Lyrae variables. We find ten RRc stars in SDSS, ten RRc stars in LAMOST, and three RRd stars in LAMOST that show blueshifted Balmer emissions. The emission features possibly indicate the existence of shock waves. We calculate the radial velocities of the emission lines, which are related to the physical conditions occurring in the radiative zone of shock waves. Using photometric observations from ZTF, we present a detailed light curve analysis for the frequency components in one of our RRd stars with hydrogen emission, RRdl3, for possible modulations. With the enormous volume of upcoming spectral observations of variable stars, our study raises the possibility of connecting the unexplained Blazhko effect to shock waves in non-fundamental mode RR Lyrae stars.

preprint2020arXiv

Reliable coherent optical memory based on a laser-written waveguide

$\mathrm {^{151}Eu^{3+}}$-doped yttrium silicate ($\mathrm {^{151}Eu^{3+}:Y_2SiO_5}$ ) crystal is a unique material that possesses hyperfine states with coherence time up to 6 h. Many efforts have been devoted to the development of this material as optical quantum memories based on the bulk crystals, but integrable structures (such as optical waveguides) that can promote $\mathrm {^{151}Eu^{3+}:Y_2SiO_5}$-based quantum memories to practical applications, have not been demonstrated so far. Here we report the fabrication of type 2 waveguides in a $\mathrm {^{151}Eu^{3+}:Y_2SiO_5}$ crystal using femtosecond-laser micromachining. The resulting waveguides are compatible with single-mode fibers and have the smallest insertion loss of $4.95\ dB$. On-demand light storage is demonstrated in a waveguide by employing the spin-wave atomic frequency comb (AFC) scheme and the revival of silenced echo (ROSE) scheme. We implement a series of interference experiments based on these two schemes to characterize the storage fidelity. Interference visibility of the readout pulse is $0.99\pm 0.03$ for the spin-wave AFC scheme and $0.97\pm 0.02$ for the ROSE scheme, demonstrating the reliability of the integrated optical memory.

preprint2020arXiv

Reverse-engineering Bar Charts Using Neural Networks

Reverse-engineering bar charts extracts textual and numeric information from the visual representations of bar charts to support application scenarios that require the underlying information. In this paper, we propose a neural network-based method for reverse-engineering bar charts. We adopt a neural network-based object detection model to simultaneously localize and classify textual information. This approach improves the efficiency of textual information extraction. We design an encoder-decoder framework that integrates convolutional and recurrent neural networks to extract numeric information. We further introduce an attention mechanism into the framework to achieve high accuracy and robustness. Synthetic and real-world datasets are used to evaluate the effectiveness of the method. To the best of our knowledge, this work takes the lead in constructing a complete neural network-based method of reverse-engineering bar charts.

preprint2020arXiv

The extended Gaia-PS1-SDSS (GPS1+) proper motion catalog

The GPS1 catalog was released in 2017. It delivered precise proper motions for around 350 million sources across three-fourths of the sky down to a magnitude of $r\sim20$\,mag. In this study, we present GPS1+ the extension GPS1 catalog down to $r\sim22.5$\,mag, based on {\it Gaia} DR2, PS1, SDSS and 2MASS astrometry. The GPS1+ totally provides proper motions for $\sim$400 million sources with a characteristic systematic error of less than 0.1\masyr. This catalog is divided into two sub-samples, i.e., the primary and secondary parts. The primary $\sim$264 million sources have either or both of the {\it Gaia} and SDSS astrometry, with a typical precision of 2.0-5.0 \masyr. In this part, $\sim$160 million sources have {\it Gaia} proper motions, we provide another new proper motion for each of them by building a Bayesian model. Relative to {\it Gaia}'s values, the precision is improved by $\sim$0.1\,dex on average at the faint end; $\sim$50 million sources are the objects whose proper motions are missing in {\it Gaia} DR2, we provide their proper motion with a precision of $\sim$4.5\masyr; the remaining $\sim$54 million faint sources are beyond {\it Gaia} detecting capability, we provide their proper motions for the first time with a precision of 7.0 \masyr. However, the secondary $\sim$136 million sources only have PS1 astrometry, the average precision is worse than 15.0 \masyr. All the proper motions have been validated using QSOs and the existing {\it Gaia} proper motions. The catalog will be released on-line and available via the VO-TAP Service, or via the National Astronomical Data Center serviced by China-VO: https://nadc.china-vo.org/data/data/gps1p/f.

preprint2020arXiv

Three New Late-type Hypervelocity Star Candidates from Gaia DR2 with Refined Selection Criteria

Several dozen hypervelocity star (HVS) candidates have been reported based on the second data release of Gaia (Gaia DR2). However, it has been proven that the radial velocities of some Gaia HVS candidates are not reliable. In this paper, we employ refined astrometric criteria to re-examine Gaia DR2, arriving at a more reliable sample of HVS and high velocity star candidates than those found by previous authors.We develop a method called Binary Escape Probability Analysis to identify some HVS candidates. This method allows us to work with stars having only two epochs of measured radial velocity. These stars were usually discarded in previous similar studies. A scrutiny of our final results sheds light on selection effects present in our studies, which we propose to be the focus of future studies. In total, we find three late-type (2 G-type and 1 K-type) HVS and 21 high velocity star candidates, 3 and 11 of which are new, respectively. Judging by their historical trajectories, which we calculate, all three HVS candidates could not have had Galactic center origins. Further monitoring is required to confirm their status.

preprint2020arXiv

Understanding the velocity distribution of the Galactic Bulge with APOGEE and Gaia

We revisit the stellar velocity distribution in the Galactic bulge/bar region with APOGEE DR16 and {\it Gaia} DR2, focusing in particular on the possible high-velocity (HV) peaks and their physical origin. We fit the velocity distributions with two different models, namely with Gauss-Hermite polynomial and Gaussian mixture model (GMM). The result of the fit using Gauss-Hermite polynomials reveals a positive correlation between the mean velocity ($\bar{V}$) and the "skewness" ($h_{3}$) of the velocity distribution, possibly caused by the Galactic bar. The $n=2$ GMM fitting reveals a symmetric longitudinal trend of $|μ_{2}|$ and $σ_{2}$ (the mean velocity and the standard deviation of the secondary component), which is inconsistent to the $x_{2}$ orbital family predictions. Cold secondary peaks could be seen at $|l|\sim6^\circ$. However, with the additional tangential information from {\it Gaia}, we find that the HV stars in the bulge show similar patterns in the radial-tangential velocity distribution ($V_{\rm R}-V_{\rm T}$), regardless of the existence of a distinct cold HV peak. The observed $V_{\rm R}-V_{\rm T}$ (or $V_{\rm GSR}-μ_{l}$) distributions are consistent with the predictions of a simple MW bar model. The chemical abundances and ages inferred from ASPCAP and CANNON suggest that the HV stars in the bulge/bar are generally as old as, if not older than, the other stars in the bulge/bar region.

preprint2019arXiv

Deriving the stellar labels of LAMOST spectra with Stellar LAbel Machine (SLAM)

The LAMOST survey has provided 9 million spectra in its Data Release 5 (DR5) at R$\sim$1800. Extracting precise stellar labels is crucial for such a large sample. In this paper, we report the implementation of the Stellar LAbel Machine (SLAM), which is a data-driven method based on Support Vector Regression (SVR), a robust non-linear regression technique. Thanks to the capability to model highly non-linear problems with SVR, SLAM generally can derive stellar labels over a wide range of spectral types. This gives it a unique capability compared to other popular data-driven methods. To illustrate this capability, we test the performance of SLAM on stars ranging from Teff$\sim$4000 to $\sim$8000 K trained on LAMOST spectra and stellar labels. At g-band signal-to-noise ratio (SNRg) higher than 100, the random uncertainties of Teff, logg and [Fe/H] are 50 K, 0.09 dex, and 0.07 dex, respectively. We then set up another SLAM model trained by APOGEE and LAMOST common stars to demonstrate its capability of dealing with high dimensional problems. The spectra are from LAMOST DR5 and the stellar labels of the training set are from APOGEE DR15, including Teff, logg, [M/H],[$α$/M], [C/M], and [N/M]. The cross-validated scatters at SNRg$\sim$100 are 49 K, 0.10 dex, 0.037 dex,0.026 dex, 0.058 dex, and 0.106 dex for these parameters, respectively. This performance is at the same level as other up-to-date data-driven models. As a byproduct, we also provide the latest catalog of $\sim$1 million LAMOST DR5 K giant stars with SLAM-predicted stellar labels in this work.

preprint2019arXiv

Exploring the spectral \textit{information content} in the LAMOST medium-resolution survey (MRS)

Low-resolution spectra are proved competitive to high-resolution spectra in determining many stellar labels at comparable precision. It is useful to consider the spectral information content when assessing the capability of a stellar spectrum in deriving precise stellar labels. In this work, we quantify the information content brought by the LAMOST-II medium-resolution spectroscopic survey (MRS) using the gradient spectra and the coefficients-of-dependence (CODs). In general, the wavelength coverage of the MRS well constrains the stellar labels but the sensitivities of different stellar labels vary with spectral types and metallicity of the stars of interest and, therefore, affect the performance of the stellar label determination from the MRS spectra. Applying the SLAM to the synthetic spectra which mimic the MRS data, we find the precision of the fundamental stellar parameters Teff, logg and [M/H] are better when combining both the blue and red bands of the MRS. This is especially important for warm stars since the H$α$ line located in the red part plays a more important role in determining the effective temperature for warm stars. With blue and red parts together, we are able to reach similar performance to the low-resolution spectra except for warm stars. However, at [M/H]$\sim-2.0$ dex, the uncertainties of fundamental stellar labels estimated from MRS are substantially larger than those from low-resolution spectra. We also tested the uncertainties of Teff, logg and [M/H] of from MRS data induced from the radial velocity mismatch and find that a mismatch of about 1 km s$^{-1}$, which is typical for LAMOST MRS data, would not significantly affect the stellar label estimates. At last, reference precision limits are calculated using synthetic gradient spectra, according to which we expect abundances of at least 17 elements to be measured precisely from MRS spectra.

preprint2019arXiv

On half-factoriality of transfer Krull monoids

Let $H$ be a transfer Krull monoid over a subset $G_0$ of an abelian group $G$ with finite exponent. Then every non-unit $a\in H$ can be written as a finite product of atoms, say $a=u_1 \cdot \ldots \cdot u_k$. The set $\mathsf L(a)$ of all possible factorization lengths $k$ is called the set of lengths of $a$, and $H$ is said to be half-factorial if $|\mathsf L(a)|=1$ for all $a\in H$. We show that, if $a \in H$ and $|\mathsf L(a^{\lfloor (3\exp(G) - 3)/2 \rfloor})| = 1$, then the smallest divisor-closed submonoid of $H$ containing $a$ is half-factorial. In addition, we prove that, if $G_0$ is finite and $|\mathsf L(\prod_{g\in G_0}g^{2\mathsf{ord}(g)})|=1$, then $H$ is half-factorial.

preprint2019arXiv

Tracing Kinematic and Chemical Properties of Sagittarius Stream by K-Giants, M-Giants, and BHB stars

We characterize the kinematic and chemical properties of $\sim$3,000 Sagittarius (Sgr) stream stars, including K-giants, M-giants, and BHBs, select from SEGUE-2, LAMOST, and SDSS separately in Integrals-of-Motion space. The orbit of Sgr stream is quite clear from the velocity vector in $X$-$Z$ plane. Stars traced by K-giants and M-giants present the apogalacticon of trailing steam is $\sim$ 100 kpc. The metallicity distributions of Sgr K-, M-giants, and BHBs present that the M-giants are on average the most metal-rich population, followed by K-giants and BHBs. All of the K-, M-giants, and BHBs indicate that the trailing arm is on average more metal-rich than leading arm, and the K-giants show that the Sgr debris is the most metal-poor part. The $α$-abundance of Sgr stars exhibits a similar trend with the Galactic halo stars at lower metallicity ([Fe/H] $<\sim$ $-$1.0 dex), and then evolve down to lower [$α$/Fe] than disk stars at higher metallicity, which is close to the evolution pattern of $α$-element of Milky Way dwarf galaxies. We find $V_Y$ and metallicity of K-giants have gradients along the direction of line-of-sight from the Galactic center in $X$-$Z$ plane, and the K-giants show that $V_Y$ increases with metallicity at [Fe/H] $>\sim-$1.5 dex. After dividing the Sgr stream into bright and faint stream according to their locations in equatorial coordinate, the K-giants and BHBs show that the bright and faint stream present different $V_Y$ and metallicities, the bright stream is on average higher in $V_Y$ and metallicity than the faint stream.

preprint2016arXiv

A catalogue of early-type emission-line stars and Hα line profiles from LAMOST DR2

We present a catalogue including 11,204 spectra for 10,436 early-type emission-line stars from LAMOST DR2, among which 9,752 early-type emission-line spectra are newly discovered. For these early-type emission-line stars, we discuss the morphological and physical properties from their low-resolution spectra. In this spectral sample, the H$α$ emission profiles display a wide variety of shapes. Based on the H$α$ line profiles, these spectra are categorized into five distinct classes: single-peak emission, single-peak emission in absorption, double-peak emission, double-peak emission in absorption, and P-Cygni profiles. To better understand what causes the H$α$ line profiles, we divide these objects into four types from the view of physical classification, which include classical Be stars, Herbig Ae/Be stars, close binaries and spectra contaminated by HII regions. The majority of Herbig Ae/Be stars and classical Be stars are identified and separated using the (H-K, K-W1) color-color diagram. We also discuss thirty one binary systems as listed in SIMBAD on-line catalogue and identify 3,600 spectra contaminated by HII regions after cross matching with positions in the Dubout-Crillon catalogue. A statistical analysis of line profiles versus classifications is then conducted in order to understand the distribution of H$α$ profiles for each type in our sample. Finally, we also provide a table of 172 spectra with FeII emission lines and roughly calculate stellar wind velocities for seven spectra with P-Cygni profiles.

preprint2016arXiv

An unsupervised spatiotemporal graphical modeling approach to anomaly detection in distributed CPS

Modern distributed cyber-physical systems (CPSs) encounter a large variety of physical faults and cyber anomalies and in many cases, they are vulnerable to catastrophic fault propagation scenarios due to strong connectivity among the sub-systems. This paper presents a new data-driven framework for system-wide anomaly detection for addressing such issues. The framework is based on a spatiotemporal feature extraction scheme built on the concept of symbolic dynamics for discovering and representing causal interactions among the subsystems of a CPS. The extracted spatiotemporal features are then used to learn system-wide patterns via a Restricted Boltzmann Machine (RBM). The results show that: (1) the RBM free energy in the off-nominal conditions is different from that in the nominal conditions and can be used for anomaly detection; (2) the framework can capture multiple nominal modes with one graphical model; (3) the case studies with simulated data and an integrated building system validate the proposed approach.

preprint2016arXiv

Calibration of LAMOST Stellar Surface Gravities Using the Kepler Asteroseismic Data

Asteroseismology is a powerful tool to precisely determine the evolutionary status and fundamental properties of stars. With the unprecedented precision and nearly continuous photometric data acquired by the NASA Kepler mission, parameters of more than 10$^4$ stars have been determined nearly consistently. However, most studies still use photometric effective temperatures (Teff) and metallicities ([Fe/H]) as inputs, which are not sufficiently accurate as suggested by previous studies. We adopted the spectroscopic Teff and [Fe/H] values based on the LAMOST low-resolution spectra (R~1,800), and combined them with the global oscillation parameters to derive the physical parameters of a large sample of stars. Clear trends were found between Δlogg(LAMOST - seismic) and spectroscopic Teff as well as logg, which may result in an overestimation of up to 0.5 dex for the logg of giants in the LAMOST catalog. We established empirical calibration relations for the logg values of dwarfs and giants. These results can be used for determining the precise distances to these stars based on their spectroscopic parameters.

preprint2016arXiv

Carbon stars from LAMOST DR2 data

In this work, we present the new catalog of carbon stars from the LAMOST DR2 catalog. In total, 894 carbon stars are identified from multiple line indices measured from the stellar spectra. Combining the CN bands in the red end with \ctwo\ and other lines, we are able to identify the carbon stars. Moreover, we also classify the carbon stars into spectral sub-types of \ch, \CR, and \cn. These sub-types approximately show distinct features in the multi-dimensional line indices, implying that in the future we can use them to identify carbon stars from larger spectroscopic datasets. Meanwhile, from the line indices space, while the \cn\ stars are clearly separated from the others, we find no clear separation between \CR\ and \ch\ sub-types. The \CR\ and \ch\ stars seem to smoothly transition from one to another. This may hint that the \CR\ and \ch\ stars may not be different in their origins but look different in their spectra because of different metallicity. Due to the relatively low spectral resolution and lower signal-to-noise ratio, the ratio of $^{12}$C/$^{13}$C is not measured and thus the \cj\ stars are not identified.

preprint2016arXiv

Characterizing the SHARDS of Disrupted Milky Way Satellites with LAMOST

We derive the fraction of substructure in the Galactic halo using a sample of over 10,000 spectroscopically-confirmed halo giant stars from the LAMOST spectroscopic survey. By observing 100 synthetic models along each line of sight with the LAMOST selection function in that sky area, we statistically characterize the expected halo populations. We define as SHARDS (Stellar Halo Accretion Related Debris Structures) any stars in >3-sigma excesses above the model predictions. We find that at least 10% of the Milky Way halo stars from LAMOST are part of SHARDS. By running our algorithm on smooth halos observed with the LAMOST selection function, we show that the LAMOST data contain excess substructure over all Galactocentric radii R_GC < 40 kpc, beyond what is expected due to statistical fluctuations and incomplete sampling of a smooth halo. The level of substructure is consistent with the fraction of stars in SHARDS in model halos created entirely from accreted satellites. This work illustrates the potential of vast spectroscopic surveys with high filling factors over large sky areas to recreate the merging history of the Milky Way.

preprint2016arXiv

Hot Subdwarf Stars Observed in LAMOST DR1 - Atmospheric parameters from single-lined spectra

We present a catalog of 166 spectroscopically identified hot subdwarf stars from LAMOST DR1, 44 of which show the characteristics of cool companions in their optical spectra. Atmospheric parameters of 122 non-composite spectra subdwarf stars were measured by fitting the profiles of hydrogen (H) and helium (He) lines with synthetic spectra from non-LTE model atmospheres. Most of the sdB stars scatter near the Extreme Horizontal Branch in the $T_{\rm eff}-\log{g}$ diagram and two well defined groups can be outlined. A clustering of He-enriched sdO stars appears near $T_{\rm eff}=45\,000$ K and $\log(g) = 5.8$. The sdB population separates into several nearly parallel sequences in the $T_{\rm eff}-{\rm He}$ abundance diagram with clumps corresponding to those in the $T_{\rm eff}-\log{g}$ diagram. Over $38\,000$ K (sdO) stars show abundance extremes, they are either He-rich or He-deficient and we observe only a few stars in the $ -1 < \log(y) < 0$ abundance range. With increasing temperature these extremes become less prominent and the He abundance approaches to $\log(y)\sim-0.5$. A unique property of our sample is that it covers a large range in apparent magnitudes and galactic latitudes, therefore it contains a mix of stars from different populations and galactic environments. Our results are consistent with the findings of Hirsch (2009) and we conclude that He-rich and He-deficient sdB stars ($\log(y) < 1$) probably origin from different populations. We also find that most sdO and sdB stars lie in a narrow strip in the luminosity and helium abundance plane, which suggests that these atmospheric parameters are correlated.

preprint2016arXiv

New tidal debris nearby the Sagittarius leading tail from the LAMOST DR2 M giant stars

We report two new tidal debris nearby the Sagittarius (Sgr) tidal stream in the north Galactic cap identified from the M giant stars in LAMOST DR2 data. The M giant stars with sky area of $210^\circ<$Λ$<290^\circ$, distance of 10--20kpc, and [Fe/H]$<-0.75$ show clear bimodality in velocity distribution. We denote the two peaks as Vel-3+83 for the one within mean velocity of -3kms$^{-1}$ with respect to that of the well observed Sgr leading tail at the same $Λ$ and Vel+162+26 for the other one with mean velocity of 162kms$^{-1}$ with respect to the Sgr leading tail. Although the projected $Λ$--$V_{gar}$ relation of Vel-3+83 is very similar to the Sgr leading tail, the opposite trend in $Λ$--distance relation against the Sgr leading tail suggests Vel-3+83 has a different 3D direction of motion with any branch of the simulated Sgr tidal stream from Law & Majewski. Therefore, we propose it to be a new tidal debris not related to the Sgr stream. Similarly, the another substructure Vel+162+26, which is the same one as the NGC group discovered by Chou et al., also moves toward a different direction with the Sgr stream, implying that it may have different origin with the Sgr tidal stream.

preprint2016arXiv

NLTE Analysis of High Resolution H-band Spectra. I. Neutral Silicon

We investigated the reliability of our silicon atomic model and the influence of non-local thermodynamical equilibrium (NLTE) on the formation of neutral silicon (Si I) lines in the near-infrared (near-IR) H-band. We derived the differential Si abundances for 13 sample stars with high-resolution H-band spectra from the Apache Point Observatory Galactic Evolution Experiment (APOGEE), as well as from optical spectra, both under local thermodynamical equilibrium (LTE) and NLTE conditions. We found that the differences between the Si abundances derived from the H-band and from optical lines for the same stars are less than 0.1 dex when the NLTE effects included, and that NLTE reduces the line-to-line scatter in the H-band spectra for most sample stars. These results suggest that our Si atomic model is appropriate for studying the formation of H-band Si lines. Our calculations show that the NLTE corrections of the Si I H-band lines are negative, i.e. the final Si abundances will be overestimated in LTE. The corrections for strong lines depend on surface gravity, and tend to be larger for giants, reaching ~ -0.2 dex in our sample, and up to ~ -0.4 dex in extreme cases of APOGEE targets. Thus, the NLTE effects should be included in deriving silicon abundances from H-band Si I lines, especially for the cases where only strong lines are available.

preprint2016arXiv

Selecting M-giants with infra-red photometry: Distances, metallicities and the Sagittarius stream

Using a spectroscopically confirmed sample of M-giants, M-dwarfs and quasars from the LAMOST survey, we assess how well WISE $\&$ 2MASS color-cuts can be used to select M-giant stars. The WISE bands are very efficient at separating M-giants from M-dwarfs and we present a simple classification that can produce a clean and relatively complete sample of M-giants. We derive a new photometric relation to estimate the metallicity for M-giants, calibrated using data from the APOGEE survey. We find a strong correlation between the $(W1-W2)$ color and $\rm [M/H]$, where almost all of the scatter is due to photometric uncertainties. We show that previous photometric distance relations, which are mostly based on stellar models, may be biased and devise a new empirical distance relation, investigating trends with metallicity and star formation history. Given these relations, we investigate the properties of M-giants in the Sagittarius stream. The offset in the orbital plane between the leading and trailing tails is reproduced and, by identifying distant M-giants in the direction of the Galactic anti-center, we confirm that the previously detected debris in the outer halo is the apocenter of the trailing tail. We also find tentative evidence supporting an existing overdensity near the leading tail in the Northern Galactic hemisphere, possibly an extension to the trailing tail (so-called Branch C). We have measured the metallicity distribution along the stream, finding a clear metallicity offset between the leading and trailing tails, in agreement with models for the stream formation. We include an online table of M-giants to facilitate further studies.

preprint2015arXiv

A theoretical foundation of the target-decoy search strategy for false discovery rate control in proteomics

Motivation: Target-decoy search (TDS) is currently the most popular strategy for estimating and controlling the false discovery rate (FDR) of peptide identifications in mass spectrometry-based shotgun proteomics. While this strategy is very useful in practice and has been intensively studied empirically, its theoretical foundation has not yet been well established. Result: In this work, we systematically analyze the TDS strategy in a rigorous statistical sense. We prove that the commonly used concatenated TDS provides a conservative estimate of the FDR for any given score threshold, but it cannot rigorously control the FDR. We prove that with a slight modification to the commonly used formula for FDR estimation, the peptide-level FDR can be rigorously controlled based on the concatenated TDS. We show that the spectrum-level FDR control is difficult. We verify the theoretical conclusions with real mass spectrometry data.

preprint2015arXiv

Asteroseismic based estimation of the surface gravity for the LAMOST giant stars

Asteroseismology is one of the most accurate approaches to estimate the surface gravity of a star. However, most of the data from the current spectroscopic surveys do not have asteroseismic measurements, which is very expensive and time consuming. In order to improve the spectroscopic surface gravity estimates for a large amount of survey data with the help of the small subset of the data with seismic measurements, we set up a support vector regression model for the estimation of the surface gravity supervised by 1,374 LAMOST giant stars with Kepler seismic surface gravity. The new approach can reduce the uncertainty of the estimates down to about 0.1 dex, which is better than the LAMOST pipeline by at least a factor of 2, for the spectra with signal-to-noise ratio higher than 20. Compared with the logg estimated from the LAMOST pipeline, the revised logg values provide a significantly improved match to the expected distribution of red clump and RGB stars from stellar isochrones. Moreover, even the red bump stars, which extend to only about 0.1 dex in logg, can be discriminated from the new estimated surface gravity. The method is then applied to about 350,000 LAMOST metal-rich giant stars to provide improved surface gravity estimates. In general, the uncertainty of the distance estimate based on the SVR surface gravity can be reduced to about 12% for the LAMOST data.

preprint2015arXiv

Determining the local dark matter density with LAMOST data

Measurement of the local dark matter density plays an important role in both Galactic dynamics and dark matter direct detection experiments. However, the estimated values from previous works are far from agreeing with each other. In this work, we provide a well-defined observed sample with 1427 G \& K type main-sequence stars from the LAMOST spectroscopic survey, taking into account selection effects, volume completeness, and the stellar populations. We apply a vertical Jeans equation method containing a single exponential stellar disk, a razor thin gas disk, and a constant dark matter density distribution to the sample, and obtain a total surface mass density of $\rm {78.7 ^{+3.9}_{-4.7}\ M_{\odot}\ pc^{-2}}$ up to 1 kpc and a local dark matter density of $0.0159^{+0.0047}_{-0.0057}\,\rm M_{\odot}\,\rm pc^{-3}$. We find that the sampling density (i.e. number of stars per unit volume) of the spectroscopic data contributes to about two-thirds of the uncertainty in the estimated values. We discuss the effect of the tilt term in the Jeans equation and find it has little impact on our measurement. Other issues, such as a non-equilibrium component due to perturbations and contamination by the thick disk population, are also discussed.

preprint2015arXiv

Estimation of distances to stars with stellar parameters from LAMOST

We present a method to estimate distances to stars with spectroscopically derived stellar parameters. The technique is a Bayesian approach with likelihood estimated via comparison of measured parameters to a grid of stellar isochrones, and returns a posterior probability density function for each star's absolute magnitude. This technique is tailored specifically to data from the Large Sky Area Multi-object Fiber Spectroscopic Telescope (LAMOST) survey. Because LAMOST obtains roughly 3000 stellar spectra simultaneously within each ~5-degree diameter "plate" that is observed, we can use the stellar parameters of the observed stars to account for the stellar luminosity function and target selection effects. This removes biasing assumptions about the underlying populations, both due to predictions of the luminosity function from stellar evolution modeling, and from Galactic models of stellar populations along each line of sight. Using calibration data of stars with known distances and stellar parameters, we show that our method recovers distances for most stars within ~20%, but with some systematic overestimation of distances to halo giants. We apply our code to the LAMOST database, and show that the current precision of LAMOST stellar parameters permits measurements of distances with ~40% error bars. This precision should improve as the LAMOST data pipelines continue to be refined.

preprint2015arXiv

Kinematics of the X-shaped Milky Way Bulge: Expectations from a Self-consistent N-body Model

We explore the kinematics (both the radial velocity and the proper motion) of the vertical X-shaped feature in the Milky Way with an N-body bar/bulge model. From the solar perspective, the distance distribution of particles is double-peaked in fields passing through the X-shape. The separation and amplitude ratio between the two peaks qualitatively match the observed trends towards the Galactic bulge. We confirm clear signatures of cylindrical rotation in the pattern of mean radial velocity across the bar/bulge region. We also find possible imprints of coherent orbital motion inside the bar structure in the radial velocity distribution along l=0 degree, where the near and far sides of the bar/bulge show excesses of approaching and receding particles. The coherent orbital motion is also reflected in the slight displacement of the zero-velocity-line in the mean radial velocity, and the displacement of the maximum/minimum in the mean longitudinal proper motion across the bulge region. We find some degree of anisotropy in the stellar velocity within the X-shape, but the underlying orbital family of the X-shape cannot be clearly distinguished. Two potential applications of the X-shape in previous literature are tested, i.e., bulge rotation and Galactic center measurements. We find that the proper motion difference between the two sides of the X-shape can be used to estimate the mean azimuthal streaming motion of the bulge, but not the pattern speed of the bar. We also demonstrate that the Galactic center can be located with the X-shape, but the accuracy depends on the fitting scheme, the number of fields, and their latitudinal coverage.

preprint2015arXiv

Member candidates of the star clusters from LAMOST DR2 data

In this work, we provide 2189 photometric- and kinematic-selected member candidates of 24 star clusters from the LAMOST DR2 catalog. We perform two-step membership identification: selection along the stellar track in the color-magnitude diagram, i.e., photometric identification, and the selection from the distribution of radial velocities, i.e. the kinematic identification. We find that the radial velocity from the LAMOST data are very helpful in the membership identification. The mean probability of membership is 40\% for the radial velocity selected sample. With these 24 star clusters, we investigate the performance of the radial velocity and metallicity estimated in the LAMOST pipeline. We find that the systematic offset in radial velocity and metallicity are $0.85\pm1.26$\,\kms\ and $-0.08\pm0.04$\,dex, with dispersions of $5.47_{-0.71}^{+1.16}$\,\kms\ and $0.13_{-0.02}^{+0.04}$\,dex, respectively. Finally, we propose that the photometric member candidates of the clusters covered by the LAMOST footprints should be assigned higher priority so that more member stars can be observed.

preprint2015arXiv

Rings and Radial Waves in the Disk of the Milky Way

We show that in the anticenter region, between Galactic longitudes of $110^\circ<l<229^\circ$, there is an oscillating asymmetry in the main sequence star counts on either side of the Galactic plane using data from the Sloan Digital Sky Survey. This asymmetry oscillates from more stars in the north at distances of about 2 kpc from the Sun to more stars in the south at 4-6 kpc from the Sun to more stars in the north at distances of 8-10 kpc from the Sun. We also see evidence that there are more stars in the south at distances of 12-16 kpc from the Sun. The three more distant asymmetries form roughly concentric rings around the Galactic center, opening in the direction of the Milky Way's spiral arms. The northern ring, 9 kpc from the Sun, is easily identified with the previously discovered Monoceros Ring. Parts of the southern ring at 14 kpc from the Sun (which we call the TriAnd Ring) have previously been identified as related to the Monoceros Ring and others have been called the Triangulum Andromeda Overdensity. The two nearer oscillations are approximated by a toy model in which the disk plane is offset by of the order 100 pc up and then down at different radii. We also show that the disk is not azimuthally symmetric around the Galactic anticenter and that there could be a correspondence between our observed oscillations and the spiral structure of the Galaxy. Our observations suggest that the TriAnd and Monoceros Rings (which extend to at least 25 kpc from the Galactic center) are primarily the result of disk oscillations.

preprint2015arXiv

Spectral classification of stars based on LAMOST spectra

In this work, we select the high signal-to-noise ratio spectra of stars from the LAMOST data andmap theirMK classes to the spectral features. The equivalentwidths of the prominent spectral lines, playing the similar role as the multi-color photometry, form a clean stellar locus well ordered by MK classes. The advantage of the stellar locus in line indices is that it gives a natural and continuous classification of stars consistent with either the broadly used MK classes or the stellar astrophysical parameters. We also employ a SVM-based classification algorithm to assignMK classes to the LAMOST stellar spectra. We find that the completenesses of the classification are up to 90% for A and G type stars, while it is down to about 50% for OB and K type stars. About 40% of the OB and K type stars are mis-classified as A and G type stars, respectively. This is likely owe to the difference of the spectral features between the late B type and early A type stars or between the late G and early K type stars are very weak. The relative poor performance of the automatic MK classification with SVM suggests that the directly use of the line indices to classify stars is likely a more preferable choice.

preprint2015arXiv

The K giant stars from the LAMOST survey data II: the Hercules stream in radial migration

We estimate the age for the individual stars located at the lower part of the red giant branch from the LAMOST DR2 K giant sample. Taking into account the selection effects and the volume completeness, the age--metallicity map for the stars located between 0.3 and 1.5 kpc from the Sun is obtained. A significant substructure (denoted as the \it{narrow stripe}) located from (age, [Fe/H])$\sim$(5, 0.4) to (10 Gyr, -0.4 dex) in the age--metallicity map is clearly identified. Moreover, the \it{narrow stripe} stars are found the dominate contributors to several velocity substructures, including the well-known Hercules stream. The substantially large difference between the observed guiding-center radii and the birth radii inferred from the age--metallicity relation is evident that the \it{narrow stripe} stars have been radially migrated from about R$\sim4$ kpc to the solar neighborhood. This implies that the Hercules stream may not be owe to the resonance associated with the bar, but may be the kinematic imprint of the inner disk and later moved out due to radial migration. We estimate that the traveling speed of the radial migration are roughly 1.1$\pm0.1$ kpc Gyr$^{-1}$, equivalent with about $1.1\pm0.1$ km s$^{-1}$. This is in agreement with the median $v_R$ of $2.6^{+1.8}_{-1.9}$ km s$^{-1}$ of the \it{narrow stripe}. We also obtain that about one third stars in the solar neighborhood are radially migrated from around 4 kpc. Finally, we find that the radial migration does not lead to additional disk thickening according to the distribution of $z_{max}$.

preprint2014arXiv

A Bayesian Method for the Extinction

We propose a Bayesian method to measure the total Galactic extinction parameters, $R_V$ and $A_V$. Validation tests based on the simulated data indicate that the method can achieve the accuracy of around 0.01\,mag. We apply this method to the SDSS BHB stars in the northern Galactic cap and find that the derived extinctions are highly consistent with those from \cite{SFD98}. It suggests that the Bayesian method is promising for the extinction estimation, even the reddening values are close to the observational errors.

preprint2014arXiv

Comment on "Fault-Tolerate Quantum Private Comparison Based on GHZ States and ECC"

A two-party quantum private comparison scheme using GHZ states and error-correcting code (ECC) was introduced in Li et al.'s paper [Int. J. Theor. Phys. 52: 2818-2815, 2013], which holds the capability of fault-tolerate and could be performed in a none-ideal scenario. However, this study points out there exists a fatal loophole under a special attack, namely the twice-Hadamard-CNOT attack. A malicious party may intercept the other's particles, firstly executes the Hadamard operations on these intercepted particles and his (her) own ones respectively, and then sequentially performs twice CNOT operations on them and the auxiliary particles prepared in advance. As a result, the secret input will be revealed without being detected through measuring the auxiliary particles. For resisting this special attack, an improvement is proposed by applying a permutation operator before TP sends the particle sequences to all the participants.

preprint2014arXiv

Complex-based analysis of dysregulated cellular processes in cancer

Background: Differential expression analysis of (individual) genes is often used to study their roles in diseases. However, diseases such as cancer are a result of the combined effect of multiple genes. Gene products such as proteins seldom act in isolation, but instead constitute stable multi-protein complexes performing dedicated functions. Therefore, complexes aggregate the effect of individual genes (proteins) and can be used to gain a better understanding of cancer mechanisms. Here, we observe that complexes show considerable changes in their expression, in turn directed by the concerted action of transcription factors (TFs), across cancer conditions. We seek to gain novel insights into cancer mechanisms through a systematic analysis of complexes and their transcriptional regulation. Results: We integrated large-scale protein-interaction (PPI) and gene-expression datasets to identify complexes that exhibit significant changes in their expression across different conditions in cancer. We devised a log-linear model to relate these changes to the differential regulation of complexes by TFs. The application of our model on two case studies involving pancreatic and familial breast tumour conditions revealed: (i) complexes in core cellular processes, especially those responsible for maintaining genome stability and cell proliferation (e.g. DNA damage repair and cell cycle) show considerable changes in expression; (ii) these changes include decrease and countering increase for different sets of complexes indicative of compensatory mechanisms coming into play in tumours; and (iii) TFs work in cooperative and counteractive ways to regulate these mechanisms. Such aberrant complexes and their regulating TFs play vital roles in the initiation and progression of cancer.

preprint2014arXiv

Exploring the total Galactic extinction with SDSS BHB stars

Aims: We used 12,530 photometrically-selected blue horizontal branch (BHB) stars from the Sloan Digital Sky Survey (SDSS) to estimate the total extinction of the Milky Way at the high Galactic latitudes, $R_V$ and $A_V$ in each line of sight. Methods: A Bayesian method was developed to estimate the reddening values in the given lines of sight. Based on the most likely values of reddening in multiple colors, we were able to derive the values of $R_V$ and $A_V$. Results: We selected 94 zero-reddened BHB stars from seven globular clusters as the template. The reddening in the four SDSS colors for the northern Galactic cap were estimated by comparing the field BHB stars with the template stars. The accuracy of this estimation is around 0.01\,mag for most lines of sight. We also obtained $<R_V>$ to be around 2.40$\pm1.05$ and $A_V$ map within an uncertainty of 0.1\,mag. The results, including reddening values in the four SDSS colors, $A_V$, and $R_V$ in each line of sight, are released on line. In this work, we employ an up-to-date parallel technique on GPU card to overcome time-consuming computations. We plan to release online the C++ CUDA code used for this analysis. Conclusions: The extinction map derived from BHB stars is highly consistent with that from Schlegel, Finkbeiner & Davis(1998). The derived $R_V$ is around 2.40$\pm1.05$. The contamination probably makes the $R_V$ be larger.

preprint2014arXiv

Fixing the Reference Frame for PPMXL Proper Motions Using Extragalactic Sources

We quantify and correct systematic errors in PPMXL proper motions using extragalactic sources from the first two LAMOST data releases and the Veron-Cetty & Veron Catalog of Quasars. Although the majority of the sources are from the Veron catalog, LAMOST makes important contributions in regions that are not well-sampled by previous catalogs, particularly at low Galactic latitudes and in the south Galactic cap. We show that quasars in PPMXL have measureable and significant proper motions, which reflect the systematic zero-point offsets present in the catalog. We confirm the global proper motion shifts seen by Wu, Ma, & Zhou (2011), and additionally find smaller-scale fluctuations of the QSO-derived corrections to an absolute frame. We average the proper motions of 158,106 extragalactic objects in bins of 3x3 degrees and present a table of proper motion corrections.

preprint2014arXiv

Pulse-Doppler Signal Processing with Quadrature Compressive Sampling

Quadrature compressive sampling (QuadCS) is a newly introduced sub-Nyquist sampling for acquiring inphase and quadrature (I/Q) components of radio-frequency signals. For applications to pulse-Doppler radars, the QuadCS outputs can be arranged in 2-dimensional data similar to that by Nyquist sampling. This paper develops a compressive sampling pulse-Doppler (CoSaPD) processing scheme from the sub-Nyquist samples. The CoSaPD scheme follows Doppler estimation/detection and range estimation and is conducted on the sub-Nyquist samples without recovering the Nyquist samples. The Doppler estimation is realized through spectrum analyzer as in classic processing. The detection is done on the Doppler bin data. The range estimation is performed through sparse recovery algorithms on the detected targets and thus the computational load is reduced. The detection threshold can be set at a low value for improving detection probability and then the introduced false targets are removed in the range estimation stage through inherent detection characteristic in the recovery algorithms. Simulation results confirm our findings. The CoSaPD scheme with the data at one eighth the Nyquist rate and for SNR above -25dB can achieve performance of the classic processing with Nyquist samples.

preprint2014arXiv

The binarity of Milky Way F,G,K stars as a function of effective temperature and metallicity

We estimate the fraction of F,G,K stars with close binary companions by analysing multi-epoch stellar spectra from SDSS and LAMOST for radial velocity (RV) variations. We employ a Bayesian method to infer the maximum likelihood of the fraction of binary stars with orbital periods of 1000 days or shorter, assuming a simple model distribution for a binary population with circular orbits. The overall inferred fraction of stars with such a close binary companion is 43.0% \pm 2.0% for a sample of F, G, K stars from SDSS SEGUE, and 30% \pm 8.0% in a similar sample from LAMOST. The apparent close binary fraction decreases with the stellar effective temperature. We divide the SEGUE and LEGUE data into three subsamples with different metallicity ([Fe/H] < -1.1; -1.1 < [Fe/H] < -0.6; -0.6 < [Fe/H]), for which the inferred close binary fractions are 56% \pm 5.0%, 56.0% \pm 3%, and 30% \pm 5.7%. The metal-rich stars from our sample are therefore substantially less likely to possess a close binary companion than otherwise similar stars drawn from metal-poor populations. The different ages and formation environments of the Milky Way's thin disk, thick disk and halo may contribute to explaining these observations. Alternatively metallicity may have a significant effect on the formation and/or evolution of binary stars.

preprint2014arXiv

The First Hypervelocity Star from the LAMOST Survey

We report the first hypervelocity star (HVS) discovered from the LAMOST spectroscopic survey. It is a B-type star with a heliocentric radial velocity about 620 km/s, which projects to a Galactocentric radial velocity component of ~477 km/s. With a heliocentric distance of ~13 kpc and an apparent magnitude of ~13 mag, it is the nearest bright HVS currently known. With a mass of ~9Msun, it is one of the three most massive HVSs discovered so far. The star is clustered on the sky with many other known HVSs, with the position suggesting a possible connection to Galactic center structures. With the current poorly-determined proper motion, a Galactic center origin of this HVS remains consistent with the data at the 1sigma level, while a disk run-away origin cannot be excluded. We discuss the potential of the LAMOST survey to discover a large statistical sample of HVSs of different types.

preprint2014arXiv

The K giant stars from the LAMOST survey data I: identification, metallicity, and distance

We present a support vector machine classifier to identify the K giant stars from the LAMOST survey directly using their spectral line features. The completeness of the identification is about 75% for tests based on LAMOST stellar parameters. The contamination in the identified K giant sample is lower than 2.5%. Applying the classification method to about 2 million LAMOST spectra observed during the pilot survey and the first year survey, we select 298,036 K giant candidates. The metallicities of the sample are also estimated with uncertainty of $0.13\sim0.29$\,dex based on the equivalent widths of Mg$_{\rm b}$ and iron lines. A Bayesian method is then developed to estimate the posterior probability of the distance for the K giant stars, based on the estimated metallicity and 2MASS photometry. The synthetic isochrone-based distance estimates have been calibrated using 7 globular clusters with a wide range of metallicities. The uncertainty of the estimated distance modulus at $K=11$\,mag, which is the median brightness of the K giant sample, is about 0.6\,mag, corresponding to $\sim30$% in distance. As a scientific verification case, the trailing arm of the Sagittarius stream is clearly identified with the selected K giant sample. Moreover, at about 80\,kpc from the Sun, we use our K giant stars to confirm a detection of stream members near the apo-center of the trailing tail. These rediscoveries of the features of the Sagittarius stream illustrate the potential of the LAMOST survey for detecting substructures in the halo of the Milky Way.

preprint2014arXiv

The Nearest High-Velocity Stars Revealed by LAMOST Data Release 1

We report the discovery of 28 candidate high-velocity stars (HVSs) at heliocentric distances of less than 3 kpc, based on the Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) Data Release 1. Our sample of HVS candidates covers a much broader color range than the equivalent ranges discussed in previous studies and comprises the first and largest sample of HVSs in the solar neighborhood. The sufficiently accurate observed and derived parameters for all candidates allow us to ascertain their nature as genuine HVSs, while a subset of 12 objects represents the most promising candidates. Our results also highlight the great potential of discovering statistically large numbers of HVSs of different spectral types in LAMOST survey data. This will ultimately enable us to achieve a better understanding of the nature of Galactic HVSs and their ejection mechanisms, and to constrain the structure of the Galaxy.

preprint2014arXiv

The velocity distribution in the solar neighbourhood from the LAMOST pilot survey

We use about 15,000 F/G nearby dwarf stars selected from the LAMOST pilot survey to map the U-V velocity distribution in the solar neighbourhood. An extreme deconvolution algorithm is applied to reconstruct an empirical multi-Gaussian model. In addition to the well known substructures, e.g., Sirius, Coma Berenices, Hyades-Pleiades over-densities, several new substructures are unveiled. A ripple-like structure from (U, V) = (-120, -5) to (103, -32)km/s is clearly seen in the U-V distribution. This structure seems associated with resonance induced by the Galactic bar, since it is extended in U while having a small dispersion in V at the same time. A ridge structure between (U, V) = (-60, 40) and (-15, 15) km/s is also found. Although similar substructures have been seen in the Hipparcos data, their origin is still unclear. Another compact over-density is seen at (U, V) = (-102, -24). With this large data sample, we find that the substructure located at V~70 km/s and the Arcturus group are essentially parallel in V, which may indicate that they originate from an unrelaxed disk component perturbed by the rotating bar.

preprint2013arXiv

DA white dwarfs observed in LAMOST pilot survey

A total of $\sim640,000$ objects from LAMOST pilot survey have been publicly released. In this work, we present a catalog of DA white dwarfs from the entire pilot survey. We outline a new algorithm for the selection of white dwarfs by fitting Sérsic profiles to the Balmer H$β$, H$γ$ and H$δ$ lines of the spectra, and calculating the equivalent width of the CaII K line. 2964 candidates are selected by constraining the fitting parameters and the equivalent width of CaII K line. All the spectra of candidates are visually inspected. We identify 230 (59 of them are already in Villanova and SDSS WD catalog) DA white dwarfs, 20 of which are DA white dwarfs with non-degenerate companions. In addition, 128 candidates are classified as DA white dwarf/subdwarfs, which means the classifications are ambiguous. The result is consistent with the expected DA white dwarf number estimated based on the LEGUE target selection algorithm.

preprint2013arXiv

Radiation driven outflow in active galactic nuclei: the feedback effects of scattered and reprocessed photons

We perform time-dependent, 2DHD numerical simulations to study the dynamics of a slowly rotating accretion flow from sub-pc to pc scales under the irradiation from the central AGN. Compared to previous work, we improve the calculation of the radiative force due to X-rays. More importantly, in addition to radiative pressure and radiative heating/cooling directly from the central AGN, in the momentum equation we also include the force due to the scattered and reprocessed photons. We find that the accretion flow properties change significantly due to this "re-radiation" effect. The inflow rate at the inner boundary is reduced, while the outflow rate at the outer boundary is enhanced by about one order of magnitude. This effect is more significant when the density at the outer boundary is higher. The properties of outflows such as velocity, momentum and energy fluxes, and the ratio of outflow rate and the accretion rate, are calculated. We find that the efficiency of transferring the radiation power into the kinetic power of outflow is typically $10^{-3}$, far below the value of $\sim 0.05$ which is assumed in some cosmological simulations. The effect of the temperature of the gas at the outer boundary ($T_0$) is investigated. When $T_0$ is high, the emitted luminosity of the accretion flow oscillates. This is because in this case the gas around the Bondi radius can be more easily heated to be above the virial temperature due to its high internal energy. Another question we hope to address is the so-called "sub-Eddington" puzzle. Observationally, the luminosity of almost all AGNs are sub-Eddington, while theoretically the luminosity of an accretion flow can easily be super-Eddington. We find that even when the re-radiation effect is included and outflow does become much stronger, the luminosity, while reduced, can still be super-Eddington.

preprint2013arXiv

Same Initial States Attack in Yang et al.'s Quantum Private Comparison Protocol and the Improvement

In Yang et al.'s literatures (J. Phys. A: Math. 42, 055305, 2009; J. Phys. A:Math. 43, 209801, 2010), a quantum private comparison protocol based on Bell states and hash function is proposed, which aims to securely compare the equality of two participants' information with the help of a dishonest third party (TP). However, this study will point out their protocol cannot resist a special kind of attack, TP's same initial states attack, which is presented in this paper. That is, the dishonest TP can disturb the comparison result without being detected through preparing the same initial states. Finally, a simple improvement is given to avoid the attack.

preprint2013arXiv

Substructure in bulk velocities of Milky Way disk stars

We find that Galactic disk stars near the anticenter exhibit velocity asymmetries in both the Galactocentric radial and vertical components across the mid-plane as well as azimuthally. These findings are based on LAMOST spectroscopic velocities for a sample of ~400,000 F-type stars, combined with proper motions from the PPMXL catalog for which we have derived corrections to the zero points based in part on spectroscopically discovered galaxies and QSOs from LAMOST. In the region within 2 kpc outside the Sun's radius and +/-2 kpc from the Galactic midplane, we show that stars above the plane exhibit net outward radial motions with downward vertical velocities, while stars below the plane have roughly the opposite behavior. We discuss this in the context of other recent findings, and conclude that we are likely seeing the signature of vertical disturbances to the disk due to an external perturbation.

preprint2013arXiv

The Gravitational Potential Near the Sun From SEGUE K-dwarf Kinematics

To constrain the Galactic gravitational potential near the Sun ($\sim$1.5 kpc), we derive and model the spatial and velocity distribution for a sample of 9000 K-dwarfs that have spectra from SDSS/SEGUE, which yield radial velocities and abundances ([Fe/H] & [$α$/Fe]). We first derive the spatial density distribution for stars of three abundance-selected sub-populations by accounting for the survey's selection function. The vertical profile of these sub-populations are simple exponentials and their vertical dispersion profile is nearly isothermal. To model these data, we apply the `vertical' Jeans Equation, which relates the observable tracer number density and vertical velocity dispersion to the gravitational potential or vertical force. We explore a number of functional forms for the vertical force law, and fit the dispersion and density profiles of all abundance selected sub-populations simultaneously in the same potential, and explore all parameter co-variances using MCMC. Our fits constrain a disk {\it mass} scale height $\lesssim$ 300 pc and the total surface mass density to be $67 \pm 6 M_{\odot} {\rm pc^{-2}}$ at $|z| = 1.0$ kpc of which the contribution from all stars is $42 \pm 5 M_{\odot} {\rm pc^{-2}}$ (presuming a contribution from cold gas of $13 M_{\odot} {\rm pc^{-2}}$). We find significant constraints on the local dark matter density of $0.0065\pm0.0023 M_{\odot} {\rm pc^{-3}}$ ($0.25\pm0.09 {\rm GeV cm^{-3}} $). Together with recent experiments this firms up the best estimate of $0.0075\pm0.0021 M_{\odot} {\rm pc^{-3}}$ ($0.28\pm0.08 {\rm GeV cm^{-3}} $), consistent with global fits of approximately round dark matter halos to kinematic data in the outskirts of the Galaxy.

preprint2013arXiv

Triggering star formation by both radiative and mechanical active galactic nucleus feedback

We perform two dimensional hydrodynamic numerical simulations to study the positive active galactic nucleus feedback which triggers, rather than suppresses, star formation. Recently, it was shown by Nayakshin et al. and Ishibashi et al. that star formation occurs when the cold interstellar medium is squeezed by the impact of mass outflow or radiation pressure, respectively. Mass outflow is ubiquitous in this astrophysical context, and radiation pressure is also important if the AGN is luminous. For the first time in this subject, we incorporate both mass outflow feedback and radiative feedback into our model. Consequently, the ISM is shocked into shells by the AGN feedback, and these shells soon fragment into clumps and filaments because of Rayleigh-Taylor and thermal instabilities. We have two major findings: (1) the star formation rate can indeed be very large in the clumps and filaments. However, the resultant star formation rate density is too large compared with previous works, which is mainly because we ignore the fact that most of the stars that are formed would be disrupted when they move away from the galactic center. (2) Although radiation pressure feedback has a limited effect, when mass outflow feedback is also included, they reinforce each other. Specifically, in the gas-poor case, mass outflow is always the dominant contributor to feedback.

preprint2012arXiv

A new determination of the local dark matter density from the kinematics of K dwarfs

We apply a new method to determine the local disc matter and dark halo matter density to kinematic and position data for \sim2000 K dwarf stars taken from the literature. Our method assumes only that the disc is locally in dynamical equilibrium, and that the 'tilt' term in the Jeans equations is small up to \sim1 kpc above the plane. We present a new calculation of the photometric distances to the K dwarf stars, and use a Monte Carlo Markov Chain to marginalise over uncertainties in both the baryonic mass distribution, and the velocity and distance errors for each individual star. We perform a series of tests to demonstrate that our results are insensitive to plausible systematic errors in our distance calibration, and we show that our method recovers the correct answer from a dynamically evolved N-body simulation of the Milky Way. We find a local dark matter density of ρdm = 0.025+0.014-0.013 M\odotpc^{-3} (0.95+0.53-0.49 GeV cm^{-3}) at 90% confidence assuming no correction for the non-flatness of the local rotation curve, and ρdm = 0.022+0.015-0.013 M\odotpc^-3 (0.85+0.57-0.50 GeV cm^{-3}) if the correction is included. Our 90% lower bound on ρdm is larger than the canonical value typically assumed in the literature, and is at mild tension with extrapolations from the rotation curve that assume a spherical halo. Our result can be explained by a larger normalisation for the local Milky Way rotation curve, an oblate dark matter halo, a local disc of dark matter, or some combination of these.

preprint2012arXiv

A resonant feature near the Perseus arm revealed by red clump stars

We investigate the extinction together with the radial velocity dispersion and distribution of red clump stars in the anti-center direction using spectra obtained with Hectospec on the MMT. We find that extinction peaks at Galactocentric radii of about 9.5 and 12.5 kpc, right in front of the locations of the Perseus and Outer arms and in line with the relative position of dust and stars in external spiral galaxies. The radial velocity dispersion peaks around 10kpc, which coincides with the location of the Perseus arm, yields an estimated arm-interarm density contrast of 1.3-1.5 and is in agreement with previous studies. Finally, we discover that the radial velocity distribution bifurcates around 10-11 kpc into two peaks at +27 km/s and -4 km/s. This seems to be naturally explained by the presence of the outer Lindblad resonance of the Galactic bar, but further observations will be needed to understand if the corotation resonance of the spirals arms also plays a role.

preprint2012arXiv

An Algorithm for Preferential Selection of Spectroscopic Targets in LEGUE

We describe a general target selection algorithm that is applicable to any survey in which the number of available candidates is much larger than the number of objects to be observed. This routine aims to achieve a balance between a smoothly-varying, well-understood selection function and the desire to preferentially select certain types of targets. Some target-selection examples are shown that illustrate different possibilities of emphasis functions. Although it is generally applicable, the algorithm was developed specifically for the LAMOST Experiment for Galactic Understanding and Exploration (LEGUE) survey that will be carried out using the Chinese Guo Shou Jing Telescope. In particular, this algorithm was designed for the portion of LEGUE targeting the Galactic halo, in which we attempt to balance a variety of science goals that require stars at fainter magnitudes than can be completely sampled by LAMOST. This algorithm has been implemented for the halo portion of the LAMOST pilot survey, which began in October 2011.

preprint2012arXiv

Chemo-orbital evidence from SDSS/SEGUE G-type dwarf stars for a mixed origin of the Milky Way's thick disk

We combine the estimated metallicities [Fe/H], abundances [α/Fe], positions and motions of a sample of 27,500 local (7<R/kpc<9, 0.5<|z|/kpc<2.5) SDSS/SEGUE G-type dwarf stars to investigate the chemo-orbital properties of the Milky Way's disk around the Sun. When we derive the orbital properties reflecting angular momentum, circularity, and thickness as function of [α/Fe] vs. [Fe/H], we find that there is a smooth variation with [α/Fe], a proxy for age. At the same time, the orbital properties of the old stars with [α/Fe]$\gtrsim$0.25 do show a transition with [Fe/H]: below [Fe/H]$\simeq$-0.6 the orbital angular momentum decreases, and the orbits become significantly non-circular and thicker. Radial migration of stars into the Solar neighborhood would naturally result in a smooth variation in the orbital properties, but the latter old metal-poor stars form a clear challenge, in particular because a basic feature of radial migration is that stars remain on near-circular orbits. When we next select stars on near-circular orbits, we indeed find besides the α-young 'thin-disk' stars a significant contribution to the α-old 'thick-disk' metal-rich stars. However, the remaining α-old 'thick-disk' stars on eccentric orbits, including nearly all old metal-poor stars, are difficult to explain with radial migration alone, but might have formed through early-on gas-rich mergers. We thus find chemo-orbital evidence that the thicker component of the Milky Way disk is not distinct from the thin component as expected from smooth internal evolution through radial migration, except for the old metal-poor stars with different orbital properties which could be part of a distinct thick-disk component formed through an external mechanism.

preprint2012arXiv

LAMOST Experiment for Galactic Understanding and Exploration (LEGUE) The survey science plan

We describe the current plans for a spectroscopic survey of millions of stars in the Milky Way galaxy using the Guo Shou Jing Telescope (GSJT, formerly the Large Area Multi-Object Spectroscopic Telescope - LAMOST). The survey will obtain spectra for 2.5 million stars brighter than $r<19$ during dark/grey time, and 5 million stars brighter than $r<17$ or $J<16$ on nights that are moonlit or have low transparency. The survey will begin in fall of 2012, and will run for at least four years. The telescope design constrains the optimal declination range for observations to $10^\circ<δ<50^\circ$, and site conditions lead to an emphasis on stars in the direction of the Galactic anticenter. The survey is divided into three parts with different target selection strategies: disk, anticenter, and spheroid. The resulting dataset will be used to study the merger history of the Milky Way, the substructure and evolution of the disks, the nature of the first generation of stars through identification of the lowest metallicity stars, and star formation through study of open clusters and the OB associations. Detailed design of the LEGUE survey will be completed after a review of the results of the pilot survey in summer 2012.

preprint2012arXiv

The LEGUE High Latitude Bright Survey Design for the LAMOST Pilot Survey

We describe the footprint and input catalog for bright nights in the LAMOST Pilot Survey, which began in October 2011. Targets are selected from two stripes in the north and south Galactic Cap regions, centered at $α$= 29$^\circ$, with 10$^\circ$ width in declination, covering right ascension of 135$^\circ-290^\circ$ and -30$^\circ$ to 30$^\circ$ respectively. We selected spectroscopic targets from a combination of the SDSS and 2MASS point source catalogs. The catalog of stars defining the field centers (as required by the Shack-Hartmann wavefront sensor at the center of the LAMOST field) consists of all V < 8m stars from the Hipparcos catalog. We employ a statistical selection algorithm that assigns priorities to targets based on their positions in multidimensional color/magnitude space. This scheme overemphasizes rare objects and de-emphasizes more populated regions of magnitude and color phase space, while ensuring a smooth, well-understood selection function. A demonstration of plate design is presented based on the Shack-Hartmann star catalog and an input catalog that was generated by our target selection routines.

preprint2012arXiv

The LEGUE Input Catalogue for Dark Night Observing in the LAMOST Pilot Survey

We outline the design of the dark nights portion of the LAMOST Pilot Survey, which began observations in October 2011. In particular, we focus on Milky Way stellar candidates that are targeted for the LEGUE (LAMOST Experiment for Galactic Understanding and Exploration) survey. We discuss the regions of sky in which spectroscopic candidates were selected, and the motivations for selecting each of these sky areas. Some limitations due to the unique design of the telescope are discussed, including the requirement that a bright (V < 8) star be placed at the center of each plate for wavefront sensing and active optics corrections. The target selection categories and scientific goals motivating them are briefly discussed, followed by a detailed overview of how these selection functions were realized. We illustrate the difference between the overall input catalog - Sloan Digital Sky Survey (SDSS) photometry - and the final targets selected for LAMOST observation.

preprint2012arXiv

The selection of LEGUE disk targets for LAMOST's pilot survey

We describe the target selection algorithm for the low latitude disk portion of the LAMOST Pilot Survey, which aims to test systems in preparation for the LAMOST spectroscopic survey. We use the PPMXL (Roeser et al. 2010) astrometric catalog, which provides positions, proper motions, B/R/I magnitudes (mostly) from USNO-B (Monet et al. 2003) and J/H/Ks from The Two Micron All Sky Survey (2MASS, see Skrutskie et al. 2006) as well. We chose 8 plates along the Galactic plane, in the region $0^\circ<α<67^\circ$ and $42^\circ<δ<59^\circ$, that cover 22 known open clusters with a range of ages. Adjacent plates may have small overlapping. Each plate covers an area $2.5^\circ$ in radius,with central star (for Shack-Hartmann guider) brighter than $\sim8^{\rm th}$ magnitude. For each plate, we create an input catalog in the magnitude range $11.3<Imag<16.3$ and $Bmag$ available from PPMXL. The stars are selected to satisfy the requirements of the fiber positioning system and have a uniform distribution in the $I$ vs. $B-I$ color-magnitude diagram. Our final input catalog consists of 12,000 objects on each of 8 plates that are observable during the winter observing season in Xinglong Station of the National Astronomical Observatory of China.

preprint2012arXiv

The site conditions of the Guo Shou Jing Telescope

The weather at Xinglong Observing Station, where the Guo Shou Jing Telescope (GSJT) is located, is strongly affected by the monsoon climate in north-east China. The LAMOST survey strategy is constrained by these weather patterns. In this paper, we present a statistics on observing hours from 2004 to 2007, and the sky brightness, seeing, and sky transparency from 1995 to 2011 at the site. We investigate effects of the site conditions on the survey plan. Operable hours each month shows strong correlation with season: on average there are 8 operable hours per night available in December, but only 1-2 hours in July and August. The seeing and the sky transparency also vary with seasons. Although the seeing is worse in windy winters, and the atmospheric extinction is worse in the spring and summer, the site is adequate for the proposed scientific program of LAMOST survey. With a Monte Carlo simulation using historical data on the site condition, we find that the available observation hours constrain the survey footprint from 22h to 16h in right ascension; the sky brightness allows LAMOST to obtain the limit magnitude of V = 19.5mag with S/N = 10.

preprint2012arXiv

The spatial structure of mono-abundance sub-populations of the Milky Way disk

The spatial, kinematic, and elemental-abundance structure of the Milky Way's stellar disk is complex, and has been difficult to dissect with local spectroscopic or global photometric data. Here, we develop and apply a rigorous density modeling approach for Galactic spectroscopic surveys that enables investigation of the global spatial structure of stellar sub-populations in narrow bins of [α/Fe] and [Fe/H], using 23,767 G-type dwarfs from SDSS/SEGUE. We fit models for the number density of each such mono-abundance component, properly accounting for the complex spectroscopic SEGUE sampling of the underlying stellar population. We find that each mono-abundance sub-population has a simple spatial structure that can be described by a single exponential in both the vertical and radial direction, with continuously increasing scale heights (~200 pc to 1 kpc) and decreasing scale lengths (>4.5 kpc to 2 kpc) for increasingly older sub-populations, as indicated by their lower metallicities and [α/Fe] enhancements. That the abundance-selected sub-components with the largest scale heights have the shortest scale lengths is in sharp contrast with purely geometric `thick--thin disk' decompositions. To the extent that [α/Fe] is an adequate proxy for age, our results directly show that older disk sub-populations are more centrally concentrated, which implies inside-out formation of galactic disks. The fact that the largest scale-height sub-components are most centrally concentrated in the Milky Way is an almost inevitable consequence of explaining the vertical structure of the disk through internal evolution. Whether the simple spatial structure of the mono-abundance sub-components, and the striking correlations between age, scale length, and scale height can be plausibly explained by satellite accretion or other external heating remains to be seen.

preprint2012arXiv

TrueLabel + Confusions: A Spectrum of Probabilistic Models in Analyzing Multiple Ratings

This paper revisits the problem of analyzing multiple ratings given by different judges. Different from previous work that focuses on distilling the true labels from noisy crowdsourcing ratings, we emphasize gaining diagnostic insights into our in-house well-trained judges. We generalize the well-known DawidSkene model (Dawid & Skene, 1979) to a spectrum of probabilistic models under the same "TrueLabel + Confusion" paradigm, and show that our proposed hierarchical Bayesian model, called HybridConfusion, consistently outperforms DawidSkene on both synthetic and real-world data sets.

preprint2011arXiv

Quantifying Kinematic Substructure in the Milky Way's Stellar Halo

We present and analyze the positions, distances, and radial velocities for over 4000 blue horizontal-branch (BHB) stars in the Milky Way's halo, drawn from SDSS DR8. We search for position-velocity substructure in these data, a signature of the hierarchical assembly of the stellar halo. Using a cumulative "close pair distribution" (CPD) as a statistic in the 4-dimensional space of sky position, distance, and velocity, we quantify the presence of position-velocity substructure at high statistical significance among the BHB stars: pairs of BHB stars that are close in position on the sky tend to have more similar distances and radial velocities compared to a random sampling of these overall distributions. We make analogous mock-observations of 11 numerical halo formation simulations, in which the stellar halo is entirely composed of disrupted satellite debris, and find a level of substructure comparable to that seen in the actually observed BHB star sample. This result quantitatively confirms the hierarchical build-up of the stellar halo through a signature in phase (position-velocity) space. In detail, the structure present in the BHB stars is somewhat less prominent than that seen in most simulated halos, quite possibly because BHB stars represent an older sub-population. BHB stars located beyond 20 kpc from the Galactic center exhibit stronger substructure than at $\rm r_{gc} < 20$ kpc.

preprint2010arXiv

Observational Evidence from SDSS for a Merger Origin of the Milky Way's Thick Disk

We test competing models that aim at explaining the nature of stars in the Milky Way that are well away (|z|$\gtrsim$ 1kpc) from the midplane, the so-called thick disk: the stars may have gotten there through orbital migration, through satellite mergers and accretion, or through heating of pre-existing thin disk stars. Sales et al. (2009) proposed the eccentricity distribution of thick disk stars as a diagnostic to differentiate between these mechanisms. Drawing on SDSS DR7, we have assembled a sample of 34,223 G-dwarfs with 6-D phase-space information and metallicities, and have derived orbital eccentricities for them. Comparing the resulting eccentricity distributions, p(e|z), with the models, we find that: a) the observed p(e|z) is inconsistent with that predicted by orbital migration only, as there are more observed stars of high and of very low eccentricity; b) scenarios where the thick disk is made predominantly through abrupt heating of a pre-existing thin disk are also inconsistent, as they predict more high-eccentricity stars than observed; c) the observed p(e|z) fits well with a "gas-rich merger" scenario, where most thick disk stars were born from unsettled gas in situ.

Chao Liu

What is connected

Connect this record

See the researcher in context

Building this map preview

115 published item(s)

Decision-Aware Semantic State Synchronization in Compute-First Networking

Evidence-Grounded Multi-Agent Planning Support for Urban Carbon Governance via RAG

On the Adversarial Robustness of 3D Large Vision-Language Models

Transition Matching Distillation for Fast Video Generation

Introduction to the Chinese Space Station Survey Telescope (CSST)

Millions of Main-Sequence Binary Stars from Gaia BP/RP Spectra

Nanoparticles Passive Targeting Allows Optical Imaging of Bone Diseases

A bimodal distribution of haze in Pluto's atmosphere

A Low-Cost, Highly Customizable Solution for Position Estimation in Modular Robots

A Robust Hot Subdwarfs Identification Method Based on Deep Learning

CAIBC: Capturing All-round Information Beyond Color for Text-based Person Retrieval

CodeMatcher: Searching Code Based on Sequential Semantics of Important Query Words

Enhancing Marine Data Transmission with Socially-Aware Resilient Vessel Networks

Graph Decipher: A transparent dual-attention graph neural network to understand the message-passing mechanism for the node classification

Identification of new classical Be stars from the LAMOST MRS survey

Immunofluorescence Capillary Imaging Segmentation: Cases Study

Implementation of an Automated Learning System for Non-experts

LAMOST medium-resolution spectroscopic survey of binarity and exotic star (LAMOST-MRS-B): Observation strategy and target selection

Look Before You Leap: Improving Text-based Person Retrieval by Learning A Consistent Cross-modal Common Manifold

Mass-Ratio Distribution of Binaries From the LAMOST-MRS Survey

Milky Way Mass with K Giants and BHB Stars Using LAMOST, SDSS/SEGUE, and Gaia: 3D Spherical Jeans Equation and Tracer Mass Estimator

MixNN: A design for protecting deep learning models

On-demand Integrated Quantum Memory for Polarization Qubits

On-demand multimode optical storage in a laser-written on-chip waveguide

Overview of the LAMOST survey in the first decade

Planets Across Space and Time (PAST). III. Morphology of the Planetary Radius Valley as a Function of Stellar Age and Metallicity in the Galactic Context Revealed by the LAMOST-Gaia-Kepler Sample

Precognition in Task-oriented Dialogue Understanding: Posterior Regularization by Future Context

Recursive Least Squares Policy Control with Echo State Network

Rigorous proof of slightly nonlinear Jeans instability in the expanding Newtonian universe

S4OD: Semi-Supervised learning for Single-Stage Object Detection

Searching Extra-tidal Features around the Globular Cluster Whiting 1

Sparse-Dyn: Sparse Dynamic Graph Multi-representation Learning via Event-based Sparse Temporal Attention Network

The Eclipsing Binaries from the LAMOST Medium-resolution Survey.III. A High-precision Empirical Stellar Mass Library

The North/South Asymmetry of the Galaxy: Possible Connection to the Vertical Phase Space Snail

A Catalog of LAMOST Variable Sources Based on Time-domain Photometry of ZTF

Binary fraction of O and B-type stars from LAMOST data

BLOCKEYE: Hunting For DeFi Attacks on Blockchain

Future stability of the FLRW spacetime for a large class of perfect fluids

Generation of entanglement between a highly wave-packet-tunable photon and a spin-wave memory in cold atoms

LAMOST Time-Domain Survey: First Results of four $K$2 plates

Noise suppression in a temporal-multimode quantum memory entangled with a photon via asymmetrical photon-collection channel

The Binarity of Early-type Stars from LAMOST Medium-resolution Spectroscopic Survey

The mass of the Milky Way out to 100 kpc using halo stars

The Spectroscopic Binaries from LAMOST Medium-Resolution Survey (MRS). I. Searching for Double-lined Spectroscopic Binaries (SB2s) with Convolutional Neural Network

Two-dimensional charge density wave TaX$_2$ (X=S, Se, Te) from first principles

A Convolutional Neural Network-Based Low Complexity Filter

Anisotropy of the Milky Way's stellar halo using K giants from LAMOST and $Gaia$

Automatic Lumbar Spinal CT Image Segmentation with a Dual Densely Connected U-Net

Characterising the Performance of High-Speed Data Converters for RFSoC-based Radio Astronomy Receivers

Differential rotation of the halo traced by the K-giant stars

Discovery of two nearby post-T Tauri stellar associations

Hyperfine Structure and Coherent Dynamics of Rare Earth Spins Explored with Electron-Nuclear Double Resonance at Sub-Kelvin Temperatures

Interpretable Machine Learning Model for Early Prediction of Mortality in Elderly Patients with Multiple Organ Dysfunction Syndrome (MODS): a Multicenter Retrospective Study and Cross Validation

LAMOST Medium-Resolution Spectroscopic Survey (LAMOST-MRS): Scientific goals and survey plan

Measuring the local dark matter density with LAMOST DR5 and Gaia DR2

On the Chemical and Kinematic Consistency Between N-rich Metal-poor Field Stars and Enriched Populations in Globular Clusters

On-demand quantum storage of photonic qubits in an on-chip waveguide

Possible evidence of hydrogen emission in the first-overtone and multi-mode RR Lyrae variables

Reliable coherent optical memory based on a laser-written waveguide

Reverse-engineering Bar Charts Using Neural Networks

The extended Gaia-PS1-SDSS (GPS1+) proper motion catalog

Three New Late-type Hypervelocity Star Candidates from Gaia DR2 with Refined Selection Criteria

Understanding the velocity distribution of the Galactic Bulge with APOGEE and Gaia

Deriving the stellar labels of LAMOST spectra with Stellar LAbel Machine (SLAM)

Exploring the spectral \textit{information content} in the LAMOST medium-resolution survey (MRS)

On half-factoriality of transfer Krull monoids

Tracing Kinematic and Chemical Properties of Sagittarius Stream by K-Giants, M-Giants, and BHB stars

A catalogue of early-type emission-line stars and Hα line profiles from LAMOST DR2

An unsupervised spatiotemporal graphical modeling approach to anomaly detection in distributed CPS

Calibration of LAMOST Stellar Surface Gravities Using the Kepler Asteroseismic Data

Carbon stars from LAMOST DR2 data

Characterizing the SHARDS of Disrupted Milky Way Satellites with LAMOST

Hot Subdwarf Stars Observed in LAMOST DR1 - Atmospheric parameters from single-lined spectra

New tidal debris nearby the Sagittarius leading tail from the LAMOST DR2 M giant stars