Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
67works
0followers
34topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

67 published item(s)

preprint2026arXiv

Decision-Aware Semantic State Synchronization in Compute-First Networking

In Compute-First Networking (CFN), an Access Point (AP) makes task offloading decisions based on resource state information reported by a Service Node (SN). A fundamental challenge arises from the trade-off between update overhead and decision accuracy: Frequent state updates consume limited network resources, while infrequent updates lead to stale state views and degraded task performance, especially under high system load. Existing approaches based on periodic updates or Age of Information (AoI) mainly focus on temporal freshness and often overlook whether a state change is actually relevant to offloading decisions. This paper proposes SenseCFN, a decision-aware state synchronization framework for CFN. Instead of synchronizing raw resource states, SenseCFN focuses on identifying state changes that are likely to alter offloading decisions. To this end, we introduce a lightweight semantic state representation that captures decision-relevant system characteristics, along with a Semantic Deviation Index (SDI) to quantify the impact of state shifts on decision outcomes. Based on SDI, the SN triggers updates only when significant decision-impacting changes are detected. Meanwhile, the AP performs offloading decisions using cached semantic states with explicit awareness of potential staleness. The update and offloading policies are jointly optimized using a centralized training with distributed execution (CTDE) approach. Simulation results show that SenseCFN maintains a task success rate of up to 99.6% in saturation-prone scenarios, outperforming baseline methods by more than 25%, while reducing status update frequency by approximately 70% to 96%. These results indicate that decision-aware state synchronization provides an effective and practical alternative to purely time-based update strategies in CFN.

preprint2026arXiv

Evidence-Grounded Multi-Agent Planning Support for Urban Carbon Governance via RAG

Urban carbon governance requires planners to integrate heterogeneous evidence -- emission inventories, statistical yearbooks, policy texts, technical measures, and academic findings -- into actionable, cross-departmental plans. Large Language Models (LLMs) can assist planning workflows, yet their factual reliability and evidential traceability remain critical barriers in professional use. This paper presents an evidence-grounded multi-agent planning support system for urban carbon governance built upon standard text-based Retrieval-Augmented Generation (RAG) (without GraphRAG). We align the system with the typical planning workflow by decomposing tasks into four specialized agents: (i) evidence Q\&A for fact checking and compliance queries, (ii) emission status assessment for diagnostic analysis, (iii) planning recommendation for generating multi-sector governance pathways, and (iv) report integration for producing planning-style deliverables. We evaluate the system in two task families: factual retrieval and comprehensive planning generation. On factual retrieval tasks, introducing RAG increases the average score from below 6 to above 90, and dramatically improves key-field extraction (e.g., region and numeric values near 100\% detection). A real-city case study (Ningbo, China) demonstrates end-to-end report generation with strong relevance, coverage, and coherence in expert review, while also highlighting boundary inconsistencies across data sources as a practical limitation.

preprint2026arXiv

On the Adversarial Robustness of 3D Large Vision-Language Models

3D Vision-Language Models (VLMs), such as PointLLM and GPT4Point, have shown strong reasoning and generalization abilities in 3D understanding tasks. However, their adversarial robustness remains largely unexplored. Prior work in 2D VLMs has shown that the integration of visual inputs significantly increases vulnerability to adversarial attacks, making these models easier to manipulate into generating toxic or misleading outputs. In this paper, we investigate whether incorporating 3D vision similarly compromises the robustness of 3D VLMs. To this end, we present the first systematic study of adversarial robustness in point-based 3D VLMs. We propose two complementary attack strategies: \textit{Vision Attack}, which perturbs the visual token features produced by the 3D encoder and projector to assess the robustness of vision-language alignment; and \textit{Caption Attack}, which directly manipulates output token sequences to evaluate end-to-end system robustness. Each attack includes both untargeted and targeted variants to measure general vulnerability and susceptibility to controlled manipulation. Our experiments reveal that 3D VLMs exhibit significant adversarial vulnerabilities under untargeted attacks, while demonstrating greater resilience against targeted attacks aimed at forcing specific harmful outputs, compared to their 2D counterparts. These findings highlight the importance of improving the adversarial robustness of 3D VLMs, especially as they are deployed in safety-critical applications.

preprint2026arXiv

Transition Matching Distillation for Fast Video Generation

Large video diffusion and flow models have achieved remarkable success in high-quality video generation, but their use in real-time interactive applications remains limited due to their inefficient multi-step sampling process. In this work, we present Transition Matching Distillation (TMD), a novel framework for distilling video diffusion models into efficient few-step generators. The central idea of TMD is to match the multi-step denoising trajectory of a diffusion model with a few-step probability transition process, where each transition is modeled as a lightweight conditional flow. To enable efficient distillation, we decompose the original diffusion backbone into two components: (1) a main backbone, comprising the majority of early layers, that extracts semantic representations at each outer transition step; and (2) a flow head, consisting of the last few layers, that leverages these representations to perform multiple inner flow updates. Given a pretrained video diffusion model, we first introduce a flow head to the model, and adapt it into a conditional flow map. We then apply distribution matching distillation to the student model with flow head rollout in each transition step. Extensive experiments on distilling Wan2.1 1.3B and 14B text-to-video models demonstrate that TMD provides a flexible and strong trade-off between generation speed and visual quality. In particular, TMD outperforms existing distilled models under comparable inference costs in terms of visual fidelity and prompt adherence. Project page: https://research.nvidia.com/labs/genair/tmd

preprint2025arXiv

Introduction to the Chinese Space Station Survey Telescope (CSST)

The Chinese Space Station Survey Telescope (CSST) is an upcoming Stage-IV sky survey telescope, distinguished by its large field of view (FoV), high image quality, and multi-band observation capabilities. It can simultaneously conduct precise measurements of the Universe by performing multi-color photometric imaging and slitless spectroscopic surveys. The CSST is equipped with five scientific instruments, i.e. Multi-band Imaging and Slitless Spectroscopy Survey Camera (SC), Multi-Channel Imager (MCI), Integral Field Spectrograph (IFS), Cool Planet Imaging Coronagraph (CPI-C), and THz Spectrometer (TS). Using these instruments, CSST is expected to make significant contributions and discoveries across various astronomical fields, including cosmology, galaxies and active galactic nuclei (AGN), the Milky Way and nearby galaxies, stars, exoplanets, Solar System objects, astrometry, and transients and variable sources. This review aims to provide a comprehensive overview of the CSST instruments, observational capabilities, data products, and scientific potential.

preprint2025arXiv

Millions of Main-Sequence Binary Stars from Gaia BP/RP Spectra

We present the main-sequence binary (MSMS) Catalog derived from Gaia Data Release 3 BP/RP (XP) spectra. Leveraging the vast sample of low-resolution Gaia XP spectra, we develop a forward modeling approach that maps stellar mass and photometric metallicity to XP spectra using a neural network. Our methodology identifies binary systems through statistical comparison of single- and binary-star model fits, enabling detection of binaries with mass ratios between 0.4 and 1.0 and flux ratios larger than 0.1. From an initial sample of 35 million stars within 1 kpc, we identify 14 million binary candidates and define a high-confidence "golden sample" of 1 million binary systems. This large, homogeneous sample enables detailed statistical analysis of binary properties across diverse Galactic environments, providing new insights into binary star formation and evolution. In addition, the $χ^2$ comparison allows us to distinguish stars with luminous companions from single stars or binaries with dark companions, such as white dwarfs, neutron stars and black hole candidates, improving our understanding of compact object populations.

preprint2023arXiv

Nanoparticles Passive Targeting Allows Optical Imaging of Bone Diseases

Bone health related skeletal disorders are commonly diagnosed by X-ray imaging, but the radiation limits its use. Light excitation and optical imaging through the near-infrared-II window (NIR-II, 1000-1700 nm) can penetrate deep tissues without radiation risk, but the targeting of contrast agent is non-specific. Here, we report that lanthanide-doped nanocrystals can be passively transported by endothelial cells and macrophages from the blood vessels into bone marrow microenvironment. We found that this passive targeting scheme can be effective for longer than two months. We therefore developed an intravital 3D and high-resolution planar imaging instrumentation for bone disease diagnosis. We demonstrated the regular monitoring of 1 mm bone defects for over 10 days, with resolution similar to X-ray imaging result, but more flexible use in prognosis. Moreover, the passive targeting can be used to reveal the early onset inflammation at the joints as the synovitis in the early stage of rheumatoid arthritis. Furthermore, the proposed method is comparable to μCT in recognizing symptoms of osteoarthritis, including the mild hyperostosis in femur which is ~100 μm thicker than normal, and the growth of millimeter-scale osteophyte in the knee joint, which further proves the power and universality of our approach in diagnosis of bone diseases

preprint2022arXiv

A bimodal distribution of haze in Pluto's atmosphere

Pluto, Titan, and Triton make up a unique class of solar system bodies, with icy surfaces and chemically reducing atmospheres rich in organic photochemistry and haze formation. Hazes play important roles in these atmospheres, with physical and chemical processes highly dependent on particle sizes, but the haze size distribution in reducing atmospheres is currently poorly understood. Here we report observational evidence that Pluto's haze particles are bimodally distributed, which successfully reproduces the full phase scattering observations from New Horizons. Combined with previous simulations of Titan's haze, this result suggests that haze particles in reducing atmospheres undergo rapid shape change near pressure levels ~0.5Pa and favors a photochemical rather than a dynamical origin for the formation of Titan's detached haze. It also demonstrates that both oxidizing and reducing atmospheres can produce multi-modal hazes, and encourages reanalysis of observations of hazes on Titan and Triton.

preprint2022arXiv

A Low-Cost, Highly Customizable Solution for Position Estimation in Modular Robots

Accurate position sensing is important for state estimation and control in robotics. Reliable and accurate position sensors are usually expensive and difficult to customize. Incorporating them into systems that have very tight volume constraints such as modular robots are particularly difficult. PaintPots are low-cost, reliable, and highly customizable position sensors, but their performance is highly dependent on the manufacturing and calibration process. This paper presents a Kalman filter with a simplified observation model developed to deal with the non-linearity issues that result in the use of low-cost microcontrollers. In addition, a complete solution for the use of PaintPots in a variety of sensing modalities including manufacturing, characterization, and estimation is presented for an example modular robot, SMORES-EP. This solution can be easily adapted to a wide range of applications.

preprint2022arXiv

A Robust Hot Subdwarfs Identification Method Based on Deep Learning

Hot subdwarf star is a particular type of star that is crucial for studying binary evolution and atmospheric diffusion processes. In recent years, identifying Hot subdwarfs by machine learning methods has become a hot topic, but there are still limitations in automation and accuracy. In this paper, we proposed a robust identification method based on the convolutional neural network (CNN). We first constructed the dataset using the spectral data of LAMOS DR7-V1. We then constructed a hybrid recognition model including an 8-class classification model and a binary classification model. The model achieved an accuracy of 96.17% on the testing set. To further validate the accuracy of the model, we selected 835 Hot subdwarfs that were not involved in the training process from the identified LAMOST catalog (2428, including repeated observations) as the validation set. An accuracy of 96.05% was achieved. On this basis, we used the model to filter and classify all 10,640,255 spectra of LAMOST DR7-V1, and obtained a catalog of 2393 Hot subdwarf candidates, of which 2067 have been confirmed. We found 25 new Hot subdwarfs among the remaining candidates by manual validation. The overall accuracy of the model is 87.42%. Overall, the model presented in this study can effectively identify specific spectra with robust results and high accuracy, and can be further applied to the classification of large-scale spectra and the search of specific targets.

preprint2022arXiv

CAIBC: Capturing All-round Information Beyond Color for Text-based Person Retrieval

Given a natural language description, text-based person retrieval aims to identify images of a target person from a large-scale person image database. Existing methods generally face a \textbf{color over-reliance problem}, which means that the models rely heavily on color information when matching cross-modal data. Indeed, color information is an important decision-making accordance for retrieval, but the over-reliance on color would distract the model from other key clues (e.g. texture information, structural information, etc.), and thereby lead to a sub-optimal retrieval performance. To solve this problem, in this paper, we propose to \textbf{C}apture \textbf{A}ll-round \textbf{I}nformation \textbf{B}eyond \textbf{C}olor (\textbf{CAIBC}) via a jointly optimized multi-branch architecture for text-based person retrieval. CAIBC contains three branches including an RGB branch, a grayscale (GRS) branch and a color (CLR) branch. Besides, with the aim of making full use of all-round information in a balanced and effective way, a mutual learning mechanism is employed to enable the three branches which attend to varied aspects of information to communicate with and learn from each other. Extensive experimental analysis is carried out to evaluate our proposed CAIBC method on the CUHK-PEDES and RSTPReid datasets in both \textbf{supervised} and \textbf{weakly supervised} text-based person retrieval settings, which demonstrates that CAIBC significantly outperforms existing methods and achieves the state-of-the-art performance on all the three tasks.

preprint2022arXiv

CodeMatcher: Searching Code Based on Sequential Semantics of Important Query Words

To accelerate software development, developers frequently search and reuse existing code snippets from a large-scale codebase, e.g., GitHub. Over the years, researchers proposed many information retrieval based models for code search, but they fail to connect the semantic gap between query and code. An early successful deep learning based model DeepCS solved this issue by learning the relationship between pairs of code methods and corresponding natural language descriptions. Two major advantages of DeepCS are the capability of understanding irrelevant/noisy keywords and capturing sequential relationships between words in query and code. In this paper, we proposed an IR-based model CodeMatcher that inherits the advantages of DeepCS, while it can leverage the indexing technique in the IR-based model to accelerate the search response time substantially. CodeMatcher first collects metadata for query words to identify irrelevant/noisy ones, then iteratively performs fuzzy search with important query words on the codebase that is indexed by the Elasticsearch tool, and finally reranks a set of returned candidate code according to how the tokens in the candidate code snippet sequentially matched the important words in a query. We verified its effectiveness on a large-scale codebase with ~41k repositories. Experimental results showed that CodeMatcher achieves an MRR of 0.60, outperforming DeepCS, CodeHow, and UNIF by 82%, 62%, and 46% respectively. Our proposed model is over 1.2k times faster than DeepCS. Moreover, CodeMatcher outperforms GitHub and Google search by 46% and 33% respectively in terms of MRR. We also observed that: fusing the advantages of IR-based and DL-based models is promising; improving the quality of method naming helps code search, since method name plays an important role in connecting query and code.

preprint2022arXiv

Enhancing Marine Data Transmission with Socially-Aware Resilient Vessel Networks

With the multi-dimensional exploration towards oceans, enormous sensing data has been generated with significant volume, velocity, variety and heterogeneity. The resulted Big Marine Data (BMD) thus issue unprecedented architectural challenges on existing marine communication systems. Current dominant marine communication technologies, e.g., shore-based cellular stations, high frequency radio, and expensive satellites, extremely suffer from short coverage, low bandwidth, insecurity, and unavailable cross-domain transmission. In this paper, Resilient Vessel Network (RVN) is proposed to fundamentally enhance BMD transmission. RVNs with widespread self-organized vessels and opportunistic connections reveal advantages of ubiquity, resilience, low cost and cross-domain transmission. To efficiently manage opportunistic vessel-to-vessel (V2V) connections for optimal routing, Social Network Analysis (SNA) on historical vessel interactions is applied for vessel familiarity measurement and community detection. The performance of the proposed community-based routing (CBR) is comprehensively evaluated with real datasets of fishing vessel trajectories. It is demonstrated that CBR achieves much lower transmission cost with comparable delivery ratio compared to typical routing algorithms.

preprint2022arXiv

Graph Decipher: A transparent dual-attention graph neural network to understand the message-passing mechanism for the node classification

Graph neural networks can be effectively applied to find solutions for many real-world problems across widely diverse fields. The success of graph neural networks is linked to the message-passing mechanism on the graph, however, the message-aggregating behavior is still not entirely clear in most algorithms. To improve functionality, we propose a new transparent network called Graph Decipher to investigate the message-passing mechanism by prioritizing in two main components: the graph structure and node attributes, at the graph, feature, and global levels on a graph under the node classification task. However, the computation burden now becomes the most significant issue because the relevance of both graph structure and node attributes are computed on a graph. In order to solve this issue, only relevant representative node attributes are extracted by graph feature filters, allowing calculations to be performed in a category-oriented manner. Experiments on seven datasets show that Graph Decipher achieves state-of-the-art performance while imposing a substantially lower computation burden under the node classification task. Additionally, since our algorithm has the ability to explore the representative node attributes by category, it is utilized to alleviate the imbalanced node classification problem on multi-class graph datasets.

preprint2022arXiv

Identification of new classical Be stars from the LAMOST MRS survey

Be stars are B-type main-sequence stars that display broad Balmer emission lines in their spectra. Identification of Be population is essential to further examine the formation and evolutionary models. We report the detection of classical Be (CBe) stars from observations with the Large sky Area Multi-Object fiber Spectroscopic Telescope Medium Resolution Survey of Date Release 7 (LAMOST MRS DR7). We used a deep convolutional neural network, the ResNet, with an 18-layer module to examine the morphology of the H alpha profile. We identified 1,162 candidate Be stars from the collection of 2,260,387 spectra for 789,918 stars in the database. The ResNet network achieves a Be star classification accuracy of 99.5%. Among the detections, 151 of these are prior known Be stars cross-matched from the literature. By applying a three-step test, we identified 183 new CBe stars. We find that 41 CBe stars are members of known open clusters. Based upon an investigation of the kinematics of the identified CBe stars from the Gaia EDR3 astrometric solutions, we identified 16 new runaways. These new identifications will provide a reference for future follow-ups to further investigate their physical properties.

preprint2022arXiv

Immunofluorescence Capillary Imaging Segmentation: Cases Study

Nonunion is one of the challenges faced by orthopedics clinics for the technical difficulties and high costs in photographing interosseous capillaries. Segmenting vessels and filling capillaries are critical in understanding the obstacles encountered in capillary growth. However, existing datasets for blood vessel segmentation mainly focus on the large blood vessels of the body, and the lack of labeled capillary image datasets greatly limits the methodological development and applications of vessel segmentation and capillary filling. Here, we present a benchmark dataset, named IFCIS-155, consisting of 155 2D capillary images with segmentation boundaries and vessel fillings annotated by biomedical experts, and 19 large-scale, high-resolution 3D capillary images. To obtain better images of interosseous capillaries, we leverage state-of-the-art immunofluorescence imaging techniques to highlight the rich vascular morphology of interosseous capillaries. We conduct comprehensive experiments to verify the effectiveness of the dataset and the benchmarking deep learning models (\eg UNet/UNet++ and the modified UNet/UNet++). Our work offers a benchmark dataset for training deep learning models for capillary image segmentation and provides a potential tool for future capillary research. The IFCIS-155 dataset and code are all publicly available at \url{https://github.com/ncclabsustech/IFCIS-55}.

preprint2022arXiv

Implementation of an Automated Learning System for Non-experts

Automated machine learning systems for non-experts could be critical for industries to adopt artificial intelligence to their own applications. This paper detailed the engineering system implementation of an automated machine learning system called YMIR, which completely relies on graphical interface to interact with users. After importing training/validation data into the system, a user without AI knowledge can label the data, train models, perform data mining and evaluation by simply clicking buttons. The paper described: 1) Open implementation of model training and inference through docker containers. 2) Implementation of task and resource management. 3) Integration of Labeling software. 4) Implementation of HCI (Human Computer Interaction) with a rebuilt collaborative development paradigm. We also provide subsequent case study on training models with the system. We hope this paper can facilitate the prosperity of our automated machine learning community from industry application perspective. The code of the system has already been released to GitHub (https://github.com/industryessentials/ymir).

preprint2022arXiv

LAMOST medium-resolution spectroscopic survey of binarity and exotic star (LAMOST-MRS-B): Observation strategy and target selection

LAMOST-MRS-B is one of the sub-surveys of LAMOST medium-resolution (R~7500) spectroscopic survey. It aims at studying the statistical properties (e.g., binary fraction, orbital period distribution, mass ratio distribution) of binary stars and exotic stars. We intend to observe about 30000 stars (10 mag <= G <= 14.5 mag) with at least 10 visits in five years. We first planned to observe 25 plates around the galactic plane in 2018. Then the plates were reduced to 12 in 2019 because of the limitation of observation. At the same time, two new plates located at the high galactic latitude were added to explore binary properties influenced by the different environments. In this survey project, we set the identified exotic and low-metallicity stars with the highest observation priorities. For the rest of the selected stars, we gave higher priority to the relatively brighter stars in order to obtain high-quality spectra as many as possible. Spectra of 49129 stars have been obtained in LAMOST-MRS-B field and released in DR8, of which 28828 and 3375 stars have been visited more than twice and ten times with SNR >= 10, respectively. Most of the sources are B-, A-, and F-type stars with 0.6 < [Fe/H] < 0.4 dex. We also obtain 347 identified variable and exotic stars and about 250 stars with [Fe/H] < 1 dex. We measure radial velocities (RVs) by using 892233 spectra of the stars. The uncertainties of RV achieve about 1 km/s and 10 km/s1 for 95% of late- and early-type stars, respectively. The datasets presented in this paper are available at http://www.doi.org/10.57760/sciencedb.j00113.00035.

preprint2022arXiv

Look Before You Leap: Improving Text-based Person Retrieval by Learning A Consistent Cross-modal Common Manifold

The core problem of text-based person retrieval is how to bridge the heterogeneous gap between multi-modal data. Many previous approaches contrive to learning a latent common manifold mapping paradigm following a \textbf{cross-modal distribution consensus prediction (CDCP)} manner. When mapping features from distribution of one certain modality into the common manifold, feature distribution of the opposite modality is completely invisible. That is to say, how to achieve a cross-modal distribution consensus so as to embed and align the multi-modal features in a constructed cross-modal common manifold all depends on the experience of the model itself, instead of the actual situation. With such methods, it is inevitable that the multi-modal data can not be well aligned in the common manifold, which finally leads to a sub-optimal retrieval performance. To overcome this \textbf{CDCP dilemma}, we propose a novel algorithm termed LBUL to learn a Consistent Cross-modal Common Manifold (C$^{3}$M) for text-based person retrieval. The core idea of our method, just as a Chinese saying goes, is to `\textit{san si er hou xing}&#39;, namely, to \textbf{Look Before yoU Leap (LBUL)}. The common manifold mapping mechanism of LBUL contains a looking step and a leaping step. Compared to CDCP-based methods, LBUL considers distribution characteristics of both the visual and textual modalities before embedding data from one certain modality into C$^{3}$M to achieve a more solid cross-modal distribution consensus, and hence achieve a superior retrieval accuracy. We evaluate our proposed method on two text-based person retrieval datasets CUHK-PEDES and RSTPReid. Experimental results demonstrate that the proposed LBUL outperforms previous methods and achieves the state-of-the-art performance.

preprint2022arXiv

Mass-Ratio Distribution of Binaries From the LAMOST-MRS Survey

Binary evolution leads to the formation of important objects crucial to the development of astrophysics, but the statistical properties of binary populations are still poorly understood. The LAMOST-MRS has provided a large sample of stars to study the properties of binary populations, especially for the mass ratio distributions and the binary fractions. We have devised a Peak Amplitude Ratio (PAR) approach to derive the mass ratio of a binary system based on results obtained from its spectrum. By computing a cross-correlation function (CCF), we established a relationship between the derived mass ratio and the PARs of the binary systems. By utilizing spectral observations obtained from LAMSOT DR6 & DR7, we applied the PAR approach to form distributions of the derived mass ratio of the binary systems to the spectral types. We selected the mass ratio within the range of $0.6-1.0$ for investigating the mass-ratio distribution. Through a power-law fitting, we obtained the power index $γ$ values of $-0.42\pm0.27$, $0.03\pm0.12$, and $2.12\pm0.19$ for A-, F-, and G-type stars identified in the sample, respectively. The derived $γ$-values display an increasing trend toward lower primary star masses, and G-type binaries tend to be more in twins. The close binary fractions (for $P\lesssim 150\,{\rm d}$ and $q\gtrsim 0.6$) in our sample for A, F and G binaries are $7.6\pm 0.5 \%$, $4.9\pm 0.2 \%$ and $3.7 \pm 0.1 \%$, respectively. Note that the PAR approach can be applied to large spectroscopic surveys of stars.

preprint2022arXiv

Milky Way Mass with K Giants and BHB Stars Using LAMOST, SDSS/SEGUE, and Gaia: 3D Spherical Jeans Equation and Tracer Mass Estimator

We measure the enclosed Milky Way mass profile to Galactocentric distances of $\sim70$ and $\sim50$ kpc using the smooth, diffuse stellar halo samples of Bird et al. The samples are LAMOST and SDSS/SEGUE K giants (KG) and SDSS/SEGUE blue horizontal branch (BHB) stars with accurate metallicities. The 3D kinematics are available through LAMOST and SDSS/SEGUE distances and radial velocities and {\it Gaia} DR2 proper motions. Two methods are used to estimate the enclosed mass: 3D spherical Jeans equation and Evans et al. tracer mass estimator (TME). We remove substructure via the Xue et al. method based on integrals of motion. We evaluate the uncertainties on our estimates due to random sampling noise, systematic distance errors, the adopted density profile, and non-virialization and non-spherical effects of the halo. The tracer density profile remains a limiting systematic in our mass estimates, although within these limits we find reasonable agreement across the different samples and the methods applied. Out to $\sim70$ and $\sim50$ kpc, the Jeans method yields total enclosed masses of $4.3\pm0.95$ (random) $\pm0.6$ (systematic) $\times10^{11}$ M$_\odot$ and $4.1\pm1.2$ (random) $\pm0.6$ (systematic) $\times10^{11}$ M$_\odot$ for the KG and BHB stars, respectively. For the KG and BHB samples we find a dark matter virial mass of $M_{200}=0.55^{+0.15}_{-0.11}$ (random) $\pm0.083$ (systematic) $\times10^{12}$ M$_\odot$ and $M_{200}=1.00^{+0.67}_{-0.33}$ (random) $\pm0.15$ (systematic) $\times10^{12}$ M$_\odot$, respectively.

preprint2022arXiv

MixNN: A design for protecting deep learning models

In this paper, we propose a novel design, called MixNN, for protecting deep learning model structure and parameters. The layers in a deep learning model of MixNN are fully decentralized. It hides communication address, layer parameters and operations, and forward as well as backward message flows among non-adjacent layers using the ideas from mix networks. MixNN has following advantages: 1) an adversary cannot fully control all layers of a model including the structure and parameters, 2) even some layers may collude but they cannot tamper with other honest layers, 3) model privacy is preserved in the training phase. We provide detailed descriptions for deployment. In one classification experiment, we compared a neural network deployed in a virtual machine with the same one using the MixNN design on the AWS EC2. The result shows that our MixNN retains less than 0.001 difference in terms of classification accuracy, while the whole running time of MixNN is about 7.5 times slower than the one running on a single virtual machine.

preprint2022arXiv

On-demand Integrated Quantum Memory for Polarization Qubits

Photonic polarization qubits are widely used in quantum computation and quantum communication due to the robustness in transmission and the easy qubit manipulation. An integrated quantum memory for polarization qubits is a fundamental building block for large-scale integrated quantum networks. However, on-demand storing polarization qubits in an integrated quantum memory is a long-standing challenge due to the anisotropic absorption of solids and the polarization-dependent features of microstructures. Here we demonstrate a reliable on-demand quantum memory for polarization qubits, using a depressed-cladding waveguide fabricated in a 151Eu3+: Y2SiO5 crystal. The site-2 151Eu3+ ions in Y2SiO5 crystal provides a near-uniform absorption for arbitrary polarization states and a new pump sequence is developed to prepare a wideband and enhanced absorption profile. A fidelity of 99.4\pm0.6% is obtained for the qubit storage process with an input of 0.32 photons per pulse, together with a storage bandwidth of 10 MHz. This reliable integrated quantum memory for polarization qubits reveals the potential for use in the construction of integrated quantum networks.

preprint2022arXiv

On-demand multimode optical storage in a laser-written on-chip waveguide

Quantum memory is a fundamental building block for large-scale quantum networks. On-demand optical storage with a large bandwidth, a high multimode capacity and an integrated structure simultaneously is crucial for practical application. However, this has not been demonstrated yet. Here, we fabricate an on-chip waveguide in a $\mathrm {^{151}Eu^{3+}:Y_2SiO_5}$ crystal with insertion losses of 0.2 dB, and propose a novel pumping scheme to enable spin-wave atomic frequency comb (AFC) storage with a bandwidth of 11 MHz inside the waveguide. Based on this, we demonstrate the storage of 200 temporal modes using the AFC scheme and conditional on-demand storage of 100 temporal modes using the spin-wave AFC scheme. The interference visibility between the readout light field and the reference light field is $99.0\% \pm 0.6\%$ and $97\% \pm 3\%$ for AFC and spin-wave AFC storage, respectively, indicating the coherent nature of this low-loss, multimode and integrated storage device.

preprint2022arXiv

Overview of the LAMOST survey in the first decade

The Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST), also known as the Guoshoujing Telescope, is a major national scientific facility for astronomical research located in Xinglong, China. Beginning with a pilot survey in 2011, LAMOST has been surveying the night sky for more than 10 years. The LAMOST survey covers various objects in the Universe, from normal stars to peculiar ones, from the Milky Way to other galaxies, and from stellar black holes and their companions to quasars that ignite ancient galaxies. Until the latest data release 8, the LAMOST survey has released spectra for more than 10 million stars, ~220,000 galaxies, and ~71,000 quasars. With this largest celestial spectra database ever constructed, LAMOST has helped astronomers to deepen their understanding of the Universe, especially for our Milky Way galaxy and the millions of stars within it. In this article, we briefly review the characteristics, observations, and scientific achievements of LAMOST. In particular, we show how astrophysical knowledge about the Milky Way has been improved by LAMOST data.

preprint2022arXiv

Planets Across Space and Time (PAST). III. Morphology of the Planetary Radius Valley as a Function of Stellar Age and Metallicity in the Galactic Context Revealed by the LAMOST-Gaia-Kepler Sample

The radius valley, a dip in the radius distribution of exoplanets at ~1.9 Earth radii separates compact rocky Super-Earths and Sub-Neptunes with lower density. Various hypotheses have been put forward to explain the radius valley. Characterizing the radius valley morphology and its correlation to stellar properties will provide crucial observation constraints on its origin mechanism and deepen the understanding of planet formation and evolution. In this paper, the third part of the Planets Across the Space and Time (PAST) series, using the LAMOST-Gaia-Kepler catalog, we perform a systematical investigation into how the radius valley morphology varies in the Galactic context, i.e., thin/thick galactic disks, stellar age and metallicity abundance ([Fe/H] and [alpha/Fe]). We find that (1) The valley becomes more prominent with the increase of both age and [Fe/H]. (2) The number ratio of super-Earths to sub-Neptunes monotonically increases with age but decreases with [Fe/H] and [alpha/Fe]. (3) The average radius of planets above the valley (2.1-6 Earth radii) decreases with age but increases with [Fe/H]. (4) In contrast, the average radius of planets below the valley (R < 1.7 Earth radii) is broadly independent on age and metallicity. Our results demonstrate that the valley morphology as well as the whole planetary radius distribution evolves on a long timescale of giga-years, and metallicities (not only Fe but also other metal elements, e.g., Mg, Si, Ca, Ti) play important roles in planet formation and in the long term planetary evolution.

preprint2022arXiv

Precognition in Task-oriented Dialogue Understanding: Posterior Regularization by Future Context

Task-oriented dialogue systems have become overwhelmingly popular in recent researches. Dialogue understanding is widely used to comprehend users&#39; intent, emotion and dialogue state in task-oriented dialogue systems. Most previous works on such discriminative tasks only models current query or historical conversations. Even if in some work the entire dialogue flow was modeled, it is not suitable for the real-world task-oriented conversations as the future contexts are not visible in such cases. In this paper, we propose to jointly model historical and future information through the posterior regularization method. More specifically, by modeling the current utterance and past contexts as prior, and the entire dialogue flow as posterior, we optimize the KL distance between these distributions to regularize our model during training. And only historical information is used for inference. Extensive experiments on two dialogue datasets validate the effectiveness of our proposed method, achieving superior results compared with all baseline models.

preprint2022arXiv

Recursive Least Squares Policy Control with Echo State Network

The echo state network (ESN) is a special type of recurrent neural networks for processing the time-series dataset. However, limited by the strong correlation among sequential samples of the agent, ESN-based policy control algorithms are difficult to use the recursive least squares (RLS) algorithm to update the ESN&#39;s parameters. To solve this problem, we propose two novel policy control algorithms, ESNRLS-Q and ESNRLS-Sarsa. Firstly, to reduce the correlation of training samples, we use the leaky integrator ESN and the mini-batch learning mode. Secondly, to make RLS suitable for training ESN in mini-batch mode, we present a new mean-approximation method for updating the RLS correlation matrix. Thirdly, to prevent ESN from over-fitting, we use the L1 regularization technique. Lastly, to prevent the target state-action value from overestimation, we employ the Mellowmax method. Simulation results show that our algorithms have good convergence performance.

preprint2022arXiv

Rigorous proof of slightly nonlinear Jeans instability in the expanding Newtonian universe

Due to the nonlinearity of the Euler{Poisson equations, it is possible that the nonlinear Jeans instability may lead to a faster density growing rate than the rate in the standard theory of linearized Jeans instability, which motivates us to study the nonlinear Jeans instability. The aim of this article is to develop a method proving the Jeans instability for slightly nonlinear Euler-Poisson equations in the expanding Newtonian universe. The standard proofs of the Jeans instability rely on the Fourier analysis. However, it is difficult to generalize Fourier method to a nonlinear setting, and thus there is no result in the nonlinear analysis of Jeans instability. We firstly develop a non-Fourier-based method to reprove the linearized Jeans instability in the expanding Newtonian universe. Secondly, we generalize this idea to a slightly nonlinear case. This method relies on the Cauchy problem of the Fuchsian system due to the recent developments of this system in mathematics. The fully nonlinear Jeans instability for the Euler-Poisson and Einstein-Euler equations are in progress.

preprint2022arXiv

S4OD: Semi-Supervised learning for Single-Stage Object Detection

Single-stage detectors suffer from extreme foreground-background class imbalance, while two-stage detectors do not. Therefore, in semi-supervised object detection, two-stage detectors can deliver remarkable performance by only selecting high-quality pseudo labels based on classification scores. However, directly applying this strategy to single-stage detectors would aggravate the class imbalance with fewer positive samples. Thus, single-stage detectors have to consider both quality and quantity of pseudo labels simultaneously. In this paper, we design a dynamic self-adaptive threshold (DSAT) strategy in classification branch, which can automatically select pseudo labels to achieve an optimal trade-off between quality and quantity. Besides, to assess the regression quality of pseudo labels in single-stage detectors, we propose a module to compute the regression uncertainty of boxes based on Non-Maximum Suppression. By leveraging only 10% labeled data from COCO, our method achieves 35.0% AP on anchor-free detector (FCOS) and 32.9% on anchor-based detector (RetinaNet).

preprint2022arXiv

Searching Extra-tidal Features around the Globular Cluster Whiting 1

Whiting 1 is a faint and young globular cluster in the halo of the Milky Way, and was suggested to have originated in the Sagittarius spherical dwarf galaxy (Sgr dSph). In this paper, we use the deep DESI Legacy Imaging Surveys to explore tentative spatial connection between Whiting 1 and the Sgr dSph. We redetermine the fundamental parameters of Whiting 1 and use the best-fitting isochrone (age $τ$=6.5 Gyr, metalicity Z=0.005 and $\rm d_{\odot}$=26.9 kpc) to construct a theoretical matched filter for the extra-tidal features searching. Without any smooth technique to the matched filter density map, we detect a round-shape feature with possible leading and trailing tails on either side of the cluster. This raw image is not totally new compared to old discoveries, but confirms that no more large-scale features can be detected under a depth of r<=22.5 mag. In our results, the whole feature stretches 0.1-0.2 degree along the orbit of Whiting 1, which gives a much larger area than the cluster core. The tails on both sides of the cluster align along the orbital direction of the Sgr dSph as well as the cluster itself, which implies that these debris are probably stripped remnants of Whiting 1 by the Milky Way.

preprint2022arXiv

Sparse-Dyn: Sparse Dynamic Graph Multi-representation Learning via Event-based Sparse Temporal Attention Network

Dynamic graph neural networks have been widely used in modeling and representation learning of graph structure data. Current dynamic representation learning focuses on either discrete learning which results in temporal information loss or continuous learning that involves heavy computation. In this work, we proposed a novel dynamic graph neural network, Sparse-Dyn. It adaptively encodes temporal information into a sequence of patches with an equal amount of temporal-topological structure. Therefore, while avoiding the use of snapshots which causes information loss, it also achieves a finer time granularity, which is close to what continuous networks could provide. In addition, we also designed a lightweight module, Sparse Temporal Transformer, to compute node representations through both structural neighborhoods and temporal dynamics. Since the fully-connected attention conjunction is simplified, the computation cost is far lower than the current state-of-the-arts. Link prediction experiments are conducted on both continuous and discrete graph datasets. Through comparing with several state-of-the-art graph embedding baselines, the experimental results demonstrate that Sparse-Dyn has a faster inference speed while having competitive performance.

preprint2022arXiv

The Eclipsing Binaries from the LAMOST Medium-resolution Survey.III. A High-precision Empirical Stellar Mass Library

High-precision stellar mass and radius measured directly from binaries can effectively calibrate the stellar models. However, such a database containing full spectral types and large range of metallicity is still not fully established. A continuous effort of data collecting and analysis are requested to complete the database. In this work, we provide a catalog containing 184 binaries with independent atmospheric parameters and accurate masses and radii as the benchmark of stellar mass and radius. The catalog contains 56 new detached binaries from LAMOST Medium-resolution spectroscopic (MRS) survey and 128 detached eclipsing binaries compiled from previous studies. We obtain the orbital solutions of the new detached binaries with uncertainties of masses and radii smaller than 5%. These new samples densify the distribution of metallicity of the high-precision stellar mass library and add 9 hot stars with Teff>8000 K. Comparisons show that these samples well agree with the PARSEC isochrones in Teff-logg-mass-radius-luminosity space. We compare mass and radius estimates from isochrone and SED fitting, respectively, with those from the binary orbital solution. We find that the precision of the stellar-model dependent mass estimates is >10% and the precision of the radius estimates based on atmospheric parameters is >15%. These give a general view of the uncertainty of the usual approaches to estimate stellar mass and radius.

preprint2022arXiv

The North/South Asymmetry of the Galaxy: Possible Connection to the Vertical Phase Space Snail

The Galaxy is found to be in disequilibrium based on recent findings of the North/South (N/S) asymmetry and the phase mixing signatures, such as a phase spiral (snail) structure in the vertical phase space ($z-V_{z}$). We show that the N/S asymmetry in a tracer population of dwarfs may be quantitatively modeled with a simple phase snail model superimposed on a smooth equilibrium background. As the phase snail intersects with the $z$ axis, the number density is enhanced, and the velocity dispersion ($σ_{z}$) is decreased relative to the other side of the Galactic plane. Fitting only to the observed asymmetric N/S $σ_{z}$ profiles, we obtain reasonable parameters for the phase space snail and the potential utilized in modeling the background, despite the complex dependence of the model on the potential parameters and the significant selection effects of the data. Both the snail shape and the N/S number density difference given by our best-fit model are consistent with previous observations. The equilibrium background implies a local dark matter density of $0.0151^{+0.0050}_{-0.0051}$ ${\rm M}_{\odot}\,{\rm pc}^{-3}$. The vertical bulk motion of our model is similar to the observation, but with a $\sim$1.2 $\rm km\,s^{-1}$ shift. Our work demonstrates the strong correlation between the phase space snail and the N/S asymmetry. Future observational constraints will facilitate more comprehensive snail models to unravel the Milky Way potential and the perturbation history encoded in the snail feature.

preprint2021arXiv

A Catalog of LAMOST Variable Sources Based on Time-domain Photometry of ZTF

The identification and analysis of different variable sources is a hot issue in astrophysical research. The Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) spectroscopic survey has accumulated massive spectral data but contains no information about variable sources. Although a few related studies present variable source catalogs for the LAMOST, the studies still have a few deficiencies regarding the type and number of variable sources identified. In this study, we presented a statistical modeling approach to identify variable source candidates. We first crossed the Kepler, Sloan Digital Sky Survey (SDSS), and Zwicky Transient Facility (ZTF) catalogs to obtain light curves data of variable and non-variable sources. The data are then modeled statistically using commonly used variability parameters, respectively. And then, an optimal variable source identification model is determined using the Receiver Operating Characteristic (ROC) curve and four credible evaluation indices such as precision, accuracy, recall, and F1score. Based on this identification model, a catalog of LAMOST variable sources (including 631,769 variable source candidates with a probability greater than 95% and so on) is obtained. To validate the correctness of the catalog, we performed a two-by-two cross-comparison with the GAIA catalog and other published variable source catalogs. We achieved the correct rate ranging from 50% to 100%. Among the 123,756 sources cross-matched, our variable source catalog identifies 85,669 with a correct rate of 69%, which indicates that the variable source catalog presented in this study is credible.

preprint2021arXiv

Binary fraction of O and B-type stars from LAMOST data

Binary stars plays important role in the evolution of stellar populations . The intrinsic binary fraction ($f_{bin}$) of O and B-type (OB) stars in LAMOST DR5 was investigated in this work. We employed a cross-correlation approach to estimate relative radial velocities for each of the stellar spectra. The algorithm described by \cite{2013A&A...550A.107S} was implemented and several simulations were made to assess the performance of the approach. Binary fraction of the OB stars are estimated through comparing the uni-distribution between observations and simulations with the Kolmogorov-Smirnov tests. Simulations show that it is reliable for stars most of whom have $6,7$ and $8$ repeated observations. The uncertainty of orbital parameters of binarity become larger when observational frequencies decrease. By adopting the fixed power exponents of $π=-0.45$ and $κ=-1$ for period and mass ratio distributions, respectively, we obtain that $f_{bin}=0.4_{-0.06}^{+0.05}$ for the samples with more than 3 observations. When we consider the full samples with at least 2 observations, the binary fraction turns out to be $0.37_{-0.03}^{+0.03}$. These two results are consistent with each other in $1σ$.

preprint2021arXiv

BLOCKEYE: Hunting For DeFi Attacks on Blockchain

Decentralized finance, i.e., DeFi, has become the most popular type of application on many public blockchains (e.g., Ethereum) in recent years. Compared to the traditional finance, DeFi allows customers to flexibly participate in diverse blockchain financial services (e.g., lending, borrowing, collateralizing, exchanging etc.) via smart contracts at a relatively low cost of trust. However, the open nature of DeFi inevitably introduces a large attack surface, which is a severe threat to the security of participants funds. In this paper, we proposed BLOCKEYE, a real-time attack detection system for DeFi projects on the Ethereum blockchain. Key capabilities provided by BLOCKEYE are twofold: (1) Potentially vulnerable DeFi projects are identified based on an automatic security analysis process, which performs symbolic reasoning on the data flow of important service states, e.g., asset price, and checks whether they can be externally manipulated. (2) Then, a transaction monitor is installed offchain for a vulnerable DeFi project. Transactions sent not only to that project but other associated projects as well are collected for further security analysis. A potential attack is flagged if a violation is detected on a critical invariant configured in BLOCKEYE, e.g., Benefit is achieved within a very short time and way much bigger than the cost. We applied BLOCKEYE in several popular DeFi projects and managed to discover potential security attacks that are unreported before. A video of BLOCKEYE is available at https://youtu.be/7DjsWBLdlQU.

preprint2021arXiv

Future stability of the FLRW spacetime for a large class of perfect fluids

We establish the future non-linear stability of Friedmann-Lema\^ıtre-Robertson-Walker (FLRW) solutions to the Einstein-Euler equations of the universe filled with a large class of perfect fluids (the equations of state are allowed to be certain nonlinear or linear types both). Several previous results as specific examples can be covered in the results of this article. We emphasize that the future stability of FLRW metric for polytropic fluids with positive cosmological constant has been a difficult problem and can not be directly generalized from the previous known results. Our result in this article has not only covered this difficult case for the polytropic fluids, but also unified more types of fluids in a same scheme of proofs.

preprint2021arXiv

Generation of entanglement between a highly wave-packet-tunable photon and a spin-wave memory in cold atoms

Controls of waveforms (pulse durations) of single photons are important tasks for effectively interconnecting disparate atomic memories in hybrid quantum networks. So far, the waveform control of single photon that is entangled with an atomic memory remains unexplored. Here, we demonstrated control of waveform length of the photon that is entangled with an atomic spin-wave memory by varying light-atom interaction time in cold atoms. The Bell parameter S as a function of the duration of photon pulse is measured, which shows that violations of Bell equality can be achieved for the photon pulse in the duration range from 40 ns to 50 us, where, S=2.64+/-0.02 and S=2.26+/-0.05 for the 40-ns and 50-μs durations, respectively. The measured results show that S parameter decreases with the increase in the pulse duration. We confirm that the increase in photon noise probability per pulse with the pulse-duration is responsible for the S decrease.

preprint2021arXiv

LAMOST Time-Domain Survey: First Results of four $K$2 plates

From Oct. 2019 to Apr. 2020, LAMOST performs a time-domain spectroscopic survey of four $K$2 plates with both low- and med-resolution observations. The low-resolution spectroscopic survey gains 282 exposures ($\approx$46.6 hours) over 25 nights, yielding a total of about 767,000 spectra, and the med-resolution survey takes 177 exposures ($\approx$49.1 hours) over 27 nights, collecting about 478,000 spectra. More than 70%/50% of low-resolution/med-resolution spectra have signal-to-noise ratio higher than 10. We determine stellar parameters (e.g., $T_{\rm eff}$, log$g$, [Fe/H]) and radial velocity (RV) with different methods, including LASP, DD-Payne, and SLAM. In general, these parameter estimations from different methods show good agreement, and the stellar parameter values are consistent with those of APOGEE. We use the $Gaia$ DR2 RV data to calculate a median RV zero point (RVZP) for each spectrograph exposure by exposure, and the RVZP-corrected RVs agree well with the APOGEE data. The stellar evolutionary and spectroscopic masses are estimated based on the stellar parameters, multi-band magnitudes, distances and extinction values. Finally, we construct a binary catalog including about 2700 candidates by analyzing their light curves, fitting the RV data, calculating the binarity parameters from med-resolution spectra, and cross-matching the spatially resolved binary catalog from $Gaia$ EDR3. The LAMOST TD survey is expected to get breakthrough in various scientific topics, such as binary system, stellar activity, and stellar pulsation, etc.

preprint2021arXiv

Noise suppression in a temporal-multimode quantum memory entangled with a photon via asymmetrical photon-collection channel

Quantum interfaces (QIs) that generate entanglement between a multimode atomic memory and a photon forms a multiplexed repeater node and hold promise to greatly improve quantum repeater rates. Recently, the temporal multimode spin-wave memory that is entangled with a photon has been demonstrated with cold atoms. However, due to additional noise generated in multimode operation, the fidelity of spin-wave-photon entanglement significantly decreases with the mode number. So far, the improvement on temporal-multimode entanglement fidelity via suppressing the additional noise remains unexplored. Here, we propose and experimentally demonstrate a scheme that can suppress the additional noise of a temporally-multiplexed QI. The scheme uses an asymmetric channel to collect the photons coming and retrieving from the temporally-multiplexed QI. For making comparisons, we also set up a QI that uses symmetric channel for the photon collections. When the QIs store 14 modes, the measured Bell parameter S for the QIs using the asymmetric and the symmetric photon-collection channels are 2.36+/-0.03 and 2.24+/-0.04, respectively, showing that the QI using the asymmetric channel gives rise to a 3% increase in entanglement fidelity, i.e., a 1.7-fold decrease in the additional noise, compared with the QI using the symmetric one. On the other hand, the 14-mode entanglement QIs that use the asymmetric and symmetric collections preserve the violation of a Bell inequality for storage times up to 25 us and 20 us, respectively, showing that the asymmetric QI has a higher entanglement storage performance.

preprint2021arXiv

The Binarity of Early-type Stars from LAMOST Medium-resolution Spectroscopic Survey

Massive binaries play significant roles in many fields. Identification of massive stars, particularly massive binaries, is of great importance. In this paper, by adopting the technique of measuring the equivalent widths of several spectral lines, we identified 9,382 early-type stars from LAMOST medium-resolution survey and divided the sample into four groups, T1 ($\sim$O-B4), T2 ($\sim$B5), T3 ($\sim$B7), and T4 ($\sim$B8-A). The relative radial velocities $RV_{\rm rel}$ were calculated using the Maximum Likelihood Estimation. The stars with significant changes of $RV_{\rm rel}$ and at least larger than 15.57km s$^{-1}$ were identified as spectroscopic binaries. We found that the observed spectroscopic binary fractions for the four groups are $24.6\%\pm0.5\%$, $20.8\%\pm0.6\%$, $13.7\%\pm0.3\%$, and $7.4\%\pm0.3\%$, respectively. Assuming that orbital period ($P$) and mass ratio ($q$) have intrinsic distributions as $f(P) \propto P^π$ (1\textless$P$\textless1000 days) and $f(q) \propto q^κ$ (0.1\textless$q$\textless1), respectively, we conducted a series of Monte-Carlo simulations to correct observational biases for estimating the intrinsic multiplicity properties. The results show that the intrinsic binary fractions for the four groups are 68$\%\pm8\%$, 52$\%\pm3\%$, 44$\%\pm6\%$, and 44$\%\pm6\%$, respectively. The best estimated values for $π$ are -1$\pm0.1$, -1.1$\pm0.05$, -1.1$\pm0.1$, and -0.6$\pm0.05$, respectively. The $κ$ cannot be constrained for groups T1 and T2 and is -2.4$\pm0.3$ for group T3 and -1.6$\pm0.3$ for group T4. We confirmed the relationship of a decreasing trend in binary fractions towards late-type stars. No correlation between the spectral type and the orbital period distribution has been found yet, possibly due to the limitation of observational cadence.

preprint2021arXiv

The mass of the Milky Way out to 100 kpc using halo stars

We use a distribution function analysis to estimate the mass of the Milky Way out to 100 kpc using a large sample of halo stars. These stars are compiled from the literature, and the vast majority (~98%) have 6D phase-space information. We pay particular attention to systematic effects, such as the dynamical influence of the Large Magellanic Cloud (LMC), and the effect of unrelaxed substructure. The LMC biases the (pre-LMC infall) halo mass estimates towards higher values, while realistic stellar halos from cosmological simulations tend to underestimate the true halo mass. After applying our method to the Milky Way data we find a mass within 100 kpc of M(< 100 kpc) = 6.07 +/- 0.29 (stat.) +/- 1.21 (sys.) x 10^11 M_Sun. For this estimate, we have approximately corrected for the reflex motion induced by the LMC using the Erkal et al. model, which assumes a rigid potential for the LMC and MW. Furthermore, stars that likely belong to the Sagittarius stream are removed, and we include a 5% systematic bias, and a 20% systematic uncertainty based on our tests with cosmological simulations. Assuming the mass-concentration relation for Navarro-Frenk-White haloes, our mass estimate favours a total (pre-LMC infall) Milky Way mass of M_200c = 1.01 +/- 0.24 x 10^12 M_Sun, or (post-LMC infall) mass of M_200c = 1.16 +/- 0.24 x 10^12 M_Sun when a 1.5 x 10^11 M_Sun mass of a rigid LMC is included.

preprint2021arXiv

The Spectroscopic Binaries from LAMOST Medium-Resolution Survey (MRS). I. Searching for Double-lined Spectroscopic Binaries (SB2s) with Convolutional Neural Network

We developed a convolutional neural network (CNN) model to distinguish the double-lined spectroscopic binaries (SB2s) from others based on single exposure medium-resolution spectra ($R\sim 7,500$). The training set consists of a large set of mock spectra of single stars and binaries synthesized based on the MIST stellar evolutionary model and ATLAS9 atmospheric model. Our model reaches a novel theoretic false positive rate by adding a proper penalty on the negative sample (e.g., 0.12\% and 0.16\% for the blue/red arm when the penalty parameter $Λ=16$). Tests show that the performance is as expected and favors FGK-type Main-sequence binaries with high mass ratio ($q \geq 0.7$) and large radial velocity separation ($Δv \geq 50\,\mathrm{km\,s^{-1}}$). Although the real false positive rate can not be estimated reliably, validating on eclipsing binaries identified from Kepler light curves indicates that our model predicts low binary probabilities at eclipsing phases (0, 0.5, and 1.0) as expected. The color-magnitude diagram also helps illustrate its feasibility and capability of identifying FGK MS binaries from spectra. We conclude that this model is reasonably reliable and can provide an automatic approach to identify SB2s with period $\lesssim 10$ days. This work yields a catalog of binary probabilities for over 5 million spectra of 1 million sources from the LAMOST medium-resolution survey (MRS), and a catalog of 2198 SB2 candidates whose physical properties will be analyzed in our following-up paper. Data products are made publicly available at the journal as well as our Github website.

preprint2021arXiv

Two-dimensional charge density wave TaX$_2$ (X=S, Se, Te) from first principles

Transition metal dichalcogenides are rich in their structural phases, e.g. 1T-TaS2 and 1T-TaSe2 form charge density wave (CDW) under low temperature with interesting and exotic properties. Here, we present a systematic study of different structures in two-dimensional TaX2 (X=S, Se, Te) using density functional theory calculations with consideration of van der Waals interaction. All the normal phases present metal characteristics with various ground state and magnetic properties. The lattice reconstruction of CDW drastically affects the electronic and structural characteristics of 1T-TaS2 and 1T-TaSe2, leading to a transition from metal to insulator and an emergence of magnetic moment within periodic atomic clusters called the Star of David. The evaluated Heisenberg couplings indicate the weak ferromagnetic coupling between the clusters in monolayer. Furthermore, in bilayer commensurate CDW cases, we find intriguing phenomenon of the varying magnetic properties with different stacking orders. The magnetic moment in each layer disappears when two layers are coupled, but may sustain in certain stackings of interlayer antiferromagnetic configurations.

preprint2020arXiv

A Convolutional Neural Network-Based Low Complexity Filter

Convolutional Neural Network (CNN)-based filters have achieved significant performance in video artifacts reduction. However, the high complexity of existing methods makes it difficult to be applied in real usage. In this paper, a CNN-based low complexity filter is proposed. We utilize depth separable convolution (DSC) merged with the batch normalization (BN) as the backbone of our proposed CNN-based network. Besides, a weight initialization method is proposed to enhance the training performance. To solve the well known over smoothing problem for the inter frames, a frame-level residual mapping (RM) is presented. We analyze some of the mainstream methods like frame-level and block-level based filters quantitatively and build our CNN-based filter with frame-level control to avoid the extra complexity and artificial boundaries caused by block-level control. In addition, a novel module called RM is designed to restore the distortion from the learned residuals. As a result, we can effectively improve the generalization ability of the learning-based filter and reach an adaptive filtering effect. Moreover, this module is flexible and can be combined with other learning-based filters. The experimental results show that our proposed method achieves significant BD-rate reduction than H.265/HEVC. It achieves about 1.2% BD-rate reduction and 79.1% decrease in FLOPs than VR-CNN. Finally, the measurement on H.266/VVC and ablation studies are also conducted to ensure the effectiveness of the proposed method.

preprint2020arXiv

Anisotropy of the Milky Way&#39;s stellar halo using K giants from LAMOST and $Gaia$

The anisotropy parameter $β$ characterizes the extent to which orbits in stellar systems are predominantly radial or tangential, and is likely to constrain, for the stellar halo of the Milky Way, scenarios for its formation and evolution. We have measured the anisotropy $β$ as a function of Galactocentric radius from $5-100$ kpc for over 8600 metal poor ([Fe/H] $<-1.3$) halo K giants from the LAMOST catalog with line-of-sight velocities and distances, matched to proper motions from the second $Gaia$ data release. We construct full 6-D positions and velocities for the K giants to directly measure the 3 components of the velocity dispersion $(σ_r, σ_θ, σ_ϕ)$ (in spherical coordinates). We find that the orbits in the halo are radial over our full Galactocentric distance range reaching over 100 kpc. The anisotropy remains remarkably unchanged with Galactocentric radius from approximately 5 to 25 kpc, with an amplitude that depends on the metallicity of the stars, dropping from $β\approx 0.9$ for $-1.8 \leq$ [Fe/H] $< -1.3$ (for the bulk of the stars) to $β\approx 0.6$ for the lowest metallicities ([Fe/H] $< -1.8$). Considering our sample as a whole, $β\approx0.8$ and, beyond 25 kpc, the orbits gradually become less radial and anisotropy decreases to $β<0.3$ past 100 kpc. Within 8 kpc, $β<0.8$. The measurement of anisotropy is affected by substructure and streams, particularly beyond a Galactocentric distance of approximately 25 kpc, where the Sagittarius stream is prominent in the data. These results are complimentary to recent analysis of simulations by Loebman et al. and of SDSS/$Gaia$ DR1 data by Belokurov et al.

preprint2020arXiv

Automatic Lumbar Spinal CT Image Segmentation with a Dual Densely Connected U-Net

The clinical treatment of degenerative and developmental lumbar spinal stenosis (LSS) is different. Computed tomography (CT) is helpful in distinguishing degenerative and developmental LSS due to its advantage in imaging of osseous and calcified tissues. However, boundaries of the vertebral body, spinal canal and dural sac have low contrast and hard to identify in a CT image, so the diagnosis depends heavily on the knowledge of expert surgeons and radiologists. In this paper, we develop an automatic lumbar spinal CT image segmentation method to assist LSS diagnosis. The main contributions of this paper are the following: 1) a new lumbar spinal CT image dataset is constructed that contains 2393 axial CT images collected from 279 patients, with the ground truth of pixel-level segmentation labels; 2) a dual densely connected U-shaped neural network (DDU-Net) is used to segment the spinal canal, dural sac and vertebral body in an end-to-end manner; 3) DDU-Net is capable of segmenting tissues with large scale-variant, inconspicuous edges (e.g., spinal canal) and extremely small size (e.g., dural sac); and 4) DDU-Net is practical, requiring no image preprocessing such as contrast enhancement, registration and denoising, and the running time reaches 12 FPS. In the experiment, we achieve state-of-the-art performance on the lumbar spinal image segmentation task. We expect that the technique will increase both radiology workflow efficiency and the perceived value of radiology reports for referring clinicians and patients.

preprint2020arXiv

Characterising the Performance of High-Speed Data Converters for RFSoC-based Radio Astronomy Receivers

RF system-on-chip (RFSoC) devices provide the potential for implementing a complete radio astronomy receiver on a single board, but performance of the integrated analogue-to-digital converters is critical. We have evaluated the performance of the data converters in the Xilinx ZU28DR RFSoC, which are 12-bit, 8-fold interleaved converters with a maximum sample speed of 4.096 Giga-sample per second (GSPS). We measured the spurious-free dynamic range (SFDR), signal-to-noise and distortion (SINAD), effective number of bits (ENOB), intermodulation distortion (IMD) and cross-talk between adjacent channels over the bandwidth of 2.048 GHz. We both captured data for off-line analysis with floating-point arithmetic, and implemented a real-time integer arithmetic spectrometer on the RFSoC. The performance of the ADCs is sufficient for radio astronomy applications and close to the vendor specifications in most of the scenarios. We have carried out spectral integrations of up to 100 s and stability tests over tens of hours and find thermal noise-limited performance over these timescales.

preprint2020arXiv

Differential rotation of the halo traced by the K-giant stars

We use K-giant stars selected from the LAMOST DR5 to study the variation of the rotational velocity of the galactic halo at different space positions. Modelling the rotational velocity distribution with both the halo and disk components, we find that the rotational velocity of the halo population decreases almost linearly with increasing vertical distance to the galactic disk plane, $Z$, at fixed galactocentric radius, $R$. The samples are separated into two parts with $6<R<12$ kpc and $12<R<20$ kpc. We derive that the decreasing rates along $Z$ for the two subsamples are $-3.07\pm0.63$ and $-1.89\pm0.37$ km s$^{-1}$ kpc$^{-1}$, respectively. Compared with the TNG simulations, we suggest that this trend is probably caused by the interaction between the disk and halo. The results from the simulations show that only the oblate halo can provide a decreasing rotational velocity with an increasing $Z$. This indicates that the Galactic halo is oblate with galactocentric radius $R<20$ kpc. On the other hand, the flaring of the disk component (mainly the thick disk) is clearly traced by this study, with $R$ between 12 and 20 kpc, the disk can vertically extend to $6\sim10$ kpc above the disk plane. What is more interesting is that, we find the Gaia-Enceladus-Sausage (GES) component has a significant contribution only in the halo with $R<12$ kpc, i.e. a fraction of 23$-$47\%. While in the outer subsample, the contribution is too low to be well constrained.

preprint2020arXiv

Discovery of two nearby post-T Tauri stellar associations

In this work we report the discovery of 2 new stellar associations in close vicinity of the Sun at roughly 180 and 150 pc. These two associations, named as u Tau assoc and e Tau assoc, were detected based on their clustering in a multi-dimensional parameter space including $α$, $δ$, $μ_α$ , $μ_δ$ and $π$ of Gaia. The fitting of pre-main-sequence model isochrones in their color-magnitude diagrams suggests that the two associations are of about 50 Myr old and the group members lower than ${\sim}$0.8 $M_{\odot}$ are at the stage of post-T Tauri.

preprint2020arXiv

Hyperfine Structure and Coherent Dynamics of Rare Earth Spins Explored with Electron-Nuclear Double Resonance at Sub-Kelvin Temperatures

An experimental platform of ultralow-temperature pulsed ENDOR (electron-nuclear double resonance) spectroscopy is constructed for the bulk materials. Coherent property of the coupled electron and nuclear spins of the rare-earth (RE) dopants in a crystal (143Nd3+:Y2SiO5) is investigated from 100 mK to 6 K. At the lowest working temperatures, two-pulse-echo coherence time exceeding 2 ms and 40 ms are achieved for the electron and nuclear spins, while the electronic Zeeman and hyperfine population lifetimes are more than 15 s and 10 min. With the aid of the near-unity electron spin polarization at 100 mK, the complete hyperfine level structure with 16 energy levels is measured using ENDOR technique without the assistance of the reconstructed spin Hamiltonian. These results demonstrate the suitability of the deeply cooled paramagnetic RE-doped solids for memory components aimed for quantum communication and quantum computation. The developed experimental platform is expected to be a powerful tool for paramagnetic materials from various research fields.

preprint2020arXiv

Interpretable Machine Learning Model for Early Prediction of Mortality in Elderly Patients with Multiple Organ Dysfunction Syndrome (MODS): a Multicenter Retrospective Study and Cross Validation

Background: Elderly patients with MODS have high risk of death and poor prognosis. The performance of current scoring systems assessing the severity of MODS and its mortality remains unsatisfactory. This study aims to develop an interpretable and generalizable model for early mortality prediction in elderly patients with MODS. Methods: The MIMIC-III, eICU-CRD and PLAGH-S databases were employed for model generation and evaluation. We used the eXtreme Gradient Boosting model with the SHapley Additive exPlanations method to conduct early and interpretable predictions of patients&#39; hospital outcome. Three types of data source combinations and five typical evaluation indexes were adopted to develop a generalizable model. Findings: The interpretable model, with optimal performance developed by using MIMIC-III and eICU-CRD datasets, was separately validated in MIMIC-III, eICU-CRD and PLAGH-S datasets (no overlapping with training set). The performances of the model in predicting hospital mortality as validated by the three datasets were: AUC of 0.858, sensitivity of 0.834 and specificity of 0.705; AUC of 0.849, sensitivity of 0.763 and specificity of 0.784; and AUC of 0.838, sensitivity of 0.882 and specificity of 0.691, respectively. Comparisons of AUC between this model and baseline models with MIMIC-III dataset validation showed superior performances of this model; In addition, comparisons in AUC between this model and commonly used clinical scores showed significantly better performance of this model. Interpretation: The interpretable machine learning model developed in this study using fused datasets with large sample sizes was robust and generalizable. This model outperformed the baseline models and several clinical scores for early prediction of mortality in elderly ICU patients. The interpretative nature of this model provided clinicians with the ranking of mortality risk features.

preprint2020arXiv

LAMOST Medium-Resolution Spectroscopic Survey (LAMOST-MRS): Scientific goals and survey plan

Since September 2018, LAMOST starts a new 5-year medium-resolution spectroscopic survey (MRS) using bright/gray nights. We present the scientific goals of LAMOST-MRS and propose a near optimistic strategy of the survey. A complete footprint is also provided. Not only the regular medium-resolution survey, but also a time-domain spectroscopic survey is being conducted since 2018 and will be end in 2023. According to the detailed survey plan, we expect that LAMOST-MRS can observe about 2 million stellar spectra with ~7500 and limiting magnitude of around G=15 mag. Moreover, it will also provide about 200 thousand stars with averagely 60-epoch observations and limiting magnitude of G~14 mag. These high quality spectra will give around 20 elemental abundances, rotational velocities, emission line profiles as well as precise radial velocity with uncertainty less than 1 km/s. With these data, we expect that LAMOST can effectively leverage sciences on stellar physics, e.g. exotic binary stars, detailed observation of many types of variable stars etc., planet host stars, emission nebulae, open clusters, young pre-main-sequence stars etc.

preprint2020arXiv

Measuring the local dark matter density with LAMOST DR5 and Gaia DR2

We apply the vertical Jeans equation to the kinematics of Milky Way stars in the solar neighbourhood to measure the local dark matter density. More than 90,000 G- and K-type dwarf stars are selected from the cross-matched sample of LAMOST DR5 and Gaia DR2 for our analyses. The mass models applied consist of a single exponential stellar disc, a razor thin gas disc and a constant dark matter density. We first consider the simplified vertical Jeans equation which ignores the tilt term and assumes a flat rotation curve. Under a Gaussian prior on the total stellar surface density, the local dark matter density inferred from Markov Chain Monte Carlo simulations is $0.0133_{-0.0022}^{+0.0024}\ {\rm M}_{\odot}\,{\rm pc}^{-3}$. The local dark matter densities for subsamples in an azimuthal angle range of $-10^{\circ} < ϕ< 5^{\circ}$ are consistent within their 1$σ$ errors. However, the northern and southern subsamples show a large discrepancy due to plateaux in the northern and southern vertical velocity dispersion profiles. These plateaux may be the cause of the different estimates of the dark matter density between the north and south. Taking the tilt term into account has little effect on the parameter estimations and does not explain the north and south asymmetry. Taking half of the difference of $σ_{z}$ profiles as unknown systematic errors, we then obtain consistent measurements for the northern and southern subsamples. We discuss the influence of the vertical data range, the scale height of the tracer population, the vertical distribution of stars and the sample size on the uncertainty of the determination of the local dark matter density.

preprint2020arXiv

On the Chemical and Kinematic Consistency Between N-rich Metal-poor Field Stars and Enriched Populations in Globular Clusters

Interesting chemically peculiar field stars may reflect their stellar evolution history and their possible origin in a different environment from where they are found now, which is one of the most important research fields in Galactic archaeology. To explore this further, we have used the CN-CH bands around 4000 A to identify N-rich metal-poor field stars in LAMOST DR3. Here we expand our N-rich metal-poor field star sample to ~100 stars in LAMOST DR5, where 53 of them are newly found in this work. We investigate light elements of the common stars between our sample and APOGEE DR14. While Mg, Al, and Si abundances generally agree with the hypothesis that N-rich metal-poor field stars come from enriched populations in globular clusters, it is still inconclusive for C, N, and O. After integrating the orbits of our N-rich field stars and a control sample of normal metal-poor field stars, we find that N-rich field stars have different orbital parameter distributions compared to the control sample, specifically, apocentric distances, maximum vertical amplitude (Zmax), orbital energy, and z direction angular momentum (Lz). The orbital parameters of N-rich field stars indicate that most of them are inner-halo stars. The kinematics of N-rich field stars support their possible GC origin. The spatial and velocity distributions of our bona fide N-rich field star sample are important observational evidence to constrain simulations of the origin of these interesting objects.

preprint2020arXiv

On-demand quantum storage of photonic qubits in an on-chip waveguide

Photonic quantum memory is the core element in quantum information processing (QIP). For the scalable and convenient practical applications, great efforts have been devoted to the integrated quantum memory based on various waveguides fabricated in solids. However, on-demand storage of qubits, which is an essential requirement for QIP, is still challenging to be implemented using such integrated quantum memory. Here we report the on-demand storage of time-bin qubits in an on-chip waveguide memory on the surface of a $^{151}$Eu$^{3+}$:Y$_2$SiO$_5$ crystal, utilizing the Stark modulated atomic frequency comb protocol. A qubit storage fidelity of $99.3\%\pm0.2\%$ is obtained with a input of 0.5 photons per pulse, far beyond the highest fidelity achievable using the classical measure-and-prepare strategy. The developed integrated quantum memory with the on-demand retrieval capability, represents an important step towards practical applications of integrated quantum nodes in quantum networks.

preprint2020arXiv

Possible evidence of hydrogen emission in the first-overtone and multi-mode RR Lyrae variables

The nature of shock waves in non-fundamental mode RR Lyrae stars remains a mystery because of limited spectroscopic observations. We apply a pattern recognition algorithm on spectroscopic data from SDSS and LAMOST and report the first evidence of hydrogen emission in first-overtone and multi-mode RR Lyrae stars showing the &#34;first apparition&#34;, which is the most prominent observational characteristic of shock in RR Lyrae variables. We find ten RRc stars in SDSS, ten RRc stars in LAMOST, and three RRd stars in LAMOST that show blueshifted Balmer emissions. The emission features possibly indicate the existence of shock waves. We calculate the radial velocities of the emission lines, which are related to the physical conditions occurring in the radiative zone of shock waves. Using photometric observations from ZTF, we present a detailed light curve analysis for the frequency components in one of our RRd stars with hydrogen emission, RRdl3, for possible modulations. With the enormous volume of upcoming spectral observations of variable stars, our study raises the possibility of connecting the unexplained Blazhko effect to shock waves in non-fundamental mode RR Lyrae stars.

preprint2020arXiv

Reliable coherent optical memory based on a laser-written waveguide

$\mathrm {^{151}Eu^{3+}}$-doped yttrium silicate ($\mathrm {^{151}Eu^{3+}:Y_2SiO_5}$ ) crystal is a unique material that possesses hyperfine states with coherence time up to 6 h. Many efforts have been devoted to the development of this material as optical quantum memories based on the bulk crystals, but integrable structures (such as optical waveguides) that can promote $\mathrm {^{151}Eu^{3+}:Y_2SiO_5}$-based quantum memories to practical applications, have not been demonstrated so far. Here we report the fabrication of type 2 waveguides in a $\mathrm {^{151}Eu^{3+}:Y_2SiO_5}$ crystal using femtosecond-laser micromachining. The resulting waveguides are compatible with single-mode fibers and have the smallest insertion loss of $4.95\ dB$. On-demand light storage is demonstrated in a waveguide by employing the spin-wave atomic frequency comb (AFC) scheme and the revival of silenced echo (ROSE) scheme. We implement a series of interference experiments based on these two schemes to characterize the storage fidelity. Interference visibility of the readout pulse is $0.99\pm 0.03$ for the spin-wave AFC scheme and $0.97\pm 0.02$ for the ROSE scheme, demonstrating the reliability of the integrated optical memory.

preprint2020arXiv

Reverse-engineering Bar Charts Using Neural Networks

Reverse-engineering bar charts extracts textual and numeric information from the visual representations of bar charts to support application scenarios that require the underlying information. In this paper, we propose a neural network-based method for reverse-engineering bar charts. We adopt a neural network-based object detection model to simultaneously localize and classify textual information. This approach improves the efficiency of textual information extraction. We design an encoder-decoder framework that integrates convolutional and recurrent neural networks to extract numeric information. We further introduce an attention mechanism into the framework to achieve high accuracy and robustness. Synthetic and real-world datasets are used to evaluate the effectiveness of the method. To the best of our knowledge, this work takes the lead in constructing a complete neural network-based method of reverse-engineering bar charts.

preprint2020arXiv

The extended Gaia-PS1-SDSS (GPS1+) proper motion catalog

The GPS1 catalog was released in 2017. It delivered precise proper motions for around 350 million sources across three-fourths of the sky down to a magnitude of $r\sim20$\,mag. In this study, we present GPS1+ the extension GPS1 catalog down to $r\sim22.5$\,mag, based on {\it Gaia} DR2, PS1, SDSS and 2MASS astrometry. The GPS1+ totally provides proper motions for $\sim$400 million sources with a characteristic systematic error of less than 0.1\masyr. This catalog is divided into two sub-samples, i.e., the primary and secondary parts. The primary $\sim$264 million sources have either or both of the {\it Gaia} and SDSS astrometry, with a typical precision of 2.0-5.0 \masyr. In this part, $\sim$160 million sources have {\it Gaia} proper motions, we provide another new proper motion for each of them by building a Bayesian model. Relative to {\it Gaia}&#39;s values, the precision is improved by $\sim$0.1\,dex on average at the faint end; $\sim$50 million sources are the objects whose proper motions are missing in {\it Gaia} DR2, we provide their proper motion with a precision of $\sim$4.5\masyr; the remaining $\sim$54 million faint sources are beyond {\it Gaia} detecting capability, we provide their proper motions for the first time with a precision of 7.0 \masyr. However, the secondary $\sim$136 million sources only have PS1 astrometry, the average precision is worse than 15.0 \masyr. All the proper motions have been validated using QSOs and the existing {\it Gaia} proper motions. The catalog will be released on-line and available via the VO-TAP Service, or via the National Astronomical Data Center serviced by China-VO: https://nadc.china-vo.org/data/data/gps1p/f.

preprint2020arXiv

Three New Late-type Hypervelocity Star Candidates from Gaia DR2 with Refined Selection Criteria

Several dozen hypervelocity star (HVS) candidates have been reported based on the second data release of Gaia (Gaia DR2). However, it has been proven that the radial velocities of some Gaia HVS candidates are not reliable. In this paper, we employ refined astrometric criteria to re-examine Gaia DR2, arriving at a more reliable sample of HVS and high velocity star candidates than those found by previous authors.We develop a method called Binary Escape Probability Analysis to identify some HVS candidates. This method allows us to work with stars having only two epochs of measured radial velocity. These stars were usually discarded in previous similar studies. A scrutiny of our final results sheds light on selection effects present in our studies, which we propose to be the focus of future studies. In total, we find three late-type (2 G-type and 1 K-type) HVS and 21 high velocity star candidates, 3 and 11 of which are new, respectively. Judging by their historical trajectories, which we calculate, all three HVS candidates could not have had Galactic center origins. Further monitoring is required to confirm their status.

preprint2020arXiv

Understanding the velocity distribution of the Galactic Bulge with APOGEE and Gaia

We revisit the stellar velocity distribution in the Galactic bulge/bar region with APOGEE DR16 and {\it Gaia} DR2, focusing in particular on the possible high-velocity (HV) peaks and their physical origin. We fit the velocity distributions with two different models, namely with Gauss-Hermite polynomial and Gaussian mixture model (GMM). The result of the fit using Gauss-Hermite polynomials reveals a positive correlation between the mean velocity ($\bar{V}$) and the &#34;skewness&#34; ($h_{3}$) of the velocity distribution, possibly caused by the Galactic bar. The $n=2$ GMM fitting reveals a symmetric longitudinal trend of $|μ_{2}|$ and $σ_{2}$ (the mean velocity and the standard deviation of the secondary component), which is inconsistent to the $x_{2}$ orbital family predictions. Cold secondary peaks could be seen at $|l|\sim6^\circ$. However, with the additional tangential information from {\it Gaia}, we find that the HV stars in the bulge show similar patterns in the radial-tangential velocity distribution ($V_{\rm R}-V_{\rm T}$), regardless of the existence of a distinct cold HV peak. The observed $V_{\rm R}-V_{\rm T}$ (or $V_{\rm GSR}-μ_{l}$) distributions are consistent with the predictions of a simple MW bar model. The chemical abundances and ages inferred from ASPCAP and CANNON suggest that the HV stars in the bulge/bar are generally as old as, if not older than, the other stars in the bulge/bar region.

preprint2019arXiv

Deriving the stellar labels of LAMOST spectra with Stellar LAbel Machine (SLAM)

The LAMOST survey has provided 9 million spectra in its Data Release 5 (DR5) at R$\sim$1800. Extracting precise stellar labels is crucial for such a large sample. In this paper, we report the implementation of the Stellar LAbel Machine (SLAM), which is a data-driven method based on Support Vector Regression (SVR), a robust non-linear regression technique. Thanks to the capability to model highly non-linear problems with SVR, SLAM generally can derive stellar labels over a wide range of spectral types. This gives it a unique capability compared to other popular data-driven methods. To illustrate this capability, we test the performance of SLAM on stars ranging from Teff$\sim$4000 to $\sim$8000 K trained on LAMOST spectra and stellar labels. At g-band signal-to-noise ratio (SNRg) higher than 100, the random uncertainties of Teff, logg and [Fe/H] are 50 K, 0.09 dex, and 0.07 dex, respectively. We then set up another SLAM model trained by APOGEE and LAMOST common stars to demonstrate its capability of dealing with high dimensional problems. The spectra are from LAMOST DR5 and the stellar labels of the training set are from APOGEE DR15, including Teff, logg, [M/H],[$α$/M], [C/M], and [N/M]. The cross-validated scatters at SNRg$\sim$100 are 49 K, 0.10 dex, 0.037 dex,0.026 dex, 0.058 dex, and 0.106 dex for these parameters, respectively. This performance is at the same level as other up-to-date data-driven models. As a byproduct, we also provide the latest catalog of $\sim$1 million LAMOST DR5 K giant stars with SLAM-predicted stellar labels in this work.

preprint2019arXiv

Exploring the spectral \textit{information content} in the LAMOST medium-resolution survey (MRS)

Low-resolution spectra are proved competitive to high-resolution spectra in determining many stellar labels at comparable precision. It is useful to consider the spectral information content when assessing the capability of a stellar spectrum in deriving precise stellar labels. In this work, we quantify the information content brought by the LAMOST-II medium-resolution spectroscopic survey (MRS) using the gradient spectra and the coefficients-of-dependence (CODs). In general, the wavelength coverage of the MRS well constrains the stellar labels but the sensitivities of different stellar labels vary with spectral types and metallicity of the stars of interest and, therefore, affect the performance of the stellar label determination from the MRS spectra. Applying the SLAM to the synthetic spectra which mimic the MRS data, we find the precision of the fundamental stellar parameters Teff, logg and [M/H] are better when combining both the blue and red bands of the MRS. This is especially important for warm stars since the H$α$ line located in the red part plays a more important role in determining the effective temperature for warm stars. With blue and red parts together, we are able to reach similar performance to the low-resolution spectra except for warm stars. However, at [M/H]$\sim-2.0$ dex, the uncertainties of fundamental stellar labels estimated from MRS are substantially larger than those from low-resolution spectra. We also tested the uncertainties of Teff, logg and [M/H] of from MRS data induced from the radial velocity mismatch and find that a mismatch of about 1 km s$^{-1}$, which is typical for LAMOST MRS data, would not significantly affect the stellar label estimates. At last, reference precision limits are calculated using synthetic gradient spectra, according to which we expect abundances of at least 17 elements to be measured precisely from MRS spectra.

preprint2019arXiv

On half-factoriality of transfer Krull monoids

Let $H$ be a transfer Krull monoid over a subset $G_0$ of an abelian group $G$ with finite exponent. Then every non-unit $a\in H$ can be written as a finite product of atoms, say $a=u_1 \cdot \ldots \cdot u_k$. The set $\mathsf L(a)$ of all possible factorization lengths $k$ is called the set of lengths of $a$, and $H$ is said to be half-factorial if $|\mathsf L(a)|=1$ for all $a\in H$. We show that, if $a \in H$ and $|\mathsf L(a^{\lfloor (3\exp(G) - 3)/2 \rfloor})| = 1$, then the smallest divisor-closed submonoid of $H$ containing $a$ is half-factorial. In addition, we prove that, if $G_0$ is finite and $|\mathsf L(\prod_{g\in G_0}g^{2\mathsf{ord}(g)})|=1$, then $H$ is half-factorial.

preprint2019arXiv

Tracing Kinematic and Chemical Properties of Sagittarius Stream by K-Giants, M-Giants, and BHB stars

We characterize the kinematic and chemical properties of $\sim$3,000 Sagittarius (Sgr) stream stars, including K-giants, M-giants, and BHBs, select from SEGUE-2, LAMOST, and SDSS separately in Integrals-of-Motion space. The orbit of Sgr stream is quite clear from the velocity vector in $X$-$Z$ plane. Stars traced by K-giants and M-giants present the apogalacticon of trailing steam is $\sim$ 100 kpc. The metallicity distributions of Sgr K-, M-giants, and BHBs present that the M-giants are on average the most metal-rich population, followed by K-giants and BHBs. All of the K-, M-giants, and BHBs indicate that the trailing arm is on average more metal-rich than leading arm, and the K-giants show that the Sgr debris is the most metal-poor part. The $α$-abundance of Sgr stars exhibits a similar trend with the Galactic halo stars at lower metallicity ([Fe/H] $<\sim$ $-$1.0 dex), and then evolve down to lower [$α$/Fe] than disk stars at higher metallicity, which is close to the evolution pattern of $α$-element of Milky Way dwarf galaxies. We find $V_Y$ and metallicity of K-giants have gradients along the direction of line-of-sight from the Galactic center in $X$-$Z$ plane, and the K-giants show that $V_Y$ increases with metallicity at [Fe/H] $>\sim-$1.5 dex. After dividing the Sgr stream into bright and faint stream according to their locations in equatorial coordinate, the K-giants and BHBs show that the bright and faint stream present different $V_Y$ and metallicities, the bright stream is on average higher in $V_Y$ and metallicity than the faint stream.