Researcher profile

Yuan-Sen Ting

Yuan-Sen Ting contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
46works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

46 published item(s)

preprint2026arXiv

Can AI Dream of Unseen Galaxies? Conditional Diffusion Model for Galaxy Morphology Augmentation

Observational astronomy relies on visual feature identification to detect critical astrophysical phenomena. While machine learning (ML) increasingly automates this process, models often struggle with generalization in large-scale surveys due to the limited representativeness of labeled datasets, whether from simulations or human annotation, a challenge pronounced for rare yet scientifically valuable objects. To address this, we propose a conditional diffusion model to synthesize realistic galaxy images for augmenting ML training data (hereafter GalaxySD). Leveraging the Galaxy Zoo 2 dataset which contains visual feature, galaxy image pairs from volunteer annotation, we demonstrate that GalaxySD generates diverse, high-fidelity galaxy images that closely adhere to the specified morphological feature conditions. Moreover, this model enables generative extrapolation to project well-annotated data into unseen domains and advancing rare object detection. Integrating synthesized images into ML pipelines improves performance in standard morphology classification, boosting completeness and purity by up to 30% across key metrics. For rare object detection, using early-type galaxies with prominent dust lane features (~0.1% in GZ2 dataset) as a test case, our approach doubled the number of detected instances, from 352 to 872, compared to previous studies based on visual inspection. This study highlights the power of generative models to bridge gaps between scarce labeled data and the vast, uncharted parameter space of observational astronomy and sheds insight for future astrophysical foundation model developments. Our project homepage is available at https://galaxysd-webpage.streamlit.app/.

preprint2025arXiv

Millions of Main-Sequence Binary Stars from Gaia BP/RP Spectra

We present the main-sequence binary (MSMS) Catalog derived from Gaia Data Release 3 BP/RP (XP) spectra. Leveraging the vast sample of low-resolution Gaia XP spectra, we develop a forward modeling approach that maps stellar mass and photometric metallicity to XP spectra using a neural network. Our methodology identifies binary systems through statistical comparison of single- and binary-star model fits, enabling detection of binaries with mass ratios between 0.4 and 1.0 and flux ratios larger than 0.1. From an initial sample of 35 million stars within 1 kpc, we identify 14 million binary candidates and define a high-confidence "golden sample" of 1 million binary systems. This large, homogeneous sample enables detailed statistical analysis of binary properties across diverse Galactic environments, providing new insights into binary star formation and evolution. In addition, the $χ^2$ comparison allows us to distinguish stars with luminous companions from single stars or binaries with dark companions, such as white dwarfs, neutron stars and black hole candidates, improving our understanding of compact object populations.

preprint2024arXiv

AstroLLaMA-Chat: Scaling AstroLLaMA with Conversational and Diverse Datasets

We explore the potential of enhancing LLM performance in astronomy-focused question-answering through targeted, continual pre-training. By employing a compact 7B-parameter LLaMA-2 model and focusing exclusively on a curated set of astronomy corpora -- comprising abstracts, introductions, and conclusions -- we achieve notable improvements in specialized topic comprehension. While general LLMs like GPT-4 excel in broader question-answering scenarios due to superior reasoning capabilities, our findings suggest that continual pre-training with limited resources can still enhance model performance on specialized topics. Additionally, we present an extension of AstroLLaMA: the fine-tuning of the 7B LLaMA model on a domain-specific conversational dataset, culminating in the release of the chat-enabled AstroLLaMA for community use. Comprehensive quantitative benchmarking is currently in progress and will be detailed in an upcoming full paper. The model, AstroLLaMA-Chat, is now available at https://huggingface.co/universeTBD, providing the first open-source conversational AI tool tailored for the astronomy community.

preprint2022arXiv

A Panchromatic Study of Massive Stars in the Extremely Metal-Poor Local Group Dwarf Galaxy Leo A

We characterize massive stars (M>8 M_sun) in the nearby (D~0.8 Mpc) extremely metal-poor (Z~5% Z_sun) galaxy Leo A using Hubble Space Telescope ultra-violet (UV), optical, and near-infrared (NIR) imaging along with Keck/LRIS and MMT/Binospec optical spectroscopy for 18 main sequence OB stars. We find that: (a) 12 of our 18 stars show emission lines, despite not being associated with an H II region, suggestive of stellar activity (e.g., mass loss, accretion, binary star interaction), which is consistent with previous predictions of enhanced activity at low metallicity; (b) 6 are Be stars, which are the first to be spectroscopically studied at such low metallicity -- these Be stars have unusual panchromatic SEDs; (c) for stars well-fit by the TLUSTY non-local thermodynamic equilibrium (non-LTE) models, the photometric and spectroscopic values of T_eff and log(g) agree to within ~0.01 dex and ~0.18 dex, respectively, indicating that NUV/optical/NIR imaging can be used to reliably characterize massive (M ~ 8-30 M_sun) main sequence star properties relative to optical spectroscopy; (d) the properties of the most massive stars in H II regions are consistent with constraints from previous nebular emission line studies; and (e) 13 stars with M>8 M_sun are >40 pc from a known star cluster or H II region. Our sample comprises ~50% of all known massive stars at Z < 10% Z_sun with derived stellar parameters, high-quality optical spectra, and panchromatic photometry.

preprint2022arXiv

A Tilt in the Dark Matter Halo of the Galaxy

Recent observations of the stellar halo have uncovered the debris of an ancient merger, Gaia-Sausage-Enceladus, estimated to have occurred ~8 Gyr ago. Follow-up studies have associated GSE with a large-scale tilt in the stellar halo that links two well-known stellar over-densities in diagonally opposing octants of the Galaxy (the Hercules-Aquila Cloud and Virgo Overdensity; HAC and VOD). In this paper, we study the plausibility of such unmixed merger debris persisting over several Gyr in the Galactic halo. We employ the simulated stellar halo from Naidu et al. (2021), which reproduces several key properties of the merger remnant, including the large-scale tilt. By integrating the orbits of these simulated stellar halo particles, we show that adoption of a spherical halo potential results in rapid phase mixing of the asymmetry. However, adopting a tilted halo potential preserves the initial asymmetry in the stellar halo for many Gyr. The asymmetry is preserved even when a realistic growing disk is added to the potential. These results suggest that HAC and VOD are long-lived structures that are associated with GSE and that the dark matter halo of the Galaxy is tilted with respect to the disk and aligned in the direction of HAC-VOD. Such halo-disk misalignment is common in modern cosmological simulations. Lastly, we study the relationship between the local and global stellar halo in light of a tilted global halo comprised of highly radial orbits. We find that the local halo offers a dynamically biased view of the global halo due to its displacement from the Galactic Center.

preprint2022arXiv

An Unsupervised Learning Approach for Quasar Continuum Prediction

Modeling quasar spectra is a fundamental task in astrophysics as quasars are the tell-tale sign of cosmic evolution. We introduce a novel unsupervised learning algorithm, Quasar Factor Analysis (QFA), for recovering the intrinsic quasar continua from noisy quasar spectra. QFA assumes that the Ly$α$ forest can be approximated as a Gaussian process, and the continuum can be well described as a latent factor model. We show that QFA can learn, through unsupervised learning and directly from the quasar spectra, the quasar continua and Ly$α$ forest simultaneously. Compared to previous methods, QFA achieves state-of-the-art performance for quasar continuum prediction robustly but without the need for predefined training continua. In addition, the generative and probabilistic nature of QFA paves the way to understanding the evolution of black holes as well as performing out-of-distribution detection and other Bayesian downstream inferences.

preprint2022arXiv

Astroconformer: Inferring Surface Gravity of Stars from Stellar Light Curves with Transformer

We introduce Astroconformer, a Transformer-based model to analyze stellar light curves from the Kepler mission. We demonstrate that Astrconformer can robustly infer the stellar surface gravity as a supervised task. Importantly, as Transformer captures long-range information in the time series, it outperforms the state-of-the-art data-driven method in the field, and the critical role of self-attention is proved through ablation experiments. Furthermore, the attention map from Astroconformer exemplifies the long-range correlation information learned by the model, leading to a more interpretable deep learning approach for asteroseismology. Besides data from Kepler, we also show that the method can generalize to sparse cadence light curves from the Rubin Observatory, paving the way for the new era of asteroseismology, harnessing information from long-cadence ground-based observations.

preprint2022arXiv

Deep Potential: Recovering the gravitational potential from a snapshot of phase space

One of the major goals of the field of Milky Way dynamics is to recover the gravitational potential field. Mapping the potential would allow us to determine the spatial distribution of matter - both baryonic and dark - throughout the Galaxy. We present a novel method for determining the gravitational field from a snapshot of the phase-space positions of stars, based only on minimal physical assumptions, which makes use of recently developed tools from the field of deep learning. We first train a normalizing flow on a sample of observed six-dimensional phase-space coordinates of stars, obtaining a smooth, differentiable approximation of the distribution function. Using the Collisionless Boltzmann Equation, we then find the gravitational potential - represented by a feed-forward neural network - that renders this distribution function stationary. This method, which we term &#34;Deep Potential,&#34; is more flexible than previous parametric methods, which fit restricted classes of analytic models of the distribution function and potential to the data. We demonstrate Deep Potential on mock datasets, and demonstrate its robustness under various non-ideal conditions. Deep Potential is a promising approach to mapping the density of the Milky Way and other stellar systems, using rich datasets of stellar positions and kinematics now being provided by Gaia and ground-based spectroscopic surveys.

preprint2022arXiv

Detecting the non-Gaussianity of the 21-cm signal during reionisation with the Wavelet Scattering Transform

Detecting the 21-cm hyperfine transition from neutral hydrogen in the intergalactic medium is our best probe for understanding the astrophysical processes driving the Epoch of Reionisation (EoR). The primary means for a detection of this 21-cm signal is through a statistical measurement of the spatial fluctuations using the 21-cm power spectrum (PS). However, the 21-cm signal is non-Gaussian meaning the PS, which only measures the Gaussian fluctuations, is sub-optimal for characterising all of the available information. The upcoming Square Kilometre Array (SKA) will perform a deep, 1000 hr observation over 100 deg$.^{2}$ specifically designed to recover direct images of the 21-cm signal. In this work, we use the Wavelet Scattering Transform (WST) to extract the non-Gaussian information directly from these two-dimensional images of the 21-cm signal. The key advantage of the WST is its stability with respect to statistical noise for measuring non-Gaussian information, unlike the bispectrum whose statistical noise diverges. We introduce a novel method to isolate this non-Gaussian information from mock 21-cm images and demonstrate its detection at 150 (177)~MHz ($z\sim8.5$ and $\sim7$) for a fiducial model with signal-to-noise of $\sim$5~(8) assuming perfect foreground removal and $\sim2$~(3) assuming foreground wedge avoidance.

preprint2022arXiv

Exploring the cosmic 21-cm signal from the Epoch of Reionisation using the Wavelet Scattering Transform

Detecting the cosmic 21-cm signal during the Epoch of Reionisation and Cosmic Dawn will reveal insights into the properties of the first galaxies and advance cosmological parameter estimation. Until recently, the primary focus for astrophysical parameter inference from the 21-cm signal centred on the power spectrum (PS). However, the cosmic 21-cm signal is highly non-Gaussian rendering the PS sub-optimal for characterising the cosmic signal. In this work, we introduce a new technique to analyse the non-Gaussian information in images of the 21-cm signal called the Wavelet Scattering Transform (WST). This approach closely mirrors that of convolutional neural networks with the added advantage of not requiring tuning or training of a neural network. Instead, it compresses the 2D spatial information into a set of coefficients making it easier to interpret while also providing a robust statistical description of the non-Gaussian information contained in the cosmic 21-cm signal. First, we explore the application of the WST to mock 21-cm images to gain valuable physical insights by comparing to the known behaviour from the 21-cm PS. Then we quantitatively explore the WST applied to the 21-cm signal by extracting astrophysical parameter constraints using Fisher Matrices from a realistic 1000 hr mock observation with the Square Kilometre Array. We find that: (i) the WST applied only to 2D images can outperform the 3D spherically averaged 21-cm PS, (ii) the excision of foreground contaminated modes can degrade the constraining power by a factor of ~1.5-2 with the WST and (iii) higher cadences between the 21-cm images can further improve the constraining power.

preprint2022arXiv

Galaxy Merger Reconstruction with Equivariant Graph Normalizing Flows

A key yet unresolved question in modern-day astronomy is how galaxies formed and evolved under the paradigm of the $Λ$CDM model. A critical limiting factor lies in the lack of robust tools to describe the merger history through a statistical model. In this work, we employ a generative graph network, E(n) Equivariant Graph Normalizing Flows Model. We demonstrate that, by treating the progenitors as a graph, our model robustly recovers their distributions, including their masses, merging redshifts and pairwise distances at redshift z=2 conditioned on their z=0 properties. The generative nature of the model enables other downstream tasks, including likelihood-free inference, detecting anomalies and identifying subtle correlations of progenitor features.

preprint2022arXiv

How Many Elements Matter?

Some studies of stars&#39; multi-element abundance distributions suggest at least 5-7 significant dimensions, but others show that many elemental abundances can be predicted to high accuracy from [Fe/H] and [Mg/Fe] (or [Fe/H] and age) alone. We show that both propositions can be, and are, simultaneously true. We adopt a machine learning technique known as normalizing flow to reconstruct the probability distribution of Milky Way disk stars in the space of 15 elemental abundances measured by APOGEE. Conditioning on Teff and log g minimizes the differential systematics. After further conditioning on [Fe/H] and [Mg/Fe], the residual scatter for most abundances is $σ_{[X/{\rm H}]} \lesssim 0.02$ dex, consistent with APOGEE&#39;s reported statistical uncertainties of $\sim$0.01-0.015 dex and intrinsic scatter of 0.01-0.02 dex. Despite the small scatter, residual abundances display clear correlations between elements, which we show are too large to be explained by measurement uncertainties or by the finite sampling noise. We must condition on at least seven elements to reduce correlations to a level consistent with observational uncertainties. Our results demonstrate that cross-element correlations are a much more sensitive probe of hidden structure than dispersion, and they can be measured precisely in a large sample even if star-by-star measurement noise is comparable to the intrinsic scatter. We conclude that many elements have an independent story to tell, even for the &#34;mundane&#34; disk stars and elements produced by core-collapse and Type Ia supernovae. The only way to learn these lessons is to measure the abundances directly, and not merely infer them.

preprint2022arXiv

Li-rich Giants in LAMOST Survey. III. The statistical analysis of Li-rich giants

The puzzle of Li-rich giant is still unsolved, contradicting the prediction of the standard stellar models. Although the exact evolutionary stages play a key role in the knowledge of Li-rich giants, a limited number of Li-rich giants have been taken with high-quality asteroseismic parameters to clearly distinguish the stellar evolutionary stages. Based on the LAMOST Data Release 7 (DR7), we applied a data-driven neural network method to derive the parameters for giant stars, which contain the largest number of Li-rich giants. The red giant stars are classified into three stages of Red Giant Branch (RGB), Primary Red Clump (PRC), and Secondary Red Clump (SRC) relying on the estimated asteroseismic parameters. In the statistical analysis of the properties (i.e. stellar mass, carbon, nitrogen, Li-rich distribution, and frequency) of Li-rich giants, we found that: (1) Most of the Li-rich RGB stars are suggested to be the descendants of Li-rich pre-RGB stars and/or the result of engulfment of planet or substellar companions; (2) The massive Li-rich SRC stars could be the natural consequence of Li depletion from the high-mass Li-rich RGB stars. (3) Internal mixing processes near the helium flash can account for the phenomenon of Li-rich on PRC that dominated the Li-rich giants. Based on the comparison of [C/N] distributions between Li-rich and normal PRC stars, the Li-enriched processes probably depend on the stellar mass.

preprint2022arXiv

Live Fast, Die $α$-Enhanced: The Mass-Metallicity-$α$ Relation of the Milky Way&#39;s Disrupted Dwarf Galaxies

The Milky Way&#39;s satellite galaxies (&#34;surviving dwarfs&#34;) have been studied for decades as unique probes of chemical evolution in the low-mass regime. Here we extend such studies to the &#34;disrupted dwarfs&#34;, whose debris constitutes the stellar halo. We present abundances ([Fe/H], [$α$/Fe]) and stellar masses for nine disrupted dwarfs with $M_{\star}\approx10^{6}-10^{9}M_{\odot}$ from the H3 Survey (Sagittarius, $Gaia$-Sausage-Enceladus, Helmi Streams, Sequoia, Wukong/LMS-1, Cetus, Thamnos, I&#39;itoi, Orphan/Chenab). The surviving and disrupted dwarfs are chemically distinct: at fixed mass, the disrupted dwarfs are systematically metal-poor and $α$-enhanced. The disrupted dwarfs define a mass-metallicity relation (MZR) with a similar slope as the $z=0$ MZR followed by the surviving dwarfs, but offset to lower metallicities by $Δ$[Fe/H]$\approx0.3-0.4$ dex. Dwarfs with larger offsets from the $z=0$ MZR are more $α$-enhanced. In simulations as well as observations, galaxies with higher $Δ$[Fe/H] formed at higher redshifts -- exploiting this, we infer the disrupted dwarfs have typical star-formation truncation redshifts of $z_{\rm{trunc}}{\sim}1-2$. We compare the chemically inferred $z_{\rm{trunc}}$ with dynamically inferred accretion redshifts and find almost all dwarfs are quenched only after accretion. The differences between disrupted and surviving dwarfs are likely because the disrupted dwarfs assembled their mass rapidly, at higher redshifts, and within denser dark matter halos that formed closer to the Galaxy. Our results place novel archaeological constraints on low-mass galaxies inaccessible to direct high-$z$ studies: (i) the redshift evolution of the MZR along parallel tracks but offset to lower metallicities extends to $M_{\star}\approx10^{6}-10^{9}M_{\odot}$; (ii) galaxies at $z\approx2-3$ are $α$-enhanced with [$α$/Fe]$\approx0.4$.

preprint2022arXiv

Mass and Age determination of the LAMOST data with different Machine Learning methods

We present a catalog of 948,216 stars with mass label and a catalog of 163,105 red clump (RC) stars with mass and age labels simultaneously. The training dataset is cross matched from the LAMOST (The Large Sky Area Multi-Object Fiber Spectroscopic Telescope) DR5 and high resolution asteroseismology data, mass and age are predicted by random forest method or convex hull algorithm. The stellar parameters with high correlation with mass and age are extracted and the test dataset shows that the median relative error of the prediction model for the mass of large sample is 3\% and meanwhile, the mass and age of red clump stars are 4\% and 7\%. We also compare the predicted age of red clump stars with the recent works and find that the final uncertainty of the RC sample could reach 18\% for age and 9\% for mass, in the meantime, final precision of the mass for large sample with different type of stars could reach 13\% without considering systematics, all these are implying that this method could be widely used in the future. Moreover, we explore the performance of different machine learning methods for our sample, including bayesian linear regression (BYS), gradient boosting decision Tree (GBDT), multilayer perceptron (MLP), multiple linear regression (MLR), random forest (RF) and support vector regression (SVR). Finally we find that the performance of nonlinear model is generally better than that of linear model, and the GBDT and RF methods are relatively better.

preprint2022arXiv

Reliable stellar abundances of individual stars with the MUSE integral-field spectrograph

We present a novel approach to deriving stellar labels for stars observed in MUSE fields making use of data-driven machine learning methods. Taking advantage of the comparable spectral properties (resolution, wavelength coverage) of the LAMOST and MUSE instruments, we adopt the Data-Driven Payne (DD-Payne) model used on LAMOST observations and apply it to stars observed in MUSE fields. Remarkably, in spite of instrumental differences, according to the cross-validation of 27 LAMOST-MUSE common stars, we are able to determine stellar labels with precision better than 75K in $T_{\rm eff}$, 0.15 dex in $\log g$, and 0.1 dex in abundances of [Fe/H], [Mg/Fe], [Si/Fe], [Ti/Fe], [C/Fe], [Ni/Fe] and [Cr/Fe] for current MUSE observations over a parameter range of 3800<$T_{\rm eff}$<7000 K, -1.5<[Fe/H]<0.5 dex. To date, MUSE has been used to target 13,000 fields across the southern sky since it was first commissioned six years ago and it is unique in its ability to study dense star fields such as globular clusters or the Milky Way bulge. Our method will enable the automated determination of stellar parameters for all stars in these fields. Additionally, it opens the door for applications to data collected by other spectrographs having resolution similar to LAMOST. With the upcoming BlueMUSE and MAVIS, we will gain access to a whole new range of chemical abundances with higher precision, especially critical s-process elements such as [Y/Fe] and [Ba/Fe] that provide key age diagnostics for stellar targets.

preprint2022arXiv

The eccentricity distribution of wide binaries and their individual measurements

Eccentricity of wide binaries is difficult to measure due to their long orbital periods. With Gaia&#39;s high-precision astrometric measurements, eccentricity of a wide binary can be constrained by the angle between the separation vector and the relative velocity vector (the $v$-$r$ angle). In this paper, by using the $v$-$r$ angles of wide binaries in Gaia Early Data Release 3, we develop a Bayesian approach to measure the eccentricity distribution as a function of binary separations. Furthermore, we infer the eccentricities of individual wide binaries and make them publicly available. Our results show that the eccentricity distribution of wide binaries at $10^2$ AU is close to uniform and becomes superthermal at $>10^{3}$ AU, suggesting two formation mechanisms dominating at different separation regimes. The close binary formation, most likely disk fragmentation, results in a uniform eccentricity distribution at $<10^{2}$ AU. The wide binary formation that leads to highly eccentric wide binaries at $>10^{3}$ AU may be turbulent fragmentation and/or the dynamical unfolding of compact triples. With Gaia, measuring eccentricities is now possible for a large number of wide binaries, opening a new window to understanding binary formation and evolution.

preprint2022arXiv

The GALAH Survey: A New Sample of Extremely Metal-Poor Stars Using A Machine Learning Classification Algorithm

Extremely Metal-Poor (EMP) stars provide a valuable probe of early chemical enrichment in the Milky Way. Here we leverage a large sample of $\sim600,000$ high-resolution stellar spectra from the GALAH survey plus a machine learning algorithm to find 54 candidates with estimated [Fe/H]~$\leq$~-3.0, 6 of which have [Fe/H]~$\leq$~-3.5. Our sample includes $\sim 20 \%$ main sequence EMP candidates, unusually high for \emp surveys. We find the magnitude-limited metallicity distribution function of our sample is consistent with previous work that used more complex selection criteria. The method we present has significant potential for application to the next generation of massive stellar spectroscopic surveys, which will expand the available spectroscopic data well into the millions of stars.

preprint2022arXiv

The GALAH Survey: Chemical tagging and chrono-chemodynamics of accreted halo stars with GALAH+ DR3 and $Gaia$ eDR3

Since the advent of $Gaia$ astrometry, it is possible to identify massive accreted systems within the Galaxy through their unique dynamical signatures. One such system, $Gaia$-Sausage-Enceladus (GSE), appears to be an early &#34;building block&#34; given its virial mass $> 10^{10}\,\mathrm{M_\odot}$ at infall ($z\sim1-3$). In order to separate the progenitor population from the background stars, we investigate its chemical properties with up to 30 element abundances from the GALAH+ Survey Data Release 3 (DR3). To inform our choice of elements for purely chemically selecting accreted stars, we analyse 4164 stars with low-$α$ abundances and halo kinematics. These are most different to the Milky Way stars for abundances of Mg, Si, Na, Al, Mn, Fe, Ni, and Cu. Based on the significance of abundance differences and detection rates, we apply Gaussian mixture models to various element abundance combinations. We find the most populated and least contaminated component, which we confirm to represent GSE, contains 1049 stars selected via [Na/Fe] vs. [Mg/Mn] in GALAH+ DR3. We provide tables of our selections and report the chrono-chemodynamical properties (age, chemistry, and dynamics). Through a previously reported clean dynamical selection of GSE stars, including $30 < \sqrt{J_R~/~\mathrm{kpc\,km\,s^{-1}}} < 55$, we can characterise an unprecedented 24 abundances of this structure with GALAH+ DR3. Our chemical selection allows us to prevent circular reasoning and characterise the dynamical properties of the GSE, for example mean $\sqrt{J_R~/~\mathrm{kpc\,km\,s^{-1}}} = 26_{-14}^{+9}$. We find only $(29\pm1)\%$ of the GSE stars within the clean dynamical selection region. Our methodology will improve future studies of accreted structures and their importance for the formation of the Milky Way.

preprint2022arXiv

Unsupervised Learning for Stellar Spectra with Deep Normalizing Flows

Stellar spectra encode detailed information about the stars. However, most machine learning approaches in stellar spectroscopy focus on supervised learning. We introduce Mendis, an unsupervised learning method, which adopts normalizing flows consisting of Neural Spline Flows and GLOW to describe the complex distribution of spectral space. A key advantage of Mendis is that we can describe the conditional distribution of spectra, conditioning on stellar parameters, to unveil the underlying structures of the spectra further. In particular, our study demonstrates that Mendis can robustly capture the pixel correlations in the spectra leading to the possibility of detecting unknown atomic transitions from stellar spectra. The probabilistic nature of Mendis also enables a rigorous determination of outliers in extensive spectroscopic surveys without the need to measure elemental abundances through existing analysis pipelines beforehand.

preprint2022arXiv

Wide binaries from the H3 survey: the thick disk and halo have similar wide binary fractions

Due to the different environments in the Milky Way&#39;s disk and halo, comparing wide binaries in the disk and halo is key to understanding wide binary formation and evolution. By using Gaia Early Data Release 3, we search for resolved wide binary companions in the H3 survey, a spectroscopic survey that has compiled $\sim$150,000 spectra for thick-disk and halo stars to date. We identify 800 high-confidence (a contamination rate of 4%) wide binaries and two resolved triples, with binary separations mostly between $10^3$-$10^5$ AU and a lowest [Fe/H] of $-2.7$. Based on their Galactic kinematics, 33 of them are halo wide binaries, and most of those are associated with the accreted Gaia-Sausage-Enceladus galaxy. The wide binary fraction in the thick disk decreases toward the low metallicity end, consistent with the previous findings for the thin disk. Our key finding is that the halo wide binary fraction is consistent with the thick-disk stars at a fixed [Fe/H]. There is no significant dependence of the wide binary fraction on the $α$-captured abundance. Therefore, the wide binary fraction is mainly determined by the iron abundance, not their disk or halo origin nor the $α$-captured abundance. Our results suggest that the formation environments play a major role for the wide binary fraction, instead of other processes like radial migration that only apply to disk stars.

preprint2022arXiv

Wide twin binaries are extremely eccentric: evidence of twin binary formation in circumbinary disks

The Gaia mission recently revealed an excess population of equal-mass &#34;twin&#34; wide binaries, with mass ratio $q\gtrsim 0.95$, extending to separations of at least 1000 AU. The origin of this population is an enigma: twin binaries are thought to form via correlated accretion in circumbinary disks, but the typical observed protostellar disks have radii of $\sim100$ AU, far smaller than the separations of the widest twins. Here, we infer the eccentricity distribution of wide twins from the distribution of their $v$-$r$ angles, i.e., the angle between the components&#39; separation and relative velocity vectors. We find that wide twins must be on extremely eccentric orbits. For the excess-twin population at 400-1000 AU, we infer a near-delta function excess of high-eccentricity system, with eccentricity $0.95 \lesssim e \leq 1$. These high eccentricities for wide twins imply pericenter distances of order $10$ AU and suggest that their orbits were scattered via dynamical interactions in their birth environments, consistent with a scenario in which twins are born in circumbinary disks and subsequently widened. These results further establish twin wide binaries as a distinct population and imply that wide twins can be used as a probe of the dynamical history of stellar populations.

preprint2022arXiv

Zeta-Payne: a fully automated spectrum analysis algorithm for the Milky Way Mapper program of the SDSS-V survey

The Sloan Digital Sky Survey has recently initiated its 5th survey generation (SDSS-V), with a central focus on stellar spectroscopy. In particular, SDSS-V Milky Way Mapper program will deliver multi-epoch optical and near-infrared spectra for more than 5 million stars across the entire sky, covering a large range in stellar mass, surface temperature, evolutionary stage, and age. About 10% of those spectra will be of hot stars of OBAF spectral types, for whose analysis no established survey pipelines exist. Here we present the spectral analysis algorithm, Zeta-Payne, developed specifically to obtain stellar labels from SDSS-V spectra of stars with these spectral types and drawing on machine learning tools. We provide details of the algorithm training, its test on artificial spectra, and its validation on two control samples of real stars. Analysis with Zeta-Payne leads to only modest internal uncertainties in the near-IR with APOGEE (optical with BOSS): 3-10% (1-2%) for Teff, 5-30% (5-25%) for v*sin(i), 1.7-6.3 km/s(0.7-2.2 km/s) for RV, $<0.1$ dex ($<0.05$ dex) for log(g), and 0.4-0.5 dex (0.1 dex) for [M/H] of the star, respectively. We find a good agreement between atmospheric parameters of OBAF-type stars when inferred from their high- and low-resolution optical spectra. For most stellar labels the APOGEE spectra are (far) less informative than the BOSS spectra of these stars, while log(g), v*sin(i), and [M/H] are in most cases too uncertain for meaningful astrophysical interpretation. This makes BOSS low-resolution optical spectra better for stellar labels of OBAF-type stars, unless the latter are subject to high levels of extinction.

preprint2021arXiv

Chemical Cartography with APOGEE: Mapping Disk Populations with a Two-Process Model and Residual Abundances

We apply a novel statistical analysis to measurements of 16 elemental abundances in 34,410 Milky Way disk stars from the final data release (DR17) of APOGEE-2. Building on recent work, we fit median abundance ratio trends [X/Mg] vs. [Mg/H] with a 2-process model, which decomposes abundance patterns into a &#34;prompt&#34; component tracing core collapse supernovae and a &#34;delayed&#34; component tracing Type Ia supernovae. For each sample star, we fit the amplitudes of these two components, then compute the residuals Δ[X/H] from this two-parameter fit. The rms residuals range from ~0.01-0.03 dex for the most precisely measured APOGEE abundances to ~0.1 dex for Na, V, and Ce. The correlations of residuals reveal a complex underlying structure, including a correlated element group comprised of Ca, Na, Al, K, Cr, and Ce and a separate group comprised of Ni, V, Mn, and Co. Selecting stars poorly fit by the 2-process model reveals a rich variety of physical outliers and sometimes subtle measurement errors. Residual abundances allow comparison of populations controlled for differences in metallicity and [α/Fe]. Relative to the main disk (R=3-13 kpc, |Z|<2 kpc), we find nearly identical abundance patterns in the outer disk (R=15-17 kpc), 0.05-0.2 dex depressions of multiple elements in LMC and Gaia Sausage/Enceladus stars, and wild deviations (0.4-1 dex) of multiple elements in ωCen. Residual abundance analysis opens new opportunities for discovering chemically distinctive stars and stellar populations, for empirically constraining nucleosynthetic yields, and for testing chemical evolution models that include stochasticity in the production and redistribution of elements.

preprint2021arXiv

Evidence from Disrupted Halo Dwarfs that $r$-process Enrichment via Neutron Star Mergers is Delayed by $\gtrsim500$ Myrs

The astrophysical origins of $r$-process elements remain elusive. Neutron star mergers (NSMs) and special classes of core-collapse supernovae (rCCSNe) are leading candidates. Due to these channels&#39; distinct characteristic timescales (rCCSNe: prompt, NSMs: delayed), measuring $r$-process enrichment in galaxies of similar mass, but differing star-formation durations might prove informative. Two recently discovered disrupted dwarfs in the Milky Way&#39;s stellar halo, Kraken and \textit{Gaia}-Sausage Enceladus (GSE), afford precisely this opportunity: both have $M_{\star}\approx10^{8}M_{\rm{\odot}}$, but differing star-formation durations of ${\approx}2$ Gyrs and ${\approx}3.6$ Gyrs. Here we present $R\approx50,000$ Magellan/MIKE spectroscopy for 31 stars from these systems, detecting the $r$-process element Eu in all stars. Stars from both systems have similar [Mg/H]$\approx-1$, but Kraken has a median [Eu/Mg]$\approx-0.1$ while GSE has an elevated [Eu/Mg]$\approx0.2$. With simple models we argue NSM enrichment must be delayed by $500-1000$ Myrs to produce this difference. rCCSNe must also contribute, especially at early epochs, otherwise stars formed during the delay period would be Eu-free. In this picture, rCCSNe account for $\approx50\%$ of the Eu in Kraken, $\approx25\%$ in GSE, and $\approx15\%$ in dwarfs with extended star-formation durations like Sagittarius. The inferred delay time for NSM enrichment is $10-100\times$ longer than merger delay times from stellar population synthesis -- this is not necessarily surprising because the enrichment delay includes time taken for NSM ejecta to be incorporated into subsequent generations of stars. For example, this may be due to natal kicks that result in $r$-enriched material deposited far from star-forming gas, which then takes $\approx10^{8}-10^{9}$ years to cool in these galaxies.

preprint2021arXiv

Stellar labels for hot stars from low-resolution spectra - I. the HotPayne method and results for 330,000 stars from LAMOST DR6

We set out to determine stellar labels from low-resolution survey spectra of hot, OBA stars with effective temperature (Teff) higher than 7500K. This fills a gap in the scientific analysis of large spectroscopic stellar surveys such as LAMOST, which offers spectra for millions of stars at R=1800. We first explore the theoretical information content of such spectra for determining stellar labels, via the Cramér-Rao bound. We show that in the limit of perfect model spectra and observed spectra with S/N of 100, precise estimates are possible for a wide range of stellar labels: not only the effective temperature Teff, surface gravity logg, and projected rotation velocity vsini, but also the micro-turbulence velocity, Helium abundance and the elemental abundances [C/H], [N/H], [O/H], [Si/H], [S/H], and [Fe/H]. Our analysis illustrates that the temperature regime of around 9500K is challenging, as the dominant Balmer and Paschen line strength vary little with Teff. We implement the simultaneous fitting of these 11 stellar labels to LAMOST hot-star spectra using the Payne approach, drawing on Kurucz&#39;s ATLAS12/SYNTHE LTE spectra as the underlying models. We then obtain stellar parameter estimates for a sample of about 330,000 hot stars with LAMOST spectra, an increase by about two orders of magnitude in sample size. Among them, about 260,000 have good Gaia parallaxes (S/N>5), and more than 95 percent of them are luminous stars, mostly on the main sequence; the rest reflects lower luminosity evolved stars, such as hot subdwarfs and white dwarfs. We show that the fidelity of the abundance estimates is limited by the systematics of the underlying models, as they do not account for NLTE effects. Finally, we show the detailed distribution of vsini of stars with 8000-15,000K, illustrating that it extends to a sharp cut-off at the critical rotation velocity, across a wide range of temperatures.

preprint2021arXiv

The GALAH survey: tracing the Galactic disk with Open Clusters

Open clusters are unique tracers of the history of our own Galaxy&#39;s disk. According to our membership analysis based on \textit{Gaia} astrometry, out of the 226 potential clusters falling in the footprint of GALAH or APOGEE, we find that 205 have secure members that were observed by at least one of the survey. Furthermore, members of 134 clusters have high-quality spectroscopic data that we use to determine their chemical composition. We leverage this information to study the chemical distribution throughout the Galactic disk of 21 elements, from C to Eu. The radial metallicity gradient obtained from our analysis is $-$0.076$\pm$0.009 dex kpc$^{-1}$, which is in agreement with previous works based on smaller samples. Furthermore, the gradient in the [Fe/H] - guiding radius (r$_{\rm guid}$) plane is $-$0.073$\pm$0.008 dex kpc$^{-1}$. We show consistently that open clusters trace the distribution of chemical elements throughout the Galactic disk differently than field stars. In particular, at given radius, open clusters show an age-metallicity relation that has less scatter than field stars. As such scatter is often interpreted as an effect of radial migration, we suggest that these differences are due to the physical selection effect imposed by our Galaxy: clusters that would have migrated significantly also had higher chances to get destroyed. Finally, our results reveal trends in the [X/Fe]$-$r$_{\rm guid}$$-$age space, which are important to understand production rates of different elements as a function of space and time.

preprint2021arXiv

The Mass of the Milky Way from the H3 Survey

The mass of the Milky Way is a critical quantity which, despite decades of research, remains uncertain within a factor of two. Until recently, most studies have used dynamical tracers in the inner regions of the halo, relying on extrapolations to estimate the mass of the Milky Way. In this paper, we extend the hierarchical Bayesian model applied in Eadie & Jurić (2019) to study the mass distribution of the Milky Way halo; the new model allows for the use of all available 6D phase-space measurements. We use kinematic data of halo stars out to $142~{\rm kpc}$, obtained from the H3 Survey and $\textit{Gaia}$ EDR3, to infer the mass of the Galaxy. Inference is carried out with the No-U-Turn sampler, a fast and scalable extension of Hamiltonian Monte Carlo. We report a median mass enclosed within $100~{\rm kpc}$ of $\rm M(<100 \; kpc) = 0.69_{-0.04}^{+0.05} \times 10^{12} \; M_\odot$ (68% Bayesian credible interval), or a virial mass of $\rm M_{200} = M(<216.2_{-7.5}^{+7.5} \; kpc) = 1.08_{-0.11}^{+0.12} \times 10^{12} \; M_\odot$, in good agreement with other recent estimates. We analyze our results using posterior predictive checks and find limitations in the model&#39;s ability to describe the data. In particular, we find sensitivity with respect to substructure in the halo, which limits the precision of our mass estimates to $\sim 15\%$.

preprint2020arXiv

A Diffuse Metal-Poor Component of the Sagittarius Stream Revealed by the H3 Survey

The tidal disruption of the Sagittarius dwarf galaxy has generated a spectacular stream of stars wrapping around the entire Galaxy. We use data from $Gaia$ and the H3 Stellar Spectroscopic Survey to identify 823 high-quality Sagittarius members based on their angular momenta. The H3 Survey is largely unbiased in metallicity, and so our sample of Sagittarius members is similarly unbiased. Stream stars span a wide range in [Fe/H] from $-0.2$ to $\approx -3.0$, with a mean overall metallicity of $\langle$[Fe/H]$\rangle=-0.99$. We identify a strong metallicity-dependence to the kinematics of the stream members. At [Fe/H]$\gt -0.8$ nearly all members belong to the well-known cold ($σ_v \lt 20$ km/s) leading and trailing arms. At intermediate metallicities ($-1.9 \lt$[Fe/H]$\lt -0.8$) a significant population (24$\%$) emerges of stars that are kinematically offset from the cold arms. These stars also appear to have hotter kinematics. At the lowest metallicities ([Fe/H]$\lesssim-2$), the majority of stars (69$\%$) belong to this kinematically-offset diffuse population. Comparison to simulations suggests that the diffuse component was stripped from the Sagittarius progenitor at earlier epochs, and therefore resided at larger radius on average, compared to the colder metal-rich component. We speculate that this kinematically diffuse, low metallicity, population is the stellar halo of the Sagittarius progenitor system.

preprint2020arXiv

A Mystery in Chamaeleon: Serendipitous Discovery of a Galactic Symbiotic Nova

We present the serendipitous discovery of a low luminosity nova occurring in a symbiotic binary star system in the Milky Way. We lay out the extensive archival data alongside new follow-up observations related to the stellar object V$^*$ CN Cha in the constellation of Chamaeleon. The object had long period ($\sim\! 250\,$day), high amplitude ($\sim\! 3\,$mag) optical variability in its recent past, preceding an increase in optical brightness by $\sim\! 8\,$magnitudes and a persistence at this luminosity for about 3 years, followed by a period of $\sim\! 1.4\,{\rm mag}\,{\rm yr}^{-1}$ dimming. The object&#39;s current optical luminosity seems to be dominated by H$α$ emission, which also exhibits blue-shifted absorption (a P-Cygni-like profile). After consideration of a number of theories to explain these myriad observations, we determine that V$^*$ CN Cha is most likely a symbiotic (an evolved star-white dwarf binary) system which has undergone a long-duration, low luminosity, nova. Interpreted in this way, the outburst in V$^*$ CN Cha is among the lowest luminosity novae ever observed.

preprint2020arXiv

Ancient Very Metal-Poor Stars Associated With the Galactic Disk in the H3 Survey

Ancient, very metal-poor stars offer a window into the earliest epochs of galaxy formation and assembly. We combine data from the H3 Spectroscopic Survey and Gaia to measure metallicities, abundances of $α$ elements, stellar ages, and orbital properties of a sample of 482 very metal-poor (VMP; [Fe/H]$<-2$) stars in order to constrain their origins. This sample is confined to $1\lesssim |Z| \lesssim3$ kpc from the Galactic plane. We find that >70% of VMP stars near the disk are on prograde orbits and this fraction increases toward lower metallicities. This result unexpected if metal-poor stars are predominantly accreted from many small systems with no preferred orientation, as such a scenario would imply a mostly isotropic distribution. Furthermore, we find there is some evidence for higher fractions of prograde orbits amongst stars with lower [$α$/Fe]. Isochrone-based ages for main sequence turn-off stars reveal that these VMP stars are uniformly old ($\approx12$ Gyr) irrespective of the $α$ abundance and metallicity, suggesting that the metal-poor population was not born from the same well-mixed gas disk. We speculate that the VMP population has a heterogeneous origin, including both in-situ formation in the ancient disk and accretion from a satellite with the same direction of rotation as the ancient disk at early times. Our precisely measured ages for these VMP stars on prograde orbits show that the Galaxy has had a relatively quiescent merging history over most of cosmic time, and implies the angular momentum alignment of the Galaxy has been in place for at least 12 Gyr.

preprint2020arXiv

Chemically peculiar A and F stars with enhanced s-process and iron-peak elements: stellar radiative acceleration at work

We present $\gtrsim 15,000$ metal-rich (${\rm [Fe/H]}>-0.2$dex) A and F stars whose surface abundances deviate strongly from Solar abundance ratios and cannot plausibly reflect their birth material composition. These stars are identified by their high [Ba/Fe] abundance ratios (${\rm [Ba/Fe]}>1.0$dex) in the LAMOST DR5 spectra analyzed by Xiang et al. (2019). They are almost exclusively main sequence and subgiant stars with $T_{\rm eff}\gtrsim6300$K. Their distribution in the Kiel diagram ($T_{\rm eff}$--$\log g$) traces a sharp border at low temperatures along a roughly fixed-mass trajectory (around $1.4M_\odot)$ that corresponds to an upper limit in convective envelope mass fraction of around $10^{-4}$. Most of these stars exhibit distinctly enhanced abundances of iron-peak elements (Cr, Mn, Fe, Ni) but depleted abundances of Mg and Ca. Rotational velocity measurements from GALAH DR2 show that the majority of these stars rotate slower than typical stars in an equivalent temperature range. These characteristics suggest that they are related to the so-called Am/Fm stars. Their abundance patterns are qualitatively consistent with the predictions of stellar evolution models that incorporate radiative acceleration, suggesting they are a consequence of stellar internal evolution particularly involving the competition between gravitational settling and radiative acceleration. These peculiar stars constitute 40% of the whole population of stars with mass above 1.5$M_\odot$, affirming that &#34;peculiar&#34; photospheric abundances due to stellar evolution effects are a ubiquitous phenomenon for these intermediate-mass stars. This large sample of Ba-enhanced chemically peculiar A/F stars with individual element abundances provides the statistics to test more stringently the mechanisms that alter the surface abundances in stars with radiative envelopes.

preprint2020arXiv

Cycle-StarNet: Bridging the gap between theory and data by leveraging large datasets

The advancements in stellar spectroscopy data acquisition have made it necessary to accomplish similar improvements in efficient data analysis techniques. Current automated methods for analyzing spectra are either (a) data-driven, which requires prior knowledge of stellar parameters and elemental abundances, or (b) based on theoretical synthetic models that are susceptible to the gap between theory and practice. In this study, we present a hybrid generative domain adaptation method that turns simulated stellar spectra into realistic spectra by applying unsupervised learning to large spectroscopic surveys. We apply our technique to the APOGEE H-band spectra at R=22,500 and the Kurucz synthetic models. As a proof of concept, two case studies are presented. The first of which is the calibration of synthetic data to become consistent with observations. To accomplish this, synthetic models are morphed into spectra that resemble observations, thereby reducing the gap between theory and observations. Fitting the observed spectra shows an improved average reduced $χ_R^2$ from 1.97 to 1.22, along with a reduced mean residual from 0.16 to -0.01 in normalized flux. The second case study is the identification of the elemental source of missing spectral lines in the synthetic modelling. A mock dataset is used to show that absorption lines can be recovered when they are absent in one of the domains. This method can be applied to other fields, which use large data sets and are currently limited by modelling accuracy. The code used in this study is made publicly available on github.

preprint2020arXiv

Discovery of ubiquitous lithium production in low-mass stars

The vast majority of stars with mass similar to the Sun are expected to only destroy lithium over the course of their lives, via low-temperature nuclear burning. This has now been supported by observations of hundreds of thousands of red giant stars (Brown et al. 1989, Kumar et al. 2011, Deepak et al. 2019, Singh et al. 2019, Casey et al. 2019). Here we perform the first large-scale systematic investigation into the Li content of stars in the red clump phase of evolution, which directly follows the red giant branch phase. Surprisingly we find that all red clump stars have high levels of lithium for their evolutionary stage. On average the lithium content increases by a factor of 40 after the end of the red giant branch stage. This suggests that all low-mass stars undergo a lithium production phase between the tip of the red giant branch and the red clump. We demonstrate that our finding is not predicted by stellar theory, revealing a stark tension between observations and models. We also show that the heavily studied (Brown et al. 1989, Reddy et al. 2005, Kumar et al. 2011, Singh et al. 2019, Casey et al. 2019) very Li-rich giants, with A(Li) $> +1.5$ dex, represent only the extreme tail of the lithium enhancement distribution, comprising 3% of red clump stars. Our findings suggest a new definition limit for Li-richness in red clump stars, A(Li) $> -0.9$ dex, which is much lower than the limit of A(Li) $> +1.5$ dex used over many decades (Brown et al. 1989, Castilho et al. 1995, Reddy et al. 2005, Carlberg et al. 2016, Casey et al. 2019, Holanda et al. 2020).

preprint2020arXiv

Forecasting Chemical Abundance Precision for Extragalactic Stellar Archaeology

Increasingly powerful and multiplexed spectroscopic facilities promise detailed chemical abundance patterns for millions of resolved stars in galaxies beyond the Milky Way (MW). Here, we employ the Cramér-Rao Lower Bound (CRLB) to forecast the precision to which stellar abundances for metal-poor, low-mass stars outside the MW can be measured for 41 current (e.g., Keck, MMT, VLT, DESI) and planned (e.g., MSE, JWST, ELTs) spectrograph configurations. We show that moderate resolution ($R\lesssim5000$) spectroscopy at blue-optical wavelengths ($λ\lesssim4500$ Å) (i) enables the recovery of 2-4 times as many elements as red-optical spectroscopy ($5000\lesssimλ\lesssim10000$ Å) at similar or higher resolutions ($R\sim 10000$) and (ii) can constrain the abundances of several neutron capture elements to $\lesssim$0.3 dex. We further show that high-resolution ($R\gtrsim 20000$), low S/N ($\sim$10 pixel$^{-1}$) spectra contain rich abundance information when modeled with full spectral fitting techniques. We demonstrate that JWST/NIRSpec and ELTs can recover (i) $\sim$10 and 30 elements, respectively, for metal-poor red giants throughout the Local Group and (ii) [Fe/H] and [$α$/Fe] for resolved stars in galaxies out to several Mpc with modest integration times. We show that select literature abundances are within a factor of $\sim$2 (or better) of our CRLBs. We suggest that, like ETCs, CRLBs should be used when planning stellar spectroscopic observations. We include an open source python package, \texttt{Chem-I-Calc}, that allows users to compute CRLBs for spectrographs of their choosing.

preprint2020arXiv

From the Inner to Outer Milky Way: A Photometric Sample of 2.6 Million Red Clump Stars

Large pristine samples of red clump stars are highly sought after given that they are standard candles and give precise distances even at large distances. However, it is difficult to cleanly select red clumps stars because they can have the same T$_{\mathrm{eff}}$ and log $g$ as red giant branch stars. Recently, it was shown that the asteroseismic parameters, $\rmΔ$P and $\rm{Δν}$, which are used to accurately select red clump stars, can be derived from spectra using the change in the surface carbon to nitrogen ratio ([C/N]) caused by mixing during the red giant branch. This change in [C/N] can also impact the spectral energy distribution. In this study, we predict the $\rmΔ$P, $\rm{Δν}$, T$_{\mathrm{eff}}$ and log $g$ using 2MASS, AllWISE, \gaia, and Pan-STARRS data in order to select a clean sample of red clump stars. We achieve a contamination rate of $\sim$20\%, equivalent to what is achieved when selecting from T$_{\mathrm{eff}}$ and log $g$ derived from low resolution spectra. Finally, we present two red clump samples. One sample has a contamination rate of $\sim$ 20\% and $\sim$ 405,000 red clump stars. The other has a contamination of $\sim$ 33\% and $\sim$ 2.6 million red clump stars which includes $\sim$ 75,000 stars at distances $>$ 10 kpc. For |b|>30 degrees we find $\sim$ 15,000 stars with contamination rate of $\sim$ 9\%. The scientific potential of this catalog for studying the structure and formation history of the Galaxy is vast given that it includes millions of precise distances to stars in the inner bulge and distant halo where astrometric distances are imprecise.

preprint2020arXiv

Interpreting Stellar Spectra with Unsupervised Domain Adaptation

We discuss how to achieve mapping from large sets of imperfect simulations and observational data with unsupervised domain adaptation. Under the hypothesis that simulated and observed data distributions share a common underlying representation, we show how it is possible to transfer between simulated and observed domains. Driven by an application to interpret stellar spectroscopic sky surveys, we construct the domain transfer pipeline from two adversarial autoencoders on each domains with a disentangling latent space, and a cycle-consistency constraint. We then construct a differentiable pipeline from physical stellar parameters to realistic observed spectra, aided by a supplementary generative surrogate physics emulator network. We further exemplify the potential of the method on the reconstructed spectra quality and to discover new spectral features associated to elemental abundances.

preprint2020arXiv

Keeping it Cool: Much Orbit Migration, yet Little Heating, in the Galactic Disk

A star in the Milky Way&#39;s disk can now be at a Galactocentric radius quite distant from its birth radius for two reasons: either its orbit has become eccentric through radial heating, which increases its radial action $J_R$ (`blurring&#39;); or merely its angular momentum $L_z$ has changed and thereby its guiding radius (`churning&#39;). We know that radial orbit migration is strong in the Galactic low-$α$ disk and set out to quantify the relative importance of these two effects, by devising and applying a parameterized model for the distribution $p(L_z, J_R, τ, \mathrm[Fe/H])$ in the stellar disk. This model describes the orbit evolution for stars of age $τ$ and metallicity [Fe/H], presuming coeval stars were initially born on (near-)circular orbits, and with a unique [Fe/H] at a given birth angular momentum and age. We fit this model to APOGEE red clump stars, accounting for the complex selection function of the survey. The best fit model implies changes of angular momentum of $\sqrt{\langle ΔL_z \rangle^2} \approx 619\, \mathrm{kpc~km/s~}(τ/\mathrm{6~Gyr})^{0.5}$, and changes of radial action as $\sqrt{\langle ΔJ_R \rangle^2} \approx 63\, \mathrm{kpc~km/s~} (τ/\mathrm{6~Gyr})^{0.6}$ at 8 kpc. This suggests that the secular orbit evolution of the disk is dominated by diffusion in angular momentum, with radial heating being an order of magnitude lower.

preprint2020arXiv

Milky Way Tomography with the SkyMapper Southern Survey. II. Photometric Re-calibration of SMSS DR2

We apply the spectroscopy-based stellar-color regression (SCR) method to perform an accurate photometric re-calibration of the second data release from the SkyMapper Southern Survey (SMSS DR2). From comparison with a sample of over 200,000 dwarf stars with stellar atmospheric parameters taken from GALAH+ DR3 and with accurate, homogeneous photometry from $Gaia$ DR2, zero-point offsets are detected in the original photometric catalog of SMSS DR2, in particular for the gravity- and metallicity-sensitive $uv$ bands. For $uv$ bands, the zero-point offsets are close to zero at very low extinction, and then steadily increase with $E (B - V)$, reaching as large as 0.174 and 0.134 mag respectively, at $E (B - V) \sim 0.5$ mag. These offsets largely arise from the adopted dust term in the transformations used by SMSS DR2 to construct photometric calibrators from the ATLAS reference catalog. For the $gr$ bands, the zero-point offsets exhibit negligible variations with SFD $E(B - V )$, due to their tiny coefficients on the dust term in the transformation. Our study also reveals small, but significant, spatial variations of the zero-point offsets in all $uvgr$ bands. External checks using Strömgren photometry, WD loci and the SDSS Stripe 82 standard-star catalog independently confirm the zero-points found by our revised SCR method.

preprint2020arXiv

MINESweeper: Spectrophotometric Modeling of Stars in the Gaia Era

We present MINESweeper, a tool to measure stellar parameters by jointly fitting observed spectra and broadband photometry to model isochrones and spectral libraries. This approach enables the measurement of spectrophotometric distances, in addition to stellar parameters such as Teff, log(g), [Fe/H], [a/Fe], and radial velocity. MINESweeper employs a Bayesian framework and can easily incorporate a variety of priors, including Gaia parallaxes. Mock data are fit in order to demonstrate how the precision of derived parameters depends on evolutionary phase and SNR. We then fit a selection of data in order to validate the model outputs. Fits to a variety of benchmark stars including Procyon, Arcturus, and the Sun result in derived stellar parameters that are in good agreement with the literature. We then fit combined spectra and photometry of stars in the open and globular clusters M92, M13, M3, M107, M71, and M67. Derived distances, [Fe/H], [a/Fe], and log(g)-Teff, relations are in overall good agreement with literature values, although there are trends between metallicity and log(g), within clusters that point to systematic uncertainties at the ~0.1 dex level. Finally, we fit a large sample of stars from the H3 Spectroscopic Survey in which high quality Gaia parallaxes are also available. These stars are fit without the Gaia parallaxes so that the geometric parallaxes can serve as an independent test of the spectrophotometric distances. Comparison between the two reveals good agreement within their formal uncertainties after accounting for the Gaia zero point uncertainties.

preprint2020arXiv

The GALAH Survey: A new constraint on cosmological lithium and Galactic lithium evolution from warm dwarf stars

Lithium depletion and enrichment in the cosmos is not yet well understood. To help tighten constraints on stellar and Galactic evolution models, we present the largest high-resolution analysis of Li abundances A(Li) to date, with results for over 100 000 GALAH field stars spanning effective temperatures $5900\,\mathrm{K} \lesssim \rm{T_{eff}} \lesssim7000\,\mathrm{K}$ and metallicities $-3 \lesssim \rm[Fe/H] \lesssim +0.5$. We separated these stars into two groups, on the warm and cool side of the so-called Li-dip, a localised region of the Kiel diagram wherein lithium is severely depleted. We discovered that stars in these two groups show similar trends in the A(Li)-[Fe/H] plane, but with a roughly constant offset in A(Li) of 0.4 dex, the warm group having higher Li abundances. At $\rm[Fe/H]\gtrsim-0.5$, a significant increasing in Li abundance with increasing metallicity is evident in both groups, signalling the onset of significant Galactic production. At lower metallicity, stars in the cool group sit on the Spite plateau, showing a reduced lithium of around 0.4 dex relative to the primordial value predicted from Big Bang nucleosynthesis (BBN). However, stars in the warm group between [Fe/H] = -1.0 and -0.5, form an elevated plateau that is largely consistent with the BBN prediction. This may indicate that these stars in fact preserve the primordial Li produced in the early Universe.

preprint2020arXiv

The Predicted Properties of Helium-Enriched Globular Cluster Progenitors at High Redshift

Globular cluster progenitors may have been detected by \textit{HST}, and are predicted to be observable with \textit{JWST} and ground-based extremely-large telescopes with adaptive optics. This has the potential to elucidate the issue of globular cluster formation and the origins of significantly helium-enriched subpopulations, a problem in Galactic astronomy with no satisfactory theoretical solution. Given this context, we use model stellar tracks and isochrones to investigate the predicted observational properties of helium-enriched stellar populations in globular cluster progenitors. We find that, relative to helium-normal populations, helium-enriched ($ΔY=+0.12$) stellar populations similar to those inferred in the most massive globular clusters, are expected, modulo some rapid fluctuations in the first $\sim$30 Myr, to be brighter and redder in the rest frame. At fixed age, stellar mass, and metallicity, a helium-enriched population is predicted to converge to being $\sim$0.40 mag brighter at $λ\approx 2.0\, μm$, and to be 0.30 mag redder in the \textit{JWST}-NIRCam colour $(F070W-F200W)$, and to actually be fainter for $λ\lesssim 0.50 \, μm$. Separately, we find that the time-integrated shift in ionizing radiation is a negligible $\sim 5\%$, though we show that the Lyman-$α$ escape fraction could end up higher for helium-enriched stars.

preprint2020arXiv

Timing the Early Assembly of the Milky Way with the H3 Survey

The archaeological record of stars in the Milky Way opens a uniquely detailed window into the early formation and assembly of galaxies. Here we use 11,000 main-sequence turn-off stars with well-measured ages, [Fe/H], [$α$/Fe], and orbits from the H3 Survey and Gaia to time the major events in the early Galaxy. Located beyond the Galactic plane, $1\lesssim |Z|/\rm kpc \lesssim4$, this sample contains three chemically distinct groups: a low metallicity population, and low-$α$ and high-$α$ groups at higher metallicity. The age and orbit distributions of these populations show that: 1) the high-$α$ group, which includes both disk stars and the in-situ halo, has a star-formation history independent of eccentricity that abruptly truncated $8.3\pm0.1$ Gyr ago ($z\simeq1$); 2) the low metallicity population, which we identify as the accreted stellar halo, is on eccentric orbits and its star formation truncated $10.2.^{+0.2}_{-0.1}$ Gyr ago ($z\simeq2$); 3) the low-$α$ population is primarily on low eccentricity orbits and the bulk of its stars formed less than 8 Gyr ago. These results suggest a scenario in which the Milky Way accreted a satellite galaxy at $z\approx2$ that merged with the early disk by $z\approx1$. This merger truncated star formation in the early high-$α$ disk and perturbed a fraction of that disk onto halo-like orbits. The merger enabled the formation of a chemically distinct, low-$α$ disk at $z\lesssim1$. The lack of any stars on halo-like orbits at younger ages indicates that this event was the last significant disturbance to the Milky Way disk.

preprint2019arXiv

Abundance Estimates for 16 Elements in 6 Million Stars from LAMOST DR5 Low-Resolution Spectra

We present the determination of stellar parameters and individual elemental abundances for 6 million stars from $\sim$8 million low-resolution ($R\sim1800$) spectra from LAMOST DR5. This is based on a modeling approach that we dub $The$ $Data$--$Driven$ $Payne$ ($DD$--$Payne$), which inherits essential ingredients from both {\it The Payne} \citep{Ting2019} and $The$ $Cannon$ \citep{Ness2015}. It is a data-driven model that incorporates constraints from theoretical spectral models to ensure the derived abundance estimates are physically sensible. Stars in LAMOST DR5 that are in common with either GALAH DR2 or APOGEE DR14 are used to train a model that delivers stellar parameters ($T_{\rm eff}$, $\log g$, $V_{\rm mic}$) and abundances for 16 elements (C, N, O, Na, Mg, Al, Si, Ca, Ti, Cr, Mn, Fe, Co, Ni, Cu, and Ba) when applied to LAMOST spectra. Cross-validation and repeat observations suggest that, for ${\rm S/N}_{\rm pix}\ge 50$, the typical internal abundance precision is 0.03--0.1\,dex for the majority of these elements, with 0.2--0.3\,dex for Cu and Ba, and the internal precision of $T_{\rm eff}$ and $\log g$ is better than 30\,K and 0.07\,dex, respectively. Abundance systematics at the $\sim$0.1\,dex level are present in these estimates, but are inherited from the high-resolution surveys&#39; training labels. For some elements, GALAH provides more robust training labels, for others, APOGEE. We provide flags to guide the quality of the label determination and to identify binary/multiple stars in LAMOST DR5. The abundance catalogs are publicly accessible via \href{url}{http://dr5.lamost.org/doc/vac}.

preprint2019arXiv

Identical or fraternal twins? : The chemical homogeneity of wide binaries from Gaia DR2

One of the high-level goals of Galactic archaeology is chemical tagging of stars across the Milky Way to piece together its assembly history. For this to work, stars born together must be uniquely chemically homogeneous. Wide binary systems are an important laboratory to test this underlying assumption. Here we present the detailed chemical abundance patterns of 50 stars across 25 wide binary systems comprised of main-sequence stars of similar spectral type identified in Gaia DR2 with the aim of quantifying their level of chemical homogeneity. Using high-resolution spectra obtained with McDonald Observatory, we derive stellar atmospheric parameters and precise detailed chemical abundances for light/odd-Z (Li, C, Na, Al, Sc, V, Cu), $α$ (Mg, Si, Ca), Fe-peak (Ti, Cr, Mn, Fe, Co, Ni, Zn), and neutron capture (Sr, Y, Zr, Ba, La, Nd, Eu) elements. Results indicate that 80% (20 pairs) of the systems are homogeneous in [Fe/H] at levels below 0.02 dex. These systems are also chemically homogeneous in all elemental abundances studied, with offsets and dispersions consistent with measurement uncertainties. We also find that wide binary systems are far more chemically homogeneous than random pairings of field stars of similar spectral type. These results indicate that wide binary systems tend to be chemically homogeneous but in some cases they can differ in their detailed elemental abundances at a level of [X/H] ~ 0.10 dex, overall implying chemical tagging in broad strokes can work.

preprint2019arXiv

The GALAH Survey: Temporal Chemical Enrichment of the Galactic Disk

We present isochrone ages and initial bulk metallicities ($\rm [Fe/H]_{bulk}$, by accounting for diffusion) of 163,722 stars from the GALAH Data Release 2, mainly composed of main sequence turn-off stars and subgiants ($\rm 7000 K>T_{eff}>4000 K$ and $\rm log g>3$ dex). The local age-metallicity relationship (AMR) is nearly flat but with significant scatter at all ages; the scatter is even higher when considering the observed surface abundances. After correcting for selection effects, the AMR appear to have intrinsic structures indicative of two star formation events, which we speculate are connected to the thin and thick disks in the solar neighborhood. We also present abundance ratio trends for 16 elements as a function of age, across different $\rm [Fe/H]_{bulk}$ bins. In general, we find the trends in terms of [X/Fe] vs age from our far larger sample to be compatible with studies based on small ($\sim$ 100 stars) samples of solar twins but we now extend it to both sub- and super-solar metallicities. The $α$-elements show differing behaviour: the hydrostatic $α$-elements O and Mg show a steady decline with time for all metallicities while the explosive $α$-elements Si, Ca and Ti are nearly constant during the thin disk epoch (ages $\lessapprox $ 12 Gyr). The s-process elements Y and Ba show increasing [X/Fe] with time while the r-process element Eu have the opposite trend, thus favouring a primary production from sources with a short time-delay such as core-collapse supernovae over long-delay events such as neutron star mergers.