Researcher profile

Rachel Mandelbaum

Rachel Mandelbaum contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
29works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

29 published item(s)

preprint2026arXiv

Diagnosing the Effects of Spectroscopic Training Set Imperfection on Photometric Redshift Performance

Most LSST extragalactic science will rely on photometric redshifts (photo-$z$) to extract distance information for the galaxies. However, an incomplete or non-representative training set can introduce bias into photo-$z$ estimation. It is necessary to understand how various forms of training set imperfection, such as incompleteness and non-trivial spectroscopic target selection, affect photo-$z$ estimation algorithms, and to identify metrics best-suited to quantify the impact. This work aims to systematically study metrics for diagnosing how various photo-$z$ methods react to certain types of training set incompleteness and non-representativeness. We use methods available through the open-source Python library Redshift Assessment Infrastructure Layers (RAIL) to systematically test the algorithms CMNN, GPz, FlexZBoost, and PZFlow on mock training data degraded in accordance with several existing spectroscopic sky surveys, as well as under conditions of inverse redshift incompleteness, which approximately mimics observed patterns of incompleteness at high redshift. We employ the algorithm TrainZ as a control. Finally, we quantify photo-$z$ algorithm performance using a variety of statistical metrics implemented externally to RAIL. We determine that the Kullback-Liebler Divergence, Wasserstein Distance, and Probability Integral Transform are particularly informative metrics with which to assess the impact of training set imperfection on algorithmic performance. We also find that inverse redshift incompleteness effects alone lack the complexity to realistically represent anticipated training data.

preprint2026arXiv

Hyrax: An Extensible Framework for Rapid ML Experimentation and Unsupervised Discovery in the Era of Rubin, Roman, and Euclid

The NSF-DOE Vera C. Rubin Observatory, Roman Space Telescope, Euclid, and other next-generation surveys will deliver imaging, spectroscopic, and time-domain data at scales that increasingly shift the bottleneck in astronomical machine learning (ML) projects from model design to infrastructure. We present Hyrax, an open-source, modular, GPU-enabled Python framework that supports the full ML lifecycle in astronomy: from data acquisition and training to inference and experiment comparison, with capabilities including multimodal dataset support, integrated vector databases for similarity search, and interactive two- and three-dimensional latent-space exploration for unsupervised discovery. We demonstrate Hyrax's versatility through five representative applications on real survey data: (i) unsupervised representation learning on $\sim 4\times10^5$ Rubin Legacy Survey of Space and Time (LSST) Data Preview 1 (DP1) galaxies, surfacing new merger and low-surface-brightness candidates missing from reference Euclid and Dark Energy Survey catalogs, while also isolating imaging artifacts -- all without labeled training data; (ii) hybrid density-based clustering for identifying cluster-scale gravitational lens candidates in DP1 data; (iii) multimodal early-time transient classification in the Zwicky Transient Facility leveraging light curves, spectra, images, and metadata; (iv) supervised false-positive filtering in shift-and-stack searches for distant solar system objects in the Dark Energy Camera Ecliptic Exploration Project survey; and (v) supervised detection of semi-resolved dwarf galaxies in Hyper Suprime-Cam and LSST-like imaging using synthetic source injection. Together, these results demonstrate that Hyrax provides astronomy-specific ML infrastructure that enables systematic discovery and rapid methodological iteration across next-generation astronomical surveys.

preprint2022arXiv

From Data to Software to Science with the Rubin Observatory LSST

The Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST) dataset will dramatically alter our understanding of the Universe, from the origins of the Solar System to the nature of dark matter and dark energy. Much of this research will depend on the existence of robust, tested, and scalable algorithms, software, and services. Identifying and developing such tools ahead of time has the potential to significantly accelerate the delivery of early science from LSST. Developing these collaboratively, and making them broadly available, can enable more inclusive and equitable collaboration on LSST science. To facilitate such opportunities, a community workshop entitled "From Data to Software to Science with the Rubin Observatory LSST" was organized by the LSST Interdisciplinary Network for Collaboration and Computing (LINCC) and partners, and held at the Flatiron Institute in New York, March 28-30th 2022. The workshop included over 50 in-person attendees invited from over 300 applications. It identified seven key software areas of need: (i) scalable cross-matching and distributed joining of catalogs, (ii) robust photometric redshift determination, (iii) software for determination of selection functions, (iv) frameworks for scalable time-series analyses, (v) services for image access and reprocessing at scale, (vi) object image access (cutouts) and analysis at scale, and (vii) scalable job execution systems. This white paper summarizes the discussions of this workshop. It considers the motivating science use cases, identified cross-cutting algorithms, software, and services, their high-level technical specifications, and the principles of inclusive collaborations needed to develop them. We provide it as a useful roadmap of needs, as well as to spur action and collaboration between groups and individuals looking to develop reusable software for early LSST science.

preprint2022arXiv

Galaxies and Halos on Graph Neural Networks: Deep Generative Modeling Scalar and Vector Quantities for Intrinsic Alignment

In order to prepare for the upcoming wide-field cosmological surveys, large simulations of the Universe with realistic galaxy populations are required. In particular, the tendency of galaxies to naturally align towards overdensities, an effect called intrinsic alignments (IA), can be a major source of systematics in the weak lensing analysis. As the details of galaxy formation and evolution relevant to IA cannot be simulated in practice on such volumes, we propose as an alternative a Deep Generative Model. This model is trained on the IllustrisTNG-100 simulation and is capable of sampling the orientations of a population of galaxies so as to recover the correct alignments. In our approach, we model the cosmic web as a set of graphs, where the graphs are constructed for each halo, and galaxy orientations as a signal on those graphs. The generative model is implemented on a Generative Adversarial Network architecture and uses specifically designed Graph-Convolutional Networks sensitive to the relative 3D positions of the vertices. Given (sub)halo masses and tidal fields, the model is able to learn and predict scalar features such as galaxy and dark matter subhalo shapes; and more importantly, vector features such as the 3D orientation of the major axis of the ellipsoid and the complex 2D ellipticities. For correlations of 3D orientations the model is in good quantitative agreement with the measured values from the simulation, except for at very small and transition scales. For correlations of 2D ellipticities, the model is in good quantitative agreement with the measured values from the simulation on all scales. Additionally, the model is able to capture the dependence of IA on mass, morphological type and central/satellite type.

preprint2022arXiv

Galaxy Distribution Incompleteness Testing Using Self-Organizing Maps

The calibration of redshift distributions for photometric samples using spectroscopic surveys is plagued by the difficulty in modelling the selection functions of spectroscopic surveys. In this work, we analyse how these selection functions impact redshift inference and quantify the induced biases using local calibration tests in photometry space. The study is carried out using simulations that mimic the radial selection function of a spectroscopic survey and an accompanying mock catalog of a photometric galaxy survey catalog. We use a self-organizing map to partition the photometry space and perform a local $χ^2$ test to study the probability calibration of redshift inferences that use the spectroscopic data for calibration. The goal of this work is to investigate the effect of uncorrected selection functions in the calibration data on redshift prediction accuracy and critically discuss mitigation methods. In particular we test culling-based bias correction techniques, that aim to remove redshift calibration biases by identifying regions in photometry with few spectroscopic calibration data, and propose avenues for future research. We found that removing regions in color-magnitude space that are underpopulated with spectroscopic calibration data does not remove all biases in redshift inference induced by the selection function.

preprint2022arXiv

GREAT3 results I: systematic errors in shear estimation and the impact of real galaxy morphology

We present first results from the third GRavitational lEnsing Accuracy Testing (GREAT3) challenge, the third in a sequence of challenges for testing methods of inferring weak gravitational lensing shear distortions from simulated galaxy images. GREAT3 was divided into experiments to test three specific questions, and included simulated space- and ground-based data with constant or cosmologically-varying shear fields. The simplest (control) experiment included parametric galaxies with a realistic distribution of signal-to-noise, size, and ellipticity, and a complex point spread function (PSF). The other experiments tested the additional impact of realistic galaxy morphology, multiple exposure imaging, and the uncertainty about a spatially-varying PSF; the last two questions will be explored in Paper II. The 24 participating teams competed to estimate lensing shears to within systematic error tolerances for upcoming Stage-IV dark energy surveys, making 1525 submissions overall. GREAT3 saw considerable variety and innovation in the types of methods applied. Several teams now meet or exceed the targets in many of the tests conducted (to within the statistical errors). We conclude that the presence of realistic galaxy morphology in simulations changes shear calibration biases by $\sim 1$ per cent for a wide range of methods. Other effects such as truncation biases due to finite galaxy postage stamps, and the impact of galaxy type as measured by the Sérsic index, are quantified for the first time. Our results generalize previous studies regarding sensitivities to galaxy size and signal-to-noise, and to PSF properties such as seeing and defocus. Almost all methods' results support the simple model in which additive shear biases depend linearly on PSF ellipticity.

preprint2022arXiv

Intrinsic alignments of bulges and discs

Galaxies exhibit coherent alignments with local structure in the Universe. This effect, called Intrinsic Alignments (IA), is an important contributor to the systematic uncertainties for wide-field weak lensing surveys. On cosmological distance scales, intrinsic shape alignments have been observed in red galaxies, which are usually bulge-dominated; while blue galaxies, which are mostly disc-dominated, exhibit shape alignments consistent with a null detection. However, disc-dominated galaxies typically consist of two prominent structures: disc and bulge. Since the bulge component has similar properties as elliptical galaxies and is thought to have formed in a similar fashion, naturally one could ask whether the bulge components exhibit similar alignments as ellipticals? In this paper, we investigate how different components of galaxies exhibit IA in the TNG100-1 cosmological hydrodynamical simulation, as well as the dependence of IA on the fraction of stars in rotation-dominated structures at $z=0$. The measurements were controlled for mass differences between the samples. We find that the bulges exhibit significantly higher IA signals, with a nonlinear alignment model amplitude of $A_I = 2.98^{+0.36}_{-0.37}$ compared to the amplitude for the galaxies as a whole (both components), $A_I = 1.13^{+0.37}_{-0.35}$. The results for bulges are statistically consistent with those for elliptical galaxies, which have $A_I = 3.47^{+0.57}_{-0.57}$. These results highlight the importance of studying galaxy dynamics in order to understand galaxy alignments and their cosmological implications.

preprint2022arXiv

Snowmass2021 Cosmic Frontier White Paper: Enabling Flagship Dark Energy Experiments to Reach their Full Potential

A new generation of powerful dark energy experiments will open new vistas for cosmology in the next decade. However, these projects cannot reach their utmost potential without data from other telescopes. This white paper focuses in particular on the compelling benefits of ground-based spectroscopic and photometric observations to complement the Vera C. Rubin Observatory, as well as smaller programs in aid of a DESI-2 experiment and CMB-S4. These additional data sets will both improve dark energy constraints from these flagship projects beyond what would possible on their own and open completely new windows into fundamental physics. For example, additional photometry and single-object spectroscopy will provide necessary follow-up information for supernova and strong lensing cosmology, while highly-multiplexed spectroscopy both from smaller facilities over wide fields and from larger facilities over narrower regions of sky will yield more accurate photometric redshift estimates for weak lensing and galaxy clustering measurements from the Rubin Observatory, provide critical spectroscopic host galaxy redshifts for supernova Hubble diagrams, provide improved understanding of limiting astrophysical systematic effects, and enable new measurements that probe the nature of gravity. A common thread is that access to complementary data from a range of telescopes/instruments would have a substantial impact on the rate of advance of dark energy science in the coming years.

preprint2022arXiv

Snowmass2021 Cosmic Frontier White Paper: Rubin Observatory after LSST

The Vera C. Rubin Observatory will begin the Legacy Survey of Space and Time (LSST) in 2024, spanning an area of 18,000 square degrees in six bands, with more than 800 observations of each field over ten years. The unprecedented data set will enable great advances in the study of the formation and evolution of structure and exploration of physics of the dark universe. The observations will hold clues about the cause for the accelerated expansion of the universe and possibly the nature of dark matter. During the next decade, LSST will be able to confirm or dispute if tensions seen today in cosmological data are due to new physics. New and unexpected phenomena could confirm or disrupt our current understanding of the universe. Findings from LSST will guide the path forward post-LSST. The Rubin Observatory will still be a uniquely powerful facility even then, capable of revealing further insights into the physics of the dark universe. These could be obtained via innovative observing strategies, e.g., targeting new probes at shorter timescales than with LSST, or via modest instrumental changes, e.g., new filters, or through an entirely new instrument for the focal plane. This White Paper highlights some of the opportunities in each scenario from Rubin observations after LSST.

preprint2022arXiv

Snowmass2021: Opportunities from Cross-survey Analyses of Static Probes

Cosmological data in the next decade will be characterized by high-precision, multi-wavelength measurements of thousands of square degrees of the same patches of sky. By performing multi-survey analyses that harness the correlated nature of these datasets, we will gain access to new science, and increase the precision and robustness of science being pursued by each individual survey. However, effective application of such analyses requires a qualitatively new level of investment in cross-survey infrastructure, including simulations, associated modeling, coordination of data sharing, and survey strategy. The scientific gains from this new level of investment are multiplicative, as the benefits can be reaped by even present-day instruments, and can be applied to new instruments as they come online.

preprint2022arXiv

Snowmass2021: Vera C. Rubin Observatory as a Flagship Dark Matter Experiment

Establishing that Vera C. Rubin Observatory is a flagship dark matter experiment is an essential pathway toward understanding the physical nature of dark matter. In the past two decades, wide-field astronomical surveys and terrestrial laboratories have jointly created a phase transition in the ecosystem of dark matter models and probes. Going forward, any robust understanding of dark matter requires astronomical observations, which still provide the only empirical evidence for dark matter to date. We have a unique opportunity right now to create a dark matter experiment with Rubin Observatory Legacy Survey of Space and Time (LSST). This experiment will be a coordinated effort to perform dark matter research, and provide a large collaborative team of scientists with the necessary organizational and funding supports. This approach leverages existing investments in Rubin. Studies of dark matter with Rubin LSST will also guide the design of, and confirm the results from, other dark matter experiments. Supporting a collaborative team to carry out a dark matter experiment with Rubin LSST is the key to achieving the dark matter science goals that have already been identified as high priority by the high-energy physics and astronomy communities.

preprint2022arXiv

The Third Gravitational Lensing Accuracy Testing (GREAT3) Challenge Handbook

The GRavitational lEnsing Accuracy Testing 3 (GREAT3) challenge is the third in a series of image analysis challenges, with a goal of testing and facilitating the development of methods for analyzing astronomical images that will be used to measure weak gravitational lensing. This measurement requires extremely precise estimation of very small galaxy shape distortions, in the presence of far larger intrinsic galaxy shapes and distortions due to the blurring kernel caused by the atmosphere, telescope optics, and instrumental effects. The GREAT3 challenge is posed to the astronomy, machine learning, and statistics communities, and includes tests of three specific effects that are of immediate relevance to upcoming weak lensing surveys, two of which have never been tested in a community challenge before. These effects include realistically complex galaxy models based on high-resolution imaging from space; spatially varying, physically-motivated blurring kernel; and combination of multiple different exposures. To facilitate entry by people new to the field, and for use as a diagnostic tool, the simulation software for the challenge is publicly available, though the exact parameters used for the challenge are blinded. Sample scripts to analyze the challenge data using existing methods will also be provided. See http://great3challenge.info and http://great3.projects.phys.ucl.ac.uk/leaderboard/ for more information.

preprint2022arXiv

The three-year shear catalog of the Subaru Hyper Suprime-Cam SSP Survey

We present the galaxy shear catalog that will be used for the three-year cosmological weak gravitational lensing analyses using data from the Wide layer of the Hyper Suprime-Cam (HSC) Subaru Strategic Program (SSP) Survey. The galaxy shapes are measured from the $i$-band imaging data acquired from 2014 to 2019 and calibrated with image simulations that resemble the observing conditions of the survey based on training galaxy images from the Hubble Space Telescope in the COSMOS region. The catalog covers an area of 433.48 deg$^2$ of the northern sky, split into six fields. The mean $i$-band seeing is 0.59 arcsec. With conservative galaxy selection criteria (e.g., $i$-band magnitude brighter than 24.5), the observed raw galaxy number density is 22.9 arcmin$^{-2}$, and the effective galaxy number density is 19.9 arcmin$^{-2}$. The calibration removes the galaxy property-dependent shear estimation bias to a level: $|δm|<9\times 10^{-3}$. The bias residual $δm$ shows no dependence on redshift in the range $0<z\leq 3$. We define the requirements for cosmological weak lensing science for this shear catalog, and quantify potential systematics in the catalog using a series of internal null tests for systematics related to point-spread function modelling and shear estimation. A variety of the null tests are statistically consistent with zero or within requirements, but (i) there is evidence for PSF model shape residual correlations; and (ii) star-galaxy shape correlations reveal additive systematics. Both effects become significant on $>1$ degree scales and will require mitigation during the inference of cosmological parameters using cosmic shear measurements.

preprint2022arXiv

Validating Synthetic Galaxy Catalogs for Dark Energy Science in the LSST Era

Large simulation efforts are required to provide synthetic galaxy catalogs for ongoing and upcoming cosmology surveys. These extragalactic catalogs are being used for many diverse purposes covering a wide range of scientific topics. In order to be useful, they must offer realistically complex information about the galaxies they contain. Hence, it is critical to implement a rigorous validation procedure that ensures that the simulated galaxy properties faithfully capture observations and delivers an assessment of the level of realism attained by the catalog. We present here a suite of validation tests that have been developed by the Rubin Observatory Legacy Survey of Space and Time (LSST) Dark Energy Science Collaboration (DESC). We discuss how the inclusion of each test is driven by the scientific targets for static ground-based dark energy science and by the availability of suitable validation data. The validation criteria that are used to assess the performance of a catalog are flexible and depend on the science goals. We illustrate the utility of this suite by showing examples for the validation of cosmoDC2, the extragalactic catalog recently released for the LSST DESC second Data Challenge.

preprint2021arXiv

Impact of Image Persistence in the Roman Space Telescope High-Latitude Survey

The High Latitude Survey of the Nancy Grace Roman Space Telescope is expected to measure the positions and shapes of hundreds of millions of galaxies in an area of 2220 deg$^2$. This survey will provide high-quality weak lensing data with unprecedented systematics control. The Roman Space Telescope will survey the sky in near infrared (NIR) bands using Teledyne H4RG HgCdTe photodiode arrays. These NIR arrays exhibit an effect called persistence: charges that are trapped in the photodiodes during earlier exposures are gradually released into later exposures, leading to contamination of the images and potentially to errors in measured galaxy properties such as fluxes and shapes. In this work, we use image simulations that incorporate the persistence effect to study its impact on galaxy shape measurements and weak lensing signals. No significant spatial correlations are found between the galaxy shape changes induced by persistence. On the scales of interest for weak lensing cosmology, the effect of persistence on the weak lensing correlation function is about two orders of magnitude lower than the Roman Space Telescope additive shear error budget, indicating that the persistence effect is expected to be a subdominant contributor to the systematic error budget for weak lensing with the Roman Space Telescope given its current design.

preprint2021arXiv

Optimization of the Observing Cadence for the Rubin Observatory Legacy Survey of Space and Time: a pioneering process of community-focused experimental design

Vera C. Rubin Observatory is a ground-based astronomical facility under construction, a joint project of the National Science Foundation and the U.S. Department of Energy, designed to conduct a multi-purpose 10-year optical survey of the southern hemisphere sky: the Legacy Survey of Space and Time. Significant flexibility in survey strategy remains within the constraints imposed by the core science goals of probing dark energy and dark matter, cataloging the Solar System, exploring the transient optical sky, and mapping the Milky Way. The survey&#39;s massive data throughput will be transformational for many other astrophysics domains and Rubin&#39;s data access policy sets the stage for a huge potential users&#39; community. To ensure that the survey science potential is maximized while serving as broad a community as possible, Rubin Observatory has involved the scientific community at large in the process of setting and refining the details of the observing strategy. The motivation, history, and decision-making process of this strategy optimization are detailed in this paper, giving context to the science-driven proposals and recommendations for the survey strategy included in this Focus Issue.

preprint2021arXiv

The Impact of Light Polarization Effects on Weak Lensing Systematics

A fraction of the light observed from edge-on disk galaxies is polarized due to two physical effects: selective extinction by dust grains aligned with the magnetic field, and scattering of the anisotropic starlight field. Since the reflection and transmission coefficients of the reflecting and refracting surfaces in an optical system depend on the polarization of incoming rays, this optical polarization produces both (a) a selection bias in favor of galaxies with specific orientations and (b) a polarization-dependent PSF. In this work we build toy models to obtain for the first time an estimate for the impact of polarization on PSF shapes and the impact of the selection bias due to the polarization effect on the measurement of the ellipticity used in shear measurements. In particular, we are interested in determining if this effect will be significant for WFIRST. We show that the systematic uncertainties in the ellipticity components are $8\times 10^{-5}$ and $1.1 \times 10^{-4}$ due to the selection bias and PSF errors respectively. Compared to the overall requirements on knowledge of the WFIRST PSF ellipticity ($4.7\times 10^{-4}$ per component), both of these systematic uncertainties are sufficiently close to the WFIRST tolerance level that more detailed studies of the polarization effects or more stringent requirements on polarization-sensitive instrumentation for WFIRST are required.

preprint2021arXiv

The Impact of Observing Strategy on Cosmological Constraints with LSST

The generation-defining Vera C. Rubin Observatory will make state-of-the-art measurements of both the static and transient universe through its Legacy Survey for Space and Time (LSST). With such capabilities, it is immensely challenging to optimize the LSST observing strategy across the survey&#39;s wide range of science drivers. Many aspects of the LSST observing strategy relevant to the LSST Dark Energy Science Collaboration, such as survey footprint definition, single visit exposure time and the cadence of repeat visits in different filters, are yet to be finalized. Here, we present metrics used to assess the impact of observing strategy on the cosmological probes considered most sensitive to survey design; these are large-scale structure, weak lensing, type Ia supernovae, kilonovae and strong lens systems (as well as photometric redshifts, which enable many of these probes). We evaluate these metrics for over 100 different simulated potential survey designs. Our results show that multiple observing strategy decisions can profoundly impact cosmological constraints with LSST; these include adjusting the survey footprint, ensuring repeat nightly visits are taken in different filters and enforcing regular cadence. We provide public code for our metrics, which makes them readily available for evaluating further modifications to the survey design. We conclude with a set of recommendations and highlight observing strategy factors that require further research.

preprint2021arXiv

The LSST DESC DC2 Simulated Sky Survey

We describe the simulated sky survey underlying the second data challenge (DC2) carried out in preparation for analysis of the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST) by the LSST Dark Energy Science Collaboration (LSST DESC). Significant connections across multiple science domains will be a hallmark of LSST; the DC2 program represents a unique modeling effort that stresses this interconnectivity in a way that has not been attempted before. This effort encompasses a full end-to-end approach: starting from a large N-body simulation, through setting up LSST-like observations including realistic cadences, through image simulations, and finally processing with Rubin&#39;s LSST Science Pipelines. This last step ensures that we generate data products resembling those to be delivered by the Rubin Observatory as closely as is currently possible. The simulated DC2 sky survey covers six optical bands in a wide-fast-deep (WFD) area of approximately 300 deg^2 as well as a deep drilling field (DDF) of approximately 1 deg^2. We simulate 5 years of the planned 10-year survey. The DC2 sky survey has multiple purposes. First, the LSST DESC working groups can use the dataset to develop a range of DESC analysis pipelines to prepare for the advent of actual data. Second, it serves as a realistic testbed for the image processing software under development for LSST by the Rubin Observatory. In particular, simulated data provide a controlled way to investigate certain image-level systematic effects. Finally, the DC2 sky survey enables the exploration of new scientific ideas in both static and time-domain cosmology.

preprint2021arXiv

The Role of Machine Learning in the Next Decade of Cosmology

In recent years, machine learning (ML) methods have remarkably improved how cosmologists can interpret data. The next decade will bring new opportunities for data-driven cosmological discovery, but will also present new challenges for adopting ML methodologies and understanding the results. ML could transform our field, but this transformation will require the astronomy community to both foster and promote interdisciplinary research endeavors.

preprint2020arXiv

Cosmological constraints from galaxy-lensing cross correlations using BOSS galaxies with SDSS and CMB lensing

We present cosmological parameter constraints based on a joint modeling of galaxy-lensing cross correlations and galaxy clustering measurements in the SDSS, marginalizing over small-scale modeling uncertainties using mock galaxy catalogs, without explicit modeling of galaxy bias. We show that our modeling method is robust to the impact of different choices for how galaxies occupy dark matter halos and to the impact of baryonic physics (at the $\sim2\%$ level in cosmological parameters) and test for the impact of covariance on the likelihood analysis and of the survey window function on the theory computations. Applying our results to the measurements using galaxy samples from BOSS and lensing measurements using shear from SDSS galaxies and CMB lensing from Planck, with conservative scale cuts, we obtain $S_8\equiv\left(\frac{σ_8}{0.8228}\right)^{0.8}\left(\frac{Ω_m}{0.307}\right)^{0.6}=0.85\pm0.05$ (stat.) using LOWZ $\times$ SDSS galaxy lensing, and $S_8=0.91\pm0.1$ (stat.) using combination of LOWZ and CMASS $\times$ Planck CMB lensing. We estimate the systematic uncertainty in the galaxy-galaxy lensing measurements to be $\sim6\%$ (dominated by photometric redshift uncertainties) and in the galaxy-CMB lensing measurements to be $\sim3\%$, from small scale modeling uncertainties including baryonic physics.

preprint2020arXiv

Estimating redshift distributions using Hierarchical Logistic Gaussian processes

This work uses hierarchical logistic Gaussian processes to infer true redshift distributions of samples of galaxies, through their cross-correlations with spatially overlapping spectroscopic samples. We demonstrate that this method can accurately estimate these redshift distributions in a fully Bayesian manner jointly with galaxy-dark matter bias models. We forecast how systematic biases in the redshift-dependent galaxy-dark matter bias model affect redshift inference. Using published galaxy-dark matter bias measurements from the Illustris simulation, we compare these systematic biases with the statistical error budget from a forecasted weak gravitational lensing measurement. If the redshift-dependent galaxy-dark matter bias model is mis-specified, redshift inference can be biased. This can propagate into relative biases in the weak lensing convergence power spectrum on the 10% - 30% level. We, therefore, showcase a methodology to detect these sources of error using Bayesian model selection techniques. Furthermore, we discuss the improvements that can be gained from incorporating prior information from Bayesian template fitting into the model, both in redshift prediction accuracy and in the detection of systematic modeling biases.

preprint2020arXiv

Optimising LSST Observing Strategy for Weak Lensing Systematics

The LSST survey will provide unprecedented statistical power for measurements of dark energy. Consequently, controlling systematic uncertainties is becoming more important than ever. The LSST observing strategy will affect the statistical uncertainty and systematics control for many science cases; here, we focus on weak lensing systematics. The fact that the LSST observing strategy involves hundreds of visits to the same sky area provides new opportunities for systematics mitigation. We explore these opportunities by testing how different dithering strategies (pointing offsets and rotational angle of the camera in different exposures) affect additive weak lensing shear systematics on a baseline operational simulation, using the $ρ-$statistics formalism. Some dithering strategies improve systematics control at the end of the survey by a factor of up to $\sim 3-4$ better than others. We find that a random translational dithering strategy, applied with random rotational dithering at every filter change, is the most effective of those strategies tested in this work at averaging down systematics. Adopting this dithering algorithm, we explore the effect of varying the area of the survey footprint, exposure time, number of exposures in a visit, and exposure to the Galactic plane. We find that any change that increases the average number of exposures (in filters relevant to weak lensing) reduces the additive shear systematics. Some ways to achieve this increase may not be favorable for the weak lensing statistical constraining power or for other probes, and we explore the relative trade-offs between these options given constraints on the overall survey parameters.

preprint2020arXiv

The LSST DESC Data Challenge 1: Generation and Analysis of Synthetic Images for Next Generation Surveys

Data Challenge 1 (DC1) is the first synthetic dataset produced by the Rubin Observatory Legacy Survey of Space and Time (LSST) Dark Energy Science Collaboration (DESC). DC1 is designed to develop and validate data reduction and analysis and to study the impact of systematic effects that will affect the LSST dataset. DC1 is comprised of $r$-band observations of 40 deg$^{2}$ to 10-year LSST depth. We present each stage of the simulation and analysis process: a) generation, by synthesizing sources from cosmological N-body simulations in individual sensor-visit images with different observing conditions; b) reduction using a development version of the LSST Science Pipelines; and c) matching to the input cosmological catalog for validation and testing. We verify that testable LSST requirements pass within the fidelity of DC1. We establish a selection procedure that produces a sufficiently clean extragalactic sample for clustering analyses and we discuss residual sample contamination, including contributions from inefficiency in star-galaxy separation and imperfect deblending. We compute the galaxy power spectrum on the simulated field and conclude that: i) survey properties have an impact of 50\% of the statistical uncertainty for the scales and models used in DC1 ii) a selection to eliminate artifacts in the catalogs is necessary to avoid biases in the measured clustering; iii) the presence of bright objects has a significant impact (2- to 6-$σ$) in the estimated power spectra at small scales ($\ell > 1200$), highlighting the impact of blending in studies at small angular scales in LSST;

preprint2020arXiv

Validation of PSF Models for HST and Other Space-Based Observations

Forthcoming space-based observations will require high-quality point-spread function (PSF) models for weak gravitational lensing measurements. One approach to generating these models is using a wavefront model based on the known telescope optics. We present an empirical framework for validating such models to confirm that they match the actual PSF to within requirements by comparing the models to the observed light distributions of isolated stars. We apply this framework to Tiny Tim, the standard tool for generating model PSFs for the Hubble Space Telescope (HST), testing its models against images taken by HST&#39;s Advanced Camera for Surveys in the Wide Field Channel. We show that Tiny Tim&#39;s models, in the default configuration, differ significantly from the observed PSFs, most notably in their sizes. We find that the quality of Tiny Tim PSFs can be improved through fitting the full set of Zernike polynomial coefficients which characterise the optics, to the point where the practical significance of the difference between model and observed PSFs is negligible for most use cases, resulting in additive and multiplicative biases both of order approximately 4e-4. We also show that most of this improvement can be retained through using an updated set of Zernike coefficients, which we provide.

preprint2019arXiv

CosmoDC2: A Synthetic Sky Catalog for Dark Energy Science with LSST

This paper introduces cosmoDC2, a large synthetic galaxy catalog designed to support precision dark energy science with the Large Synoptic Survey Telescope (LSST). CosmoDC2 is the starting point for the second data challenge (DC2) carried out by the LSST Dark Energy Science Collaboration (LSST DESC). The catalog is based on a trillion-particle, 4.225 Gpc^3 box cosmological N-body simulation, the `Outer Rim&#39; run. It covers 440 deg^2 of sky area to a redshift of z=3 and is complete to a magnitude depth of 28 in the r-band. Each galaxy is characterized by a multitude of properties including stellar mass, morphology, spectral energy distributions, broadband filter magnitudes, host halo information and weak lensing shear. The size and complexity of cosmoDC2 requires an efficient catalog generation methodology; our approach is based on a new hybrid technique that combines data-driven empirical approaches with semi-analytic galaxy modeling. A wide range of observation-based validation tests has been implemented to ensure that cosmoDC2 enables the science goals of the planned LSST DESC DC2 analyses. This paper also represents the official release of the cosmoDC2 data set, including an efficient reader that facilitates interaction with the data.

preprint2019arXiv

Galaxy-Galaxy Lensing in HSC: Validation Tests and the Impact of Heterogeneous Spectroscopic Training Sets

Although photometric redshifts (photo-z&#39;s) are crucial ingredients for current and upcoming large-scale surveys, the high-quality spectroscopic redshifts currently available to train, validate, and test them are substantially non-representative in both magnitude and color. We investigate the nature and structure of this bias by tracking how objects from a heterogeneous training sample contribute to photo-z predictions as a function of magnitude and color, and illustrate that the underlying redshift distribution at fixed color can evolve strongly as a function of magnitude. We then test the robustness of the galaxy-galaxy lensing signal in 120 deg$^2$ of HSC-SSP DR1 data to spectroscopic completeness and photo-z biases, and find that their impacts are sub-dominant to current statistical uncertainties. Our methodology provides a framework to investigate how spectroscopic incompleteness can impact photo-z-based weak lensing predictions in future surveys such as LSST and WFIRST.

preprint2019arXiv

The evolution of galaxy intrinsic alignments in the MassiveBlack II universe

We investigate the redshift evolution of the intrinsic alignments (IA) of galaxies in the \texttt{MassiveBlackII} (MBII) simulation. We select galaxy samples above fixed subhalo mass cuts ($M_h>10^{11,12,13}~M_{\odot}/h$) at $z=0.6$ and trace their progenitors to $z=3$ along their merger trees. Dark matter components of $z=0.6$ galaxies are more spherical than their progenitors while stellar matter components tend to be less spherical than their progenitors. The distribution of the galaxy-subhalo misalignment angle peaks at $\sim10~\mathrm{deg}$ with a mild increase with time. The evolution of the ellipticity-direction~(ED) correlation amplitude $ω(r)$ of galaxies (which quantifies the tendency of galaxies to preferentially point towards surrounding matter overdensities) is governed by the evolution in the alignment of underlying dark matter~(DM) subhaloes to the matter density of field, as well as the alignment between galaxies and their DM subhaloes. At scales $\sim1~\mathrm{cMpc}/h$, the alignment between DM subhaloes and matter overdensity gets suppressed with time, whereas the alignment between galaxies and DM subhaloes is enhanced. These competing tendencies lead to a complex redshift evolution of $ω(r)$ for galaxies at $\sim1~\mathrm{cMpc}/h$. At scales $>1~\mathrm{cMpc}/h$, alignment between DM subhaloes and matter overdensity does not evolve significantly; the evolution of the galaxy-subhalo misalignment therefore leads to an increase in $ω(r)$ for galaxies by a factor of $\sim4$ from $z=3$ to $0.6$ at scales $>1~\mathrm{cMpc}/h$. The balance between competing physical effects is scale dependant, leading to different conclusions at much smaller scales($\sim0.1~\mathrm{Mpc}/h$).

preprint2019arXiv

Tomographic galaxy clustering with the Subaru Hyper Suprime-Cam first year public data release

We analyze the clustering of galaxies in the first public data release of the HSC Subaru Strategic Program. Despite the relatively small footprints of the observed fields, the data are an excellent proxy for the deep photometric datasets that will be acquired by LSST, and are therefore an ideal test bed for the analysis methods being implemented by the LSST DESC. We select a magnitude limited sample with $i<24.5$ and analyze it in four redshift bins covering $0.15\lesssim z \lesssim1.5$. We carry out a Fourier-space analysis of the two-point clustering of this sample, including all auto- and cross-correlations. We demonstrate the use of map-level deprojection methods to account for fluctuations in the galaxy number density caused by observational systematics. Through an HOD analysis, we place constraints on the characteristic halo masses of this sample, finding a good fit up to scales $k_{\rm max}=1\,{\rm Mpc}^{-1}$, including both auto- and cross-correlations. Our results show monotonically decreasing average halo masses, which can be interpreted in terms of the drop-out of red galaxies at high redshifts for a flux-limited sample. In terms of photometric redshift systematics, we show that additional care is needed in order to marginalize over uncertainties in the redshift distribution in galaxy clustering, and that these uncertainties can be constrained by including cross-correlations. We are able to make a $\sim3σ$ detection of lensing magnification in the HSC data. Our results are stable to variations in $σ_8$ and $Ω_c$ and we find constraints that agree well with measurements from Planck and low-redshift probes. Finally, we use our pipeline to study the clustering of galaxies as a function of limiting flux, and provide a simple fitting function for the linear galaxy bias for magnitude limited samples as a function of limiting magnitude and redshift. [abridged]