Researcher profile

Duncan Campbell

Duncan Campbell contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2022arXiv

Galaxies and Halos on Graph Neural Networks: Deep Generative Modeling Scalar and Vector Quantities for Intrinsic Alignment

In order to prepare for the upcoming wide-field cosmological surveys, large simulations of the Universe with realistic galaxy populations are required. In particular, the tendency of galaxies to naturally align towards overdensities, an effect called intrinsic alignments (IA), can be a major source of systematics in the weak lensing analysis. As the details of galaxy formation and evolution relevant to IA cannot be simulated in practice on such volumes, we propose as an alternative a Deep Generative Model. This model is trained on the IllustrisTNG-100 simulation and is capable of sampling the orientations of a population of galaxies so as to recover the correct alignments. In our approach, we model the cosmic web as a set of graphs, where the graphs are constructed for each halo, and galaxy orientations as a signal on those graphs. The generative model is implemented on a Generative Adversarial Network architecture and uses specifically designed Graph-Convolutional Networks sensitive to the relative 3D positions of the vertices. Given (sub)halo masses and tidal fields, the model is able to learn and predict scalar features such as galaxy and dark matter subhalo shapes; and more importantly, vector features such as the 3D orientation of the major axis of the ellipsoid and the complex 2D ellipticities. For correlations of 3D orientations the model is in good quantitative agreement with the measured values from the simulation, except for at very small and transition scales. For correlations of 2D ellipticities, the model is in good quantitative agreement with the measured values from the simulation on all scales. Additionally, the model is able to capture the dependence of IA on mass, morphological type and central/satellite type.

preprint2022arXiv

Validating Synthetic Galaxy Catalogs for Dark Energy Science in the LSST Era

Large simulation efforts are required to provide synthetic galaxy catalogs for ongoing and upcoming cosmology surveys. These extragalactic catalogs are being used for many diverse purposes covering a wide range of scientific topics. In order to be useful, they must offer realistically complex information about the galaxies they contain. Hence, it is critical to implement a rigorous validation procedure that ensures that the simulated galaxy properties faithfully capture observations and delivers an assessment of the level of realism attained by the catalog. We present here a suite of validation tests that have been developed by the Rubin Observatory Legacy Survey of Space and Time (LSST) Dark Energy Science Collaboration (DESC). We discuss how the inclusion of each test is driven by the scientific targets for static ground-based dark energy science and by the availability of suitable validation data. The validation criteria that are used to assess the performance of a catalog are flexible and depend on the science goals. We illustrate the utility of this suite by showing examples for the validation of cosmoDC2, the extragalactic catalog recently released for the LSST DESC second Data Challenge.

preprint2021arXiv

Planning as Inference in Epidemiological Models

In this work we demonstrate how to automate parts of the infectious disease-control policy-making process via performing inference in existing epidemiological models. The kind of inference tasks undertaken include computing the posterior distribution over controllable, via direct policy-making choices, simulation model parameters that give rise to acceptable disease progression outcomes. Among other things, we illustrate the use of a probabilistic programming language that automates inference in existing simulators. Neither the full capabilities of this tool for automating inference nor its utility for planning is widely disseminated at the current time. Timely gains in understanding about how such simulation-based models and inference automation tools applied in support of policymaking could lead to less economically damaging policy prescriptions, particularly during the current COVID-19 pandemic.

preprint2020arXiv

Void Galaxies Follow a Distinct Evolutionary Path in the Environmental COntext Catalog

We measure the environmental dependence, where environment is defined by the distance to the third nearest neighbor, of multiple galaxy properties inside the Environmental COntext (ECO) catalog. We focus primarily on void galaxies, which we define as the $10 \%$ of galaxies having the lowest local density. We compare the properties of void and non-void galaxies: baryonic mass, color, fractional stellar mass growth rate (FSMGR), morphology, and gas-to-stellar-mass ratio (estimated from a combination of HI data and photometric gas fractions calibrated with the RESOLVE survey). Our void galaxies typically have lower baryonic masses than galaxies in denser environments, and they display the properties expected of a lower mass population: they have more late-types, are bluer, have higher FSMGR, and are more gas rich. We control for baryonic mass and investigate the extent to which void galaxies are different at fixed mass. Void galaxies are bluer, more gas-rich, and more star forming at fixed mass than non-void galaxies, which is a possible signature of galaxy assembly bias. Furthermore, we show that these trends persist even at fixed mass and morphology, and we find that voids host a distinct population of early-types that are bluer and more star-forming than the typical red and quenched early-types. In addition to these empirical observational results, we also present theoretical results from mock catalogs with built-in galaxy assembly bias. We show that a simple matching of galaxy properties to (sub)halo properties, such as mass and age, can recover the observed environmental trends in ECO galaxies.

preprint2019arXiv

CosmoDC2: A Synthetic Sky Catalog for Dark Energy Science with LSST

This paper introduces cosmoDC2, a large synthetic galaxy catalog designed to support precision dark energy science with the Large Synoptic Survey Telescope (LSST). CosmoDC2 is the starting point for the second data challenge (DC2) carried out by the LSST Dark Energy Science Collaboration (LSST DESC). The catalog is based on a trillion-particle, 4.225 Gpc^3 box cosmological N-body simulation, the `Outer Rim' run. It covers 440 deg^2 of sky area to a redshift of z=3 and is complete to a magnitude depth of 28 in the r-band. Each galaxy is characterized by a multitude of properties including stellar mass, morphology, spectral energy distributions, broadband filter magnitudes, host halo information and weak lensing shear. The size and complexity of cosmoDC2 requires an efficient catalog generation methodology; our approach is based on a new hybrid technique that combines data-driven empirical approaches with semi-analytic galaxy modeling. A wide range of observation-based validation tests has been implemented to ensure that cosmoDC2 enables the science goals of the planned LSST DESC DC2 analyses. This paper also represents the official release of the cosmoDC2 data set, including an efficient reader that facilitates interaction with the data.

preprint2019arXiv

Generating Synthetic Cosmological Data with GalSampler

As part of the effort to meet the needs of the Large Synoptic Survey Telescope Dark Energy Science Collaboration (LSST DESC) for accurate, realistically complex mock galaxy catalogs, we have developed GalSampler, an open-source python package that assists in generating large volumes of synthetic cosmological data. The key idea behind GalSampler is to recast hydrodynamical simulations and semi-analytic models as physically-motivated galaxy libraries. GalSampler populates a new, larger-volume halo catalog with galaxies drawn from the baseline library; by using weighted sampling guided by empirical modeling techniques, GalSampler inherits statistical accuracy from the empirical model and physically-motivated complexity from the baseline library. We have recently used GalSampler to produce the cosmoDC2 extragalactic catalog made for the LSST DESC Data Challenge 2. Using cosmoDC2 as a guiding example, we outline how GalSampler can continue to support ongoing and near-future galaxy surveys such as the Dark Energy Survey (DES), the Dark Energy Spectroscopic Instrument (DESI), WFIRST, and Euclid.

preprint2019arXiv

How to Optimally Constrain Galaxy Assembly Bias: Supplement Projected Correlation Functions with Count-in-cells Statistics

Most models for the connection between galaxies and their haloes ignore the possibility that galaxy properties may be correlated with halo properties other than mass, a phenomenon known as galaxy assembly bias. Yet, it is known that such correlations can lead to systematic errors in the interpretation of survey data. At present, the degree to which galaxy assembly bias may be present in the real Universe, and the best strategies for constraining it remain uncertain. We study the ability of several observables to constrain galaxy assembly bias from redshift survey data using the decorated halo occupation distribution (dHOD), an empirical model of the galaxy--halo connection that incorporates assembly bias. We cover an expansive set of observables, including the projected two-point correlation function $w_{\mathrm{p}}(r_{\mathrm{p}})$, the galaxy--galaxy lensing signal $ΔΣ(r_{\mathrm{p}})$, the void probability function $\mathrm{VPF}(r)$, the distributions of counts-in-cylinders $P(N_{\mathrm{CIC}})$, and counts-in-annuli $P(N_{\mathrm{CIA}})$, and the distribution of the ratio of counts in cylinders of different sizes $P(N_2/N_5)$. We find that despite the frequent use of the combination $w_{\mathrm{p}}(r_{\mathrm{p}})+ΔΣ(r_{\mathrm{p}})$ in interpreting galaxy data, the count statistics, $P(N_{\mathrm{CIC}})$ and $P(N_{\mathrm{CIA}})$, are generally more efficient in constraining galaxy assembly bias when combined with $w_{\mathrm{p}}(r_{\mathrm{p}})$. Constraints based upon $w_{\mathrm{p}}(r_{\mathrm{p}})$ and $ΔΣ(r_{\mathrm{p}})$ share common degeneracy directions in the parameter space, while combinations of $w_{\mathrm{p}}(r_{\mathrm{p}})$ with the count statistics are more complementary. Therefore, we strongly suggest that count statistics should be used to complement the canonical observables in future studies of the galaxy--halo connection.

preprint2017arXiv

Brightest galaxies as halo centre tracers in SDSS DR7

Determining the positions of halo centres in large-scale structure surveys is crucial for many cosmological studies. A common assumption is that halo centres correspond to the location of their brightest member galaxies. In this paper, we study the dynamics of brightest galaxies with respect to other halo members in the Sloan Digital Sky Survey DR7. Specifically, we look at the line-of-sight velocity and spatial offsets between brightest galaxies and their neighbours. We compare those to detailed mock catalogues, constructed from high-resolution, dark-matter-only $N$-body simulations, in which it is assumed that satellite galaxies trace dark matter subhaloes. This allows us to place constraints on the fraction $f_{\rm BNC}$ of haloes in which the brightest galaxy is not the central. Compared to previous studies we explicitly take into account the unrelaxed state of the host haloes, velocity offsets of halo cores and correlations between $f_{\rm BNC}$ and the satellite occupation. We find that $f_{\rm BNC}$ strongly decreases with the luminosity of the brightest galaxy and increases with the mass of the host halo. Overall, in the halo mass range $10^{13} - 10^{14.5} h^{-1} M_\odot$ we find $f_{\rm BNC} \sim 30\%$, in good agreement with a previous study by Skibba et al. We discuss the implications of these findings for studies inferring the galaxy--halo connection from satellite kinematics, models of the conditional luminosity function and galaxy formation in general.

preprint2017arXiv

The Galaxy Clustering Crisis in Abundance Matching

Galaxy clustering on small scales is significantly under-predicted by sub-halo abundance matching (SHAM) models that populate (sub-)haloes with galaxies based on peak halo mass, $M_{\rm peak}$. SHAM models based on the peak maximum circular velocity, $V_{\rm peak}$, have had much better success. The primary reason $M_{\rm peak}$ based models fail is the relatively low abundance of satellite galaxies produced in these models compared to those based on $V_{\rm peak}$. Despite success in predicting clustering, a simple $V_{\rm peak}$ based SHAM model results in predictions for galaxy growth that are at odds with observations. We evaluate three possible remedies that could "save" mass-based SHAM: (1) SHAM models require a significant population of "orphan" galaxies as a result of artificial disruption/merging of sub-haloes in modern high resolution dark matter simulations; (2) satellites must grow significantly after their accretion; and (3) stellar mass is significantly affected by halo assembly history. No solution is entirely satisfactory. However, regardless of the particulars, we show that popular SHAM models based on $M_{\rm peak}$ cannot be complete physical models as presented. Either $V_{\rm peak}$ truly is a better predictor of stellar mass at $z\sim 0$ and it remains to be seen how the correlation between stellar mass and $V_{\rm peak}$ comes about, or SHAM models are missing vital component(s) that significantly affect galaxy clustering.

preprint2017arXiv

The Immitigable Nature of Assembly Bias: The Impact of Halo Definition on Assembly Bias

Dark matter halo clustering depends not only on halo mass, but also on other properties such as concentration and shape. This phenomenon is known broadly as assembly bias. We explore the dependence of assembly bias on halo definition, parametrized by spherical overdensity parameter, $Δ$. We summarize the strength of concentration-, shape-, and spin-dependent halo clustering as a function of halo mass and halo definition. Concentration-dependent clustering depends strongly on mass at all $Δ$. For conventional halo definitions ($Δ\sim 200\mathrm{m}-600\mathrm{m}$), concentration-dependent clustering at low mass is driven by a population of haloes that is altered through interactions with neighbouring haloes. Concentration-dependent clustering can be greatly reduced through a mass-dependent halo definition with $Δ\sim 20\mathrm{m}-40\mathrm{m}$ for haloes with $M_{200\mathrm{m}} \lesssim 10^{12}\, h^{-1}\mathrm{M}_{\odot}$. Smaller $Δ$ implies larger radii and mitigates assembly bias at low mass by subsuming altered, so-called backsplash haloes into now larger host haloes. At higher masses ($M_{200\mathrm{m}} \gtrsim 10^{13}\, h^{-1}\mathrm{M}_{\odot}$) larger overdensities, $Δ\gtrsim 600\mathrm{m}$, are necessary. Shape- and spin-dependent clustering are significant for all halo definitions that we explore and exhibit a relatively weaker mass dependence. Generally, both the strength and the sense of assembly bias depend on halo definition, varying significantly even among common definitions. We identify no halo definition that mitigates all manifestations of assembly bias. A halo definition that mitigates assembly bias based on one halo property (e.g., concentration) must be mass dependent. The halo definitions that best mitigate concentration-dependent halo clustering do not coincide with the expected average splashback radii at fixed halo mass.