Researcher profile

Rupert A. C. Croft

Rupert A. C. Croft contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
9works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2022arXiv

Deep Learning nearby galaxy peculiar velocities

We explore how information in images of nearby galaxies can be used to estimate their distance. We train a convolutional Neural Network (NN) to do this, using galaxy images from the Illustris simulation. We show that if the NN is trained on data with random errors added to the true distance (representing training using spectroscopic redshift instead of actual distance), then the NN can predict distances in a test dataset with greater accuracy than it was given in the training set. This is not unusual, as often NNs are trained on data with added noise, in order to increase robustness. In this case, however, it offers a route to estimating peculiar velocities of nearby galaxies. Given a galaxy with a known spectroscopic redshift one can use the NN-predicted distance to make an estimate of the peculiar velocity. Trying this using relatively low resolution (1.4 arcsec per pixel) simulated galaxy images we find fractional RMS distance errors of 7.7% for galaxies at a mean distance of 75 Mpc from the observer, leading to RMS peculiar velocity errors of 440 km/s. In a companion paper we apply the technique to 145,115 nearby galaxies from the NASA Sloan Atlas.

preprint2022arXiv

The BlueTides Mock Image Catalogue: Simulated observations of high-redshift galaxies and predictions for JWST imaging surveys

We present a mock image catalogue of ~100,000 MUV=-22.5 to -19.6 mag galaxies at z=7-12 from the BlueTides cosmological simulation. We create mock images of each galaxy with the James Webb (JWST), Hubble, Roman, and Euclid Space Telescopes, as well as Subaru, and VISTA, with a range of near- and mid-infrared filters. We perform photometry on the mock images to estimate the success of these instruments for detecting high-z galaxies. We predict that JWST will have unprecedented power in detecting high-z galaxies, with a 95% completeness limit at least 2.5 magnitudes fainter than VISTA and Subaru, 1.1 magnitudes fainter than Hubble, and 0.9 magnitudes fainter than Roman, for the same wavelength and exposure time. Focusing on JWST, we consider a range of exposure times and filters, and find that the NIRCam F356W and F277W filters will detect the faintest galaxies, with 95% completeness at m=27.4 mag in 10ks exposures. We also predict the number of high-z galaxies that will be discovered by upcoming JWST imaging surveys. We predict that the COSMOS-Web survey will detect ~1000 MUV<-20.1 mag galaxies at 6.5<z<7.5, by virtue of its large survey area. JADES-Medium will detect almost 100% of MUV<-20 mag galaxies at z<8.5 due to its significant depth, however with its smaller survey area it will detect only ~100 of these galaxies at 6.5<z<7.5. Cosmic variance results in a large range in the number of predicted galaxies each survey will detect, which is more evident in smaller surveys such as CEERS and the PEARLS NEP and GOODS-S fields.

preprint2022arXiv

The Impact of Dust on the Sizes of Galaxies in the Epoch of Reionization

We study the sizes of galaxies in the Epoch of Reionization using a sample of ~100,000 galaxies from the BlueTides cosmological hydrodynamical simulation from z=7 to 11. We measure the galaxy sizes from stellar mass and luminosity maps, defining the effective radius as the minimum radius which could enclose the pixels containing 50% of the total mass/light in the image. We find an inverse relationship between stellar mass and effective half-mass radius, suggesting that the most massive galaxies are more compact and dense than lower mass galaxies, which have flatter mass distributions. We find a mildly negative relation between intrinsic far-ultraviolet luminosity and size, while we find a positive size-luminosity relation when measured from dust-attenuated images. This suggests that dust is the predominant cause of the observed positive size-luminosity relation, with dust preferentially attenuating bright sight lines resulting in a flatter emission profile and thus larger measured effective radii. We study the size-luminosity relation across the rest-frame ultraviolet and optical, and find that the slope decreases at longer wavelengths; this is a consequence of the relation being caused by dust, which produces less attenuation at longer wavelengths. We find that the far-ultraviolet size-luminosity relation shows mild evolution from z=7 to 11, and galaxy size evolves with redshift as $R\propto(1+z)^{-m}$, where $m=0.662\pm0.009$. Finally, we investigate the sizes of z=7 quasar host galaxies, and find that while the intrinsic sizes of quasar hosts are small relative to the overall galaxy sample, they have comparable sizes when measured from dust-attenuated images.

preprint2021arXiv

Deep Forest: Neural Network reconstruction of intergalactic medium temperature

We explore the use of Deep Learning to infer the temperature of the intergalactic medium from the transmitted flux in the high redshift Lyman-alpha forest. We train Neural Networks on sets of simulated spectra from redshift z=2-3 outputs of cosmological hydrodynamic simulations, including high temperature regions added in post-processing to approximate bubbles heated by Helium-II reionization. We evaluate how well the trained networks are able to reconstruct the temperature from the effect of Doppler broadening in the simulated input Lyman-alpha forest absorption spectra. We find that for spectra with high resolution (10 km/s pixel) and moderate signal to noise (20-50), the neural network is able to reconstruct the IGM temperature smoothed on scales of 6 Mpc/h quite well. Concentrating on discontinuities we find that high temperature regions of width 25 Mpc/h and temperature 20,000 K can be fairly easily detected and characterized. We show an example where multiple sightlines are combined to yield tomographic images of hot bubbles. Deep Learning techniques may be useful in this way to help us understand the complex temperature structure of the intergalactic medium around the time of Helium reionization.

preprint2020arXiv

Large Scale Structure Reconstruction with Short-Wavelength Modes

Large scale density modes are difficult to measure because they are sensitive to systematic observational errors in galaxy surveys, but we can study them indirectly by observing their impact on small scale perturbations. Cosmological perturbation theory predicts that second-order density inhomogeneities are a convolution of a short- and a long-wavelength mode. This arises physically because small scale structures grow at different rates depending on the large scale environment in which they reside. This induces an off-diagonal term in the two-point statistics in Fourier space that we use as the basis for a quadratic estimator for the large scale field. We demonstrate that this quadratic estimator works well on an N-body simulation of size (2.5 h^{-1} Gpc)^3. In particular, the quadratic estimator successfully reconstructs the long-wavelength modes using only small-scale information. This opens up novel opportunities to study structure on the largest observable scales.

preprint2020arXiv

Trend Filtering -- I. A Modern Statistical Tool for Time-Domain Astronomy and Astronomical Spectroscopy

The problem of denoising a one-dimensional signal possessing varying degrees of smoothness is ubiquitous in time-domain astronomy and astronomical spectroscopy. For example, in the time domain, an astronomical object may exhibit a smoothly varying intensity that is occasionally interrupted by abrupt dips or spikes. Likewise, in the spectroscopic setting, a noiseless spectrum typically contains intervals of relative smoothness mixed with localized higher frequency components such as emission peaks and absorption lines. In this work, we present trend filtering, a modern nonparametric statistical tool that yields significant improvements in this broad problem space of denoising $spatially$ $heterogeneous$ signals. When the underlying signal is spatially heterogeneous, trend filtering is superior to any statistical estimator that is a linear combination of the observed data---including kernel smoothers, LOESS, smoothing splines, Gaussian process regression, and many other popular methods. Furthermore, the trend filtering estimate can be computed with practical and scalable efficiency via a specialized convex optimization algorithm, e.g. handling sample sizes of $n\gtrsim10^7$ within a few minutes. In a companion paper, we explicitly demonstrate the broad utility of trend filtering to observational astronomy by carrying out a diverse set of spectroscopic and time-domain analyses.

preprint2020arXiv

Trend Filtering -- II. Denoising Astronomical Signals with Varying Degrees of Smoothness

Trend filtering---first introduced into the astronomical literature in Paper I of this series---is a state-of-the-art statistical tool for denoising one-dimensional signals that possess varying degrees of smoothness. In this work, we demonstrate the broad utility of trend filtering to observational astronomy by discussing how it can contribute to a variety of spectroscopic and time-domain studies. The observations we discuss are (1) the Lyman-$α$ forest of quasar spectra; (2) more general spectroscopy of quasars, galaxies, and stars; (3) stellar light curves with planetary transits; (4) eclipsing binary light curves; and (5) supernova light curves. We study the Lyman-$α$ forest in the greatest detail---using trend filtering to map the large-scale structure of the intergalactic medium along quasar-observer lines of sight. The remaining studies share broad themes of: (1) estimating observable parameters of light curves and spectra; and (2) constructing observational spectral/light-curve templates. We also briefly discuss the utility of trend filtering as a tool for one-dimensional data reduction and compression.

preprint2019arXiv

QSO obscuration at high redshift ($z \gtrsim 7$): Predictions from the BlueTides simulation

High-$z$ AGNs hosted in gas rich galaxies are expected to grow through significantly obscured accretion phases. This may limit or bias their observability. In this work, we use \textsc{BlueTides}, a large volume cosmological simulation of galaxy formation to examine quasar obscuration for the highest-redshift ($z \geq 7$) supermassive black holes residing in the center of galaxies. We find that for the bright quasars, most of the high column density gas ($>90\%$) resides in the innermost regions of the host galaxy, (typically within $< 10$ ckpc), while the gas in the outskirts is a minor contributor to the $N_\mathrm H$. The brightest quasars can have large angular variations in galactic obscuration, over 2 orders of magnitude, where the lines of sight with the lowest obscuration are those formed via strong gas outflows driven by AGN feedback. We find that for the overall AGN population, the mean $N_\mathrm H$ is generally larger for high luminosity and BH mass, while the $N_\mathrm H$ distribution is significantly broadened, developing a low $N_\mathrm H $ wing due to the angular variations driven by the AGN outflows/feedback. The obscured fraction P($N_{\rm H} > 10^{23} {\rm cm}^{-2}$) typically range from 0.6 to 1.0 for increasing $L_{X}$ (with $L_X > 10^{43} \rm{ergs/s}$), with no clear trend of redshift evolution. With respect to the galaxy host property, we find a linear relation between $N_{\rm H}$, $M_*$ and $M_{\rm H_2}$ with $\log N_{\rm H} = (0.24 \pm 0.03) \log M_{*} + (20.7 \pm 0.3)$ and $\log N_{\rm H} = (0.47 \pm 0.03) \log M_{\rm H_2} + (18.4 \pm 0.3)$. The dust optical depth in the UV band $τ_{\mathrm UV}$ has tight positive correlation with $N_{\rm H}$. Our dust extincted UVLF is about 1.5 dex lower than the intrinsic UVLF, implying that more than 99\% of the $z \sim 7$ AGNs are heavily dust extincted and therefore would be missed by the UV band observation.

preprint2019arXiv

Towards Machine-assisted Meta-Studies: The Hubble Constant

We present an approach for automatic extraction of measured values from the astrophysical literature, using the Hubble constant for our pilot study. Our rules-based model -- a classical technique in natural language processing -- has successfully extracted 298 measurements of the Hubble constant, with uncertainties, from the 208,541 available arXiv astrophysics papers. We have also created an artificial neural network classifier to identify papers in arXiv which report novel measurements. From the analysis of our results we find that reporting measurements with uncertainties and the correct units is critical information when distinguishing novel measurements in free text. Our results correctly highlight the current tension for measurements of the Hubble constant and recover the $3.5σ$ discrepancy -- demonstrating that the tool presented in this paper is useful for meta-studies of astrophysical measurements from a large number of publications.