Researcher profile

Hugh Dickinson

Hugh Dickinson contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2022arXiv

Detecting gravitational lenses using machine learning: exploring interpretability and sensitivity to rare lensing configurations

Forthcoming large imaging surveys such as Euclid and the Vera Rubin Observatory Legacy Survey of Space and Time are expected to find more than $10^5$ strong gravitational lens systems, including many rare and exotic populations such as compound lenses, but these $10^5$ systems will be interspersed among much larger catalogues of $\sim10^9$ galaxies. This volume of data is too much for visual inspection by volunteers alone to be feasible and gravitational lenses will only appear in a small fraction of these data which could cause a large amount of false positives. Machine learning is the obvious alternative but the algorithms' internal workings are not obviously interpretable, so their selection functions are opaque and it is not clear whether they would select against important rare populations. We design, build, and train several Convolutional Neural Networks (CNNs) to identify strong gravitational lenses using VIS, Y, J, and H bands of simulated data, with F1 scores between 0.83 and 0.91 on 100,000 test set images. We demonstrate for the first time that such CNNs do not select against compound lenses, obtaining recall scores as high as 76\% for compound arcs and 52\% for double rings. We verify this performance using Hubble Space Telescope (HST) and Hyper Suprime-Cam (HSC) data of all known compound lens systems. Finally, we explore for the first time the interpretability of these CNNs using Deep Dream, Guided Grad-CAM, and by exploring the kernels of the convolutional layers, to illuminate why CNNs succeed in compound lens selection.

preprint2022arXiv

Galaxy Zoo: Clump Scout: Surveying the Local Universe for Giant Star-forming Clumps

Massive, star-forming clumps are a common feature of high-redshift star-forming galaxies. How they formed, and why they are so rare at low redshift, remains unclear. In this paper we identify the largest yet sample of clumpy galaxies (7,052) at low redshift using data from the citizen science project \textit{Galaxy Zoo: Clump Scout}, in which volunteers classified over 58,000 Sloan Digital Sky Survey (SDSS) galaxies spanning redshift $0.02 < z < 0.15$. We apply a robust completeness correction by comparing with simulated clumps identified by the same method. Requiring that the ratio of clump-to-galaxy flux in the SDSS $u$ band be greater than 8\% (similar to clump definitions used by other works), we estimate the fraction of local galaxies hosting at least one clump ($f_{clumpy}$) to be $2.68_{-0.30}^{+0.33}\%$. We also compute the same fraction with a less stringent cut of 3\% ($11.33_{-1.16}^{+0.89}\%$), as the higher number count and lower statistical noise of this fraction permits sharper comparison with future low-redshift clumpy galaxy studies. Our results reveal a sharp decline in $f_{clumpy}$ over $0 < z < 0.5$. The minor merger rate remains roughly constant over the same span, so we suggest that minor mergers are unlikely to be the primary driver of clump formation. Instead, the rate of galaxy turbulence is a better tracer for $f_{clumpy}$ over $0 < z < 1.5$ for galaxies of all masses, which supports the idea that clump formation is primarily driven by violent disk instability for all galaxy populations during this period.

preprint2022arXiv

Practical Galaxy Morphology Tools from Deep Supervised Representation Learning

Astronomers have typically set out to solve supervised machine learning problems by creating their own representations from scratch. We show that deep learning models trained to answer every Galaxy Zoo DECaLS question learn meaningful semantic representations of galaxies that are useful for new tasks on which the models were never trained. We exploit these representations to outperform several recent approaches at practical tasks crucial for investigating large galaxy samples. The first task is identifying galaxies of similar morphology to a query galaxy. Given a single galaxy assigned a free text tag by humans (e.g. &#34;#diffuse&#34;), we can find galaxies matching that tag for most tags. The second task is identifying the most interesting anomalies to a particular researcher. Our approach is 100% accurate at identifying the most interesting 100 anomalies (as judged by Galaxy Zoo 2 volunteers). The third task is adapting a model to solve a new task using only a small number of newly-labelled galaxies. Models fine-tuned from our representation are better able to identify ring galaxies than models fine-tuned from terrestrial images (ImageNet) or trained from scratch. We solve each task with very few new labels; either one (for the similarity search) or several hundred (for anomaly detection or fine-tuning). This challenges the longstanding view that deep supervised methods require new large labelled datasets for practical use in astronomy. To help the community benefit from our pretrained models, we release our fine-tuning code Zoobot. Zoobot is accessible to researchers with no prior experience in deep learning.

preprint2019arXiv

Modeling with the Crowd: Optimizing the Human-Machine Partnership with Zooniverse

LSST and Euclid must address the daunting challenge of analyzing the unprecedented volumes of imaging and spectroscopic data that these next-generation instruments will generate. A promising approach to overcoming this challenge involves rapid, automatic image processing using appropriately trained Deep Learning (DL) algorithms. However, reliable application of DL requires large, accurately labeled samples of training data. Galaxy Zoo Express (GZX) is a recent experiment that simulated using Bayesian inference to dynamically aggregate binary responses provided by citizen scientists via the Zooniverse crowd-sourcing platform in real time. The GZX approach enables collaboration between human and machine classifiers and provides rapidly generated, reliably labeled datasets, thereby enabling online training of accurate machine classifiers. We present selected results from GZX and show how the Bayesian aggregation engine it uses can be extended to efficiently provide object-localization and bounding-box annotations of two-dimensional data with quantified reliability. DL algorithms that are trained using these annotations will facilitate numerous panchromatic data modeling tasks including morphological classification and substructure detection in direct imaging, as well as decontamination and emission line identification for slitless spectroscopy. Effectively combining the speed of modern computational analyses with the human capacity to extrapolate from few examples will be critical if the potential of forthcoming large-scale surveys is to be realized.

preprint2017arXiv

GAMBIT: The Global and Modular Beyond-the-Standard-Model Inference Tool

We describe the open-source global fitting package GAMBIT: the Global And Modular Beyond-the-Standard-Model Inference Tool. GAMBIT combines extensive calculations of observables and likelihoods in particle and astroparticle physics with a hierarchical model database, advanced tools for automatically building analyses of essentially any model, a flexible and powerful system for interfacing to external codes, a suite of different statistical methods and parameter scanning algorithms, and a host of other utilities designed to make scans faster, safer and more easily-extendible than in the past. Here we give a detailed description of the framework, its design and motivation, and the current models and other specific components presently implemented in GAMBIT. Accompanying papers deal with individual modules and present first GAMBIT results. GAMBIT can be downloaded from gambit.hepforge.org.

preprint2012arXiv

Handling Systematic Uncertainties and Combined Source Analyses for Atmospheric Cherenkov Telescopes

In response to an increasing availability of statistically rich observational data sets, the performance and applicability of traditional Atmospheric Cherenkov Telescope analyses in the regime of systematically dominated measurement uncertainties is examined. In particular, the effect of systematic uncertainties affecting the relative normalisation of fiducial ON and OFF-source sampling regions - often denoted as α - is investigated using combined source analysis as a representative example case. The traditional summation of accumulated ON and OFF-source event counts is found to perform sub-optimally in the studied contexts and requires careful calibration to correct for unexpected and potentially misleading statistical behaviour. More specifically, failure to recognise and correct for erroneous estimates of α is found to produce substantial overestimates of the combined population significance which worsen with increasing target multiplicity. An alternative joint likelihood technique is introduced, which is designed to treat systematic uncertainties in a uniform and statistically robust manner. This alternate method is shown to yield dramatically enhanced performance and reliability with respect to the more traditional approach.