Researcher profile

Benjamin D. Youngman

Benjamin D. Youngman contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 15 - Baseline
3works
0followers
1topics
2close collaborators

Actions

Decide how to stay connected

Follow researcher0

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

3 published item(s)

preprint2020arXiv

Flexible models for nonstationary dependence: Methodology and examples

There are many situations when modelling environmental phenomena for which it is not appropriate to assume a stationary dependence structure. \cite{sampson1992} proposed an approach to allowing nonstationarity in dependence based on a deformed space: coordinates from original geographic "$G$" space are mapped to a new dispersion "$D$" space in which stationary dependence is a reasonable assumption. \cite{sampson1992} achieve this with two deformation functions, which are chosen as thin plate splines, each representing how one of the two coordinates in $D$-space relates to the original $G$-space coordinates. This works extends the deformation approach, and the dimension expansion approach of \cite{bornn2012}, to a regression-based framework in which all dimensions in $D$-space are treated as "smooths" as found, for example, in generalized additive models. The framework offers an intuitive and user-friendly approach to specifying $D$-space, allows different levels of smoothing for dimensions in $D$-space, and allows objective inference for all model parameters. Furthermore, a numerical approach is proposed to avoid non-bijective deformations, should they occur, which applies to any deformation. The proposed framework is demonstrated on the solar radiation data studied in \cite{sampson1992}, and then on an example related to risk analysis, which culminates in producing simulations of extreme rainfall for part of Colorado, US.

preprint2016arXiv

Inference for spatial processes using imperfect data from measurements and numerical simulations

We present a framework for inference for spatial processes that have actual values imperfectly represented by data. Environmental processes represented as spatial fields, either at fixed time points, or aggregated over fixed time periods, are studied. Data from both measurements and simulations performed by complex computer models are used to infer actual values of the spatial fields. Methods from geostatistics and statistical emulation are used to explicitly capture discrepancies between a spatial field's actual and simulated values. A geostatistical model captures spatial discrepancy: the difference in spatial structure between simulated and actual values. An emulator represents the intensity discrepancy: the bias in simulated values of given intensity. Measurement error is also represented. Gaussian process priors represent each source of error, which gives an analytical expression for the posterior distribution for the actual spatial field. Actual footprints for 50 European windstorms, which represent maximum wind gust speeds on a grid over a 72-hour period, are derived from wind gust speed measurements taken at stations across Europe and output simulated from a downscaled version of the Met Office Unified Model. The derived footprints have realistic spatial structure, and gust speeds closer to the measurements than originally simulated.

preprint2014arXiv

Calibration of Complex Computer Simulators using Likelihood Emulation

We calibrate a Natural History Model, which is a class of computer simulator used in the health industry, and here has been used to characterise bowel cancer incidence for the UK. The simulator tracks the development of bowel cancer in a sample of people, and its output mostly stratifies bowel cancer occurrence by patient age and bowel cancer type. Its output relies on 25 unknown inputs, which we are required to calibrate. In order to do this we must address that not only is the output count data, but it is also stochastic, due to the simulation procedure. We cannot feasibly achieve calibration of the simulator using Monte Carlo methods alone, as it is of `moderate' computational expense. To achieve a reliable calibration, we must also specify its discrepancy: how, when calibrated, it differs from reality. We propose a method for calibration that combines a statistical emulator for the likelihood function with importance sampling. The emulator provides an interim sample of inputs at which the simulator is run, from which the likelihood is calculated. Importance sampling is then used to re-weight the inputs and provide a final sample of calibrated inputs. Re-calculating the importance weights incurs little computational cost, and so we can easily investigate how different discrepancy specifications affect calibration.