Researcher profile

David B. Stephenson

David B. Stephenson contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - Emerging
7works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2021arXiv

Towards reliable projections of global mean surface temperature

Quantifying the risk of global warming exceeding critical targets such as 2.0 K requires reliable projections of uncertainty as well as best estimates of Global Mean Surface Temperature (GMST). However, uncertainty bands on GMST projections are often calculated heuristically and have several potential shortcomings. In particular, the uncertainty bands shown in IPCC plume projections of GMST are based on the distribution of GMST anomalies from climate model runs and so are strongly determined by model characteristics with little influence from observations of the real-world. Physically motivated time-series approaches are proposed based on fitting energy balance models (EBMs) to climate model outputs and observations in order to constrain future projections. It is shown that EBMs fitted to one forcing scenario will not produce reliable projections when different forcing scenarios are applied. The errors in the EBM projections can be interpreted as arising due to a discrepancy in the effective forcing felt by the model. A simple time-series approach to correcting the projections is proposed based on learning the evolution of the forcing discrepancy so that it can be projected into the future. These approaches give reliable projections of GMST when tested in a perfect model setting, and when applied to observations lead to well constrained projections with lower mean warming and narrower projection bands than previous estimates. Despite the reduced uncertainty, the lower warming leads to a greatly reduced probability of exceeding the 2.0 K warming target.

preprint2020arXiv

On constraining projections of future climate using observations and simulations from multiple climate models

Numerical climate models are used to project future climate change due to both anthropogenic and natural causes. Differences between projections from different climate models are a major source of uncertainty about future climate. Emergent relationships shared by multiple climate models have the potential to constrain our uncertainty when combined with historical observations. We combine projections from 13 climate models with observational data to quantify the impact of emergent relationships on projections of future warming in the Arctic at the end of the 21st century. We propose a hierarchical Bayesian framework based on a coexchangeable representation of the relationship between climate models and the Earth system. We show how emergent constraints fit into the coexchangeable representation, and extend it to account for internal variability simulated by the models and natural variability in the Earth system. Our analysis shows that projected warming in some regions of the Arctic may be more than 2C lower and our uncertainty reduced by up to 30% when constrained by historical observations. A detailed theoretical comparison with existing multi-model projection frameworks is also provided. In particular, we show that projections may be biased if we do not account for internal variability in climate model predictions.

preprint2016arXiv

Inference for spatial processes using imperfect data from measurements and numerical simulations

We present a framework for inference for spatial processes that have actual values imperfectly represented by data. Environmental processes represented as spatial fields, either at fixed time points, or aggregated over fixed time periods, are studied. Data from both measurements and simulations performed by complex computer models are used to infer actual values of the spatial fields. Methods from geostatistics and statistical emulation are used to explicitly capture discrepancies between a spatial field's actual and simulated values. A geostatistical model captures spatial discrepancy: the difference in spatial structure between simulated and actual values. An emulator represents the intensity discrepancy: the bias in simulated values of given intensity. Measurement error is also represented. Gaussian process priors represent each source of error, which gives an analytical expression for the posterior distribution for the actual spatial field. Actual footprints for 50 European windstorms, which represent maximum wind gust speeds on a grid over a 72-hour period, are derived from wind gust speed measurements taken at stations across Europe and output simulated from a downscaled version of the Met Office Unified Model. The derived footprints have realistic spatial structure, and gust speeds closer to the measurements than originally simulated.

preprint2015arXiv

A Bayesian framework for verification and recalibration of ensemble forecasts: How uncertain is NAO predictability?

Predictability estimates of ensemble prediction systems are uncertain due to limited numbers of past forecasts and observations. To account for such uncertainty, this paper proposes a Bayesian inferential framework that provides a simple 6-parameter representation of ensemble forecasting systems and the corresponding observations. The framework is probabilistic, and thus allows for quantifying uncertainty in predictability measures such as correlation skill and signal-to-noise ratios. It also provides a natural way to produce recalibrated probabilistic predictions from uncalibrated ensembles forecasts. The framework is used to address important questions concerning the skill of winter hindcasts of the North Atlantic Oscillation for 1992-2011 issued by the Met Office GloSea5 climate prediction system. Although there is much uncertainty in the correlation between ensemble mean and observations, there is strong evidence of skill: the 95% credible interval of the correlation coefficient of [0.19,0.68] does not overlap zero. There is also strong evidence that the forecasts are not exchangeable with the observations: With over 99% certainty, the signal-to-noise ratio of the forecasts is smaller than the signal-to-noise ratio of the observations, which suggests that raw forecasts should not be taken as representative scenarios of the observations. Forecast recalibration is thus required, which can be coherently addressed within the proposed framework.

preprint2015arXiv

Evaluating ensemble forecasts by the Ignorance score -- Correcting the finite-ensemble bias

This study considers the application of the Ignorance Score (also known as the Logarithmic Score) in the context of ensemble verification. In particular, we consider the case where an ensemble forecast is transformed to a Normal forecast distribution, and this distribution is evaluated by the Ignorance Score. It is shown that the standard Ignorance score is biased with respect to the ensemble size, such that larger ensembles yield systematically better expected scores. A new estimator of the Ignorance score is derived which is unbiased with respect to the ensemble size. In an application to seasonal climate predictions it is shown that the standard Ignorance score assigns better expected scores to simple climatological ensembles or biased ensembles that have many members, than to physical dynamical and unbiased ensembles with fewer members. By contrast, the new bias-corrected Ignorance score ranks the physical dynamical and unbiased ensembles better than the climatological and biased ones, independent of ensemble size. It is shown that the unbiased estimator has smaller estimator variance and error than the standard estimator, and that it is a fair verification score, which is optimized if the ensemble members are statistically consistent with the observations. The finite ensemble bias of ensemble verification scores is discussed more broadly. It is argued that a bias-correction is appropriate when forecast systems with different ensemble sizes are compared, and when an evaluation of the underlying distribution of the ensemble is of interest; possible applications to unbiased parameter estimation are discussed.

preprint2015arXiv

Spatio-temporal modelling of extreme storms

A flexible spatio-temporal model is implemented to analyse extreme extra-tropical cyclones objectively identified over the Atlantic and Europe in 6-hourly re-analyses from 1979-2009. Spatial variation in the extremal properties of the cyclones is captured using a 150 cell spatial regularisation, latitude as a covariate, and spatial random effects. The North Atlantic Oscillation (NAO) is also used as a covariate and is found to have a significant effect on intensifying extremal storm behaviour, especially over Northern Europe and the Iberian peninsula. Estimates of lower bounds on minimum sea-level pressure are typically 10-50 hPa below the minimum values observed for historical storms with largest differences occurring when the NAO index is positive.

preprint2011arXiv

On the visualisation, verification and recalibration of ternary probabilistic forecasts

We develop a geometrical interpretation of ternary probabilistic forecasts in which forecasts and observations are regarded as points inside a triangle. Within the triangle, we define a continuous colour palette in which hue and colour saturation are defined with reference to the observed climatology. In contrast to current methods, forecast maps created with this colour scheme convey all of the information present in each ternary forecast. The geometrical interpretation is then extended to verification under quadratic scoring rules (of which the Brier Score and the Ranked Probability Score are well--known examples). Each scoring rule defines an associated triangle in which the square roots of the score, the reliability, the uncertainty and the resolution all have natural interpretations as root--mean--square distances. This leads to our proposal for a Ternary Reliability Diagram in which data relating to verification and calibration can be summarised. We illustrate these ideas with data relating to seasonal forecasting of precipitation in South America, including an example of nonlinear forecast calibration. Codes implementing these ideas have been produced using the statistical software package R and are available from the authors.