Researcher profile

Thomas Opitz

Thomas Opitz contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
11works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

11 published item(s)

preprint2022arXiv

A modeler's guide to extreme value software

This review paper surveys recent development in software implementations for extreme value analyses since the publication of Stephenson and Gilleland (2006) and Gilleland et al. (2013), here with a focus on numerical challenges. We provide a comparative review by topic and highlight differences in existing routines, along with listing areas where software development is lacking. The online supplement contains two vignettes providing a comparison of implementations of frequentist and Bayesian estimation of univariate extreme value models.

preprint2022arXiv

Exact Simulation of Max-Infinitely Divisible Processes

Max-infinitely divisible (max-id) processes play a central role in extreme-value theory and include the subclass of all max-stable processes. They allow for a constructive representation based on the pointwise maximum of random functions drawn from a Poisson point process defined on a suitable function space. Simulating from a max-id process is often difficult due to its complex stochastic structure, while calculating its joint density in high dimensions is often numerically infeasible. Therefore, exact and efficient simulation techniques for max-id processes are useful tools for studying the characteristics of the process and for drawing statistical inferences. Inspired by the simulation algorithms for max-stable processes, theory and algorithms to generalize simulation approaches tailored for certain flexible (existing or new) classes of max-id processes are presented. Efficient simulation for a large class of models can be achieved by implementing an adaptive rejection sampling scheme to sidestep a numerical integration step in the algorithm. The results of a simulation study highlight that our simulation algorithm works as expected and is highly accurate and efficient, such that it clearly outperforms customary approximate sampling schemes. As a by-product, new max-id models, which can be represented as pointwise maxima of general location-scale mixtures and possess flexible tail dependence structures capturing a wide range of asymptotic dependence scenarios, are also developed.

preprint2022arXiv

Joint modeling of landslide counts and sizes using spatial marked point processes with sub-asymptotic mark distributions

To accurately quantify landslide hazard in a region of Turkey, we develop new marked point process models within a Bayesian hierarchical framework for the joint prediction of landslide counts and sizes. To accommodate for the dominant role of the few largest landslides in aggregated sizes, we leverage mark distributions with strong justification from extreme-value theory, thus bridging the two broad areas of statistics of extremes and marked point patterns. At the data level, we assume a Poisson distribution for landslide counts, while we compare different "sub-asymptotic" distributions for landslide sizes to flexibly model their upper and lower tails. At the latent level, Poisson intensities and the median of the size distribution vary spatially in terms of fixed and random effects, with shared spatial components capturing cross-correlation between landslide counts and sizes. We robustly model spatial dependence using intrinsic conditional autoregressive priors. Our novel models are fitted efficiently using a customized adaptive Markov chain Monte Carlo algorithm. We show that, for our dataset, sub-asymptotic mark distributions provide improved predictions of large landslide sizes compared to more traditional choices. To showcase the benefits of joint occurrence-size models and illustrate their usefulness for risk assessment, we map landslide hazard along major roads.

preprint2020arXiv

Bayesian space-time gap filling for inference on extreme hot-spots: an application to Red Sea surface temperatures

We develop a method for probabilistic prediction of extreme value hot-spots in a spatio-temporal framework, tailored to big datasets containing important gaps. In this setting, direct calculation of summaries from data, such as the minimum over a space-time domain, is not possible. To obtain predictive distributions for such cluster summaries, we propose a two-step approach. We first model marginal distributions with a focus on accurate modeling of the right tail and then, after transforming the data to a standard Gaussian scale, we estimate a Gaussian space-time dependence model defined locally in the time domain for the space-time subregions where we want to predict. In the first step, we detrend the mean and standard deviation of the data and fit a spatially resolved generalized Pareto distribution to apply a correction of the upper tail. To ensure spatial smoothness of the estimated trends, we either pool data using nearest-neighbor techniques, or apply generalized additive regression modeling. To cope with high space-time resolution of data, the local Gaussian models use a Markov representation of the Matérn correlation function based on the stochastic partial differential equations (SPDE) approach. In the second step, they are fitted in a Bayesian framework through the integrated nested Laplace approximation implemented in R-INLA. Finally, posterior samples are generated to provide statistical inferences through Monte-Carlo estimation. Motivated by the 2019 Extreme Value Analysis data challenge, we illustrate our approach to predict the distribution of local space-time minima in anomalies of Red Sea surface temperatures, using a gridded dataset (11315 days, 16703 pixels) with artificially generated gaps. In particular, we show the improved performance of our two-step approach over a purely Gaussian model without tail transformations.

preprint2020arXiv

High-resolution Bayesian mapping of landslide hazard with unobserved trigger event

Statistical models for landslide hazard enable mapping of risk factors and landslide occurrence intensity by using geomorphological covariates available at high spatial resolution. However, the spatial distribution of the triggering event (e.g., precipitation or earthquakes) is often not directly observed. In this paper, we develop Bayesian spatial hierarchical models for point patterns of landslide occurrences using different types of log-Gaussian Cox processes. Starting from a competitive baseline model that captures the unobserved precipitation trigger through a spatial random effect at slope unit resolution, we explore novel complex model structures that take clusters of events arising at small spatial scales into account, as well as nonlinear or spatially-varying covariate effects. For a 2009 event of around 4000 precipitation-triggered landslides in Sicily, Italy, we show how to fit our proposed models efficiently using the integrated nested Laplace approximation (INLA), and rigorously compare the performance of our models both from a statistical and applied perspective. In this context, we argue that model comparison should not be based on a single criterion, and that different models of various complexity may provide insights into complementary aspects of the same applied problem. In our application, our models are found to have mostly the same spatial predictive performance, implying that key to successful prediction is the inclusion of a slope-unit resolved random effect capturing the precipitation trigger. Interestingly, a parsimonious formulation of space-varying slope effects reflects a physical interpretation of the precipitation trigger: in subareas with weak trigger, the slope steepness is shown to be mostly irrelevant.

preprint2020arXiv

Landscape allocation: stochastic generators and statistical inference

In agricultural landscapes, the composition and spatial configuration of cultivated and semi-natural elements strongly impact species dynamics, their interactions and habitat connectivity. To allow for landscape structural analysis and scenario generation, we here develop statistical tools for real landscapes composed of geometric elements including 2D patches but also 1D linear elements such as hedges. We design generative stochastic models that combine a multiplex network representation and Gibbs energy terms to characterize the distributional behavior of landscape descriptors for land-use categories. We implement Metropolis-Hastings for this new class of models to sample agricultural scenarios featuring parameter-controlled spatial and temporal patterns (e.g., geometry, connectivity, crop-rotation). Pseudolikelihood-based inference allows studying the relevance of model components in real landscapes through statistical and functional validation, the latter achieved by comparing commonly used landscape metrics between observed and simulated landscapes. Models fitted to subregions of the Lower Durance Valley (France) indicate strong deviation from random allocation, and they realistically capture small-scale landscape patterns. In summary, our approach of statistical modeling improves the understanding of structural and functional aspects of agro-ecosystems, and it enables simulation-based theoretical analysis of how landscape patterns shape biological and ecological processes.

preprint2020arXiv

Max-infinitely divisible models and inference for spatial extremes

For many environmental processes, recent studies have shown that the dependence strength is decreasing when quantile levels increase. This implies that the popular max-stable models are inadequate to capture the rate of joint tail decay, and to estimate joint extremal probabilities beyond observed levels. We here develop a more flexible modeling framework based on the class of max-infinitely divisible processes, which extend max-stable processes while retaining dependence properties that are natural for maxima. We propose two parametric constructions for max-infinitely divisible models, which relax the max-stability property but remain close to some popular max-stable models obtained as special cases. The first model considers maxima over a finite, random number of independent observations, while the second model generalizes the spectral representation of max-stable processes. Inference is performed using a pairwise likelihood. We illustrate the benefits of our new modeling framework on Dutch wind gust maxima calculated over different time units. Results strongly suggest that our proposed models outperform other natural models, such as the Student-t copula process and its max-stable limit, even for large block sizes.

preprint2020arXiv

Modeling Non-Stationary Temperature Maxima Based on Extremal Dependence Changing with Event Magnitude

The modeling of spatio-temporal trends in temperature extremes can help better understand the structure and frequency of heatwaves in a changing climate. Here, we study annual temperature maxima over Southern Europe using a century-spanning dataset observed at 44 monitoring stations. Extending the spectral representation of max-stable processes, our modeling framework relies on a novel construction of max-infinitely divisible processes, which include covariates to capture spatio-temporal non-stationarities. Our new model keeps a popular max-stable process on the boundary of the parameter space, while flexibly capturing weakening extremal dependence at increasing quantile levels and asymptotic independence. This is achieved by linking the overall magnitude of a spatial event to its spatial correlation range, in such a way that more extreme events become less spatially dependent, thus more localized. Our model reveals salient features of the spatio-temporal variability of European temperature extremes, and it clearly outperforms natural alternative models. Results show that the spatial extent of heatwaves is smaller for more severe events at higher altitudes, and that recent heatwaves are moderately wider. Our probabilistic assessment of the 2019 annual maxima confirms the severity of the 2019 heatwaves both spatially and at individual sites, especially when compared to climatic conditions prevailing in 1950-1975.

preprint2020arXiv

Point-process based Bayesian modeling of space-time structures of forest fire occurrences in Mediterranean France

Due to climate change and human activity, wildfires are expected to become more frequent and extreme worldwide, causing economic and ecological disasters. The deployment of preventive measures and operational forecasts can be aided by stochastic modeling that helps to understand and quantify the mechanisms governing the occurrence intensity. We here develop a point process framework for wildfire ignition points observed in the French Mediterranean basin since 1995, and we fit a spatio-temporal log-Gaussian Cox process with monthly temporal resolution in a Bayesian framework using the integrated nested Laplace approximation (INLA). Human activity is the main direct cause of wildfires and is indirectly measured through a number of appropriately defined proxies related to land-use covariates (urbanization, road network) in our approach, and we further integrate covariates of climatic and environmental conditions to explain wildfire occurrences. We include spatial random effects with Matérn covariance and temporal autoregression at yearly resolution. Two major methodological challenges are tackled: first, handling and unifying multi-scale structures in data is achieved through computer-intensive preprocessing steps with GIS software and kriging techniques; second, INLA-based estimation with high-dimensional response vectors and latent models is facilitated through intra-year subsampling, taking into account the occurrence structure of wildfires.

preprint2020arXiv

Semi-parametric resampling with extremes

Nonparametric resampling methods such as Direct Sampling are powerful tools to simulate new datasets preserving important data features such as spatial patterns from observed datasets while using only minimal assumptions. However, such methods cannot generate extreme events beyond the observed range of data values. We here propose using tools from extreme value theory for stochastic processes to extrapolate observed data towards yet unobserved high quantiles. Original data are first enriched with new values in the tail region, and then classical resampling algorithms are applied to enriched data. In a first approach to enrichment that we label "naive resampling", we generate an independent sample of the marginal distribution while keeping the rank order of the observed data. We point out inaccuracies of this approach around the most extreme values, and therefore develop a second approach that works for datasets with many replicates. It is based on the asymptotic representation of extreme events through two stochastically independent components: a magnitude variable, and a profile field describing spatial variation. To generate enriched data, we fix a target range of return levels of the magnitude variable, and we resample magnitudes constrained to this range. We then use the second approach to generate heatwave scenarios of yet unobserved magnitude over France, based on daily temperature reanalysis training data for the years 2010 to 2016.

preprint2020arXiv

Spatial hierarchical modeling of threshold exceedances using rate mixtures

We develop new flexible univariate models for light-tailed and heavy-tailed data, which extend a hierarchical representation of the generalized Pareto (GP) limit for threshold exceedances. These models can accommodate departure from asymptotic threshold stability in finite samples while keeping the asymptotic GP distribution as a special (or boundary) case and can capture the tails and the bulk jointly without losing much flexibility. Spatial dependence is modeled through a latent process, while the data are assumed to be conditionally independent. Focusing on a gamma-gamma model construction, we design penalized complexity priors for crucial model parameters, shrinking our proposed spatial Bayesian hierarchical model toward a simpler reference whose marginal distributions are GP with moderately heavy tails. Our model can be fitted in fairly high dimensions using Markov chain Monte Carlo by exploiting the Metropolis-adjusted Langevin algorithm (MALA), which guarantees fast convergence of Markov chains with efficient block proposals for the latent variables. We also develop an adaptive scheme to calibrate the MALA tuning parameters. Moreover, our model avoids the expensive numerical evaluations of multifold integrals in censored likelihood expressions. We demonstrate our new methodology by simulation and application to a dataset of extreme rainfall events that occurred in Germany. Our fitted gamma-gamma model provides a satisfactory performance and can be successfully used to predict rainfall extremes at unobserved locations.