Source author record

Andrew O. Finley

Andrew O. Finley appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Applications Computation Methodology Populations and Evolution

Catalog footprint

What is connected

12works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2024arXiv

A spatial mixture model for spaceborne lidar observations over mixed forest and non-forest land types

The Global Ecosystem Dynamics Investigation (GEDI) is a spaceborne lidar instrument that collects near-global measurements of forest structure. While expansive in scope, GEDI samples are spatially sparse and cover a small fraction of the land surface. Converting the sparse samples into spatially complete predictive maps is of practical importance for many ecological studies. A complicating factor is that GEDI collects measurements over forested and non-forested land alike, with no automatic labeling of the land type. Such classification is important, as it categorically influences the probability distribution of the spatial process and the ecological interpretation of the observations/predictions. We implement a spatial mixture model, separating the spatial domain into two latent classes. The latent classes are governed by a Bernoulli spatial process and within each class the process is governed by a separate spatial model. Model predictions take the form of scalar predictions as well as discrete labeling of the class membership. Inference is conducted through a Bayesian paradigm, yielding rich quantification of prediction and uncertainty. We demonstrate the method using GEDI data over Wollemi National Park. When compared to a single spatial model, the mixture model achieves much higher posterior predictive densities on the true value. When compared to a random forest model, a common algorithmic approach in the remote sensing community, the random forest achieves better absolute prediction accuracy for prediction locations far from observed training data locations, but at the expense of location-specific assessments of uncertainty. The unsupervised binary classifications of the mixture model appear broadly ecologically interpretable as forest and non-forest when compared to optical imagery, but further comparison to ground-truth data is required.

preprint2022arXiv

Simplifying small area estimation with rFIA: a demonstration of tools and techniques

The United States (US) Forest Service Forest Inventory and Analysis (FIA) program operates the national forest inventory of the US. Traditionally, the FIA program has relied on sample-based approaches -- permanent plot networks and associated design-based estimators -- to estimate forest variables across large geographic areas and long periods of time. These approaches generally offer unbiased inference on large domains but fail to provide reliable estimates for small domains due to low sample sizes. Rising demand for small domain estimates will thus require the FIA program to adopt non-traditional estimation approaches that are capable of delivering defensible estimates of forest variables at increased spatial and temporal resolution, without the expense of collecting additional field data. In light of this challenge, the development of small area estimation (SAE) methods for FIA data has become an active and highly productive area of research. Yet, SAE methods remain difficult to apply to FIA data, due in part to the complex data structures and inventory design used by the FIA program. Thus, we argue that a new suite of estimation tools (i.e., software) will be required to accommodate shifts in demand for inference on large geographic areas and long time periods to inference on small spatial and/or temporal domains. Herein, we present rFIA, an open-source R package designed to increase the accessibility of FIA data, as one such tool. Specifically, we present two case studies chosen to demonstrate rFIA's potential to simplify the application of a broad suite of SAE methods to FIA data: (1) estimation of contemporary county-level forest carbon stocks across the conterminous US using a spatial Fay-Herriot model; and (2) temporally-explicit estimation of multi-decadal trends in merchantable wood volume in Washington County, Maine using a Bayesian mixed-effects model.

preprint2020arXiv

A Bayesian hierarchical model to estimate land surface phenology parameters with harmonized Landsat 8 and Sentinel-2 images

We develop a Bayesian Land Surface Phenology (LSP) model and examine its performance using Enhanced Vegetation Index (EVI) observations derived from the Harmonized Landsat Sentinel-2 (HLS) dataset. Building on previous work, we propose a double logistic function that, once couched within a Bayesian model, yields posterior distributions for all LSP parameters. We assess the efficacy of the Normal, Truncated Normal, and Beta likelihoods to deliver robust LSP parameter estimates. Two case studies are presented and used to explore aspects of the proposed model. The first, conducted over forested pixels within a HLS tile, explores choice of likelihood and space-time varying HLS data availability for long-term average LSP parameter point and uncertainty estimation. The second, conducted on a small area of interest within the HLS tile on an annual time-step, further examines the impact of sample size and choice of likelihood on LSP parameter estimates. Results indicate that while the Truncated Normal and Beta likelihoods are theoretically preferable when the vegetation index is bounded, all three likelihoods performed similarly when the number of index observations is sufficiently large and values are not near the index bounds. Both case studies demonstrate how pixel-level LSP parameter posterior distributions can be used to propagate uncertainty through subsequent analysis. As a companion to this article, we provide an open-source \R package \pkg{rsBayes} and supplementary data and code used to reproduce the analysis results. The proposed model specification and software implementation delivers computationally efficient, statistically robust, and inferentially rich LSP parameter posterior distributions at the pixel-level across massive raster time series datasets.

preprint2020arXiv

Characterizing functional relationships between anthropogenic and biological sounds: A western New York state soundscape case study

Roads are a widespread feature of landscapes worldwide, and road traffic sound potentially makes nearby habitat unsuitable for acoustically communicating organisms. It is important to understand the influence of roads at the soundscape level to mitigate negative impacts of road sound on individual species as well as subsequent effects on the surrounding landscape. We seek to characterize the relationship between anthropogenic and biological sounds in western New York and assess the extent to which available traffic data explains variability in anthropogenic noise. Recordings were obtained in the spring of 2016 at 18 sites throughout western New York. We used the Welch Power Spectral Density (PSD) at low frequencies (0.5-2 kHz) to represent anthropogenic noise and PSD values at higher frequencies (2-11 kHz) to represent biological sound. Relationships were modeled using a novel two-stage hierarchical Bayesian model utilizing beta regression and basis splines. Model results and map predictions illustrate that anthropogenic noise and biological sound have an inverse relationship, and anthropogenic noise is greatest in close proximity to high traffic volume roads. The predictions have large uncertainty, resulting from the temporal coarseness of public road data used as a proxy for traffic sound. Results suggest that finer temporal resolution traffic sound data, such as crowd-sourced time-indexed traffic data from geographic positioning systems, might better account for observed temporal changes in the soundscape. The use of such data, in combination with the proposed modeling framework, could have important implications for the development of sound management policies.

preprint2020arXiv

rFIA: An R package for estimation of forest attributes with the Forest Inventory and Analysis Database

Forest Inventory and Analysis (FIA) is a US Department of Agriculture Forest Service program that aims to monitor changes in forests across the US. FIA hosts one of the largest ecological datasets in the world, though its complexity limits access for many potential users. rFIA is an R package designed to simplify the estimation of forest attributes using data collected by the FIA Program. Specifically, rFIA improves access to the spatio-temporal estimation capacity of the FIA Database via space-time indexed summaries of forest variables within user-defined population boundaries (e.g., geographic, temporal, biophysical). The package implements multiple design-based estimators, and has been validated against official estimates and sampling errors produced by the FIA Program. We demonstrate the utility of rFIA by assessing changes in abundance and mortality rates of ash (Fraxinus spp.) populations in the Lower Peninsula of Michigan following the establishment of emerald ash borer (Agrilus planipennis).

preprint2019arXiv

Bayesian spatially varying coefficient models in the spBayes R package

This paper describes and illustrates new functionality for fitting spatially varying coefficients models in the spBayes (version 0.4-2) R package. The new spSVC function uses a computationally efficient Markov chain Monte Carlo algorithm and extends current spBayes functions, that fit only space-varying intercept regression models, to fit independent or multivariate Gaussian process random effects for any set of columns in the regression design matrix. Newly added OpenMP parallelization options for spSVC are discussed and illustrated, as well as helper functions for joint and point-wise prediction and model fit diagnostics. The utility of the proposed models is illustrated using a PM10 analysis over central Europe.

preprint2016arXiv

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geostatistical Datasets

Spatial process models for analyzing geostatistical data entail computations that become prohibitive as the number of spatial locations become large. This manuscript develops a class of highly scalable Nearest Neighbor Gaussian Process (NNGP) models to provide fully model-based inference for large geostatistical datasets. We establish that the NNGP is a well-defined spatial process providing legitimate finite-dimensional Gaussian densities with sparse precision matrices. We embed the NNGP as a sparsity-inducing prior within a rich hierarchical modeling framework and outline how computationally efficient Markov chain Monte Carlo (MCMC) algorithms can be executed without storing or decomposing large matrices. The floating point operations (flops) per iteration of this algorithm is linear in the number of spatial locations, thereby rendering substantial scalability. We illustrate the computational and inferential benefits of the NNGP over competing methods using simulation studies and also analyze forest biomass from a massive United States Forest Inventory dataset at a scale that precludes alternative dimension-reducing methods.

preprint2016arXiv

Joint hierarchical models for sparsely sampled high-dimensional LiDAR and forest variables

Recent advancements in remote sensing technology, specifically Light Detection and Ranging (LiDAR) sensors, provide the data needed to quantify forest characteristics at a fine spatial resolution over large geographic domains. From an inferential standpoint, there is interest in prediction and interpolation of the often sparsely sampled and spatially misaligned LiDAR signals and forest variables. We propose a fully process-based Bayesian hierarchical model for above ground biomass (AGB) and LiDAR signals. The process-based framework offers richness in inferential capabilities, e.g., inference on the entire underlying processes instead of estimates only at pre-specified points. Key challenges we obviate include misalignment between the AGB observations and LiDAR signals and the high-dimensionality in the model emerging from LiDAR signals in conjunction with the large number of spatial locations. We offer simulation experiments to evaluate our proposed models and also apply them to a challenging dataset comprising LiDAR and spatially coinciding forest inventory variables collected on the Penobscot Experimental Forest (PEF), Maine. Our key substantive contributions include AGB data products with associated measures of uncertainty for the PEF and, more broadly, a methodology that should find use in a variety of current and upcoming forest variable mapping efforts using sparsely sampled remotely sensed high-dimensional data.

preprint2016arXiv

Non-separable Dynamic Nearest-Neighbor Gaussian Process Models for Large spatio-temporal Data With an Application to Particulate Matter Analysis

Particulate matter (PM) is a class of malicious environmental pollutants known to be detrimental to human health. Regulatory efforts aimed at curbing PM levels in different countries often require high resolution space-time maps that can identify red-flag regions exceeding statutory concentration limits. Continuous spatio-temporal Gaussian Process (GP) models can deliver maps depicting predicted PM levels and quantify predictive uncertainty. However, GP based approaches are usually thwarted by computational challenges posed by large datasets. We construct a novel class of scalable Dynamic Nearest Neighbor Gaussian Process (DNNGP) models that can provide a sparse approximation to any spatio-temporal GP (e.g., with non-separable covariance structures). The DNNGP we develop here can be used as a sparsity-inducing prior for spatio-temporal random effects in any Bayesian hierarchical model to deliver full posterior inference. Storage and memory requirements for a DNNGP model are linear in the size of the dataset thereby delivering massive scalability without sacrificing inferential richness. Extensive numerical studies reveal that the DNNGP provides substantially superior approximations to the underlying process than low rank approximations. Finally, we use the DNNGP to analyze a massive air quality dataset to substantially improve predictions of PM levels across Europe in conjunction with the LOTOS-EUROS chemistry transport models (CTMs).

preprint2016arXiv

Variable Effects of Climate on Forest Growth in Relation to Climate Extremes, Disturbance, and Forest Stand Dynamics

Changes in the frequency, duration, and severity of climate extremes are forecast to occur under global climate change. The impacts of climate extremes on forest productivity and health are complicated by potential interactions with disturbance events and stand dynamics. The effects of stand dynamics on forest responses to climate and disturbance are particularly important given forest characteristics driven by stand dynamics can be modified through forest management with the goal of increasing forest resistance and resilience to climate change. We develop a hierarchical Bayesian state-space model allowing climate effects on tree growth to vary over time and in relation to climate extremes, disturbance events, and stand dynamics. We apply the model to a dendrochronology dataset comprising measurements from forest stands of varying composition, structure, and development stage in northeastern Minnesota. Results indicate average forest growth was most sensitive to variables describing climatic water deficit. Forest growth responses to water deficit were partitioned into responses driven by climatic threshold exceedances and interactions with forest tent caterpillar defoliation. Forest growth was both resistant and resilient to climate extremes with the majority of forest growth responses occurring after multiple climatic threshold exceedances or insect defoliation events. Forest growth was most sensitive to water deficit during periods of high stem density following major regeneration events when average inter-tree competition was high. Results suggest that forest growth resistance and resilience to interactions between climate extremes and insect defoliation can be increased through management steps such as thinning to reduce competition during early stages of stand development and small-group selection harvests to maintain forest structures characteristic of older, mature stands.

preprint2014arXiv

Dynamic spatial regression models for space-varying forest stand tables

Many forest management planning decisions are based on information about the number of trees by species and diameter per unit area. This information is commonly summarized in a stand table, where a stand is defined as a group of forest trees of sufficiently uniform species composition, age, condition, or productivity to be considered a homogeneous unit for planning purposes. Typically information used to construct stand tables is gleaned from observed subsets of the forest selected using a probability-based sampling design. Such sampling campaigns are expensive and hence only a small number of sample units are typically observed. This data paucity means that stand tables can only be estimated for relatively large areal units. Contemporary forest management planning and spatially explicit ecosystem models require stand table input at higher spatial resolution than can be affordably provided using traditional approaches. We propose a dynamic multivariate Poisson spatial regression model that accommodates both spatial correlation between observed diameter distributions and also correlation between tree counts across diameter classes within each location. To improve fit and prediction at unobserved locations, diameter specific intensities can be estimated using auxiliary data such as management history or remotely sensed information. The proposed model is used to analyze a diverse forest inventory dataset collected on the United States Forest Service Penobscot Experimental Forest in Bradley, Maine. Results demonstrate that explicitly modeling the residual spatial structure via a multivariate Gaussian process and incorporating information about forest structure from LiDAR covariates improve model fit and can provide high spatial resolution stand table maps with associated estimates of uncertainty.

preprint2013arXiv

spBayes for large univariate and multivariate point-referenced spatio-temporal data models

In this paper we detail the reformulation and rewrite of core functions in the spBayes R package. These efforts have focused on improving computational efficiency, flexibility, and usability for point-referenced data models. Attention is given to algorithm and computing developments that result in improved sampler convergence rate and efficiency by reducing parameter space; decreased sampler run-time by avoiding expensive matrix computations, and; increased scalability to large datasets by implementing a class of predictive process models that attempt to overcome computational hurdles by representing spatial processes in terms of lower-dimensional realizations. Beyond these general computational improvements for existing model functions, we detail new functions for modeling data indexed in both space and time. These new functions implement a class of dynamic spatio-temporal models for settings where space is viewed as continuous and time is taken as discrete.

Andrew O. Finley

What is connected

Connect this record

See the researcher in context

Building this map preview

12 published item(s)

A spatial mixture model for spaceborne lidar observations over mixed forest and non-forest land types

Simplifying small area estimation with rFIA: a demonstration of tools and techniques

A Bayesian hierarchical model to estimate land surface phenology parameters with harmonized Landsat 8 and Sentinel-2 images

Characterizing functional relationships between anthropogenic and biological sounds: A western New York state soundscape case study

rFIA: An R package for estimation of forest attributes with the Forest Inventory and Analysis Database

Bayesian spatially varying coefficient models in the spBayes R package

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geostatistical Datasets

Joint hierarchical models for sparsely sampled high-dimensional LiDAR and forest variables

Non-separable Dynamic Nearest-Neighbor Gaussian Process Models for Large spatio-temporal Data With an Application to Particulate Matter Analysis

Variable Effects of Climate on Forest Growth in Relation to Climate Extremes, Disturbance, and Forest Stand Dynamics

Dynamic spatial regression models for space-varying forest stand tables

spBayes for large univariate and multivariate point-referenced spatio-temporal data models