Researcher profile

Andrew O. Finley

Andrew O. Finley contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
8works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

8 published item(s)

preprint2024arXiv

A spatial mixture model for spaceborne lidar observations over mixed forest and non-forest land types

The Global Ecosystem Dynamics Investigation (GEDI) is a spaceborne lidar instrument that collects near-global measurements of forest structure. While expansive in scope, GEDI samples are spatially sparse and cover a small fraction of the land surface. Converting the sparse samples into spatially complete predictive maps is of practical importance for many ecological studies. A complicating factor is that GEDI collects measurements over forested and non-forested land alike, with no automatic labeling of the land type. Such classification is important, as it categorically influences the probability distribution of the spatial process and the ecological interpretation of the observations/predictions. We implement a spatial mixture model, separating the spatial domain into two latent classes. The latent classes are governed by a Bernoulli spatial process and within each class the process is governed by a separate spatial model. Model predictions take the form of scalar predictions as well as discrete labeling of the class membership. Inference is conducted through a Bayesian paradigm, yielding rich quantification of prediction and uncertainty. We demonstrate the method using GEDI data over Wollemi National Park. When compared to a single spatial model, the mixture model achieves much higher posterior predictive densities on the true value. When compared to a random forest model, a common algorithmic approach in the remote sensing community, the random forest achieves better absolute prediction accuracy for prediction locations far from observed training data locations, but at the expense of location-specific assessments of uncertainty. The unsupervised binary classifications of the mixture model appear broadly ecologically interpretable as forest and non-forest when compared to optical imagery, but further comparison to ground-truth data is required.

preprint2022arXiv

Simplifying small area estimation with rFIA: a demonstration of tools and techniques

The United States (US) Forest Service Forest Inventory and Analysis (FIA) program operates the national forest inventory of the US. Traditionally, the FIA program has relied on sample-based approaches -- permanent plot networks and associated design-based estimators -- to estimate forest variables across large geographic areas and long periods of time. These approaches generally offer unbiased inference on large domains but fail to provide reliable estimates for small domains due to low sample sizes. Rising demand for small domain estimates will thus require the FIA program to adopt non-traditional estimation approaches that are capable of delivering defensible estimates of forest variables at increased spatial and temporal resolution, without the expense of collecting additional field data. In light of this challenge, the development of small area estimation (SAE) methods for FIA data has become an active and highly productive area of research. Yet, SAE methods remain difficult to apply to FIA data, due in part to the complex data structures and inventory design used by the FIA program. Thus, we argue that a new suite of estimation tools (i.e., software) will be required to accommodate shifts in demand for inference on large geographic areas and long time periods to inference on small spatial and/or temporal domains. Herein, we present rFIA, an open-source R package designed to increase the accessibility of FIA data, as one such tool. Specifically, we present two case studies chosen to demonstrate rFIA's potential to simplify the application of a broad suite of SAE methods to FIA data: (1) estimation of contemporary county-level forest carbon stocks across the conterminous US using a spatial Fay-Herriot model; and (2) temporally-explicit estimation of multi-decadal trends in merchantable wood volume in Washington County, Maine using a Bayesian mixed-effects model.

preprint2020arXiv

A Bayesian hierarchical model to estimate land surface phenology parameters with harmonized Landsat 8 and Sentinel-2 images

We develop a Bayesian Land Surface Phenology (LSP) model and examine its performance using Enhanced Vegetation Index (EVI) observations derived from the Harmonized Landsat Sentinel-2 (HLS) dataset. Building on previous work, we propose a double logistic function that, once couched within a Bayesian model, yields posterior distributions for all LSP parameters. We assess the efficacy of the Normal, Truncated Normal, and Beta likelihoods to deliver robust LSP parameter estimates. Two case studies are presented and used to explore aspects of the proposed model. The first, conducted over forested pixels within a HLS tile, explores choice of likelihood and space-time varying HLS data availability for long-term average LSP parameter point and uncertainty estimation. The second, conducted on a small area of interest within the HLS tile on an annual time-step, further examines the impact of sample size and choice of likelihood on LSP parameter estimates. Results indicate that while the Truncated Normal and Beta likelihoods are theoretically preferable when the vegetation index is bounded, all three likelihoods performed similarly when the number of index observations is sufficiently large and values are not near the index bounds. Both case studies demonstrate how pixel-level LSP parameter posterior distributions can be used to propagate uncertainty through subsequent analysis. As a companion to this article, we provide an open-source \R package \pkg{rsBayes} and supplementary data and code used to reproduce the analysis results. The proposed model specification and software implementation delivers computationally efficient, statistically robust, and inferentially rich LSP parameter posterior distributions at the pixel-level across massive raster time series datasets.

preprint2020arXiv

Characterizing functional relationships between anthropogenic and biological sounds: A western New York state soundscape case study

Roads are a widespread feature of landscapes worldwide, and road traffic sound potentially makes nearby habitat unsuitable for acoustically communicating organisms. It is important to understand the influence of roads at the soundscape level to mitigate negative impacts of road sound on individual species as well as subsequent effects on the surrounding landscape. We seek to characterize the relationship between anthropogenic and biological sounds in western New York and assess the extent to which available traffic data explains variability in anthropogenic noise. Recordings were obtained in the spring of 2016 at 18 sites throughout western New York. We used the Welch Power Spectral Density (PSD) at low frequencies (0.5-2 kHz) to represent anthropogenic noise and PSD values at higher frequencies (2-11 kHz) to represent biological sound. Relationships were modeled using a novel two-stage hierarchical Bayesian model utilizing beta regression and basis splines. Model results and map predictions illustrate that anthropogenic noise and biological sound have an inverse relationship, and anthropogenic noise is greatest in close proximity to high traffic volume roads. The predictions have large uncertainty, resulting from the temporal coarseness of public road data used as a proxy for traffic sound. Results suggest that finer temporal resolution traffic sound data, such as crowd-sourced time-indexed traffic data from geographic positioning systems, might better account for observed temporal changes in the soundscape. The use of such data, in combination with the proposed modeling framework, could have important implications for the development of sound management policies.

preprint2020arXiv

rFIA: An R package for estimation of forest attributes with the Forest Inventory and Analysis Database

Forest Inventory and Analysis (FIA) is a US Department of Agriculture Forest Service program that aims to monitor changes in forests across the US. FIA hosts one of the largest ecological datasets in the world, though its complexity limits access for many potential users. rFIA is an R package designed to simplify the estimation of forest attributes using data collected by the FIA Program. Specifically, rFIA improves access to the spatio-temporal estimation capacity of the FIA Database via space-time indexed summaries of forest variables within user-defined population boundaries (e.g., geographic, temporal, biophysical). The package implements multiple design-based estimators, and has been validated against official estimates and sampling errors produced by the FIA Program. We demonstrate the utility of rFIA by assessing changes in abundance and mortality rates of ash (Fraxinus spp.) populations in the Lower Peninsula of Michigan following the establishment of emerald ash borer (Agrilus planipennis).

preprint2019arXiv

Bayesian spatially varying coefficient models in the spBayes R package

This paper describes and illustrates new functionality for fitting spatially varying coefficients models in the spBayes (version 0.4-2) R package. The new spSVC function uses a computationally efficient Markov chain Monte Carlo algorithm and extends current spBayes functions, that fit only space-varying intercept regression models, to fit independent or multivariate Gaussian process random effects for any set of columns in the regression design matrix. Newly added OpenMP parallelization options for spSVC are discussed and illustrated, as well as helper functions for joint and point-wise prediction and model fit diagnostics. The utility of the proposed models is illustrated using a PM10 analysis over central Europe.

preprint2014arXiv

Dynamic spatial regression models for space-varying forest stand tables

Many forest management planning decisions are based on information about the number of trees by species and diameter per unit area. This information is commonly summarized in a stand table, where a stand is defined as a group of forest trees of sufficiently uniform species composition, age, condition, or productivity to be considered a homogeneous unit for planning purposes. Typically information used to construct stand tables is gleaned from observed subsets of the forest selected using a probability-based sampling design. Such sampling campaigns are expensive and hence only a small number of sample units are typically observed. This data paucity means that stand tables can only be estimated for relatively large areal units. Contemporary forest management planning and spatially explicit ecosystem models require stand table input at higher spatial resolution than can be affordably provided using traditional approaches. We propose a dynamic multivariate Poisson spatial regression model that accommodates both spatial correlation between observed diameter distributions and also correlation between tree counts across diameter classes within each location. To improve fit and prediction at unobserved locations, diameter specific intensities can be estimated using auxiliary data such as management history or remotely sensed information. The proposed model is used to analyze a diverse forest inventory dataset collected on the United States Forest Service Penobscot Experimental Forest in Bradley, Maine. Results demonstrate that explicitly modeling the residual spatial structure via a multivariate Gaussian process and incorporating information about forest structure from LiDAR covariates improve model fit and can provide high spatial resolution stand table maps with associated estimates of uncertainty.

preprint2013arXiv

spBayes for large univariate and multivariate point-referenced spatio-temporal data models

In this paper we detail the reformulation and rewrite of core functions in the spBayes R package. These efforts have focused on improving computational efficiency, flexibility, and usability for point-referenced data models. Attention is given to algorithm and computing developments that result in improved sampler convergence rate and efficiency by reducing parameter space; decreased sampler run-time by avoiding expensive matrix computations, and; increased scalability to large datasets by implementing a class of predictive process models that attempt to overcome computational hurdles by representing spatial processes in terms of lower-dimensional realizations. Beyond these general computational improvements for existing model functions, we detail new functions for modeling data indexed in both space and time. These new functions implement a class of dynamic spatio-temporal models for settings where space is viewed as continuous and time is taken as discrete.