Source author record

James V. Zidek

James V. Zidek appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Applications math.ST Statistics Theory Methodology Computation

Catalog footprint

What is connected

9works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2020arXiv

Approximately Optimal Spatial Design: How Good is it?

The increasing recognition of the association between adverse human health conditions and many environmental substances as well as processes has led to the need to monitor them. An important problem that arises in environmental statistics is the design of the locations of the monitoring stations for those environmental processes of interest. One particular design criterion for monitoring networks that tries to reduce the uncertainty about predictions of unseen processes is called the maximum-entropy design. However, this design criterion involves a hard optimization problem that is computationally intractable for large data sets. Previous work of Wang et al. (2017) examined a probabilistic model that can be implemented efficiently to approximate the underlying optimization problem. In this paper, we attempt to establish statistically sound tools for assessing the quality of the approximations.

preprint2016arXiv

Data Integration Model for Air Quality: A Hierarchical Approach to the Global Estimation of Exposures to Ambient Air Pollution

Air pollution is a major risk factor for global health, with both ambient and household air pollution contributing substantial components of the overall global disease burden. One of the key drivers of adverse health effects is fine particulate matter ambient pollution (PM$_{2.5}$) to which an estimated 3 million deaths can be attributed annually. The primary source of information for estimating exposures has been measurements from ground monitoring networks but, although coverage is increasing, there remain regions in which monitoring is limited. Ground monitoring data therefore needs to be supplemented with information from other sources, such as satellite retrievals of aerosol optical depth and chemical transport models. A hierarchical modelling approach for integrating data from multiple sources is proposed allowing spatially-varying relationships between ground measurements and other factors that estimate air quality. Set within a Bayesian framework, the resulting Data Integration Model for Air Quality (DIMAQ) is used to estimate exposures, together with associated measures of uncertainty, on a high resolution grid covering the entire world. Bayesian analysis on this scale can be computationally challenging and here approximate Bayesian inference is performed using Integrated Nested Laplace Approximations. Model selection and assessment is performed by cross-validation with the final model offering substantial increases in predictive accuracy, particularly in regions where there is sparse ground monitoring, when compared to current approaches: root mean square error (RMSE) reduced from 17.1 to 10.7, and population weighted RMSE from 23.1 to 12.1 $μ$gm$^{-3}$. Based on summaries of the posterior distributions for each grid cell, it is estimated that 92% of the world's population reside in areas exceeding the World Health Organization's Air Quality Guidelines.

preprint2016arXiv

Spatio-temporal Modelling of Temperature Fields in the Pacific Northwest

The importance of modelling temperature fields goes beyond the need to understand a region's climate and serves too as a starting point for understanding their socioeconomic, and health consequences. The topography of the study region contributes much to the complexity of modelling these fields and demands flexible spatio-temporal models that are able to handle nonstationarity and changes in trend. In this paper, we develop a flexible stochastic spatio-temporal model for daily temperatures in the Pacific Northwest, and describe a methodology for performing Bayesian spatial prediction. A novel aspect of this model, an extension of the spatio-temporal model proposed in Le and Zidek (1992), is its incorporation of site-specific features of a spatio-temporal field in its spatio-temporal mean. Due to the often surprising Pacific Northwestern weather, the analysis reported in the paper shows the need to incorporate spatio-temporal interactions in that mean in order to understand the rapid changes in temperature observed in nearby locations and to get approximately stationary residuals for higher level analysis. No structure is assumed for the spatial covariance matrix of these residuals, thus allowing the model to capture any nonstationary spatial structures remaining in those residuals.

preprint2015arXiv

Comment on Article by Ferreira and Gamerman

Comment on Article by Ferreira and Gamerman [arXiv:1509.03410].

preprint2015arXiv

Hypothesis testing in the presence of multiple samples under density ratio models

This paper presents a hypothesis testing method given independent samples from a number of connected populations. The method is motivated by a forestry project for monitoring change in the strength of lumber. Traditional practice has been built upon nonparametric methods which ignore the fact that these populations are connected. By pooling the information in multiple samples through a density ratio model, the proposed empirical likelihood method leads to a more efficient inference and therefore reduces the cost in applications. The new test has a classical chi-square null limiting distribution. Its power function is obtained under a class of local alternatives. The local power is found increased even when some underlying populations are unrelated to the hypothesis of interest. Simulation studies confirm that this test has better power properties than potential competitors, and is robust to model misspecification. An application example to lumber strength is included.

preprint2014arXiv

Bayesian Melding of the Dead-Reckoned Path and GPS Measurements for an Accurate and High-Resolution Path of Marine Mammals

With the recent advances in electrical engineering, devices attached to free-ranging marine mammals today can collect oceanographic data in remarkable high spatial-temporal resolution. However, those data cannot be fully utilized without a matching high-resolution and accurate path of the animal, which is currently missing in this field. In this paper, we develop a Bayesian melding approach based on a Brownian Bridge process to combine the fine-resolution but seriously biased Dead-Reckoned path and the precise but sparse GPS measurements, which results in an accurate and high-resolution estimated path together with credible bands as quantified uncertainty statements. We also exploit the properties of underlying processes and some approximations to the likelihood to dramatically reduce the computational burden of handling those big high resolution data sets.

preprint2014arXiv

Reducing estimation bias in adaptively changing monitoring networks with preferential site selection

This paper explores the topic of preferential sampling, specifically situations where monitoring sites in environmental networks are preferentially located by the designers. This means the data arising from such networks may not accurately characterize the spatio-temporal field they intend to monitor. Approaches that have been developed to mitigate the effects of preferential sampling in various contexts are reviewed and, building on these approaches, a general framework for dealing with the effects of preferential sampling in environmental monitoring is proposed. Strategies for implementation are proposed, leading to a method for improving the accuracy of official statistics used to report trends and inform regulatory policy. An essential feature of the method is its capacity to learn the preferential selection process over time and hence to reduce bias in these statistics. Simulation studies suggest dramatic reductions in bias are possible. A case study demonstrates use of the method in assessing the levels of air pollution due to black smoke in the UK over an extended period (1970-1996). In particular, dramatic reductions in the estimates of the number of sites out of compliance are observed.

preprint2010arXiv

Predicting phenological events using event-history analysis

This paper presents an approach to phenology, one based on the use of a method developed by the authors for event history data. Of specific interest is the prediction of the so-called "bloom--date" of fruit trees in the agriculture industry and it is this application which we consider, although the method is much more broadly applicable. Our approach provides sensible estimate for a parameter that interests phenologists -- Tbase, the thresholding parameter in the definition of the growing degree days (GDD). Our analysis supports scientists' empirical finding: the timing of a phenological event of a prenniel crop is related the cumulative sum of GDDs. Our prediction of future bloom--dates are quite accurate, but the predictive uncertainty is high, possibly due to our crude climate model for predicting future temperature, the time-dependent covariate in our regression model for phenological events. We found that if we can manage to get accurate prediction of future temperature, our prediction of bloom--date is more accurate and the predictive uncertainty is much lower.