Source author record

Jorge Mateu

Jorge Mateu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Methodology Applications math.ST Statistics Theory math.PR

Catalog footprint

What is connected

12works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

A Spatio-Temporal Dirichlet Process Mixture Model for Coronavirus Disease-19

Understanding the spatio-temporal patterns of the coronavirus disease 2019 (COVID-19) is essential to construct public health interventions. Spatially referenced data can provide richer opportunities to understand the mechanism of the disease spread compared to the more often encountered aggregated count data. We propose a spatio-temporal Dirichlet process mixture model to analyze confirmed cases of COVID-19 in an urban environment. Our method can detect unobserved cluster centers of the epidemics, and estimate the space-time range of the clusters that are useful to construct a warning system. Furthermore, our model can measure the impact of different types of landmarks in the city, which provides an intuitive explanation of disease spreading sources from different time points. To efficiently capture the temporal dynamics of the disease patterns, we employ a sequential approach that uses the posterior distribution of the parameters for the previous time step as the prior information for the current time step. This approach enables us to incorporate time dependence into our model in a computationally efficient manner without complicating the model structure. We also develop a model assessment by comparing the data with theoretical densities, and outline the goodness-of-fit of our fitted model.

preprint2022arXiv

ANOVA for Data in Metric Spaces, with Applications to Spatial Point Patterns

We give a review of recent ANOVA-like procedures for testing group differences based on data in a metric space and present a new such procedure. Our statistic is based on the classic Levene's test for detecting differences in dispersion. It uses only pairwise distances of data points and and can be computed quickly and precisely in situations where the computation of barycenters ("generalized means") in the data space is slow, only by approximation or even infeasible. We show the asymptotic normality of our test statistic and present simulation studies for spatial point pattern data, in which we compare the various procedures in a 1-way ANOVA setting. As an application, we perform a 2-way ANOVA on a data set of bubbles in a mineral flotation process.

preprint2022arXiv

Clustering constrained on linear networks

An unsupervised classification method for point events occurring on a network of lines is proposed. The idea relies on the distributional flexibility and practicality of random partition models to discover the clustering structure featuring observations from a particular phenomenon taking place on a given set of edges. By incorporating the spatial effect in the random partition distribution, induced by a Dirichlet process, one is able to control the distance between edges and events, thus leading to an appealing clustering method. A Gibbs sampler algorithm is proposed and evaluated with a sensitivity analysis. The proposal is motivated and illustrated by the analysis of crime and violence patterns in Mexico City.

preprint2022arXiv

Generalised functional additive mixed models with compositional covariates for areal Covid-19 incidence curves

We extend the generalised functional additive mixed model to include (functional) compositional covariates carrying relative information of a whole. Relying on the isometric isomorphism of the Bayes Hilbert space of probability densities with a subspace of the $L^2$, we include functional compositions as transformed functional covariates with constrained effect function. The extended model allows for the estimation of linear, nonlinear and time-varying effects of scalar and functional covariates, as well as (correlated) functional random effects, in addition to the compositional effects. We use the model to estimate the effect of the age, sex and smoking (functional) composition of the population on regional Covid-19 incidence data for Spain, while accounting for climatological and socio-demographic covariate effects and spatial correlation.

preprint2022arXiv

Mapping the intensity function of a non-stationary point process in unobserved areas

Seismic networks provide data that are used as basis both for public safety decisions and for scientific research. Their configuration affects the data completeness, which in turn, critically affects several seismological scientific targets (e.g., earthquake prediction, seismic hazard...). In this context, a key aspect is how to map earthquakes density in seismogenic areas from censored data or even in areas that are not covered by the network. We propose to predict the spatial distribution of earthquakes from the knowledge of presence locations and geological relationships, taking into account any interaction between records. Namely, in a more general setting, we aim to estimate the intensity function of a point process, conditional to its censored realization, as in geostatistics for continuous processes. We define a predictor as the best linear unbiased combination of the observed point pattern. We show that the weight function associated to the predictor is the solution of a Fredholm equation of second kind. Both the kernel and the source term of the Fredholm equation are related to the first-and second-order characteristics of the point process through the intensity and the pair correlation function. Results are presented and illustrated on simulated non-stationary point processes and real data for mapping Greek Hellenic seismicity in a region with unreliable and incomplete records.

preprint2022arXiv

Nonparametric testing of the dependence structure among points-marks-covariates in spatial point patterns

We investigate testing of the hypothesis of independence between a covariate and the marks in a marked point process. It would be rather straightforward if the (unmarked) point process were independent of the covariate and the marks. In practice, however, such an assumption is questionable and possible dependence between the point process and the covariate or the marks may lead to incorrect conclusions. Therefore, we propose to investigate the complete dependence structure in the triangle points--marks--covariates together. We take advantage of the recent development of the nonparametric random shift methods, namely the new variance correction approach, and propose tests of the null hypothesis of independence between the marks and the covariate and between the points and the covariate. We present a detailed simulation study showing the performance of the methods and provide two theorems establishing the appropriate form of the correction factors for the variance correction. Finally, we illustrate the use of the proposed methods in two real applications.

preprint2020arXiv

Analyzing Car Thefts and Recoveries with Connections to Modeling Origin-Destination Point Patterns

For a given region, we have a dataset composed of car theft locations along with a linked dataset of recovery locations which, due to partial recovery, is a relatively small subset of the set of theft locations. For an investigator seeking to understand the behavior of car thefts and recoveries in the region, several questions are addressed. Viewing the set of theft locations as a point pattern, can we propose useful models to explain the pattern? What types of predictive models can be built to learn about recovery location given theft location? Can the dependence between theft locations and recovery locations be formalized? Can the flow between theft sites and recovery sites be captured? Origin-destination modeling offers a natural framework for such problems. However, here the data is not for areal units but rather is a pair of point patterns, with the recovery point pattern only partially observed. We offer modeling approaches for investigating the questions above and apply the approaches to two datasets. One is small from the state of Neza in Mexico with areal covariate information regarding population features and crime type. A second, much larger one, is from Belo Horizonte in Brazil but lacks covariates.

preprint2020arXiv

Graphical modelling and partial characteristics for multitype and multivariate-marked spatio-temporal point processes

This paper contributes to the multivariate analysis of marked spatio-temporal point process data by introducing different partial point characteristics and extending the spatial dependence graph model formalism. Our approach yields a unified framework for different types of spatio-temporal data including both, purely qualitatively (multivariate) cases and multivariate cases with additional quantitative marks. The proposed graphical model is defined through partial spectral density characteristics, it is highly computationally efficient and reflects the conditional similarity among sets of spatio-temporal sub-processes of either points or marked points with identical discrete marks. The paper considers three applications, two on crime data and a third one on forestry.

preprint2019arXiv

Revisiting the random shift approach for testing in spatial statistics

We consider the problem of non-parametric testing of independence of two components of a stationary bivariate spatial process. In particular, we revisit the random shift approach that has become a standard method for testing the independent superposition hypothesis in spatial statistics, and it is widely used in a plethora of practical applications. However, this method has a problem of liberality caused by breaking the marginal spatial correlation structure due to the toroidal correction. This indeed causes that the assumption of exchangability, which is essential for the Monte Carlo test to be exact, is not fulfilled. We present a number of permutation strategies and show that the random shift with the variance correction brings a suitable improvement compared to the torus correction in the random field case. It reduces the liberality and achieves the largest power from all investigated variants. To obtain the variance for the variance correction method, several approaches were studied. The best results were achieved, for the sample covariance as the test statistics, with the correction factor $1/n$. This corresponds to the asymptotic order of the variance of the test statistics. In the point process case, the problem of deviations from exchangeability is far more complex and we propose an alternative strategy based on the mean cross nearest-neighbor distance and torus correction. It reduces the liberality but achieves slightly lower power than the usual cross $K$-function. Therefore we recommend it, when the point patterns are clustered, where the cross $K$-function achieves liberality.

preprint2016arXiv

Graphical modelling of multivariate spatial point processes with continuous marks

This paper is the second in a series of papers which combine graphical modelling and marked spatial point patterns. Extending the previous results of \cite Eckardt (2016a), we introduce a marked spatial dependence graph model which depicts the global dependence structure of quantitatively marked multi-type points that occur in space based on the marked conditional partial spectral coherence. Most beneficial, no structural assumption with respect to the characteristics in the data are to be made prior to analysis. This approach presents a computationally efficient method of pattern recognition in highly structured and high dimensional multi-type spatial point processes where also quantitative marks are available. Unlike all previous methods, our new model permits the simultaneous analysis of all multivariate conditional interrelations. The new technique is illustrated analysing the diameter at breast hight of 37 different tree species recorded at 10053 locations in Duke Forest.

preprint2016arXiv

Structured network regression for spatial point patterns

The analysis of spatial point patterns that occur in the network domain have recently gained much attraction and various intensity functions and measures have been proposed. However, the linkage of spatial network statistics to regression models has not been approached so far. This paper presents a new regression approach which treats a generic intensity function of a planar point pattern that occurred on a network as the outcome of a set of different covariates and various graph statistics. Different to all alternative approaches, our model is the first which permits the statistical analysis of complex regression data in the context of network intensity functions for spatial point patterns. The potential of our new technique to model the structural dependencies of network intensity functions on various covariates and graph statistics is illustrated using call-in data on neighbour and community disturbances in an urban context.

preprint2014arXiv

Spatio-temporal càdlàg functional marked point processes: Unifying spatio-temporal frameworks

This paper defines the class of càdlàg functional marked point processes (CFMPPs). These are (spatio-temporal) point processes marked by random elements which take values in a càdlàg function space, i.e. the marks are given by càdlàg stochastic processes. We generalise notions of marked (spatio-temporal) point processes and indicate how this class, in a sensible way, connects the point process framework with the random fields framework. We also show how they can be used to construct a class of spatio-temporal Boolean models, how to construct different classes of these models by choosing specific mark functions, and how càdlàg functional marked Cox processes have a double connection to random fields. We also discuss finite CFMPPs, purely temporally well-defined CFMPPs and Markov CFMPPs. Furthermore, we define characteristics such as product densities, Palm distributions and conditional intensities, in order to develop statistical inference tools such as likelihood estimation schemes.

Jorge Mateu

What is connected

Connect this record

See the researcher in context

Building this map preview

12 published item(s)

A Spatio-Temporal Dirichlet Process Mixture Model for Coronavirus Disease-19

ANOVA for Data in Metric Spaces, with Applications to Spatial Point Patterns

Clustering constrained on linear networks

Generalised functional additive mixed models with compositional covariates for areal Covid-19 incidence curves

Mapping the intensity function of a non-stationary point process in unobserved areas

Nonparametric testing of the dependence structure among points-marks-covariates in spatial point patterns

Analyzing Car Thefts and Recoveries with Connections to Modeling Origin-Destination Point Patterns

Graphical modelling and partial characteristics for multitype and multivariate-marked spatio-temporal point processes

Revisiting the random shift approach for testing in spatial statistics

Graphical modelling of multivariate spatial point processes with continuous marks

Structured network regression for spatial point patterns

Spatio-temporal càdlàg functional marked point processes: Unifying spatio-temporal frameworks