Researcher profile

Jorge Mateu

Jorge Mateu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
9works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2022arXiv

A Spatio-Temporal Dirichlet Process Mixture Model for Coronavirus Disease-19

Understanding the spatio-temporal patterns of the coronavirus disease 2019 (COVID-19) is essential to construct public health interventions. Spatially referenced data can provide richer opportunities to understand the mechanism of the disease spread compared to the more often encountered aggregated count data. We propose a spatio-temporal Dirichlet process mixture model to analyze confirmed cases of COVID-19 in an urban environment. Our method can detect unobserved cluster centers of the epidemics, and estimate the space-time range of the clusters that are useful to construct a warning system. Furthermore, our model can measure the impact of different types of landmarks in the city, which provides an intuitive explanation of disease spreading sources from different time points. To efficiently capture the temporal dynamics of the disease patterns, we employ a sequential approach that uses the posterior distribution of the parameters for the previous time step as the prior information for the current time step. This approach enables us to incorporate time dependence into our model in a computationally efficient manner without complicating the model structure. We also develop a model assessment by comparing the data with theoretical densities, and outline the goodness-of-fit of our fitted model.

preprint2022arXiv

ANOVA for Data in Metric Spaces, with Applications to Spatial Point Patterns

We give a review of recent ANOVA-like procedures for testing group differences based on data in a metric space and present a new such procedure. Our statistic is based on the classic Levene's test for detecting differences in dispersion. It uses only pairwise distances of data points and and can be computed quickly and precisely in situations where the computation of barycenters ("generalized means") in the data space is slow, only by approximation or even infeasible. We show the asymptotic normality of our test statistic and present simulation studies for spatial point pattern data, in which we compare the various procedures in a 1-way ANOVA setting. As an application, we perform a 2-way ANOVA on a data set of bubbles in a mineral flotation process.

preprint2022arXiv

Clustering constrained on linear networks

An unsupervised classification method for point events occurring on a network of lines is proposed. The idea relies on the distributional flexibility and practicality of random partition models to discover the clustering structure featuring observations from a particular phenomenon taking place on a given set of edges. By incorporating the spatial effect in the random partition distribution, induced by a Dirichlet process, one is able to control the distance between edges and events, thus leading to an appealing clustering method. A Gibbs sampler algorithm is proposed and evaluated with a sensitivity analysis. The proposal is motivated and illustrated by the analysis of crime and violence patterns in Mexico City.

preprint2022arXiv

Generalised functional additive mixed models with compositional covariates for areal Covid-19 incidence curves

We extend the generalised functional additive mixed model to include (functional) compositional covariates carrying relative information of a whole. Relying on the isometric isomorphism of the Bayes Hilbert space of probability densities with a subspace of the $L^2$, we include functional compositions as transformed functional covariates with constrained effect function. The extended model allows for the estimation of linear, nonlinear and time-varying effects of scalar and functional covariates, as well as (correlated) functional random effects, in addition to the compositional effects. We use the model to estimate the effect of the age, sex and smoking (functional) composition of the population on regional Covid-19 incidence data for Spain, while accounting for climatological and socio-demographic covariate effects and spatial correlation.

preprint2022arXiv

Mapping the intensity function of a non-stationary point process in unobserved areas

Seismic networks provide data that are used as basis both for public safety decisions and for scientific research. Their configuration affects the data completeness, which in turn, critically affects several seismological scientific targets (e.g., earthquake prediction, seismic hazard...). In this context, a key aspect is how to map earthquakes density in seismogenic areas from censored data or even in areas that are not covered by the network. We propose to predict the spatial distribution of earthquakes from the knowledge of presence locations and geological relationships, taking into account any interaction between records. Namely, in a more general setting, we aim to estimate the intensity function of a point process, conditional to its censored realization, as in geostatistics for continuous processes. We define a predictor as the best linear unbiased combination of the observed point pattern. We show that the weight function associated to the predictor is the solution of a Fredholm equation of second kind. Both the kernel and the source term of the Fredholm equation are related to the first-and second-order characteristics of the point process through the intensity and the pair correlation function. Results are presented and illustrated on simulated non-stationary point processes and real data for mapping Greek Hellenic seismicity in a region with unreliable and incomplete records.

preprint2022arXiv

Nonparametric testing of the dependence structure among points-marks-covariates in spatial point patterns

We investigate testing of the hypothesis of independence between a covariate and the marks in a marked point process. It would be rather straightforward if the (unmarked) point process were independent of the covariate and the marks. In practice, however, such an assumption is questionable and possible dependence between the point process and the covariate or the marks may lead to incorrect conclusions. Therefore, we propose to investigate the complete dependence structure in the triangle points--marks--covariates together. We take advantage of the recent development of the nonparametric random shift methods, namely the new variance correction approach, and propose tests of the null hypothesis of independence between the marks and the covariate and between the points and the covariate. We present a detailed simulation study showing the performance of the methods and provide two theorems establishing the appropriate form of the correction factors for the variance correction. Finally, we illustrate the use of the proposed methods in two real applications.

preprint2020arXiv

Analyzing Car Thefts and Recoveries with Connections to Modeling Origin-Destination Point Patterns

For a given region, we have a dataset composed of car theft locations along with a linked dataset of recovery locations which, due to partial recovery, is a relatively small subset of the set of theft locations. For an investigator seeking to understand the behavior of car thefts and recoveries in the region, several questions are addressed. Viewing the set of theft locations as a point pattern, can we propose useful models to explain the pattern? What types of predictive models can be built to learn about recovery location given theft location? Can the dependence between theft locations and recovery locations be formalized? Can the flow between theft sites and recovery sites be captured? Origin-destination modeling offers a natural framework for such problems. However, here the data is not for areal units but rather is a pair of point patterns, with the recovery point pattern only partially observed. We offer modeling approaches for investigating the questions above and apply the approaches to two datasets. One is small from the state of Neza in Mexico with areal covariate information regarding population features and crime type. A second, much larger one, is from Belo Horizonte in Brazil but lacks covariates.

preprint2020arXiv

Graphical modelling and partial characteristics for multitype and multivariate-marked spatio-temporal point processes

This paper contributes to the multivariate analysis of marked spatio-temporal point process data by introducing different partial point characteristics and extending the spatial dependence graph model formalism. Our approach yields a unified framework for different types of spatio-temporal data including both, purely qualitatively (multivariate) cases and multivariate cases with additional quantitative marks. The proposed graphical model is defined through partial spectral density characteristics, it is highly computationally efficient and reflects the conditional similarity among sets of spatio-temporal sub-processes of either points or marked points with identical discrete marks. The paper considers three applications, two on crime data and a third one on forestry.

preprint2019arXiv

Revisiting the random shift approach for testing in spatial statistics

We consider the problem of non-parametric testing of independence of two components of a stationary bivariate spatial process. In particular, we revisit the random shift approach that has become a standard method for testing the independent superposition hypothesis in spatial statistics, and it is widely used in a plethora of practical applications. However, this method has a problem of liberality caused by breaking the marginal spatial correlation structure due to the toroidal correction. This indeed causes that the assumption of exchangability, which is essential for the Monte Carlo test to be exact, is not fulfilled. We present a number of permutation strategies and show that the random shift with the variance correction brings a suitable improvement compared to the torus correction in the random field case. It reduces the liberality and achieves the largest power from all investigated variants. To obtain the variance for the variance correction method, several approaches were studied. The best results were achieved, for the sample covariance as the test statistics, with the correction factor $1/n$. This corresponds to the asymptotic order of the variance of the test statistics. In the point process case, the problem of deviations from exchangeability is far more complex and we propose an alternative strategy based on the mean cross nearest-neighbor distance and torus correction. It reduces the liberality but achieves slightly lower power than the usual cross $K$-function. Therefore we recommend it, when the point patterns are clustered, where the cross $K$-function achieves liberality.