Researcher profile

Emilio Porcu

Emilio Porcu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
12works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

12 published item(s)

preprint2026arXiv

Data Science: a Natural Ecosystem

This manuscript provides a systemic and data-centric view of what we term essential data science, as a natural ecosystem with challenges and missions stemming from the fusion of data universe with its multiple combinations of the 5D complexities (data structure, domain, cardinality, causality, and ethics) with the phases of the data life cycle. Data agents perform tasks driven by specific goals. The data scientist is an abstract entity that comes from the logical organization of data agents with their actions. Data scientists face challenges that are defined according to the missions. We define specific discipline-induced data science, which in turn allows for the definition of pan-data science, a natural ecosystem that integrates specific disciplines with the essential data science. We semantically split the essential data science into computational, and foundational. By formalizing this ecosystemic view, we contribute a general-purpose, fusion-oriented architecture for integrating heterogeneous knowledge, agents, and workflows-relevant to a wide range of disciplines and high-impact applications.

preprint2023arXiv

Hybrid Parametric Classes of Isotropic Covariance Functions for Spatial Random Fields

Covariance functions are the core of spatial statistics, stochastic processes, machine learning as well as many other theoretical and applied disciplines. The properties of the covariance function at small and large distances determine the geometric attributes of the associated Gaussian random field. Having covariance functions that allow to specify both local and global properties is certainly on demand. This paper provides a method to find new classes of covariance functions having such properties. We term these models hybrid as they are obtained as scale mixtures of piecewise covariance kernels against measures that are also defined as piecewise linear combination of parametric families of measures. In order to illustrate our methodology, we provide new families of covariance functions that are proved to be richer with respect to other well known families that have been proposed by earlier literature. More precisely, we derive a hybrid Cauchy-Matérn model, which allows us to index both long memory and mean square differentiability of the random field, and a hybrid Hole-Effect-Matérn model, which is capable of attaining negative values (hole effect), while preserving the local attributes of the traditional Matérn model. Our findings are illustrated through numerical studies with both simulated and real data.

preprint2022arXiv

A Riemann-Stein Kernel Method

This paper proposes and studies a numerical method for approximation of posterior expectations based on interpolation with a Stein reproducing kernel. Finite-sample-size bounds on the approximation error are established for posterior distributions supported on a compact Riemannian manifold, and we relate these to a kernel Stein discrepancy (KSD). Moreover, we prove in our setting that the KSD is equivalent to Sobolev discrepancy and, in doing so, we completely characterise the convergence-determining properties of KSD. Our contribution is rooted in a novel combination of Stein's method, the theory of reproducing kernels, and existence and regularity results for partial differential equations on a Riemannian manifold.

preprint2022arXiv

Dimension Walks on Generalized Spaces

Let $d,k$ be positive integers. We call generalized spaces the cartesian product of the $d$-dimensional sphere, $\mathbb{S}^d$, with the $k$-dimensional Euclidean space, $\mathbb{R}^k$. We consider the class ${\mathcal P}(\mathbb{S}^d \times \mathbb{R}^k)$ of continuous functions $φ: [-1,1] \times [0,\infty) \to \mathbb{R}$ such that the mapping $C: \left ( \mathbb{S}^d \times\mathbb{R}^k \right )^2 \to \mathbb{R}$, defined as $C \Big ( (x,y),(x^{\prime},y^{\prime})\Big ) = φ\Big ( \cos θ(x,x^{\prime}), \|y-y^{\prime}\| \Big )$, $(x,y), \; (x^{\prime},y^{\prime}) \in \mathbb{S}^d \times \mathbb{R}^k$, is positive definite. We propose linear operators that allow for walks through dimension within generalized spaces while preserving positive definiteness.

preprint2022arXiv

Flexible Validity Conditions for the Multivariate Matérn Covariance in any Spatial Dimension and for any Number of Components

This paper addresses the problem of finding parametric constraints that ensure the validity of the multivariate Mat{é}rn covariance for modeling the spatial correlation structure of coregionalized variables defined in an Euclidean space. To date, much attention has been given to the bivariate setting, while the multivariate setting has been explored to a limited extent only. The existing conditions often imply severe restrictions on the upper bounds for the collocated correlation coefficients, which makes the multivariate Mat{é}rn model appealing for the case of weak spatial cross-dependence only. We provide a collection of sufficient validity conditions for the multivariate Mat{é}rn covariance that allows for more flexible parameterizations than those currently available, and prove that one can attain considerably higher upper bounds for the collocated correlation coefficients in comparison with our competitors. We conclude with an illustration on a trivariate geochemical data set and show that our enlarged parametric space yields better fitting performances.

preprint2022arXiv

Multivariate Gaussian Random Fields over Generalized Product Spaces involving the Hypertorus

The paper deals with multivariate Gaussian random fields defined over generalized product spaces that involve the hypertorus. The assumption of Gaussianity implies the finite dimensional distributions to be completely specified by the covariance functions, being in this case matrix valued mappings. We start by considering the spectral representations that in turn allow for a characterization of such covariance functions. We then provide some methods for the construction of these matrix valued mappings. Finally, we consider strategies to evade radial symmetry (called isotropy in spatial statistics) and provide representation theorems for such a more general case.

preprint2022arXiv

Nonseparable Space-Time Stationary Covariance Functions on Networks cross Time

The advent of data science has provided an increasing number of challenges with high data complexity. This paper addresses the challenge of space-time data where the spatial domain is not a planar surface, a sphere, or a linear network, but a generalized network (termed a graph with Euclidean edges). Additionally, data are repeatedly measured over different temporal instants. We provide new classes of nonseparable space-time stationary covariance functions where {\em space} can be a generalized network, a Euclidean tree, or a linear network, and where time can be linear or circular (seasonal). Because the construction principles are technical, we focus on illustrations that guide the reader through the construction of statistically interpretable examples. A simulation study demonstrates that we can recover the correct model when compared to misspecified models. In addition, our simulation studies show that we effectively recover simulation parameters. In our data analysis, we consider a traffic accident dataset that shows improved model performance based on covariance specifications and network-based metrics.

preprint2022arXiv

Rudin Extension Theorems on Product Spaces, Turning Bands, and Random Fields on Balls cross Time

Characteristic functions that are radially symmetric have a dual interpretation, as they can be used as the isotropic correlation functions of spatial random fields. Extensions of isotropic correlation functions from balls into $d$-dimensional Euclidean spaces, $\R^{d}$, have been understood after Rudin. Yet, extension theorems on product spaces are elusive, and a counterexample provided by Rudin on rectangles suggest that the problem is quite challenging. This paper provides extension theorem for multiradial characteristic functions that are defined in balls embedded in $\R^d$ cross, either $\R^{\dd}$ or the unit sphere $§^{\dd}$ embedded in $\R^{\dd+1}$, for any two positive integers $d$ and $\dd$. We then examine Turning Bands operators that provide bijections between the class of multiradial correlation functions in given product spaces, and multiradial correlations in product spaces having different dimensions. The combination of extension theorems with Turning Bands provides a connection with random fields that are defined in balls cross linear or circular time.

preprint2021arXiv

A deep look into the Dagum family of isotropic covariance functions

The Dagum family of isotropic covariance functions has two parameters that allow for decoupling of the fractal dimension and Hurst effect for Gaussian random fields that are stationary and isotropic over Euclidean spaces. Sufficient conditions that allow for positive definiteness in Rd of the Dagum family have been proposed on the basis of the fact that the Dagum family allows for complete monotonicity under some parameter restrictions. The spectral properties of the Dagum family have been inspected to a very limited extent only, and this paper gives insight into this direction. Specifically, we study finite and asymptotic properties of the isotropic spectral density (intended as the Hankel transform) of the Dagum model. Also, we establish some closed forms expressions for the Dagum spectral density in terms of the Fox{Wright functions. Finally, we provide asymptotic properties for such a class of spectral densities.

preprint2021arXiv

The $\mathcal{F}$-family of covariance functions: A Matérn analogue for modeling random fields on spheres

The Mat{é}rn family of isotropic covariance functions has been central to the theoretical development and application of statistical models for geospatial data. For global data defined over the whole sphere representing planet Earth, the natural distance between any two locations is the great circle distance. In this setting, the Mat{é}rn family of covariance functions has a restriction on the smoothness parameter, making it an unappealing choice to model smooth data. Finding a suitable analogue for modelling data on the sphere is still an open problem. This paper proposes a new family of isotropic covariance functions for random fields defined over the sphere. The proposed family has a parameter that indexes the mean square differentiability of the corresponding Gaussian field, and allows for any admissible range of fractal dimension. Our simulation study mimics the fixed domain asymptotic setting, which is the most natural regime for sampling on a closed and bounded set. As expected, our results support the analogous results (under the same asymptotic scheme) for planar processes that not all parameters can be estimated consistently. We apply the proposed model to a dataset of precipitable water content over a large portion of the Earth, and show that the model gives more precise predictions of the underlying process at unsampled locations than does the Mat{é}rn model using chordal distances.

preprint2020arXiv

Asymptotically Equivalent Prediction in Multivariate Geostatistics

Cokriging is the common method of spatial interpolation (best linear unbiased prediction) in multivariate geostatistics. While best linear prediction has been well understood in univariate spatial statistics, the literature for the multivariate case has been elusive so far. The new challenges provided by modern spatial datasets, being typically multivariate, call for a deeper study of cokriging. In particular, we deal with the problem of misspecified cokriging prediction within the framework of fixed domain asymptotics. Specifically, we provide conditions for equivalence of measures associated with multivariate Gaussian random fields, with index set in a compact set of a d-dimensional Euclidean space. Such conditions have been elusive for over about 50 years of spatial statistics. We then focus on the multivariate Matérn and Generalized Wendland classes of matrix valued covariance functions, that have been very popular for having parameters that are crucial to spatial interpolation, and that control the mean square differentiability of the associated Gaussian process. We provide sufficient conditions, for equivalence of Gaussian measures, relying on the covariance parameters of these two classes. This enables to identify the parameters that are crucial to asymptotically equivalent interpolation in multivariate geostatistics. Our findings are then illustrated through simulation studies.

preprint2019arXiv

Towards a Complete Picture of Stationary Covariance Functions on Spheres Cross Time

With the advent of wide-spread global and continental-scale spatiotemporal datasets, increased attention has been given to covariance functions on spheres over time. This paper provides results for stationary covariance functions of random fields defined over $d$-dimensional spheres cross time. Specifically, we provide a bridge between the characterization in \cite{berg-porcu} for covariance functions on spheres cross time and Gneiting's lemma \citep{gneiting2002} that deals with planar surfaces. We then prove that there is a valid class of covariance functions similar in form to the Gneiting class of space-time covariance functions \citep{gneiting2002} that replaces the squared Euclidean distance with the great circle distance. Notably, the provided class is shown to be positive definite on every $d$-dimensional sphere cross time, while the Gneiting class is positive definite over $\R^d \times \R$ for fixed $d$ only. In this context, we illustrate the value of our adapted Gneiting class by comparing examples from this class to currently established nonseparable covariance classes using out-of-sample predictive criteria. These comparisons are carried out on two climate reanalysis datasets from the National Centers for Environmental Prediction and National Center for Atmospheric Research. For these datasets, we show that examples from our covariance class have better predictive performance than competing models.