Source author record

Jean-François Coeurjolly

Jean-François Coeurjolly appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

28works
6topics
4close collaborators

Actions

Connect this record

Log in to claim

Research graph

See the researcher in context

Open full explorer

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

28 published item(s)

preprint2026arXiv

Simulation of warping processes with applications to temperature data

Curve registration plays a major role in functional data analysis by separating amplitude and phase variation through warping functions and the accurate simulation of warping processes is essential for developing statistical methods that properly account for phase variability in functional data. In this paper, we focus on the simulation of continuous warping processes with a prescribed expectation and a controllable variance. We study and compare three procedures, including two existing methods and a new algorithm based on randomized empirical cumulative distribution functions. For each approach, we provide an operational description and establish theoretical results for the first two moments of the simulated processes. A numerical study illustrates the theoretical findings and highlights the respective merits of the three methods. Finally, we present an application to the analysis of temperature distributions in Montreal based on simulated realizations from a warping process estimated from temperature quantile functions.

preprint2026arXiv

When Imbalance Comes Twice: Active Learning under Simulated Class Imbalance and Label Shift in Binary Semantic Segmentation

The aim of Active Learning is to select the most informative samples from an unlabelled set of data. This is useful in cases where the amount of data is large and labelling is expensive, such as in machine vision or medical imaging. Two particularities of machine vision are first, that most of the images produced are free of defects, and second, that the amount of images produced is so big that we cannot store all acquired images. This results, on the one hand, in a strong class imbalance in defect distribution and, on the other hand, in a potential label shift caused by limited storage. To understand how these two forms of imbalance affect active learning algorithms, we propose a simulation study based on two open-source datasets. We artificially create datasets for which we control the levels of class imbalance and label shift. Three standard active learning selection strategies are compared: random sampling, entropy-based selection, and core-set selection. We demonstrate that active learning strategies, and in particular the entropy-based and core-set selections, remain interesting and efficient even for highly imbalanced datasets. We also illustrate and measure the loss of efficiency that occurs in the situation a strong label shift.

preprint2025arXiv

Marked point processes intensity estimation using sparse group Lasso method applied to locations of lucrative and cooperative banks in mainland France

In this paper, we model the locations of five major banks in mainland France, two lucrative and three cooperative institutions based on socio-economic considerations. Locations of banks are collected using web scrapping and constitute a bivariate spatial point process for which we estimate nonparametrically summary functions (intensity, Ripley and cross-Ripley's K functions). This shows that the pattern is highly inhomogenenous and exhibits a clustering effect especially at small scales, and thus a significant departure to the bivariate (inhomogeneous) Poisson point process is pointed out. We also collect socio-economic datasets (at the living area level) from INSEE and propose a parametric modelling of the intensity function using these covariates. We propose a group-penalized bivariate composite likelihood method to estimate the model parameters, and we establish its asymptotic properties. The application of the methodology to the banking dataset provides new insights into the specificity of the cooperative model within the sector, particularly in relation to the theories of institutional isomorphism.

preprint2023arXiv

Pairwise interaction function estimation of Gibbs point processes using basis expansion

The class of Gibbs point processes (GPP) is a large class of spatial point processes able to model both clustered and repulsive point patterns. They are specified by their conditional intensity, which for a point pattern $\mathbf{x}$ and a location $u$, is roughly speaking the probability that an event occurs in an infinitesimal ball around $u$ given the rest of the configuration is $\mathbf{x}$. The most simple and natural class of models is the class of pairwise interaction point processes where the conditional intensity depends on the number of points and pairwise distances between them. This paper is concerned with the problem of estimating the pairwise interaction function non parametrically. We propose to estimate it using an orthogonal series expansion of its logarithm. Such an approach has numerous advantages compared to existing ones. The estimation procedure is simple, fast and completely data-driven. We provide asymptotic properties such as consistency and asymptotic normality and show the efficiency of the procedure through simulation experiments and illustrate it with several datasets.

preprint2022arXiv

Adaptive lasso and Dantzig selector for spatial point processes intensity estimation

Lasso and Dantzig selector are standard procedures able to perform variable selection and estimation simultaneously. This paper is concerned with extending these procedures to spatial point process intensity estimation. We propose adaptive versions of these procedures, develop efficient computational methodologies and derive asymptotic results for a large class of spatial point processes under an original setting where the number of parameters, i.e. the number of spatial covariates considered, increases with the expected number of data points. Both procedures are compared theoretically, in a simulation study, and in a real data example.

preprint2021arXiv

Inference for possibly high-dimensional inhomogeneous Gibbs point processes

Gibbs point processes (GPPs) constitute a large and flexible class of spatial point processes with explicit dependence between the points. They can model attractive as well as repulsive point patterns. Feature selection procedures are an important topic in high-dimensional statistical modeling. In this paper, composite likelihood approach regularized with convex and non-convex penalty functions is proposed to handle statistical inference for possibly high-dimensional inhomogeneous GPPs. The composite likelihood incorporates both the pseudo-likelihood and the logistic composite likelihood. We particularly investigate the setting where the number of covariates diverges as the domain of observation increases. Under some conditions provided on the spatial GPP and on the penalty functions, we show that the oracle property, the consistency and the asymptotic normality hold. Our results also cover the low-dimensional case which fills a large gap in the literature. Through simulation experiments, we validate our theoretical results and finally, an application to a tropical forestry dataset illustrates the use of the proposed approach.

preprint2020arXiv

Digit analysis for Covid-19 reported data

The coronavirus which appeared in December 2019 in Wuhan has spread out worldwide and caused the death of more than 280,000 people (as of May, 11 2020). Since February 2020, doubts were raised about the numbers of confirmed cases and deaths reported by the Chinese government. In this paper, we examine data available from China at the city and provincial levels and we compare them with Canadian provincial data, US state data and French regional data. We consider cumulative and daily numbers of confirmed cases and deaths and examine these numbers through the lens of their first two digits and in particular we measure departures of these first two digits to the Newcomb-Benford distribution, often used to detect frauds. Our finding is that there is no evidence that cumulative and daily numbers of confirmed cases and deaths for all these countries have different first or second digit distributions. We also show that the Newcomb-Benford distribution cannot be rejected for these data.

preprint2020arXiv

Projections of determinantal point processes

Let $\mathbf x=\{x^{(1)},\dots,x^{(n)}\}$ be a space filling-design of $n$ points defined in $[0{,}1]^d$. In computer experiments, an important property seeked for $\mathbf x$ is a nice coverage of $[0{,}1]^d$. This property could be desirable as well as for any projection of $\mathbf x$ onto $[0{,}1]^ι$ for $ι<d$ . Thus we expect that $\mathbf x_I=\{x_I^{(1)},\dots,x_I^{(n)}\}$, which represents the design $\mathbf x$ with coordinates associated to any index set $I\subseteq\{1,\dots,d\}$, remains regular in $[0{,}1]^ι$ where $ι$ is the cardinality of $I$. This paper examines the conservation of nice coverage by projection using spatial point processes, and more specifically using the class of determinantal point processes. We provide necessary conditions on the kernel defining these processes, ensuring that the projected point process $\mathbf{X}_I$ is repulsive, in the sense that its pair correlation function is uniformly bounded by 1, for all $I\subseteq\{1,\dots,d\}$. We present a few examples, compare them using a new normalized version of Ripley's function. Finally, we illustrate the interest of this research for Monte-Carlo integration.

preprint2016arXiv

A tutorial on Palm distributions for spatial point processes

This tutorial provides an introduction to Palm distributions for spatial point processes. Initially, in the context of finite point processes , we give an explicit definition of Palm distributions in terms of their density functions. Then we review Palm distributions in the general case. Finally we discuss some examples of Palm distributions for specific models and some applications.

preprint2016arXiv

Standard and robust intensity parameter estimation for stationary determinantal point processes

This work is concerned with the estimation of the intensity parameter of a stationary determinantal point process. We consider the standard estimator, corresponding to the number of observed points per unit volume and a recently introduced median-based estimator more robust to outliers. The consistency and asymptotic normality of estimators are obtained under mild assumptions on the determinantal point process. We illustrate the efficiency of the procedures in a simulation study.

preprint2016arXiv

Towards optimal Takacs--Fiksel estimation

The Takacs--Fiksel method is a general approach to estimate the parameters of a spatial Gibbs point process. This method embraces standard procedures such as the pseudolikelihood and is defined via weight functions. In this paper we propose a general procedure to find weight functions which reduce the Godambe information and thus outperform pseudolikelihood in certain situations. The new procedure is applied to a standard dataset and to a recent neuroscience replicated point pattern dataset. Finally, the performance of the new procedure is investigated in a simulation study.

preprint2015arXiv

Median-based estimation of the intensity of a spatial point process

This paper is concerned with a robust estimator of the intensity of a stationary spatial point process. The estimator corresponds to the median of a jittered sample of the number of points, computed from a tessellation of the observation domain. We show that this median-based estimator satisfies a Bahadur representation from which we deduce its consistency and asymptotic normality under mild assumptions on the spatial point process. Through a simulation study, we compare the new estimator with the standard one counting the mean number of points per unit volume. The empirical study verifies the asymptotic properties established and shows that the median-based estimator is more robust to outliers than the standard estimator.

preprint2015arXiv

Parametric estimation of pairwise Gibbs point processes with infinite range interaction

This paper is concerned with statistical inference for infinite range interaction Gibbs point processes and in particular for the large class of Ruelle superstable and lower regular pairwise interaction models. We extend classical statistical methodologies such as the pseudolikelihood and the logistic regression methods, originally defined and studied for finite range models. Then we prove that the associated estimators are strongly consistent and satisfy a central limit theorem, provided the pairwise interaction function tends sufficiently fast to zero. To this end, we introduce a new central limit theorem for almost conditionally centered triangular arrays of random fields.

preprint2015arXiv

Stein estimation of the intensity of a spatial homogeneous Poisson point process

In this paper, we revisit the original ideas of Stein and propose an estimator of the intensity parameter of a homogeneous Poisson point process defined in $\R^d$ and observed in a bounded window. The procedure is based on a new general integration by parts formula for Poisson point processes. We show that our Stein estimator outperforms the maximum likelihood estimator in terms of mean squared error. In particular, we show that in many practical situations we have a gain larger than 30\%.

preprint2014arXiv

Variational approach for spatial point process intensity estimation

We introduce a new variational estimator for the intensity function of an inhomogeneous spatial point process with points in the $d$-dimensional Euclidean space and observed within a bounded region. The variational estimator applies in a simple and general setting when the intensity function is assumed to be of log-linear form $β+{θ}^{\top}z(u)$ where $z$ is a spatial covariate function and the focus is on estimating ${θ}$. The variational estimator is very simple to implement and quicker than alternative estimation procedures. We establish its strong consistency and asymptotic normality. We also discuss its finite-sample properties in comparison with the maximum first order composite likelihood estimator when considering various inhomogeneous spatial point process models and dimensions as well as settings were $z$ is completely or only partially known.

preprint2012arXiv

Basic properties of the Multivariate Fractional Brownian Motion

This paper reviews and extends some recent results on the multivariate fractional Brownian motion (mfBm) and its increment process. A characterization of the mfBm through its covariance function is obtained. Similarly, the correlation and spectral analyses of the increments are investigated. On the other hand we show that (almost) all mfBm's may be reached as the limit of partial sums of (super)linear processes. Finally, an algorithm to perfectly simulate the mfBm is presented and illustrated by some simulations.

preprint2011arXiv

Expectiles for subordinated Gaussian processes with applications

In this paper, we introduce a new class of estimators of the Hurst exponent of the fractional Brownian motion (fBm) process. These estimators are based on sample expectiles of discrete variations of a sample path of the fBm process. In order to derive the statistical properties of the proposed estimators, we establish asymptotic results for sample expectiles of subordinated stationary Gaussian processes with unit variance and correlation function satisfying $ρ(i)\sim κ|i|^{-α}$ ($κ\in \RR$) with $α>0$. Via a simulation study, we demonstrate the relevance of the expectile-based estimation method and show that the suggested estimators are more robust to data rounding than their sample quantile-based counterparts.

preprint2011arXiv

Geodesic Normal distribution on the circle

This paper is concerned with the study of a circular random distribution called geodesic Normal distribution recently proposed for general manifolds. This distribution, parameterized by two real numbers associated to some specific location and dispersion concepts, looks like a standard Gaussian on the real line except that the support of this variable is $[0,2π)$ and that the Euclidean distance is replaced by the geodesic distance on the circle. Some properties are studied and comparisons with the von Mises distribution in terms of intrinsic and extrinsic means and variances are provided. Finally, the problem of estimating the parameters through the maximum likelihood method is investigated and illustrated with some simulations.

preprint2011arXiv

Identification of the Multivariate Fractional Brownian Motion

This paper deals with the identification of the multivariate fractional Brownian motion, a recently developed extension of the fractional Brownian motion to the multivariate case. This process is a $p$-multivariate self-similar Gaussian process parameterized by $p$ different Hurst exponents $H_i$, $p$ scaling coefficients $σ_i$ (of each component) and also by $p(p-1)$ coefficients $ρ_{ij},η_{ij}$ (for $i,j=1,...,p$ with $j>i$) allowing two components to be more or less strongly correlated and allowing the process to be time reversible or not. We investigate the use of discrete filtering techniques to estimate jointly or separately the different parameters and prove the efficiency of the methodology with a simulation study and the derivation of asymptotic results.

preprint2011arXiv

Takacs Fiksel method for stationary marked Gibbs point processes

This paper studies a method to estimate the parameters governing the distribution of a stationary marked Gibbs point process. This procedure, known as the Takacs-Fiksel method, is based on the estimation of the left and right hand sides of the Georgii-Nguyen-Zessin formula and leads to a family of estimators due to the possible choices of test functions. We propose several examples illustrating the interest and flexibility of this procedure. We also provide sufficient conditions based on the model and the test functions to derive asymptotic properties (consistency and asymptotic normality) of the resulting estimator. The different assumptions are discussed for exponential family models and for a large class of test functions. A short simulation study is proposed to assess the correctness of the methodology and the asymptotic results.

preprint2011arXiv

Wavelet analysis of the multivariate fractional Brownian motion

The work developed in the paper concerns the multivariate fractional Brownian motion (mfBm) viewed through the lens of the wavelet transform. After recalling some basic properties on the mfBm, we calculate the correlation structure of its wavelet transform. We particularly study the asymptotic behavior of the correlation, showing that if the analyzing wavelet has a sufficient number of null first order moments, the decomposition eliminates any possible long-range (inter)dependence. The cross-spectral density is also considered in a second part. Its existence is proved and its evaluation is performed using a von Bahr-Essen like representation of the function $\sign(t) |t|^α$. The behavior of the cross-spectral density of the wavelet field at the zero frequency is also developed and confirms the results provided by the asymptotic analysis of the correlation.

preprint2010arXiv

Asymptotic properties of the maximum pseudo-likelihood estimator for stationary Gibbs point processes including the Lennard-Jones model

This paper presents asymptotic properties of the maximum pseudo-likelihood estimator of a vector $\Vectθ$ parameterizing a stationary Gibbs point process. Sufficient conditions, expressed in terms of the local energy function defining a Gibbs point process, to establish strong consistency and asymptotic normality results of this estimator depending on a single realization, are presented.These results are general enough to no longer require the local stability and the linearity in terms of the parameters of the local energy function. We consider characteristic examples of such models, the Lennard-Jones and the finite range Lennard-Jones models. We show that the different assumptions ensuring the consistency are satisfied for both models whereas the assumptions ensuring the asymptotic normality are fulfilled only for the finite range Lennard-Jones model.

preprint2010arXiv

Confidence intervals for the Hurst parameter of a fractional Brownian motion based on finite sample size

In this paper, we show how concentration inequalities for Gaussian quadratic form can be used to propose exact confidence intervals of the Hurst index parametrizing a fractional Brownian motion. Both cases where the scaling parameter of the fractional Brownian motion is known or unknown are investigated. These intervals are obtained by observing a single discretized sample path of a fractional Brownian motion and without any assumption on the parameter $H$.

preprint2010arXiv

Residuals and goodness-of-fit tests for stationary marked Gibbs point processes

The inspection of residuals is a fundamental step to investigate the quality of adjustment of a parametric model to data. For spatial point processes, the concept of residuals has been recently proposed by Baddeley et al. (2005) as an empirical counterpart of the {\it Campbell equilibrium} equation for marked Gibbs point processes. The present paper focuses on stationary marked Gibbs point processes and deals with asymptotic properties of residuals for such processes. In particular, the consistency and the asymptotic normality are obtained for a wide class of residuals including the classical ones (raw residuals, inverse residuals, Pearson residuals). Based on these asymptotic results, we define goodness-of-fit tests with Type-I error theoretically controlled. One of these tests constitutes an extension of the quadrat counting test widely used to test the null hypothesis of a homogeneous Poisson point process.

preprint2006arXiv

Maximum pseudo-likelihood estimator for nearest-neighbours Gibbs point processes

This paper is devoted to the estimation of a vector parametrizing an energy function associated to some "Nearest-Neighbours" Gibbs point process, via the pseudo-likelihood method. We present some convergence results concerning this estimator, that is strong consistency and asymptotic normality, when only a single realization is observed. Sufficient conditions are expressed in terms of the local energy function and are verified on some examples.

preprint2006arXiv

Normalized information-based divergences

This paper is devoted to the mathematical study of some divergences based on the mutual information well-suited to categorical random vectors. These divergences are generalizations of the "entropy distance" and "information distance". Their main characteristic is that they combine a complexity term and the mutual information. We then introduce the notion of (normalized) information-based divergence, propose several examples and discuss their mathematical properties in particular in some prediction framework.