Researcher profile

Frédéric Ouimet

Frédéric Ouimet contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
23works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

23 published item(s)

preprint2026arXiv

Stein's method for the matrix normal distribution

This work presents the first systematic development of Stein's method for matrix distributions. We establish the basic essential ingredients of Stein's method for matrix normal approximation: we derive a generator-based Stein identity from a matrix Ornstein--Uhlenbeck diffusion with two-sided scales, provide an explicit semigroup representation for the solution of the Stein equation, and obtain regularity estimates for the solution. The new methodology is illustrated with three statistical applications, these being smooth Wasserstein distance bounds to quantify the matrix central limit theorem, a Wasserstein distance bound for the matrix normal approximation of the centered matrix $T$ distribution, and the derivation of Stein's method-of-moments estimators for scale parameters of the matrix normal distribution.

preprint2022arXiv

A comprehensive empirical power comparison of univariate goodness-of-fit tests for the Laplace distribution

In this paper we present the results from an empirical power comparison of 40 goodness-of-fit tests for the univariate Laplace distribution, carried out using Monte Carlo simulations with sample sizes $n = 20, 50, 100, 200$, significance levels $α= 0.01, 0.05, 0.10$, and 400 alternatives consisting of asymmetric and symmetric light/heavy-tailed distributions taken as special cases from 11 models. In addition to the unmatched scope of our study, an interesting contribution is the proposal of an innovative design for the selection of alternatives. The 400 alternatives consist of 20 specific cases of 20 submodels drawn from the main 11 models. For each submodel, the 20 specific cases corresponded to parameter values chosen to cover the full power range. An analysis of the results leads to a recommendation of the best tests for five different groupings of the alternative distributions. A real-data example is also presented, where an appropriate test for the goodness-of-fit of the univariate Laplace distribution is applied to weekly log-returns of Amazon stock over a recent four-year period.

preprint2022arXiv

A multivariate normal approximation for the Dirichlet density and some applications

In this short note, we prove an asymptotic expansion for the ratio of the Dirichlet density to the multivariate normal density with the same mean and covariance matrix. The expansion is then used to derive an upper bound on the total variation between the corresponding probability measures and rederive the asymptotic variance of the Dirichlet kernel estimators introduced by Aitchison & Lauder (1985) and studied theoretically in Ouimet (2020). Another potential application related to the asymptotic equivalence between the Gaussian variance regression problem and the Gaussian white noise problem is briefly mentioned but left open for future research.

preprint2022arXiv

Refined normal approximations for the central and noncentral chi-square distributions and some applications

In this paper, we prove a local limit theorem for the chi-square distribution with $r > 0$ degrees of freedom and noncentrality parameter $λ\geq 0$. We use it to develop refined normal approximations for the survival function. Our maximal errors go down to an order of $r^{-2}$, which is significantly smaller than the maximal error bounds of order $r^{-1/2}$ recently found by Horgan & Murphy (2013) and Seri (2015). Our results allow us to drastically reduce the number of observations required to obtain negligible errors in the energy detection problem, from $250$, as recommended in the seminal work of Urkowitz (1967), to only $8$ here with our new approximations. We also obtain an upper bound on several probability metrics between the central and noncentral chi-square distributions and the standard normal distribution, and we obtain an approximation for the median that improves the lower bound previously obtained by Robert (1990).

preprint2022arXiv

Refined normal approximations for the Student distribution

In this paper, we develop a local limit theorem for the Student distribution. We use it to improve the normal approximation of the Student survival function given in Shafiei & Saberali (2015) and to derive asymptotic bounds for the corresponding maximal errors at four levels of approximation. As a corollary, approximations for the percentage points (or quantiles) of the Student distribution are obtained in terms of the percentage points of the standard normal distribution.

preprint2021arXiv

A precise local limit theorem for the multinomial distribution and some applications

In Siotani & Fujikoshi (1984), a precise local limit theorem for the multinomial distribution is derived by inverting the Fourier transform, where the error terms are explicit up to order $N^{-1}$. In this paper, we give an alternative (conceptually simpler) proof based on Stirling's formula and a careful handling of Taylor expansions, and we show how the result can be used to approximate multinomial probabilities on most subsets of $\mathbb{R}^d$. Furthermore, we discuss a recent application of the result to obtain asymptotic properties of Bernstein estimators on the simplex, we improve the main result in Carter (2002) on the Le Cam distance bound between multinomial and multivariate normal experiments while simultaneously simplifying the proof, and we mention another potential application related to finely tuned continuity corrections.

preprint2021arXiv

A symmetric matrix-variate normal local approximation for the Wishart distribution and some applications

The noncentral Wishart distribution has become more mainstream in statistics as the prevalence of applications involving sample covariances with underlying multivariate Gaussian populations as dramatically increased since the advent of computers. Multiple sources in the literature deal with local approximations of the noncentral Wishart distribution with respect to its central counterpart. However, no source has yet developed explicit local approximations for the (central) Wishart distribution in terms of a normal analogue, which is important since Gaussian distributions are at the heart of the asymptotic theory for many statistical methods. In this paper, we prove a precise asymptotic expansion for the ratio of the Wishart density to the symmetric matrix-variate normal density with the same mean and covariances. The result is then used to derive an upper bound on the total variation between the corresponding probability measures and to find the pointwise variance of a new density estimator on the space of positive definite matrices with a Wishart asymmetric kernel. For the sake of completeness, we also find expressions for the pointwise bias of our new estimator, the pointwise variance as we move towards the boundary of its support, the mean squared error, the mean integrated squared error away from the boundary, and we prove its asymptotic normality.

preprint2021arXiv

Asymptotic properties of Bernstein estimators on the simplex

Bernstein estimators are well-known to avoid the boundary bias problem of traditional kernel estimators. The theoretical properties of these estimators have been studied extensively on compact intervals and hypercubes, but never on the simplex, except for the mean squared error of the density estimator in Tenbusch (1994) when $d = 2$. The simplex is an important case as it is the natural domain of compositional data. In this paper, we make an effort to prove several asymptotic results (bias, variance, mean squared error (MSE), mean integrated squared error (MISE), asymptotic normality, uniform strong consistency) for Bernstein estimators of cumulative distribution functions and density functions on the $d$-dimensional simplex. Our results generalize the ones in Leblanc (2012) and Babu et al. (2002), who treated the case $d = 1$, and significantly extend those found in Tenbusch (1994). In particular, our rates of convergence for the MSE and MISE are optimal.

preprint2021arXiv

Counterexamples to the classical central limit theorem for triplewise independent random variables having a common arbitrary margin

We present a general methodology to construct triplewise independent sequences of random variables having a common but arbitrary marginal distribution $F$ (satisfying very mild conditions). For two specific sequences, we obtain in closed form the asymptotic distribution of the sample mean. It is non-Gaussian (and depends on the specific choice of $F$). This allows us to illustrate the extent of the 'failure' of the classical central limit theorem (CLT) under triplewise independence. Our methodology is simple and can also be used to create, for any integer $K$, new $K$-tuplewise independent sequences that are not mutually independent. For $K \geq 4$, it appears that the sequences created using our methodology do verify a CLT, and we explain heuristically why this is the case.

preprint2021arXiv

General formulas for the central and non-central moments of the multinomial distribution

We present the first general formulas for the central and non-central moments of the multinomial distribution, using a combinatorial argument and the factorial moments previously obtained in Mosimann (1962). We use the formulas to give explicit expressions for all the non-central moments up to order 8 and all the central moments up to order 4. These results expand significantly on those in Newcomer (2008) and Newcomer et al. (2008), where the non-central moments were calculated up to order 4.

preprint2021arXiv

Moments of the Riemann zeta function on short intervals of the critical line

We show that as $T\to \infty$, for all $t\in [T,2T]$ outside of a set of measure $\mathrm{o}(T)$, $$ \int_{-(\log T)^θ}^{(\log T)^θ} |ζ(\tfrac 12 + \mathrm{i} t + \mathrm{i} h)|^β \mathrm{d} h = (\log T)^{f_θ(β) + \mathrm{o}(1)}, $$ for some explicit exponent $f_θ(β)$, where $θ> -1$ and $β> 0$. This proves an extended version of a conjecture of Fyodorov and Keating (2014). In particular, it shows that, for all $θ> -1$, the moments exhibit a phase transition at a critical exponent $β_c(θ)$, below which $f_θ(β)$ is quadratic and above which $f_θ(β)$ is linear. The form of the exponent $f_θ$ also differs between mesoscopic intervals ($-1<θ<0$) and macroscopic intervals ($θ>0$), a phenomenon that stems from an approximate tree structure for the correlations of zeta. We also prove that, for all $t\in [T,2T]$ outside a set of measure $\mathrm{o}(T)$, $$ \max_{|h| \leq (\log T)^θ} |ζ(\tfrac{1}{2} + \mathrm{i} t + \mathrm{i} h)| = (\log T)^{m(θ) + \mathrm{o}(1)}, $$ for some explicit $m(θ)$. This generalizes earlier results of Najnudel (2018) and Arguin et al. (2019) for $θ= 0$. The proofs are unconditional, except for the upper bounds when $θ> 3$, where the Riemann hypothesis is assumed.

preprint2021arXiv

On the Le Cam distance between multivariate hypergeometric and multivariate normal experiments

In this short note, we develop a local approximation for the log-ratio of the multivariate hypergeometric probability mass function over the corresponding multinomial probability mass function. In conjunction with the bounds from Carter (2002) and Ouimet (2021) on the total variation between the law of a multinomial vector jittered by a uniform on $(-1/2,1/2)^d$ and the law of the corresponding multivariate normal distribution, the local expansion for the log-ratio is then used to obtain a total variation bound between the law of a multivariate hypergeometric random vector jittered by a uniform on $(-1/2,1/2)^d$ and the law of the corresponding multivariate normal distribution. As a corollary, we find an upper bound on the Le Cam distance between multivariate hypergeometric and multivariate normal experiments.

preprint2021arXiv

On the Le Cam distance between Poisson and Gaussian experiments and the asymptotic properties of Szasz estimators

In this paper, we prove a local limit theorem for the ratio of the Poisson distribution to the Gaussian distribution with the same mean and variance, using only elementary methods (Taylor expansions and Stirling&#39;s formula). We then apply the result to derive an upper bound on the Le Cam distance between Poisson and Gaussian experiments, which gives a complete proof of the sketch provided in the unpublished set of lecture notes by Pollard (2010), who uses a different approach. We also use the local limit theorem to derive the asymptotics of the variance for Bernstein c.d.f. and density estimators with Poisson weights on the positive half-line (also called Szasz estimators). The propagation of errors in the literature due to the incorrect estimate in Lemma 2 (iv) of Leblanc (2012) is addressed in the Appendix.

preprint2020arXiv

A counterexample to the central limit theorem for pairwise independent random variables having a common arbitrary margin

The Central Limit Theorem (CLT) is one of the most fundamental results in statistics. It states that the standardized sample mean of a sequence of $n$ mutually independent and identically distributed random variables with finite first and second moments converges in distribution to a standard Gaussian as $n$ goes to infinity. In particular, pairwise independence of the sequence is generally not sufficient for the theorem to hold. We construct explicitly a sequence of pairwise independent random variables having a common but arbitrary marginal distribution $F$ (satisfying very mild conditions) for which the CLT is not verified. We study the extent of this &#39;failure&#39; of the CLT by obtaining, in closed form, the asymptotic distribution of the sample mean of our sequence. This is illustrated through several theoretical examples, for which we provide associated computing codes in the R language.

preprint2020arXiv

A study of seven asymmetric kernels for the estimation of cumulative distribution functions

In Mombeni et al. (2019), Birnbaum-Saunders and Weibull kernel estimators were introduced for the estimation of cumulative distribution functions (c.d.f.s) supported on the half-line $[0,\infty)$. They were the first authors to use asymmetric kernels in the context of c.d.f. estimation. Their estimators were shown to perform better numerically than traditional methods such as the basic kernel method and the boundary modified version from Tenreiro (2013). In the present paper, we complement their study by introducing five new asymmetric kernel c.d.f. estimators, namely the Gamma, inverse Gamma, lognormal, inverse Gaussian and reciprocal inverse Gaussian kernel c.d.f. estimators. For these five new estimators, we prove the asymptotic normality and we find asymptotic expressions for the following quantities: bias, variance, mean squared error and mean integrated squared error. A numerical study then compares the performance of the five new c.d.f. estimators against traditional methods and the Birnbaum-Saunders and Weibull kernel c.d.f. estimators from Mombeni et al. (2019). By using the same experimental design, we show that the lognormal and Birnbaum-Saunders kernel c.d.f. estimators perform the best overall, while the other asymmetric kernel estimators are sometimes better but always at least competitive against the boundary kernel method.

preprint2019arXiv

Large deviations and continuity estimates for the derivative of a random model of $\log |ζ|$ on the critical line

In this paper, we study the random field \begin{equation*} X(h) \circeq \sum_{p \leq T} \frac{\text{Re}(U_p \, p^{-i h})}{p^{1/2}}, \quad h\in [0,1], \end{equation*} where $(U_p, \, p ~\text{primes})$ is an i.i.d. sequence of uniform random variables on the unit circle in $\mathbb{C}$. Harper (2013) showed that $(X(h), \, h\in (0,1))$ is a good model for the large values of $(\log |ζ(\frac{1}{2} + i (T + h))|, \, h\in [0,1])$ when $T$ is large, if we assume the Riemann hypothesis. The asymptotics of the maximum were found in Arguin, Belius & Harper (2017) up to the second order, but the tightness of the recentered maximum is still an open problem. As a first step, we provide large deviation estimates and continuity estimates for the field&#39;s derivative $X&#39;(h)$. The main result shows that, with probability arbitrarily close to $1$, \begin{equation*} \max_{h\in [0,1]} X(h) - \max_{h\in \mathcal{S}} X(h) = O(1), \end{equation*} where $\mathcal{S}$ a discrete set containing $O(\log T \sqrt{\log \log T})$ points.

preprint2018arXiv

A uniform $L^1$ law of large numbers for functions of i.i.d. random variables that are translated by a consistent estimator

We develop a new $L^1$ law of large numbers where the $i$-th summand is given by a function $h(\cdot)$ evaluated at $X_i - θ_n$, and where $θ_n \circeq θ_n(X_1,X_2,\ldots,X_n)$ is an estimator converging in probability to some parameter $θ\in \mathbb{R}$. Under broad technical conditions, the convergence is shown to hold uniformly in the set of estimators interpolating between $θ$ and another consistent estimator $θ_n^{\star}$. Our main contribution is the treatment of the case where $|h|$ blows up at $0$, which is not covered by standard uniform laws of large numbers.

preprint2018arXiv

Complete monotonicity of multinomial probabilities and its application to Bernstein estimators on the simplex

Let $d\in \mathbb{N}$ and let $γ_i\in [0,\infty)$, $x_i\in (0,1)$ be such that $\sum_{i=1}^{d+1} γ_i = M\in (0,\infty)$ and $\sum_{i=1}^{d+1} x_i = 1$. We prove that \begin{equation*} a \mapsto \frac{Γ(aM + 1)}{\prod_{i=1}^{d+1} Γ(a γ_i + 1)} \prod_{i=1}^{d+1} x_i^{aγ_i} \end{equation*} is completely monotonic on $(0,\infty)$. This result generalizes the one found by Alzer (2018) for binomial probabilities ($d=1$). As a consequence of the log-convexity, we obtain some combinatorial inequalities for multinomial coefficients. We also show how the main result can be used to derive asymptotic formulas for quantities of interest in the context of statistical density estimation based on Bernstein polynomials on the $d$-dimensional simplex.

preprint2018arXiv

Maxima of branching random walks with piecewise constant variance

This article extends the results of Fang & Zeitouni (2012a) on branching random walks (BRWs) with Gaussian increments in time inhomogeneous environments. We treat the case where the variance of the increments changes a finite number of times at different scales in [0,1] under a slight restriction. We find the asymptotics of the maximum up to an OP(1) error and show how the profile of the variance influences the leading order and the logarithmic correction term. A more general result was independently obtained by Mallein (2015b) when the law of the increments is not necessarily Gaussian. However, the proof we present here generalizes the approach of Fang & Zeitouni (2012a) instead of using the spinal decomposition of the BRW. As such, the proof is easier to understand and more robust in the presence of an approximate branching structure.

preprint2018arXiv

Poisson-Dirichlet statistics for the extremes of a randomized Riemann zeta function

In Arguin & Tai (2018), the authors prove the convergence of the two-overlap distribution at low temperature for a randomized Riemann zeta function on the critical line. We extend their results to prove the Ghirlanda-Guerra identities. As a consequence, we find the joint law of the overlaps under the limiting mean Gibbs measure in terms of Poisson-Dirichlet variables. It is expected that we can adapt the approach to prove the same result for the Riemann zeta function itself.

preprint2017arXiv

Geometry of the Gibbs measure for the discrete 2D Gaussian free field with scale-dependent variance

We continue our study of the scale-inhomogeneous Gaussian free field introduced in Arguin and Ouimet (2016). Firstly, we compute the limiting free energy on V_N and adapt a technique of Bovier and Kurkova (2004b) to determine the limiting two-overlap distribution. The adaptation was already successfully applied in the simpler case of Arguin and Zindy (2015), where the limiting free energy was computed for the field with two levels (in the center of V_N) and the limiting two-overlap distribution was determined in the homogeneous case. Our results agree with the analogous quantities for the Generalized Random Energy Model (GREM); see Capocaccia et al. (1987) and Bovier and Kurkova (2004a), respectively. Secondly, we show that the extended Ghirlanda-Guerra identities hold exactly in the limit. As a corollary, the limiting array of overlaps is ultrametric and the limiting Gibbs measure has the same law as a Ruelle probability cascade.

preprint2016arXiv

Extremes of the two-dimensional Gaussian free field with scale-dependent variance

In this paper, we study a random field constructed from the two-dimensional Gaussian free field (GFF) by modifying the variance along the scales in the neighborhood of each point. The construction can be seen as a local martingale transform and is akin to the time-inhomogeneous branching random walk. In the case where the variance takes finitely many values, we compute the first order of the maximum and the log-number of high points. These quantities were obtained by Bolthausen, Deuschel and Giacomin (2001) and Daviaud (2006) when the variance is constant on all scales. The proof relies on a truncated second moment method proposed by Kistler (2015), which streamlines the proof of the previous results. We also discuss possible extensions of the construction to the continuous GFF.