Researcher profile

Patrice Abry

Patrice Abry contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
17works
0followers
15topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

17 published item(s)

preprint2024arXiv

On the empirical spectral distribution of large wavelet random matrices based on mixed-Gaussian fractional measurements in moderately high dimensions

In this paper, we characterize the convergence of the (rescaled logarithmic) empirical spectral distribution of wavelet random matrices. We assume a moderately high-dimensional framework where the sample size $n$, the dimension $p(n)$ and, for a fixed integer $j$, the scale $a(n)2^j$ go to infinity in such a way that $\lim_{n \rightarrow \infty}p(n)\cdot a(n)/n = \lim_{n \rightarrow \infty} o(\sqrt{a(n)/n})= 0$. We suppose the underlying measurement process is a random scrambling of a sample of size $n$ of a growing number $p(n)$ of fractional processes. Each of the latter processes is a fractional Brownian motion conditionally on a randomly chosen Hurst exponent. We show that the (rescaled logarithmic) empirical spectral distribution of the wavelet random matrices converges weakly, in probability, to the distribution of Hurst exponents.

preprint2022arXiv

Deep Learning-based Extreme Heatwave Forecast

Because of the impact of extreme heat waves and heat domes on society and biodiversity, their study is a key challenge. We specifically study long-lasting extreme heat waves, which are among the most important for climate impacts. Physics driven weather forecast systems or climate models can be used to forecast their occurrence or predict their probability. The present work explores the use of deep learning architectures, trained using outputs of a climate model, as an alternative strategy to forecast the occurrence of extreme long-lasting heatwaves. This new approach will be useful for several key scientific goals which include the study of climate model statistics, building a quantitative proxy for resampling rare events in climate models, study the impact of climate change, and should eventually be useful for forecasting. Fulfilling these important goals implies addressing issues such as class-size imbalance that is intrinsically associated with rare event prediction, assessing the potential benefits of transfer learning to address the nested nature of extreme events (naturally included in less extreme ones). We train a Convolutional Neural Network, using 1000 years of climate model outputs, with large-class undersampling and transfer learning. From the observed snapshots of the surface temperature and the 500 hPa geopotential height fields, the trained network achieves significant performance in forecasting the occurrence of long-lasting extreme heatwaves. We are able to predict them at three different levels of intensity, and as early as 15 days ahead of the start of the event (30 days ahead of the end of the event).

preprint2022arXiv

Temporal evolution of the Covid19 pandemic reproduction number: Estimations from proximal optimization to Monte Carlo sampling

Monitoring the evolution of the Covid19 pandemic constitutes a critical step in sanitary policy design. Yet, the assessment of the pandemic intensity within the pandemic period remains a challenging task because of the limited quality of data made available by public health authorities (missing data, outliers and pseudoseasonalities, notably), that calls for cumbersome and ad-hoc preprocessing (denoising) prior to estimation. Recently, the estimation of the reproduction number, a measure of the pandemic intensity, was formulated as an inverse problem, combining data-model fidelity and space-time regularity constraints, solved by nonsmooth convex proximal minimizations. Though promising, that formulation lacks robustness against the limited quality of the Covid19 data and confidence assessment. The present work aims to address both limitations: First, it discusses solutions to produce a robust assessment of the pandemic intensity by accounting for the low quality of the data directly within the inverse problem formulation. Second, exploiting a Bayesian interpretation of the inverse problem formulation, it devises a Monte Carlo sampling strategy, tailored to a nonsmooth log-concave a posteriori distribution, to produce relevant credibility intervalbased estimates for the Covid19 reproduction number. Clinical relevance Applied to daily counts of new infections made publicly available by the Health Authorities for around 200 countries, the proposed procedures permit robust assessments of the time evolution of the Covid19 pandemic intensity, updated automatically and on a daily basis.

preprint2022arXiv

Wavelet eigenvalue regression in high dimensions

In this paper, we construct the wavelet eigenvalue regression methodology in high dimensions. We assume that possibly non-Gaussian, finite-variance $p$-variate measurements are made of a low-dimensional $r$-variate ($r \ll p$) fractional stochastic process with non-canonical scaling coordinates and in the presence of additive high-dimensional noise. The measurements are correlated both time-wise and between rows. Building upon the asymptotic and large scale properties of wavelet random matrices in high dimensions, the wavelet eigenvalue regression is shown to be consistent and, under additional assumptions, asymptotically Gaussian in the estimation of the fractal structure of the system. We further construct a consistent estimator of the effective dimension $r$ of the system that significantly increases the robustness of the methodology. The estimation performance over finite samples is studied by means of simulations.

preprint2021arXiv

Nonsmooth convex optimization to estimate the Covid-19 reproduction number space-time evolution with robustness against low quality data

Daily pandemic surveillance, often achieved through the estimation of the reproduction number, constitutes a critical challenge for national health authorities to design countermeasures. In an earlier work, we proposed to formulate the estimation of the reproduction number as an optimization problem, combining data-model fidelity and space-time regularity constraints, solved by nonsmooth convex proximal minimizations. Though promising, that first formulation significantly lacks robustness against the Covid-19 data low quality (irrelevant or missing counts, pseudo-seasonalities,.. .) stemming from the emergency and crisis context, which significantly impairs accurate pandemic evolution assessments. The present work aims to overcome these limitations by carefully crafting a functional permitting to estimate jointly, in a single step, the reproduction number and outliers defined to model low quality data. This functional also enforces epidemiology-driven regularity properties for the reproduction number estimates, while preserving convexity, thus permitting the design of efficient minimization algorithms, based on proximity operators that are derived analytically. The explicit convergence of the proposed algorithm is proven theoretically. Its relevance is quantified on real Covid-19 data, consisting of daily new infection counts for 200+ countries and for the 96 metropolitan France counties, publicly available at Johns Hopkins University and Sant{é}-Publique-France. The procedure permits automated daily updates of these estimates, reported via animated and interactive maps. Open-source estimation procedures will be made publicly available.

preprint2020arXiv

Automated data-driven selection of the hyperparameters for Total-Variation based texture segmentation

Penalized Least Squares are widely used in signal and image processing. Yet, it suffers from a major limitation since it requires fine-tuning of the regularization parameters. Under assumptions on the noise probability distribution, Stein-based approaches provide unbiased estimator of the quadratic risk. The Generalized Stein Unbiased Risk Estimator is revisited to handle correlated Gaussian noise without requiring to invert the covariance matrix. Then, in order to avoid expansive grid search, it is necessary to design algorithmic scheme minimizing the quadratic risk with respect to regularization parameters. This work extends the Stein's Unbiased GrAdient estimator of the Risk of Deledalle et al. to the case of correlated Gaussian noise, deriving a general automatic tuning of regularization parameters. First, the theoretical asymptotic unbiasedness of the gradient estimator is demonstrated in the case of general correlated Gaussian noise. Then, the proposed parameter selection strategy is particularized to fractal texture segmentation, where problem formulation naturally entails inter-scale and spatially correlated noise. Numerical assessment is provided, as well as discussion of the practical issues.

preprint2020arXiv

Parameter-free and fast nonlinear piecewise filtering. Application to experimental physics

Numerous fields of nonlinear physics, very different in nature, produce signals and images, that share the common feature of being essentially constituted of piecewise homogeneous phases. Analyzing signals and images from corresponding experiments to construct relevant physical interpretations thus often requires detecting such phases and estimating accurately their characteristics (borders, feature differences, ...). However, situations of physical relevance often comes with low to very low signal to noise ratio precluding the standard use of classical linear filtering for analysis and denoising and thus calling for the design of advanced nonlinear signal/image filtering techniques. Additionally, when dealing with experimental physics signals/images, a second limitation is the large amount of data that need to be analyzed to yield accurate and relevant conclusions requiring the design of fast algorithms. The present work proposes a unified signal/image nonlinear filtering procedure, with fast algorithms and a data-driven automated hyperparameter tuning, based on proximal algorithms and Stein unbiased estimator principles. The interest and potential of these tools are illustrated at work on low-confinement solid friction signals and porous media multiphase flows.

preprint2019arXiv

Graph-based era segmentation of international financial integration

Assessing world-wide financial integration constitutes a recurrent challenge in macroeconometrics, often addressed by visual inspections searching for data patterns. Econophysics literature enables us to build complementary, data-driven measures of financial integration using graphs. The present contribution investigates the potential and interests of a novel 3-step approach that combines several state-of-the-art procedures to i) compute graph-based representations of the multivariate dependence structure of asset prices time series representing the financial states of 32 countries world-wide (1955-2015); ii) compute time series of 5 graph-based indices that characterize the time evolution of the topologies of the graph; iii) segment these time evolutions in piece-wise constant eras, using an optimization framework constructed on a multivariate multi-norm total variation penalized functional. The method shows first that it is possible to find endogenous stable eras of world-wide financial integration. Then, our results suggest that the most relevant globalization eras would be based on the historical patterns of global capital flows, while the major regulatory events of the 1970s would only appear as a cause of sub-segmentation.

preprint2014arXiv

Large deviations for correlated random variables described by a matrix product ansatz

We study the large deviations of sums of correlated random variables described by a matrix product ansatz, which generalizes the product structure of independent random variables to matrices whose non-commutativity is the source of correlations. We show with specific examples that different large deviation behaviors can be found with this ansatz. In particular, it is possible to construct sums of correlated random variables that violate the Law of Large Numbers, the Central Limit Theorem, as well as sums that have nonconvex rate functions or rate functions with linear parts or plateaux.

preprint2014arXiv

Statistics of sums of correlated variables described by a matrix product ansatz

We determine the asymptotic distribution of the sum of correlated variables described by a matrix product ansatz with finite matrices, considering variables with finite variances. In cases when the correlation length is finite, the law of large numbers is obeyed, and the rescaled sum converges to a Gaussian distribution. In constrast, when correlation extends over system size, we observe either a breaking of the law of large numbers, with the onset of giant fluctuations, or a generalization of the central limit theorem with a family of nonstandard limit distributions. The corresponding distributions are found as mixtures of delta functions for the generalized law of large numbers, and as mixtures of Gaussian distributions for the generalized central limit theorem. Connections with statistical physics models are emphasized.

preprint2013arXiv

On the existence of a glass transition in a Random Energy Model

We consider a generalized version of the Random Energy Model in which the energy of each configuration is given by the sum of $N$ independent contributions ("local energies") with finite variances but otherwise arbitrary statistics. Using the large deviation formalism, we find that the glass transition generically exists when local energies have a smooth distribution. In contrast, if the distribution of the local energies has a {Dirac mass} at the minimal energy (e.g., if local energies take discrete values), the glass transition ceases to exist if the number of energy levels grows sufficiently fast with system size. This shows that statistical independence of energy levels does not imply the existence of a glass transition.

preprint2012arXiv

Hyperbolic wavelet transform: an efficient tool for multifractal analysis of anisotropic textures

Global and local regularities of functions are analyzed in anisotropic function spaces, under a common framework, that of hyperbolic wavelet bases. Local and directional regularity features are characterized by means of global quantities constructed upon the coefficients of hyperbolic wavelet decompositions. A multifractal analysis is introduced, that jointly accounts for scale invariance and anisotropy. Its properties are studied in depth.

preprint2012arXiv

Matrix product representation and synthesis for random vectors: Insight from statistical physics

Inspired from modern out-of-equilibrium statistical physics models, a matrix product based framework permits the formal definition of random vectors (and random time series) whose desired joint distributions are a priori prescribed. Its key feature consists of preserving the writing of the joint distribution as the simple product structure it has under independence, while inputing controlled dependencies amongst components: This is obtained by replacing the product of distributions by a product of matrices of distributions. The statistical properties stemming from this construction are studied theoretically: The landscape of the attainable dependence structure is thoroughly depicted and a stationarity condition for time series is notably obtained. The remapping of this framework onto that of Hidden Markov Models enables us to devise an efficient and accurate practical synthesis procedure. A design procedure is also described permitting the tuning of model parameters to attain targeted properties. Pedagogical well-chosen examples of times series and multivariate vectors aim at illustrating the power and versatility of the proposed approach and at showing how targeted statistical properties can be actually prescribed.

preprint2012arXiv

Renormalization flow for extreme value statistics of random variables raised to a varying power

Using a renormalization approach, we study the asymptotic limit distribution of the maximum value in a set of independent and identically distributed random variables raised to a power q(n) that varies monotonically with the sample size n. Under these conditions, a non-standard class of max-stable limit distributions, which mirror the classical ones, emerges. Furthermore a transition mechanism between the classical and the non-standard limit distributions is brought to light. If q(n) grows slower than a characteristic function q*(n), the standard limit distributions are recovered, while if q(n) behaves asymptotically as k.q*(n), non-standard limit distributions emerge.

preprint2011arXiv

Critical moment definition and estimation, for finite size observation of log-exponential-power law random variables

This contribution aims at studying the behaviour of the classical sample moment estimator, $S(n,q)= \sum_{k=1}^n X_k^{q}/n $, as a function of the number of available samples $n$, in the case where the random variables $X$ are positive, have finite moments at all orders and are naturally of the form $X= \exp Y$ with the tail of $Y$ behaving like $e^{-y^ρ}$. This class of laws encompasses and generalizes the classical example of the log-normal law. This form is motivated by a number of applications stemming from modern statistical physics or multifractal analysis. Borrowing heuristic and analytical results from the analysis of the Random Energy Model in statistical physics, a critical moment $q_c(n)$ is defined as the largest statistical order $q$ up to which the sample mean estimator $S(n,q)$ correctly accounts for the ensemble average $\E X^q$, for a given $n$. A practical estimator for the critical moment $q_c(n)$ is then proposed. Its statistical performance are studied analytically and illustrated numerically in the case of \emph{i.i.d.} samples. A simple modification is proposed to explicitly account for correlation amongst the observed samples. Estimation performance are then carefully evaluated by means of Monte-Carlo simulations in the practical case of correlated time series.

preprint2011arXiv

Linearization effect in multifractal analysis: Insights from the Random Energy Model

The analysis of the linearization effect in multifractal analysis, and hence of the estimation of moments for multifractal processes, is revisited borrowing concepts from the statistical physics of disordered systems, notably from the analysis of the so-called Random Energy Model. Considering a standard multifractal process (compound Poisson motion), chosen as a simple representative example, we show: i) the existence of a critical order $q^*$ beyond which moments, though finite, cannot be estimated through empirical averages, irrespective of the sample size of the observation; ii) that multifractal exponents necessarily behave linearly in $q$, for $q > q^*$. Tayloring the analysis conducted for the Random Energy Model to that of Compound Poisson motion, we provide explicative and quantitative predictions for the values of $q^*$ and for the slope controlling the linear behavior of the multifractal exponents. These quantities are shown to be related only to the definition of the multifractal process and not to depend on the sample size of the observation. Monte-Carlo simulations, conducted over a large number of large sample size realizations of compound Poisson motion, comfort and extend these analyses.

preprint2007arXiv

Bounds for the covariance of functions of infinite variance stable random variables with applications to central limit theorems and wavelet-based estimation

We establish bounds for the covariance of a large class of functions of infinite variance stable random variables, including unbounded functions such as the power function and the logarithm. These bounds involve measures of dependence between the stable variables, some of which are new. The bounds are also used to deduce the central limit theorem for unbounded functions of stable moving average time series. This result extends the earlier results of Tailen Hsing and the authors on central limit theorems for bounded functions of stable moving averages. It can be used to show asymptotic normality of wavelet-based estimators of the self-similarity parameter in fractional stable motions.