Researcher profile

Walid Hachem

Walid Hachem contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2022arXiv

Convergence of constant step stochastic gradient descent for non-smooth non-convex functions

This paper studies the asymptotic behavior of the constant step Stochastic Gradient Descent for the minimization of an unknown function F , defined as the expectation of a non convex, non smooth, locally Lipschitz random function. As the gradient may not exist, it is replaced by a certain operator: a reasonable choice is to use an element of the Clarke subdifferential of the random function; an other choice is the output of the celebrated backpropagation algorithm, which is popular amongst practionners, and whose properties have recently been studied by Bolte and Pauwels [7]. Since the expectation of the chosen operator is not in general an element of the Clarke subdifferential BF of the mean function, it has been assumed in the literature that an oracle of BF is available. As a first result, it is shown in this paper that such an oracle is not needed for almost all initialization points of the algorithm. Next, in the small step size regime, it is shown that the interpolated trajectory of the algorithm converges in probability (in the compact convergence sense) towards the set of solutions of the differential inclusion. Finally, viewing the iterates as a Markov chain whose transition kernel is indexed by the step size, it is shown that the invariant distribution of the kernel converge weakly to the set of invariant distribution of this differential inclusion as the step size tends to zero. These results show that when the step size is small, with large probability, the iterates eventually lie in a neighborhood of the critical points of the mean function F .

preprint2022arXiv

Spectral measure of empirical autocovariance matrices of high dimensional Gaussian stationary processes

Consider the empirical autocovariance matrix at a given non-zero time lag based on observations from a multivariate complex Gaussian stationary time series. The spectral analysis of these autocovariance matrices can be useful in certain statistical problems, such as those related to testing for white noise. We study the behavior of their spectral measures in the asymptotic regime where the time series dimension and the observation window length both grow to infinity, and at the same rate. Following a general framework in the field of the spectral analysis of large random non-Hermitian matrices, at first the probabilistic behavior of the small singular values of the shifted versions of the autocovariance matrix are obtained. This is then used to infer about the large sample behaviour of the empirical spectral measure of the autocovariance matrices at any lag. Matrix orthogonal polynomials on the unit circle play a crucial role in our study.

preprint2020arXiv

A Fully Stochastic Primal-Dual Algorithm

A new stochastic primal--dual algorithm for solving a composite optimization problem is proposed. It is assumed that all the functions/operators that enter the optimization problem are given as statistical expectations. These expectations are unknown but revealed across time through i.i.d. realizations. The proposed algorithm is proven to converge to a saddle point of the Lagrangian function. In the framework of the monotone operator theory, the convergence proof relies on recent results on the stochastic Forward Backward algorithm involving random monotone operators. An example of convex optimization under stochastic linear constraints is considered.

preprint2020arXiv

Non-Hermitian random matrices with a variance profile (I): Deterministic equivalents and limiting ESDs

For each $n$, let $A_n=(σ_{ij})$ be an $n\times n$ deterministic matrix and let $X_n=(X_{ij})$ be an $n\times n$ random matrix with i.i.d. centered entries of unit variance. We study the asymptotic behavior of the empirical spectral distribution $μ_n^Y$ of the rescaled entry-wise product \[ Y_n = \left(\frac1{\sqrt{n}} σ_{ij}X_{ij}\right). \] For our main result we provide a deterministic sequence of probability measures $μ_n$, each described by a family of Master Equations, such that the difference $μ^Y_n - μ_n$ converges weakly in probability to the zero measure. A key feature of our results is to allow some of the entries $σ_{ij}$ to vanish, provided that the standard deviation profiles $A_n$ satisfy a certain quantitative irreducibility property. An important step is to obtain quantitative bounds on the solutions to an associate system of Schwinger--Dyson equations, which we accomplish in the general sparse setting using a novel graphical bootstrap argument.

preprint2020arXiv

Non-Hermitian random matrices with a variance profile (II): properties and examples

For each $n$, let $A_n=(σ_{ij})$ be an $n\times n$ deterministic matrix and let $X_n=(X_{ij})$ be an $n\times n$ random matrix with i.i.d. centered entries of unit variance. In the companion article Cook et al., we considered the empirical spectral distribution $μ_n^Y$ of the rescaled entry-wise product \[ Y_n = \frac 1{\sqrt{n}} A_n\odot X_n = \left(\frac1{\sqrt{n}} σ_{ij}X_{ij}\right) \] and provided a deterministic sequence of probability measures $μ_n$ such that the difference $μ^Y_n - μ_n$ converges weakly in probability to the zero measure. A key feature in Cook et al. was to allow some of the entries $σ_{ij}$ to vanish, provided that the standard deviation profiles $A_n$ satisfy a certain quantitative irreducibility property. In the present article, we provide more information on the sequence $(μ_n)$, described by a family of Master Equations. We consider these equations in important special cases such as separable variance profiles $σ^2_{ij}=d_i \widetilde d_j$ and sampled variance profiles $σ^2_{ij} = σ^2\left(\frac in, \frac jn \right)$ where $(x,y)\mapsto σ^2(x,y)$ is a given function on $[0,1]^2$. Associate examples are provided where $μ_n^Y$ converges to a genuine limit. We study $μ_n$'s behavior at zero and provide examples where $μ_n$'s density is bounded, blows up, or vanishes while an atom appears. As a consequence, we identify the profiles that yield the circular law. Finally, building upon recent results from Alt et al., we prove that except maybe in zero, $μ_n$ admits a positive density on the centered disc of radius $\sqrt{ρ(V_n)}$, where $V_n=(\frac 1n σ_{ij}^2)$ and $ρ(V_n)$ is its spectral radius.