Researcher profile

Soumendu Sundar Mukherjee

Soumendu Sundar Mukherjee contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2022arXiv

Concentration inequalities for correlated network-valued processes with applications to community estimation and changepoint analysis

Network-valued time series are currently a common form of network data. However, the study of the aggregate behavior of network sequences generated from network-valued stochastic processes is relatively rare. Most of the existing research focuses on the simple setup where the networks are independent (or conditionally independent) across time, and all edges are updated synchronously at each time step. In this paper, we study the concentration properties of the aggregated adjacency matrix and the corresponding Laplacian matrix associated with network sequences generated from lazy network-valued stochastic processes, where edges update asynchronously, and each edge follows a lazy stochastic process for its updates independent of the other edges. We demonstrate the usefulness of these concentration results in proving consistency of standard estimators in community estimation and changepoint estimation problems. We also conduct a simulation study to demonstrate the effect of the laziness parameter, which controls the extent of temporal correlation, on the accuracy of community and changepoint estimation.

preprint2022arXiv

Learning with latent group sparsity via heat flow dynamics on networks

Group or cluster structure on explanatory variables in machine learning problems is a very general phenomenon, which has attracted broad interest from practitioners and theoreticians alike. In this work we contribute an approach to learning under such group structure, that does not require prior information on the group identities. Our paradigm is motivated by the Laplacian geometry of an underlying network with a related community structure, and proceeds by directly incorporating this into a penalty that is effectively computed via a heat flow-based local network dynamics. In fact, we demonstrate a procedure to construct such a network based on the available data. Notably, we dispense with computationally intensive pre-processing involving clustering of variables, spectral or otherwise. Our technique is underpinned by rigorous theorems that guarantee its effective performance and provide bounds on its sample complexity. In particular, in a wide range of settings, it provably suffices to run the heat flow dynamics for time that is only logarithmic in the problem dimensions. We explore in detail the interfaces of our approach with key statistical physics models in network science, such as the Gaussian Free Field and the Stochastic Block Model. We validate our approach by successful applications to real-world data from a wide array of application domains, including computer science, genetics, climatology and economics. Our work raises the possibility of applying similar diffusion-based techniques to classical learning tasks, exploiting the interplay between geometric, dynamical and stochastic structures underlying the data.

preprint2022arXiv

On $*$-Convergence of Schur-Hadamard Products of Independent Nonsymmetric Random Matrices

Let $\{x_α\}_{α\in \mathbb{Z}}$ and $\{y_α\}_{α\in \mathbb{Z}}$ be two independent collections of zero mean, unit variance random variables with uniformly bounded moments of all orders. Consider a nonsymmetric Toeplitz matrix $X_n = ((x_{i - j}))_{1 \le i, j \le n}$ and a Hankel matrix $Y_n = ((y_{i + j}))_{1 \le i, j \le n}$, and let $M_n = X_n \odot Y_n$ be their elementwise/Schur-Hadamard product. In this article, we show that almost surely, $n^{-1/2}M_n$, as an element of the $*$-probability space $(\mathcal{M}_n(\mathbb{C}), \frac{1}{n}\mathrm{tr})$, converges in $*$-distribution to a circular variable. With i.i.d. Rademacher entries, this construction gives a matrix model for circular variables with only $O(n)$ bits of randomness. We also consider a dependent setup where $\{x_α\}$ and $\{y_β\}$ are independent strongly multiplicative systems (à la Gaposhkin [7]) satisfying an additional \emph{admissibility} condition, and have uniformly bounded moments of all orders -- a nontrivial example of such a system being $\{\sqrt{2}\sin(2^n πU)\}_{n \in \mathbb{Z}_+}$, where $U \sim \mathrm{Uniform}(0, 1)$. In this case, we show in-expectation and in-probability convergence of the $*$-moments of $n^{-1/2}M_n$ to those of a circular variable. Finally, we generalise our results to Schur-Hadamard products of structured random matrices of the form $X_n = ((x_{L_X(i, j)}))_{1 \le i, j \le n}$ and $Y_n = ((y_{L_Y(i, j)}))_{1 \le i, j \le n}$, under certain assumptions on the \emph{link-functions} $L_X$ and $L_Y$, most notably the injectivity of the map $(i, j) \mapsto (L_X(i, j), L_Y(i, j))$. Based on numerical evidence, we conjecture that the circular law $μ_{\mathrm{circ}}$, i.e. the uniform measure on the unit disk of $\mathbb{C}$, which is also the Brown measure of a circular variable, is in fact the limiting spectral measure of $n^{-1/2}M_n$.

preprint2022arXiv

Some characterization results on classical and free Poisson thinning

Poisson thinning is an elementary result in probability, which is of great importance in the theory of Poisson point processes. In this article, we record a couple of characterization results on Poisson thinning. We also consider several free probability analogues of Poisson thinning, which we collectively dub as \emph{free Poisson thinning}, and prove characterization results for them, similar to the classical case. One of these free Poisson thinning procedures arises naturally as a high-dimensional asymptotic analogue of Cochran's theorem from multivariate statistics on the "Wishart-ness" of quadratic functions of Gaussian random matrices. We note the implications of our characterization results in the context of Cochran's theorem. We also prove a free probability analogue of Craig's theorem, another well-known result in multivariate statistics on the independence of quadratic functions of Gaussian random matrices.

preprint2021arXiv

Distribution of Eigenvalues of Matrix Ensembles arising from Wigner and Palindromic Toeplitz Blocks

Random Matrix Theory (RMT) has successfully modeled diverse systems, from energy levels of heavy nuclei to zeros of $L$-functions; this correspondence has allowed RMT to successfully predict many number theoretic behaviors. However there are some operations which to date have no RMT analogue. Our motivation is to find an RMT analogue of Rankin-Selberg convolution, which constructs a new $L$-functions from an input pair. We report one such attempt; while it does not appear to model convolution, it does create new ensembles with properties hybridizing those of its constituents. For definiteness we concentrate on the ensemble of palindromic real symmetric Toeplitz (PST) matrices and the ensemble of real symmetric matrices, whose limiting spectral measures are the Gaussian and semi-circular distributions, respectively; these were chosen as they are the two extreme cases in terms of moment calculations. For a PST matrix $A$ and a real symmetric matrix $B$, we construct an ensemble of random real symmetric block matrices whose first row is $\lbrace A, B \rbrace$ and whose second row is $\lbrace B, A \rbrace$. By Markov's Method of Moments and the use of free probability, we show this ensemble converges weakly and almost surely to a new, universal distribution with a hybrid of Gaussian and semi-circular behaviors. We extend this construction by considering an iterated concatenation of matrices from an arbitrary pair of random real symmetric sub-ensembles with different limiting spectral measures. We prove that finite iterations converge to new, universal distributions with hybrid behavior, and that infinite iterations converge to the limiting spectral measure of the dominant component matrix.

preprint2020arXiv

Consistent detection and optimal localization of all detectable change points in piecewise stationary arbitrarily sparse network-sequences

We consider the offline change point detection and localization problem in the context of piecewise stationary networks, where the observable is a finite sequence of networks. We develop algorithms involving some suitably modified CUSUM statistics based on adaptively trimmed adjacency matrices of the observed networks for both detection and localization of single or multiple change points present in the input data. We provide rigorous theoretical analysis and finite sample estimates evaluating the performance of the proposed methods when the input (finite sequence of networks) is generated from an inhomogeneous random graph model, where the change points are characterized by the change in the mean adjacency matrix. We show that the proposed algorithms can detect (resp. localize) all change points, where the change in the expected adjacency matrix is above the minimax detectability (resp. localizability) threshold, consistently without any a priori assumption about (a) a lower bound for the sparsity of the underlying networks, (b) an upper bound for the number of change points, and (c) a lower bound for the separation between successive change points, provided either the minimum separation between successive pairs of change points or the average degree of the underlying networks goes to infinity arbitrarily slowly. We also prove that the above condition is necessary to have consistency.

preprint2020arXiv

Exact Tests for Offline Changepoint Detection in Multichannel Binary and Count Data with Application to Networks

We consider offline detection of a single changepoint in binary and count time-series. We compare exact tests based on the cumulative sum (CUSUM) and the likelihood ratio (LR) statistics, and a new proposal that combines exact two-sample conditional tests with multiplicity correction, against standard asymptotic tests based on the Brownian bridge approximation to the CUSUM statistic. We see empirically that the exact tests are much more powerful in situations where normal approximations driving asymptotic tests are not trustworthy: (i) small sample settings; (ii) sparse parametric settings; (iii) time-series with changepoint near the boundary. We also consider a multichannel version of the problem, where channels can have different changepoints. Controlling the False Discovery Rate (FDR), we simultaneously detect changes in multiple channels. This "local" approach is shown to be more advantageous than multivariate global testing approaches when the number of channels with changepoints is much smaller than the total number of channels. As a natural application, we consider network-valued time-series and use our approach with (a) edges as binary channels and (b) node-degrees or other local subgraph statistics as count channels. The local testing approach is seen to be much more informative than global network changepoint algorithms.