Source author record

Soumendu Sundar Mukherjee

Soumendu Sundar Mukherjee appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.PR Machine Learning math.ST Methodology Statistics Theory math.OA Applications astro-ph.GA astro-ph.SR Computation econ.EM math.CA quant-ph Social and Information Networks

Catalog footprint

What is connected

13works

14topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Concentration inequalities for correlated network-valued processes with applications to community estimation and changepoint analysis

Network-valued time series are currently a common form of network data. However, the study of the aggregate behavior of network sequences generated from network-valued stochastic processes is relatively rare. Most of the existing research focuses on the simple setup where the networks are independent (or conditionally independent) across time, and all edges are updated synchronously at each time step. In this paper, we study the concentration properties of the aggregated adjacency matrix and the corresponding Laplacian matrix associated with network sequences generated from lazy network-valued stochastic processes, where edges update asynchronously, and each edge follows a lazy stochastic process for its updates independent of the other edges. We demonstrate the usefulness of these concentration results in proving consistency of standard estimators in community estimation and changepoint estimation problems. We also conduct a simulation study to demonstrate the effect of the laziness parameter, which controls the extent of temporal correlation, on the accuracy of community and changepoint estimation.

preprint2022arXiv

Learning with latent group sparsity via heat flow dynamics on networks

Group or cluster structure on explanatory variables in machine learning problems is a very general phenomenon, which has attracted broad interest from practitioners and theoreticians alike. In this work we contribute an approach to learning under such group structure, that does not require prior information on the group identities. Our paradigm is motivated by the Laplacian geometry of an underlying network with a related community structure, and proceeds by directly incorporating this into a penalty that is effectively computed via a heat flow-based local network dynamics. In fact, we demonstrate a procedure to construct such a network based on the available data. Notably, we dispense with computationally intensive pre-processing involving clustering of variables, spectral or otherwise. Our technique is underpinned by rigorous theorems that guarantee its effective performance and provide bounds on its sample complexity. In particular, in a wide range of settings, it provably suffices to run the heat flow dynamics for time that is only logarithmic in the problem dimensions. We explore in detail the interfaces of our approach with key statistical physics models in network science, such as the Gaussian Free Field and the Stochastic Block Model. We validate our approach by successful applications to real-world data from a wide array of application domains, including computer science, genetics, climatology and economics. Our work raises the possibility of applying similar diffusion-based techniques to classical learning tasks, exploiting the interplay between geometric, dynamical and stochastic structures underlying the data.

preprint2022arXiv

On $*$-Convergence of Schur-Hadamard Products of Independent Nonsymmetric Random Matrices

Let $\{x_α\}_{α\in \mathbb{Z}}$ and $\{y_α\}_{α\in \mathbb{Z}}$ be two independent collections of zero mean, unit variance random variables with uniformly bounded moments of all orders. Consider a nonsymmetric Toeplitz matrix $X_n = ((x_{i - j}))_{1 \le i, j \le n}$ and a Hankel matrix $Y_n = ((y_{i + j}))_{1 \le i, j \le n}$, and let $M_n = X_n \odot Y_n$ be their elementwise/Schur-Hadamard product. In this article, we show that almost surely, $n^{-1/2}M_n$, as an element of the $*$-probability space $(\mathcal{M}_n(\mathbb{C}), \frac{1}{n}\mathrm{tr})$, converges in $*$-distribution to a circular variable. With i.i.d. Rademacher entries, this construction gives a matrix model for circular variables with only $O(n)$ bits of randomness. We also consider a dependent setup where $\{x_α\}$ and $\{y_β\}$ are independent strongly multiplicative systems (à la Gaposhkin [7]) satisfying an additional \emph{admissibility} condition, and have uniformly bounded moments of all orders -- a nontrivial example of such a system being $\{\sqrt{2}\sin(2^n πU)\}_{n \in \mathbb{Z}_+}$, where $U \sim \mathrm{Uniform}(0, 1)$. In this case, we show in-expectation and in-probability convergence of the $*$-moments of $n^{-1/2}M_n$ to those of a circular variable. Finally, we generalise our results to Schur-Hadamard products of structured random matrices of the form $X_n = ((x_{L_X(i, j)}))_{1 \le i, j \le n}$ and $Y_n = ((y_{L_Y(i, j)}))_{1 \le i, j \le n}$, under certain assumptions on the \emph{link-functions} $L_X$ and $L_Y$, most notably the injectivity of the map $(i, j) \mapsto (L_X(i, j), L_Y(i, j))$. Based on numerical evidence, we conjecture that the circular law $μ_{\mathrm{circ}}$, i.e. the uniform measure on the unit disk of $\mathbb{C}$, which is also the Brown measure of a circular variable, is in fact the limiting spectral measure of $n^{-1/2}M_n$.

preprint2022arXiv

Some characterization results on classical and free Poisson thinning

Poisson thinning is an elementary result in probability, which is of great importance in the theory of Poisson point processes. In this article, we record a couple of characterization results on Poisson thinning. We also consider several free probability analogues of Poisson thinning, which we collectively dub as \emph{free Poisson thinning}, and prove characterization results for them, similar to the classical case. One of these free Poisson thinning procedures arises naturally as a high-dimensional asymptotic analogue of Cochran's theorem from multivariate statistics on the "Wishart-ness" of quadratic functions of Gaussian random matrices. We note the implications of our characterization results in the context of Cochran's theorem. We also prove a free probability analogue of Craig's theorem, another well-known result in multivariate statistics on the independence of quadratic functions of Gaussian random matrices.

preprint2021arXiv

Distribution of Eigenvalues of Matrix Ensembles arising from Wigner and Palindromic Toeplitz Blocks

Random Matrix Theory (RMT) has successfully modeled diverse systems, from energy levels of heavy nuclei to zeros of $L$-functions; this correspondence has allowed RMT to successfully predict many number theoretic behaviors. However there are some operations which to date have no RMT analogue. Our motivation is to find an RMT analogue of Rankin-Selberg convolution, which constructs a new $L$-functions from an input pair. We report one such attempt; while it does not appear to model convolution, it does create new ensembles with properties hybridizing those of its constituents. For definiteness we concentrate on the ensemble of palindromic real symmetric Toeplitz (PST) matrices and the ensemble of real symmetric matrices, whose limiting spectral measures are the Gaussian and semi-circular distributions, respectively; these were chosen as they are the two extreme cases in terms of moment calculations. For a PST matrix $A$ and a real symmetric matrix $B$, we construct an ensemble of random real symmetric block matrices whose first row is $\lbrace A, B \rbrace$ and whose second row is $\lbrace B, A \rbrace$. By Markov's Method of Moments and the use of free probability, we show this ensemble converges weakly and almost surely to a new, universal distribution with a hybrid of Gaussian and semi-circular behaviors. We extend this construction by considering an iterated concatenation of matrices from an arbitrary pair of random real symmetric sub-ensembles with different limiting spectral measures. We prove that finite iterations converge to new, universal distributions with hybrid behavior, and that infinite iterations converge to the limiting spectral measure of the dominant component matrix.

preprint2020arXiv

Consistent detection and optimal localization of all detectable change points in piecewise stationary arbitrarily sparse network-sequences

We consider the offline change point detection and localization problem in the context of piecewise stationary networks, where the observable is a finite sequence of networks. We develop algorithms involving some suitably modified CUSUM statistics based on adaptively trimmed adjacency matrices of the observed networks for both detection and localization of single or multiple change points present in the input data. We provide rigorous theoretical analysis and finite sample estimates evaluating the performance of the proposed methods when the input (finite sequence of networks) is generated from an inhomogeneous random graph model, where the change points are characterized by the change in the mean adjacency matrix. We show that the proposed algorithms can detect (resp. localize) all change points, where the change in the expected adjacency matrix is above the minimax detectability (resp. localizability) threshold, consistently without any a priori assumption about (a) a lower bound for the sparsity of the underlying networks, (b) an upper bound for the number of change points, and (c) a lower bound for the separation between successive change points, provided either the minimum separation between successive pairs of change points or the average degree of the underlying networks goes to infinity arbitrarily slowly. We also prove that the above condition is necessary to have consistency.

preprint2020arXiv

Exact Tests for Offline Changepoint Detection in Multichannel Binary and Count Data with Application to Networks

We consider offline detection of a single changepoint in binary and count time-series. We compare exact tests based on the cumulative sum (CUSUM) and the likelihood ratio (LR) statistics, and a new proposal that combines exact two-sample conditional tests with multiplicity correction, against standard asymptotic tests based on the Brownian bridge approximation to the CUSUM statistic. We see empirically that the exact tests are much more powerful in situations where normal approximations driving asymptotic tests are not trustworthy: (i) small sample settings; (ii) sparse parametric settings; (iii) time-series with changepoint near the boundary. We also consider a multichannel version of the problem, where channels can have different changepoints. Controlling the False Discovery Rate (FDR), we simultaneously detect changes in multiple channels. This "local" approach is shown to be more advantageous than multivariate global testing approaches when the number of channels with changepoints is much smaller than the total number of channels. As a natural application, we consider network-valued time-series and use our approach with (a) edges as binary channels and (b) node-degrees or other local subgraph statistics as count channels. The local testing approach is seen to be much more informative than global network changepoint algorithms.

preprint2014arXiv

Bulk behaviour of Schur-Hadamard products of symmetric random matrices

We develop a general method for establishing the existence of the Limiting Spectral Distributions (LSD) of Schur-Hadamard products of independent symmetric patterned random matrices. We apply this method to show that the LSDs of Schur-Hadamard products of some common patterned matrices exist and identify the limits. In particular, the Schur-Hadamard product of independent Toeplitz and Hankel matrices has the semi-circular LSD. We also prove an invariance theorem that may be used to find the LSD in many examples.

preprint2014arXiv

Bulk behaviour of skew-symmetric patterned random matrices

Limiting Spectral Distributions (LSD) of real symmetric patterned matrices have been well-studied. In this article, we consider skew-symmetric/anti-symmetric patterned random matrices and establish the LSDs of several common matrices. For the skew-symmetric Wigner, skew-symmetric Toeplitz and the skew-symmetric Circulant, the LSDs (on the imaginary axis) are the same as those in the symmetric cases. For the skew-symmetric Hankel and the skew-symmetric Reverse Circulant however, we obtain new LSDs. We also show the existence of the LSDs for the triangular versions of these matrices. We then introduce a related modification of the symmetric matrices by changing the sign of the lower triangle part of the matrices. In this case, the modified Wigner, modified Hankel and the modified Reverse Circulants have the same LSDs as their usual symmetric counterparts while new LSDs are obtained for the modified Toeplitz and the modified Symmetric Circulant.

preprint2014arXiv

Limiting spectral distribution of a class of Hankel type random matrices

We consider an indexed class of real symmetric random matrices which generalize the symmetric Hankel and Reverse Circulant matrices. We show that the limiting spectral distributions of these matrices exist almost surely and the limit is continuous in the index. We also study other properties of the limit.

preprint2014arXiv

Minimum Distance Estimation of Milky Way Model Parameters and Related Inference

We propose a method to estimate the location of the Sun in the disk of the Milky Way using a method based on the Hellinger distance and construct confidence sets on our estimate of the unknown location using a bootstrap based method. Assuming the Galactic disk to be two-dimensional, the sought solar location then reduces to the radial distance separating the Sun from the Galactic center and the angular separation of the Galactic center to Sun line, from a pre-fixed line on the disk. On astronomical scales, the unknown solar location is equivalent to the location of us earthlings who observe the velocities of a sample of stars in the neighborhood of the Sun. This unknown location is estimated by undertaking pairwise comparisons of the estimated density of the observed set of velocities of the sampled stars, with densities estimated using synthetic stellar velocity data sets generated at chosen locations in the Milky Way disk according to four base astrophysical models. The "match" between the pair of estimated densities is parameterized by the affinity measure based on the familiar Hellinger distance. We perform a novel cross-validation procedure to establish a desirable "consistency" property of the proposed method.

preprint2014arXiv

On Pseudo-Hermitian Hamiltonians

We investigate some questions on the construction of $η$ operators for pseudo-Hermitian Hamiltonians. We give a sufficient condition which can be exploited to systematically generate a sequence of $η$ operators starting from a known one, thereby proving the non-uniqueness of $η$ for a particular pseudo-Hermitian Hamiltonian. We also study perturbed Hamiltonians for which $η$'s corresponding to the original Hamiltonian still work.

preprint2013arXiv

An Approximation Inequality for Continued Radicals and Power Forms

In this article we derive an approximation inequality for continued radicals, generalizing an inequality of Herschfeld for continued square roots to arbitrary radicals, which is useful in exploring convergence issues and obtaining convergence rates. In fact, we generalize this inequality further to encompass the more general continued power forms. We demonstrate the use of this inequality by obtaining estimates for the convergence rates of several continued radicals including the famous Ramanujan radical.

Soumendu Sundar Mukherjee

What is connected

Connect this record

See the researcher in context

Building this map preview

13 published item(s)

Concentration inequalities for correlated network-valued processes with applications to community estimation and changepoint analysis

Learning with latent group sparsity via heat flow dynamics on networks

On $*$-Convergence of Schur-Hadamard Products of Independent Nonsymmetric Random Matrices

Some characterization results on classical and free Poisson thinning

Distribution of Eigenvalues of Matrix Ensembles arising from Wigner and Palindromic Toeplitz Blocks

Consistent detection and optimal localization of all detectable change points in piecewise stationary arbitrarily sparse network-sequences

Exact Tests for Offline Changepoint Detection in Multichannel Binary and Count Data with Application to Networks

Bulk behaviour of Schur-Hadamard products of symmetric random matrices

Bulk behaviour of skew-symmetric patterned random matrices

Limiting spectral distribution of a class of Hankel type random matrices

Minimum Distance Estimation of Milky Way Model Parameters and Related Inference

On Pseudo-Hermitian Hamiltonians

An Approximation Inequality for Continued Radicals and Power Forms