Researcher profile

Somabha Mukherjee

Somabha Mukherjee contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 17 - UnverifiedVerification L1Unclaimed author
4works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

4 published item(s)

preprint2022arXiv

Phase Transitions of the Maximum Likelihood Estimates in the $p$-Spin Curie-Weiss Model

In this paper we consider the problem of parameter estimation in the $p$-spin Curie-Weiss model, for $p \geq 3$. We provide a complete description of the limiting properties of the maximum likelihood (ML) estimates of the inverse temperature and the magnetic field given a single realization from the $p$-spin Curie-Weiss model, complementing the well-known results in the 2-spin case (Comets and Gidas (1991)). Our results unearth various new phase transitions and surprising limit theorems, such as the existence of a 'critical' curve in the parameter space, where the limiting distribution of the ML estimates is a mixture with both continuous and discrete components. The number of mixture components is either two or three, depending on, among other things, the sign of one of the parameters and the parity of $p$. Another interesting revelation is the existence of certain 'special' points in the parameter space where the ML estimates exhibit a superefficiency phenomenon, converging to a non-Gaussian limiting distribution at rate $N^{\frac{3}{4}}$. Using these results we can obtain asymptotically valid confidence intervals for the inverse temperature and the magnetic field at all points in the parameter space where consistent estimation is possible.

preprint2020arXiv

Asymptotic bounds on graphical partitions and partition comparability

An integer partition is called graphical if it is the degree sequence of a simple graph. We prove that the probability that a uniformly chosen partition of size $n$ is graphical decreases to zero faster than $n^{-.003}$, answering a question of Pittel. A lower bound of $n^{-1/2}$ was proven by Erdős and Richmond, and so this demonstrates that the probability decreases polynomially. Key to our argument is an asymptotic result of Pittel characterizing the joint distribution of the first rows and columns of a uniformly random partition, combined with a characterization of graphical partitions due to Erdős and Gallai. Our proof also implies a polynomial upper bound for the probability that two randomly chosen partitions are comparable in the dominance order.

preprint2020arXiv

Estimation in Tensor Ising Models

The $p$-tensor Ising model is a one-parameter discrete exponential family for modeling dependent binary data, where the sufficient statistic is a multi-linear form of degree $p \geq 2$. This is a natural generalization of the matrix Ising model, that provides a convenient mathematical framework for capturing higher-order dependencies in complex relational data. In this paper, we consider the problem of estimating the natural parameter of the $p$-tensor Ising model given a single sample from the distribution on $N$ nodes. Our estimate is based on the maximum pseudo-likelihood (MPL) method, which provides a computationally efficient algorithm for estimating the parameter that avoids computing the intractable partition function. We derive general conditions under which the MPL estimate is $\sqrt N$-consistent, that is, it converges to the true parameter at rate $1/\sqrt N$. In particular, we show the $\sqrt N$-consistency of the MPL estimate in the $p$-spin Sherrington-Kirkpatrick (SK) model, spin systems on general $p$-uniform hypergraphs, and Ising models on the hypergraph stochastic block model (HSBM). In fact, for the HSBM we pin down the exact location of the phase transition threshold, which is determined by the positivity of a certain mean-field variational problem, such that above this threshold the MPL estimate is $\sqrt N$-consistent, while below the threshold no estimator is consistent. Finally, we derive the precise fluctuations of the MPL estimate in the special case of the $p$-tensor Curie-Weiss model. An interesting consequence of our results is that the MPL estimate in the Curie-Weiss model saturates the Cramer-Rao lower bound at all points above the estimation threshold, that is, the MPL estimate incurs no loss in asymptotic efficiency, even though it is obtained by minimizing only an approximation of the true likelihood function for computational tractability.

preprint2020arXiv

The Second Moment Phenomenon for Monochromatic Subgraphs

What is the chance that among a group of $n$ friends, there are $s$ friends all of whom have the same birthday? This is the celebrated birthday problem which can be formulated as the existence of a monochromatic $s$-clique $K_s$ ($s$-matching birthdays) in the complete graph $K_n$, where every vertex of $K_n$ is uniformly colored with $365$ colors (corresponding to birthdays). More generally, for a general connected graph $H$, let $T(H, G_n)$ be the number of monochromatic copies of $H$ in a uniformly random coloring of the vertices of the graph $G_n$ with $c_n$ colors. In this paper we show that $T(H, G_n)$ converges to $\mathrm{Pois}(λ)$ whenever $\mathbb E T(H, G_n) \rightarrow λ$ and $\mathrm{Var} T(H, G_n) \rightarrow λ$, that is, the asymptotic Poisson distribution of $T(H, G_n)$ is determined just by the convergence of its mean and variance. Moreover, this condition is necessary if and only if $H$ is a star-graph. In fact, the second-moment phenomenon is a consequence of a more general theorem about the convergence of $T(H,G_n)$ to a finite linear combination of independent Poisson random variables. As an application, we derive the limiting distribution of $T(H, G_n)$, when $G_n\sim G(n, p)$ is the Erd\H os-Rényi random graph. Multiple phase-transitions emerge as $p$ varies from 0 to 1, depending on whether the graph $H$ is balanced or unbalanced.