Researcher profile

Mehtaab Sawhney

Mehtaab Sawhney contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
21works
0followers
14topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

21 published item(s)

preprint2022arXiv

Enumerating coprime permutations

Define a permutation $σ$ to be coprime if $\gcd(m,σ(m)) = 1$ for $m\in[n]$. In this note, proving a recent conjecture of Pomerance, we prove that the number of coprime permutations on $[n]$ is $n!\cdot (c+o(1))^n$ where \[c = \prod_{p\text{ prime }}\frac{(p-1)^{2(1-1/p)}}{p\cdot (p-2)^{(1-2/p)}}.\] The techniques involve entropy maximization for the upper bound, and a mixture of number-theoretic bounds, permanent estimates, and the absorbing method for the lower bound.

preprint2022arXiv

Optimal minimization of the covariance loss

Let $X$ be a random vector valued in $\mathbb{R}^{m}$ such that $\|X\|_{2} \le 1$ almost surely. For every $k\ge 3$, we show that there exists a sigma algebra $\mathcal{F}$ generated by a partition of $\mathbb{R}^{m}$ into $k$ sets such that \[\|\operatorname{Cov}(X) - \operatorname{Cov}(\mathbb{E}[X\mid\mathcal{F}]) \|_{\mathrm{F}} \lesssim \frac{1}{\sqrt{\log{k}}}.\] This is optimal up to the implicit constant and improves on a previous bound due to Boedihardjo, Strohmer, and Vershynin. Our proof provides an efficient algorithm for constructing $\mathcal{F}$ and leads to improved accuracy guarantees for $k$-anonymous or differentially private synthetic data. We also establish a connection between the above problem of minimizing the covariance loss and the pinning lemma from statistical physics, providing an alternate (and much simpler) algorithmic proof in the important case when $X \in \{\pm 1\}^m/\sqrt{m}$ almost surely.

preprint2022arXiv

Spencer's theorem in nearly input-sparsity time

A celebrated theorem of Spencer states that for every set system $S_1,\dots, S_m \subseteq [n]$, there is a coloring of the ground set with $\{\pm 1\}$ with discrepancy $O(\sqrt{n\log(m/n+2)})$. We provide an algorithm to find such a coloring in near input-sparsity time $\tilde{O}(n+\sum_{i=1}^{m}|S_i|)$. A key ingredient in our work, which may be of independent interest, is a novel width reduction technique for solving linear programs, not of covering/packing type, in near input-sparsity time using the multiplicative weights update method.

preprint2022arXiv

Substructures in Latin squares

We prove several results about substructures in Latin squares. First, we explain how to adapt our recent work on high-girth Steiner triple systems to the setting of Latin squares, resolving a conjecture of Linial that there exist Latin squares with arbitrarily high girth. As a consequence, we see that the number of order-$n$ Latin squares with no intercalate (i.e., no $2\times2$ Latin subsquare) is at least $(e^{-9/4}n-o(n))^{n^{2}}$. Equivalently, $\mathbb{P}\left[\mathbf{N}=0\right]\ge e^{-n^{2}/4-o(n^{2})}=e^{-(1+o(1))\mathbb{E}\mathbf{N}}$, where $\mathbf{N}$ is the number of intercalates in a uniformly random order-$n$ Latin square. In fact, extending recent work of Kwan, Sah, and Sawhney, we resolve the general large-deviation problem for intercalates in random Latin squares, up to constant factors in the exponent: for any constant $0<δ\le1$ we have $\mathbb{P}[\mathbf{N}\le(1-δ)\mathbb{E}\mathbf{N}]=\exp(-Θ(n^{2}))$ and for any constant $δ>0$ we have $\mathbb{P}[\mathbf{N}\ge(1+δ)\mathbb{E}\mathbf{N}]=\exp(-Θ(n^{4/3}\log n))$. Finally, as an application of some new general tools for studying substructures in random Latin squares, we show that in almost all order-$n$ Latin squares, the number of cuboctahedra (i.e., the number of pairs of possibly degenerate $2\times2$ submatrices with the same arrangement of symbols) is of order $n^{4}$, which is the minimum possible. As observed by Gowers and Long, this number can be interpreted as measuring ``how associative&#39;&#39; the quasigroup associated with the Latin square is.

preprint2022arXiv

Threshold for Steiner triple systems

We prove that with high probability $\mathbb{G}^{(3)}(n,n^{-1+o(1)})$ contains a spanning Steiner triple system for $n\equiv 1,3\pmod{6}$, establishing the exponent for the threshold probability for existence of a Steiner triple system. We also prove the analogous theorem for Latin squares. Our result follows from a novel bootstrapping scheme that utilizes iterative absorption as well as the connection between thresholds and fractional expectation-thresholds established by Frankston, Kahn, Narayanan, and Park.

preprint2021arXiv

Popular differences for matrix patterns

The following combinatorial conjecture arises naturally from recent ergodic-theoretic work of Ackelsberg, Bergelson, and Best. Let $M_1$, $M_2$ be $k\times k$ integer matrices, $G$ be a finite abelian group of order $N$, and $A\subseteq G^k$ with $|A|\geαN^k$. If $M_1$, $M_2$, $M_1-M_2$, and $M_1+M_2$ are automorphisms of $G^k$, is it true that there exists a popular difference $d \in G^k\setminus\{0\}$ such that \[\#\{x \in G^k: x, x+M_1d, x+M_2d, x+(M_1+M_2)d \in A\} \ge (α^4-o(1))N^k.\] We show that this conjecture is false in general, but holds for $G = \mathbb{F}_p^n$ with $p$ an odd prime given the additional spectral condition that no pair of eigenvalues of $M_1M_2^{-1}$ (over $\overline{\mathbb{F}}_p$) are negatives of each other. In particular, the &#34;rotated squares&#34; pattern does not satisfy this eigenvalue condition, and we give a construction of a set of positive density in $(\mathbb{F}_5^n)^2$ for which that pattern has no nonzero popular difference. This is in surprising contrast to three-point patterns, which we handle over all compact abelian groups and which do not require an additional spectral condition.

preprint2020arXiv

A reverse Sidorenko inequality

Let $H$ be a graph allowing loops as well as vertex and edge weights. We prove that, for every triangle-free graph $G$ without isolated vertices, the weighted number of graph homomorphisms $\hom(G, H)$ satisfies the inequality \[ \hom(G, H ) \le \prod_{uv \in E(G)} \hom(K_{d_u,d_v}, H )^{1/(d_ud_v)}, \] where $d_u$ denotes the degree of vertex $u$ in $G$. In particular, one has \[ \hom(G, H )^{1/|E(G)|} \le \hom(K_{d,d}, H )^{1/d^2} \] for every $d$-regular triangle-free $G$. The triangle-free hypothesis on $G$ is best possible. More generally, we prove a graphical Brascamp-Lieb type inequality, where every edge of $G$ is assigned some two-variable function. These inequalities imply tight upper bounds on the partition function of various statistical models such as the Ising and Potts models, which includes independent sets and graph colorings. For graph colorings, corresponding to $H = K_q$, we show that the triangle-free hypothesis on $G$ may be dropped; this is also valid if some of the vertices of $K_q$ are looped. A corollary is that among $d$-regular graphs, $G = K_{d,d}$ maximizes the quantity $c_q(G)^{1/|V(G)|}$ for every $q$ and $d$, where $c_q(G)$ counts proper $q$-colorings of $G$. Finally, we show that if the edge-weight matrix of $H$ is positive semidefinite, then \[ \hom(G, H) \le \prod_{v \in V(G)} \hom(K_{d_v+1}, H )^{1/(d_v+1)}. \] This implies that among $d$-regular graphs, $G = K_{d+1}$ maximizes $\hom(G, H)^{1/|V(G)|}$. For 2-spin Ising models, our results give a complete characterization of extremal graphs: complete bipartite graphs maximize the partition function of 2-spin antiferromagnetic models and cliques maximize the partition function of ferromagnetic models. These results settle a number of conjectures by Galvin-Tetali, Galvin, and Cohen-Csikvári-Perkins-Tetali, and provide an alternate proof to a conjecture by Kahn.

preprint2020arXiv

Character Values of Stanley Sequences

Stanley and Odlyzko proposed a method for greedily constructing sets with no 3-term arithmetic progressions. It is conjectured that there is a dichotomy between such sequences: those that have a periodic structure as the sequence satisfies certain recurrence relations while others appear to be chaotic. One large class of sequences that have these periodic behaviors are known as independent sequences that have two parameters, a character and a growth factor. It was conjectured by Rolnick that all but a finite set of integers can be achieved as characters of a independent sequences. Previously the only large class of integers known to be characters where those with base 3 representations consisting solely of the digits 0 and 2. This paper dramatically improves this result by demonstrating that all even integers not congruent to 244 mod 486 can be achieved as characters, therefore demonstrating that the set of all characters has a positive lower density.

preprint2020arXiv

Discrepancy Minimization via a Self-Balancing Walk

We study discrepancy minimization for vectors in $\mathbb{R}^n$ under various settings. The main result is the analysis of a new simple random process in multiple dimensions through a comparison argument. As corollaries, we obtain bounds which are tight up to logarithmic factors for several problems in online vector balancing posed by Bansal, Jiang, Singla, and Sinha (STOC 2020), as well as linear time algorithms for logarithmic bounds for the Komlós conjecture.

preprint2020arXiv

Fast and memory-optimal dimension reduction using Kac&#39;s walk

In this work, we analyze dimension reduction algorithms based on the Kac walk and discrete variants. (1) For $n$ points in $\mathbb{R}^{d}$, we design an optimal Johnson-Lindenstrauss (JL) transform based on the Kac walk which can be applied to any vector in time $O(d\log{d})$ for essentially the same restriction on $n$ as in the best-known transforms due to Ailon and Liberty [SODA, 2008], and Bamberger and Krahmer [arXiv, 2017]. Our algorithm is memory-optimal, and outperforms existing algorithms in regimes when $n$ is sufficiently large and the distortion parameter is sufficiently small. In particular, this confirms a conjecture of Ailon and Chazelle [STOC, 2006] in a stronger form. (2) The same construction gives a simple transform with optimal Restricted Isometry Property (RIP) which can be applied in time $O(d\log{d})$ for essentially the same range of sparsity as in the best-known such transform due to Ailon and Rauhut [Discrete Comput. Geom., 2014]. (3) We show that by fixing the angle in the Kac walk to be $π/4$ throughout, one obtains optimal JL and RIP transforms with almost the same running time, thereby confirming -- up to a $\log\log{d}$ factor -- a conjecture of Avron, Maymounkov, and Toledo [SIAM J. Sci. Comput., 2010]. Our moment-based analysis of this modification of the Kac walk may also be of independent interest.

preprint2020arXiv

Local limit theorems for subgraph counts

We introduce a general framework for studying anticoncentration and local limit theorems for random variables, including graph statistics. Our methods involve an interplay between Fourier analysis, decoupling, hypercontractivity of Boolean functions, and transference between ``fixed-size&#39;&#39; and ``independent&#39;&#39; models. We also adapt a notion of ``graph factors&#39;&#39; due to Janson. As a consequence, we derive a local central limit theorem for connected subgraph counts in the Erdős-Renyi random graph $G(n,p)$, building on work of Gilmer and Kopparty and of Berkowitz. These results improve an anticoncentration result of Fox, Kwan, and Sauermann and partially answers a question of Fox, Kwan, and Sauermann. We also derive a local limit central limit theorem for induced subgraph counts, as long as $p$ is bounded away from a set of ``problematic&#39;&#39; densities, partially answering a question of Fox, Kwan, and Sauermann. We then prove these restrictions are necessary by exhibiting a disconnected graph for which anticoncentration for subgraph counts at the optimal scale fails for all constant $p$, and finding a graph $H$ for which anticoncentration for induced subgraph counts fails in $G(n,1/2)$. These counterexamples resolve anticoncentration conjectures of Fox, Kwan, and Sauermann in the negative. Finally, we also examine the behavior of counts of $k$-term arithmetic progressions in subsets of $\mathbb{Z}/n\mathbb{Z}$ and deduce a local limit theorem wherein the behavior is Gaussian at a global scale but has nontrivial local oscillations (according to a Ramanujan theta function). These results improve on results of and answer questions of the authors and Berkowitz, and answer a question of Fox, Kwan, and Sauermann.

preprint2020arXiv

Number of arithmetic progressions in dense random subsets of $\mathbb{Z}/n\mathbb{Z}$

We examine the behavior of the number of $k$-term arithmetic progressions in a random subset of $\mathbb{Z}/n\mathbb{Z}$. We prove that if a set is chosen by including each element of $\mathbb{Z}/n\mathbb{Z}$ independently with constant probability $p$, then the resulting distribution of $k$-term arithmetic progressions in that set, while obeying a central limit theorem, does not obey a local central limit theorem. The methods involve decomposing the random variable into homogeneous degree $d$ polynomials with respect to the Walsh/Fourier basis. Proving a suitable multivariate central limit theorem for each component of the expansion gives the desired result.

preprint2020arXiv

On the real Davies&#39; conjecture

We show that every matrix $A \in \mathbb{R}^{n\times n}$ is at least $δ$$\|A\|$-close to a real matrix $A+E \in \mathbb{R}^{n\times n}$ whose eigenvectors have condition number at most $\tilde{O}_{n}(δ^{-1})$. In fact, we prove that, with high probability, taking $E$ to be a sufficiently small multiple of an i.i.d. real sub-Gaussian matrix of bounded density suffices. This essentially confirms a speculation of Davies, and of Banks, Kulkarni, Mukherjee, and Srivastava, who recently proved such a result for i.i.d. complex Gaussian matrices. Along the way, we also prove non-asymptotic estimates on the minimum possible distance between any two eigenvalues of a random matrix whose entries have arbitrary means; this part of our paper may be of independent interest.

preprint2020arXiv

On the smoothed analysis of the smallest singular value with discrete noise

Let $A$ be an $n\times n$ real matrix, and let $M$ be an $n\times n$ random matrix whose entries are i.i.d sub-Gaussian random variables with mean $0$ and variance $1$. We make two contributions to the study of $s_n(A+M)$, the smallest singular value of $A+M$. (1) We show that for all $ε\geq 0$, $$\mathbb{P}[s_n(A + M) \leq ε] = O(ε\sqrt{n}) + 2e^{-Ω(n)},$$ provided only that $A$ has $Ω(n)$ singular values which are $O(\sqrt{n})$. This extends a well-known result of Rudelson and Vershynin, which requires all singular values of $A$ to be $O(\sqrt{n})$. (2) We show that any bound of the form $$\sup_{\|{A}\|\leq n^{C_1}}\mathbb{P}[s_n(A+M)\leq n^{-C_3}] \leq n^{-C_2}$$ must have $C_3 = Ω(C_1 \sqrt{C_2})$. This complements a result of Tao and Vu, who proved such a bound with $C_3 = O(C_1C_2 + C_1 + 1)$, and counters their speculation of possibly taking $C_3 = O(C_1 + C_2)$.

preprint2020arXiv

Perfectly Sampling $k\geq (8/3 +o(1))Δ$-Colorings in Graphs

We present a randomized algorithm which takes as input an undirected graph $G$ on $n$ vertices with maximum degree $Δ$, and a number of colors $k \geq (8/3 + o_Δ(1))Δ$, and returns -- in expected time $\tilde{O}(nΔ^{2}\log{k})$ -- a proper $k$-coloring of $G$ distributed perfectly uniformly on the set of all proper $k$-colorings of $G$. Notably, our sampler breaks the barrier at $k = 3Δ$ encountered in recent work of Bhandari and Chakraborty [STOC 2020]. We also sketch how to modify our methods to relax the restriction on $k$ to $k \geq (8/3 - ε_0)Δ$ for an absolute constant $ε_0 > 0$. As in the work of Bhandari and Chakraborty, and the pioneering work of Huber [STOC 1998], our sampler is based on Coupling from the Past [Propp&Wilson, Random Struct. Algorithms, 1995] and the bounding chain method [Huber, STOC 1998; Häggström&Nelander, Scand. J. Statist., 1999]. Our innovations include a novel bounding chain routine inspired by Jerrum&#39;s analysis of the Glauber dynamics [Random Struct. Algorithms, 1995], as well as a preconditioning routine for bounding chains which uses the algorithmic Lovász Local Lemma [Moser&Tardos, J.ACM, 2010].

preprint2020arXiv

The smallest singular value of dense random regular digraphs

Let $A$ be the adjacency matrix of a uniformly random $d$-regular digraph on $n$ vertices, and suppose that $\min(d,n-d)\geqλn$. We show that for any $κ\geq 0$, \[\mathbb{P}[s_n(A)\leqκ]\leq C_λκ\sqrt{n}+2e^{-c_λn}.\] Up to the constants $C_λ, c_λ> 0$, our bound matches optimal bounds for $n\times n$ random matrices, each of whose entries is an i.i.d $\text{Ber}(d/n)$ random variable. The special case $κ= 0$ of our result confirms a conjecture of Cook regarding the probability of singularity of dense random regular digraphs.

preprint2019arXiv

Triforce and Corners

May the $\mathit{triforce}$ be the 3-uniform hypergraph on six vertices with edges $\{123&#39;,12&#39;3,1&#39;23\}$. We show that the minimum triforce density in a 3-uniform hypergraph of edge density $δ$ is $δ^{4-o(1)}$ but not $O(δ^4)$. Let $M(δ)$ be the maximum number such that the following holds: for every $ε> 0$ and $G = \mathbb{F}_2^n$ with $n$ sufficiently large, if $A \subseteq G \times G$ with $A \ge δ|G|^2$, then there exists a nonzero &#34;popular difference&#34; $d \in G$ such that the number of &#34;corners&#34; $(x,y), (x+d,y), (x,y+d) \in A$ is at least $(M(δ) - ε)|G|^2$. As a corollary via a recent result of Mandache, we conclude that $M(δ) = δ^{4-o(1)}$ and $M(δ) = ω(δ^4)$. On the other hand, for $0 < δ< 1/2$ and sufficiently large $N$, there exists $A \subseteq [N]^3$ with $|A|\geδN^3$ such that for every $d \ne 0$, the number of corners $(x,y,z), (x+d,y,z),(x,y+d,z),(x,y,z+d) \in A$ is at most $δ^{c \log (1/δ)} N^3$. A similar bound holds in higher dimensions, or for any configuration with at least 5 points or affine dimension at least 3.