Researcher profile

Kunal Dutta

Kunal Dutta contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2026arXiv

Constructive l2-Discrepancy Minimization with Additive Deviations

The \emph{signed series} problem in the $\ell_2$ norm asks, given set of vectors $v_1,\ldots,v_n\in \mathbf{R}^d$ having at most unit $\ell_2$ norm, does there always exist a series $(\varepsilon_i)_{i\in [n]}$ of $\pm 1$ signs such that for all $i\in [n]$, $\max_{i\in [n]} \|\sum_{j=1}^i \varepsilon_i v_i\|_2 = O(\sqrt{d})$. A result of Banaszczyk [2012, \emph{Rand. Struct. Alg.}] states that there exist signs $\varepsilon_i\in \{-1,1\},\; i\in [n]$ such that $\max_{i\in [n]} \|\sum_{j=1}^i \varepsilon_i v_i\|_2 = O(\sqrt{d+\log n})$. The best constructive bound known so far is of $O(\sqrt{d\log n})$, by Bansal and Garg [2017, \emph{STOC.}, 2019, \emph{SIAM J. Comput.}]. We give a polynomial-time randomized algorithm to find signs $x(i) \in \{-1,1\},\; i\in [n]$ such that \[ \max_{i\in [n]} \|\sum_{j=1}^i x(i)v_i\|_2 = O(\sqrt{d + \log^2 n}) = O(\sqrt{d}+\log n).\] By the constructive reduction of Harvey and Samadi [\emph{COLT}, 2014], this also yields a constructive bound of $O(\sqrt{d}+\log n)$ for the Steinitz problem in the $\ell_2$-norm. Thus, we algorithmically achieve Banaszczyk's bounds for both problems when $d \geq \log^2n$, which also matches the conjectured bounds. Our algorithm is based on the framework on Bansal and Garg, together with a new analysis involving $(i)$ additional linear and spectral orthogonality constraints during the construction of the covariance matrix of the random walk steps, which allow us to control the quadratic variation in the linear as well as the quadratic components of the discrepancy increment vector, alongwith $(ii)$ a ``Freedman-like" version of the Hanson-Wright concentration inequality, for filtration-dependent sums of subgaussian chaoses.

preprint2023arXiv

Dimensionality Reduction for Persistent Homology with Gaussian Kernels

Computing persistent homology using Gaussian kernels is useful in the domains of topological data analysis and machine learning as shown by Phillips, Wang and Zheng [SoCG 2015]. However, contrary to the case of computing persistent homology using the Euclidean distance or even the $k$-distance, it is not known how to compute the persistent homology of high dimensional data using Gaussian kernels. In this paper, we consider a power distance version of the Gaussian kernel distance (GKPD) given by Phillips, Wang and Zheng, and show that the persistent homology of the Čech filtration of $P$ computed using the GKPD is approximately preserved. For datasets in $d$-dimensional Euclidean space, under a relative error bound of $\varepsilon \in [0,1]$, we obtain a dimensionality of $(i)$ $O(\varepsilon^{-2}\log^2 n)$ for $n$-point datasets and $(ii)$ $O(D\varepsilon^{-2}\log (Dr/\varepsilon))$ for datasets having diameter $r$ (up to a scaling factor). We use two main ingredients. The first one is a new decomposition of the squared radii of Čech simplices using the kernel power distance, in terms of the pairwise GKPDs between the vertices, which we state and prove. The second one is the Random Fourier Features (RFF) map of Rahimi and Recht [NeurIPS 2007], as used by Chen and Phillips [ALT 2017].

preprint2023arXiv

Sparse Geometric Set Systems and the Beck-Fiala Conjecture

We investigate the combinatorial discrepancy of geometric set systems having bounded shallow cell complexity in the \emph{Beck-Fiala} setting, where each point belongs to at most $t$ ranges. For set systems with shallow cell complexity $ψ(m,k)=g(m)k^{c}$, where $(i)$ $g(m) = o(m^{\varepsilon})$ for any $\varepsilon\in (0,1],$ $(ii)$ $ψ$ is non-decreasing in $m$, and $(iii)$ $c>0$ is independent of $m$ and $k$, we get a discrepancy bound of \[ O\left(\sqrt{\left(\log n+\left(t^{c}g(n)\right)^{\frac{1}{1+c}}\right)\log n}\right).\] For $t=ω(\log^2 n)$, in several cases, such as for set systems of points and half-planes / disks / pseudo-disks in $\mathbb{R}^2$, points and orthants in $\mathbb{R}^3$ etc., these bounds are $o(\sqrt{t})$, which verifies (and improves upon) the conjectured bound of Beck and Fiala~\emph{(Disc. Appl. Math., 1981)}. Our bounds are obtained by showing the existence of \emph{matchings with low crossing number}, using the multiplicative weights update method of Welzl \emph{(SoCG, 1988)}, together with the recent bound of Mustafa \emph{(Disc. Comp. Geom., 2015)} on \emph{shallow packings} of set systems in terms of their shallow cell complexity. For set systems of shallow cell complexity $ψ(m,k)=m^{c_1}g(m)k^{c}$, we obtain matchings with crossing number at most \[ O\left(\left(n^{c_1}g(n)t^{c}\right)^{\frac{1}{1+c_1+c}}\right).\] These are of independent interest.

preprint2023arXiv

Strong Collapse of Random Simplicial Complexes

The \emph{strong collapse} of a simplicial complex, proposed by Barmak and Minian (\emph{Disc. Comp. Geom. 2012}), is a combinatorial collapse of a complex onto its sub-complex. Recently, it has received attention from computational topology researchers, owing to its empirically observed usefulness in simplification and size-reduction of the size of simplicial complexes while preserving the homotopy class. We consider the strong collapse process on random simplicial complexes. For the Erdős-Rényi random clique complex $X(n,c/n)$ on $n$ vertices with edge probability $c/n$ with $c>1$, we show that after any maximal sequence of strong collapses the remaining subcomplex, or \emph{core} must have $(1-γ)(1-cγ) n+o(n)$ vertices asymptotically almost surely (a.a.s.), where $γ$ is the least non-negative fixed point of the function $f(x) = \exp\left(-c(1-x)\right)$ in the range $(0,1)$. These are the first theoretical results proved for strong collapses on random (or non-random) simplicial complexes.

preprint2022arXiv

On Shallow Packings and Tusnády's Problem

Tusnády's problem asks to bound the discrepancy of points and axis-parallel boxes in $\mathbb{R}^d$. Algorithmic bounds on Tusnády's problem use a canonical decomposition of Matoušek for the system of points and axis-parallel boxes, together with other techniques like partial coloring and / or random-walk based methods. We use the notion of \emph{shallow cell complexity} and the \emph{shallow packing lemma}, together with the chaining technique, to obtain an improved decomposition of the set system. Coupled with an algorithmic technique of Bansal and Garg for discrepancy minimization, which we also slightly extend, this yields improved algorithmic bounds on Tusnády's problem. For $d\geq 5$, our bound matches the lower bound of $Ω(\log^{d-1}n)$ given by Matoušek, Nikolov and Talwar [IMRN, 2020] -- settling Tusnády's problem, upto constant factors. For $d=2,3,4$, we obtain improved algorithmic bounds of $O(\log^{7/4}n)$, $O(\log^{5/2}n)$ and $O(\log^{13/4}n)$ respectively, which match or improve upon the non-constructive bounds of Nikolov for $d\geq 3$. Further, we also give improved bounds for the discrepancy of set systems of points and polytopes in $\mathbb{R}^d$ generated via translations of a fixed set of hyperplanes. As an application, we also get a bound for the geometric discrepancy of anchored boxes in $\mathbb{R}^d$ with respect to an arbitrary measure, matching the upper bound for the Lebesgue measure, which improves on a result of Aistleitner, Bilyk, and Nikolov [MC and QMC methods, \emph{Springer, Proc. Math. Stat.}, 2018] for $d\geq 4$.

preprint2020arXiv

On Limit Constants in Last Passage Percolation in Transitive Tournaments

We investigate the \emph{last passage percolation} problem on transitive tournaments, in the case when the edge weights are independent Bernoulli random variables. Given a transitive tournament on $n$ nodes with random weights on its edges, the last passage percolation problem seeks to find the weight $X_n$ of the heaviest path, where the weight of a path is the sum of the weights on its edges. We give a recurrence relation and use it to obtain a (bivariate) generating function for the probability generating function of $X_n$. This also gives exact combinatorial expressions for $\mathbb{E}[X_n]$, which was stated as an open problem by Yuster [\emph{Disc. Appl. Math.}, 2017]. We further determine scaling constants in the limit laws for $X_n$. Define $β_{tr}(p) := \lim_{n\to \infty} \frac{\mathbb{E}[X_n]}{n-1}$. Using singularity analysis, we show \[ β_{tr}(p) = \left(\sum_{n\geq 1}(1-p)^{n\choose 2}\right)^{-1}. \] In particular, $β_{tr}(0.5) = \left(\sum_{n\geq 1} 2^{-{n\choose 2}}\right)^{-1} = 0.60914971106...$. This settles the question of determining the value of $β_{tr}(0.5)$, initiated by Yuster. $β_{tr}(p)$ is also the limiting value in the strong law of large numbers for $X_n$, given by Foss, Martin, and Schmidt [\emph{Ann. Appl. Probab.}, 2014]. We also derive the scaling constants in the functional central limit theorem for $X_n$ proved by Foss et al.