Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
24works
0followers
18topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

24 published item(s)

preprint2026arXiv

Learning Multinomial Logits in $O(n \log n)$ time

A Multinomial Logit (MNL) model is composed of a finite universe of items $[n]=\{1,..., n\}$, each assigned a positive weight. A query specifies an admissible subset -- called a slate -- and the model chooses one item from that slate with probability proportional to its weight. This query model is also known as the Plackett-Luce model or conditional sampling oracle in the literature. Although MNLs have been studied extensively, a basic computational question remains open: given query access to slates, how efficiently can we learn weights so that, for every slate, the induced choice distribution is within total variation distance $\varepsilon$ of the ground truth? This question is central to MNL learning and has direct implications for modern recommender system interfaces. We provide two algorithms for this task, one with adaptive queries and one with non-adaptive queries. Each algorithm outputs an MNL $M'$ that induces, for each slate $S$, a distribution $M'_S$ on $S$ that is within $\varepsilon$ total variation distance of the true distribution. Our adaptive algorithm makes $O\left(\frac{n}{\varepsilon^{3}}\log n\right)$ queries, while our non-adaptive algorithm makes $O\left(\frac{n^{2}}{\varepsilon^{3}}\log n \log\frac{n}{\varepsilon}\right)$ queries. Both algorithms query only slates of size two and run in time proportional to their query complexity. We complement these upper bounds with lower bounds of $Ω\left(\frac{n}{\varepsilon^{2}}\log n\right)$ for adaptive queries and $Ω\left(\frac{n^{2}}{\varepsilon^{2}}\log n\right)$ for non-adaptive queries, thus proving that our adaptive algorithm is optimal in its dependence on the support size $n$, while the non-adaptive one is tight within a $\log n$ factor.

preprint2026arXiv

On the LSH Distortion of Ulam and Cayley Similarities

Locality-sensitive hashing (LSH) has found widespread use as a fundamental primitive, particularly to accelerate nearest neighbor search. An LSH scheme for a similarity function $S:\mathcal{X} \times \mathcal{X} \to [0,1]$ is a distribution over hash functions on $\mathcal{X}$ with the property that the probability of collision of any two elements $x,y\in \mathcal{X}$ is exactly equal to $S(x,y)$. However, not all similarity functions admit exact LSH schemes. The notion of LSH distortion measures how multiplicatively close a similarity function is to having an LSH scheme. In this work, we study the LSH distortion of the Ulam and Cayley similarities, which are popular similarity measures on permutations of $n$ elements. We show that the Ulam similarity admits a sublinear LSH distortion of $O(n / \sqrt{\log n})$; we also prove a lower bound of $Ω(n^{0.12})$ on the best LSH distortion achievable. On the other hand, we show that the LSH distortion of the Cayley similarity is $Θ(n)$.

preprint2022arXiv

Algorithms with More Granular Differential Privacy Guarantees

Differential privacy is often applied with a privacy parameter that is larger than the theory suggests is ideal; various informal justifications for tolerating large privacy parameters have been proposed. In this work, we consider partial differential privacy (DP), which allows quantifying the privacy guarantee on a per-attribute basis. In this framework, we study several basic data analysis and learning tasks, and design algorithms whose per-attribute privacy parameter is smaller that the best possible privacy parameter for the entire record of a person (i.e., all the attributes).

preprint2022arXiv

Connect the Dots: Tighter Discrete Approximations of Privacy Loss Distributions

The privacy loss distribution (PLD) provides a tight characterization of the privacy loss of a mechanism in the context of differential privacy (DP). Recent work has shown that PLD-based accounting allows for tighter $(\varepsilon, δ)$-DP guarantees for many popular mechanisms compared to other known methods. A key question in PLD-based accounting is how to approximate any (potentially continuous) PLD with a PLD over any specified discrete support. We present a novel approach to this problem. Our approach supports both pessimistic estimation, which overestimates the hockey-stick divergence (i.e., $δ$) for any value of $\varepsilon$, and optimistic estimation, which underestimates the hockey-stick divergence. Moreover, we show that our pessimistic estimate is the best possible among all pessimistic estimates. Experimental evaluation shows that our approach can work with much larger discretization intervals while keeping a similar error bound compared to previous approaches and yet give a better approximation than existing methods.

preprint2022arXiv

Decay Processes in Cationic Alkali Metals in Microsolvated Clusters: A Complex Absorbing Potential Based Equation-of-Motion Coupled Cluster Investigation

We have employed the highly accurate complex absorbing potential based ionization potential equation-of-motion coupled cluster singles and doubles (CAP-IP-EOM-CCSD) method to study the various intermolecular decay processes in ionized metals (Li$^{+}$, Na$^{+}$, K$^{+}$) microsolvated by water molecules. For the Li atom, the electron is ionized from the 1s subshell. However, for Na and K atoms, the electron is ionized from 2s and both 2s and 2p subshells, respectively. We have investigated decay processes for the Li$^{+}$-(H$_{2}$O)$_{n}$; (n=1-3) systems as well as Na$^{+}$-(H$_{2}$O)$_{n}$; (n=1,2), and K$^{+}$-H$_{2}$O. The Lithium cation in water can decay only via electron transfer mediated decay (ETMD) as there are no valence electrons in Lithium. We have investigated how the various decay processes change in the presence of different alkali metal atoms and how the increasing number of water molecules play a significant role in the decay of microsolvated systems. To see the effect of the environment, we have studied the Li$^{+}$-NH$_{3}$ (in comparison to Li$^{+}$-H$_{2}$O). In the case of Na$^{+}$-H$_{2}$O, we have studied the impact of bond distance on the decay width. The effect of polarization on decay width is checked for the X$^{+}$-H$_{2}$O; X=Li, Na. We have used the PCM model to study the polarization effect. We have compared our results with the existing theoretical and experimental results wherever available in the literature.

preprint2022arXiv

Differentially Private All-Pairs Shortest Path Distances: Improved Algorithms and Lower Bounds

We study the problem of releasing the weights of all-pair shortest paths in a weighted undirected graph with differential privacy (DP). In this setting, the underlying graph is fixed and two graphs are neighbors if their edge weights differ by at most $1$ in the $\ell_1$-distance. We give an $ε$-DP algorithm with additive error $\tilde{O}(n^{2/3} / ε)$ and an $(ε, δ)$-DP algorithm with additive error $\tilde{O}(\sqrt{n} / ε)$ where $n$ denotes the number of vertices. This positively answers a question of Sealfon (PODS'16), who asked whether a $o(n)$-error algorithm exists. We also show that an additive error of $Ω(n^{1/6})$ is necessary for any sufficiently small $ε, δ> 0$. Finally, we consider a relaxed setting where a multiplicative approximation is allowed. We show that, with a multiplicative approximation factor $k$, %$2k - 1$, the additive error can be reduced to $\tilde{O}\left(n^{1/2 + O(1/k)} / ε\right)$ in the $ε$-DP case and $\tilde{O}(n^{1/3 + O(1/k)} / ε)$ in the $(ε, δ)$-DP case, respectively.

preprint2022arXiv

Effect of Protonation and Deprotonation on Electron Transfer Mediated Decay and Interatomic Coulombic Decay

Electronically excited atoms or molecules in an environment are often subject to interatomic/intermolecular Coulombic decay (ICD) and/or electron transfer mediated decay (ETMD) mechanisms. A few of the numerous variables that can impact these non-radiative decay mechanisms include bond distance, the number of nearby atoms or molecules, and the polarisation effect. In this paper, we have studied the effect of protonation and deprotonation on the ionization potential (IP), double ionization potential (DIP), and lifetime (or decay width) of the temporary bound state in these non-radiative decay processes. We have chosen LiH-NH$_3$ and LiH-H$_2$O as test systems. The equation of motion coupled cluster singles and doubles method augmented by complex absorbing potential (CAP-EOM-CCSD) has been used in calculating the energetic position of the decaying state and the system's decay rate. Deprotonation of LiH-NH$_3$/LiH-H$_2$O either from the metal center (LiH) or from ammonia/water lowers the IP and DIP compared to the neutral systems. In contrast, protonation increases these quantities compared to neutral systems. The protonation closes the inner valence state relaxation channels for ICD/ETMD. For example, the decay of the O-2s/N-2s state stops in protonated systems (LiH$_2^+$-H$_2$O, LiH$_2^+$-NH$_3$, and LiH-NH$_4^+$). Our study also shows that the efficiency, i.e., the rate of ICD/ETMD, can be altered by protonation and deprotonation. It is expected to have implications for chemical and biological systems

preprint2022arXiv

Enhancement to Training of Bidirectional GAN : An Approach to Demystify Tax Fraud

Outlier detection is a challenging activity. Several machine learning techniques are proposed in the literature for outlier detection. In this article, we propose a new training approach for bidirectional GAN (BiGAN) to detect outliers. To validate the proposed approach, we train a BiGAN with the proposed training approach to detect taxpayers, who are manipulating their tax returns. For each taxpayer, we derive six correlation parameters and three ratio parameters from tax returns submitted by him/her. We train a BiGAN with the proposed training approach on this nine-dimensional derived ground-truth data set. Next, we generate the latent representation of this data set using the $encoder$ (encode this data set using the $encoder$) and regenerate this data set using the $generator$ (decode back using the $generator$) by giving this latent representation as the input. For each taxpayer, compute the cosine similarity between his/her ground-truth data and regenerated data. Taxpayers with lower cosine similarity measures are potential return manipulators. We applied our method to analyze the iron and steel taxpayers data set provided by the Commercial Taxes Department, Government of Telangana, India.

preprint2022arXiv

Faster Privacy Accounting via Evolving Discretization

We introduce a new algorithm for numerical composition of privacy random variables, useful for computing the accurate differential privacy parameters for composition of mechanisms. Our algorithm achieves a running time and memory usage of $\mathrm{polylog}(k)$ for the task of self-composing a mechanism, from a broad class of mechanisms, $k$ times; this class, e.g., includes the sub-sampled Gaussian mechanism, that appears in the analysis of differentially private stochastic gradient descent. By comparison, recent work by Gopi et al. (NeurIPS 2021) has obtained a running time of $\widetilde{O}(\sqrt{k})$ for the same task. Our approach extends to the case of composing $k$ different mechanisms in the same class, improving upon their running time and memory usage from $\widetilde{O}(k^{1.5})$ to $\widetilde{O}(k)$.

preprint2022arXiv

Geometric quenches in quasi-disordered lattice system

While global quantum quench has been extensively used in the literature to understand the localization-delocalization transition for the one-dimensional quantum spin chain, the effect of geometric quench (which corresponds to a sudden change of the geometry of the chain) in the context of such transitions is yet to be well understood. In this work, we investigate the effect of geometric quench in the Aubry-Andre model, which supports localization-delocalization transition even in one dimension. We study the spreading of the entanglement and the site-occupation with time and find many interesting features that can be used to characterize localization-delocalization transition. We observe that geometric quench causes a power-law type growth of the entanglement entropy in the delocalized phase in contrast to the linear growth which is found in the global quench studies. Remarkably, we also find that the saturation values in the Many-body localized (MBL) phase obey Area law in contrast to the usual volume law which is a signature feature of the MBL phase in the context of global quench.

preprint2022arXiv

Parsimonious Learning-Augmented Caching

Learning-augmented algorithms -- in which, traditional algorithms are augmented with machine-learned predictions -- have emerged as a framework to go beyond worst-case analysis. The overarching goal is to design algorithms that perform near-optimally when the predictions are accurate yet retain certain worst-case guarantees irrespective of the accuracy of the predictions. This framework has been successfully applied to online problems such as caching where the predictions can be used to alleviate uncertainties. In this paper we introduce and study the setting in which the learning-augmented algorithm can utilize the predictions parsimoniously. We consider the caching problem -- which has been extensively studied in the learning-augmented setting -- and show that one can achieve quantitatively similar results but only using a sublinear number of predictions.

preprint2021arXiv

Fock-space relativistic coupled-cluster calculation of hyperfine induced $\bf {^1S_0 \rightarrow {^3P^o_0}}$ clock transition in Al$^+$

We have developed an all-particle Fock-space relativistic coupled-cluster method to calculate the properties of two-valence atoms and ions. Using the method we compute the properties associated with hyperfine induced $^1S_0 - ^3P^o_0$ clock transition in Al$^+$. Our result of the $^3P^o_0$ metastable state life time, $20.20 \pm 0.91$ s, is in excellent agreement with the experimental value, $20.60 \pm 1.4$ s [Phys. Rev. Lett. {\bf 98}, 220801 (2007)]. Our studies show that the contributions from the triple excitations, and the corrections from the Breit interaction and QED effects are essential to obtain accurate clock properties in Al$^+$.

preprint2021arXiv

Observation of ballistic upstream modes at fractional quantum Hall edges of graphene

The structure of edge modes at the boundary of quantum Hall (QH) phases forms the basis for understanding low energy transport properties. In particular, the presence of ``upstream'' modes, moving against the direction of charge current flow, is critical for the emergence of renormalized modes with exotic quantum statistics. Detection of excess noise at the edge is a smoking gun for the presence of upstream modes. Here we report on noise measurements at the edges of fractional QH (FQH) phases realized in dual graphite-gated bilayer graphene devices. A noiseless dc current is injected at one of the edge contacts, and the noise generated at contacts at $L= 4\,μ$m or $10\,μ$m away along the upstream direction is studied. For integer and particle-like FQH states, no detectable noise is measured. By contrast, for ``hole-conjugate'' FQH states, we detect a strong noise proportional to the injected current, unambiguously proving the existence of upstream modes. The noise magnitude remaining independent of length together with a remarkable agreement with our theoretical analysis demonstrates the ballistic nature of upstream energy transport, quite distinct from the diffusive propagation reported earlier in GaAs-based systems. Our investigation opens the door to the study of upstream transport in more complex geometries and in edges of non-Abelian phases in graphene.

preprint2021arXiv

Performance Dependency of LSTM and NAR Beamformers With Respect to Sensor Array Properties in V2I Scenario

Prediction and nullifying the interference is a challenging problem in vehicle to infrastructure scenarios . The implementation of practical V2I network is limited because of inevitability of interference due to random nature of the wireless channel. The interference introduces angle ambiguity between the road side units mounted base station and user equipment. This paper proposes an adaptive beamforming technique for mitigation of interference in V2I networks, especially in multiuser environment. In this work , Long short term based (LSTM) based deep learning and non linear auto regresive technique based regressor have been employed to predict the angles between the road side units and user equipment .Advance prediction of transmit and receive signals enables reliable vehicle to infrastructure communication. Instead of predicting the beamforming matrix directly, we predict the main features using LSTM for learning dependencies in the input time series ,where complex variables were taken as input states and final beamformed signal was the output. simulation results have confirmed that the proposed LSTM model achieves comparable performance in terms of system throughput when compared with the non linear auto regressive method implemented as an artificial neural network.

preprint2020arXiv

Differentially Private Clustering: Tight Approximation Ratios

We study the task of differentially private clustering. For several basic clustering problems, including Euclidean DensestBall, 1-Cluster, k-means, and k-median, we give efficient differentially private algorithms that achieve essentially the same approximation ratios as those that can be obtained by any non-private algorithm, while incurring only small additive errors. This improves upon existing efficient algorithms that only achieve some large constant approximation factors. Our results also imply an improved algorithm for the Sample and Aggregate privacy framework. Furthermore, we show that one of the tools used in our 1-Cluster algorithm can be employed to get a faster quantum algorithm for ClosestPair in a moderate number of dimensions.

preprint2020arXiv

Fair Correlation Clustering

In this paper, we study correlation clustering under fairness constraints. Fair variants of $k$-median and $k$-center clustering have been studied recently, and approximation algorithms using a notion called fairlet decomposition have been proposed. We obtain approximation algorithms for fair correlation clustering under several important types of fairness constraints. Our results hinge on obtaining a fairlet decomposition for correlation clustering by introducing a novel combinatorial optimization problem. We define a fairlet decomposition with cost similar to the $k$-median cost and this allows us to obtain approximation algorithms for a wide range of fairness constraints. We complement our theoretical results with an in-depth analysis of our algorithms on real graphs where we show that fair solutions to correlation clustering can be obtained with limited increase in cost compared to the state-of-the-art (unfair) algorithms.

preprint2020arXiv

Fair Hierarchical Clustering

As machine learning has become more prevalent, researchers have begun to recognize the necessity of ensuring machine learning systems are fair. Recently, there has been an interest in defining a notion of fairness that mitigates over-representation in traditional clustering. In this paper we extend this notion to hierarchical clustering, where the goal is to recursively partition the data to optimize a specific objective. For various natural objectives, we obtain simple, efficient algorithms to find a provably good fair hierarchical clustering. Empirically, we show that our algorithms can find a fair hierarchical clustering, with only a negligible loss in the objective.

preprint2020arXiv

Hall Effect for Dirac Electrons in Graphene Exposed to an Abrikosov Flux Lattice

The proposals for realizing exotic particles through coupling of quantum Hall effect to superconductivity involve spatially non-uniform magnetic fields. As a step toward that goal, we study, both theoretically and experimentally, a system of Dirac electrons exposed to an Abrikosov flux lattice. We theoretically find that non-uniform magnetic field causes a carrier-density dependent reduction of the Hall conductivity. Our studies show that this reduction originates from a rather subtle effect: a levitation of the Berry curvature within Landau levels broadened by the non-uniform magnetic field. Experimentally, we measure the magneto-transport in a monolayer graphene-hexagonal boron nitride - niobium diselenide (NbSe$_2$) heterostructure, and find a density-dependent reduction of the Hall resistivity of graphene as the temperature is lowered from above the superconducting critical temperature of NbSe$_2$, when the magnetic field is uniform, to below, where the magnetic field bunches into an Abrikosov flux lattice.

preprint2020arXiv

Near-tight closure bounds for Littlestone and threshold dimensions

We study closure properties for the Littlestone and threshold dimensions of binary hypothesis classes. Given classes $\mathcal{H}_1, \ldots, \mathcal{H}_k$ of Boolean functions with bounded Littlestone (respectively, threshold) dimension, we establish an upper bound on the Littlestone (respectively, threshold) dimension of the class defined by applying an arbitrary binary aggregation rule to $\mathcal{H}_1, \ldots, \mathcal{H}_k$. We also show that our upper bounds are nearly tight. Our upper bounds give an exponential (in $k$) improvement upon analogous bounds shown by Alon et al. (COLT 2020), thus answering a question posed by their work.

preprint2020arXiv

On Distributed Differential Privacy and Counting Distinct Elements

We study the setup where each of $n$ users holds an element from a discrete set, and the goal is to count the number of distinct elements across all users, under the constraint of $(ε, δ)$-differentially privacy: - In the non-interactive local setting, we prove that the additive error of any protocol is $Ω(n)$ for any constant $ε$ and for any $δ$ inverse polynomial in $n$. - In the single-message shuffle setting, we prove a lower bound of $Ω(n)$ on the error for any constant $ε$ and for some $δ$ inverse quasi-polynomial in $n$. We do so by building on the moment-matching method from the literature on distribution estimation. - In the multi-message shuffle setting, we give a protocol with at most one message per user in expectation and with an error of $\tilde{O}(\sqrt(n))$ for any constant $ε$ and for any $δ$ inverse polynomial in $n$. Our protocol is also robustly shuffle private, and our error of $\sqrt(n)$ matches a known lower bound for such protocols. Our proof technique relies on a new notion, that we call dominated protocols, and which can also be used to obtain the first non-trivial lower bounds against multi-message shuffle protocols for the well-studied problems of selection and learning parity. Our first lower bound for estimating the number of distinct elements provides the first $ω(\sqrt(n))$ separation between global sensitivity and error in local differential privacy, thus answering an open question of Vadhan (2017). We also provide a simple construction that gives $\tildeΩ(n)$ separation between global sensitivity and error in two-party differential privacy, thereby answering an open question of McGregor et al. (2011).

preprint2020arXiv

On the Power of Multiple Anonymous Messages

An exciting new development in differential privacy is the shuffled model, in which an anonymous channel enables non-interactive, differentially private protocols with error much smaller than what is possible in the local model, while relying on weaker trust assumptions than in the central model. In this paper, we study basic counting problems in the shuffled model and establish separations between the error that can be achieved in the single-message shuffled model and in the shuffled model with multiple messages per user. For the problem of frequency estimation for $n$ users and a domain of size $B$, we obtain: - A nearly tight lower bound of $\tildeΩ( \min(\sqrt[4]{n}, \sqrt{B}))$ on the error in the single-message shuffled model. This implies that the protocols obtained from the amplification via shuffling work of Erlingsson et al. (SODA 2019) and Balle et al. (Crypto 2019) are essentially optimal for single-message protocols. A key ingredient in the proof is a lower bound on the error of locally-private frequency estimation in the low-privacy (aka high $ε$) regime. - Protocols in the multi-message shuffled model with $poly(\log{B}, \log{n})$ bits of communication per user and $poly\log{B}$ error, which provide an exponential improvement on the error compared to what is possible with single-message algorithms. For the related selection problem on a domain of size $B$, we prove: - A nearly tight lower bound of $Ω(B)$ on the number of users in the single-message shuffled model. This significantly improves on the $Ω(B^{1/17})$ lower bound obtained by Cheu et al. (Eurocrypt 2019), and when combined with their $\tilde{O}(\sqrt{B})$-error multi-message protocol, implies the first separation between single-message and multi-message protocols for this problem.

preprint2020arXiv

Pure Differentially Private Summation from Anonymous Messages

The shuffled (aka anonymous) model has recently generated significant interest as a candidate distributed privacy framework with trust assumptions better than the central model but with achievable errors smaller than the local model. We study pure differentially private (DP) protocols in the shuffled model for summation, a basic and widely used primitive: - For binary summation where each of n users holds a bit as an input, we give a pure $ε$-DP protocol for estimating the number of ones held by the users up to an error of $O_ε(1)$, and each user sends $O_ε(\log n)$ messages each of 1 bit. This is the first pure protocol in the shuffled model with error $o(\sqrt{n})$ for constant $ε$. Using this protocol, we give a pure $ε$-DP protocol that performs summation of real numbers in $[0, 1]$ up to an error of $O_ε(1)$, and where each user sends $O_ε(\log^3 n)$ messages each of $O(\log\log n)$ bits. - In contrast, we show that for any pure $ε$-DP protocol for binary summation in the shuffled model having absolute error $n^{0.5-Ω(1)}$, the per user communication has to be at least $Ω_ε(\sqrt{\log n})$ bits. This implies the first separation between the (bounded-communication) multi-message shuffled model and the central model, and the first separation between pure and approximate DP protocols in the shuffled model. To prove our lower bound, we consider (a generalization of) the following question: given $γ$ in $(0, 1)$, what is the smallest m for which there are two random variables $X^0, X^1$ supported on $\{0, \dots ,m\}$ such that (i) the total variation distance between $X^0$ and $X^1$ is at least $1-γ$, and (ii) the moment generating functions of $X^0$ and $X^1$ are within a constant factor of each other everywhere? We show that the answer is $m = Θ(\sqrt{\log(1/γ)})$.

preprint2019arXiv

Electric dipole polarizability of group-IIIA ions using PRCC: Large correlation effects from nonlinear terms

We compute the ground-state electric dipole polarizability of group-IIIA ions using the perturbed relativistic coupled-cluster (PRCC) theory. To account for the relativistic effects and QED corrections, we use the Dirac-Coulomb-Breit Hamiltonian with the corrections from the Uehling potential and the self-energy. The effects of triple excitations are considered perturbatively in the PRCC. Our PRCC results for $α$ are good in agreement with the previous theoretical results for all the ions. From our computations we find that the nonlinear terms in PRCC have significant contributions and must be included to obtain the accurate value of $α$ for group-IIIA ions. For the correction from the Breit interaction, we find that it is largest for Al$^+$ and decreases as we go towards the heavier ions. The corrections from the vacuum polarization and the self-energy increase from lighter to heavier ions.