Source author record

Shubham Gupta

Shubham Gupta appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Artificial Intelligence math.FA Computation and Language Information Retrieval math.NT Multiagent Systems Cryptography and Security Data Structures and Algorithms Distributed, Parallel, and Cluster Computing Emerging Technologies Human-Computer Interaction Information Theory math.IT math.SP Methodology Social and Information Networks

Catalog footprint

What is connected

19works

17topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2025arXiv

A Transfer Framework for Enhancing Temporal Graph Learning in Data-Scarce Settings

Dynamic interactions between entities are prevalent in domains like social platforms, financial systems, healthcare, and e-commerce. These interactions can be effectively represented as time-evolving graphs, where predicting future connections is a key task in applications such as recommendation systems. Temporal Graph Neural Networks (TGNNs) have achieved strong results for such predictive tasks but typically require extensive training data, which is often limited in real-world scenarios. One approach to mitigating data scarcity is leveraging pre-trained models from related datasets. However, direct knowledge transfer between TGNNs is challenging due to their reliance on node-specific memory structures, making them inherently difficult to adapt across datasets. To address this, we introduce a novel transfer approach that disentangles node representations from their associated features through a structured bipartite encoding mechanism. This decoupling enables more effective transfer of memory components and other learned inductive patterns from one dataset to another. Empirical evaluations on real-world benchmarks demonstrate that our method significantly enhances TGNN performance in low-data regimes, outperforming non-transfer baselines by up to 56\% and surpassing existing transfer strategies by 36\%

preprint2023arXiv

Hardy and Rellich inequality on lattices

In this paper, we study the asymptotic behaviour of the sharp constant in discrete Hardy and Rellich inequality on the lattice $\mathbb{Z}^d$ as $d \rightarrow \infty$. In the process, we proved some Hardy-type inequalities for the operators $Δ^m$ and $\nabla(Δ^m)$ for non-negative integers $m$ on a $d$ dimensional torus. It turns out that the sharp constant in discrete Hardy and Rellich inequality grows as $d$ and $d^2$ respectively as $ d \rightarrow \infty$.

preprint2022arXiv

A Survey on Temporal Graph Representation Learning and Generative Modeling

Temporal graphs represent the dynamic relationships among entities and occur in many real life application like social networks, e commerce, communication, road networks, biological systems, and many more. They necessitate research beyond the work related to static graphs in terms of their generative modeling and representation learning. In this survey, we comprehensively review the neural time dependent graph representation learning and generative modeling approaches proposed in recent times for handling temporal graphs. Finally, we identify the weaknesses of existing approaches and discuss the research proposal of our recently published paper TIGGER[24].

preprint2022arXiv

Dependency Structure for News Document Summarization

In this work, we develop a neural network based model which leverages dependency parsing to capture cross-positional dependencies and grammatical structures. With the help of linguistic signals, sentence-level relations can be correctly captured, thus improving news documents summarization performance. Empirical studies demonstrate that this simple but effective method outperforms existing works on the benchmark dataset. Extensive analyses examine different settings and configurations of the proposed model which provide a good reference to the community.

preprint2022arXiv

Differentiable Rule Induction with Learned Relational Features

Rule-based decision models are attractive due to their interpretability. However, existing rule induction methods often result in long and consequently less interpretable rule models. This problem can often be attributed to the lack of appropriately expressive vocabulary, i.e., relevant predicates used as literals in the decision model. Most existing rule induction algorithms presume pre-defined literals, naturally decoupling the definition of the literals from the rule learning phase. In contrast, we propose the Relational Rule Network (R2N), a neural architecture that learns literals that represent a linear relationship among numerical input features along with the rules that use them. This approach opens the door to increasing the expressiveness of induced decision models by coupling literal learning directly with rule learning in an end-to-end differentiable fashion. On benchmark tasks, we show that these learned literals are simple enough to retain interpretability, yet improve prediction accuracy and provide sets of rules that are more concise compared to state-of-the-art rule induction algorithms.

preprint2022arXiv

Diophantine triples with the property $D(n)$ for distinct $n$

We prove that for every integer $n$, there exist infinitely many $D(n)$-triples which are also $D(t)$-triples for $t\in\mathbb{Z}$ with $n\ne t$. We also prove that there are infinitely many triples with the property $D(-1)$ in $\mathbb{Z}[i]$ which are also $D(n)$-triple in $\mathbb{Z}[i]$ for two distinct $n$'s other than $n = -1$ and these triples are not equivalent to any triple with the property $D(1)$.

preprint2022arXiv

Discrete weighted Hardy Inequality in 1-D

In this paper we consider a weighted version of one dimensional discrete Hardy's Inequality on half-line with power weights of the form $n^α$. Namely we consider: \begin{equation} \sum_{n=1}^\infty |u(n)-u(n-1)|^2 n^α\geq c(α) \sum_{n=1}^\infty \frac{|u(n)|^2}{n^2}n^α\end{equation} We prove the above inequality when $α\in [0,1) \cup [5,\infty)$ with the sharp constant $c(α)$. Furthermore when $α\in [1/3,1) \cup \{0\}$ we prove an improved version of the above inequality. More precisely we prove \begin{equation} \sum_{n=1}^\infty |u(n)-u(n-1)|^2 n^α\geq c(α) \sum_{n=1}^\infty \frac{|u(n)|^2}{n^2} n^α+ \sum_{k=3}^\infty b_k(α) \sum_{n=2}^\infty \frac{|u(n)|^2}{n^k}n^α. \end{equation} for non-negative constants $b_k(α)$.

preprint2022arXiv

On consistency of constrained spectral clustering under representation-aware stochastic block model

Spectral clustering is widely used in practice due to its flexibility, computational efficiency, and well-understood theoretical performance guarantees. Recently, spectral clustering has been studied to find balanced clusters under population-level constraints. These constraints are specified by additional information available in the form of auxiliary categorical node attributes. In this paper, we consider a scenario where these attributes may not be observable, but manifest as latent features of an auxiliary graph. Motivated by this, we study constrained spectral clustering with the aim of finding balanced clusters in a given \textit{similarity graph} $\mathcal{G}$, such that each individual is adequately represented with respect to an auxiliary graph $\mathcal{R}$ (we refer to this as representation graph). We propose an individual-level balancing constraint that formalizes this idea. Our work leads to an interesting stochastic block model that not only plants the given partitions in $\mathcal{G}$ but also plants the auxiliary information encoded in the representation graph $\mathcal{R}$. We develop unnormalized and normalized variants of spectral clustering in this setting. These algorithms use $\mathcal{R}$ to find clusters in $\mathcal{G}$ that approximately satisfy the proposed constraint. We also establish the first statistical consistency result for constrained spectral clustering under individual-level constraints for graphs sampled from the above-mentioned variant of the stochastic block model. Our experimental results corroborate our theoretical findings.

preprint2022arXiv

Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits

We study the problem of \emph{dynamic regret minimization} in $K$-armed Dueling Bandits under non-stationary or time varying preferences. This is an online learning setup where the agent chooses a pair of items at each round and observes only a relative binary `win-loss' feedback for this pair, sampled from an underlying preference matrix at that round. We first study the problem of static-regret minimization for adversarial preference sequences and design an efficient algorithm with $O(\sqrt{KT})$ high probability regret. We next use similar algorithmic ideas to propose an efficient and provably optimal algorithm for dynamic-regret minimization under two notions of non-stationarities. In particular, we establish $\tO(\sqrt{SKT})$ and $\tO({V_T^{1/3}K^{1/3}T^{2/3}})$ dynamic-regret guarantees, $S$ being the total number of `effective-switches' in the underlying preference relations and $V_T$ being a measure of `continuous-variation' non-stationarity. The complexity of these problems have not been studied prior to this work despite the practicability of non-stationary environments in real world systems. We justify the optimality of our algorithms by proving matching lower bound guarantees under both the above-mentioned notions of non-stationarities. Finally, we corroborate our results with extensive simulations and compare the efficacy of our algorithms over state-of-the-art baselines.

preprint2022arXiv

Symmetrization inequalities on one-dimensional integer lattice

In this paper, we develop a theory of symmetrization on the one dimensional integer lattice. More precisely, we associate a radially decreasing function $u^*$ with a function $u$ defined on the integers and prove the corresponding Polya-Szegö inequality. Along the way we also prove the weighted Polya-Szegö inequality for the decreasing rearrangement on the half-line, i.e., non-negative integers. As a consequence, we prove the discrete weighted Hardy's inequality with the weight $n^α$ for $1 < α\leq 2$.

preprint2022arXiv

TIGGER: Scalable Generative Modelling for Temporal Interaction Graphs

There has been a recent surge in learning generative models for graphs. While impressive progress has been made on static graphs, work on generative modeling of temporal graphs is at a nascent stage with significant scope for improvement. First, existing generative models do not scale with either the time horizon or the number of nodes. Second, existing techniques are transductive in nature and thus do not facilitate knowledge transfer. Finally, due to relying on one-to-one node mapping from source to the generated graph, existing models leak node identity information and do not allow up-scaling/down-scaling the source graph size. In this paper, we bridge these gaps with a novel generative model called TIGGER. TIGGER derives its power through a combination of temporal point processes with auto-regressive modeling enabling both transductive and inductive variants. Through extensive experiments on real datasets, we establish TIGGER generates graphs of superior fidelity, while also being up to 3 orders of magnitude faster than the state-of-the-art.

preprint2020arXiv

A Large-Scale Deep Architecture for Personalized Grocery Basket Recommendations

With growing consumer adoption of online grocery shopping through platforms such as Amazon Fresh, Instacart, and Walmart Grocery, there is a pressing business need to provide relevant recommendations throughout the customer journey. In this paper, we introduce a production within-basket grocery recommendation system, RTT2Vec, which generates real-time personalized product recommendations to supplement the user's current grocery basket. We conduct extensive offline evaluation of our system and demonstrate a 9.4% uplift in prediction metrics over baseline state-of-the-art within-basket recommendation models. We also propose an approximate inference technique 11.6x times faster than exact inference approaches. In production, our system has resulted in an increase in average basket size, improved product discovery, and enabled faster user check-out

preprint2020arXiv

Certain Diophantine Tuples in Imaginary Quadratic Fields

Let $K$ be an imaginary quadratic field and $ \mathcal{O}_K$ be its ring of integers. A set $\{a_1, a_2, \cdots,a_m\} \subset \mathcal{O}_K\setminus\{0\}$ is called a Diophantine $m$-tuple in $\mathcal{O}_K$ with $D(-1)$ if $a_ia_j -1 = x_{ij}^2$, where $x_{ij} \in \mathcal{O}_K$ for all $i,j$ such that $1 \leq i < j \leq m$. Here we prove the non-existence of Diophantine $m$-tuples in $\mathcal{O}_K$ with $D(-1)$ for $m > 36$.

preprint2020arXiv

Networked Multi-Agent Reinforcement Learning with Emergent Communication

Multi-Agent Reinforcement Learning (MARL) methods find optimal policies for agents that operate in the presence of other learning agents. Central to achieving this is how the agents coordinate. One way to coordinate is by learning to communicate with each other. Can the agents develop a language while learning to perform a common task? In this paper, we formulate and study a MARL problem where cooperative agents are connected to each other via a fixed underlying network. These agents can communicate along the edges of this network by exchanging discrete symbols. However, the semantics of these symbols are not predefined and, during training, the agents are required to develop a language that helps them in accomplishing their goals. We propose a method for training these agents using emergent communication. We demonstrate the applicability of the proposed framework by applying it to the problem of managing traffic controllers, where we achieve state-of-the-art performance as compared to a number of strong baselines. More importantly, we perform a detailed analysis of the emergent communication to show, for instance, that the developed language is grounded and demonstrate its relationship with the underlying network topology. To the best of our knowledge, this is the only work that performs an in depth analysis of emergent communication in a networked MARL setting while being applicable to a broad class of problems.

preprint2020arXiv

Winning an Election: On Emergent Strategic Communication in Multi-Agent Networks

Humans use language to collectively execute abstract strategies besides using it as a referential tool for identifying physical entities. Recently, multiple attempts at replicating the process of emergence of language in artificial agents have been made. While existing approaches study emergent languages as referential tools, in this paper, we study their role in discovering and implementing strategies. We formulate the problem using a voting game where two candidate agents contest in an election with the goal of convincing population members (other agents), that are connected to each other via an underlying network, to vote for them. To achieve this goal, agents are only allowed to exchange messages in the form of sequences of discrete symbols to spread their propaganda. We use neural networks with Gumbel-Softmax relaxation for sampling categorical random variables to parameterize the policies followed by all agents. Using our proposed framework, we provide concrete answers to the following questions: (i) Do the agents learn to communicate in a meaningful way and does the emergent communication play a role in deciding the winner? (ii) Does the system evolve as expected under various reward structures? (iii) How is the emergent language affected by the community structure in the network? To the best of our knowledge, we are the first to explore emergence of communication for discovering and implementing strategies in a setting where agents communicate over a network.

preprint2020arXiv

WorkerRep: Immutable Reputation System For Crowdsourcing Platform Based on Blockchain

Crowdsourcing is a process wherein an individual or an organisation utilizes the talent pool present over the Internet to accomplish their task. The existing crowdsourcing platforms and their reputation computation are centralised and hence prone to various attacks or malicious manipulation of the data by the central entity. A few distributed crowdsourcing platforms have been proposed but they lack a robust reputation mechanism. So we propose a decentralised crowdsourcing platform having an immutable reputation mechanism to tackle these problems. It is built on top of Ethereum network and does not require the user to trust a third party for a non malicious experience. It also utilizes IOTAs consensus mechanism which reduces the cost for task evaluation significantly.

preprint2015arXiv

Synthesis of Sequential Reversible Circuits through Finite State Machine

Reversible computing has attracted the attention of researchers due to its low power consumption and less heat dissipation compared to conventional computing. A number of reversible gates have been proposed by different researchers and various combinational circuits based on reversible gates have been developed. However the realization of sequential circuit in reversible logic is still at premature stage. Sequential circuits were not available because of feedback was not allowed in reversible circuit. However allowing feedback in space (not in time), some sequential reversible gates and circuits have been reported in the literature. In this dissertation, we have addressed the problem from two sides. One side is to propose a low cost reversible gate suitable for sequential building block i.e. T flip-flop and hence designing low cost synchronous and asynchronous counters. Another side is to generate the circuit from its behavioral description described in FSM form. Our propose designs of reversible counters are significantly better in optimization parameters such as gate counts, garbage outputs and constant inputs available in literature. We have also proposed a procedure for obtaining reversible circuit from behavioral description through FSM. A very few attempts have been reported in the literature for the conversion FSM to reversible FSM.Because of non-availability of generated sequential reversible circuit in literature, our results cannot be compared with any other circuits. We expect that the sequential reversible circuits will help in debugging the reversible circuits, handling the ambiguous state of an FSM and generating the original input in reverse direction by reversing the original output.

preprint2014arXiv

Approximation algorithms for Capacitated Facility Location Problem with Penalties

In this paper, we address the problem of capacitated facility location problem with penalties (CapFLPP) paid per unit of unserved demand. In case of uncapacitated FLP with penalties demands of a client are either entirely met or are entirely rejected and penalty is paid. In the uncapacitated case, there is no reason to serve a client partially. Whereas, in case of CapFLPP, it may be beneficial to serve a client partially instead of not serving at all and, pay the penalty for the unmet demand. Charikar et. al. \cite{charikar2001algorithms}, Jain et. al. \cite{jain2003greedy} and Xu- Xu \cite{xu2009improved} gave $3$, $2$ and $1.8526$ approximation, respectively, for the uncapacitated case . We present $(5.83 + ε)$ factor for the case of uniform capacities and $(8.532 + ε)$ factor for non-uniform capacities.

preprint2013arXiv

CPU and/or GPU: Revisiting the GPU Vs. CPU Myth

Parallel computing using accelerators has gained widespread research attention in the past few years. In particular, using GPUs for general purpose computing has brought forth several success stories with respect to time taken, cost, power, and other metrics. However, accelerator based computing has signifi- cantly relegated the role of CPUs in computation. As CPUs evolve and also offer matching computational resources, it is important to also include CPUs in the computation. We call this the hybrid computing model. Indeed, most computer systems of the present age offer a degree of heterogeneity and therefore such a model is quite natural. We reevaluate the claim of a recent paper by Lee et al.(ISCA 2010). We argue that the right question arising out of Lee et al. (ISCA 2010) should be how to use a CPU+GPU platform efficiently, instead of whether one should use a CPU or a GPU exclusively. To this end, we experiment with a set of 13 diverse workloads ranging from databases, image processing, sparse matrix kernels, and graphs. We experiment with two different hybrid platforms: one consisting of a 6-core Intel i7-980X CPU and an NVidia Tesla T10 GPU, and another consisting of an Intel E7400 dual core CPU with an NVidia GT520 GPU. On both these platforms, we show that hybrid solutions offer good advantage over CPU or GPU alone solutions. On both these platforms, we also show that our solutions are 90% resource efficient on average. Our work therefore suggests that hybrid computing can offer tremendous advantages at not only research-scale platforms but also the more realistic scale systems with significant performance gains and resource efficiency to the large scale user community.

Shubham Gupta

What is connected

Connect this record

See the researcher in context

Building this map preview

19 published item(s)

A Transfer Framework for Enhancing Temporal Graph Learning in Data-Scarce Settings

Hardy and Rellich inequality on lattices

A Survey on Temporal Graph Representation Learning and Generative Modeling

Dependency Structure for News Document Summarization

Differentiable Rule Induction with Learned Relational Features

Diophantine triples with the property $D(n)$ for distinct $n$

Discrete weighted Hardy Inequality in 1-D

On consistency of constrained spectral clustering under representation-aware stochastic block model

Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits

Symmetrization inequalities on one-dimensional integer lattice

TIGGER: Scalable Generative Modelling for Temporal Interaction Graphs

A Large-Scale Deep Architecture for Personalized Grocery Basket Recommendations

Certain Diophantine Tuples in Imaginary Quadratic Fields

Networked Multi-Agent Reinforcement Learning with Emergent Communication

Winning an Election: On Emergent Strategic Communication in Multi-Agent Networks

WorkerRep: Immutable Reputation System For Crowdsourcing Platform Based on Blockchain

Synthesis of Sequential Reversible Circuits through Finite State Machine

Approximation algorithms for Capacitated Facility Location Problem with Penalties

CPU and/or GPU: Revisiting the GPU Vs. CPU Myth