Researcher profile

Sudipta Mondal

Sudipta Mondal contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 15 - UnverifiedVerification L1Unclaimed author
3works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

3 published item(s)

preprint2026arXiv

Accelerating RL Post-Training Rollouts via System-Integrated Speculative Decoding

RL post-training of frontier language models is increasingly bottlenecked by autoregressive rollout generation, making rollout acceleration a central systems challenge. Many existing efficiency methods improve throughput by changing the rollout or optimization regime, for example, through off-policy execution, replay, or lower-precision generation. We study speculative decoding as a lossless acceleration primitive for RL rollouts that preserves the target model's output distribution. We implement speculative decoding in NeMo-RL with a vLLM backend, supporting both synchronous and asynchronous pipelines and enabling speculation during RL rollouts. This benefit is realizable across speculation mechanisms, such as pretrained MTP heads, small external draft models or even techniques such as Eagle3, which are traditionally applied after RL phase. This yields a deployment path for state-of-the-art speculative decoding inside RL training. In a reasoning post-training workload at 8B scale under synchronous RL, speculative decoding improves rollout throughput by 1.8x. Using a high-fidelity performance simulator, we project that combining speculative decoding with asynchronous RL yields up to 2.5x end-to-end training speedup at 235B scale.

preprint2026arXiv

Classical capacities under physical constraints: More capacity with less entanglement

Current advancements in communication equipment demand the investigation of classical information transfer over quantum channels, by encompassing realistic scenarios in finite dimensions. To address this issue, we develop a framework for analyzing classical capacities of quantum channels where the set of states used for encoding information is restricted based on various physical properties. Specifically, we provide expressions for the classical capacities of noiseless and noisy quantum channels when the average energy of the encoded ensemble or the energy of each of the constituent states in the ensemble is bounded. In the case of qubit energy-preserving dephasing channels, we demonstrate that a nonuniform probability distribution based on the energy constraint maximizes capacity, while we derive the compact form of the capacity for equiprobable messages. We suggest an energy-constrained dense coding (DC) protocol that we prove to be optimal in the two-qubit situation and obtain a closed-form expression for the DC capacity. Additionally, we demonstrate a no-go result, which states that when the dimension of the sender and the receiver is two, no energy-preserving operation can offer any quantum advantage for energy-constrained entanglement-assisted capacity. We exhibit that, in the energy-constrained situation, classical-quantum noisy channels can show improved capabilities under entanglement assistance, a phenomenon that is unattainable in the unrestricted scenario.

preprint2026arXiv

Hierarchies among genuine multipartite entangling capabilities of quantum gates

We classify quantum gates according to their capability to generate genuine multipartite entanglement (GME), using a hierarchy based on multipartite separable states. In particular, when a fixed unitary operator acts on the set of k-separable states, the maximal genuine multipartite entanglement content produced via that particular unitary operator is determined after maximizing over the set of k-separable input states. We identify unitary operators that are beneficial for generating high GME when the input states are entangled in some bipartition, although the picture can also be reversed, where such initial entanglement offers no advantage. We investigate the maximum entangling power of a broad range of unitary operators, encompassing special classes of quantum gates, as well as diagonal, permutation, and Haar-uniformly generated unitaries by computing generalized geometric measure (GGM) as a GME quantifier. Additionally, we observe a notable distinction in entangling power based on the nature of the input states: when maximization is restricted to separable states with real coefficients, the entangling power is lower than when the optimization is carried out over arbitrary separable states with complex coefficients, thereby highlighting the role of complex amplitudes in entanglement creation. Furthermore, we determine which unitary operators, along with their corresponding optimal inputs, yield output states with the highest achievable GGM.