Researcher profile

Uday V. Shanbhag

Uday V. Shanbhag contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
1topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2022arXiv

Complexity guarantees for an implicit smoothing-enabled method for stochastic MPECs

Stochastic MPECs have found increasing relevance for modeling a broad range of settings in engineering and statistics. Yet, there seem to be no efficient first/zeroth-order schemes equipped with non-asymptotic rate guarantees for resolving even deterministic variants of such problems. We consider SMPECs where the parametrized lower-level equilibrium problem is given by a deterministic/stochastic VI problem whose mapping is strongly monotone. We develop a zeroth-order implicit algorithmic framework by leveraging a locally randomized spherical smoothing scheme. We present schemes for single-stage and two-stage stochastic MPECs when the upper-level problem is either convex or nonconvex. (I). Single-stage SMPECs: In convex regimes, our proposed inexact schemes are characterized by a complexity in upper-level projections, upper-level samples, and lower-level projections of $\mathcal{O}(\tfrac{1}{ε^2})$, $\mathcal{O}(\tfrac{1}{ε^2})$, and $\mathcal{O}(\tfrac{1}{ε^2}\ln(\tfrac{1}ε))$ , respectively. Analogous bounds for the nonconvex regime are $\mathcal{O}(\tfrac{1}ε)$, $\mathcal{O}(\tfrac{1}{ε^2})$, and $\mathcal{O}(\tfrac{1}{ε^3})$, respectively . (II). Two-stage SMPECs: In convex regimes, our proposed inexact schemes have a complexity in upper-level projections, upper-level samples, and lower-level projections of $\mathcal{O}(\tfrac{1}{ε^2}),\mathcal{O}(\tfrac{1}{ε^2})$, and $\mathcal{O}(\tfrac{1}{ε^2}\ln(\tfrac{1}ε))$ while the corresponding bounds in the nonconvex regime are $\mathcal{O}(\tfrac{1}ε)$, $\mathcal{O}(\tfrac{1}{ε^2})$, and $\mathcal{O}(\tfrac{1}{ε^2}\ln(\tfrac{1}ε))$ , respectively . In addition, we derive statements for exact as well as accelerated counterparts. We also provide a comprehensive set of numerical results for validating the theoretical findings.

preprint2022arXiv

Probability Maximization via Minkowski Functionals: Convex Representations and Tractable Resolution

In this paper, we consider the maximization of a probability $\mathbb{P}\{ ζ\mid ζ\in \mathbf{K}(\mathbf x)\}$ over a closed and convex set $\mathcal X$, a special case of the chance-constrained optimization problem. We define $\mathbf{K}(\mathbf x)$ as $\mathbf{K}(\mathbf x) \triangleq \{ ζ\in \mathcal{K} \mid c(\mathbf{x},ζ) \geq 0 \}$ where $ζ$ is uniformly distributed on a convex and compact set $\mathcal{K}$ and $c(\mathbf{x},ζ)$ is defined as either {$c(\mathbf{x},ζ) \triangleq 1-|ζ^T\mathbf{x}|^m$, $m\geq 0$} (Setting A) or $c(\mathbf{x},ζ) \triangleq T\mathbf{x} -ζ$ (Setting B). We show that in either setting, $\mathbb{P}\{ ζ\mid ζ\in \mathbf{K(x)}\}$ can be expressed as the expectation of a suitably defined function $F(\mathbf{x},ξ)$ with respect to an appropriately defined Gaussian density (or its variant), i.e. $\mathbb{E}_{\tilde p} [F(\mathbf x,ξ)]$. We then develop a convex representation of the original problem requiring the minimization of ${g(\mathbb{E}[F(\mathbf{x},ξ)])}$ over $\mathcal X$ where $g$ is an appropriately defined smooth convex function. Traditional stochastic approximation schemes cannot contend with the minimization of ${g(\mathbb{E}[F(\cdot,ξ)])}$ over $\mathcal X$, since conditionally unbiased sampled gradients are unavailable. We then develop a regularized variance-reduced stochastic approximation (r-VRSA) scheme that obviates the need for such unbiasedness by combining iterative regularization with variance-reduction. Notably, (r-VRSA) is characterized by both almost-sure convergence guarantees, a convergence rate of $\mathcal{O}(1/k^{1/2-a})$ in expected sub-optimality where $a > 0$, and a sample complexity of $\mathcal{O}(1/ε^{6+δ})$ where $δ> 0$.

preprint2021arXiv

Stochastic Relaxed Inertial Forward-Backward-Forward splitting for Monotone Inclusions in Hilbert spaces

We consider monotone inclusions defined on a Hilbert space where the operator is given by the sum of a maximal monotone operator $T$ and a single-valued monotone, Lipschitz continuous, and expectation-valued operator $V$. We draw motivation from the seminal work by Attouch and Cabot on relaxed inertial methods for monotone inclusions and present a stochastic extension of the relaxed inertial forward-backward-forward (RISFBF) method. Facilitated by an online variance reduction strategy via a mini-batch approach, we show that (RISFBF) produces a sequence that weakly converges to the solution set. Moreover, it is possible to estimate the rate at which the discrete velocity of the stochastic process vanishes. Under strong monotonicity, we demonstrate strong convergence, and give a detailed assessment of the iteration and oracle complexity of the scheme. When the mini-batch is raised at a geometric (polynomial) rate, the rate statement can be strengthened to a linear (suitable polynomial) rate while the oracle complexity of computing an $ε$-solution improves to $O(1/ε)$. Importantly, the latter claim allows for possibly biased oracles, a key theoretical advancement allowing for far broader applicability. By defining a restricted gap function based on the Fitzpatrick function, we prove that the expected gap of an averaged sequence diminishes at a sublinear rate of $O(1/k)$ while the oracle complexity of computing a suitably defined $ε$-solution is $O(1/ε^{1+a})$ where $a>1$. Numerical results on two-stage games and an overlapping group Lasso problem illustrate the advantages of our method compared to stochastic forward-backward-forward (SFBF) and SA schemes.

preprint2020arXiv

Asynchronous Variance-reduced Block Schemes for Composite Nonconvex Stochastic Optimization: Block-specific Steplengths and Adapted Batch-sizes

We consider the minimization of a sum of an expectation-valued coordinate-wise $L_i$-smooth nonconvex function and a nonsmooth block-separable convex regularizer. We propose an asynchronous variance-reduced algorithm, where in each iteration, a single block is randomly chosen to update its estimates by a proximal variable sample-size stochastic gradient scheme, while the remaining blocks are kept invariant. Notably, each block employs a steplength that is in accordance with its block-specific Lipschitz constant while block-specific batch-sizes are random variables updated at a rate that grows either at a geometric or polynomial rate with the (random) number of times that block is selected. We show that every limit point for almost every sample path is a stationary point and establish the ergodic non-asymptotic rate $\mathcal{O}(1/K) $. Iteration and oracle complexity to obtain an $ε$-stationary point are shown to be $\mathcal{O}(1/ε)$ and $\mathcal{O}(1/ε^2)$, respectively. Furthermore, under a $ μ$-proximal Polyak-Łojasiewicz (PL) condition with the batch size increasing at a geometric rate, we prove that the suboptimality diminishes at a {\em geometric} rate, the {\em optimal} deterministic rate while iteration and oracle complexity to obtain an $ε$-optimal solution are proven to be $\mathcal{O}( (L_{\rm max}/μ) \ln(1/ε))$ and $\mathcal{O}\left((L_{\rm ave}/μ) (1/ε)^{1+c} \right)$ with $c\geq 0$, respectively. In pursuit of less aggressive sampling rates, when the batch sizes increase at a polynomial rate of degree $v \geq 1$, suboptimality decays at a corresponding polynomial rate while the iteration and oracle complexity to obtain an $ε-$optimal solution are provably $\mathcal{O} ( v(1/ε)^{1/v})$ and $\mathcal{O} \left(e^v v^{2v+1}(1/ε)^{1+1/v}\right)$, respectively.

preprint2020arXiv

Distributed Variable Sample-Size Gradient-response and Best-response Schemes for Stochastic Nash Equilibrium Problems over Graphs

This paper considers a stochastic Nash game in which each player minimizes an expectation valued composite objective. We make the following contributions. (I) Under suitable monotonicity assumptions on the concatenated gradient map, we derive optimal rate statements and oracle complexity bounds for the proposed variable sample-size proximal stochastic gradient-response (VS-PGR) scheme when the sample-size increases at a geometric rate. If the sample-size increases at a polynomial rate of degree $v > 0$, the mean-squared errordecays at a corresponding polynomial rate while the iteration and oracle complexities to obtain an $ε$-NE are $\mathcal{O}(1/ε^{1/v})$ and $\mathcal{O}(1/ε^{1+1/v})$, respectively. (II) We then overlay (VS-PGR) with a consensus phase with a view towards developing distributed protocols for aggregative stochastic Nash games. In the resulting scheme, when the sample-size and the consensus steps grow at a geometric and linear rate, computing an $ε$-NE requires similar iteration and oracle complexities to (VS-PGR) with a communication complexity of $\mathcal{O}(\ln^2(1/ε))$; (III) Under a suitable contractive property associated with the proximal best-response (BR) map, we design a variable sample-size proximal BR (VS-PBR) scheme, where each player solves a sample-average BR problem. Akin to (I), we also give the rate statements, oracle and iteration complexity bounds. (IV) Akin to (II), the distributed variant achieves similar iteration and oracle complexities to the centralized (VS-PBR) with a communication complexity of $\mathcal{O}(\ln^2(1/ε))$ when the communication rounds per iteration increase at a linear rate. Finally, we present some preliminary numerics to provide empirical support for the rate and complexity statements.