Source author record

Sourabh Bhattacharya

Sourabh Bhattacharya appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.ST Statistics Theory Methodology Applications math.OC Computation Computer Science and Game Theory Systems and Control Information Theory math.IT Robotics astro-ph.GA astro-ph.SR Computational Geometry Cryptography and Security Machine Learning math.PR

Catalog footprint

What is connected

39works

17topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

The Bayesian Reflex: Online Learning as the Autonomic Nervous System of Modern and Future AI

This chapter introduces the Bayesian reflex -- an analogy with the autonomic nervous system -- as a unifying framework for online learning in AI. Bayesian online algorithms automatically maintain equilibrium in dynamic environments via three mechanisms: belief maintenance through probabilistic representations, sequential updating via Bayes' theorem, and uncertainty-driven action balancing exploration and exploitation. We survey online Bayesian methods, highlighting two computational principles: the look-up table principle for sequential inference in function space, and the ellipsoidal decomposition framework for nearly exact i.i.d. sampling from arbitrary posteriors. These principles are generalized across dynamic emulation, nonparametric state-space models, circular time series, inverse regression for climate model evaluation, and deep architectures via Recursive Gaussian Processes. Decision-making is explored via Thompson sampling and restless bandits. We extend the framework to assess infinite series convergence (applied to climate dynamics and the Riemann Hypothesis), model prime number distributions leading to the discovery of 184 strong Mersenne prime candidates, detect stationarity, and characterize point processes. The Bayesian reflex provides a foundational infrastructure for adaptive AI that continuously learns in a complex world.

preprint2022arXiv

Additive Security Games: Structure and Optimization

In this work, we provide a structural characterization of the possible Nash equilibria in the well-studied class of security games with additive utility. Our analysis yields a classification of possible equilibria into seven types and we provide closed-form feasibility conditions for each type as well as closed-form expressions for the expected outcomes to the players at equilibrium. We provide uniqueness and multiplicity results for each type and utilize our structural approach to propose a novel algorithm to compute equilibria of each type when they exist. We then consider the special cases of security games with fully protective resources and zero-sum games. Under the assumption that the defender can perturb the payoffs to the attacker, we study the problem of optimizing the defender expected outcome at equilibrium. We show that this problem is weakly NP- hard in the case of Stackelberg equilibria and multiple attacker resources and present a pseudopolynomial time procedure to solve this problem for the case of Nash equilibria under mild assumptions. Finally, to address non-additive security games, we propose a notion of nearest additive game and demonstrate the existence and uniqueness of a such a nearest additive game for any non-additive game.

preprint2022arXiv

IID Sampling from Posterior Dirichlet Process Mixtures

The influence of Dirichlet process mixture is ubiquitous in the Bayesian nonparametrics literature. But sampling from its posterior distribution remains a challenge, despite the advent of various Markov chain Monte Carlo methods. The primary challenge is the infinite-dimensional setup, and even if the infinite-dimensional random measure is integrated out, high-dimensionality and discreteness still remain difficult issues to deal with. In this article, exploiting the key ideas proposed in Bhattacharya (2021b), we propose a novel methodology for drawing iid realizations from posteriors of Dirichlet process mixtures. We focus in particular on the more general and flexible model of Bhattacharya (2008), so that the methods developed here are simply applicable to the traditional Dirichlet process mixture. We illustrate our ideas on the well-known enzyme, acidity and the galaxy datasets, which are usually considered benchmark datasets for mixture applications. Generating 10, 000 iid realizations from the Dirichlet process mixture posterior of Bhattacharya (2008) given these datasets took 19 minutes, 8 minutes and 5 minutes, respectively, in our parallel implementation.

preprint2020arXiv

A Bayesian Multiple Testing Paradigm for Model Selection in Inverse Regression Problems

In this article, we propose a novel Bayesian multiple testing formulation for model and variable selection in inverse setups, judiciously embedding the idea of inverse reference distributions proposed by Bhattacharya (2013) in a mixture framework consisting of the competing models. We develop the theory and methods in the general context encompassing parametric and nonparametric competing models, dependent data, as well as misspecifications. Our investigation shows that asymptotically the multiple testing procedure almost surely selects the best possible inverse model that minimizes the minimum Kullback-Leibler divergence from the true model. We also show that the error rates, namely, versions of the false discovery rate and the false non-discovery rate converge to zero almost surely as the sample size goes to infinity. Asymptotic α-control of versions of the false discovery rate and its impact on the convergence of false non-discovery rate versions, are also investigated. Our simulation experiments involve small sample based selection among inverse Poisson log regression and inverse geometric logit and probit regression, where the regressions are either linear or based on Gaussian processes. Additionally, variable selection is also considered. Our multiple testing results turn out to be very encouraging in the sense of selecting the best models in all the non-misspecified and misspecified cases.

preprint2020arXiv

A Fully Bayesian Approach to Assessment of Model Adequacy in Inverse Problems

We consider the problem of assessing goodness of fit of a single Bayesian model to the observed data in the inverse problem context. A novel procedure of goodness of fit test is proposed, based on construction of reference distributions using the `inverse' part of the given model. This is motivated by an example from palaeoclimatology in which it is of interest to reconstruct past climates using information obtained from fossils deposited in lake sediment. Technically, given a model $f(Y\mid X,θ)$, where $Y$ is the observed data and $X$ is a set of (non-random) covariates, we obtain reference distributions based on the posterior $π(\tilde X\mid Y)$, where $\tilde X$ must be interpreted as the {\it unobserved} random vector corresponding to the {\it observed} covariates $X$. Put simply, if the posterior distribution $π(\tilde X\mid Y)$ gives high density to the observed covariates $X$, or equivalently, if the posterior distribution of $T(\tilde X)$ gives high density to $T(X)$, where $T$ is any appropriate statistic, then we say that the model fits the data. Otherwise the model in question is not adequate. We provide decision-theoretic justification of our proposed approach and discuss other theoretical and computational advantages. We demonstrate our methodology with many simulated examples and three complex, high-dimensional, realistic palaeoclimate problems, including the motivating palaeoclimate problem.

preprint2020arXiv

A Non-Gaussian, Nonparametric Structure for Gene-Gene and Gene-Environment Interactions in Case-Control Studies Based on Hierarchies of Dirichlet Processes

It is becoming increasingly clear that complex interactions among genes and environmental factors play crucial roles in triggering complex diseases. Thus, understanding such interactions is vital, which is possible only through statistical models that adequately account for such intricate, albeit unknown, dependence structures. Bhattacharya & Bhattacharya (2016b) attempt such modeling, relating finite mixtures composed of Dirichlet processes that represent unknown number of genetic sub-populations through a hierarchical matrix-normal structure that incorporates gene-gene interactions, and possible mutations, induced by environmental variables. However, the product dependence structure implied by their matrix-normal model seems to be too simple to be appropriate for general complex, realistic situations. In this article, we propose and develop a novel nonparametric Bayesian model for case-control genotype data using hierarchies of Dirichlet processes that offers a more realistic and nonparametric dependence structure between the genes, induced by the environmental variables. In this regard, we propose a novel and highly parallelisable MCMC algorithm that is rendered quite efficient by the combination of modern parallel computing technology, effective Gibbs sampling steps, retrospective sampling and Transformation based Markov Chain Monte Carlo (TMCMC). We use appropriate Bayesian hypothesis testing procedures to detect the roles of genes and environment in case-control studies. We apply our ideas to 5 biologically realistic case-control genotype datasets simulated under distinct set-ups, and obtain encouraging results in each case. We finally apply our ideas to a real, myocardial infarction dataset, and obtain interesting results on gene-gene and gene-environment interaction, while broadly agreeing with the results reported in the literature.

preprint2020arXiv

Asymptotic Theory of Dependent Bayesian Multiple Testing Procedures Under Possible Model Misspecification

We study asymptotic properties of Bayesian multiple testing procedures and provide sufficient conditions for strong consistency under general dependence structure. We also consider a novel Bayesian multiple testing procedure and associated error measures that coherently accounts for the dependence structure present in the model. We advocate posterior versions of FDR and FNR as appropriate error rates and show that their asymptotic convergence rates are directly associated with the Kullback-Leibler divergence from the true model. Our results hold even when the class of postulated models is misspecified. We illustrate our results in a variable selection problem with autoregressive response variables, and compare the new Bayesian procedure with some existing methods through extensive simulation studies in the variable selection problem. Superior performance of the new procedure compared to the others vindicate that proper exploitation of the dependence structure by multiple testing methods is indeed important. Moreover, we obtain encouraging results in a real, maize data context, where we select influential marker variables.

preprint2020arXiv

Bayesian Appraisal of Random Series Convergence with Application to Climate Change

Roy and Bhattacharya (2020) provided Bayesian characterization of infinite series, and their most important application, namely, to the Dirichlet series characterizing the (in)famous Riemann Hypothesis, revealed insights that are not in support of the most celebrated conjecture for over 150 years. In contrast with deterministic series considered by Roy and Bhattacharya (2020), in this article we take up random infinite series for our investigation. Remarkably, our method does not require any simplifying assumption. Albeit the Bayesian characterization theory for random series is no different from that for the deterministic setup, construction of effective upper bounds for partial sums, required for implementation, turns out to be a challenging undertaking in the random setup. In this article, we construct parametric and nonparametric upper bound forms for the partial sums of random infinite series and demonstrate the generality of the latter in comparison to the former. Simulation studies exhibit high accuracy and efficiency of the nonparametric bound in all the setups that we consider. Finally, exploiting the property that the summands tend to zero in the case of series convergence, we consider application of our nonparametric bound driven Bayesian method to global climate change analysis. Specifically, analyzing the global average temperature record over the years 1850--2016 and Holocene global average temperature reconstruction data 12,000 years before present, we conclude, in spite of the current global warming situation, that global climate dynamics is subject to temporary variability only, the current global warming being an instance, and long term global warming or cooling either in the past or in the future, are highly unlikely.

preprint2020arXiv

Bayesian Characterizations of Properties of Stochastic Processes with Applications

In this article, we primarily propose a novel Bayesian characterization of stationary and nonstationary stochastic processes. In practice, this theory aims to distinguish between global stationarity and nonstationarity for both parametric and nonparametric stochastic processes. Interestingly, our theory builds on our previous work on Bayesian characterization of infinite series, which was applied to verification of the (in)famous Riemann Hypothesis. Thus, there seems to be interesting and important connections between pure mathematics and Bayesian statistics, with respect to our proposed ideas. We validate our proposed method with simulation and real data experiments associated with different setups. In particular, applications of our method include stationarity and nonstationarity determination in various time series models, spatial and spatio-temporal setups, and convergence diagnostics of Markov Chain Monte Carlo. Our results demonstrate very encouraging performance, even in very subtle situations. Using similar principles, we also provide a novel Bayesian characterization of mutual independence among any number of random variables, using which we characterize the properties of point processes, including characterizations of Poisson point processes, complete spatial randomness, stationarity and nonstationarity. Applications to simulation experiments with ample Poisson and non-Poisson point process models again indicate quite encouraging performance of our proposed ideas. We further propose a novel recursive Bayesian method for determination of frequencies of oscillatory stochastic processes, based on our general principle. Simulation studies and real data experiments with varieties of time series models consisting of single and multiple frequencies bring out the worth of our method.

preprint2020arXiv

Convergence of Pseudo-Bayes Factors in Forward and Inverse Regression Problems

In the Bayesian literature on model comparison, Bayes factors play the leading role. In the classical statistical literature, model selection criteria are often devised used cross-validation ideas. Amalgamating the ideas of Bayes factor and cross-validation Geisser and Eddy (1979) created the pseudo-Bayes factor. The usage of cross-validation inculcates several theoretical advantages, computational simplicity and numerical stability in Bayes factors as the marginal density of the entire dataset is replaced with products of cross-validation densities of individual data points. However, the popularity of pseudo-Bayes factors is still negligible in comparison with Bayes factors, with respect to both theoretical investigations and practical applications. In this article, we establish almost sure exponential convergence of pseudo-Bayes factors for large samples under a general setup consisting of dependent data and model misspecifications. We particularly focus on general parametric and nonparametric regression setups in both forward and inverse contexts. We illustrate our theoretical results with various examples, providing explicit calculations. We also supplement our asymptotic theory with simulation experiments in small sample situations of Poisson log regression and geometric logit and probit regression, additionally addressing the variable selection problem. We consider both linear and nonparametric regression modeled by Gaussian processes for our purposes. Our simulation results provide quite interesting insights into the usage of pseudo-Bayes factors in forward and inverse setups.

preprint2020arXiv

High-dimensional Asymptotic Theory of Bayesian Multiple Testing Procedures Under General Dependent Setup and Possible Misspecification

In this article, we investigate the asymptotic properties of Bayesian multiple testing procedures under general dependent setup, when the sample size and the number of hypotheses both tend to infinity. Specifically, we investigate strong consistency of the procedures and asymptotic properties of different versions of false discovery and false non-discovery rates under the high dimensional setup. We particularly focus on a novel Bayesian non-marginal multiple testing procedure and its associated error rates in this regard. Our results show that the asymptotic convergence rates of the error rates are directly associated with the Kullback-Leibler divergence from the true model, and the results hold even when the postulated class of models is misspecified. For illustration of our high-dimensional asymptotic theory, we consider a Bayesian variable selection problem in a time-varying covariate selection framework, with autoregressive response variables. We particularly focus on the setup where the number of hypotheses increases at a faster rate compared to the sample size, which is the so-called ultra-high dimensional situation.

preprint2020arXiv

Nonstationary, Nonparametric, Nonseparable Bayesian Spatio-Temporal Modeling Using Kernel Convolution of Order Based Dependent Dirichlet Process

In this article, using kernel convolution of order based dependent Dirichlet process (Griffin and Steel (2006)) we construct a nonstationary, nonseparable, nonparametric space-time process, which, as we show, satisfies desirable properties, and includes the stationary, separable, parametric processes as special cases. We also investigate the smoothness properties of our proposed model. Since our model entails an infinite random series, for Bayesian model fitting purpose we must either truncate the series or more appropriately consider a random number of summands, which renders the model dimension a random variable. We attack the variable dimensionality problem using Transdimensional Transformation based Markov Chain Monte Carlo introduced by Das and Bhattacharya (2019b), which can update all the variables and also change dimensions in a single block using essentially a single random variable drawn from some arbitrary density defined on a relevant support. For the sake of completeness we also address the problem of truncating the infinite series by providing a uniform bound on the error incurred by truncating the infinite series. We illustrate the effectiveness of our model and methodologies on a simulated data set and demonstrate that our approach significantly outperforms that of Fuentes and Reich (2013) which is based on principles somewhat similar to ours. We also fit two real, spatial and spatio-temporal datasets with our approach and obtain quite encouraging results in both the cases.

preprint2020arXiv

On Classical and Bayesian Asymptotics in Stochastic Differential Equations with Random Effects having Mixture Normal Distributions

Delattre et al. (2013) considered a system of stochastic differential equations (SDEs) in a random effects setup. Under the independent and identical (iid) situation, and assuming normal distribution of the random effects, they established weak consistency of the maximum likelihood estimators (M LEs) of the population parameters of the random effects. In this article, respecting the increasing importance and versatility of normal mixtures and their ability to approximate any standard distribution, we consider the random effects having mixture of normal distributions and prove asymptotic results associated with the MLEs in both independent and identical (iid) and independent but not identical (non-iid) situations. Besides, we consider iid and non-iid setups under the Bayesian paradigm and establish posterior consistency and asymptotic normality of the posterior distribution of the population parameters, even when the number of mixture components is unknown and treated as a random variable. Although ours is an independent work, we later noted that Delattre et al. (2016) also assumed the SDE setup with normal mixture distribution of the random effect parameters but considered only the iid case and proved only weak consistency of the M LE under an extra, strong assumption as opposed to strong consistency that we are able to prove without the extra assumption. Furthermore, they did not deal with asymptotic normality of M LE or the Bayesian asymptotics counterpart which we investigate in details. Ample simulation experiments and application to a real, stock market data set reveal the importance and usefulness of our methods even for small samples.

preprint2020arXiv

On the Characterization of Saddle Point Equilibrium for Security Games with Additive Utility

In this work, we investigate a security game between an attacker and a defender, originally proposed in \cite{emadi2019security}. As is well known, the combinatorial nature of security games leads to a large cost matrix. Therefore, computing the value and optimal strategy for the players becomes computationally expensive. In this work, we analyze a special class of zero-sum games in which the payoff matrix has a special structure which results from the {\it additive property} of the utility function. Based on variational principles, we present structural properties of optimal attacker as well as defender's strategy. We propose a linear-time algorithm to compute the value based on the structural properties, which is an improvement from our previous result in \cite{emadi2019security}, especially in the context of large-scale zero-sum games.

preprint2020arXiv

Posterior Consistency of Bayesian Inverse Regression and Inverse Reference Distributions

We consider Bayesian inference in inverse regression problems where the objective is to infer about unobserved covariates from observed responses and covariates. We establish posterior consistency of such unobserved covariates in Bayesian inverse regression problemsunder appropriate priors in a leave-one-out cross-validation setup. We relate this to posterior consistency of inverse reference distributions (Bhattacharya (2013)) for assessing model adequacy. We illustrate our theory and methods with various examples of Bayesian inverse regression, along with adequate simulation experiments.

preprint2020arXiv

Posterior Convergence of Gaussian and General Stochastic Process Regression Under Possible Misspecifications

In this article, we investigate posterior convergence in nonparametric regression models where the unknown regression function is modeled by some appropriate stochastic process. In this regard, we consider two setups. The first setup is based on Gaussian processes, where the covariates are either random or non-random and the noise may be either normally or double-exponentially distributed. In the second setup, we assume that the underlying regression function is modeled by some reasonably smooth, but unspecified stochastic process satisfying reasonable conditions. The distribution of the noise is also left unspecified, but assumed to be thick-tailed. As in the previous studies regarding the same problems, we do not assume that the truth lies in the postulated parameter space, thus explicitly allowing the possibilities of misspecification. We exploit the general results of Shalizi (2009) for our purpose and establish not only posterior consistency, but also the rates at which the posterior probabilities converge, which turns out to be the Kullback-Leibler divergence rate. We also investigate the more familiar posterior convergence rates. Interestingly, we show that the posterior predictive distribution can accurately approximate the best possible predictive distribution in the sense that the Hellinger distance, as well as the total variation distance between the two distributions can tend to zero, in spite of misspecifications.

preprint2020arXiv

Posterior Convergence of Nonparametric Binary and Poisson Regression Under Possible Misspecifications

In this article, we investigate posterior convergence of nonparametric binary and Poisson regression under possible model misspecification, assuming general stochastic process prior with appropriate properties. Our model setup and objective for binary regression is similar to that of Ghosal and Roy (2006) where the authors have used the approach of entropy bound and exponentially consistent tests with the sieve method to achieve consistency with respect to their Gaussian process prior. In contrast, for both binary and Poisson regression, using general stochastic process prior, our approach involves verification of asymptotic equipartition property along with the method of sieve, which is a manoeuvre of the general results of Shalizi (2009), useful even for misspecified models. Moreover, we will establish not only posterior consistency but also the rates at which the posterior probabilities converge, which turns out to be the Kullback-Leibler divergence rate. We also investgate the traditional posterior convergence rates. Interestingly, from subjective Bayesian viewpoint we will show that the posterior predictive distribution can accurately approximate the best possible predictive distribution in the sense that the Hellinger distance, as well as the total variation distance between the two distributions can tend to zero, in spite of misspecifications.

preprint2016arXiv

On Asymptotics Related to Classical Inference in Stochastic Differential Equations with Random Effects

Delattre et al. (2013) considered n independent stochastic differential equations (SDEs), where in each case the drift term is associated with a random effect, the distribution of which depends upon unknown parameters. Assuming the independent and identical (iid) situation the authors provide independent proofs of weak consistency and asymptotic normality of the maximum likelihood estimators (MLEs) of the hyper-parameters of their random effects parameters. In this article, as an alternative route to proving consistency and asymptotic normality in the SDE set-up involving random effects, we verify the regularity conditions required by existing relevant theorems. In particular, this approach allowed us to prove strong consistency under weaker assumption. But much more importantly, we further consider the independent, but non-identical set-up associated with the random effects based SDE framework, and prove asymptotic results associated with the MLEs.

preprint2016arXiv

On Bayesian Asymptotics in Stochastic Differential Equations with Random Effects

Delattre et al. (2013) investigated asymptotic properties of the maximum likelihood estimator of the population parameters of the random effects associated with n independent stochastic differential equations (SDEs) assuming that the SDEs are independent and identical (iid). In this article, we consider the Bayesian approach to learning about the population parameters, and prove consistency and asymptotic normality of the corresponding posterior distribution in the iid set-up as well as when the SDEs are independent but non-identical.

preprint2016arXiv

On the Optimal Policies for Visibility-Based Target Tracking

In this paper, we investigate a pursuit-evasion game in which a mobile observer tries to track a target in an environment containing obstacles. We formulate the game as an optimal control problem with state inequality constraint in a simple environment. We show that for some initial conditions, there are two different regimes in the optimal strategy of the pursuer depending on whether the state-constraint is activated. We derive the equations that characterize the switching time between the two regimes. The pursuer's optimal tracking strategy in a simple environment is further extended to a general environment with multiple polygonal obstacles. We propose techniques to construct a "pursuit field" based on the optimal solutions to guide the motion of the observer in a general environment.

preprint2016arXiv

Partitioning Strategies and Task Allocation for Target-tracking with Multiple Guards in Polygonal Environments

This paper presents an algorithm to deploy a team of {\it free} guards equipped with omni-directional cameras for tracking a bounded speed intruder inside a simply-connected polygonal environment. The proposed algorithm partitions the environment into smaller polygons, and assigns a guard to each partition so that the intruder is visible to at least one guard at all times. Based on the concept of {\it dynamic zones} introduced in this paper, we propose event-triggered strategies for the guards to track the intruder. We show that the number of guards deployed by the algorithm for tracking is strictly less than $\lfloor {\frac{n}{3}} \rfloor$ which is sufficient and sometimes necessary for coverage. We derive an upper bound on the speed of the mobile guard required for successful tracking which depends on the intruder's speed, the road map of the mobile guards, and geometry of the environment. Finally, we extend the aforementioned analysis to orthogonal polygons, and show that the upper bound on the number of guards deployed for tracking is strictly less than $\lfloor {\frac{n}{4}} \rfloor$ which is sufficient and sometimes necessary for the coverage problem.

preprint2016arXiv

Towards a Framework for Tracking Multiple Targets: Hybrid Systems meets Computational Geometry

We investigate a variation of the art gallery problem in which a team of mobile guards tries to track an unpredictable intruder in a simply-connected polygonal environment. In this work, we use the deployment strategy for diagonal guards originally proposed in [1]. The guards are confined to move along the diagonals of a polygon and the intruder can move freely within the environment. We define critical regions to generate event-triggered strategies for the guards. We design a hybrid automaton based on the critical regions to model the tracking problem. Based on reachability analysis, we provide necessary and sufficient conditions for tracking in terms of the maximal controlled invariant set of the hybrid system. We express these conditions in terms of the critical curves to find sufficient conditions for n/4 guards to track the mobile intruder using the reachability analysis.

preprint2015arXiv

Bayesian Nonparametric Dynamic State Space Modeling with Circular Latent States

State space models are well-known for their versatility in modeling dynamic systems that arise in various scientific disciplines. Although parametric state space models are well studied, nonparametric approaches are much less explored in comparison. In this article we propose a novel Bayesian nonparametric approach to state space modeling assuming that both the observational and evolutionary functions are unknown and are varying with time; crucially, we assume that the unknown evolutionary equation describes dynamic evolution of some latent circular random variable. Based on appropriate kernel convolution of the standard Wiener process we model the time-varying observational and evolutionary functions as suitable Gaussian processes that take both linear and circular variables as arguments. Additionally, for the time-varying evolutionary function, we wrap the Gaussian process thus constructed around the unit circle to form an appropriate circular Gaussian process. We show that our process thus created satisfies desirable properties. For the purpose of inference we develop an MCMC based methodology combining Gibbs sampling and Metropolis-Hastings algorithms. Applications to a simulated data set, a real wind speed data set and a real ozone data set demonstrated quite encouraging performances of our model and methodologies.

preprint2015arXiv

Bayesian Nonparametric Estimation of Milky Way Model Parameters Using a New Matrix-Variate Gaussian Process Based Method

In this paper we develop an inverse Bayesian approach to find the value of the unknown model parameter vector that supports the real (or test) data, where the data comprises measurements of a matrix-variate variable. The method is illustrated via the estimation of the unknown Milky Way feature parameter vector, using available test and simulated (training) stellar velocity data matrices. The data is represented as an unknown function of the model parameters, where this high-dimensional function is modelled using a high-dimensional Gaussian Process (${\cal GP}$). The model for this function is trained using available training data and inverted by Bayesian means, to estimate the sought value of the model parameter vector at which the test data is realised. We achieve a closed-form expression for the posterior of the unknown parameter vector and the parameters of the invoked ${\cal GP}$, given test and training data. We perform model fitting by comparing the observed data with predictions made at different summaries of the posterior probability of the model parameter vector. As a supplement, we undertake a leave-one-out cross validation of our method.

preprint2015arXiv

Particle Swarm Optimization Based Source Seeking

Signal source seeking using autonomous vehicles is a complex problem. The complexity increases manifold when signal intensities captured by physical sensors onboard are noisy and unreliable. Added to the fact that signal strength decays with distance, noisy environments make it extremely difficult to describe and model a decay function. This paper addresses our work with seeking maximum signal strength in a continuous electromagnetic signal source with mobile robots, using Particle Swarm Optimization (PSO). A one to one correspondence with swarm members in a PSO and physical Mobile robots is established and the positions of the robots are iteratively updated as the PSO algorithm proceeds forward. Since physical robots are responsive to swarm position updates, modifications were required to implement the interaction between real robots and the PSO algorithm. The development of modifications necessary to implement PSO on mobile robots, and strategies to adapt to real life environments such as obstacles and collision objects are presented in this paper. Our findings are also validated using experimental testbeds.

preprint2014arXiv

A Note on the Misuse of the Variance Test in Meteorological Studies

The erroneous assumption "for all distributions for which the theoretical variance can be computed independently from parameters estimated by any method different from the method of moments" has been used in the case of fitting the gamma distribution to a rainfall data by Mooley (1973) which was followed by several researchers. We show that the asymptotic distribution of the test statistic is generally not even comparable to any central chi-square distribution. We also describe a method for checking the validity of the asymptotic distribution for a class of distributions.

preprint2014arXiv

Bayesian Inference in Nonparametric Dynamic State-Space Models

We introduce state-space models where the functionals of the observational and the evolutionary equations are unknown, and treated as random functions evolving with time. Thus, our model is nonparametric and generalizes the traditional parametric state-space models. This random function approach also frees us from the restrictive assumption that the functional forms, although time-dependent, are of fixed forms. The traditional approach of assuming known, parametric functional forms is questionable, particularly in state-space models, since the validation of the assumptions require data on both the observed time series and the latent states; however, data on the latter are not available in state-space models. We specify Gaussian processes as priors of the random functions and exploit the "look-up table approach" of \ctn{Bhattacharya07} to efficiently handle the dynamic structure of the model. We consider both univariate and multivariate situations, using the Markov chain Monte Carlo (MCMC) approach for studying the posterior distributions of interest. In the case of challenging multivariate situations we demonstrate that the newly developed Transformation-based MCMC (TMCMC) of \ctn{Dutta11} provides interesting and efficient alternatives to the usual proposal distributions. We illustrate our methods with a challenging multivariate simulated data set, where the true observational and the evolutionary equations are highly non-linear, and treated as unknown. The results we obtain are quite encouraging. Moreover, using our Gaussian process approach we analysed a real data set, which has also been analysed by \ctn{Shumway82} and \ctn{Carlin92} using the linearity assumption. Our analyses show that towards the end of the time series, the linearity assumption of the previous authors breaks down.

preprint2014arXiv

Minimum Distance Estimation of Milky Way Model Parameters and Related Inference

We propose a method to estimate the location of the Sun in the disk of the Milky Way using a method based on the Hellinger distance and construct confidence sets on our estimate of the unknown location using a bootstrap based method. Assuming the Galactic disk to be two-dimensional, the sought solar location then reduces to the radial distance separating the Sun from the Galactic center and the angular separation of the Galactic center to Sun line, from a pre-fixed line on the disk. On astronomical scales, the unknown solar location is equivalent to the location of us earthlings who observe the velocities of a sample of stars in the neighborhood of the Sun. This unknown location is estimated by undertaking pairwise comparisons of the estimated density of the observed set of velocities of the sampled stars, with densities estimated using synthetic stellar velocity data sets generated at chosen locations in the Milky Way disk according to four base astrophysical models. The "match" between the pair of estimated densities is parameterized by the affinity measure based on the familiar Hellinger distance. We perform a novel cross-validation procedure to establish a desirable "consistency" property of the proposed method.

preprint2014arXiv

On Single Variable Transformation Approach to Markov Chain Monte Carlo

Random Walk Metropolis Hastings (RWMH) algorithm, is quite inefficient in high dimensions because of its abysmally slow acceptance rate. The slow acceptance rate results from the fact that RWMH separately updates each coordinate of the chain at every step. Dutta and Bhattacharya (2013) proposed a new technique called Transformation based Markov Chain Monte Carlo (TMCMC) aimed at overcoming these problems. This method updates all co-ordinates at a time- ensuring stable acceptance in all dimensions. We have shown here that geometric ergodicity is achieved for sub-exponential targets for two versions of TMCMC- the additive and the additive-multiplicative hybrid TMCMC schemes. Also, we obtain the optimal scaling by maximizing the diffusion speed of the limiting time-scaled diffusion process for TMCMC. We show that the optimal acceptance rate is 0.439 for TMCMC which is almost twice as large as RWMH (0.234). We observe that convergence to stationarity for TMCMC is faster than RWMH but the mixing property in RWMH is relatively better. However TMCMC is more robust with respect to scaling and dimensionality. This is attested by simulation runs on Gaussian and nearest neighbor models.

preprint2013arXiv

An Improved Bayesian Semiparametric Model for Palaeoclimate Reconstruction: Cross-validation Based Model Assessment

Fossil-based palaeoclimate reconstruction is an important area of ecological science that has gained momentum in the backdrop of the global climate change debate. The hierarchical Bayesian paradigm provides an interesting platform for studying such important scientific issue. However, our cross-validation based assessment of the existing Bayesian hierarchical models with respect to two modern proxy data sets based on chironomid and pollen, respectively, revealed that the models are inadequate for the data sets. In this paper, we model the species assemblages (compositional data) by the zero-inflated multinomial distribution, while modelling the species response functions using Dirichlet process based Gaussian mixtures. This modelling strategy yielded significantly improved performances, and a formal Bayesian test of model adequacy, developed recently, showed that our new model is adequate for both the modern data sets. Furthermore, combining together the zero-inflated assumption, Importance Resampling Markov Chain Monte Carlo (IRMCMC) and the recently developed Transformation-based Markov Chain Monte Carlo (TMCMC), we develop a powerful and efficient computational methodology.

preprint2013arXiv

Clustering Categorical Time Series into Unknown Number of Clusters: A Perfect Simulation based Approach

Pamminger and Fruwirth-Schnatter (2010) considered a Bayesian approach to model-based clustering of categorical time series assuming a fixed number of clusters. But the popular methods for selecting the number of clusters, for example, the Bayes Information Criterion (BIC), turned out to have severe problems in the categorical time series context. In this paper, we circumvent the difficulties of choosing the number of clusters by adopting the Bayesian semiparametric mixture model approach introduced by Bhattacharya (2008), who assume that the number of clusters is a random quantity, but is bounded above by a (possibly large) number of clusters. We adopt the perfect simulation approach of Mukhopadhyay and Bhattacharya (2012) for posterior simulation for completely solving the problems of convergence of the underlying Markov chain Monte Carlo (MCMC) approach. Importantly, within our main perfect simulation algorithm, there arose the necessity to simulate perfectly from the joint distribution of a set of continuous random variables with log-concave full conditional densities. We propose and develop a novel and efficient perfect simulation methodology for joint distributions with log-concave full conditionals. This perfect sampling methodology is of independent interest as well since in a very large and important class of Bayesian applications the full conditionals turn out to be log-concave. We will consider application of our model and methodology to the Austrian wage mobility data, also analysed by Pamminger and Fruwirth-Schnatter (2010), and adopting the methods developed in Mukhopadhyay et al. (2011), Mukhopadhyay et al. (2012), will obtain the posterior modes of clusterings and also the desired highest posterior distribution credible regions of the posterior distribution of clusterings.

preprint2013arXiv

Markov Chain Monte Carlo Based on Deterministic Transformations

In this article we propose a novel MCMC method based on deterministic transformations T: X x D --> X where X is the state-space and D is some set which may or may not be a subset of X. We refer to our new methodology as Transformation-based Markov chain Monte Carlo (TMCMC). One of the remarkable advantages of our proposal is that even if the underlying target distribution is very high-dimensional, deterministic transformation of a one-dimensional random variable is sufficient to generate an appropriate Markov chain that is guaranteed to converge to the high-dimensional target distribution. Apart from clearly leading to massive computational savings, this idea of deterministically transforming a single random variable very generally leads to excellent acceptance rates, even though all the random variables associated with the high-dimensional target distribution are updated in a single block. Since it is well-known that joint updating of many random variables using Metropolis-Hastings (MH) algorithm generally leads to poor acceptance rates, TMCMC, in this regard, seems to provide a significant advance. We validate our proposal theoretically, establishing the convergence properties. Furthermore, we show that TMCMC can be very effectively adopted for simulating from doubly intractable distributions. TMCMC is compared with MH using the well-known Challenger data, demonstrating the effectiveness of of the former in the case of highly correlated variables. Moreover, we apply our methodology to a challenging posterior simulation problem associated with the geostatistical model of Diggle et al. (1998), updating 160 unknown parameters jointly, using a deterministic transformation of a one-dimensional random variable. Remarkable computational savings as well as good convergence properties and acceptance rates are the results.

preprint2013arXiv

Supplement to "Markov Chain Monte Carlo Based on Deterministic Transformations"

This is a supplement to the article "Markov Chain Monte Carlo Based on Deterministic Transformations" available at http://arxiv.org/abs/1106.5850

preprint2012arXiv

Perfect Simulation for Mixtures with Known and Unknown Number of components

We propose and develop a novel and effective perfect sampling methodology for simulating from posteriors corresponding to mixtures with either known (fixed) or unknown number of components. For the latter we consider the Dirichlet process-based mixture model developed by these authors, and show that our ideas are applicable to conjugate, and importantly, to non-conjugate cases. As to be expected, and, as we show, perfect sampling for mixtures with known number of components can be achieved with much less effort with a simplified version of our general methodology, whether or not conjugate or non-conjugate priors are used. While no special assumption is necessary in the conjugate set-up for our theory to work, we require the assumption of bounded parameter space in the non-conjugate set-up. However, we argue, with appropriate analytical, simulation, and real data studies as support, that such boundedness assumption is not unrealistic and is not an impediment in practice. Not only do we validate our ideas theoretically and with simulation studies, but we also consider application of our proposal to three real data sets used by several authors in the past in connection with mixture models. The results we achieved in each of our experiments with either simulation study or real data application, are quite encouraging.

preprint2011arXiv

A nonstationary nonparametric Bayesian approach to dynamically modeling effective connectivity in functional magnetic resonance imaging experiments

Effective connectivity analysis provides an understanding of the functional organization of the brain by studying how activated regions influence one other. We propose a nonparametric Bayesian approach to model effective connectivity assuming a dynamic nonstationary neuronal system. Our approach uses the Dirichlet process to specify an appropriate (most plausible according to our prior beliefs) dynamic model as the "expectation" of a set of plausible models upon which we assign a probability distribution. This addresses model uncertainty associated with dynamic effective connectivity. We derive a Gibbs sampling approach to sample from the joint (and marginal) posterior distributions of the unknowns. Results on simulation experiments demonstrate our model to be flexible and a better candidate in many situations. We also used our approach to analyzing functional Magnetic Resonance Imaging (fMRI) data on a Stroop task: our analysis provided new insight into the mechanism by which an individual brain distinguishes and learns about shapes of objects.

preprint2011arXiv

Adaptive Resource Allocation in Jamming Teams Using Game Theory

In this work, we study the problem of power allocation and adaptive modulation in teams of decision makers. We consider the special case of two teams with each team consisting of two mobile agents. Agents belonging to the same team communicate over wireless ad hoc networks, and they try to split their available power between the tasks of communication and jamming the nodes of the other team. The agents have constraints on their total energy and instantaneous power usage. The cost function adopted is the difference between the rates of erroneously transmitted bits of each team. We model the adaptive modulation problem as a zero-sum matrix game which in turn gives rise to a a continuous kernel game to handle power control. Based on the communications model, we present sufficient conditions on the physical parameters of the agents for the existence of a pure strategy saddle-point equilibrium (PSSPE).

preprint2011arXiv

On Bayesian "central clustering": Application to landscape classification of Western Ghats

Landscape classification of the well-known biodiversity hotspot, Western Ghats (mountains), on the west coast of India, is an important part of a world-wide program of monitoring biodiversity. To this end, a massive vegetation data set, consisting of 51,834 4-variate observations has been clustered into different landscapes by Nagendra and Gadgil [Current Sci. 75 (1998) 264--271]. But a study of such importance may be affected by nonuniqueness of cluster analysis and the lack of methods for quantifying uncertainty of the clusterings obtained. Motivated by this applied problem of much scientific importance, we propose a new methodology for obtaining the global, as well as the local modes of the posterior distribution of clustering, along with the desired credible and "highest posterior density" regions in a nonparametric Bayesian framework. To meet the need of an appropriate metric for computing the distance between any two clusterings, we adopt and provide a much simpler, but accurate modification of the metric proposed in [In Felicitation Volume in Honour of Prof. B. K. Kale (2009) MacMillan]. A very fast and efficient Bayesian methodology, based on [Sankhyā Ser. B 70 (2008) 133--155], has been utilized to solve the computational problems associated with the massive data and to obtain samples from the posterior distribution of clustering on which our proposed methods of summarization are illustrated.

preprint2011arXiv

Power Allocation in Team Jamming Games in Wireless Ad Hoc Networks

In this work, we study the problem of power allocation in teams. Each team consists of two agents who try to split their available power between the tasks of communication and jamming the nodes of the other team. The agents have constraints on their total energy and instantaneous power usage. The cost function is the difference between the rates of erroneously transmitted bits of each team. We model the problem as a zero-sum differential game between the two teams and use {\it{Isaacs'}} approach to obtain the necessary conditions for the optimal trajectories. This leads to a continuous-kernel power allocation game among the players. Based on the communications model, we present sufficient conditions on the physical parameters of the agents for the existence of a pure strategy Nash equilibrium (PSNE). Finally, we present simulation results for the case when the agents are holonomic.

preprint2011arXiv

Switching Strategies for Linear Feedback Stabilization with Sparsified State Measurements

In this paper, we address the problem of stabilization in continuous time linear dynamical systems using state feedback when compressive sampling techniques are used for state measurement and reconstruction. In [5], we had introduced the concept of using l1 reconstruction technique, commonly used in sparse data reconstruction, for state measurement and estimation in a discrete time linear system. In this work, we extend the previous scenario to analyse continuous time linear systems. We investigate the effect of switching within a set of sparsifiers, introduced in [5], on the stability of a linear plant in continuous time settings. Initially, we analyze the problem of stabilization in low dimensional systems, following which we generalize the results to address the problem of stabilization in systems of arbitrary dimensions.

Sourabh Bhattacharya

What is connected

Connect this record

See the researcher in context

Building this map preview

39 published item(s)

The Bayesian Reflex: Online Learning as the Autonomic Nervous System of Modern and Future AI

Additive Security Games: Structure and Optimization

IID Sampling from Posterior Dirichlet Process Mixtures

A Bayesian Multiple Testing Paradigm for Model Selection in Inverse Regression Problems

A Fully Bayesian Approach to Assessment of Model Adequacy in Inverse Problems

A Non-Gaussian, Nonparametric Structure for Gene-Gene and Gene-Environment Interactions in Case-Control Studies Based on Hierarchies of Dirichlet Processes

Asymptotic Theory of Dependent Bayesian Multiple Testing Procedures Under Possible Model Misspecification

Bayesian Appraisal of Random Series Convergence with Application to Climate Change

Bayesian Characterizations of Properties of Stochastic Processes with Applications

Convergence of Pseudo-Bayes Factors in Forward and Inverse Regression Problems

High-dimensional Asymptotic Theory of Bayesian Multiple Testing Procedures Under General Dependent Setup and Possible Misspecification

Nonstationary, Nonparametric, Nonseparable Bayesian Spatio-Temporal Modeling Using Kernel Convolution of Order Based Dependent Dirichlet Process

On Classical and Bayesian Asymptotics in Stochastic Differential Equations with Random Effects having Mixture Normal Distributions

On the Characterization of Saddle Point Equilibrium for Security Games with Additive Utility

Posterior Consistency of Bayesian Inverse Regression and Inverse Reference Distributions

Posterior Convergence of Gaussian and General Stochastic Process Regression Under Possible Misspecifications

Posterior Convergence of Nonparametric Binary and Poisson Regression Under Possible Misspecifications

On Asymptotics Related to Classical Inference in Stochastic Differential Equations with Random Effects

On Bayesian Asymptotics in Stochastic Differential Equations with Random Effects

On the Optimal Policies for Visibility-Based Target Tracking

Partitioning Strategies and Task Allocation for Target-tracking with Multiple Guards in Polygonal Environments

Towards a Framework for Tracking Multiple Targets: Hybrid Systems meets Computational Geometry

Bayesian Nonparametric Dynamic State Space Modeling with Circular Latent States

Bayesian Nonparametric Estimation of Milky Way Model Parameters Using a New Matrix-Variate Gaussian Process Based Method

Particle Swarm Optimization Based Source Seeking

A Note on the Misuse of the Variance Test in Meteorological Studies

Bayesian Inference in Nonparametric Dynamic State-Space Models

Minimum Distance Estimation of Milky Way Model Parameters and Related Inference

On Single Variable Transformation Approach to Markov Chain Monte Carlo

An Improved Bayesian Semiparametric Model for Palaeoclimate Reconstruction: Cross-validation Based Model Assessment

Clustering Categorical Time Series into Unknown Number of Clusters: A Perfect Simulation based Approach

Markov Chain Monte Carlo Based on Deterministic Transformations

Supplement to "Markov Chain Monte Carlo Based on Deterministic Transformations"

Perfect Simulation for Mixtures with Known and Unknown Number of components

A nonstationary nonparametric Bayesian approach to dynamically modeling effective connectivity in functional magnetic resonance imaging experiments

Adaptive Resource Allocation in Jamming Teams Using Game Theory

On Bayesian "central clustering": Application to landscape classification of Western Ghats

Power Allocation in Team Jamming Games in Wireless Ad Hoc Networks

Switching Strategies for Linear Feedback Stabilization with Sparsified State Measurements