Source author record

Ankit Gupta

Ankit Gupta appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

30works

22topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Video Active Perception: Effective Inference-Time Long-Form Video Understanding with Vision-Language Models

Large vision-language models (VLMs) have advanced multimodal tasks such as video question answering (QA). However, VLMs face the challenge of selecting frames effectively and efficiently, as standard uniform sampling is expensive and performance may plateau. Inspired by active perception theory, which posits that models gain information by acquiring data that differs from their expectations, we introduce Video Active Perception (VAP), a training-free method to enhance long-form video QA using VLMs. Our approach treats keyframe selection as data acquisition in active perception and leverages a lightweight text-conditioned video generation model to represent prior world knowledge. Empirically, VAP achieves state-of-the-art zero-shot results on long-form or reasoning video QA datasets such as EgoSchema, NExT-QA, ActivityNet-QA, IntentQA, and CLEVRER, achieving an increase of up to 5.6 x frame efficiency by frames per question over standard GPT-4o, Gemini 1.5 Pro, and LLaVA-OV. Moreover, VAP shows stronger reasoning abilities than previous methods and effectively selects keyframes relevant to questions. These findings highlight the potential of leveraging active perception to improve the frame effectiveness and efficiency of long-form video QA.

preprint2022arXiv

Adaptive Traffic Signal Control for Developing Countries Using Fused Parameters Derived from Crowd-Source Data

Advancement of mobile technologies has enabled economical collection, storage, processing, and sharing of traffic data. These data are made accessible to intended users through various application program interfaces (API) and can be used to recognize and mitigate congestion in real time. In this paper, quantitative (time of arrival) and qualitative (color-coded congestion levels) data were acquired from the Google traffic APIs. New parameters that reflect heterogeneous traffic conditions were defined and utilized for real-time control of traffic signals while maintaining the green-to-red time ratio. The proposed method utilizes a congestion-avoiding principle commonly used in computer networking. Adaptive congestion levels were observed on three different intersections of Delhi (India), in peak hours. It showed good variation, hence sensitive for the control algorithm to act efficiently. Also, simulation study establishes that proposed control algorithm decreases waiting time and congestion. The proposed method provides an inexpensive alternative for traffic sensing and tracking technologies.

preprint2022arXiv

Braitenberg Vehicles as Developmental Neurosimulation

Connecting brain and behavior is a longstanding issue in the areas of behavioral science, artificial intelligence, and neurobiology. As is standard among models of artificial and biological neural networks, an analogue of the fully mature brain is presented as a blank slate. However, this does not consider the realities of biological development and developmental learning. Our purpose is to model the development of an artificial organism that exhibits complex behaviors. We introduce three alternate approaches to demonstrate how developmental embodied agents can be implemented. The resulting developmental BVs (dBVs) will generate behaviors ranging from stimulus responses to group behavior that resembles collective motion. We will situate this work in the domain of artificial brain networks along with broader themes such as embodied cognition, feedback, and emergence. Our perspective is exemplified by three software instantiations that demonstrate how a BV-genetic algorithm hybrid model, multisensory Hebbian learning model, and multi-agent approaches can be used to approach BV development. We introduce use cases such as optimized spatial cognition (vehicle-genetic algorithm hybrid model), hinges connecting behavioral and neural models (multisensory Hebbian learning model), and cumulative classification (multi-agent approaches). In conclusion, we consider future applications of the developmental neurosimulation approach.

preprint2022arXiv

Diagonal State Spaces are as Effective as Structured State Spaces

Modeling long range dependencies in sequential data is a fundamental step towards attaining human-level performance in many modalities such as text, vision, audio and video. While attention-based models are a popular and effective choice in modeling short-range interactions, their performance on tasks requiring long range reasoning has been largely inadequate. In an exciting result, Gu et al. (ICLR 2022) proposed the $\textit{Structured State Space}$ (S4) architecture delivering large gains over state-of-the-art models on several long-range tasks across various modalities. The core proposition of S4 is the parameterization of state matrices via a diagonal plus low rank structure, allowing efficient computation. In this work, we show that one can match the performance of S4 even without the low rank correction and thus assuming the state matrices to be diagonal. Our $\textit{Diagonal State Space}$ (DSS) model matches the performance of S4 on Long Range Arena tasks, speech classification on Speech Commands dataset, while being conceptually simpler and straightforward to implement.

preprint2022arXiv

Long Range Language Modeling via Gated State Spaces

State space models have shown to be effective at modeling long range dependencies, specially on sequence classification tasks. In this work we focus on autoregressive sequence modeling over English books, Github source code and ArXiv mathematics articles. Based on recent developments around the effectiveness of gated activation functions, we propose a new layer named Gated State Space (GSS) and show that it trains significantly faster than the diagonal version of S4 (i.e. DSS) on TPUs, is fairly competitive with several well-tuned Transformer-based baselines and exhibits zero-shot generalization to longer inputs while being straightforward to implement. Finally, we show that leveraging self-attention to model local dependencies improves the performance of GSS even further.

preprint2022arXiv

Machine Learning-based Urban Canyon Path Loss Prediction using 28 GHz Manhattan Measurements

Large bandwidth at mm-wave is crucial for 5G and beyond but the high path loss (PL) requires highly accurate PL prediction for network planning and optimization. Statistical models with slope-intercept fit fall short in capturing large variations seen in urban canyons, whereas ray-tracing, capable of characterizing site-specific features, faces challenges in describing foliage and street clutter and associated reflection/diffraction ray calculation. Machine learning (ML) is promising but faces three key challenges in PL prediction: 1) insufficient measurement data; 2) lack of extrapolation to new streets; 3) overwhelmingly complex features/models. We propose an ML-based urban canyon PL prediction model based on extensive 28 GHz measurements from Manhattan where street clutters are modeled via a LiDAR point cloud dataset and buildings by a mesh-grid building dataset. We extract expert knowledge-driven street clutter features from the point cloud and aggressively compress 3D-building information using convolutional-autoencoder. Using a new street-by-street training and testing procedure to improve generalizability, the proposed model using both clutter and building features achieves a prediction error (RMSE) of $4.8 \pm 1.1$ dB compared to $10.6 \pm 4.4$ dB and $6.5 \pm 2.0$ dB for 3GPP LOS and slope-intercept prediction, respectively, where the standard deviation indicates street-by-street variation. By only using four most influential clutter features, RMSE of $5.5\pm 1.1$ dB is achieved.

preprint2022arXiv

On the Parameterization and Initialization of Diagonal State Space Models

State space models (SSM) have recently been shown to be very effective as a deep learning layer as a promising alternative to sequence models such as RNNs, CNNs, or Transformers. The first version to show this potential was the S4 model, which is particularly effective on tasks involving long-range dependencies by using a prescribed state matrix called the HiPPO matrix. While this has an interpretable mathematical mechanism for modeling long dependencies, it introduces a custom representation and algorithm that can be difficult to implement. On the other hand, a recent variant of S4 called DSS showed that restricting the state matrix to be fully diagonal can still preserve the performance of the original model when using a specific initialization based on approximating S4's matrix. This work seeks to systematically understand how to parameterize and initialize such diagonal state space models. While it follows from classical results that almost all SSMs have an equivalent diagonal form, we show that the initialization is critical for performance. We explain why DSS works mathematically, by showing that the diagonal restriction of S4's matrix surprisingly recovers the same kernel in the limit of infinite state dimension. We also systematically describe various design choices in parameterizing and computing diagonal SSMs, and perform a controlled empirical study ablating the effects of these choices. Our final model S4D is a simple diagonal version of S4 whose kernel computation requires just 2 lines of code and performs comparably to S4 in almost all settings, with state-of-the-art results for image, audio, and medical time-series domains, and averaging 85\% on the Long Range Arena benchmark.

preprint2022arXiv

Stochastic filtering for multiscale stochastic reaction networks based on hybrid approximations

In the past few decades, the development of fluorescent technologies and microscopic techniques has greatly improved scientists' ability to observe real-time single-cell activities. In this paper, we consider the filtering problem associate with these advanced technologies, i.e., how to estimate latent dynamic states of an intracellular multiscale stochastic reaction network from time-course measurements of fluorescent reporters. A good solution to this problem can further improve scientists' ability to extract information about intracellular systems from time-course experiments. A straightforward approach to this filtering problem is to use a particle filter where particles are generated by simulation of the full model and weighted according to observations. However, the exact simulation of the full dynamic model usually takes an impractical amount of computational time and prevents this type of particle filters from being used for real-time applications, such as transcription regulation networks. Inspired by the recent development of hybrid approximations to multiscale chemical reaction networks, we approach the filtering problem in an alternative way. We first prove that accurate solutions to the filtering problem can be constructed by solving the filtering problem for a reduced model that represents the dynamics as a hybrid process. The model reduction is based on exploiting the time-scale separations in the original network and, therefore, can greatly reduce the computational effort required to simulate the dynamics. As a result, we are able to develop efficient particle filters to solve the filtering problem for the original model by applying particle filters to the reduced model. We illustrate the accuracy and the computational efficiency of our approach using several numerical examples.

preprint2020arXiv

Break It Down: A Question Understanding Benchmark

Understanding natural language questions entails the ability to break down a question into the requisite steps for computing its answer. In this work, we introduce a Question Decomposition Meaning Representation (QDMR) for questions. QDMR constitutes the ordered list of steps, expressed through natural language, that are necessary for answering a question. We develop a crowdsourcing pipeline, showing that quality QDMRs can be annotated at scale, and release the Break dataset, containing over 83K pairs of questions and their QDMRs. We demonstrate the utility of QDMR by showing that (a) it can be used to improve open-domain question answering on the HotpotQA dataset, (b) it can be deterministically converted to a pseudo-SQL formal language, which can alleviate annotation in semantic parsing applications. Last, we use Break to train a sequence-to-sequence model with copying that parses questions into QDMR structures, and show that it substantially outperforms several natural baselines.

preprint2020arXiv

GMAT: Global Memory Augmentation for Transformers

Transformer-based models have become ubiquitous in natural language processing thanks to their large capacity, innate parallelism and high performance. The contextualizing component of a Transformer block is the $\textit{pairwise dot-product}$ attention that has a large $Ω(L^2)$ memory requirement for length $L$ sequences, limiting its ability to process long documents. This has been the subject of substantial interest recently, where multiple approximations were proposed to reduce the quadratic memory requirement using sparse attention matrices. In this work, we propose to augment sparse Transformer blocks with a dense attention-based $\textit{global memory}$ of length $M$ ($\ll L$) which provides an aggregate global view of the entire input sequence to each position. Our augmentation has a manageable $O(M\cdot(L+M))$ memory overhead, and can be seamlessly integrated with prior sparse solutions. Moreover, global memory can also be used for sequence compression, by representing a long input sequence with the memory representations only. We empirically show that our method leads to substantial improvement on a range of tasks, including (a) synthetic tasks that require global reasoning, (b) masked language modeling, and (c) reading comprehension.

preprint2020arXiv

HeartFit: An Accurate Platform for Heart Murmur Diagnosis Utilizing Deep Learning

Cardiovascular disease (CD) is the number one leading cause of death worldwide, accounting for more than 17 million deaths in 2015. Critical indicators of CD include heart murmurs, intense sounds emitted by the heart during periods of irregular blood flow. Current diagnosis of heart murmurs relies on echocardiography (ECHO), which costs thousands of dollars and medical professionals to analyze the results, making it very unsuitable for areas with inadequate medical facilities. Thus, there is a need for an accessible alternative. Based on a simple interface and deep learning, HeartFit allows users to administer diagnoses themselves. An inexpensive, custom designed stethoscope in conjunction with a mobile application allows users to record and upload audio of their heart to a database. Using a deep learning network architecture, the database classifies the audio and returns the diagnosis to the user. The model consists of a deep recurrent convolutional neural network trained on 300 prelabeled heartbeat audio samples. After the model was validated on a previously unseen set of 100 heartbeat audio samples, it achieved a f beta score of 0.9545 and an accuracy of 95.5 percent. This value exceeds that of clinical examination accuracy, which is around 83 percent to 91 percent and costs orders of magnitude less than ECHO, demonstrating the effectiveness of the HeartFit platform. Through the platform, users can obtain immediate, accurate diagnosis of heart murmurs without any professional medical assistance, revolutionizing how we combat CD.

preprint2020arXiv

Injecting Numerical Reasoning Skills into Language Models

Large pre-trained language models (LMs) are known to encode substantial amounts of linguistic information. However, high-level reasoning skills, such as numerical reasoning, are difficult to learn from a language-modeling objective only. Consequently, existing models for numerical reasoning have used specialized architectures with limited flexibility. In this work, we show that numerical reasoning is amenable to automatic data generation, and thus one can inject this skill into pre-trained LMs, by generating large amounts of data, and training in a multi-task setup. We show that pre-training our model, GenBERT, on this data, dramatically improves performance on DROP (49.3 $\rightarrow$ 72.3 F1), reaching performance that matches state-of-the-art models of comparable size, while using a simple and general-purpose encoder-decoder architecture. Moreover, GenBERT generalizes well to math word problem datasets, while maintaining high performance on standard RC tasks. Our approach provides a general recipe for injecting skills into large pre-trained LMs, whenever the skill is amenable to automatic data augmentation.

preprint2020arXiv

Stochastic filters based on hybrid approximations of multiscale stochastic reaction networks

We consider the problem of estimating the dynamic latent states of an intracellular multiscale stochastic reaction network from time-course measurements of fluorescent reporters. We first prove that accurate solutions to the filtering problem can be constructed by solving the filtering problem for a reduced model that represents the dynamics as a hybrid process. The model reduction is based on exploiting the time-scale separations in the original network, and it can greatly reduce the computational effort required to simulate the dynamics. This enables us to develop efficient particle filters to solve the filtering problem for the original model by applying particle filters to the reduced model. We illustrate the accuracy and the computational efficiency of our approach using a numerical example.

preprint2019arXiv

A study of topological structures on equi-continuous mappings

Function space topologies are developed for EC(Y,Z), the class of equi-continuous mappings from a topological space Y to a uniform space Z. Properties such as splittingness, admissibility etc. are defined for such spaces. The net theoretic investigations are carried out to provide characterizations of splittingness and admissibility of function spaces on EC(Y,Z). The open-entourage topology and point-transitive-entourage topology are shown to be admissible and splitting respectively. Dual topologies are defined. A topology on EC(Y,Z) is found to be admissible (resp. splitting) if and only if its dual is so.

preprint2019arXiv

User-Interactive Machine Learning Model for Identifying Structural Relationships of Code Features

Traditional machine learning based intelligent systems assist users by learning patterns in data and making recommendations. However, these systems are limited in that the user has little means of understanding the rationale behind the systems suggestions, communicating their own understanding of patterns, or correcting system behavior. In this project, we outline a model for intelligent software based on a human computer feedback loop. The Machine Learning (ML) systems recommendations are reviewed by the user, and in turn, this information shapes the systems decision making. Our model was applied to developing an HTML editor that integrates ML with user interaction to ascertain structural relationships between HTML document features and apply them for code completion. The editor utilizes the ID3 algorithm to build decision trees, sequences of rules for predicting code the user will type. The editor displays the decision trees rules in the Interactive Rules Interface System (IRIS), which allows developers to prioritize, modify, or delete them. These interactions alter the data processed by ID3, providing the developer some control over the autocomplete system. Validation indicates that, absent user interaction, the ML model is able to predict tags with 78.4 percent accuracy, attributes with 62.9 percent accuracy, and values with 12.8 percent accuracy. Based off of the results of the user study, user interaction with the rules interface corrects feature relationships missed or mistaken by the automated process, enhancing autocomplete accuracy and developer productivity. Additionally, interaction is proven to help developers work with greater awareness of code patterns. Our research demonstrates the viability of a software integration of machine intelligence with human feedback.

preprint2016arXiv

Antithetic Integral Feedback ensures robust perfect adaptation in noisy biomolecular networks

Homeostasis is a running theme in biology. Often achieved through feedback regulation strategies, homeostasis allows living cells to control their internal environment as a means for surviving changing and unfavourable environments. While many endogenous homeostatic motifs have been studied in living cells, some other motifs may remain under-explored or even undiscovered. At the same time, known regulatory motifs have been mostly analyzed at the deterministic level, and the effect of noise on their regulatory function has received low attention. Here we lay the foundation for a regulation theory at the molecular level that explicitly takes into account the noisy nature of biochemical reactions and provides novel tools for the analysis and design of robust homeostatic circuits. Using these ideas, we propose a new regulation motif, which we refer to as {\em antithetic integral feedback, and demonstrate its effectiveness as a strategy for generically regulating a wide class of reaction networks. By combining tools from probability and control theory, we show that the proposed motif preserves the stability of the overall network, steers the population of any regulated species to a desired set point, and achieves robust perfect adaptation -- all with low prior knowledge of reaction rates. Moreover, our proposed regulatory motif can be implemented using a very small number of molecules and hence has a negligible metabolic load. Strikingly, the regulatory motif exploits stochastic noise, leading to enhanced regulation in scenarios where noise-free implementations result in dysregulation. Finally, we discuss the possible manifestation of the proposed antithetic integral feedback motif in endogenous biological circuits and its realization in synthetic circuits.

preprint2015arXiv

A Comparative Analysis of Tensor Decomposition Models Using Hyper Spectral Image

Hyper spectral imaging is a remote sensing technology, providing variety of applications such as material identification, space object identification, planetary exploitation etc. It deals with capturing continuum of images of the earth surface from different angles. Due to the multidimensional nature of the image, multi-way arrays are one of the possible solutions for analyzing hyper spectral data. This multi-way array is called tensor. Our approach deals with implementing three decomposition models LMLRA, BTD and CPD to the sample data for choosing the best decomposition of the data set. The results have proved that Block Term Decomposition (BTD) is the best tensor model for decomposing the hyper spectral image in to resultant factor matrices.

preprint2015arXiv

Adaptive Hybrid Simulations for Multiscale Stochastic Reaction Networks

The probability distribution describing the state of a Stochastic Reaction Network evolves according to the Chemical Master Equation (CME). It is common to estimated its solution using Monte Carlo methods such as the Stochastic Simulation Algorithm (SSA). In many cases these simulations can take an impractical amount of computational time. Therefore many methods have been developed that approximate the Stochastic Process underlying the Chemical Master Equation. Prominent strategies are Hybrid Models that regard the firing of some reaction channels as being continuous and applying the quasi-stationary assumption to approximate the dynamics of fast subnetworks. However as the dynamics of a Stochastic Reaction Network changes with time these approximations might have to be adapted during the simulation. We develop a method that approximates the solution of a CME by automatically partitioning the reaction dynamics into discrete/continuous components and applying the quasi-stationary assumption on identifiable fast subnetworks. Our method does not require user intervention and it adapts to exploit the changing timescale separation between reactions and/or changing magnitudes of copy numbers of constituent species. We demonstrate the efficiency of the proposed method by considering examples from Systems Biology and showing that very good approximations to the exact probability distributions can be achieved in significantly less computational time.

preprint2014arXiv

A scalable computational framework for establishing long-term behavior of stochastic reaction networks

Reaction networks are systems in which the populations of a finite number of species evolve through predefined interactions. Such networks are found as modeling tools in many biological disciplines such as biochemistry, ecology, epidemiology, immunology, systems biology and synthetic biology. It is now well-established that, for small population sizes, stochastic models for biochemical reaction networks are necessary to capture randomness in the interactions. The tools for analyzing such models, however, still lag far behind their deterministic counterparts. In this paper, we bridge this gap by developing a constructive framework for examining the long-term behavior and stability properties of the reaction dynamics in a stochastic setting. In particular, we address the problems of determining ergodicity of the reaction dynamics, which is analogous to having a globally attracting fixed point for deterministic dynamics. We also examine when the statistical moments of the underlying process remain bounded with time and when they converge to their steady state values. The framework we develop relies on a blend of ideas from probability theory, linear algebra and optimization theory. We demonstrate that the stability properties of a wide class of biological networks can be assessed from our sufficient theoretical conditions that can be recast as efficient and scalable linear programs, well-known for their tractability. It is notably shown that the computational complexity is often linear in the number of species. We illustrate the validity, the efficiency and the wide applicability of our results on several reaction networks arising in biochemistry, systems biology, epidemiology and ecology. The biological implications of the results as well as an example of a non-ergodic biological network are also discussed.

preprint2014arXiv

An efficient and unbiased method for sensitivity analysis of stochastic reaction networks

We consider the problem of estimating parameter sensitivity for Markovian models of reaction networks. Sensitivity values measure the responsiveness of an output to the model parameters. They help in analyzing the network, understanding its robustness properties and identifying the important reactions for a specific output. Sensitivity values are commonly estimated using methods that perform finite-difference computations along with Monte Carlo simulations of the reaction dynamics. These methods are computationally efficient and easy to implement, but they produce a biased estimate which can be unreliable for certain applications. Moreover the size of the bias is generally unknown and hence the accuracy of these methods cannot be easily determined. There also exist unbiased schemes for sensitivity estimation but these schemes can be computationally infeasible, even for simple networks. Our goal in this paper is to present a new method for sensitivity estimation, which combines the computational efficiency of finite-difference methods with the accuracy of unbiased schemes. Our method is easy to implement and it relies on an exact representation of parameter sensitivity that we recently proved in an earlier paper. Through examples we demonstrate that the proposed method can outperform the existing methods, both biased and unbiased, in many situations.

preprint2014arXiv

Critical Regression Analysis of Real Time Industrial Web Data Set Using Data Mining Tool

In todays fast pacing, highly competing,volatile and challenging world, companies highly rely on data analysis obtained from both offline as well as online way to make their future strategy, to sustain in the market. This paper reviews the regression technique analysis on a real time web data to analyse different attributes of interest and to predict possible growth factors for the company, so as to enable the company to make possible strategic decisions for the growth of the company.

preprint2014arXiv

Unbiased estimation of second-order parameter sensitivities for stochastic reaction networks

This paper deals with the problem of estimating second-order parameter sensitivities for stochastic reaction networks, where the reaction dynamics is modeled as a continuous time Markov chain over a discrete state space. Estimation of such second-order sensitivities (the Hessian) is necessary for implementing the Newton-Raphson scheme for optimization over the parameter space. To perform this estimation, Wolf and Anderson have proposed an efficient finite-difference method, that uses a coupling of perturbed processes to reduce the estimator variance. The aim of this paper is to illustrate that the same coupling can be exploited to derive an exact representation for second-order parameter sensitivity. Furthermore with this representation one can construct an unbiased estimator which is easy to implement. The ideas contained in this paper are extensions of the ideas presented in our recent papers on first-order parameter sensitivity estimation.

preprint2013arXiv

Determining the long-term behavior of cell populations: A new procedure for detecting ergodicity in large stochastic reaction networks

A reaction network consists of a finite number of species, which interact through predefined reaction channels. Traditionally such networks were modeled deterministically, but it is now well-established that when reactant copy numbers are small, the random timing of the reactions create internal noise that can significantly affect the macroscopic properties of the system. To understand the role of noise and quantify its effects, stochastic models are necessary. In the stochastic setting, the population is described by a probability distribution, which evolves according to a set of ordinary differential equations known as the Chemical Master Equation (CME). This set is infinite in most cases making the CME practically unsolvable. In many applications, it is important to determine if the solution of a CME has a globally attracting fixed point. This property is called ergodicity and its presence leads to several important insights about the underlying dynamics. The goal of this paper is to present a simple procedure to verify ergodicity in stochastic reaction networks. We provide a set of simple linear-algebraic conditions which are sufficient for the network to be ergodic. In particular, our main condition can be cast as a Linear Feasibility Problem (LFP) which is essentially the problem of determining the existence of a vector satisfying certain linear constraints. The inherent scalability of LFPs make our approach efficient, even for very large networks. We illustrate our procedure through an example from systems biology.

preprint2013arXiv

Sensitivity analysis for stochastic chemical reaction networks with multiple time-scales

Stochastic models for chemical reaction networks have become very popular in recent years. For such models, the estimation of parameter sensitivities is an important and challenging problem. Sensitivity values help in analyzing the network, understanding its robustness properties and also in identifying the key reactions for a given outcome. Most of the methods that exist in the literature for the estimation of parameter sensitivities, rely on Monte Carlo simulations using Gillespie's stochastic simulation algorithm or its variants. It is well-known that such simulation methods can be prohibitively expensive when the network contains reactions firing at different time-scales, which is a feature of many important biochemical networks. For such networks, it is often possible to exploit the time-scale separation and approximately capture the original dynamics by simulating a "reduced" model, which is obtained by eliminating the fast reactions in a certain way. The aim of this paper is to tie these model reduction techniques with sensitivity analysis. We prove that under some conditions, the sensitivity values of the reduced model can be used to approximately recover the sensitivity values for the original model. Through an example we illustrate how our result can help in sharply reducing the computational costs for the estimation of parameter sensitivities for reaction networks with multiple time-scales. To prove our result, we use coupling arguments based on the random time change representation of Kurtz. We also exploit certain connections between the distributions of the occupation times of Markov chains and multi-dimensional wave equations.

preprint2012arXiv

A new proof for the convergence of an individual based model to the Trait substitution sequence

We consider a continuous time stochastic individual based model for a population structured only by an inherited vector trait and with logistic interactions. We consider its limit in a context from adaptive dynamics: the population is large, the mutations are rare and we view the process in the timescale of mutations. Using averaging techniques due to Kurtz (1992), we give a new proof of the convergence of the individual based model to the trait substitution sequence of Metz et al. (1992) first worked out by Dieckman and Law (1996) and rigorously proved by Champagnat (2006): rigging the model such that "invasion implies substitution", we obtain in the limit a process that jumps from one population equilibrium to another when mutations occur and invade the population.

preprint2012arXiv

NLOS UV Channel Modeling Using Numerical Integration and an Approximate Closed-Form Path Loss Model

In this paper we propose a simulation method using numerical integration, and develop a closed-form link loss model for physical layer channel characterization for non-line of sight (NLOS) ultraviolet (UV) communication systems. The impulse response of the channel is calculated by assuming both uniform and Gaussian profiles for transmitted beams and different geometries. The results are compared with previously published results. The accuracy of the integration approach is compared to the Monte Carlo simulation. Then the path loss using the simulation method and the suggested closed-form expression are presented for different link geometries. The accuracies are evaluated and compared to the results obtained using other methods.

preprint2012arXiv

Stochastic model for cell polarity

Cell polarity refers to the spatial asymmetry of molecules on the cell membrane. Altschuler, Angenent, Wang and Wu have proposed a stochastic model for studying the emergence of polarity in the presence of feedback between molecules. We analyze their model further by representing it as a model of an evolving population with interacting individuals. Under a suitable scaling of parameters, we show that in the infinite population limit we get a Fleming--Viot process. Using well-known results for such processes, we establish that cell polarity is exhibited by the model and also study its dependence on the biological parameters of the model.

preprint2012arXiv

The Fleming-Viot limit of an interacting spatial population with fast density regulation

We consider population models in which the individuals reproduce, die and also migrate in space. The population size scales according to some parameter $N$, which can have different interpretations depending on the context. Each individual is assigned a mass of 1/N and the total mass in the system is called \emph{population density}. The dynamics has an intrinsic density regulation mechanism that drives the population density towards an equilibrium. We show that under a timescale separation between the \emph{slow} migration mechanism and the \emph{fast} density regulation mechanism, the population dynamics converges to a Fleming-Viot process as the scaling parameter $N$ approaches $\infty$. We first prove this result for a basic model in which the birth and death rates can only depend on the population density. In this case we obtain a \emph{neutral} Fleming-Viot process. We then extend this model by including position-dependence in the birth and death rates, as well as, offspring dispersal and immigration mechanisms. We show how these extensions add \emph{mutation} and \emph{selection} to the limiting Fleming-Viot process. All the results are proved in a multi-type setting, where there are $q$ types of individuals interacting with each other. We illustrate the usefulness of our convergence result by discussing applications in population genetics and cell biology.

preprint2012arXiv

Unbiased estimation of parameter sensitivities for stochastic chemical reaction networks

Estimation of parameter sensitivities for stochastic chemical reaction networks is an important and challenging problem. Sensitivity values are important in the analysis, modeling and design of chemical networks. They help in understanding the robustness properties of the system and also in identifying the key reactions for a given outcome. In a discrete setting, most of the methods that exist in the literature for the estimation of parameter sensitivities rely on Monte Carlo simulations along with finite difference computations. However these methods introduce a bias in the sensitivity estimate and in most cases the size or direction of the bias remains unknown, potentially damaging the accuracy of the analysis. In this paper, we use the random time change representation of Kurtz to derive an exact formula for parameter sensitivity. This formula allows us to construct an unbiased estimator for parameter sensitivity, which can be efficiently evaluated using a suitably devised Monte Carlo scheme. The existing literature contains only one method to produce such an unbiased estimator. This method was proposed by Plyasunov and Arkin and it is based on the Girsanov measure transformation. By taking a couple of examples we compare our method to this existing method. Our results indicate that our method can be much faster than the existing method while computing sensitivity with respect to a reaction rate constant which is small in magnitude. This rate constant could correspond to a reaction which is slow in the reference time-scale of the system. Since many biological systems have such slow reactions, our method can be a useful tool for sensitivity analysis.

preprint2010arXiv

A Decentralized Approach for Service Discovery & Availability in P-Grids

The widespread emergence of the Internet as a platform for electronic data distribution and the advent of structured information have revolutionized our ability to deliver information to any corner of the world. Although Service Oriented Architecture (SOA) is a paradigm for organizing and utilizing distributed capabilities that may be under the control of different ownership domains and implemented using various technology stacks and every organization may not be geared up for this. To harness the various software / service resources placed on various systems, we have proposed and implemented a model that is able to establish discovery and sharing in load balanced P-grid environment. The experimental results show that the proposed approach has dramatically lowered the network traffic (nearly negligible), while achieving load balancing in P2P grid systems. Our model is able to support discovery and sharing of resources also.

Ankit Gupta

What is connected

Connect this record

See the researcher in context

Building this map preview

30 published item(s)

Video Active Perception: Effective Inference-Time Long-Form Video Understanding with Vision-Language Models

Adaptive Traffic Signal Control for Developing Countries Using Fused Parameters Derived from Crowd-Source Data

Braitenberg Vehicles as Developmental Neurosimulation

Diagonal State Spaces are as Effective as Structured State Spaces

Long Range Language Modeling via Gated State Spaces

Machine Learning-based Urban Canyon Path Loss Prediction using 28 GHz Manhattan Measurements

On the Parameterization and Initialization of Diagonal State Space Models

Stochastic filtering for multiscale stochastic reaction networks based on hybrid approximations

Break It Down: A Question Understanding Benchmark

GMAT: Global Memory Augmentation for Transformers

HeartFit: An Accurate Platform for Heart Murmur Diagnosis Utilizing Deep Learning

Injecting Numerical Reasoning Skills into Language Models

Stochastic filters based on hybrid approximations of multiscale stochastic reaction networks

A study of topological structures on equi-continuous mappings

User-Interactive Machine Learning Model for Identifying Structural Relationships of Code Features

Antithetic Integral Feedback ensures robust perfect adaptation in noisy biomolecular networks

A Comparative Analysis of Tensor Decomposition Models Using Hyper Spectral Image

Adaptive Hybrid Simulations for Multiscale Stochastic Reaction Networks

A scalable computational framework for establishing long-term behavior of stochastic reaction networks

An efficient and unbiased method for sensitivity analysis of stochastic reaction networks

Critical Regression Analysis of Real Time Industrial Web Data Set Using Data Mining Tool

Unbiased estimation of second-order parameter sensitivities for stochastic reaction networks

Determining the long-term behavior of cell populations: A new procedure for detecting ergodicity in large stochastic reaction networks

Sensitivity analysis for stochastic chemical reaction networks with multiple time-scales

A new proof for the convergence of an individual based model to the Trait substitution sequence

NLOS UV Channel Modeling Using Numerical Integration and an Approximate Closed-Form Path Loss Model

Stochastic model for cell polarity

The Fleming-Viot limit of an interacting spatial population with fast density regulation

Unbiased estimation of parameter sensitivities for stochastic chemical reaction networks

A Decentralized Approach for Service Discovery & Availability in P-Grids