Researcher profile

Prateek Bansal

Prateek Bansal contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
12works
0followers
10topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

12 published item(s)

preprint2025arXiv

Scalable Variational Inference for Multinomial Probit Models under Large Choice Sets and Sample Sizes

The multinomial probit (MNP) model is widely used to analyze categorical outcomes due to its ability to capture flexible substitution patterns among alternatives. Conventional likelihood based and Markov chain Monte Carlo (MCMC) estimators become computationally prohibitive in high dimensional choice settings. This study introduces a fast and accurate conditional variational inference (CVI) approach to calibrate MNP model parameters, which is scalable to large samples and large choice sets. A flexible variational distribution on correlated latent utilities is defined using neural embeddings, and a reparameterization trick is used to ensure the positive definiteness of the resulting covariance matrix. The resulting CVI estimator is similar to a variational autoencoder, with the variational model being the encoder and the MNP's data generating process being the decoder. Straight through estimation and Gumbel SoftMax approximation are adopted for the argmax operation to select an alternative with the highest latent utility. This eliminates the need to sample from high dimensional truncated Gaussian distributions, significantly reducing computational costs as the number of alternatives grows. The proposed method achieves parameter recovery comparable to MCMC. It can calibrate MNP parameters with 20 alternatives and one million observations in approximately 28 minutes roughly 36 times faster and more accurate than the existing benchmarks in recovering model parameters.

preprint2022arXiv

A Deep Generative Model for Feasible and Diverse Population Synthesis

An ideal synthetic population, a key input to activity-based models, mimics the distribution of the individual- and household-level attributes in the actual population. Since the entire population's attributes are generally unavailable, household travel survey (HTS) samples are used for population synthesis. Synthesizing population by directly sampling from HTS ignores the attribute combinations that are unobserved in the HTS samples but exist in the population, called 'sampling zeros'. A deep generative model (DGM) can potentially synthesize the sampling zeros but at the expense of generating 'structural zeros' (i.e., the infeasible attribute combinations that do not exist in the population). This study proposes a novel method to minimize structural zeros while preserving sampling zeros. Two regularizations are devised to customize the training of the DGM and applied to a generative adversarial network (GAN) and a variational autoencoder (VAE). The adopted metrics for feasibility and diversity of the synthetic population indicate the capability of generating sampling and structural zeros -- lower structural zeros and lower sampling zeros indicate the higher feasibility and the lower diversity, respectively. Results show that the proposed regularizations achieve considerable performance improvement in feasibility and diversity of the synthesized population over traditional models. The proposed VAE additionally generated 23.5% of the population ignored by the sample with 79.2% precision (i.e., 20.8% structural zeros rates), while the proposed GAN generated 18.3% of the ignored population with 89.0% precision. The proposed improvement in DGM generates a more feasible and diverse synthetic population, which is critical for the accuracy of an activity-based model.

preprint2022arXiv

DT2I: Dense Text-to-Image Generation from Region Descriptions

Despite astonishing progress, generating realistic images of complex scenes remains a challenging problem. Recently, layout-to-image synthesis approaches have attracted much interest by conditioning the generator on a list of bounding boxes and corresponding class labels. However, previous approaches are very restrictive because the set of labels is fixed a priori. Meanwhile, text-to-image synthesis methods have substantially improved and provide a flexible way for conditional image generation. In this work, we introduce dense text-to-image (DT2I) synthesis as a new task to pave the way toward more intuitive image generation. Furthermore, we propose DTC-GAN, a novel method to generate images from semantically rich region descriptions, and a multi-modal region feature matching loss to encourage semantic image-text matching. Our results demonstrate the capability of our approach to generate plausible images of complex scenes using region captions.

preprint2022arXiv

Fuel consumption elasticities, rebound effect and feebate effectiveness in the Indian and Chinese new car markets

China and India, the world's two most populous developing economies, are also among the world's largest automotive markets and carbon emitters. To reduce carbon emissions from the passenger car sector, both countries have considered various policy levers affecting fuel prices, car prices and fuel economy. This study estimates the responsiveness of new car buyers in China and India to such policy levers and drivers including income. Furthermore, we estimate the potential for rebound effect and the effectiveness of a feebate policy. To accomplish this, we developed a joint discrete-continuous model of car choice and usage based on revealed preference survey data from approximately 8000 new car buyers from India and China who purchased cars in 2016-17. Conditional on buying a new car, the fuel consumption in both markets is found to be relatively unresponsive to fuel price and income, with magnitudes of elasticity estimates ranging from 0.12 to 0.15. For both markets, the mean segment-level direct elasticities of fuel consumption relative to car price and fuel economy range from 0.57 to 0.65. The rebound effect on fuel savings due to cost-free fuel economy improvement is found to be 17.1% for India and 18.8% for China. A revenue-neutral feebate policy, with average rebates and fees of up to around 15% of the retail price, resulted in fuel savings of around 0.7% for both markets. While the feebate policy's rebound effect is low - 7.3% for India and 1.6% for China - it does not appear to be an effective fuel conservation policy.

preprint2022arXiv

Optimal congestion control strategies for near-capacity urban metros: informing intervention via fundamental diagrams

Congestion; operational delays due to a vicious circle of passenger-congestion and train-queuing; is an escalating problem for metro systems because it has negative consequences from passenger discomfort to eventual mode-shifts. Congestion arises due to large volumes of passenger boardings and alightings at bottleneck stations, which may lead to increased stopping times at stations and consequent queuing of trains upstream, further reducing line throughput and implying an even greater accumulation of passengers at stations. Alleviating congestion requires control strategies such as regulating the inflow of passengers entering bottleneck stations. The availability of large-scale smartcard and train movement data from day-to-day operations facilitates the development of models that can inform such strategies in a data-driven way. In this paper, we propose to model station-level passenger-congestion via empirical passenger boarding-alightings and train flow relationships, henceforth, fundamental diagrams (FDs). We emphasise that estimating FDs using station-level data is empirically challenging due to confounding biases arising from the interdependence of operations at different stations, which obscures the true sources of congestion in the network. We thus adopt a causal statistical modelling approach to produce FDs that are robust to confounding and as such suitable to properly inform control strategies. The closest antecedent to the proposed model is the FD for road traffic networks, which informs traffic management strategies, for instance, via locating the optimum operation point. Our analysis of data from the Mass Transit Railway, Hong Kong indicates the existence of concave FDs at identified bottleneck stations, and an associated critical level of boarding-alightings above which congestion sets-in unless there is an intervention.

preprint2020arXiv

A Dynamic Choice Model with Heterogeneous Decision Rules: Application in Estimating the User Cost of Rail Crowding

Crowding valuation of subway riders is an important input to various supply-side decisions of transit operators. The crowding cost perceived by a transit rider is generally estimated by capturing the trade-off that the rider makes between crowding and travel time while choosing a route. However, existing studies rely on static compensatory choice models and fail to account for inertia and the learning behaviour of riders. To address these challenges, we propose a new dynamic latent class model (DLCM) which (i) assigns riders to latent compensatory and inertia/habit classes based on different decision rules, (ii) enables transitions between these classes over time, and (iii) adopts instance-based learning theory to account for the learning behaviour of riders. We use the expectation-maximisation algorithm to estimate DLCM, and the most probable sequence of latent classes for each rider is retrieved using the Viterbi algorithm. The proposed DLCM can be applied in any choice context to capture the dynamics of decision rules used by a decision-maker. We demonstrate its practical advantages in estimating the crowding valuation of an Asian metro's riders. To calibrate the model, we recover the daily route preferences and in-vehicle crowding experiences of regular metro riders using a two-month-long smart card and vehicle location data. The results indicate that the average rider follows the compensatory rule on only 25.5% of route choice occasions. DLCM estimates also show an increase of 47% in metro riders' valuation of travel time under extremely crowded conditions relative to that under uncrowded conditions.

preprint2020arXiv

A Generalized Continuous-Multinomial Response Model with a t-distributed Error Kernel

In multinomial response models, idiosyncratic variations in the indirect utility are generally modeled using Gumbel or normal distributions. This study makes a strong case to substitute these thin-tailed distributions with a t-distribution. First, we demonstrate that a model with a t-distributed error kernel better estimates and predicts preferences, especially in class-imbalanced datasets. Our proposed specification also implicitly accounts for decision-uncertainty behavior, i.e. the degree of certainty that decision-makers hold in their choices relative to the variation in the indirect utility of any alternative. Second, after applying a t-distributed error kernel in a multinomial response model for the first time, we extend this specification to a generalized continuous-multinomial (GCM) model and derive its full-information maximum likelihood estimator. The likelihood involves an open-form expression of the cumulative density function of the multivariate t-distribution, which we propose to compute using a combination of the composite marginal likelihood method and the separation-of-variables approach. Third, we establish finite sample properties of the GCM model with a t-distributed error kernel (GCM-t) and highlight its superiority over the GCM model with a normally-distributed error kernel (GCM-N) in a Monte Carlo study. Finally, we compare GCM-t and GCM-N in an empirical setting related to preferences for electric vehicles (EVs). We observe that accounting for decision-uncertainty behavior in GCM-t results in lower elasticity estimates and a higher willingness to pay for improving the EV attributes than those of the GCM-N model. These differences are relevant in making policies to expedite the adoption of EVs.

preprint2020arXiv

A New Spatial Count Data Model with Bayesian Additive Regression Trees for Accident Hot Spot Identification

The identification of accident hot spots is a central task of road safety management. Bayesian count data models have emerged as the workhorse method for producing probabilistic rankings of hazardous sites in road networks. Typically, these methods assume simple linear link function specifications, which, however, limit the predictive power of a model. Furthermore, extensive specification searches are precluded by complex model structures arising from the need to account for unobserved heterogeneity and spatial correlations. Modern machine learning (ML) methods offer ways to automate the specification of the link function. However, these methods do not capture estimation uncertainty, and it is also difficult to incorporate spatial correlations. In light of these gaps in the literature, this paper proposes a new spatial negative binomial model, which uses Bayesian additive regression trees to endogenously select the specification of the link function. Posterior inference in the proposed model is made feasible with the help of the Polya-Gamma data augmentation technique. We test the performance of this new model on a crash count data set from a metropolitan highway network. The empirical results show that the proposed model performs at least as well as a baseline spatial count data model with random parameters in terms of goodness of fit and site ranking ability.

preprint2020arXiv

A New Spatial Count Data Model with Time-varying Parameters

Recent crash frequency studies incorporate spatiotemporal correlations, but these studies have two key limitations: i) none of these studies accounts for temporal variation in model parameters; and ii) Gibbs sampler suffers from convergence issues due to non-conjugacy. To address the first limitation, we propose a new count data model that identifies the underlying temporal patterns of the regression parameters while simultaneously allowing for time-varying spatial correlation. The model is also extended to incorporate heterogeneity in non-temporal parameters across spatial units. We tackle the second shortcoming by deriving a Gibbs sampler that ensures conditionally conjugate posterior updates for all model parameters. To this end, we take the advantages of Pólya-Gamma data augmentation and forward filtering backward sampling (FFBS) algorithm. After validating the properties of the Gibbs sampler in a Monte Carlo study, the advantages of the proposed specification are demonstrated in an empirical application to uncover relationships between crash frequency spanning across nine years and pavement characteristics. Model parameters exhibit practically significant temporal patterns (i.e., temporal instability). For example, the safety benefits of better pavement ride quality are estimated to increase over time.

preprint2020arXiv

Biogeography-Based Optimization and Support Vector Regression for Freeway Travel Time Prediction and Feature Selection

As travelers make their choices based on travel time, its prior information can be helpful for them in making more informed travel decisions. To achieve this goal, travel time prediction models have been proposed in literature, but identification of important predictors has not received much attention. Identification of important predictors reduces dimensions of input data, which not only lessens computational load, but also provides better understanding of underlying relationship between important predictors and travel time. Moreover, collection of only important predictors can lead to a significant equipment savings in data collection. Therefore, this study proposes a hybrid approach for feature selection (identifying important predictors) along with developing a robust freeway travel time prediction model. A framework integrating biogeography-based optimization (BBO) and support vector regression (SVR) has been developed. It was validated by predicting travel time at 36.1 km long segment of National Taiwan Freeway No. 1. The proposed hybrid approach is able to develop a prediction model with only six predictors, which is found to have accuracy equivalent to a stand-alone SVR prediction model developed with all forty three predictors.

preprint2020arXiv

Matlab-Vissim Interface for online optimization of Green Time splits

VISSIM is a widely used microscopic traffic simulator, which not only provides a graphical user interface to simulate simple static controls (pre-timed or fixed-time) but also offers flexibility to dynamically control simulation through versatile programming languages (C++ and Java etc.). However, to implement various traffic control techniques, integration of computational tools may save lots of effort and time as compared to standard programming platforms. MATLAB falls in the category of a widely used computational tool and also fulfills the primary requirements to control VISSIM simulation dynamically. Therefore, this study proposes and develops a direct interface between MATLAB and VISSIM to extensively harness the computational power of MATLAB. The significance of developed interface is demonstrated on a practical scenario, by conducting an online optimization of green time splits on a study network.

preprint2020arXiv

Variational Bayesian Inference for Mixed Logit Models with Unobserved Inter- and Intra-Individual Heterogeneity

Variational Bayes (VB), a method originating from machine learning, enables fast and scalable estimation of complex probabilistic models. Thus far, applications of VB in discrete choice analysis have been limited to mixed logit models with unobserved inter-individual taste heterogeneity. However, such a model formulation may be too restrictive in panel data settings, since tastes may vary both between individuals as well as across choice tasks encountered by the same individual. In this paper, we derive a VB method for posterior inference in mixed logit models with unobserved inter- and intra-individual heterogeneity. In a simulation study, we benchmark the performance of the proposed VB method against maximum simulated likelihood (MSL) and Markov chain Monte Carlo (MCMC) methods in terms of parameter recovery, predictive accuracy and computational efficiency. The simulation study shows that VB can be a fast, scalable and accurate alternative to MSL and MCMC estimation, especially in applications in which fast predictions are paramount. VB is observed to be between 2.8 and 17.7 times faster than the two competing methods, while affording comparable or superior accuracy. Besides, the simulation study demonstrates that a parallelised implementation of the MSL estimator with analytical gradients is a viable alternative to MCMC in terms of both estimation accuracy and computational efficiency, as the MSL estimator is observed to be between 0.9 and 2.1 times faster than MCMC.