Researcher profile

Vikas Kumar

Vikas Kumar contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2026arXiv

What Matters in Data Curation for Multimodal Reasoning? Insights from the DCVLR Challenge

We study data curation for multimodal reasoning through the NeurIPS 2025 Data Curation for Vision-Language Reasoning (DCVLR) challenge, which isolates dataset selection by fixing the model and training protocol. Using a compact curated dataset derived primarily from Walton Multimodal Cold Start, our submission placed first in the challenge. Through post-competition ablations, we show that difficulty-based example selection on an aligned base dataset is the dominant driver of performance gains. Increasing dataset size does not reliably improve mean accuracy under the fixed training recipe, but mainly reduces run-to-run variance, while commonly used diversity and synthetic augmentation heuristics provide no additional benefit and often degrade performance. These results characterize DCVLR as a saturation-regime evaluation and highlight the central role of alignment and difficulty in data-efficient multimodal reasoning.

preprint2022arXiv

Inductive Conformal Recommender System

Traditional recommendation algorithms develop techniques that can help people to choose desirable items. However, in many real-world applications, along with a set of recommendations, it is also essential to quantify each recommendation's (un)certainty. The conformal recommender system uses the experience of a user to output a set of recommendations, each associated with a precise confidence value. Given a significance level $\varepsilon$, it provides a bound $\varepsilon$ on the probability of making a wrong recommendation. The conformal framework uses a key concept called \emph{nonconformity measure} that measures the strangeness of an item concerning other items. One of the significant design challenges of any conformal recommendation framework is integrating nonconformity measures with the recommendation algorithm. This paper introduces an inductive variant of a conformal recommender system. We propose and analyze different nonconformity measures in the inductive setting. We also provide theoretical proofs on the error-bound and the time complexity. Extensive empirical analysis on ten benchmark datasets demonstrates that the inductive variant substantially improves the performance in computation time while preserving the accuracy.

preprint2022arXiv

Transfer of codebook latent factors for cross-domain recommendation with non-overlapping data

Recommender systems based on collaborative filtering play a vital role in many E-commerce applications as they guide the user in finding their items of interest based on the user's past transactions and feedback of other similar customers. Data Sparsity is one of the major drawbacks with collaborative filtering technique arising due to the less number of transactions and feedback data. In order to reduce the sparsity problem, techniques called transfer learning/cross-domain recommendation has emerged. In transfer learning methods, the data from other dense domain(s) (source) is considered in order to predict the missing ratings in the sparse domain (target). In this paper, we come up with a novel transfer learning approach for cross-domain recommendation, wherein the cluster-level rating pattern(codebook) of the source domain is obtained via a co-clustering technique. Thereafter we apply the Maximum Margin Matrix factorization (MMMF) technique on the codebook in order to learn the user and item latent features of codebook. Prediction of the target rating matrix is achieved by introducing these latent features in a novel way into the optimisation function. In the experiments we demonstrate that our model improves the prediction accuracy of the target matrix on benchmark datasets.

preprint2021arXiv

Assessing Fairness in Classification Parity of Machine Learning Models in Healthcare

Fairness in AI and machine learning systems has become a fundamental problem in the accountability of AI systems. While the need for accountability of AI models is near ubiquitous, healthcare in particular is a challenging field where accountability of such systems takes upon additional importance, as decisions in healthcare can have life altering consequences. In this paper we present preliminary results on fairness in the context of classification parity in healthcare. We also present some exploratory methods to improve fairness and choosing appropriate classification algorithms in the context of healthcare.

preprint2021arXiv

Emergency Department Optimization and Load Prediction in Hospitals

Over the past several years, across the globe, there has been an increase in people seeking care in emergency departments (EDs). ED resources, including nurse staffing, are strained by such increases in patient volume. Accurate forecasting of incoming patient volume in emergency departments (ED) is crucial for efficient utilization and allocation of ED resources. Working with a suburban ED in the Pacific Northwest, we developed a tool powered by machine learning models, to forecast ED arrivals and ED patient volume to assist end-users, such as ED nurses, in resource allocation. In this paper, we discuss the results from our predictive models, the challenges, and the learnings from users' experiences with the tool in active clinical deployment in a real world setting.

preprint2021arXiv

Noisy Student Training using Body Language Dataset Improves Facial Expression Recognition

Facial expression recognition from videos in the wild is a challenging task due to the lack of abundant labelled training data. Large DNN (deep neural network) architectures and ensemble methods have resulted in better performance, but soon reach saturation at some point due to data inadequacy. In this paper, we use a self-training method that utilizes a combination of a labelled dataset and an unlabelled dataset (Body Language Dataset - BoLD). Experimental analysis shows that training a noisy student network iteratively helps in achieving significantly better results. Additionally, our model isolates different regions of the face and processes them independently using a multi-level attention mechanism which further boosts the performance. Our results show that the proposed method achieves state-of-the-art performance on benchmark datasets CK+ and AFEW 8.0 when compared to other single models.

preprint2021arXiv

Pattern Formation Study of an Eco-epidemiological Model with Cannibalism and Disease in Predator Population

Pattern formation analysis of eco-epidemiological models with cannibalism and disease has been less explored in the literature. Therefore, motivated by this, we have proposed a diffusive eco-epidemiological model and performed pattern formation analysis in the model system. Sufficient conditions for local asymptotic stability and global asymptotic stability for the constant positive steady state are obtained by linearization and Lyapunov function technique. A priori estimate for the positive steady state is obtained for the nonexistence of the nonconstant positive solution using Cauchy and Poincaré inequality. The existence of the nonconstant positive steady states is studied using Leray-Schauder degree theory. The importance of the diffusive coefficients which are responsible for the appearance of stationary patterns is observed. Pattern formation is done using numerical simulation. Further, the effect of the cannibalism and disease are observed on the dynamics of the proposed model system. The movements of prey and susceptible predator plays a significant role in pattern formation. These movements cause stationary and non-stationary patterns. It is observed that an increment in the movement of the susceptible predator as well as cannibalistic attack rate converts non-Turing patterns to Turing patterns. Lyapunov spectrum is calculated for quantification of stable and unstable dynamics. Non-Turing patterns obtained with parameter set having unstable limit cycle are more interesting and realistic than stationary patterns. Stationary and non-stationary non-Turing patterns are obtained.

preprint2020arXiv

Committee Selection with Attribute Level Preferences

We consider the problem of committee selection from a fixed set of candidates where each candidate has multiple quantifiable attributes. To select the best possible committee, instead of voting for a candidate, a voter is allowed to approve the preferred attributes of a given candidate. Though attribute based preference is addressed in several contexts, committee selection problem with attribute approval of voters has not been attempted earlier. A committee formed on attribute preferences is more likely to be a better representative of the qualities desired by the voters and is less likely to be susceptible to collusion or manipulation. In this work, we provide a formal study of the different aspects of this problem and define properties of weak unanimity, strong unanimity, simple justified representations and compound justified representation, that are required to be satisfied by the selected committee. We show that none of the existing vote/approval aggregation rules satisfy these new properties for attribute aggregation. We describe a greedy approach for attribute aggregation that satisfies the first three properties, but not the fourth, i.e., compound justified representation, which we prove to be NP-complete. Furthermore, we prove that finding a committee with justified representation and the highest approval voting score is NP-complete.

preprint2020arXiv

Gamow-Teller transition strengths for selected ${fp}$ shell nuclei

We have reported a systematic shell model description of the experimental Gamow-Teller transition strength for $^{44}$Sc $\rightarrow$ $^{44}$Ca, $^{45}$Ti $\rightarrow$ $^{45}$Sc, $^{48}$Ti $\rightarrow$ $^{48}$V, $^{66}$Co $\rightarrow$ $^{66}$Ni, and $^{66}$Fe $\rightarrow$ $^{66}$Co transitions using KB3G and GXPF1A interactions for $fp$ model space. In order to see the importance of higher orbital for $^{66}$Co $\rightarrow$ $^{66}$Ni and $^{66}$Fe $\rightarrow$ $^{66}$Co transitions, we have reported the shell model results with $fpg_{9/2}$ space using GXPF1Br+$V_{MU}$ interaction. We have obtained the qualitative agreement for the individual transitions, while the calculated summed transition strengths closely reproduce the observed ones.

preprint2020arXiv

Shell model study of high-spin states and band terminations in $^{67}$As

In the present work, recently available experimental data for different bands of $^{67}$As [Phys. Rev. C {\bf 98}, 024313 (2018)] have been interpreted within the framework of the shell model in full $f_{5/2}pg_{9/2}$ model space using JUN45 and jj44b effective interactions. The variation of $E$ - $E_{rot}$ energy versus spin for different bands is shown to obtain useful information about band termination. We have also reported the electromagnetic transition probabilities, quadrupole and magnetic moments of $^{67}$As. Except for some tentative high-spin states, the results are in good agreement with the available experimental data.The states $29/2^+$-$45/2^+$ are described by three particles in $g_{9/2}$, while $47/2^+$ and $51/2^+$ states are described by five particles in $g_{9/2}$. The shell model results corresponding to nine tentative states such as $41/2_1^+$ and $45/2_1^+$ in band-1b; $41/2_2^+$ and $45/2_2^+$ in band-1a; $43/2_2^+$, $47/2_2^+$ and $51/2_1^+$ in band-2a; $43/2_1^+$ and $47/2_1^+$ in band-2b are reported and discussed.