Researcher profile

Anuj Kumar

Anuj Kumar contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
9works
0followers
10topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2026arXiv

Sharpness of the Osgood Criterion for the Continuity Equation with Divergence-free Vector Fields

For any modulus of continuity $ω$ that fails the Osgood condition, we construct a divergence-free velocity field $v \in C_t C^ω_x$ for which the associated ODE admits at least two distinct flow maps. In other words, non-uniqueness does not occur merely for a single or even finitely many trajectories, but instead on a set of initial conditions $E$ of positive Lebesgue measure. In fact, the set $E$ has full measure inside a cube where the construction is supported. Moreover, we also construct a divergence-free velocity field $v \in C_{t}C^ω_x$ for which the associated continuity equation admits two distinct solutions $μ^1$ and $μ^2$ which are absolutely continuous with respect to Lebesgue measure for almost every time, and start from the same initial datum $\bar μ\ll \mathscr{L}^{d}$. Our construction introduces two novel ideas: (i) We introduce the notion of "parallelization", where at each time, the velocity field consists of simultaneous motion across multiple nested spatial scales. This differs from most explicit constructions in the literature on mixing or anomalous dissipation, where the velocity on different scales acts at separate times. This is crucial to cover the whole class of non-Osgood moduli of continuity. (ii) Inspired by a recent work of Bruè, Colombo and Kumar, we develop a new fixed-point framework that naturally incorporates the parallelization mechanism. This framework allows us to construct anomalous solutions of the continuity equation that belong to $L^1(\mathbb{R}^d)$ a.e. in time.

preprint2025arXiv

WearVox: An Egocentric Multichannel Voice Assistant Benchmark for Wearables

Wearable devices such as AI glasses are transforming voice assistants into always-available, hands-free collaborators that integrate seamlessly with daily life, but they also introduce challenges like egocentric audio affected by motion and noise, rapid micro-interactions, and the need to distinguish device-directed speech from background conversations. Existing benchmarks largely overlook these complexities, focusing instead on clean or generic conversational audio. To bridge this gap, we present WearVox, the first benchmark designed to rigorously evaluate voice assistants in realistic wearable scenarios. WearVox comprises 3,842 multi-channel, egocentric audio recordings collected via AI glasses across five diverse tasks including Search-Grounded QA, Closed-Book QA, Side-Talk Rejection, Tool Calling, and Speech Translation, spanning a wide range of indoor and outdoor environments and acoustic conditions. Each recording is accompanied by rich metadata, enabling nuanced analysis of model performance under real-world constraints. We benchmark leading proprietary and open-source speech Large Language Models (SLLMs) and find that most real-time SLLMs achieve accuracies on WearVox ranging from 29% to 59%, with substantial performance degradation on noisy outdoor audio, underscoring the difficulty and realism of the benchmark. Additionally, we conduct a case study with two new SLLMs that perform inference with single-channel and multi-channel audio, demonstrating that multi-channel audio inputs significantly enhance model robustness to environmental noise and improve discrimination between device-directed and background speech. Our results highlight the critical importance of spatial audio cues for context-aware voice assistants and establish WearVox as a comprehensive testbed for advancing wearable voice AI research.

preprint2023arXiv

Extended Feature Space-Based Automatic Melanoma Detection System

Melanoma is the deadliest form of skin cancer. Uncontrollable growth of melanocytes leads to melanoma. Melanoma has been growing wildly in the last few decades. In recent years, the detection of melanoma using image processing techniques has become a dominant research field. The Automatic Melanoma Detection System (AMDS) helps to detect melanoma based on image processing techniques by accepting infected skin area images as input. A single lesion image is a source of multiple features. Therefore, It is crucial to select the appropriate features from the image of the lesion in order to increase the accuracy of AMDS. For melanoma detection, all extracted features are not important. Some of the extracted features are complex and require more computation tasks, which impacts the classification accuracy of AMDS. The feature extraction phase of AMDS exhibits more variability, therefore it is important to study the behaviour of AMDS using individual and extended feature extraction approaches. A novel algorithm ExtFvAMDS is proposed for the calculation of Extended Feature Vector Space. The six models proposed in the comparative study revealed that the HSV feature vector space for automatic detection of melanoma using Ensemble Bagged Tree classifier on Med-Node Dataset provided 99% AUC, 95.30% accuracy, 94.23% sensitivity, and 96.96% specificity.

preprint2022arXiv

An Indian Roads Dataset for Supported and Suspended Traffic Lights Detection

Autonomous vehicles are growing rapidly, in well-developed nations like America, Europe, and China. Tech giants like Google, Tesla, Audi, BMW, and Mercedes are building highly efficient self-driving vehicles. However, the technology is still not mainstream for developing nations like India, Thailand, Africa, etc., In this paper, we present a thorough comparison of the existing datasets based on well-developed nations as well as Indian roads. We then developed a new dataset "Indian Roads Dataset" (IRD) having more than 8000 annotations extracted from 3000+ images shot using a 64 (megapixel) camera. All the annotations are manually labelled adhering to the strict rules of annotations. Real-time video sequences have been captured from two different cities in India namely New Delhi and Chandigarh during the day and night-light conditions. Our dataset exceeds previous Indian traffic light datasets in size, annotations, and variance. We prove the amelioration of our dataset by providing an extensive comparison with existing Indian datasets. Various dataset criteria like size, capturing device, a number of cities, and variations of traffic light orientations are considered. The dataset can be downloaded from here https://sites.google.com/view/ird-dataset/home

preprint2022arXiv

Optimal bounds in Taylor--Couette flow

This paper is concerned with the optimal upper bound on mean quantities (torque, dissipation and the Nusselt number) obtained in the framework of the background method for the Taylor--Couette flow with a stationary outer cylinder. Along the way, we perform the energy stability analysis of the laminar flow, and demonstrate that below radius ratio 0.0556, the marginally stable perturbations are not the axisymmetric Taylor vortices but rather a fully three-dimensional flow. The main result of the paper is an analytical expression of the optimal bound as a function of the radius ratio. To obtain this bound, we begin by deriving a suboptimal analytical bound using analysis techniques. We use a definition of the background flow with two boundary layers, whose relative thicknesses are optimized to obtain the bound. In the limit of high Reynolds number, the dependence of this suboptimal bound on the radius ratio (the geometrical scaling) turns out to be the same as that of numerically computed optimal bounds in three different cases: (1) where the perturbed flow only satisfies the homogeneous boundary conditions but need not be incompressible, (2) the perturbed flow is three dimensional and incompressible, (3) the perturbed flow is two dimensional and incompressible. We compare the geometrical scaling with the observations from the turbulent Taylor--Couette flow, and find that the analytical result indeed agrees well with the available DNS data. In this paper, we also dismiss the applicability of the background method to certain flow problems and therefore establish the limitation of this method.

preprint2022arXiv

Three dimensional branching pipe flows for optimal scalar transport between walls

We consider the problem of "wall-to-wall optimal transport" in which we attempt to maximize the transport of a passive temperature field between hot and cold plates. Specifically, we optimize the choice of the divergence-free velocity field in the advection-diffusion equation subject to an enstrophy constraint (which can be understood as a constraint on the power required to generate the flow). Previous work established an a priori upper bound on the transport, scaling as the 1/3-power of the flow's enstrophy. Recently, Tobasco & Doering (Phys. Rev. Lett. vol.118, 2017, p.264502}) and Doering & Tobasco (Comm. Pure Appl. Math. vol.72, 2019, p.2385--2448}) constructed self-similar two-dimensional steady branching flows saturating this bound up to a logarithmic correction. This logarithmic correction appears to arise due to a topological obstruction inherent to two-dimensional steady branching flows. We present a construction of three-dimensional "branching pipe flows" that eliminates the possibility of this logarithmic correction and therefore identifies the optimal scaling as a clean 1/3-power law. Our flows resemble previous numerical studies of the three-dimensional wall-to-wall problem by Motoki, Kawahara & Shimizu (J. Fluid Mech. vol.851, 2018, p.R4}). We also discuss the implications of our result to the heat transfer problem in Rayleigh--Bénard convection and the problem of anomalous dissipation in a passive scalar.

preprint2021arXiv

Analytical bounds on the heat transport in internally heated convection

We obtain an analytical bound on the mean vertical convective heat flux $\langle w T \rangle$ between two parallel boundaries driven by uniform internal heating. We consider two configurations, one with both boundaries held at the same constant temperature, and the other one with a top boundary held at constant temperature and a perfectly insulating bottom boundary. For the first configuration, Arslan et al. (J. Fluid Mech. 919:A15, 2021) recently provided numerical evidence that Rayleigh-number-dependent corrections to the only known rigorous bound $\langle w T \rangle \leq 1/2$ may be provable if the classical background method is augmented with a minimum principle stating that the fluid's temperature is no smaller than that of the top boundary. Here, we confirm this fact rigorously for both configurations by proving bounds on $\langle wT \rangle$ that approach $1/2$ exponentially from below as the Rayleigh number is increased. The key to obtaining these bounds are inner boundary layers in the background fields with a particular inverse-power scaling, which can be controlled in the spectral constraint using Hardy and Rellich inequalities. These allow for qualitative improvements in the analysis not available to standard constructions.

preprint2021arXiv

El Volumen Louder Por Favor: Code-switching in Task-oriented Semantic Parsing

Being able to parse code-switched (CS) utterances, such as Spanish+English or Hindi+English, is essential to democratize task-oriented semantic parsing systems for certain locales. In this work, we focus on Spanglish (Spanish+English) and release a dataset, CSTOP, containing 5800 CS utterances alongside their semantic parses. We examine the CS generalizability of various Cross-lingual (XL) models and exhibit the advantage of pre-trained XL language models when data for only one language is present. As such, we focus on improving the pre-trained models for the case when only English corpus alongside either zero or a few CS training instances are available. We propose two data augmentation methods for the zero-shot and the few-shot settings: fine-tune using translate-and-align and augment using a generation model followed by match-and-filter. Combining the few-shot setting with the above improvements decreases the initial 30-point accuracy gap between the zero-shot and the full-data settings by two thirds.

preprint2020arXiv

Information Extraction of Clinical Trial Eligibility Criteria

Clinical trials predicate subject eligibility on a diversity of criteria ranging from patient demographics to food allergies. Trials post their requirements as semantically complex, unstructured free-text. Formalizing trial criteria to a computer-interpretable syntax would facilitate eligibility determination. In this paper, we investigate an information extraction (IE) approach for grounding criteria from trials in ClinicalTrials(dot)gov to a shared knowledge base. We frame the problem as a novel knowledge base population task, and implement a solution combining machine learning and context free grammar. To our knowledge, this work is the first criteria extraction system to apply attention-based conditional random field architecture for named entity recognition (NER), and word2vec embedding clustering for named entity linking (NEL). We release the resources and core components of our system on GitHub at https://github.com/facebookresearch/Clinical-Trial-Parser. Finally, we report our per module and end to end performances; we conclude that our system is competitive with Criteria2Query, which we view as the current state-of-the-art in criteria extraction.