Source author record

Vivek Natarajan

Vivek Natarajan appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision Machine Learning Artificial Intelligence math.OC Computation and Language eess.IV eess.SY physics.flu-dyn Systems and Control

Catalog footprint

What is connected

9works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Towards Conversational Medical AI with Eyes, Ears and a Voice

The practice of medicine relies not only upon skillful dialogue but also on the nuanced exchange and interpretation of rich auditory and visual cues between doctors and patients. Building on the low-latency voice and video processing capabilities of Gemini, we introduce AI co-clinician, a first-of-its-kind conversational AI system utilizing continuous streams of audio-visual data from live patient conversations to inform real-time clinical decisions. Its dual-agent architecture balances deep clinical reasoning with the low latency required for natural dialogue. To assess this system, we implemented a video-based interface emulating telemedicine consultations. We crafted 20 standardized outpatient scenarios requiring proactive real-time auditory and visual reasoning and designed "TelePACES" evaluation criteria alongside case-specific rubrics. In a randomized, interface-blinded, crossover simulation study (n = 120 encounters) with 10 internal medicine residents as patient actors, we compared AI co-clinician with primary care physicians (PCPs), GPT-Realtime, and a baseline agent. AI co-clinician approached PCPs in key TelePACES dimensions, including management plans and differential diagnosis, while significantly outperforming GPT-Realtime across all general criteria. While our agent demonstrated parity with PCPs in case-specific triage measures, physicians maintained superior overall performance in case-specific assessments. Although AI co-clinician marks a significant advance in real-time telemedical AI, gaps remain in physical examination and disease-specific reasoning. Our work shows that text-only approaches fail to capture the true challenges of medical consultation and suggests that high-stakes real-time diagnostic AI is most safely advanced in collaborative, triadic models where AI can be a supportive co-clinician for doctors and patients.

preprint2022arXiv

Robust and Efficient Medical Imaging with Self-Supervision

Recent progress in Medical Artificial Intelligence (AI) has delivered systems that can reach clinical expert level performance. However, such systems tend to demonstrate sub-optimal "out-of-distribution" performance when evaluated in clinical settings different from the training environment. A common mitigation strategy is to develop separate systems for each clinical setting using site-specific data [1]. However, this quickly becomes impractical as medical data is time-consuming to acquire and expensive to annotate [2]. Thus, the problem of "data-efficient generalization" presents an ongoing difficulty for Medical AI development. Although progress in representation learning shows promise, their benefits have not been rigorously studied, specifically for out-of-distribution settings. To meet these challenges, we present REMEDIS, a unified representation learning strategy to improve robustness and data-efficiency of medical imaging AI. REMEDIS uses a generic combination of large-scale supervised transfer learning with self-supervised learning and requires little task-specific customization. We study a diverse range of medical imaging tasks and simulate three realistic application scenarios using retrospective data. REMEDIS exhibits significantly improved in-distribution performance with up to 11.5% relative improvement in diagnostic accuracy over a strong supervised baseline. More importantly, our strategy leads to strong data-efficient generalization of medical imaging AI, matching strong supervised baselines using between 1% to 33% of retraining data across tasks. These results suggest that REMEDIS can significantly accelerate the life-cycle of medical imaging AI development thereby presenting an important step forward for medical imaging AI to deliver broad impact.

preprint2021arXiv

Does Your Dermatology Classifier Know What It Doesn't Know? Detecting the Long-Tail of Unseen Conditions

We develop and rigorously evaluate a deep learning based system that can accurately classify skin conditions while detecting rare conditions for which there is not enough data available for training a confident classifier. We frame this task as an out-of-distribution (OOD) detection problem. Our novel approach, hierarchical outlier detection (HOD) assigns multiple abstention classes for each training outlier class and jointly performs a coarse classification of inliers vs. outliers, along with fine-grained classification of the individual classes. We demonstrate the effectiveness of the HOD loss in conjunction with modern representation learning approaches (BiT, SimCLR, MICLe) and explore different ensembling strategies for further improving the results. We perform an extensive subgroup analysis over conditions of varying risk levels and different skin types to investigate how the OOD detection performance changes over each subgroup and demonstrate the gains of our framework in comparison to baselines. Finally, we introduce a cost metric to approximate downstream clinical impact. We use this cost metric to compare the proposed method against a baseline system, thereby making a stronger case for the overall system effectiveness in a real-world deployment scenario.

preprint2021arXiv

Stability of the integral control of stable nonlinear systems

PI controllers are the most widespread type of controllers and there is an intuitive understanding that if their gains are sufficiently small and of the correct sign, then they always work. In this paper we try to give some rigorous backing to this claim, under specific assumptions. Let $\bf P$ be a nonlinear system described by $\dot x=f(x,u)$, $y=g(x)$, where the state trajectory $x$ takes values in $R^n$, $u$ and $y$ are scalar and $f,g$ are of class $C^1$. We assume that there is a Lipschitz function $Ξ:[u_{min},u_{max}]\rightarrow R^n$ such that for every constant input $u_0\in[u_{min},u_{max}]$, $Ξ(u_0)$ is an exponentially stable equilibrium point of $\bf P$. We also assume that $G(u)=g(Ξ(u))$, which is the steady state input-output map of $\bf P$, is strictly increasing. Denoting $y_{min}=G(u_{min})$ and $y_{max}=G(u_{max})$, we assume that the reference value $r$ is in $(y_{min},y_{max})$. Our aim is that $y$ should track $r$, i.e., $y\rightarrow r$ as $t\rightarrow\infty$, while the input of $P$ is only allowed to be in $[u_{min},u_{max}]$. For this, we introduce a variation of the integrator, called the saturating integrator, and connect it in feedback with $\bf P$ in the standard way, with gain $k>0$. We show that for any small enough $k$, the closed-loop system is (locally) exponentially stable around an equilibrium point $(Xi(u_r),u_r)$, with a large region of attraction $X_T\subset R^n\times[u_{min},u_{max}]$. When the state $(x(t),u(t))$ of the closed-loop system converges to $(Ξ(u_r),u_r)$, then the tracking error $r-y$ tends to zero. The compact set $X_T$ can be made larger by choosing a larger parameter $T>0$, resulting in smaller $k$.

preprint2021arXiv

Supervised Transfer Learning at Scale for Medical Imaging

Transfer learning is a standard technique to improve performance on tasks with limited data. However, for medical imaging, the value of transfer learning is less clear. This is likely due to the large domain mismatch between the usual natural-image pre-training (e.g. ImageNet) and medical images. However, recent advances in transfer learning have shown substantial improvements from scale. We investigate whether modern methods can change the fortune of transfer learning for medical imaging. For this, we study the class of large-scale pre-trained networks presented by Kolesnikov et al. on three diverse imaging tasks: chest radiography, mammography, and dermatology. We study both transfer performance and critical properties for the deployment in the medical domain, including: out-of-distribution generalization, data-efficiency, sub-group fairness, and uncertainty estimation. Interestingly, we find that for some of these properties transfer from natural to medical images is indeed extremely effective, but only when performed at sufficient scale.

preprint2020arXiv

Contrastive Training for Improved Out-of-Distribution Detection

Reliable detection of out-of-distribution (OOD) inputs is increasingly understood to be a precondition for deployment of machine learning systems. This paper proposes and investigates the use of contrastive training to boost OOD detection performance. Unlike leading methods for OOD detection, our approach does not require access to examples labeled explicitly as OOD, which can be difficult to collect in practice. We show in extensive experiments that contrastive training significantly helps OOD detection performance on a number of common benchmarks. By introducing and employing the Confusion Log Probability (CLP) score, which quantifies the difficulty of the OOD detection task by capturing the similarity of inlier and outlier datasets, we show that our method especially improves performance in the `near OOD' classes -- a particularly challenging setting for previous methods.

preprint2020arXiv

Modelling, Controllability and Gait Design for a Spherical Flexible Swimmer

This paper discusses modelling, controllability and gait design for a spherical flexible swimmer. We first present a kinematic model of a low Reynolds number spherical flexible swimming mechanism with periodic surface deformations in the radial and azimuthal directions. The model is then converted to a finite dimensional driftless, affine-in-control principal kinematic form by representing the surface deformations as a linear combination of finitely many Legendre polynomials. A controllability analysis is then done for this swimmer to conclude that the swimmer is locally controllable on $\mathbb{R}^3$ for certain combinations of the Legendre polynomials. The rates of the coefficients of the polynomials are considered as the control inputs for surface deformation. Finally, the Abelian nature of the structure group of the swimmer's configuration space is exploited to synthesize a curvature based gait for the spherical flexile swimmer and a rigid-link swimmer.

preprint2019arXiv

A deep learning system for differential diagnosis of skin diseases

Skin conditions affect an estimated 1.9 billion people worldwide. A shortage of dermatologists causes long wait times and leads patients to seek dermatologic care from general practitioners. However, the diagnostic accuracy of general practitioners has been reported to be only 0.24-0.70 (compared to 0.77-0.96 for dermatologists), resulting in referral errors, delays in care, and errors in diagnosis and treatment. In this paper, we developed a deep learning system (DLS) to provide a differential diagnosis of skin conditions for clinical cases (skin photographs and associated medical histories). The DLS distinguishes between 26 skin conditions that represent roughly 80% of the volume of skin conditions seen in primary care. The DLS was developed and validated using de-identified cases from a teledermatology practice serving 17 clinical sites via a temporal split: the first 14,021 cases for development and the last 3,756 cases for validation. On the validation set, where a panel of three board-certified dermatologists defined the reference standard for every case, the DLS achieved 0.71 and 0.93 top-1 and top-3 accuracies respectively. For a random subset of the validation set (n=963 cases), 18 clinicians reviewed the cases for comparison. On this subset, the DLS achieved a 0.67 top-1 accuracy, non-inferior to board-certified dermatologists (0.63, p<0.001), and higher than primary care physicians (PCPs, 0.45) and nurse practitioners (NPs, 0.41). The top-3 accuracy showed a similar trend: 0.90 DLS, 0.75 dermatologists, 0.60 PCPs, and 0.55 NPs. These results highlight the potential of the DLS to augment general practitioners to accurately diagnose skin conditions by suggesting differential diagnoses that may not have been considered. Future work will be needed to prospectively assess the clinical impact of using this tool in actual clinical workflows.

preprint2016arXiv

Almost global asymptotic stability of a grid-connected synchronous generator

We study the global asymptotic behavior of a grid-connected constant field current synchronous generator (SG). The grid is regarded as an "infinite bus", i.e. a three-phase AC voltage source. The generator does not include any controller other than the frequency droop loop. This means that the mechanical torque applied to this generator is an affine function of its angular velocity. The negative slope of this function is the frequency droop constant. We derive sufficient conditions on the SG parameters under which there exist exactly two periodic state trajectories for the SG, one stable and another unstable, and for almost all initial states, the state trajectory of the SG converges to the stable periodic trajectory (all the angles are measured modulo $2π$). Along both periodic state trajectories, the angular velocity of the SG is equal to the grid frequency. Our sufficient conditions are easy to check computationally. An important tool in our analysis is an integro-differential equation called the {\em exact swing equation}, which resembles a forced pendulum equation and is equivalent to our fourth order model of the grid-connected SG. Apart from our objective of providing an analytical proof for a global asymptotic behavior observed in a classical dynamical system, a key motivation for this work is the development of synchronverters which are inverters that mimic the behavior of SGs. Understanding the global dynamics of SGs can guide the choice of synchronverter parameters and operation. As an application we find a set of stable nominal parameters for a 500 kW synchronverter.

Vivek Natarajan

What is connected

Connect this record

See the researcher in context

Building this map preview

9 published item(s)

Towards Conversational Medical AI with Eyes, Ears and a Voice

Robust and Efficient Medical Imaging with Self-Supervision

Does Your Dermatology Classifier Know What It Doesn't Know? Detecting the Long-Tail of Unseen Conditions

Stability of the integral control of stable nonlinear systems

Supervised Transfer Learning at Scale for Medical Imaging

Contrastive Training for Improved Out-of-Distribution Detection

Modelling, Controllability and Gait Design for a Spherical Flexible Swimmer

A deep learning system for differential diagnosis of skin diseases

Almost global asymptotic stability of a grid-connected synchronous generator