Source author record

Vivek Singh

Vivek Singh appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning physics.atom-ph Artificial Intelligence Computation and Language Computer Vision eess.IV Other Computer Science Robotics

Catalog footprint

What is connected

8works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

An Empirical Study of Automating Agent Evaluation

Agent evaluation requires assessing complex multi-step behaviors involving tool use and intermediate reasoning, making it costly and expertise-intensive. A natural question arises: can frontier coding assistants reliably automate this evaluation process? Our study shows that simply prompting coding assistants is insufficient for this task. Without domain-specific evaluation knowledge, frontier coding assistants achieve only a 30% execution success rate and produce over-engineered evaluations averaging 12+ metrics per agent, indicating that strong coding ability does not automatically translate to reliable agent evaluation. We introduce EvalAgent, an AI assistant that automates the end-to-end agent evaluation pipeline. EvalAgent encodes evaluation domain expertise as evaluation skills (procedural instructions, reusable code and templates, and dynamically retrieved API documentation) that compose into a trace-based pipeline producing complete evaluation artifacts including metrics, executable code, and reports. To systematically assess generated evaluations, we introduce a meta-evaluation framework alongside AgentEvalBench, a benchmark comprising 20 agents, each paired with evaluation requirements and test scenarios. We further propose the Eval@1 metric to measure whether generated evaluation code both executes and yields meaningful results on the first run. Our experiments show that EvalAgent produces focused evaluations, improving Eval@1 from 17.5% to 65%, and achieving 79.5% human expert preference over baseline approaches. Further ablation studies show that evaluation skills are critical for handling complex evaluation: removing them causes Eval@1 to drop significantly from 65% to 30%.

preprint2026arXiv

AURA-CVC: Autonomous Ultrasound-guided Robotic Assistance for Central Venous Catheterization

Purpose: Central venous catheterization (CVC) is a critical medical procedure for vascular access, hemodynamic monitoring, and life-saving interventions. Its success remains challenging due to the need for continuous ultrasound-guided visualization of a target vessel and approaching needle, which is further complicated by anatomical variability and operator dependency. Errors in needle placement can lead to life-threatening complications. While robotic systems offer a potential solution, achieving full autonomy remains challenging. In this work, we propose an end-to-end robotic-ultrasound-guided CVC pipeline, from scan initialization to needle insertion. Methods: We introduce a deep-learning model to identify clinically relevant anatomical landmarks from a depth image of the patient's neck, obtained using RGB-D camera, to autonomously define the scanning region and paths. Then, a robot motion planning framework is proposed to scan, segment, reconstruct, and localize vessels (veins and arteries), followed by the identification of the optimal insertion zone. Finally, a needle guidance module plans the insertion under ultrasound guidance with operator's feedback. This pipeline was validated on a high-fidelity commercial phantom across 10 simulated clinical scenarios. Results: The proposed pipeline achieved 10 out of 10 successful needle placements on the first attempt. Vessels were reconstructed with a mean error of 2.15 \textit{mm}, and autonomous needle insertion was performed with an error less than or close to 1 \textit{mm}. Conclusion: To our knowledge, this is the first robotic CVC system demonstrated on a high-fidelity phantom with integrated planning, scanning, and insertion. Experimental results show its potential for clinical translation.

preprint2022arXiv

Efficient quantum state preparation using Stern-Gerlach effect on cold atoms

The Zeeman hyperfine state dependent force in a Stern-Gerlach (SG) experiment has been exploited to separate and detect atoms having different Zeeman hyperfine states in a cold atom cloud. Utilizing this SG technique, we have made the quantitative estimate of atoms in different Zeeman hyperfine states in an atom cloud, which has been helpful in optimizing the optical pumping of atoms for efficient preparation of atomic state. Employing an optimized optical pumping, nearly $92 \% $ of cold $^{87}Rb$ atoms from a grey magneto-optical trap (G-MOT) on atom-chip have been optically pumped to the trappable Zeeman hyperfine state $\ket{F=2, \, m_{F} = +2}$. These optically pumped atoms have been trapped in an Ioffe-Pritchard magnetic trap near the atom-chip surface.

preprint2022arXiv

Generic Lithography Modeling with Dual-band Optics-Inspired Neural Networks

Lithography simulation is a critical step in VLSI design and optimization for manufacturability. Existing solutions for highly accurate lithography simulation with rigorous models are computationally expensive and slow, even when equipped with various approximation techniques. Recently, machine learning has provided alternative solutions for lithography simulation tasks such as coarse-grained edge placement error regression and complete contour prediction. However, the impact of these learning-based methods has been limited due to restrictive usage scenarios or low simulation accuracy. To tackle these concerns, we introduce an dual-band optics-inspired neural network design that considers the optical physics underlying lithography. To the best of our knowledge, our approach yields the first published via/metal layer contour simulation at 1nm^2/pixel resolution with any tile size. Compared to previous machine learning based solutions, we demonstrate that our framework can be trained much faster and offers a significant improvement on efficiency and image quality with 20X smaller model size. We also achieve 85X simulation speedup over traditional lithography simulator with 1% accuracy loss.

preprint2022arXiv

Large Scale Mask Optimization Via Convolutional Fourier Neural Operator and Litho-Guided Self Training

Machine learning techniques have been extensively studied for mask optimization problems, aiming at better mask printability, shorter turnaround time, better mask manufacturability, and so on. However, most of these researches are focusing on the initial solution generation of small design regions. To further realize the potential of machine learning techniques on mask optimization tasks, we present a Convolutional Fourier Neural Operator (CFNO) that can efficiently learn layout tile dependencies and hence promise stitch-less large-scale mask optimization with the limited intervention of legacy tools. We discover the possibility of litho-guided self-training (LGST) through a trained machine learning model when solving non-convex optimization problems, which allows iterative model and dataset update and brings significant model performance improvement. Experimental results show that, for the first time, our machine learning-based framework outperforms state-of-the-art academic numerical mask optimizers with an order of magnitude speedup.

preprint2020arXiv

View Invariant Human Body Detection and Pose Estimation from Multiple Depth Sensors

Point cloud based methods have produced promising results in areas such as 3D object detection in autonomous driving. However, most of the recent point cloud work focuses on single depth sensor data, whereas less work has been done on indoor monitoring applications, such as operation room monitoring in hospitals or indoor surveillance. In these scenarios multiple cameras are often used to tackle occlusion problems. We propose an end-to-end multi-person 3D pose estimation network, Point R-CNN, using multiple point cloud sources. We conduct extensive experiments to simulate challenging real world cases, such as individual camera failures, various target appearances, and complex cluttered scenes with the CMU panoptic dataset and the MVOR operation room dataset. Unlike most of the previous methods that attempt to use multiple sensor information by building complex fusion models, which often lead to poor generalization, we take advantage of the efficiency of concatenating point clouds to fuse the information at the input level. In the meantime, we show our end-to-end network greatly outperforms cascaded state-of-the-art models.

preprint2019arXiv

On continuous loading of a U-magneto-optical trap (U-MOT) on atom-chip in ultra high vacuum

Here, we report our studies on the continuous loading of a U-magneto-optical trap (U-MOT) on atom-chip from background rubidium (Rb) vapor generated using a dispenser source in ultra high vacuum (UHV) environment. Using the U-MOT loading curves, the partial pressure due to Rb vapor and pressure due to background gas have been estimated near the MOT cloud position. The estimated pressure due to Rb vapor increased from $\sim \; 1.4 \times 10^{-10}$ Torr to $ \sim \; 4.1 \times 10^{-9} $ Torr as Rb-dispenser current was increased from 2.8 to 3.4 A. The increase in dispenser current also resulted in decrease in loading as well as lifetime of the MOT cloud. This study is useful for magnetic trapping experiments where accurate information of pressure in chamber is important for the lifetime of the magnetic trap.

preprint2016arXiv

A tunable Doppler-free dichroic lock for laser frequency stabilization

We propose and demonstrate a laser frequency stabilization scheme which generates a dispersion-like tunable Doppler-free dichroic lock (TDFDL) signal. This signal offers a wide tuning range for lock point (i.e. zero-crossing) without compromising on the slope of the locking signal. The method involves measurement of magnetically induced dichroism in an atomic vapour for a weak probe laser beam in presence of a counter propagating strong pump laser beam. A simple model is presented to explain the basic principles of this method to generate the TDFDL signal. The spectral shift in the locking signal is achieved by tuning the frequency of the pump beam. The TDFDL signal is shown to be useful for locking the frequency of a cooling laser used for magneto-optcal trap (MOT) for $^{87}Rb$ atoms.

Vivek Singh

What is connected

Connect this record

See the researcher in context

Building this map preview

8 published item(s)

An Empirical Study of Automating Agent Evaluation

AURA-CVC: Autonomous Ultrasound-guided Robotic Assistance for Central Venous Catheterization

Efficient quantum state preparation using Stern-Gerlach effect on cold atoms

Generic Lithography Modeling with Dual-band Optics-Inspired Neural Networks

Large Scale Mask Optimization Via Convolutional Fourier Neural Operator and Litho-Guided Self Training

View Invariant Human Body Detection and Pose Estimation from Multiple Depth Sensors

On continuous loading of a U-magneto-optical trap (U-MOT) on atom-chip in ultra high vacuum

A tunable Doppler-free dichroic lock for laser frequency stabilization