Source author record

Vijay Kumar

Vijay Kumar appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

47works

21topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

LMPath: Language-Mediated Priors and Path Generation for Aerial Exploration

Traditional autonomous UAV search missions rely on geometric coverage patterns that ignore the semantic context of the target, leading to significant time waste in large-scale environments. In this paper we present LMPath, a pipeline for generating language-mediated exploration priors for Unmanned Aerial Vehicle (UAV) search missions that leverages semantics. Given a basic geofence and an object of interest prompt, LMPath uses generative language models to determine what regions of the environment should contain that object and a foundation vision model ran over satellite imagery to segment sub-regions that form the exploration prior. This prior can then be used to generate UAV paths with various objectives, such as minimizing the expected time to locate the object of interest, maximizing the probability that the object is found given a limited travel distance, or narrowing down the search space to sub-regions that are most likely to contain the object. To demonstrate it's capabilities, we used LMPath to generate various UAV paths and ran them using a real UAV over large-scale environments. We also ran simulations to demonstrate how paths generated using LMPath outperform traditional path planning approaches for search missions.

preprint2026arXiv

Retrieval Mechanisms Surpass Long-Context Scaling in Time Series Forecasting

Time Series Foundation Models (TSFMs) have borrowed the long context paradigm from natural language processing under the premise that feeding more history into the model improves forecast quality. But in stochastic domains, distant history is often just high-frequency noise, not signal. Hence, the proposed work tests whether this premise actually holds by running continuous context architectures (PatchTST included) through the ETTh1 benchmark. The obtained results contradict the premise: an inverse scaling law shows up clearly, with forecasting error rising as context gets longer. A 3,000-step window causes performance to drop by over 68%, evidence that attention mechanisms are poor at ignoring irrelevant historical volatility. Retrieval-Augmented Forecasting (RAFT) is evaluated as an alternative. RAFT achieves a mean squared error (MSE) of 0.379 with a fixed 720-step window and selective retrieval, outperforming both long-context configurations and zero-shot foundation models (Chronos, Moirai) despite requiring far less computation. In addition, the retrieval step injects only the most relevant historical segments as dynamic exogenous variables, which gives the model a context-informed inductive bias it cannot build on its own from raw sequences. Therefore, foundation models going forward need to shift architecturally toward selective retrieval.

preprint2025arXiv

ADMM-MCBF-LCA: A Layered Control Architecture for Safe Real-Time Navigation

We consider the problem of safe real-time navigation of a robot in a dynamic environment with moving obstacles of arbitrary smooth geometries and input saturation constraints. We assume that the robot detects and models nearby obstacle boundaries with a short-range sensor and that this detection is error-free. This problem presents three main challenges: i) input constraints, ii) safety, and iii) real-time computation. To tackle all three challenges, we present a layered control architecture (LCA) consisting of an offline path library generation layer, and an online path selection and safety layer. To overcome the limitations of reactive methods, our offline path library consists of feasible controllers, feedback gains, and reference trajectories. To handle computational burden and safety, we solve online path selection and generate safe inputs that run at 100 Hz. Through simulations on Gazebo and Fetch hardware in an indoor environment, we evaluate our approach against baselines that are layered, end-to-end, or reactive. Our experiments demonstrate that among all algorithms, only our proposed LCA is able to complete tasks such as reaching a goal, safely. When comparing metrics such as safety, input error, and success rate, we show that our approach generates safe and feasible inputs throughout the robot execution.

preprint2022arXiv

Adaptive Sampling of Latent Phenomena using Heterogeneous Robot Teams (ASLaP-HR)

In this paper, we present an online adaptive planning strategy for a team of robots with heterogeneous sensors to sample from a latent spatial field using a learned model for decision making. Current robotic sampling methods seek to gather information about an observable spatial field. However, many applications, such as environmental monitoring and precision agriculture, involve phenomena that are not directly observable or are costly to measure, called latent phenomena. In our approach, we seek to reason about the latent phenomenon in real-time by effectively sampling the observable spatial fields using a team of robots with heterogeneous sensors, where each robot has a distinct sensor to measure a different observable field. The information gain is estimated using a learned model that maps from the observable spatial fields to the latent phenomenon. This model captures aleatoric uncertainty in the relationship to allow for information theoretic measures. Additionally, we explicitly consider the correlations among the observable spatial fields, capturing the relationship between sensor types whose observations are not independent. We show it is possible to learn these correlations, and investigate the impact of the learned correlation models on the performance of our sampling approach. Through our qualitative and quantitative results, we illustrate that empirically learned correlations improve the overall sampling efficiency of the team. We simulate our approach using a data set of sensor measurements collected on Lac Hertel, in Quebec, which we make publicly available.

preprint2022arXiv

Any Way You Look At It: Semantic Crossview Localization and Mapping with LiDAR

Currently, GPS is by far the most popular global localization method. However, it is not always reliable or accurate in all environments. SLAM methods enable local state estimation but provide no means of registering the local map to a global one, which can be important for inter-robot collaboration or human interaction. In this work, we present a real-time method for utilizing semantics to globally localize a robot using only egocentric 3D semantically labelled LiDAR and IMU as well as top-down RGB images obtained from satellites or aerial robots. Additionally, as it runs, our method builds a globally registered, semantic map of the environment. We validate our method on KITTI as well as our own challenging datasets, and show better than 10 meter accuracy, a high degree of robustness, and the ability to estimate the scale of a top-down map on the fly if it is initially unknown.

preprint2022arXiv

Graph Neural Networks for Decentralized Multi-Robot Submodular Action Selection

The problem of decentralized multi-robot target tracking asks for jointly selecting actions, e.g., motion primitives, for the robots to maximize target tracking performance with local communications. One major challenge for practical implementations is to make target tracking approaches scalable for large-scale problem instances. In this work, we propose a general-purpose learning architecture toward collaborative target tracking at scale, with decentralized communications. Particularly, our learning architecture leverages a graph neural network (GNN) to capture local interactions of the robots and learns decentralized decision-making for the robots. We train the learning model by imitating an expert solution and implement the resulting model for decentralized action selection involving local observations and communications only. We demonstrate the performance of our GNN-based learning approach in a scenario of active target tracking with large networks of robots. The simulation results show our approach nearly matches the tracking performance of the expert algorithm, and yet runs several orders faster with up to 100 robots. Moreover, it slightly outperforms a decentralized greedy algorithm but runs faster (especially with more than 20 robots). The results also exhibit our approach's generalization capability in previously unseen scenarios, e.g., larger environments and larger networks of robots.

preprint2022arXiv

Learning Connectivity-Maximizing Network Configurations

In this letter we propose a data-driven approach to optimizing the algebraic connectivity of a team of robots. While a considerable amount of research has been devoted to this problem, we lack a method that scales in a manner suitable for online applications for more than a handful of agents. To that end, we propose a supervised learning approach with a convolutional neural network (CNN) that learns to place communication agents from an expert that uses an optimization-based strategy. We demonstrate the performance of our CNN on canonical line and ring topologies, 105k randomly generated test cases, and larger teams not seen during training. We also show how our system can be applied to dynamic robot teams through a Unity-based simulation. After training, our system produces connected configurations over an order of magnitude faster than the optimization-based scheme for teams of 10-20 agents.

preprint2022arXiv

Resilient Edge: Can we achieve Network Resiliency at the IoT Edge using LPWAN and WiFi?

Edge computing has gained attention in recent years due to the adoption of many Internet of Things (IoT) applications in domestic, industrial and wild settings. The resiliency and reliability requirements of these applications vary from non-critical (best delivery efforts) to safety-critical with time-bounded guarantees. The network connectivity of IoT edge devices remains the central critical component that needs to meet the time-bounded Quality of Service (QoS) and fault-tolerance guarantees of the applications. Therefore, in this work, we systematically investigate how to meet IoT applications mixed-criticality QoS requirements in multi-communication networks. We (i) present the network resiliency requirements of IoT applications by defining a system model (ii) analyse and evaluate the bandwidth, latency, throughput, maximum packet size of many state-of-the-art LPWAN technologies, such as Sigfox, LoRa, and LTE (CAT-M1/NB-IoT) and Wi-Fi, (iii) implement and evaluate an adaptive system Resilient Edge and Criticality-Aware Best Fit (CABF) resource allocation algorithm to meet the application resiliency requirements using Raspberry Pi 4 and Pycom FiPy development board having five multi-communication networks. We present our findings on how to achieve 100% of the best-effort high criticality level message delivery using multi-communication networks

preprint2022arXiv

Stronger Together: Air-Ground Robotic Collaboration Using Semantics

In this work, we present an end-to-end heterogeneous multi-robot system framework where ground robots are able to localize, plan, and navigate in a semantic map created in real time by a high-altitude quadrotor. The ground robots choose and deconflict their targets independently, without any external intervention. Moreover, they perform cross-view localization by matching their local maps with the overhead map using semantics. The communication backbone is opportunistic and distributed, allowing the entire system to operate with no external infrastructure aside from GPS for the quadrotor. We extensively tested our system by performing different missions on top of our framework over multiple experiments in different environments. Our ground robots travelled over 6 km autonomously with minimal intervention in the real world and over 96 km in simulation without interventions.

preprint2021arXiv

Belief Space Planning for Mobile Robots with Range Sensors using iLQG

In this work, we use iterative Linear Quadratic Gaussian (iLQG) to plan motions for a mobile robot with range sensors in belief space. We address two limitations that prevent applications of iLQG to the considered robotic system. First, iLQG assumes a differentiable measurement model, which is not true for range sensors. We show that iLQG only requires the differentiability of the belief dynamics. We propose to use a derivative-free filter to approximate the belief dynamics, which does not require explicit differentiability of the measurement model. Second, informative measurements from a range sensor are sparse. Uninformative measurements produce trivial gradient information, which prevent iLQG optimization from converging to a local minimum. We densify the informative measurements by introducing additional parameters in the measurement model. The parameters are iteratively updated in the optimization to ensure convergence to the true measurement model of a range sensor. We show the effectiveness of the proposed modifications through an ablation study. We also apply the proposed method in simulations of large scale real world environments, which show superior performance comparing to the state-of-the-art methods that either assume the separation principle or maximum likelihood measurements.

preprint2021arXiv

Fair Robust Assignment using Redundancy

We study the consideration of fairness in redundant assignment for multi-agent task allocation. It has recently been shown that redundant assignment of agents to tasks provides robustness to uncertainty in task performance. However, the question of how to fairly assign these redundant resources across tasks remains unaddressed. In this paper, we present a novel problem formulation for fair redundant task allocation, which we cast as the optimization of worst-case task costs under a cardinality constraint. Solving this problem optimally is NP-hard. We exploit properties of supermodularity to propose a polynomial-time, near-optimal solution. In supermodular redundant assignment, the use of additional agents always improves task costs. Therefore, we provide a solution set that is $α$ times larger than the cardinality constraint. This constraint relaxation enables our approach to achieve a super-optimal cost by using a sub-optimal assignment size. We derive the sub-optimality bound on this cardinality relaxation, $α$. Additionally, we demonstrate that our algorithm performs near-optimally without the cardinality relaxation. We show simulations of redundant assignments of robots to goal nodes on transport networks with uncertain travel times. Empirically, our algorithm outperforms benchmarks, scales to large problems, and provides improvements in both fairness and average utility.

preprint2021arXiv

Large Scale Distributed Collaborative Unlabeled Motion Planning with Graph Policy Gradients

In this paper, we present a learning method to solve the unlabelled motion problem with motion constraints and space constraints in 2D space for a large number of robots. To solve the problem of arbitrary dynamics and constraints we propose formulating the problem as a multi-agent problem. We are able to demonstrate the scalability of our methods for a large number of robots by employing a graph neural network (GNN) to parameterize policies for the robots. The GNN reduces the dimensionality of the problem by learning filters that aggregate information among robots locally, similar to how a convolutional neural network is able to learn local features in an image. Additionally, by employing a GNN we are also able to overcome the computational overhead of training policies for a large number of robots by first training graph filters for a small number of robots followed by zero-shot policy transfer to a larger number of robots. We demonstrate the effectiveness of our framework through various simulations.

preprint2021arXiv

Resilient Task Allocation in Heterogeneous Multi-Robot Systems

For a multi-robot system equipped with heterogeneous capabilities, this paper presents a mechanism to allocate robots to tasks in a resilient manner when anomalous environmental conditions such as weather events or adversarial attacks affect the performance of robots within the tasks. Our primary objective is to ensure that each task is assigned the requisite level of resources, measured as the aggregated capabilities of the robots allocated to the task. By keeping track of task performance deviations under external perturbations, our framework quantifies the extent to which robot capabilities (e.g., visual sensing or aerial mobility) are affected by environmental conditions. This enables an optimization-based framework to flexibly reallocate robots to tasks based on the most degraded capabilities within each task. In the face of resource limitations and adverse environmental conditions, our algorithm minimally relaxes the resource constraints corresponding to some tasks, thus exhibiting a graceful degradation of performance. Simulated experiments in a multi-robot coverage and target tracking scenario demonstrate the efficacy of the proposed approach.

preprint2021arXiv

ROS-NetSim: A Framework for the Integration of Robotic and Network Simulators

Multi-agent systems play an important role in modern robotics. Due to the nature of these systems, coordination among agents via communication is frequently necessary. Indeed, Perception-Action-Communication (PAC) loops, or Perception-Action loops closed over a communication channel, are a critical component of multi-robot systems. However, we lack appropriate tools for simulating PAC loops. To that end, in this paper, we introduce ROS-NetSim, a ROS package that acts as an interface between robotic and network simulators. With ROS-NetSim, we can attain high-fidelity representations of both robotic and network interactions by accurately simulating the PAC loop. Our proposed approach is lightweight, modular and adaptive. Furthermore, it can be used with many available network and physics simulators by making use of our proposed interface. In summary, ROS-NetSim is (i) Transparent to the ROS target application, (ii) Agnostic to the specific network and physics simulator being used, and (iii) Tunable in fidelity and complexity. As part of our contribution, we have made available an open-source implementation of ROS-NetSim to the community.

preprint2020arXiv

Feedback Enhanced Motion Planning for Autonomous Vehicles

In this work, we address the motion planning problem for autonomous vehicles through a new lattice planning approach, called Feedback Enhanced Lattice Planner (FELP). Existing lattice planners have two major limitations, namely the high dimensionality of the lattice and the lack of modeling of agent vehicle behaviors. We propose to apply the Intelligent Driver Model (IDM) as a speed feedback policy to address both of these limitations. IDM both enables the responsive behavior of the agents, and uniquely determines the acceleration and speed profile of the ego vehicle on a given path. Therefore, only a spatial lattice is needed, while discretization of higher order dimensions is no longer required. Additionally, we propose a directed-graph map representation to support the implementation and execution of lattice planners. The map can reflect local geometric structure, embed the traffic rules adhering to the road, and is efficient to construct and update. We show that FELP is more efficient compared to other existing lattice planners through runtime complexity analysis, and we propose two variants of FELP to further reduce the complexity to polynomial time. We demonstrate the improvement by comparing FELP with an existing spatiotemporal lattice planner using simulations of a merging scenario and continuous highway traffic. We also study the performance of FELP under different traffic densities.

preprint2020arXiv

Mine Tunnel Exploration using Multiple Quadrupedal Robots

Robotic exploration of underground environments is a particularly challenging problem due to communication, endurance, and traversability constraints which necessitate high degrees of autonomy and agility. These challenges are further exacerbated by the need to minimize human intervention for practical applications. While legged robots have the ability to traverse extremely challenging terrain, they also engender new challenges for planning, estimation, and control. In this work, we describe a fully autonomous system for multi-robot mine exploration and mapping using legged quadrupeds, as well as a distributed database mesh networking system for reporting data. In addition, we show results from the DARPA Subterranean Challenge (SubT) Tunnel Circuit demonstrating localization of artifacts after traversals of hundreds of meters. These experiments describe fully autonomous exploration of an unknown Global Navigation Satellite System (GNSS)-denied environment undertaken by legged robots.

preprint2020arXiv

Mobile Wireless Network Infrastructure on Demand

In this work, we introduce Mobile Wireless In-frastructure on Demand: a framework for providing wireless connectivity to multi-robot teams via autonomously reconfiguring ad-hoc networks. In many cases, previous multi-agent systems either assumed the availability of existing communication infrastructure or were required to create a network in addition to completing their objective. Instead our system explicitly assumes the responsibility of creating and sustaining a wireless network capable of satisfying end-to-end communication requirements of a team of agents, called the task team, performing an arbitrary objective. To accomplish this goal, we propose a joint optimization framework that alternates between finding optimal network routes to support data flows between the task agents and improving the performance of the network by repositioning a collection of mobile relay nodes referred to as the network team. We demonstrate our approach with simulations and experiments wherein wireless connectivity is provided to patrolling task agents.

preprint2020arXiv

Perimeter-defense Game between Aerial Defender and Ground Intruder

We study a variant of pursuit-evasion game in the context of perimeter defense. In this problem, the intruder aims to reach the base plane of a hemisphere without being captured by the defender, while the defender tries to capture the intruder. The perimeter-defense game was previously studied under the assumption that the defender moves on a circle. We extend the problem to the case where the defender moves on a hemisphere. To solve this problem, we analyze the strategies based on the breaching point at which the intruder tries to reach the target and predict the goal position, defined as optimal breaching point, that is achieved by the optimal strategies on both players. We provide the barrier that divides the state space into defender-winning and intruder-winning regions and prove that the optimal strategies for both players are to move towards the optimal breaching point. Simulation results are presented to demonstrate that the optimality of the game is given as a Nash equilibrium.

preprint2020arXiv

Robust, Perception Based Control with Quadrotors

Traditionally, controllers and state estimators in robotic systems are designed independently. Controllers are often designed assuming perfect state estimation. However, state estimation methods such as Visual Inertial Odometry (VIO) drift over time and can cause the system to misbehave. While state estimation error can be corrected with the aid of GPS or motion capture, these complementary sensors are not always available or reliable. Recent work has shown that this issue can be dealt with by synthesizing robust controllers using a data-driven characterization of the perception error, and can bound the system's response to state estimation error using a robustness constraint. We investigate the application of this robust perception-based approach to a quadrotor model using VIO for state estimation and demonstrate the benefits and drawbacks of using this technique in simulation and hardware. Additionally, to make tuning easier, we introduce a new cost function to use in the control synthesis which allows one to take an existing controller and "robustify" it. To the best of our knowledge, this is the first robust perception-based controller implemented in real hardware, as well as one utilizing a data-driven perception model. We believe this as an important step towards safe, robust robots that explicitly account for the inherent dependence between perception and control.

preprint2020arXiv

TLIO: Tight Learned Inertial Odometry

In this work we propose a tightly-coupled Extended Kalman Filter framework for IMU-only state estimation. Strap-down IMU measurements provide relative state estimates based on IMU kinematic motion model. However the integration of measurements is sensitive to sensor bias and noise, causing significant drift within seconds. Recent research by Yan et al. (RoNIN) and Chen et al. (IONet) showed the capability of using trained neural networks to obtain accurate 2D displacement estimates from segments of IMU data and obtained good position estimates from concatenating them. This paper demonstrates a network that regresses 3D displacement estimates and its uncertainty, giving us the ability to tightly fuse the relative state measurement into a stochastic cloning EKF to solve for pose, velocity and sensor biases. We show that our network, trained with pedestrian data from a headset, can produce statistically consistent measurement and uncertainty to be used as the update step in the filter, and the tightly-coupled system outperforms velocity integration approaches in position estimates, and AHRS attitude filter in orientation estimates.

preprint2020arXiv

Vision-based Multi-MAV Localization with Anonymous Relative Measurements Using Coupled Probabilistic Data Association Filter

We address the localization of robots in a multi-MAV system where external infrastructure like GPS or motion capture systems may not be available. Our approach lends itself to implementation on platforms with several constraints on size, weight, and power (SWaP). Particularly, our framework fuses the onboard VIO with the anonymous, visual-based robot-to-robot detection to estimate all robot poses in one common frame, addressing three main challenges: 1) the initial configuration of the robot team is unknown, 2) the data association between each vision-based detection and robot targets is unknown, and 3) the vision-based detection yields false negatives, false positives, inaccurate, and provides noisy bearing, distance measurements of other robots. Our approach extends the Coupled Probabilistic Data Association Filter (CPDAF)[1] to cope with nonlinear measurements. We demonstrate the superior performance of our approach over a simple VIO-based method in a simulation with the measurement models statistically modeled using the real experimental data. We also show how onboard sensing, estimation, and control can be used for formation flight.

preprint2019arXiv

Cooperative Team Strategies for Multi-player Perimeter-Defense Games

This paper studies a variant of the multi-player reach-avoid game played between intruders and defenders with applications to perimeter defense. The intruder team tries to score by sending as many intruders as possible to the target area, while the defender team tries to minimize this score by intercepting them. Finding the optimal strategies of the game is challenging due to the high dimensionality of the joint state space, and the existing works have proposed approximation methods to reduce the design of the defense strategy into assignment problems. However they suffer from either suboptimal defender performance or computational complexity. Based on a novel decomposition method, this paper proposes a scalable (polynomial-time) assignment algorithm that accommodates cooperative behaviors and outperforms the existing defense strategies. For a certain class of initial configurations, we derive the exact score by showing that the lower bound provided by the intruder team matches the upper bound provided by the defender team, which also proves the optimality of the team strategies.

preprint2019arXiv

MAMPS: Safe Multi-Agent Reinforcement Learning via Model Predictive Shielding

Reinforcement learning is a promising approach to learning control policies for performing complex multi-agent robotics tasks. However, a policy learned in simulation often fails to guarantee even simple safety properties such as obstacle avoidance. To ensure safety, we propose multi-agent model predictive shielding (MAMPS), an algorithm that provably guarantees safety for an arbitrary learned policy. In particular, it operates by using the learned policy as often as possible, but instead uses a backup policy in cases where it cannot guarantee the safety of the learned policy. Using a multi-agent simulation environment, we show how MAMPS can achieve good performance while ensuring safety.

preprint2019arXiv

Nuclear Environments Inspection with Micro Aerial Vehicles: Algorithms and Experiments

In this work, we address the estimation, planning, control and mapping problems to allow a small quadrotor to autonomously inspect the interior of hazardous damaged nuclear sites. These algorithms run onboard on a computationally limited CPU. We investigate the effect of varying illumination on the system performance. To the best of our knowledge, this is the first fully autonomous system of this size and scale applied to inspect the interior of a full scale mock-up of a Primary Containment Vessel (PCV). The proposed solution opens up new ways to inspect nuclear reactors and to support nuclear decommissioning, which is well known to be a dangerous, long and tedious process. Experimental results with varying illumination conditions show the ability to navigate a full scale mock-up PCV pedestal and create a map of the environment, while concurrently avoiding obstacles.

preprint2019arXiv

SLOAM: Semantic Lidar Odometry and Mapping for Forest Inventory

This paper describes an end-to-end pipeline for tree diameter estimation based on semantic segmentation and lidar odometry and mapping. Accurate mapping of this type of environment is challenging since the ground and the trees are surrounded by leaves, thorns and vines, and the sensor typically experiences extreme motion. We propose a semantic feature based pose optimization that simultaneously refines the tree models while estimating the robot pose. The pipeline utilizes a custom virtual reality tool for labeling 3D scans that is used to train a semantic segmentation network. The masked point cloud is used to compute a trellis graph that identifies individual instances and extracts relevant features that are used by the SLAM module. We show that traditional lidar and image based methods fail in the forest environment on both Unmanned Aerial Vehicle (UAV) and hand-carry systems, while our method is more robust, scalable, and automatically generates tree diameter estimations.

preprint2016arXiv

Algorithms for Visibility-Based Monitoring with Robot Teams

We study the problem of planning paths for a team of robots for visually monitoring an environment. Our work is motivated by surveillance and persistent monitoring applications. We are given a set of target points in a polygonal environment that must be monitored using robots with cameras. The goal is to compute paths for all robots such that every target is visible from at least one path. In its general form, this problem is NP-hard as it generalizes the Art Gallery Problem and the Watchman Route Problem. We study two versions: (i) a geometric version in \emph{street polygons} for which we give a polynomial time $4$--approximation algorithm; and (ii) a general version for which we present a practical solution that finds the optimal solution in possibly exponential time. In addition to theoretical proofs, we also present results from simulation studies.

preprint2016arXiv

DimensionApp : android app to estimate object dimensions

In this project, we develop an android app that uses on computer vision techniques to estimate an object dimension present in field of view. The app while having compact size, is accurate upto +/- 5 mm and robust towards touch inputs. We use single-view metrology to compute accurate measurement. Unlike previous approaches, our technique does not rely on line detection and can be generalize to any object shape easily.

preprint2016arXiv

Next Generation Robotics

The National Robotics Initiative (NRI) was launched 2011 and is about to celebrate its 5 year anniversary. In parallel with the NRI, the robotics community, with support from the Computing Community Consortium, engaged in a series of road mapping exercises. The first version of the roadmap appeared in September 2009; a second updated version appeared in 2013. While not directly aligned with the NRI, these road-mapping documents have provided both a useful charting of the robotics research space, as well as a metric by which to measure progress. This report sets forth a perspective of progress in robotics over the past five years, and provides a set of recommendations for the future. The NRI has in its formulation a strong emphasis on co-robot, i.e., robots that work directly with people. An obvious question is if this should continue to be the focus going forward? To try to assess what are the main trends, what has happened the last 5 years and what may be promising directions for the future a small CCC sponsored study was launched to have two workshops, one in Washington DC (March 5th, 2016) and another in San Francisco, CA (March 11th, 2016). In this report we brief summarize some of the main discussions and observations from those workshops. We will present a variety of background information in Section 2, and outline various issues related to progress over the last 5 years in Section 3. In Section 4 we will outline a number of opportunities for moving forward. Finally, we will summarize the main points in Section 5.

preprint2016arXiv

Toward a Science of Autonomy for Physical Systems

Our lives have been immensely improved by decades of automation research -- we are more comfortable, more productive and safer than ever before. Just imagine a world where familiar automation technologies have failed. In that world, thermostats don't work -- you have to monitor your home heating system manually. Cruise control for your car doesn't exist. Every elevator has to have a human operator to hit the right floor, most manufactured products are assembled by hand, and you have to wash your own dishes. Who would willingly adopt that world -- the world of last century -- today? Physical systems -- elevators, cars, home appliances, manufacturing equipment -- were more troublesome, ore time consuming, less safe, and far less convenient. Now, suppose we put ourselves in the place someone 20 years in the future, a future of autonomous systems. A future where transportation is largely autonomous, more efficient, and far safer; a future where dangerous occupations like mining or disaster response are performed by autonomous systems supervised remotely by humans; a future where manufacturing and healthcare are twice as productive per person-hour by having smart monitoring and readily re-tasked autonomous physical agents; a future where the elderly and infirm have 24 hour in-home autonomous support for the basic activities, both physical and social, of daily life. In a future world where these capabilities are commonplace, why would someone come back to today's world where someone has to put their life at risk to do a menial job, we lose time to mindless activities that have no intrinsic value, or be consumed with worry that a loved one is at risk in their own home? In what follows, and in a series of associated essays, we expand on these ideas, and frame both the opportunities and challenges posed by autonomous physical systems.

preprint2016arXiv

Toward a Science of Autonomy for Physical Systems: Aerial Earth Science

Unmanned Aerial Vehicles (UAVs) equipped with LiDAR, electro-optical and infrared cameras, SAR and atmospheric sensors have transformed the way we acquire high spatio-temporal resolution data. For example, UAVs equipped with these sensors have been able to obtain topography at resolutions of less than one meter, revolutionizing earth sciences. Surface processes act at spatial scales on the order of a meter to produce intricate landforms and UAVs equipped with these sensors are able to measure the three dimensional spatio-temporal geometry of the earths surface and overlying anthropogenic features and vegetation at resolutions appropriate to document these processes. In addition, surface changes due to erosion, transport and sedimentation, as well as earthquakes, landslides, volcanoes can be quantified with this data.

preprint2015arXiv

Clustering by transitive propagation

We present a global optimization algorithm for clustering data given the ratio of likelihoods that each pair of data points is in the same cluster or in different clusters. To define a clustering solution in terms of pairwise relationships, a necessary and sufficient condition is that belonging to the same cluster satisfies transitivity. We define a global objective function based on pairwise likelihood ratios and a transitivity constraint over all triples, assigning an equal prior probability to all clustering solutions. We maximize the objective function by implementing max-sum message passing on the corresponding factor graph to arrive at an O(N^3) algorithm. Lastly, we demonstrate an application inspired by mutational sequencing for decoding random binary words transmitted through a noisy channel.

preprint2015arXiv

Ecosystem: A Characteristic Of Crowdsourced Environments

The phenomenal success of certain crowdsourced online platforms, such as Wikipedia, is accredited to their ability to tap the crowd's potential to collaboratively build knowledge. While it is well known that the crowd's collective wisdom surpasses the cumulative individual expertise, little is understood on the dynamics of knowledge building in a crowdsourced environment. A proper understanding of the dynamics of knowledge building in a crowdsourced environment would enable one in the better designing of such environments to solicit knowledge from the crowd. Our experiment on crowdsourced systems based on annotations shows that an important reason for the rapid knowledge building in such environments is due to variance in expertise. First, we used as our test bed, a customized Crowdsourced Annotation System (CAS) which provides a group of users the facility to annotate a given document while trying to understand it. Our results showed the presence of different genres of proficiency amongst the users of an annotation system. We observed that the ecosystem in crowdsourced annotation system comprised of mainly four categories of contributors, namely: Probers, Solvers, Articulators and Explorers. We inferred from our experiment that the knowledge garnering mainly happens due to the synergetic interaction across these categories. Further, we conducted an analysis on the dataset of Wikipedia and Stack Overflow and noticed the ecosystem presence in these portals as well. From this study, we claim that the ecosystem is a universal characteristic of all crowdsourced portals.

preprint2015arXiv

Experimental Characterization of a Bearing-only Sensor for Use With the PHD Filter

This report outlines the procedure and results of an experiment to characterize a bearing-only sensor for use with PHD filter. The resulting detection, measurement, and clutter models are used for hardware and simulated experiments with a team of mobile robots autonomously seeking an unknown number of objects of interest in an office environment.

preprint2014arXiv

Generation of isolated asymmetric umbilics in light's polarization

Polarization-singularity C-points, a form of line singularities, are the vectorial counterparts of the optical vortices of spatial modes and fundamental optical features of polarization-spatial modes. Their generation in tailored beams has been limited to lemon and star C-points that contain symmetric dislocations in state-of-polarization patterns. In this article we present the theory and laboratory measurements of two complementary methods to generate isolated asymmetric C-points in tailored beams, of which symmetric lemons and stars are limiting cases; and we report on the generation of monstars, an asymmetric C-point with characteristics of both lemons and stars.

preprint2014arXiv

New Methods for Characterizing Phases of 2D Supersymmetric Gauge Theories

We study the physics of two-dimensional N=(2,2) gauged linear sigma models (GLSMs) via the two-sphere partition function. We show that the classical phase boundaries separating distinct GLSM phases, which are described by the secondary fan construction for abelian GLSMs, are completely encoded in the analytic structure of the partition function. The partition function of a non-abelian GLSM can be obtained as a limit from an abelian theory; we utilize this fact to show that the phases of non-abelian GLSMs can be obtained from the secondary fan of the associated abelian GLSM. We prove that the partition function of any abelian GLSM satisfies a set of linear differential equations; these reduce to the familiar A-hypergeometric system of Gel'fand, Kapranov, and Zelevinski for GLSMs describing complete intersections in toric varieties. We develop a set of conditions that are necessary for a GLSM phase to admit an interpretation as the low-energy limit of a non-linear sigma model with a Calabi-Yau threefold target space. Through the application of these criteria we discover a class of GLSMs with novel geometric phases corresponding to Calabi-Yau manifolds that are branched double-covers of Fano threefolds. These criteria provide a promising approach for constructing new Calabi-Yau geometries.

preprint2013arXiv

Technical Report: Cooperative Multi-Target Localization With Noisy Sensors

This technical report is an extended version of the paper 'Cooperative Multi-Target Localization With Noisy Sensors' accepted to the 2013 IEEE International Conference on Robotics and Automation (ICRA). This paper addresses the task of searching for an unknown number of static targets within a known obstacle map using a team of mobile robots equipped with noisy, limited field-of-view sensors. Such sensors may fail to detect a subset of the visible targets or return false positive detections. These measurement sets are used to localize the targets using the Probability Hypothesis Density, or PHD, filter. Robots communicate with each other on a local peer-to-peer basis and with a server or the cloud via access points, exchanging measurements and poses to update their belief about the targets and plan future actions. The server provides a mechanism to collect and synthesize information from all robots and to share the global, albeit time-delayed, belief state to robots near access points. We design a decentralized control scheme that exploits this communication architecture and the PHD representation of the belief state. Specifically, robots move to maximize mutual information between the target set and measurements, both self-collected and those available by accessing the server, balancing local exploration with sharing knowledge across the team. Furthermore, robots coordinate their actions with other robots exploring the same local region of the environment.

preprint2013arXiv

Two-Sphere Partition Functions and Gromov-Witten Invariants

Many N=(2,2) two-dimensional nonlinear sigma models with Calabi-Yau target spaces admit ultraviolet descriptions as N=(2,2) gauge theories (gauged linear sigma models). We conjecture that the two-sphere partition function of such ultraviolet gauge theories -- recently computed via localization by Benini et al. and Doroud et al. -- yields the exact Kähler potential on the quantum Kähler moduli space for Calabi-Yau threefold target spaces. In particular, this allows one to compute the genus zero Gromov-Witten invariants for any such Calabi-Yau threefold without the use of mirror symmetry. More generally, when the infrared superconformal fixed point is used to compactify string theory, this provides a direct method to compute the spacetime Kähler potential of certain moduli (e.g., vector multiplet moduli in type IIA), exactly in α'. We compute these quantities for the quintic and for Rødland's Pfaffian Calabi-Yau threefold and find agreement with existing results in the literature. We then apply our methods to a codimension four determinantal Calabi-Yau threefold in P^7, recently given a nonabelian gauge theory description by the present authors, for which no mirror Calabi-Yau is currently known. We derive predictions for its Gromov-Witten invariants and verify that our predictions satisfy nontrivial geometric checks.

preprint2012arXiv

Freedom and Constraints in the K3 Landscape

We consider "magnetized brane" compactifications of the type I/heterotic string on K3 with U(1) background fluxes. The nonabelian gauge group and matter content of the resulting six-dimensional vacua are parameterized by a matrix encoding a lattice contained within the even, self-dual lattice Gamma^{3,19}. Mathematical results of Nikulin on lattice embeddings make possible a simple classification of such solutions. This approach makes it possible to explicitly and efficiently construct models in this class with a particular allowed gauge group and matter content, so that one can immediately "dial-a-model" with desired properties.

preprint2012arXiv

Invariants for Homology Classes with Application to Optimal Search and Planning Problem in Robotics

We consider planning problems on a punctured Euclidean spaces, $\mathbb{R}^D - \widetilde{\mathcal{O}}$, where $\widetilde{\mathcal{O}}$ is a collection of obstacles. Such spaces are of frequent occurrence as configuration spaces of robots, where $\widetilde{\mathcal{O}}$ represent either physical obstacles that the robots need to avoid (e.g., walls, other robots, etc.) or illegal states (e.g., all legs off-the-ground). As state-planning is translated to path-planning on a configuration space, we collate equivalent plannings via topologically-equivalent paths. This prompts finding or exploring the different homology classes in such environments and finding representative optimal trajectories in each such class. In this paper we start by considering the problem of finding a complete set of easily computable homology class invariants for $(N-1)$-cycles in $(\mathbb{R}^D - \widetilde{\mathcal{O}})$. We achieve this by finding explicit generators of the $(N-1)^{st}$ de Rham cohomology group of this punctured Euclidean space, and using their integrals to define cocycles. The action of those dual cocycles on $(N-1)$-cycles gives the desired complete set of invariants. We illustrate the computation through examples. We further show that, due to the integral approach, this complete set of invariants is well-suited for efficient search-based planning of optimal robot trajectories with topological constraints. Finally we extend this approach to computation of invariants in spaces derived from $(\mathbb{R}^D - \widetilde{\mathcal{O}})$ by collapsing subspace, thereby permitting application to a wider class of non-Euclidean ambient spaces.

preprint2012arXiv

Mapping 6D N = 1 supergravities to F-theory

We develop a systematic framework for realizing general anomaly-free chiral 6D supergravity theories in F-theory. We focus on 6D (1, 0) models with one tensor multiplet whose gauge group is a product of simple factors (modulo a finite abelian group) with matter in arbitrary representations. Such theories can be decomposed into blocks associated with the simple factors in the gauge group; each block depends only on the group factor and the matter charged under it. All 6D chiral supergravity models can be constructed by gluing such blocks together in accordance with constraints from anomalies. Associating a geometric structure to each block gives a dictionary for translating a supergravity model into a set of topological data for an F-theory construction. We construct the dictionary of F-theory divisors explicitly for some simple gauge group factors and associated matter representations. Using these building blocks we analyze a variety of models. We identify some 6D supergravity models which do not map to integral F-theory divisors, possibly indicating quantum inconsistency of these 6D theories.

preprint2012arXiv

Nonabelian 2D Gauge Theories for Determinantal Calabi-Yau Varieties

The two-dimensional supersymmetric gauged linear sigma model (GLSM) with abelian gauge groups and matter fields has provided many insights into string theory on Calabi--Yau manifolds of a certain type: complete intersections in toric varieties. In this paper, we consider two GLSM constructions with nonabelian gauge groups and charged matter whose infrared CFTs correspond to string propagation on determinantal Calabi-Yau varieties, furnishing another broad class of Calabi-Yau geometries in addition to complete intersections. We show that these two models -- which we refer to as the PAX and the PAXY model -- are dual descriptions of the same low-energy physics. Using GLSM techniques, we determine the quantum Kähler moduli space of these varieties and find no disagreement with existing results in the literature.

preprint2011arXiv

6D supergravity without tensor multiplets

We systematically investigate the finite set of possible gauge groups and matter content for N = 1 supergravity theories in six dimensions with no tensor multiplets, focusing on nonabelian gauge groups which are a product of SU(N) factors. We identify a number of models which obey all known low-energy consistency conditions, but which have no known string theory realization. Many of these models contain novel matter representations, suggesting possible new string theory constructions. Many of the most exotic matter structures arise in models which precisely saturate the gravitational anomaly bound on the number of hypermultiplets. Such models have a rigid symmetry structure, in the sense that there are no moduli which leave the full gauge group unbroken.

preprint2011arXiv

A Homotopy-like Class Invariant for Sub-manifolds of Punctured Euclidean Spaces

We consider the $D$-dimensional Euclidean space, $\mathbb{R}^D$, with certain $(D-N)$-dimensional compact, closed and orientable sub-manifolds (which we call \emph{singularity manifolds} and represent by $\widetilde{\mathcal{S}}$) removed from it. We define and investigate the problem of finding a homotopy-like class invariant ($χ$-homotopy) for certain $(N-1)$-dimensional compact, closed and orientable sub-manifolds (which we call \emph{candidate manifolds} and represent by $ω$) of $\mathbb{R}^D \setminus \widetilde{\mathcal{S}}$, with special emphasis on computational aspects of the problem. We determine a differential $(N-1)$-form, $ψ_{\widetilde{\mathcal{S}}}$, such that $χ_{\widetilde{\mathcal{S}}}(ω) = \int_ωψ_{\widetilde{\mathcal{S}}}$ is a class invariant for such candidate manifolds. We show that the formula agrees with formulae from Cauchy integral theorem and Residue theorem of complex analysis (when $D=2,N=2$), Biot-Savart law and Ampere's law of theory of electromagnetism (when $D=3,N=2$), and the Gauss divergence theorem (when $D=3,N=3$), and discover that the underlying equivalence relation suggested by each of these well-known theorems is the $χ$-homotopy of sub-manifolds of these low dimensional punctured Euclidean spaces. We describe numerical techniques for computing $ψ_{\widetilde{\mathcal{S}}}$ and its integral on $ω$, and give numerical validations of the proposed theory for a problem in a 5-dimensional Euclidean space. We also discuss a specific application from \emph{robot path planning problem}, when N=2, and describe a method for computing least cost paths with homotopy class constraints using \emph{graph search techniques}.

preprint2011arXiv

N=1 Sigma Models in AdS_4

We study sigma models in AdS_4 with global N=1 supersymmetry and find that they differ significantly from their flat-space cousins -- the target space is constrained to be a Kahler manifold with an exact Kahler form, the superpotential transforms under Kahler transformations, the space of supersymmetric vacua is generically a set of isolated points even when the superpotential vanishes, and the R-symmetry is classically broken by the cosmological constant. Remarkably, the exactness of the Kahler class is also required for the sigma model to arise as a decoupling limit of N=1 supergravity, and ensures the vanishing of gravitational anomalies. As simple applications of these results, we argue that fields with AdS_4 scale masses are ubiquitous in, for example, type IIB N=1 AdS_4 vacua stabilized near large volume; we also show that the Affleck-Dine-Seiberg runaway of N_f < N_c SQCD is regulated by considering the theory in AdS_4.

preprint2010arXiv

Global aspects of the space of 6D N = 1 supergravities

We perform a global analysis of the space of consistent 6D quantum gravity theories with N = 1 supersymmetry, including models with multiple tensor multiplets. We prove that for theories with fewer than T = 9 tensor multiplets, a finite number of distinct gauge groups and matter content are possible. We find infinite families of field combinations satisfying anomaly cancellation and admitting physical gauge kinetic terms for T > 8. We find an integral lattice associated with each apparently-consistent supergravity theory; this lattice is determined by the form of the anomaly polynomial. For models which can be realized in F-theory, this anomaly lattice is related to the intersection form on the base of the F-theory elliptic fibration. The condition that a supergravity model have an F-theory realization imposes constraints which can be expressed in terms of this lattice. The analysis of models which satisfy known low-energy consistency conditions and yet violate F-theory constraints suggests possible novel constraints on low-energy supergravity theories.

preprint2009arXiv

A bound on 6D N=1 supergravities

We prove that there are only finitely many distinct semi-simple gauge groups and matter representations possible in consistent 6D chiral (1,0) supergravity theories with one tensor multiplet. The proof relies only on features of the low-energy theory; the consistency conditions we impose are that anomalies should be cancelled by the Green-Schwarz mechanism, and that the kinetic terms for all fields should be positive in some region of moduli space. This result does not apply to the case of the non-chiral (1,1) supergravities, which are not constrained by anomaly cancellation.

preprint2009arXiv

Comprehensive study of sodium, copper, and silver clusters over a wide range of sizes 2=<N=<75

The geometric and electronic structures of NaN, CuN, and AgN metal clusters are systematically studied based on the density functional theory over a wide range of cluster sizes 2=<N=<75. A remarkable similarity is observed between the optimized geometric structures of alkali and noble metal clusters over all of the calculated cluster sizes N. The most stable structures are the same for the three different metal clusters for approximately half the cluster sizes N considered in this study. Even if the most stable structures are different, the same types of structures are obtained when the meta-stable structures are also considered. For all of the three different metal clusters, the cluster shapes change in the order of linear, planar, opened, and closed structures with increasing N. This structural type transition leads to a deviation from the monotonic increase in the volume with N. A remarkable similarity is also observed for the N dependence of the cluster energy E(N) for the most stable geometric structures. The amplitude of this energy difference is larger in the two noble metal clusters than in the alkali metal cluster. This is attributed to the contribution of $d$ electrons to the bonds. The magic number is defined in the framework of total energy calculations for the first time. In the case of NaN, a semi-quantitative comparison between the experimental abundance spectra (Knight et al., Phys. Rev. Lett., 52, 2141 (1984)) and the total energy calculations is carried out. The changing aspect of the Kohn-Sham eigenvalues from N=2 to N=75 is presented for the three different metal clusters. The feature of the bulk density of states already appears at N=75 for all of three clusters. With increasing N, the HOMO-LUMO gap clearly exhibits an odd-even alternation and converges to 0.

Vijay Kumar

What is connected

Connect this record

See the researcher in context

Building this map preview

47 published item(s)

LMPath: Language-Mediated Priors and Path Generation for Aerial Exploration

Retrieval Mechanisms Surpass Long-Context Scaling in Time Series Forecasting

ADMM-MCBF-LCA: A Layered Control Architecture for Safe Real-Time Navigation

Adaptive Sampling of Latent Phenomena using Heterogeneous Robot Teams (ASLaP-HR)

Any Way You Look At It: Semantic Crossview Localization and Mapping with LiDAR

Graph Neural Networks for Decentralized Multi-Robot Submodular Action Selection

Learning Connectivity-Maximizing Network Configurations

Resilient Edge: Can we achieve Network Resiliency at the IoT Edge using LPWAN and WiFi?

Stronger Together: Air-Ground Robotic Collaboration Using Semantics

Belief Space Planning for Mobile Robots with Range Sensors using iLQG

Fair Robust Assignment using Redundancy

Large Scale Distributed Collaborative Unlabeled Motion Planning with Graph Policy Gradients

Resilient Task Allocation in Heterogeneous Multi-Robot Systems

ROS-NetSim: A Framework for the Integration of Robotic and Network Simulators

Feedback Enhanced Motion Planning for Autonomous Vehicles

Mine Tunnel Exploration using Multiple Quadrupedal Robots

Mobile Wireless Network Infrastructure on Demand

Perimeter-defense Game between Aerial Defender and Ground Intruder

Robust, Perception Based Control with Quadrotors

TLIO: Tight Learned Inertial Odometry

Vision-based Multi-MAV Localization with Anonymous Relative Measurements Using Coupled Probabilistic Data Association Filter

Cooperative Team Strategies for Multi-player Perimeter-Defense Games

MAMPS: Safe Multi-Agent Reinforcement Learning via Model Predictive Shielding

Nuclear Environments Inspection with Micro Aerial Vehicles: Algorithms and Experiments

SLOAM: Semantic Lidar Odometry and Mapping for Forest Inventory

Algorithms for Visibility-Based Monitoring with Robot Teams

DimensionApp : android app to estimate object dimensions

Next Generation Robotics

Toward a Science of Autonomy for Physical Systems

Toward a Science of Autonomy for Physical Systems: Aerial Earth Science

Clustering by transitive propagation

Ecosystem: A Characteristic Of Crowdsourced Environments

Experimental Characterization of a Bearing-only Sensor for Use With the PHD Filter

Generation of isolated asymmetric umbilics in light's polarization

New Methods for Characterizing Phases of 2D Supersymmetric Gauge Theories

Technical Report: Cooperative Multi-Target Localization With Noisy Sensors

Two-Sphere Partition Functions and Gromov-Witten Invariants

Freedom and Constraints in the K3 Landscape

Invariants for Homology Classes with Application to Optimal Search and Planning Problem in Robotics

Mapping 6D N = 1 supergravities to F-theory

Nonabelian 2D Gauge Theories for Determinantal Calabi-Yau Varieties

6D supergravity without tensor multiplets

A Homotopy-like Class Invariant for Sub-manifolds of Punctured Euclidean Spaces

N=1 Sigma Models in AdS_4

Global aspects of the space of 6D N = 1 supergravities

A bound on 6D N=1 supergravities

Comprehensive study of sodium, copper, and silver clusters over a wide range of sizes 2=<N=<75