Source author record

Fumio Kanehiro

Fumio Kanehiro appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision Robotics

Catalog footprint

What is connected

3works

2topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Instance-specific 6-DoF Object Pose Estimation from Minimal Annotations

In many robotic applications, the environment setting in which the 6-DoF pose estimation of a known, rigid object and its subsequent grasping is to be performed, remains nearly unchanging and might even be known to the robot in advance. In this paper, we refer to this problem as instance-specific pose estimation: the robot is expected to estimate the pose with a high degree of accuracy in only a limited set of familiar scenarios. Minor changes in the scene, including variations in lighting conditions and background appearance, are acceptable but drastic alterations are not anticipated. To this end, we present a method to rapidly train and deploy a pipeline for estimating the continuous 6-DoF pose of an object from a single RGB image. The key idea is to leverage known camera poses and rigid body geometry to partially automate the generation of a large labeled dataset. The dataset, along with sufficient domain randomization, is then used to supervise the training of deep neural networks for predicting semantic keypoints. Experimentally, we demonstrate the convenience and effectiveness of our proposed method to accurately estimate object pose requiring only a very small amount of manual annotation for training.

preprint2022arXiv

mc-mujoco: Simulating Articulated Robots with FSM Controllers in MuJoCo

For safe and reliable deployment of any robot controller on the real hardware platform, it is generally a necessary practice to comprehensively assess the performance of the controller with the specific robot in a realistic simulation environment beforehand. While there exist several software solutions that can provide the core physics engine for this purpose, it is often a cumbersome and error-prone effort to interface the simulation environment with the robot controller being evaluated. The controller may have a complex structure consisting of multiple states and transitions within a finite-state machine (FSM), and may even require input through a GUI. In this work, we present mc-mujoco -- an open-source software framework that forms an interface between the MuJoCo physics simulator and the mc-rtc robot control framework. We provide implementation details and describe the process for adding support for essentially any new robot. We also demonstrate and publish a sample FSM controller for bipedal locomotion and stable grasping of a rigid object by the HRP-5P humanoid robot in MuJoCo. The code and usage instructions for mc-mujoco, the developed robot modules, and the FSM controller are available online.

preprint2020arXiv

Rapid Pose Label Generation through Sparse Representation of Unknown Objects

Deep Convolutional Neural Networks (CNNs) have been successfully deployed on robots for 6-DoF object pose estimation through visual perception. However, obtaining labeled data on a scale required for the supervised training of CNNs is a difficult task - exacerbated if the object is novel and a 3D model is unavailable. To this end, this work presents an approach for rapidly generating real-world, pose-annotated RGB-D data for unknown objects. Our method not only circumvents the need for a prior 3D object model (textured or otherwise) but also bypasses complicated setups of fiducial markers, turntables, and sensors. With the help of a human user, we first source minimalistic labelings of an ordered set of arbitrarily chosen keypoints over a set of RGB-D videos. Then, by solving an optimization problem, we combine these labels under a world frame to recover a sparse, keypoint-based representation of the object. The sparse representation leads to the development of a dense model and the pose labels for each image frame in the set of scenes. We show that the sparse model can also be efficiently used for scaling to a large number of new scenes. We demonstrate the practicality of the generated labeled dataset by training a pipeline for 6-DoF object pose estimation and a pixel-wise segmentation network.