Source author record

Kentaro Wada

Kentaro Wada appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision Robotics math.RT math.QA Artificial Intelligence Machine Learning math.CO math.AG

Catalog footprint

What is connected

15works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic Manipulation via Discretisation

We present a coarse-to-fine discretisation method that enables the use of discrete reinforcement learning approaches in place of unstable and data-inefficient actor-critic methods in continuous robotics domains. This approach builds on the recently released ARM algorithm, which replaces the continuous next-best pose agent with a discrete one, with coarse-to-fine Q-attention. Given a voxelised scene, coarse-to-fine Q-attention learns what part of the scene to 'zoom' into. When this 'zooming' behaviour is applied iteratively, it results in a near-lossless discretisation of the translation space, and allows the use of a discrete action, deep Q-learning method. We show that our new coarse-to-fine algorithm achieves state-of-the-art performance on several difficult sparsely rewarded RLBench vision-based robotics tasks, and can train real-world policies, tabula rasa, in a matter of minutes, with as little as 3 demonstrations.

preprint2022arXiv

ReorientBot: Learning Object Reorientation for Specific-Posed Placement

Robots need the capability of placing objects in arbitrary, specific poses to rearrange the world and achieve various valuable tasks. Object reorientation plays a crucial role in this as objects may not initially be oriented such that the robot can grasp and then immediately place them in a specific goal pose. In this work, we present a vision-based manipulation system, ReorientBot, which consists of 1) visual scene understanding with pose estimation and volumetric reconstruction using an onboard RGB-D camera; 2) learned waypoint selection for successful and efficient motion generation for reorientation; 3) traditional motion planning to generate a collision-free trajectory from the selected waypoints. We evaluate our method using the YCB objects in both simulation and the real world, achieving 93% overall success, 81% improvement in success rate, and 22% improvement in execution time compared to a heuristic approach. We demonstrate extended multi-object rearrangement showing the general capability of the system.

preprint2022arXiv

SafePicking: Learning Safe Object Extraction via Object-Level Mapping

Robots need object-level scene understanding to manipulate objects while reasoning about contact, support, and occlusion among objects. Given a pile of objects, object recognition and reconstruction can identify the boundary of object instances, giving important cues as to how the objects form and support the pile. In this work, we present a system, SafePicking, that integrates object-level mapping and learning-based motion planning to generate a motion that safely extracts occluded target objects from a pile. Planning is done by learning a deep Q-network that receives observations of predicted poses and a depth-based heightmap to output a motion trajectory, trained to maximize a safety metric reward. Our results show that the observation fusion of poses and depth-sensing gives both better performance and robustness to the model. We evaluate our methods using the YCB objects in both simulation and the real world, achieving safe object extraction from piles.

preprint2020arXiv

3D Object Segmentation for Shelf Bin Picking by Humanoid with Deep Learning and Occupancy Voxel Grid Map

Picking objects in a narrow space such as shelf bins is an important task for humanoid to extract target object from environment. In those situations, however, there are many occlusions between the camera and objects, and this makes it difficult to segment the target object three dimensionally because of the lack of three dimentional sensor inputs. We address this problem with accumulating segmentation result with multiple camera angles, and generating voxel model of the target object. Our approach consists of two components: first is object probability prediction for input image with convolutional networks, and second is generating voxel grid map which is designed for object segmentation. We evaluated the method with the picking task experiment for target objects in narrow shelf bins. Our method generates dense 3D object segments even with occlusions, and the real robot successfuly picked target objects from the narrow space.

preprint2020arXiv

Finite dimensional simple modules of $(q, \mathbf{Q})$-current algebras

The $(q, \mathbf{Q})$-current algebra associated with the general linear Lie algebra was introduced by the second author in the study of representation theory of cyclotomic $q$-Schur algebras. In this paper, we study the $(q, \mathbf{Q})$-current algebra $U_q(\mathfrak{sl}_n^{\langle \mathbf{Q} \rangle}[x])$ associated with the special linear Lie algebra $\mathfrak{sl}_n$. In particular, we classify finite dimensional simple $U_q(\mathfrak{sl}_n^{\langle \mathbf{Q} \rangle}[x])$-modules.

preprint2020arXiv

Instance Segmentation of Visible and Occluded Regions for Finding and Picking Target from a Pile of Objects

We present a robotic system for picking a target from a pile of objects that is capable of finding and grasping the target object by removing obstacles in the appropriate order. The fundamental idea is to segment instances with both visible and occluded masks, which we call `instance occlusion segmentation'. To achieve this, we extend an existing instance segmentation model with a novel `relook' architecture, in which the model explicitly learns the inter-instance relationship. Also, by using image synthesis, we make the system capable of handling new objects without human annotations. The experimental results show the effectiveness of the relook architecture when compared with a conventional model and of the image synthesis when compared to a human-annotated dataset. We also demonstrate the capability of our system to achieve picking a target in a cluttered environment with a real robot.

preprint2020arXiv

Joint Learning of Instance and Semantic Segmentation for Robotic Pick-and-Place with Heavy Occlusions in Clutter

We present joint learning of instance and semantic segmentation for visible and occluded region masks. Sharing the feature extractor with instance occlusion segmentation, we introduce semantic occlusion segmentation into the instance segmentation model. This joint learning fuses the instance- and image-level reasoning of the mask prediction on the different segmentation tasks, which was missing in the previous work of learning instance segmentation only (instance-only). In the experiments, we evaluated the proposed joint learning comparing the instance-only learning on the test dataset. We also applied the joint learning model to 2 different types of robotic pick-and-place tasks (random and target picking) and evaluated its effectiveness to achieve real-world robotic tasks.

preprint2020arXiv

MoreFusion: Multi-object Reasoning for 6D Pose Estimation from Volumetric Fusion

Robots and other smart devices need efficient object-based scene representations from their on-board vision systems to reason about contact, physics and occlusion. Recognized precise object models will play an important role alongside non-parametric reconstructions of unrecognized structures. We present a system which can estimate the accurate poses of multiple known objects in contact and occlusion from real-time, embodied multi-view vision. Our approach makes 3D object pose proposals from single RGB-D views, accumulates pose estimates and non-parametric occupancy information from multiple views as the camera moves, and performs joint optimization to estimate consistent, non-intersecting poses for multiple objects in contact. We verify the accuracy and robustness of our approach experimentally on 2 object datasets: YCB-Video, and our own challenging Cluttered YCB-Video. We demonstrate a real-time robotics application where a robot arm precisely and orderly disassembles complicated piles of objects, using only on-board RGB-D vision.

preprint2020arXiv

Probabilistic 3D Multilabel Real-time Mapping for Multi-object Manipulation

Probabilistic 3D map has been applied to object segmentation with multiple camera viewpoints, however, conventional methods lack of real-time efficiency and functionality of multilabel object mapping. In this paper, we propose a method to generate three-dimensional map with multilabel occupancy in real-time. Extending our previous work in which only target label occupancy is mapped, we achieve multilabel object segmentation in a single looking around action. We evaluate our method by testing segmentation accuracy with 39 different objects, and applying it to a manipulation task of multiple objects in the experiments. Our mapping-based method outperforms the conventional projection-based method by 40 - 96\% relative (12.6 mean $IU_{3d}$), and robot successfully recognizes (86.9\%) and manipulates multiple objects (60.7\%) in an environment with heavy occlusions.

preprint2015arXiv

New realization of cyclotomic $q$-Schur algebras I

We introduce a Lie algebra $\mathfrak{g}_{\mathbf{Q}}(\mathbf{m})$ and an associative algebra $\mathcal{U}_{q,\mathbf{Q}}(\mathbf{m})$ associated with the Cartan data of $\mathfrak{gl}_m$ which is separated into $r$ parts with respect to $\mathbf{m}=(m_1, \dots, m_r)$ such that $m_1+ \dots + m_r =m$. We show that the Lie algebra $\mathfrak{g}_{\mathbf{Q}} (\mathbf{m})$ is a filtered deformation of the current Lie algebra of $\mathfrak{gl}_m$, and we can regard the algebra $\mathcal{U}_{q, \mathbf{Q}}(\mathbf{m})$ as a "$q$-analogue" of $U(\mathfrak{g}_{\mathbf{Q}}(\mathbf{m}))$. Then, we realize a cyclotomic $q$-Schur algebra as a quotient algebra of $\mathcal{U}_{q, \mathbf{Q}}(\mathbf{m})$ under a certain mild condition. We also study the representation theory for $\mathfrak{g}_{\mathbf{Q}}(\mathbf{m})$ and $\mathcal{U}_{q,\mathbf{Q}}(\mathbf{m})$, and we apply them to the representations of the cyclotomic $q$-Schur algebras.

preprint2014arXiv

On Weyl Groups and Artin Groups Associated to Orbifold Projective Lines

We associate a generalized root system in the sense of Kyoji Saito to an orbifold projective line via the derived category of finite dimensional representations of a certain bound quiver algebra. We generalize results by Saito--Takebayshi and Yamada for elliptic Weyl groups and elliptic Artin groups to the Weyl groups and the fundamental groups of the regular orbit spaces associated to the generalized root systems. Moreover we study the relation between this fundamental group and a certain subgroup of the autoequivalence group of a triangulated subcategory of the derived category of $2$-Calabi--Yau completion of the bound quiver algebra.

preprint2011arXiv

Induction and Restriction Functors for Cyclotomic q-Schur Algebras

We define the induction and restriction functors for cyclotomic q-Schur algebras, and study some properties of them. As an application, we categorify a higher level Fock space by using the module categories of cyclotomic q-Schur algebras.

preprint2011arXiv

On Weyl modules of cyclotomic $q$-Schur algebras

We study on Weyl modules of cyclotomic $q$-Schur algebras. In particular, we give the character formula of the Weyl modules by using the Kostka numbers and some numbers which are computed by a generalization of Littlewood-Richardson rule. We also study corresponding symmetric functions. Finally, we give some simple applications to modular representations of cyclotomic $q$-Schur algebras.

preprint2010arXiv

Blocks of category $\mathcal{O}$ for rational Cherednik algebras and of cyclotomic Hecke algebras of type G(r,p,n)

We classify blocks of category $\mathcal{O}$ for rational Cherednik algebras and of cyclotomic Hecke algebras of type G(r,p,n) by using the "residue equivalence" for multi-partitions.

preprint2010arXiv

The representation type of Ariki-Koike algebras and cyclotomic q-Schur algebras

We give a necessary and sufficient condition on parameters for Ariki-Koike algebras (resp. cyclotomic q-Schur algebras) to be of finite representation type.

Kentaro Wada

What is connected

Connect this record

See the researcher in context

Building this map preview

15 published item(s)

Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic Manipulation via Discretisation

ReorientBot: Learning Object Reorientation for Specific-Posed Placement

SafePicking: Learning Safe Object Extraction via Object-Level Mapping

3D Object Segmentation for Shelf Bin Picking by Humanoid with Deep Learning and Occupancy Voxel Grid Map

Finite dimensional simple modules of $(q, \mathbf{Q})$-current algebras

Instance Segmentation of Visible and Occluded Regions for Finding and Picking Target from a Pile of Objects

Joint Learning of Instance and Semantic Segmentation for Robotic Pick-and-Place with Heavy Occlusions in Clutter

MoreFusion: Multi-object Reasoning for 6D Pose Estimation from Volumetric Fusion

Probabilistic 3D Multilabel Real-time Mapping for Multi-object Manipulation

New realization of cyclotomic $q$-Schur algebras I

On Weyl Groups and Artin Groups Associated to Orbifold Projective Lines

Induction and Restriction Functors for Cyclotomic q-Schur Algebras

On Weyl modules of cyclotomic $q$-Schur algebras

Blocks of category $\mathcal{O}$ for rational Cherednik algebras and of cyclotomic Hecke algebras of type G(r,p,n)

The representation type of Ariki-Koike algebras and cyclotomic q-Schur algebras