Source author record

Hussein A. Abbass

Hussein A. Abbass appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence cs.CY Human-Computer Interaction Machine Learning Multiagent Systems Robotics

Catalog footprint

What is connected

4works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2020arXiv

Continuous Deep Hierarchical Reinforcement Learning for Ground-Air Swarm Shepherding

The control and guidance of multi-robots (swarm) is a non-trivial problem due to the complexity inherent in the coupled interaction among the group. Whether the swarm is cooperative or non-cooperative, lessons can be learnt from sheepdogs herding sheep. Biomimicry of shepherding offers computational methods for swarm control with the potential to generalize and scale in different environments. However, learning to shepherd is complex due to the large search space that a machine learner is faced with. We present a deep hierarchical reinforcement learning approach for shepherding, whereby an unmanned aerial vehicle (UAV) learns to act as an aerial sheepdog to control and guide a swarm of unmanned ground vehicles (UGVs). The approach extends our previous work on machine education to decompose the search space into a hierarchically organized curriculum. Each lesson in the curriculum is learnt by a deep reinforcement learning model. The hierarchy is formed by fusing the outputs of the model. The approach is demonstrated first in a high-fidelity robotic-operating-system (ROS)-based simulation environment, then with physical UGVs and a UAV in an in-door testing facility. We investigate the ability of the method to generalize as the models move from simulation to the real-world and as the models move from one scale to another.

preprint2020arXiv

Machine Education: Designing semantically ordered and ontologically guided modular neural networks

The literature on machine teaching, machine education, and curriculum design for machines is in its infancy with sparse papers on the topic primarily focusing on data and model engineering factors to improve machine learning. In this paper, we first discuss selected attempts to date on machine teaching and education. We then bring theories and methodologies together from human education to structure and mathematically define the core problems in lesson design for machine education and the modelling approaches required to support the steps for machine education. Last, but not least, we offer an ontology-based methodology to guide the development of lesson plans to produce transparent and explainable modular learning machines, including neural networks.

preprint2020arXiv

Q-Learning with Differential Entropy of Q-Tables

It is well-known that information loss can occur in the classic and simple Q-learning algorithm. Entropy-based policy search methods were introduced to replace Q-learning and to design algorithms that are more robust against information loss. We conjecture that the reduction in performance during prolonged training sessions of Q-learning is caused by a loss of information, which is non-transparent when only examining the cumulative reward without changing the Q-learning algorithm itself. We introduce Differential Entropy of Q-tables (DE-QT) as an external information loss detector to the Q-learning algorithm. The behaviour of DE-QT over training episodes is analyzed to find an appropriate stopping criterion during training. The results reveal that DE-QT can detect the most appropriate stopping point, where a balance between a high success rate and a high efficiency is met for classic Q-Learning algorithm.

preprint2014arXiv

Visualizing Cognitive Moves for Assessing Information Perception Biases in Decision Making

In decision making a key source of uncertainty is people's perception of information which is influenced by their attitudes toward risk. Both, perception of information and risk attitude, affect the interpretation of information and hence the choice of suitable courses of action in a variety of contexts ranging from project planning to military operations. Visualization associated with the dynamics of cognitive states of people processing information and making decision is therefore not only important for analysis but has also significant practical applications, in particular in the military command and control domain. In this paper, we focus on a major concept that affect human cognition in this context: reliability of information. We introduce Cognitive Move Diagrams (CMD)---a simple visualization tool---to represent and evaluate the impact of this concept on decision making. We demonstrate through both a hypothetical example and a subject matter expert based experiment that CMD are effective in visualizing, detecting and qualifying human biases.

Hussein A. Abbass

What is connected

Connect this record

See the researcher in context

Building this map preview

4 published item(s)

Continuous Deep Hierarchical Reinforcement Learning for Ground-Air Swarm Shepherding

Machine Education: Designing semantically ordered and ontologically guided modular neural networks

Q-Learning with Differential Entropy of Q-Tables

Visualizing Cognitive Moves for Assessing Information Perception Biases in Decision Making