Source author record

Richard P. Mann

Richard P. Mann appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Quantitative Methods Artificial Intelligence Machine Learning Multiagent Systems nlin.AO physics.soc-ph

Catalog footprint

What is connected

3works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Modeling the effects of environmental and perceptual uncertainty using deterministic reinforcement learning dynamics with partial observability

Assessing the systemic effects of uncertainty that arises from agents' partial observation of the true states of the world is critical for understanding a wide range of scenarios. Yet, previous modeling work on agent learning and decision-making either lacks a systematic way to describe this source of uncertainty or puts the focus on obtaining optimal policies using complex models of the world that would impose an unrealistically high cognitive demand on real agents. In this work we aim to efficiently describe the emergent behavior of biologically plausible and parsimonious learning agents faced with partially observable worlds. Therefore we derive and present deterministic reinforcement learning dynamics where the agents observe the true state of the environment only partially. We showcase the broad applicability of our dynamics across different classes of partially observable agent-environment systems. We find that partial observability creates unintuitive benefits in a number of specific contexts, pointing the way to further research on a general understanding of such effects. For instance, partially observant agents can learn better outcomes faster, in a more stable way and even overcome social dilemmas. Furthermore, our method allows the application of dynamical systems theory to partially observable multiagent leaning. In this regard we find the emergence of catastrophic limit cycles, a critical slowing down of the learning processes between reward regimes and the separation of the learning dynamics into fast and slow directions, all caused by partial observability. Therefore, the presented dynamics have the potential to become a formal, yet practical, lightweight and robust tool for researchers in biology, social science and machine learning to systematically investigate the effects of interacting partially observant agents.

preprint2016arXiv

Towards a fully predictive model of flight paths in pigeons navigating in the familiar area: prediction across differing individuals

This paper will detail the basis of our previously developed predictive model for pigeon flight paths based on observations of the specific individual being predicted. We will then describe how this model can be adapted to predict the flight of a new, unobserved bird, based on observations of other individuals from the same release site. We will test the accuracy of these predictions relative to naive models with no previous flight information and those trained on the focal bird's own previous flights, and discuss the implications of these results for the nature of navigational cue use in the familiar area. Finally we will discuss how visual cues may be explicitly encoded in the model in future work.

preprint2015arXiv

Escape path complexity and its context dependency in Pacific blue-eyes (Pseudomugil signifer)

The escape trajectories animals take following a predatory attack appear to show high degrees of apparent 'randomness' - a property that has been described as 'protean behaviour'. Here we present a method of quantifying the escape trajectories of individual animals using a path complexity approach. When fish (Pseudomugil signifer) were attacked either on their own or in groups, we find that an individual's path rapidly increases in entropy (our measure of complexity) following the attack. For individuals on their own, this entropy remains elevated (indicating a more random path) for a sustained period (10 seconds) after the attack, whilst it falls more quickly for individuals in groups. The entropy of the path is context dependent. When attacks towards single fish come from greater distances, a fish's path shows less complexity compared to attacks that come from short range. This context dependency effect did not exist, however, when individuals were in groups. Nor did the path complexity of individuals in groups depend on a fish's local density of neighbours. We separate out the components of speed and direction changes to determine which of these components contributes to the overall increase in path complexity following an attack. We found that both speed and direction measures contribute similarly to an individual's path's complexity in absolute terms. Our work highlights the adaptive behavioural tactics that animals use to avoid predators and also provides a novel method for quantifying the escape trajectories of animals.