Researcher profile

Richard P. Mann

Richard P. Mann contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 13 - UnverifiedVerification L1Unclaimed author
2works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

2 published item(s)

preprint2022arXiv

Modeling the effects of environmental and perceptual uncertainty using deterministic reinforcement learning dynamics with partial observability

Assessing the systemic effects of uncertainty that arises from agents' partial observation of the true states of the world is critical for understanding a wide range of scenarios. Yet, previous modeling work on agent learning and decision-making either lacks a systematic way to describe this source of uncertainty or puts the focus on obtaining optimal policies using complex models of the world that would impose an unrealistically high cognitive demand on real agents. In this work we aim to efficiently describe the emergent behavior of biologically plausible and parsimonious learning agents faced with partially observable worlds. Therefore we derive and present deterministic reinforcement learning dynamics where the agents observe the true state of the environment only partially. We showcase the broad applicability of our dynamics across different classes of partially observable agent-environment systems. We find that partial observability creates unintuitive benefits in a number of specific contexts, pointing the way to further research on a general understanding of such effects. For instance, partially observant agents can learn better outcomes faster, in a more stable way and even overcome social dilemmas. Furthermore, our method allows the application of dynamical systems theory to partially observable multiagent leaning. In this regard we find the emergence of catastrophic limit cycles, a critical slowing down of the learning processes between reward regimes and the separation of the learning dynamics into fast and slow directions, all caused by partial observability. Therefore, the presented dynamics have the potential to become a formal, yet practical, lightweight and robust tool for researchers in biology, social science and machine learning to systematically investigate the effects of interacting partially observant agents.

preprint2015arXiv

Escape path complexity and its context dependency in Pacific blue-eyes (Pseudomugil signifer)

The escape trajectories animals take following a predatory attack appear to show high degrees of apparent 'randomness' - a property that has been described as 'protean behaviour'. Here we present a method of quantifying the escape trajectories of individual animals using a path complexity approach. When fish (Pseudomugil signifer) were attacked either on their own or in groups, we find that an individual's path rapidly increases in entropy (our measure of complexity) following the attack. For individuals on their own, this entropy remains elevated (indicating a more random path) for a sustained period (10 seconds) after the attack, whilst it falls more quickly for individuals in groups. The entropy of the path is context dependent. When attacks towards single fish come from greater distances, a fish's path shows less complexity compared to attacks that come from short range. This context dependency effect did not exist, however, when individuals were in groups. Nor did the path complexity of individuals in groups depend on a fish's local density of neighbours. We separate out the components of speed and direction changes to determine which of these components contributes to the overall increase in path complexity following an attack. We found that both speed and direction measures contribute similarly to an individual's path's complexity in absolute terms. Our work highlights the adaptive behavioural tactics that animals use to avoid predators and also provides a novel method for quantifying the escape trajectories of animals.