Graph explorer

Discovering Agents

Causal models of agents have been used to analyse the safety aspects of machine learning systems. But identifying agents is non-trivial -- often the causal model is just assumed by the modeler without much justification -- and modelling failures can lead to mistakes in the safety analysis. This paper proposes the first formal causal definition of agents -- roughly that agents are systems that would adapt their policy if their actions influenced the world in a different way. From this we derive the first causal discovery algorithm for discovering agents from empirical data, and give algorithms for translating between causal models and game-theoretic influence diagrams. We demonstrate our approach by resolving some previous confusions caused by incorrect causal modelling of agents.

9 nodes9 linksoverview previewDiscovering Agents
9 nodes9 links
Discovering Agents9 visible / 9 total nodes / 24 links
Related contextCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipCo-authorshipAuthorshipAuthorshipAuthorshipAuthorshipTopic signalTopic signalAuthorshipAuthorshipWDiscovering Agentspreprint / 2022AZachary KentonResearcherARamana KumarResearcherASebastian FarquharResearcherAJonathan RichensResearcherTMachine Learning49008 worksTArtificial Intelligence22915 worksAMatt MacDermottResearcherATom EverittResearcher
PaperSignal 108 links

Discovering Agents

preprint / 2022

Open