On the utility of dreaming: A general model for how learning in artificial agents can benefit from data hallucination
Source
Adaptive Behavior, 29, 3, (2021), pp. 267-280ISSN
Publication type
Article / Letter to editor
Display more detailsDisplay less details
Organization
SW OZ DCC AI
Journal title
Adaptive Behavior
Volume
vol. 29
Issue
iss. 3
Languages used
English (eng)
Page start
p. 267
Page end
p. 280
Subject
Cognitive artificial intelligenceAbstract
We consider the benefits of dream mechanisms - that is, the ability to simulate new experiences based on past ones - in a machine learning context. Specifically, we are interested in learning for artificial agents that act in the world, and operationalize "dreaming" as a mechanism by which such an agent can use its own model of the learning environment to generate new hypotheses and training data.We first show that it is not necessarily a given that such a data-hallucination process is useful, since it can easily lead to a training set dominated by spurious imagined data until an ill-defined convergence point is reached. We then analyse a notably successful implementation of a machine learning-based dreaming mechanism by Ha and Schmidhuber (Ha, D., & Schmidhuber, J. (2018). World models. arXiv e-prints, arXiv:1803.10122). On that basis, we then develop a general framework by which an agent can generate simulated data to learn from in a manner that is beneficial to the agent. This, we argue, then forms a general method for an operationalized dream-like mechanism.We finish by demonstrating the general conditions under which such mechanisms can be useful in machine learning, wherein the implicit simulator inference and extrapolation involved in dreaming act without reinforcing inference error even when inference is incomplete.
This item appears in the following Collection(s)
- Academic publications [242560]
- Electronic publications [129511]
- Faculty of Social Sciences [29963]
- Open Access publications [104127]
Upload full text
Use your RU credentials (u/z-number and password) to log in with SURFconext to upload a file for processing by the repository team.