Optimization of neuroprosthetic vision via end-to-end deep reinforcement learning
Publication year
2022Author(s)
Number of pages
16 p.
Source
International Journal of Neural Systems, 32, 11, (2022), article 2250052ISSN
Publication type
Article / Letter to editor
Display more detailsDisplay less details
Organization
SW OZ DCC AI
Neurobiology
Journal title
International Journal of Neural Systems
Volume
vol. 32
Issue
iss. 11
Languages used
English (eng)
Subject
Biophysics; Cognitive artificial intelligenceAbstract
Visual neuroprostheses are a promising approach to restore basic sight in visually impaired people. A major challenge is to condense the sensory information contained in a complex environment into meaningful stimulation patterns at low spatial and temporal resolution. Previous approaches considered task-agnostic feature extractors such as edge detectors or semantic segmentation, which are likely suboptimal for specific tasks in complex dynamic environments. As an alternative approach, we propose to optimize stimulation patterns by end-to-end training of a feature extractor using deep reinforcement learning agents in virtual environments. We present a task-oriented evaluation framework to compare different stimulus generation mechanisms, such as static edge-based and adaptive end-to-end approaches like the one introduced here. Our experiments in Atari games show that stimulation patterns obtained via task-dependent end-to-end optimized reinforcement learning result in equivalent or improved performance compared to fixed feature extractors on high difficulty levels. These findings signify the relevance of adaptive reinforcement learning for neuroprosthetic vision in complex environments.
This item appears in the following Collection(s)
- Academic publications [246764]
- Electronic publications [134215]
- Faculty of Science [38035]
- Faculty of Social Sciences [30508]
- Open Access publications [107738]
Upload full text
Use your RU credentials (u/z-number and password) to log in with SURFconext to upload a file for processing by the repository team.