Capsule networks as recurrent models of grouping and segmentation
Publication year
2020Number of pages
19 p.
Source
Plos Computational Biology, 16, 7, (2020), article e1008017ISSN
Publication type
Article / Letter to editor
Display more detailsDisplay less details
Organization
SW OZ DCC AI
Journal title
Plos Computational Biology
Volume
vol. 16
Issue
iss. 7
Languages used
English (eng)
Subject
Cognitive artificial intelligenceAbstract
Classically, visual processing is described as a cascade of local feedforward computations. Feedforward Convolutional Neural Networks (ffCNNs) have shown how powerful such models can be. However, using visual crowding as a well-controlled challenge, we previously showed that no classic model of vision, including ffCNNs, can explain human global shape processing. Here, we show that Capsule Neural Networks (CapsNets), combining ffCNNs with recurrent grouping and segmentation, solve this challenge. We also show that ffCNNs and standard recurrent CNNs do not, suggesting that the grouping and segmentation capabilities of CapsNets are crucial. Furthermore, we provide psychophysical evidence that grouping and segmentation are implemented recurrently in humans, and show that CapsNets reproduce these results well. We discuss why recurrence seems needed to implement grouping and segmentation efficiently. Together, we provide mutually reinforcing psychophysical and computational evidence that a recurrent grouping and segmentation process is essential to understand the visual system and create better models that harness global shape computations.
This item appears in the following Collection(s)
- Academic publications [246625]
- Electronic publications [134196]
- Faculty of Social Sciences [30504]
- Open Access publications [107719]
Upload full text
Use your RU credentials (u/z-number and password) to log in with SURFconext to upload a file for processing by the repository team.