The developmental trajectory of object recognition robustness: Children are like small adults but unlike big deep neural networks.

Huber, Lukas S; Geirhos, Robert; Wichmann, Felix A (2023). The developmental trajectory of object recognition robustness: Children are like small adults but unlike big deep neural networks. Journal of vision, 23(7), pp. 1-30. ARVO 10.1167/jov.23.7.4

[img]
Preview
Text
i1534-7362-23-7-4_1688636908.85308.pdf - Published Version
Available under License Creative Commons: Attribution (CC-BY).

Download (3MB) | Preview

In laboratory object recognition tasks based on undistorted photographs, both adult humans and deep neural networks (DNNs) perform close to ceiling. Unlike adults', whose object recognition performance is robust against a wide range of image distortions, DNNs trained on standard ImageNet (1.3M images) perform poorly on distorted images. However, the last 2 years have seen impressive gains in DNN distortion robustness, predominantly achieved through ever-increasing large-scale datasets-orders of magnitude larger than ImageNet. Although this simple brute-force approach is very effective in achieving human-level robustness in DNNs, it raises the question of whether human robustness, too, is simply due to extensive experience with (distorted) visual input during childhood and beyond. Here we investigate this question by comparing the core object recognition performance of 146 children (aged 4-15 years) against adults and against DNNs. We find, first, that already 4- to 6-year-olds show remarkable robustness to image distortions and outperform DNNs trained on ImageNet. Second, we estimated the number of images children had been exposed to during their lifetime. Compared with various DNNs, children's high robustness requires relatively little data. Third, when recognizing objects, children-like adults but unlike DNNs-rely heavily on shape but not on texture cues. Together our results suggest that the remarkable robustness to distortions emerges early in the developmental trajectory of human object recognition and is unlikely the result of a mere accumulation of experience with distorted visual input. Even though current DNNs match human performance regarding robustness, they seem to rely on different and more data-hungry strategies to do so.

Item Type:

Journal Article (Original Article)

Division/Institute:

07 Faculty of Human Sciences > Institute of Psychology
07 Faculty of Human Sciences > Institute of Psychology > Cognitive Psychology, Perception and Methodology

UniBE Contributor:

Huber, Lukas Sebastian

Subjects:

000 Computer science, knowledge & systems
100 Philosophy > 150 Psychology
600 Technology

ISSN:

1534-7362

Publisher:

ARVO

Language:

English

Submitter:

Pubmed Import

Date Deposited:

10 Jul 2023 13:15

Last Modified:

31 Aug 2023 13:35

Publisher DOI:

10.1167/jov.23.7.4

PubMed ID:

37410494

BORIS DOI:

10.48350/184557

URI:

https://boris.unibe.ch/id/eprint/184557

Actions (login required)

Edit item Edit item
Provide Feedback