Research Library

open-access-imgOpen AccessDistinguishing mirror from glass: A “big data” approach to material perception
Author(s)
Hideki Tamura,
Konrad E Prokott,
Roland W. Fleming
Publication year2022
Publication title
journal of vision
Resource typeJournals
PublisherAssociation for Research in Vision and Ophthalmology
Visually identifying materials is crucial for many tasks, yet materialperception remains poorly understood. Distinguishing mirror from glass isparticularly challenging as both materials derive their appearance from theirsurroundings, yet we rarely experience difficulties telling them apart. Here wetook a 'big data' approach to uncovering the underlying visual cues andprocesses, leveraging recent advances in neural network models of vision. Wetrained thousands of convolutional neural networks on >750,000 simulated mirrorand glass objects, and compared their performance with human judgments, as wellas alternative classifiers based on 'hand-engineered' image features. Forrandomly chosen images, all classifiers and humans performed with highaccuracy, and therefore correlated highly with one another. To tease the modelsapart, we then painstakingly assembled a diagnostic image set for which humansmake highly systematic errors, allowing us to decouple accuracy from human-likeperformance. A large-scale, systematic search through feedforward neuralarchitectures revealed that relatively shallow networks predicted humanjudgments better than any other models. However, surprisingly, no networkcorrelated better than 0.6 with humans (below inter-human correlations). Thus,although the model sets new standards for simulating human vision in achallenging material perception task, the results cast doubt on recent claimsthat such architectures are generally good models of human vision.
Subject(s)artificial intelligence , artificial neural network , computer science , computer vision , convolutional neural network , deep learning , economics , human visual system model , image (mathematics) , inference , machine learning , management , neuroscience , pattern recognition (psychology) , perception , programming language , psychology , set (abstract data type) , task (project management) , visual perception , visual processing
Language(s)English
SCImago Journal Rank1.126
H-Index113
ISSN1534-7362
DOI10.1167/jov.22.4.4

Seeing content that should not be on Zendy? Contact us.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here