Modeling naturalistic face processing in humans with deep convolutional neural networks

Author:

Jiahui Guo1ORCID,Feilong Ma1ORCID,Visconti di Oleggio Castello Matteo2ORCID,Nastase Samuel A.3ORCID,Haxby James V.1ORCID,Gobbini M. Ida45ORCID

Affiliation:

1. Center for Cognitive Neuroscience, Dartmouth College, Hanover, NH 03755

2. Helen Wills Neuroscience Institute, University of California, Berkeley, CA 94720

3. Princeton Neuroscience Institute, Princeton University, Princeton, NJ 08544

4. Department of Medical and Surgical Sciences, University of Bologna, Bologna 40138, Italy

5. Istituti di Ricovero e Cura a Carattere Scientifico, Istituto delle Scienze Neurologiche di Bologna, Bologna 40139, Italia

Abstract

Deep convolutional neural networks (DCNNs) trained for face identification can rival and even exceed human-level performance. The ways in which the internal face representations in DCNNs relate to human cognitive representations and brain activity are not well understood. Nearly all previous studies focused on static face image processing with rapid display times and ignored the processing of naturalistic, dynamic information. To address this gap, we developed the largest naturalistic dynamic face stimulus set in human neuroimaging research (700+ naturalistic video clips of unfamiliar faces). We used this naturalistic dataset to compare representational geometries estimated from DCNNs, behavioral responses, and brain responses. We found that DCNN representational geometries were consistent across architectures, cognitive representational geometries were consistent across raters in a behavioral arrangement task, and neural representational geometries in face areas were consistent across brains. Representational geometries in late, fully connected DCNN layers, which are optimized for individuation, were much more weakly correlated with cognitive and neural geometries than were geometries in late-intermediate layers. The late-intermediate face-DCNN layers successfully matched cognitive representational geometries, as measured with a behavioral arrangement task that primarily reflected categorical attributes, and correlated with neural representational geometries in known face-selective topographies. Our study suggests that current DCNNs successfully capture neural cognitive processes for categorical attributes of faces but less accurately capture individuation and dynamic features.

Funder

National Science Foundation

HHS | NIH | National Institute of Mental Health

Publisher

Proceedings of the National Academy of Sciences

Subject

Multidisciplinary

Reference81 articles.

1. O. M. Parkhi, A. Vedaldi, A. Zisserman, Deep face recognition (British Machine Vision Association, 2015), pp. 41.1–41.12.

2. Face recognition accuracy of forensic examiners, superrecognizers, and face recognition algorithms

3. Y. Taigman, M. Yang, M. Ranzato, L. Wolf, “DeepFace: Closing the gap to human-level performance” in IEEE Conference on Computer Vision and Pattern Recognition (Columbus, OH, USA, 2014), pp. 1701–1708.

4. Face Space Representations in Deep Convolutional Neural Networks

5. Convolutional neural networks explain tuning properties of anterior, but not middle, face-processing areas in macaque inferotemporal cortex

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3