Generic decoding of seen and imagined objects using hierarchical visual features

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Object recognition is a key function in both human and machine vision. While brain decoding of seen and imagined objects has been achieved, the prediction is limited to training examples. We present a decoding approach for arbitrary objects using the machine vision principle that an object category is represented by a set of features rendered invariant through hierarchical processing. We show that visual features, including those derived from a deep convolutional neural network, can be predicted from fMRI patterns, and that greater accuracy is achieved for low-/high-level features with lower-/higher-level visual areas, respectively. Predicted features are used to identify seen/imagined object categories (extending beyond decoder training) from a set of computed features for numerous object images. Furthermore, decoding of imagined objects reveals progressive recruitment of higher-to-lower visual representations. Our results demonstrate a homology between human and machine vision and its utility for brain-based information retrieval.

Abstract

Machine learning algorithms can decode objects that people see or imagine from their brain activity. Here the authors present a predictive decoder combined with deep neural network representations that generalizes beyond the training set and correctly identifies novel objects that it has never been trained on.

Related collections

Most cited references 48

Record: found
Abstract: not found
Article: not found

ImageNet: A large-scale hierarchical image database

Jia Deng, Wei Dong, Richard Socher … (2010)

0 comments Cited 1927 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Distributed and overlapping representations of faces and objects in ventral temporal cortex.

J Haxby, M. I. Gobbini, M. Furey … (2001)

The functional architecture of the object vision pathway in the human brain was investigated using functional magnetic resonance imaging to measure patterns of response in ventral temporal cortex while subjects viewed faces, cats, five categories of man-made objects, and nonsense pictures. A distinct pattern of response was found for each stimulus category. The distinctiveness of the response to a given category was not due simply to the regions that responded maximally to that category, because the category being viewed also could be identified on the basis of the pattern of response when those regions were excluded from the analysis. Patterns of response that discriminated among all categories were found even within cortical regions that responded maximally to only one category. These results indicate that the representations of faces and objects in ventral temporal cortex are widely distributed and overlapping.

0 comments Cited 757 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

The Fusiform Face Area: A Module in Human Extrastriate Cortex Specialized for Face Perception

Nancy Kanwisher, Josh McDermott, Marvin M Chun (1997)

Using functional magnetic resonance imaging (fMRI), we found an area in the fusiform gyrus in 12 of the 15 subjects tested that was significantly more active when the subjects viewed faces than when they viewed assorted common objects. This face activation was used to define a specific region of interest individually for each subject, within which several new tests of face specificity were run. In each of five subjects tested, the predefined candidate “face area” also responded significantly more strongly to passive viewing of (1) intact than scrambled two-tone faces, (2) full front-view face photos than front-view photos of houses, and (in a different set of five subjects) (3) three-quarter-view face photos (with hair concealed) than photos of human hands; it also responded more strongly during (4) a consecutive matching task performed on three-quarter-view faces versus hands. Our technique of running multiple tests applied to the same region defined functionally within individual subjects provides a solution to two common problems in functional imaging: (1) the requirement to correct for multiple statistical comparisons and (2) the inevitable ambiguity in the interpretation of any study in which only two or three conditions are compared. Our data allow us to reject alternative accounts of the function of the fusiform face area (area “FF”) that appeal to visual attention, subordinate-level classification, or general processing of any animate or human forms, demonstrating that this region is selectively involved in the perception of faces.

0 comments Cited 574 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Journal ID (nlm-ta): Nat Commun

Journal ID (iso-abbrev): Nat Commun

Title: Nature Communications

Publisher: Nature Publishing Group

ISSN (Electronic): 2041-1723

Publication date (Electronic): 22 May 2017

Publication date Collection: 2017

Volume: 8

Electronic Location Identifier: 15037

Affiliations

[1 ]ATR Computational Neuroscience Laboratories , 2-2-2 Hikaridai, Seika, Soraku, Kyoto 619-0288, Japan

[2 ]Graduate School of Informatics, Kyoto University , Yoshida-honmachi, Sakyo-ku, Kyoto 606-8501, Japan

Author notes

[a ] kamitani@ 123456i.kyoto-u.ac.jp

Author information

Yukiyasu Kamitani http://orcid.org/0000-0002-9300-8268

Article

Publisher Item ID: ncomms15037

DOI: 10.1038/ncomms15037

PMC ID: 5458127

PubMed ID: 28530228

SO-VID: 913c5cd3-5150-41fc-b6d1-979a27af7efb

License:

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

History

Date received : 15 October 2015

Date accepted : 21 February 2017

Comments

Comment on this article

scite_

Cited by 100

See all cited by

Most referenced authors 1,012

See all reference authors

- Version 1
- Version 1

Generic decoding of seen and imagined objects using hierarchical visual features

Read this article at

Abstract

Abstract

Related collections

Decoding Infection and Transmission

Most cited references 48

ImageNet: A large-scale hierarchical image database

Distributed and overlapping representations of faces and objects in ventral temporal cortex.

The Fusiform Face Area: A Module in Human Extrastriate Cortex Specialized for Face Perception

Author and article information

Journal

Affiliations

Author notes

Author information

Article

History

Categories

Comments

Comment on this article

Similar content 117

Cited by 100

Most referenced authors 1,012