65
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Why vision is not both hierarchical and feedforward

      review-article
      ,
      Frontiers in Computational Neuroscience
      Frontiers Media S.A.
      feedback, object recognition, crowding, Verniers, Gestalt

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          In classical models of object recognition, first, basic features (e.g., edges and lines) are analyzed by independent filters that mimic the receptive field profiles of V1 neurons. In a feedforward fashion, the outputs of these filters are fed to filters at the next processing stage, pooling information across several filters from the previous level, and so forth at subsequent processing stages. Low-level processing determines high-level processing. Information lost on lower stages is irretrievably lost. Models of this type have proven to be very successful in many fields of vision, but have failed to explain object recognition in general. Here, we present experiments that, first, show that, similar to demonstrations from the Gestaltists, figural aspects determine low-level processing (as much as the other way around). Second, performance on a single element depends on all the other elements in the visual scene. Small changes in the overall configuration can lead to large changes in performance. Third, grouping of elements is key. Only if we know how elements group across the entire visual field, can we determine performance on individual elements, i.e., challenging the classical stereotypical filtering approach, which is at the very heart of most vision models.

          Related collections

          Most cited references46

          • Record: found
          • Abstract: not found
          • Article: not found

          Normalized cuts and image segmentation

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            One-shot learning of object categories.

            Learning visual models of object categories notoriously requires hundreds or thousands of training examples. We show that it is possible to learn much information about a category from just one, or a handful, of images. The key insight is that, rather than learning from scratch, one can take advantage of knowledge coming from previously learned categories, no matter how different these categories might be. We explore a Bayesian implementation of this idea. Object categories are represented by probabilistic models. Prior knowledge is represented as a probability density function on the parameters of these models. The posterior model for an object category is obtained by updating the prior in the light of one or more observations. We test a simple implementation of our algorithm on a database of 101 diverse object categories. We compare category models learned by an implementation of our Bayesian approach to models learned from by Maximum Likelihood (ML) and Maximum A Posteriori (MAP) methods. We find that on a database of more than 100 categories, the Bayesian approach produces informative models when the number of training examples is too small for other methods to operate successfully.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Robust object recognition with cortex-like mechanisms.

              We introduce a new general framework for the recognition of complex visual scenes, which is motivated by biology: We describe a hierarchical system that closely follows the organization of visual cortex and builds an increasingly complex and invariant feature representation by alternating between a template matching and a maximum pooling operation. We demonstrate the strength of the approach on a range of recognition tasks: From invariant single object recognition in clutter to multiclass categorization problems and complex scene understanding tasks that rely on the recognition of both shape-based as well as texture-based objects. Given the biological constraints that the system had to satisfy, the approach performs surprisingly well: It has the capability of learning from only a few training examples and competes with state-of-the-art systems. We also discuss the existence of a universal, redundant dictionary of features that could handle the recognition of most object categories. In addition to its relevance for computer vision, the success of this approach suggests a plausibility proof for a class of feedforward models of object recognition in cortex.
                Bookmark

                Author and article information

                Contributors
                Journal
                Front Comput Neurosci
                Front Comput Neurosci
                Front. Comput. Neurosci.
                Frontiers in Computational Neuroscience
                Frontiers Media S.A.
                1662-5188
                22 October 2014
                2014
                : 8
                : 135
                Affiliations
                Laboratory of Psychophysics, Brain, Mind Institute, École Polytechnique Fédérale de Lausanne Lausanne, Switzerland
                Author notes

                Edited by: Antonio J. Rodriguez-Sanchez, University of Innsbruck, Austria

                Reviewed by: Michael Zillich, Vienna University of Technology, Austria; Norbert Küger, The Maersk Mc-Kinney Moller Institute, Denmark

                *Correspondence: Michael H. Herzog, Laboratory of Psychophysics, Brain, Mind Institute, École Polytechnique Fédérale de Lausanne, EPFL SV BMI LPSY SV-2807, Station 19, CH-1015 Lausanne, Switzerland e-mail: michael.herzog@ 123456epfl.ch

                This article was submitted to the journal Frontiers in Computational Neuroscience.

                Article
                10.3389/fncom.2014.00135
                4205941
                38810539-a3ea-4388-96cd-c37c97cb7a19
                Copyright © 2014 Herzog and Clarke.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

                History
                : 15 April 2014
                : 03 October 2014
                Page count
                Figures: 2, Tables: 0, Equations: 0, References: 61, Pages: 5, Words: 4037
                Categories
                Neuroscience
                Mini Review Article

                Neurosciences
                feedback,object recognition,crowding,verniers,gestalt
                Neurosciences
                feedback, object recognition, crowding, verniers, gestalt

                Comments

                Comment on this article