14
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      An Image Dataset of Text Patches in Everyday Scenes

      Preprint
      Published

      , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          This paper describes a dataset containing small images of text from everyday scenes. The purpose of the dataset is to support the development of new automated systems that can detect and analyze text. Although much research has been devoted to text detection and recognition in scanned documents, relatively little attention has been given to text detection in other types of images, such as photographs that are posted on social-media sites. This new dataset, known as COCO-Text-Patch, contains approximately 354,000 small images that are each labeled as "text" or "non-text". This dataset particularly addresses the problem of text verification, which is an essential stage in the end-to-end text detection and recognition pipeline. In order to evaluate the utility of this dataset, it has been used to train two deep convolution neural networks to distinguish text from non-text. One network is inspired by the GoogLeNet architecture, and the second one is based on CaffeNet. Accuracy levels of 90.2% and 90.9% were obtained using the two networks, respectively. All of the images, source code, and deep-learning trained models described in this paper will be publicly available

          Related collections

          Most cited references 6

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          ICDAR 2015 competition on Robust Reading

            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            ICDAR 2011 Robust Reading Competition - Challenge 1: Reading Text in Born-Digital Images (Web and Email)

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Accurate video text detection through classification of low and high contrast images

                Bookmark

                Author and article information

                Journal
                2016-10-20
                Article
                1610.06494

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                Custom metadata
                Accepted in the 12th International Symposium on Visual Computing (ISVC'16)
                cs.CV

                Computer vision & Pattern recognition

                Comments

                Comment on this article