7
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: not found
      • Book Chapter: not found
      Computer Vision – ECCV 2018 

      TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes

      other

      Read this book at

      Publisher
      Buy book Bookmark
          There is no author summary for this book yet. Authors can add summaries to their books on ScienceOpen to make them more accessible to a non-specialist audience.

          Related collections

          Most cited references30

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          Feature Pyramid Networks for Object Detection

            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            Learning Deconvolution Network for Semantic Segmentation

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition

              Image-based sequence recognition has been a long-standing research topic in computer vision. In this paper, we investigate the problem of scene text recognition, which is among the most important and challenging tasks in image-based sequence recognition. A novel neural network architecture, which integrates feature extraction, sequence modeling and transcription into a unified framework, is proposed. Compared with previous systems for scene text recognition, the proposed architecture possesses four distinctive properties: (1) It is end-to-end trainable, in contrast to most of the existing algorithms whose components are separately trained and tuned. (2) It naturally handles sequences in arbitrary lengths, involving no character segmentation or horizontal scale normalization. (3) It is not confined to any predefined lexicon and achieves remarkable performances in both lexicon-free and lexicon-based scene text recognition tasks. (4) It generates an effective yet much smaller model, which is more practical for real-world application scenarios. The experiments on standard benchmarks, including the IIIT-5K, Street View Text and ICDAR datasets, demonstrate the superiority of the proposed algorithm over the prior arts. Moreover, the proposed algorithm performs well in the task of image-based music score recognition, which evidently verifies the generality of it.
                Bookmark

                Author and book information

                Book Chapter
                2018
                October 09 2018
                : 19-35
                10.1007/978-3-030-01216-8_2
                66d0e58c-c4d2-4a89-859f-8731d6b74437
                History

                Comments

                Comment on this book

                Book chapters

                Similar content2,223

                Cited by16