3
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Multi-View Features and Hybrid Reward Strategies for Vatex Video Captioning Challenge 2019

      Preprint
      , , , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          This document describes our solution for the VATEX Captioning Challenge 2019, which requires generating descriptions for the videos in both English and Chinese languages. We identified three crucial factors that improve the performance, namely: multi-view features, hybrid reward, and diverse ensemble. Our method achieves the 2nd and the 3rd places on the Chinese and English video captioning tracks, respectively.

          Related collections

          Most cited references3

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering

            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            A Closer Look at Spatiotemporal Convolutions for Action Recognition

              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              Self-Critical Sequence Training for Image Captioning

                Bookmark

                Author and article information

                Journal
                17 October 2019
                Article
                1910.11102
                1b719d7f-8a0b-48a8-808f-90b4d7983254

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                3 pages,1 figure
                cs.CV cs.MM

                Computer vision & Pattern recognition,Graphics & Multimedia design
                Computer vision & Pattern recognition, Graphics & Multimedia design

                Comments

                Comment on this article