0
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Multimodal Abstractive Summarization for How2 Videos

      Preprint
      , , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          In this paper, we study abstractive summarization for open-domain videos. Unlike the traditional text news summarization, the goal is less to "compress" text information but rather to provide a fluent textual summary of information that has been collected and fused from different source modalities, in our case video and audio transcripts (or text). We show how a multi-source sequence-to-sequence model with hierarchical attention can integrate information from different modalities into a coherent output, compare various models trained with different modalities and present pilot experiments on the How2 corpus of instructional videos. We also propose a new evaluation metric (Content F1) for abstractive summarization task that measures semantic adequacy rather than fluency of the summaries, which is covered by metrics like ROUGE and BLEU.

          Related collections

          Most cited references16

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          Meteor Universal: Language Specific Translation Evaluation for Any Target Language

            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            Get To The Point: Summarization with Pointer-Generator Networks

              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?

                Bookmark

                Author and article information

                Journal
                18 June 2019
                Article
                1906.07901
                c424fb08-8dcc-4cf4-9306-647bcbefec07

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                To appear in ACL 2019
                cs.CL cs.CV cs.LG cs.MM

                Computer vision & Pattern recognition,Theoretical computer science,Artificial intelligence,Graphics & Multimedia design

                Comments

                Comment on this article