ScienceOpen: research and publishing network

For Researchers

Search
Advanced search

0

views

    

0

recommends

0

shares

Record: found
Abstract: found
Article: found

Is Open Access

Multimodal Abstractive Summarization for How2 Videos

Preprint

Author(s): Shruti Palaskar , Jindrich Libovický , Spandana Gella , Florian Metze

Publication date Created: 18 June 2019

Read this article at

ScienceOpen ArXiv

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

In this paper, we study abstractive summarization for open-domain videos. Unlike the traditional text news summarization, the goal is less to "compress" text information but rather to provide a fluent textual summary of information that has been collected and fused from different source modalities, in our case video and audio transcripts (or text). We show how a multi-source sequence-to-sequence model with hierarchical attention can integrate information from different modalities into a coherent output, compare various models trained with different modalities and present pilot experiments on the How2 corpus of instructional videos. We also propose a new evaluation metric (Content F1) for abstractive summarization task that measures semantic adequacy rather than fluency of the summaries, which is covered by metrics like ROUGE and BLEU.

Related collections

Most cited references 16

Record: found
Abstract: not found
Conference Proceedings: not found

Meteor Universal: Language Specific Translation Evaluation for Any Target Language

Michael Denkowski, Alon Lavie (2014)

0 comments Cited 195 times – based on 0 reviews

Record: found
Abstract: not found
Conference Proceedings: not found

Get To The Point: Summarization with Pointer-Generator Networks

Abigail See, Peter Liu, Christopher D Manning (2017)

0 comments Cited 185 times – based on 0 reviews

Record: found
Abstract: not found
Conference Proceedings: not found

Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?

Kensho Hara, Hirokatsu Kataoka, Yutaka Satoh (2018)

0 comments Cited 114 times – based on 0 reviews

Author and article information

Journal

Publication date Created: 18 June 2019

Article

ArXiV ID: 1906.07901

SO-VID: c424fb08-8dcc-4cf4-9306-647bcbefec07

License:

http://arxiv.org/licenses/nonexclusive-distrib/1.0/

History

Custom metadata

Comments To appear in ACL 2019

Categories cs.CL cs.CV cs.LG cs.MM

ScienceOpen disciplines: Computer vision & Pattern recognition,Theoretical computer science,Artificial intelligence,Graphics & Multimedia design

Data availability:

ScienceOpen disciplines: Computer vision & Pattern recognition, Theoretical computer science, Artificial intelligence, Graphics & Multimedia design

Comments

Comment on this article