A Discourse-Aware Attention Model for Abstractive Summarization of Long
  Documents

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Neural abstractive summarization models have led to promising results in summarizing relatively short documents. We propose the first model for abstractive summarization of single, longer-form documents (e.g., research papers). Our approach consists of a new hierarchical encoder that models the discourse structure of a document, and an attentive discourse-aware decoder to generate the summary. Empirical results on two large-scale datasets of scientific papers show that our model significantly outperforms state-of-the-art models.

Related collections

Most cited references 10

Record: found
Abstract: found
Article: found

Is Open Access

Speech Recognition with Deep Recurrent Neural Networks

Alex Graves, Abdel-rahman Mohamed, Geoffrey E Hinton (2013)

Recurrent neural networks (RNNs) are a powerful model for sequential data. End-to-end training methods such as Connectionist Temporal Classification make it possible to train RNNs for sequence labelling problems where the input-output alignment is unknown. The combination of these methods with the Long Short-term Memory RNN architecture has proved particularly fruitful, delivering state-of-the-art results in cursive handwriting recognition. However RNN performance in speech recognition has so far been disappointing, with better results returned by deep feedforward networks. This paper investigates \emph{deep recurrent neural networks}, which combine the multiple levels of representation that have proved so effective in deep networks with the flexible use of long range context that empowers RNNs. When trained end-to-end with suitable regularisation, we find that deep Long Short-term Memory RNNs achieve a test set error of 17.7% on the TIMIT phoneme recognition benchmark, which to our knowledge is the best recorded score.

0 comments Cited 400 times – based on 0 reviews

Preprint

     Review now

Bookmark

Record: found
Abstract: not found
Conference Proceedings: not found

An Improved Non-monotonic Transition System for Dependency Parsing

Matthew Honnibal, Mark Johnson (2015)

0 comments Cited 63 times – based on 0 reviews

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

Get To The Point: Summarization with Pointer-Generator Networks

Abigail See, Peter Liu, Christopher D Manning (2017)

Neural sequence-to-sequence models have provided a viable new approach for abstractive text summarization (meaning they are not restricted to simply selecting and rearranging passages from the original text). However, these models have two shortcomings: they are liable to reproduce factual details inaccurately, and they tend to repeat themselves. In this work we propose a novel architecture that augments the standard sequence-to-sequence attentional model in two orthogonal ways. First, we use a hybrid pointer-generator network that can copy words from the source text via pointing, which aids accurate reproduction of information, while retaining the ability to produce novel words through the generator. Second, we use coverage to keep track of what has been summarized, which discourages repetition. We apply our model to the CNN / Daily Mail summarization task, outperforming the current abstractive state-of-the-art by at least 2 ROUGE points.

0 comments Cited 54 times – based on 0 reviews

Preprint

     Review now

Bookmark

All references

Author and article information

Journal

Publication date Created: 16 April 2018

Article

ArXiV ID: 1804.05685

SO-VID: fb9b6731-4f69-4947-89b9-b2584a2bb65e

License:

http://arxiv.org/licenses/nonexclusive-distrib/1.0/

History

Custom metadata

Comments NAACL HLT 2018

Categories cs.CL

ScienceOpen disciplines: Theoretical computer science

Data availability:

ScienceOpen disciplines: Theoretical computer science

A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents

Read this article at

Abstract

Related collections

Blockchain in Healthcare Today

Most cited references 10

Speech Recognition with Deep Recurrent Neural Networks

An Improved Non-monotonic Transition System for Dependency Parsing

Get To The Point: Summarization with Pointer-Generator Networks

Author and article information

Journal

Article

History

Custom metadata

Comments

Comment on this article

Similar content 310

Most referenced authors 75