12
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Scientific document summarization via citation contextualization and scientific discourse

      Preprint
      ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The rapid growth of scientific literature has made it difficult for the researchers to quickly learn about the developments in their respective fields. Scientific document summarization addresses this challenge by providing summaries of the important contributions of scientific papers. We present a framework for scientific summarization which takes advantage of the citations and the scientific discourse structure. Citation texts often lack the evidence and context to support the content of the cited paper and are even sometimes inaccurate. We first address the problem of inaccuracy of the citation texts by finding the relevant context from the cited paper. We propose three approaches for contextualizing citations which are based on query reformulation, word embeddings, and supervised learning. We then train a model to identify the discourse facets for each citation. We finally propose a method for summarizing scientific papers by leveraging the faceted citations and their corresponding contexts. We evaluate our proposed method on two scientific summarization datasets in the biomedical and computational linguistics domains. Extensive evaluation results show that our methods can improve over the state of the art by large margins.

          Related collections

          Most cited references22

          • Record: found
          • Abstract: not found
          • Article: not found

          A STATISTICAL INTERPRETATION OF TERM SPECIFICITY AND ITS APPLICATION IN RETRIEVAL

            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            A Neural Attention Model for Abstractive Sentence Summarization

              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              A language modeling approach to information retrieval

                Bookmark

                Author and article information

                Journal
                2017-06-11
                Article
                10.1007/s00799-017-0216-8
                1706.03449
                12f4cf21-52d2-4bfa-92e6-d6fecdf3a515

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                Preprint. The final publication is available at Springer via http://dx.doi.org/10.1007/s00799-017-0216-8, International Journal on Digital Libraries (IJDL) 2017
                cs.CL cs.DL

                Theoretical computer science,Information & Library science
                Theoretical computer science, Information & Library science

                Comments

                Comment on this article