7
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Towards Better Text Understanding and Retrieval through Kernel Entity Salience Modeling

      Preprint
      , , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          This paper presents a Kernel Entity Salience Model (KESM) that improves text understanding and retrieval by better estimating entity salience (importance) in documents. KESM represents entities by knowledge enriched distributed representations, models the interactions between entities and words by kernels, and combines the kernel scores to estimate entity salience. The whole model is learned end-to-end using entity salience labels. The salience model also improves ad hoc search accuracy, providing effective ranking features by modeling the salience of query entities in candidate documents. Our experiments on two entity salience corpora and two TREC ad hoc search datasets demonstrate the effectiveness of KESM over frequency-based and feature-based methods. We also provide examples showing how KESM conveys its text understanding ability learned from entity salience to search.

          Related collections

          Most cited references19

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          The Stanford CoreNLP Natural Language Processing Toolkit

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Learning to Rank for Information Retrieval

              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              Boilerplate detection using shallow text features

                Bookmark

                Author and article information

                Journal
                03 May 2018
                Article
                10.1145/3209978.3209982
                1805.01334
                7678aa42-b8d7-4f45-9128-23a07c695669

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                In proceedings of SIGIR 2018
                cs.IR cs.CL

                Theoretical computer science,Information & Library science
                Theoretical computer science, Information & Library science

                Comments

                Comment on this article