6
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs

      1 , 2 , 3 , 3
      Transactions of the Association for Computational Linguistics
      MIT Press - Journals

      Read this article at

      ScienceOpenPublisher
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          How to model a pair of sentences is a critical issue in many NLP tasks such as answer selection (AS), paraphrase identification (PI) and textual entailment (TE). Most prior work (i) deals with one individual task by fine-tuning a specific system; (ii) models each sentence’s representation separately, rarely considering the impact of the other sentence; or (iii) relies fully on manually designed, task-specific linguistic features. This work presents a general Attention Based Convolutional Neural Network (ABCNN) for modeling a pair of sentences. We make three contributions. (i) The ABCNN can be applied to a wide variety of tasks that require modeling of sentence pairs. (ii) We propose three attention schemes that integrate mutual influence between sentences into CNNs; thus, the representation of each sentence takes into consideration its counterpart. These interdependent sentence pair representations are more powerful than isolated sentence representations. (iii) ABCNNs achieve state-of-the-art performance on AS, PI and TE tasks. We release code at: https://github.com/yinwenpeng/Answer_Selection .

          Related collections

          Most cited references1

          • Record: found
          • Abstract: not found
          • Article: not found

          Cogex: A semantically and contextually enriched logic prover for question answering

            Bookmark

            Author and article information

            Journal
            Transactions of the Association for Computational Linguistics
            Transactions of the Association for Computational Linguistics
            MIT Press - Journals
            2307-387X
            December 2016
            December 2016
            : 4
            : 259-272
            Affiliations
            [1 ]Center for Information and Language Processing, LMU Munich, Germany,
            [2 ]Center for Information and Language Processing, LMU Munich, Germany
            [3 ]IBM Watson, Yorktown Heights, NY, USA,
            Article
            10.1162/tacl_a_00097
            44bf3781-0aee-4847-8e70-8e6351cf286a
            © 2016
            History

            Comments

            Comment on this article