7
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks

      Preprint
      , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Because of their superior ability to preserve sequence information over time, Long Short-Term Memory (LSTM) networks, a type of recurrent neural network with a more complex computational unit, have obtained strong results on a variety of sequence modeling tasks. The only underlying LSTM structure that has been explored so far is a linear chain. However, natural language exhibits syntactic properties that would naturally combine words to phrases. We introduce the Tree-LSTM, a generalization of LSTMs to tree-structured network topologies. Tree-LSTMs outperform all existing systems and strong LSTM baselines on two tasks: predicting the semantic relatedness of two sentences (SemEval 2014, Task 1) and sentiment classification (Stanford Sentiment Treebank).

          Related collections

          Most cited references7

          • Record: found
          • Abstract: not found
          • Article: not found

          The Vanishing Gradient Problem During Learning Recurrent Neural Nets and Problem Solutions

            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            A Fast and Accurate Dependency Parser using Neural Networks

              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              Hybrid speech recognition with Deep Bidirectional LSTM

                Bookmark

                Author and article information

                Journal
                2015-02-28
                2015-05-30
                Article
                1503.00075
                d6b649ec-0a65-487a-96ec-a78fba41ef3a

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                Accepted for publication at ACL 2015
                cs.CL cs.AI cs.LG

                Theoretical computer science,Artificial intelligence
                Theoretical computer science, Artificial intelligence

                Comments

                Comment on this article