957
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation

      Preprint

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          In this paper, we propose a novel neural network model called RNN Encoder-Decoder that consists of two recurrent neural networks (RNN). One RNN encodes a sequence of symbols into a fixed-length vector representation, and the other decodes the representation into another sequence of symbols. The encoder and decoder of the proposed model are jointly trained to maximize the conditional probability of a target sequence given a source sequence. The performance of a statistical machine translation system is empirically found to improve by using the conditional probabilities of phrase pairs computed by the RNN Encoder-Decoder as an additional feature in the existing log-linear model. Qualitatively, we show that the proposed model learns a semantically and syntactically meaningful representation of linguistic phrases.

          Related collections

          Most cited references4

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          Statistical phrase-based translation

            Bookmark
            • Record: found
            • Abstract: not found
            • Book Chapter: not found

            Supervised Sequence Labelling

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Continuous space language models

                Bookmark

                Author and article information

                Journal
                2014-06-03
                2014-09-02
                Article
                1406.1078
                03afd088-8671-4ad8-b5bc-296a1d24b0b0

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                EMNLP 2014
                cs.CL cs.LG cs.NE stat.ML

                Theoretical computer science,Machine learning,Neural & Evolutionary computing,Artificial intelligence

                Comments

                Comment on this article