16
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      The Effect of Translationese in Machine Translation Test Sets

      Preprint
      ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The effect of translationese has been studied in the field of machine translation (MT), mostly with respect to training data. We study in depth the effect of translationese on test data, using the test sets from the last three editions of WMT's news shared task, containing 17 translation directions. We show evidence that (i) the use of translationese in test sets results in inflated human evaluation scores for MT systems; (ii) in some cases system rankings do change and (iii) the impact translationese has on a translation direction is inversely correlated to the translation quality attainable by state-of-the-art MT systems for that direction.

          Related collections

          Most cited references3

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          Findings of the 2017 Conference on Machine Translation (WMT17)

            Bookmark
            • Record: found
            • Abstract: not found
            • Book Chapter: not found

            Corpus Linguistics and Translation Studies — Implications and Applications

            Mona Baker (1993)
              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              URIEL and lang2vec: Representing languages as typological, geographical, and phylogenetic vectors

                Bookmark

                Author and article information

                Journal
                19 June 2019
                Article
                1906.08069
                f9298c23-dee5-4b6e-937c-ed012330537b

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                9 pages, 10 pages appendix, 3 figures, 20 tables, accepted in WMT19
                cs.CL

                Theoretical computer science
                Theoretical computer science

                Comments

                Comment on this article