2
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Simple Automatic Post-editing for Arabic-Japanese Machine Translation

      Preprint
      , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          A common bottleneck for developing machine translation (MT) systems for some language pairs is the lack of direct parallel translation data sets, in general and in certain domains. Alternative solutions such as zero-shot models or pivoting techniques are successful in getting a strong baseline, but are often below the more supported language-pair systems. In this paper, we focus on Arabic-Japanese machine translation, a less studied language pair; and we work with a unique parallel corpus of Arabic news articles that were manually translated to Japanese. We use this parallel corpus to adapt a state-of-the-art domain/genre agnostic neural MT system via a simple automatic post-editing technique. Our results and detailed analysis suggest that this approach is quite viable for less supported language pairs in specific domains.

          Related collections

          Most cited references12

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          Moses

            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            Experiments in domain adaptation for statistical machine translation

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              On the features of translationese

                Bookmark

                Author and article information

                Journal
                14 July 2019
                Article
                1907.06210
                2f44710e-3fd6-4b43-bfbd-2b954877a666

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                Machine translation, Automatic Post editing, Arabic, Japanese
                cs.CL

                Theoretical computer science
                Theoretical computer science

                Comments

                Comment on this article