22
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation

      Preprint
      ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          We explore how to improve machine translation systems by adding more translation data in situations where we already have substantial resources. The main challenge is how to buck the trend of diminishing returns that is commonly encountered. We present an active learning-style data solicitation algorithm to meet this challenge. We test it, gathering annotations via Amazon Mechanical Turk, and find that we get an order of magnitude increase in performance rates of improvement.

          Related collections

          Author and article information

          Journal
          21 October 2014
          Article
          1410.5877
          ae5a59d1-fd1e-45a1-9860-339d00f87fc0

          http://arxiv.org/licenses/nonexclusive-distrib/1.0/

          History
          Custom metadata
          In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pages 854-864, Uppsala, Sweden, July 2010. Association for Computational Linguistics
          11 pages, 14 figures; appeared in Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, July 2010
          cs.CL cs.LG stat.ML

          Comments

          Comment on this article