18
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      A Relationship: Word Alignment, Phrase Table, and Translation Quality

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          In the last years, researchers conducted several studies to evaluate the machine translation quality based on the relationship between word alignments and phrase table. However, existing methods usually employ ad-hoc heuristics without theoretical support. So far, there is no discussion from the aspect of providing a formula to describe the relationship among word alignments, phrase table, and machine translation performance. In this paper, on one hand, we focus on formulating such a relationship for estimating the size of extracted phrase pairs given one or more word alignment points. On the other hand, a corpus-motivated pruning technique is proposed to prune the default large phrase table. Experiment proves that the deduced formula is feasible, which not only can be used to predict the size of the phrase table, but also can be a valuable reference for investigating the relationship between the translation performance and phrase tables based on different links of word alignment. The corpus-motivated pruning results show that nearly 98% of phrases can be reduced without any significant loss in translation quality.

          Related collections

          Most cited references59

          • Record: found
          • Abstract: not found
          • Article: not found

          A Systematic Comparison of Various Statistical Alignment Models

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            The Penn Chinese TreeBank: Phrase structure annotation of a large corpus

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Elements of Information Theory

                Bookmark

                Author and article information

                Journal
                ScientificWorldJournal
                ScientificWorldJournal
                TSWJ
                The Scientific World Journal
                Hindawi Publishing Corporation
                2356-6140
                1537-744X
                2014
                16 April 2014
                : 2014
                : 438106
                Affiliations
                Natural Language Processing & Portuguese-Chinese Machine Translation Laboratory, Department of Computer and Information Science, University of Macau, Macau
                Author notes

                Academic Editors: J. Shu and F. Yu

                Author information
                http://orcid.org/0000-0002-6744-8294
                http://orcid.org/0000-0002-5307-7322
                http://orcid.org/0000-0001-6395-3325
                Article
                10.1155/2014/438106
                4030493
                22a15002-6f7f-4678-b107-76e860badf49
                Copyright © 2014 Liang Tian et al.

                This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 30 August 2013
                : 10 March 2014
                Funding
                Funded by: Science and Technology Development Fund of Macau
                Funded by: University of Macau
                Award ID: MYRG076 (Y1-L2)-FST13-WF
                Funded by: University of Macau
                Award ID: MYRG070(Y1-L2)-FST12-CS
                Categories
                Research Article

                Uncategorized
                Uncategorized

                Comments

                Comment on this article