19
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Examining the Validity of Cross-Lingual Word Sense Disambiguation

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          This paper describes a set of experiments in which the viability of a classification-based Word Sense Disambiguation system that uses evidence from multiple languages is investigated. Instead of using a predefined monolingual sense-inventory such as WordNet, we use a language-independent framework and start from a manually constructed gold standard in which the word senses are made up by the translations that result from word alignments on a parallel corpus. To train and test the classifier, we used English as an input language and we incorporated the translations of our target words in five languages (viz. Spanish, Italian, French, Dutch and German) as features in the feature vectors. Our results show that the multilingual approach outperforms the classification experiments where no additional evidence from other languages is used. These results confirm our initial hypothesis that each language adds evidence to further refine the senses of a given word. This allows us to develop a proof of concept for a multilingual approach to Word Sense Disambiguation.

          Related collections

          Most cited references40

          • Record: found
          • Abstract: not found
          • Article: not found

          A Systematic Comparison of Various Statistical Alignment Models

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Word sense disambiguation

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              C4.5:programs for machine learning

                Bookmark

                Author and article information

                Journal
                poli
                Polibits
                Polibits
                Instituto Politécnico Nacional, Centro de Innovación y Desarrollo Tecnológico en Cómputo (México, DF, Mexico )
                1870-9044
                June 2011
                : 43
                : 29-35
                Affiliations
                [01] Ghent orgnameUniversity College Ghent orgdiv1Dpt. of Applied Mathematics and Computer Science Belgium Els.Lefeverhogent.be, **Veronique.Hoste@ 123456hogent.be
                Article
                S1870-90442011000100004 S1870-9044(11)00004300004
                cd3f1d19-5f70-4ea7-acff-e75870722d4a

                This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

                History
                : 06 November 2010
                : 12 January 2011
                Page count
                Figures: 0, Tables: 0, Equations: 0, References: 22, Pages: 7
                Product

                SciELO Mexico


                multilingual,cross-lingual,Word Sense Disambiguation

                Comments

                Comment on this article