Blog
About

0
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      NORMALIZATION OF NOISY TEXTS IN MALAYSIAN ONLINE REVIEWS

      Read this article at

      ScienceOpenPublisher
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The process of gathering useful information from online messages has increased as more and more people use the Internet and other online applications such as Facebook and Twitter to communicate with each other. One of the problems in processing online messages is the high number of noisy texts that exist in these messages. Few studies have shown that the noisy texts decreased the result of text mining activities. On the other hand, very few works have investigated on the patterns of noisy texts that are created by Malaysians. In this study, a common noisy terms list and an artificial abbreviations list were created using specific rules and were utilized to select candidates of correct words for a noisy term. Later, the correct term was selected based on a bi-gram words index. The experiments used online messages that were created by the Malaysians. The result shows that normalization of noisy texts using artificial abbreviations list compliments the use of common noisy texts list.  

          Related collections

          Author and article information

          Contributors
          Malaysia
          Malaysia
          Malaysia
          Malaysia
          Journal
          Journal of Information and Communication Technology
          UUM Press
          April 23 2013
          : 12
          : 147-159
          Affiliations
          [1 ]Faculty of Computer and Mathematical Science, Universiti Teknologi MARA Terengganu, Dungun, 23000, Terengganu, Malaysia
          Article
          10.32890/jict.12.2013.8141

          All content is freely available without charge to users or their institutions. Users are allowed to read, download, copy, distribute, print, search, or link to the full texts of the articles in this journal without asking prior permission of the publisher or the author. Articles published in the journal are distributed under a http://creativecommons.org/licenses/by/4.0/.

          Comments

          Comment on this article