Blog
About

23
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found
      Is Open Access

      Chinese Microblog Topic Detection through POS-Based Semantic Expansion

      , ,

      Information

      MDPI AG

      Read this article at

      ScienceOpenPublisher
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          A microblog is a new type of social media for information publishing, acquiring, and spreading. Finding the significant topics of a microblog is necessary for popularity tracing and public opinion following. This paper puts forward a method to detect topics from Chinese microblogs. Since traditional methods showed low performance on a short text from a microblog, we put forward a topic detection method based on the semantic description of the microblog post. The semantic expansion of the post supplies more information and clues for topic detection. First, semantic features are extracted from a microblog post. Second, the semantic features are expanded according to a thesaurus. Here TongYiCi CiLin is used as the lexical resource to find words with the same meaning. To overcome the polysemy problem, several semantic expansion strategies based on part-of-speech are introduced and compared. Third, an approach to detect topics based on semantic descriptions and an improved incremental clustering algorithm is introduced. A dataset from Sina Weibo is employed to evaluate our method. Experimental results show that our method can bring about better results both for post clustering and topic detection in Chinese microblogs. We also found that the semantic expansion of nouns is far more efficient than for other parts of speech. The potential mechanism of the phenomenon is also analyzed and discussed.

          Related collections

          Most cited references 10

          • Record: found
          • Abstract: not found
          • Article: not found

          Short text similarity based on probabilistic topics

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Weakness Finder: Find product weakness from Chinese reviews by using aspects based sentiment analysis

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              A cross-media public sentiment analysis system for microblog

                Bookmark

                Author and article information

                Journal
                INFOGG
                Information
                Information
                MDPI AG
                2078-2489
                August 2018
                August 10 2018
                : 9
                : 8
                : 203
                Article
                10.3390/info9080203
                © 2018
                Product
                Self URI (article page): http://www.mdpi.com/2078-2489/9/8/203

                Comments

                Comment on this article