17
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Detecting Hotspot Information Using Multi-Attribute Based Topic Model

      research-article
      , * , , ,
      PLoS ONE
      Public Library of Science

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Microblogging as a kind of social network has become more and more important in our daily lives. Enormous amounts of information are produced and shared on a daily basis. Detecting hot topics in the mountains of information can help people get to the essential information more quickly. However, due to short and sparse features, a large number of meaningless tweets and other characteristics of microblogs, traditional topic detection methods are often ineffective in detecting hot topics. In this paper, we propose a new topic model named multi-attribute latent dirichlet allocation (MA-LDA), in which the time and hashtag attributes of microblogs are incorporated into LDA model. By introducing time attribute, MA-LDA model can decide whether a word should appear in hot topics or not. Meanwhile, compared with the traditional LDA model, applying hashtag attribute in MA-LDA model gives the core words an artificially high ranking in results meaning the expressiveness of outcomes can be improved. Empirical evaluations on real data sets demonstrate that our method is able to detect hot topics more accurately and efficiently compared with several baselines. Our method provides strong evidence of the importance of the temporal factor in extracting hot topics.

          Related collections

          Author and article information

          Contributors
          Role: Editor
          Journal
          PLoS One
          PLoS ONE
          plos
          plosone
          PLoS ONE
          Public Library of Science (San Francisco, CA USA )
          1932-6203
          2015
          23 October 2015
          : 10
          : 10
          : e0140539
          Affiliations
          [001]School of Computer and Information Science, Southwest University, Chongqing, China
          Tianjin University of Technology, CHINA
          Author notes

          Competing Interests: The authors have declared that no competing interests exist.

          Conceived and designed the experiments: JW LL. Performed the experiments: JW. Analyzed the data: FT YZ. Contributed reagents/materials/analysis tools: JW WF. Wrote the paper: JW LL.

          Article
          PONE-D-15-10202
          10.1371/journal.pone.0140539
          4619720
          26496635
          db346f7c-8f7e-4635-9453-542d124e2df6
          Copyright @ 2015

          This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited

          History
          : 27 May 2015
          : 28 September 2015
          Page count
          Figures: 6, Tables: 6, Pages: 16
          Funding
          This work was supported by National Natural Science Foundation of China (No.61170192) and National High-tech R&D Program (No. 2013AA013801).
          Categories
          Research Article
          Custom metadata
          The Tencent data are mined from Tencent Microblogs ( http:// blog.qq.com/). To query the Tencent API, please refer to http://wiki.open.qq.com/wiki/API%E6%96%87%E6%A1%A3. Twitter data are from a previously published study “Large Scale Microblog Mining Using Distributed MB-LDA” whose authors may be contacted at zhangchenyi.zju@ 123456gmail.com .

          Uncategorized
          Uncategorized

          Comments

          Comment on this article