7
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: not found
      • Article: not found

      Knowledge discovery through directed probabilistic topic models: a survey

      Read this article at

      ScienceOpenPublisher
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Related collections

          Most cited references30

          • Record: found
          • Abstract: found
          • Article: not found

          Finding scientific topics.

          A first step in identifying the content of a document is determining which topics that document addresses. We describe a generative model for documents, introduced by Blei, Ng, and Jordan [Blei, D. M., Ng, A. Y. & Jordan, M. I. (2003) J. Machine Learn. Res. 3, 993-1022], in which each document is generated by choosing a distribution over topics and then choosing each word in the document from a topic selected according to this distribution. We then present a Markov chain Monte Carlo algorithm for inference in this model. We use this algorithm to analyze abstracts from PNAS by using Bayesian model selection to establish the number of topics. We show that the extracted topics capture meaningful structure in the data, consistent with the class designations provided by the authors of the articles, and outline further applications of this analysis, including identifying "hot topics" by examining temporal dynamics and tagging abstracts to illustrate semantic content.
            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            Dynamic topic models

              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              ArnetMiner

                Bookmark

                Author and article information

                Journal
                Frontiers of Computer Science in China
                Front. Comput. Sci. China
                Springer Science and Business Media LLC
                1673-7350
                1673-7466
                June 2010
                January 27 2010
                June 2010
                : 4
                : 2
                : 280-301
                Article
                10.1007/s11704-009-0062-y
                f096bddf-4ba4-4838-814d-869910192f60
                © 2010

                http://www.springer.com/tdm


                Comments

                Comment on this article