5
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Finding Theme Communities from Database Networks: from Mining to Indexing and Query Answering

      Preprint
      , , , , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Given a database network where each vertex is associated with a transaction database, we are interested in finding theme communities. Here, a theme community is a cohesive subgraph such that a common pattern is frequent in all transaction databases associated with the vertices in the subgraph. Finding all theme communities from a database network enjoys many novel applications. However, it is challenging since even counting the number of all theme communities in a database network is #P-hard. Inspired by the observation that a theme community shrinks when the length of the pattern increases, we investigate several properties of theme communities and develop TCFI, a scalable algorithm that uses these properties to effectively prune the patterns that cannot form any theme community. We also design TC-Tree, a scalable algorithm that decomposes and indexes theme communities efficiently. Retrieving 1 million theme communities from a TC-Tree takes only 1 second. Extensive experiments and a case study demonstrate the effectiveness and scalability of TCFI and TC-Tree in discovering and querying meaningful theme communities from large database networks.

          Related collections

          Most cited references12

          • Record: found
          • Abstract: not found
          • Article: not found

          Network structure and minimum degree

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Graph clustering based on structural/attribute similarities

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Graph Twiddling in a MapReduce World

              J. Cohen (2009)
                Bookmark

                Author and article information

                Journal
                23 September 2017
                Article
                1709.08083
                da50b270-3cf4-44d1-8fe1-809c42b9f1e3

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                cs.DB cs.IR

                Comments

                Comment on this article