42
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      80 New Packages to Mine Database Query Logs

      Preprint
      ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The query log of a DBMS is a powerful resource. It enables many practical applications, including query optimization and user experience enhancement. And yet, mining SQL queries is a difficult task. The fundamental problem is that queries are symbolic objects, not vectors of numbers. Therefore, many popular statistical concepts, such as means, regression, or decision trees do not apply. Most authors limit themselves to ad hoc algorithms or approaches based on neighborhoods, such as k Nearest Neighbors. Our project is to challenge this limitation. We introduce methods to manipulate SQL queries as if they were vectors, thereby unlocking the whole statistical toolbox. We present three families of methods: feature maps, kernel methods, and Bayesian models. The first technique directly encodes queries into vectors. The second one transforms the queries implicitly. The last one exploits probabilistic graphical models as an alternative to vector spaces. We present the benefits and drawbacks of each solution, highlight how they relate to each other, and make the case for future investigation.

          Related collections

          Most cited references10

          • Record: found
          • Abstract: not found
          • Book Chapter: not found

          A Comprehensive Survey of Neighborhood-based Recommendation Methods

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Mining Query Logs: Turning Search Usage Data into Knowledge

              Bookmark
              • Record: found
              • Abstract: not found
              • Book Chapter: not found

              PROMISE: Predicting Query Behavior to Enable Predictive Caching Strategies for OLAP Systems

                Bookmark

                Author and article information

                Journal
                2017-03-25
                Article
                1703.08732
                f9ac6017-864f-4a19-bf81-53540ce81d5a

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                Vision Paper
                cs.DB

                Databases
                Databases

                Comments

                Comment on this article