5
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Analyzing Web Archives Through Topic and Event Focused Sub-collections

      Preprint
      , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Web archives capture the history of the Web and are therefore an important source to study how societal developments have been reflected on the Web. However, the large size of Web archives and their temporal nature pose many challenges to researchers interested in working with these collections. In this work, we describe the challenges of working with Web archives and propose the research methodology of extracting and studying sub-collections of the archive focused on specific topics and events. We discuss the opportunities and challenges of this approach and suggest a framework for creating sub-collections.

          Related collections

          Most cited references4

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          Novelty and diversity in information retrieval evaluation

            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            Learning temporal-dependent ranking models

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              The SHARC framework for data quality in Web archiving

                Bookmark

                Author and article information

                Journal
                2016-12-16
                Article
                10.1145/2908131.2908175
                1612.05413
                1ac5794e-2bc1-4b8b-aba2-472592c65d8f

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                Proceedings of the 8th ACM Conference on Web Science (p./pp. 291--295)
                Published in the proceedings of the 8th ACM Conference on Web Science 2016
                cs.DL cs.IR

                Information & Library science
                Information & Library science

                Comments

                Comment on this article