142
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      YFCC100M: The New Data in Multimedia Research

      Preprint

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          We present the Yahoo Flickr Creative Commons 100 Million Dataset (YFCC100M), the largest public multimedia collection that has ever been released. The dataset contains a total of 100 million media objects, of which approximately 99.2 million are photos and 0.8 million are videos, all of which carry a Creative Commons license. Each media object in the dataset is represented by several pieces of metadata, e.g. Flickr identifier, owner name, camera, title, tags, geo, media source. The collection provides a comprehensive snapshot of how photos and videos were taken, described, and shared over the years, from the inception of Flickr in 2004 until early 2014. In this article we explain the rationale behind its creation, as well as the implications the dataset has for science, research, engineering, and development. We further present several new challenges in multimedia research that can now be expanded upon with our dataset.

          Related collections

          Most cited references4

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          Mapping the world's photos

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Digital photography: communication, identity, memory

              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              Tweets from Justin Bieber's heart

                Bookmark

                Author and article information

                Journal
                2015-03-05
                2016-04-25
                Article
                10.1145/2812802
                1503.01817
                92b2c6cd-a14e-4d13-80a2-6bac4d06902b

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                Communications of the ACM, 59(2), pp. 64-73, 2016
                cs.MM cs.CY

                Applied computer science,Graphics & Multimedia design
                Applied computer science, Graphics & Multimedia design

                Comments

                Comment on this article