7
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      MegaDepth: Learning Single-View Depth Prediction from Internet Photos

      Preprint
      ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Single-view depth prediction is a fundamental problem in computer vision. Recently, deep learning methods have led to significant progress, but such methods are limited by the available training data. Current datasets based on 3D sensors have key limitations, including indoor-only images (NYU), small numbers of training examples (Make3D), and sparse sampling (KITTI). We propose to use multi-view Internet photo collections, a virtually unlimited data source, to generate training data via modern structure-from-motion and multi-view stereo (MVS) methods, and present a large depth dataset called MegaDepth based on this idea. Data derived from MVS comes with its own challenges, including noise and unreconstructable objects. We address these challenges with new data cleaning methods, as well as automatically augmenting our data with ordinal depth relations generated using semantic segmentation. We validate the use of large amounts of Internet data by showing that models trained on MegaDepth exhibit strong generalization-not only to novel scenes, but also to other diverse datasets including Make3D, KITTI, and DIW, even when no images from those datasets are seen during training.

          Related collections

          Most cited references19

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          Pyramid Scene Parsing Network

            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            Are we ready for autonomous driving? The KITTI vision benchmark suite

              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              Object scene flow for autonomous vehicles

                Bookmark

                Author and article information

                Journal
                02 April 2018
                Article
                1804.00607
                234a301b-f92b-49b7-ade1-65e3487ad259

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                CVPR, 2018
                cs.CV

                Comments

                Comment on this article