Blog
About

0
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      DMC-Net: Generating Discriminative Motion Cues for Fast Compressed Video Action Recognition

      Preprint

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Motion has shown to be useful for video understanding, where motion is typically represented by optical flow. However, computing flow from video frames is very time-consuming. Recent works directly leverage the motion vectors and residuals readily available in the compressed video to represent motion at no cost. While this avoids flow computation, it also hurts accuracy since the motion vector is noisy and has substantially reduced resolution, which makes it a less discriminative motion representation. To remedy these issues, we propose a lightweight generator network, which reduces noises in motion vectors and captures fine motion details, achieving a more Discriminative Motion Cue (DMC) representation. Since optical flow is a more accurate motion representation, we train the DMC generator to approximate flow using a reconstruction loss and a generative adversarial loss, jointly with the downstream action classification task. Extensive evaluations on three action recognition benchmarks (HMDB-51, UCF-101, and a subset of Kinetics) confirm the effectiveness of our method. Our full system, consisting of the generator and the classifier, is coined as DMC-Net which obtains high accuracy close to that of using flow and runs two orders of magnitude faster than using optical flow at inference time.

          Related collections

          Most cited references 17

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          Learning realistic human actions from movies

            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            HMDB: A large video database for human motion recognition

              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks

                Bookmark

                Author and article information

                Journal
                10 January 2019
                Article
                1901.03460

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                Custom metadata
                cs.CV

                Computer vision & Pattern recognition

                Comments

                Comment on this article