10
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      MS-ASL: A Large-Scale Data Set and Benchmark for Understanding American Sign Language

      Preprint
      ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Computer Vision has been improved significantly in the past few decades. It has enabled machine to do many human tasks. However, the real challenge is in enabling machine to carry out tasks that an average human does not have the skills for. One such challenge that we have tackled in this paper is providing accessibility for deaf individual by providing means of communication with others with the aid of computer vision. Unlike other frequent works focusing on multiple camera, depth camera, electrical glove or visual gloves, we focused on the sole use of RGB which allows everybody to communicate with a deaf individual through their personal devices. This is not a new approach but the lack of realistic large-scale data set prevented recent computer vision trends on video classification in this filed. In this paper, we propose the first large scale ASL data set that covers over 200 signers, signer independent sets, challenging and unconstrained recording conditions and a large class count of 1000 signs. We evaluate baselines from action recognition techniques on the data set. We propose I3D, known from video classifications, as a powerful and suitable architecture for sign language recognition. We also propose new pre-trained model more appropriate for sign language recognition. Finally, We estimate the effect of number of classes and number of training samples on the recognition accuracy.

          Related collections

          Most cited references25

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          FaceNet: A unified embedding for face recognition and clustering

            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            Hierarchical recurrent neural network for skeleton based action recognition

              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              Online Detection and Classification of Dynamic Hand Gestures with Recurrent 3D Convolutional Neural Networks

                Bookmark

                Author and article information

                Journal
                03 December 2018
                Article
                1812.01053
                f8c6dd76-4873-418f-b75f-302e916d9f4f

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                cs.CV

                Computer vision & Pattern recognition
                Computer vision & Pattern recognition

                Comments

                Comment on this article