6
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Investigating Generative Adversarial Networks based Speech Dereverberation for Robust Speech Recognition

      Preprint
      , , , , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          We investigate the use of generative adversarial networks (GANs) in speech dereverberation for robust speech recognition. GANs have been recently studied for speech enhancement to remove additive noises, but there still lacks of a work to examine their ability in speech dereverberation and the advantages of using GANs have not been fully established. In this paper, we provide deep investigations in the use of GAN-based dereverberation front-end in ASR. First, we study the effectiveness of different dereverberation networks (the generator in GAN) and find that LSTM leads a significant improvement as compared with feed-forward DNN and CNN in our dataset. Second, further adding residual connections in the deep LSTMs can boost the performance as well. Finally, we find that, for the success of GAN, it is important to update the generator and the discriminator using the same mini-batch data during training. Moreover, using reverberant spectrogram as a condition to discriminator, as suggested in previous studies, may degrade the performance. In summary, our GAN-based dereverberation front-end achieves 14%-19% relative CER reduction as compared to the baseline DNN dereverberation network when tested on a strong multi-condition training acoustic model.

          Related collections

          Most cited references16

          • Record: found
          • Abstract: not found
          • Article: not found

          Mean squared error: Love it or leave it? A new look at Signal Fidelity Measures

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            An Experimental Study on Speech Enhancement Based on Deep Neural Networks

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Invertibility of a room impulse response

                Bookmark

                Author and article information

                Journal
                27 March 2018
                Article
                1803.10132
                965c2532-3782-4c13-8722-96a714e03c3a

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                cs.SD cs.CL eess.AS

                Comments

                Comment on this article