7
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      DNN Filter Bank Cepstral Coefficients for Spoofing Detection

      Preprint
      , , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          With the development of speech synthesis techniques, automatic speaker verification systems face the serious challenge of spoofing attack. In order to improve the reliability of speaker verification systems, we develop a new filter bank based cepstral feature, deep neural network filter bank cepstral coefficients (DNN-FBCC), to distinguish between natural and spoofed speech. The deep neural network filter bank is automatically generated by training a filter bank neural network (FBNN) using natural and synthetic speech. By adding restrictions on the training rules, the learned weight matrix of FBNN is band-limited and sorted by frequency, similar to the normal filter bank. Unlike the manually designed filter bank, the learned filter bank has different filter shapes in different channels, which can capture the differences between natural and synthetic speech more effectively. The experimental results on the ASVspoof {2015} database show that the Gaussian mixture model maximum-likelihood (GMM-ML) classifier trained by the new feature performs better than the state-of-the-art linear frequency cepstral coefficients (LFCC) based classifier, especially on detecting unknown attacks.

          Related collections

          Most cited references16

          • Record: found
          • Abstract: not found
          • Article: not found

          An Experimental Study on Speech Enhancement Based on Deep Neural Networks

            Bookmark
            • Record: found
            • Abstract: not found
            • Conference Proceedings: not found

            Deep neural networks for small footprint text-dependent speaker verification

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Gammatone Cepstral Coefficients: Biologically Inspired Features for Non-Speech Audio Classification

                Bookmark

                Author and article information

                Journal
                2017-02-13
                Article
                1702.03791
                d90a959f-d944-4bfb-bf31-5cbd839dde58

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                cs.SD cs.CR cs.LG

                Security & Cryptology,Artificial intelligence,Graphics & Multimedia design
                Security & Cryptology, Artificial intelligence, Graphics & Multimedia design

                Comments

                Comment on this article