22
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Emotion Recognition from Chinese Speech for Smart Affective Services Using a Combination of SVM and DBN

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Accurate emotion recognition from speech is important for applications like smart health care, smart entertainment, and other smart services. High accuracy emotion recognition from Chinese speech is challenging due to the complexities of the Chinese language. In this paper, we explore how to improve the accuracy of speech emotion recognition, including speech signal feature extraction and emotion classification methods. Five types of features are extracted from a speech sample: mel frequency cepstrum coefficient (MFCC), pitch, formant, short-term zero-crossing rate and short-term energy. By comparing statistical features with deep features extracted by a Deep Belief Network (DBN), we attempt to find the best features to identify the emotion status for speech. We propose a novel classification method that combines DBN and SVM (support vector machine) instead of using only one of them. In addition, a conjugate gradient method is applied to train DBN in order to speed up the training process. Gender-dependent experiments are conducted using an emotional speech database created by the Chinese Academy of Sciences. The results show that DBN features can reflect emotion status better than artificial features, and our new classification approach achieves an accuracy of 95.8%, which is higher than using either DBN or SVM separately. Results also show that DBN can work very well for small training databases if it is properly designed.

          Related collections

          Most cited references27

          • Record: found
          • Abstract: not found
          • Article: not found

          IEMOCAP: interactive emotional dyadic motion capture database

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Failure diagnosis using deep belief learning based health state classification

              Bookmark
              • Record: found
              • Abstract: not found
              • Book: not found

              The Nature of Statistical Learning Theory

                Bookmark

                Author and article information

                Journal
                Sensors (Basel)
                Sensors (Basel)
                sensors
                Sensors (Basel, Switzerland)
                MDPI
                1424-8220
                24 July 2017
                July 2017
                : 17
                : 7
                : 1694
                Affiliations
                [1 ]Department of Software Engineering, China University of Petroleum, No. 66 Changjiang West Road, Qingdao 266031, China; chenleiming1930@ 123456sina.com (L.C.); zhaodh.upc@ 123456gmail.com (D.Z.)
                [2 ]Department of Information Processing Science, University of Oulu, Oulu FI-91004, Finland; jiehan.zhou@ 123456oulu.fi
                Author notes
                [* ]Correspondence: zhulz@ 123456upc.edu.cn (L.Z.); zhangws@ 123456upc.edu.cn (W.Z.); Tel.: +86-532-8698-3556 (W.Z.)
                Author information
                https://orcid.org/0000-0002-4026-1649
                https://orcid.org/0000-0001-9800-1068
                Article
                sensors-17-01694
                10.3390/s17071694
                5539696
                28737705
                e1caa0ab-e7de-40ce-8c59-560d32b02455
                © 2017 by the authors.

                Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( http://creativecommons.org/licenses/by/4.0/).

                History
                : 08 March 2017
                : 14 July 2017
                Categories
                Article

                Biomedical engineering
                speech emotion recognition,speech features,support vector machine,deep belief networks

                Comments

                Comment on this article