19
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Development of the Arabic Voice Pathology Database and Its Evaluation by Using Speech Features and Machine Learning Algorithms

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          A voice disorder database is an essential element in doing research on automatic voice disorder detection and classification. Ethnicity affects the voice characteristics of a person, and so it is necessary to develop a database by collecting the voice samples of the targeted ethnic group. This will enhance the chances of arriving at a global solution for the accurate and reliable diagnosis of voice disorders by understanding the characteristics of a local group. Motivated by such idea, an Arabic voice pathology database (AVPD) is designed and developed in this study by recording three vowels, running speech, and isolated words. For each recorded samples, the perceptual severity is also provided which is a unique aspect of the AVPD. During the development of the AVPD, the shortcomings of different voice disorder databases were identified so that they could be avoided in the AVPD. In addition, the AVPD is evaluated by using six different types of speech features and four types of machine learning algorithms. The results of detection and classification of voice disorders obtained with the sustained vowel and the running speech are also compared with the results of an English-language disorder database, the Massachusetts Eye and Ear Infirmary (MEEI) database.

          Related collections

          Most cited references46

          • Record: found
          • Abstract: not found
          • Article: not found

          An Algorithm for Vector Quantizer Design

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Perceptual linear predictive (PLP) analysis of speech.

            A new technique for the analysis of speech, the perceptual linear predictive (PLP) technique, is presented and examined. This technique uses three concepts from the psychophysics of hearing to derive an estimate of the auditory spectrum: (1) the critical-band spectral resolution, (2) the equal-loudness curve, and (3) the intensity-loudness power law. The auditory spectrum is then approximated by an autoregressive all-pole model. A 5th-order all-pole model is effective in suppressing speaker-dependent details of the auditory spectrum. In comparison with conventional linear predictive (LP) analysis, PLP analysis is more consistent with human hearing. The effective second formant F2' and the 3.5-Bark spectral-peak integration theories of vowel perception are well accounted for. PLP analysis is computationally efficient and yields a low-dimensional representation of speech. These properties are found to be useful in speaker-independent automatic-speech recognition.
              Bookmark
              • Record: found
              • Abstract: not found
              • Book: not found

              Pattern recognition and machine learning

                Bookmark

                Author and article information

                Contributors
                Journal
                J Healthc Eng
                J Healthc Eng
                JHE
                Journal of Healthcare Engineering
                Hindawi
                2040-2295
                2040-2309
                2017
                19 October 2017
                : 2017
                : 8783751
                Affiliations
                1ENT Department, College of Medicine, King Saud University, Riyadh, Saudi Arabia
                2Digital Speech Processing Group, Department of Computer Engineering, College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia
                Author notes

                Academic Editor: Tiago H. Falk

                Author information
                http://orcid.org/0000-0002-9073-2357
                http://orcid.org/0000-0002-1599-1287
                http://orcid.org/0000-0002-9781-3969
                Article
                10.1155/2017/8783751
                5672151
                29201333
                0ba20012-029a-4e41-9a48-5192baa377a6
                Copyright © 2017 Tamer A. Mesallam et al.

                This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 14 December 2016
                : 4 April 2017
                : 2 May 2017
                Funding
                Funded by: National Plan for Science, Technology and Innovation
                Award ID: 12-MED-2474-02
                Categories
                Research Article

                Comments

                Comment on this article