5
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Speech synthesis using generative adversarial network for improving readability of Hindi words to recuperate from dyslexia

      research-article

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Children learn and develop their abilities at their own pace. One of the most basic skills that they acquire is reading. However, some children struggle with reading longer than their friends, and in such a case, it is possible that they have a learning disorder known as dyslexia. The paper aims to use neural networks, namely generative neural networks, for generating raw audio data of two- or three-letter Hindi words. Using the generated data, a system will be built that will pronounce generated words for children recuperating from dyslexia. The system aims to be an effective helping tool for teachers to speed up the recuperation process by making the child repeat the correct pronunciation of the word. The system uses advance Mel-generative adversarial network neural network for working with Mel-spectrograms of the raw audio, by which the system will model its own audio iteratively, until a satisfactory result is achieved. Generated audio sample contains the Hindi words which will be taught to children. Mel-generative adversarial network will be used to generate audio samples since it provides better results compared to other existing models. 300 basic two- or three-letter Hindi words are taken as an input for assisting 5- to 8-year children. Minimum opinion score is calculated for comparison.

          Related collections

          Most cited references6

          • Record: found
          • Abstract: not found
          • Article: not found

          A Scale for the Measurement of the Psychological Magnitude Pitch

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Measuring speech quality for text-to-speech systems: development and assessment of a modified mean opinion score (MOS) scale

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Deep convolutional neural networks: structure, feature extraction and training

                Bookmark

                Author and article information

                Contributors
                geeta.atkar2016@vitstudent.ac.in
                Priyadarshini.j@vit.ac.in
                Journal
                Neural Comput Appl
                Neural Comput Appl
                Neural Computing & Applications
                Springer London (London )
                0941-0643
                1433-3058
                15 February 2021
                : 1-10
                Affiliations
                GRID grid.412813.d, ISNI 0000 0001 0687 4946, School of Computer Science and Engineering, , Vellore Institute of Technology, ; Chennai, 600127 India
                Author information
                http://orcid.org/0000-0002-5328-484X
                Article
                5695
                10.1007/s00521-021-05695-3
                7883547
                33612979
                201f2e86-3e8b-4988-b1f4-5c880dd559f9
                © The Author(s), under exclusive licence to Springer-Verlag London Ltd. part of Springer Nature 2021

                This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.

                History
                : 1 September 2020
                : 5 January 2021
                Categories
                Original Article

                Neural & Evolutionary computing
                dyslexia,generative adversarial network,minimum opinion score,melgan,wavegan

                Comments

                Comment on this article