8
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Feature reinforcement with word embedding and parsing information in neural TTS

      Preprint
      , , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          In this paper, we propose a feature reinforcement method under the sequence-to-sequence neural text-to-speech (TTS) synthesis framework. The proposed method utilizes the multiple input encoder to take three levels of text information, i.e., phoneme sequence, pre-trained word embedding, and grammatical structure of sentences from parser as the input feature for the neural TTS system. The added word and sentence level information can be viewed as the feature based pre-training strategy, which clearly enhances the model generalization ability. The proposed method not only improves the system robustness significantly but also improves the synthesized speech to near recording quality in our experiments for out-of-domain text.

          Related collections

          Most cited references1

          • Record: found
          • Abstract: not found
          • Article: not found

          Signal estimation from modified short-time Fourier transform

            Bookmark

            Author and article information

            Journal
            03 January 2019
            Article
            1901.00707
            796bcd01-5626-40d1-a3e7-eb0907b7f909

            http://arxiv.org/licenses/nonexclusive-distrib/1.0/

            History
            Custom metadata
            Submitted to ICASSP 2019
            cs.SD cs.CL eess.AS

            Theoretical computer science,Electrical engineering,Graphics & Multimedia design

            Comments

            Comment on this article