46
views
0
recommends
+1 Recommend
1 collections
    6
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      New Methods for Prosodic Transcription: Capturing Variability as a Source of Information

      research-article

      Read this article at

      ScienceOpenPublisher
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Understanding the role of prosody in encoding linguistic meaning and in shaping phonetic form requires the analysis of prosodically annotated speech drawn from a wide variety of speech materials. Yet obtaining accurate and reliable prosodic annotations for even small datasets is challenging due to the time and expertise required. We discuss several factors that make prosodic annotation difficult and impact its reliability, all of which relate to variability: in the patterning of prosodic elements (features and structures) as they relate to the linguistic and discourse context, in the acoustic cues for those prosodic elements, and in the parameter values of the cues. We propose two novel methods for prosodic transcription that capture variability as a source of information relevant to the linguistic analysis of prosody. The first is Rapid Prosody Transcription (RPT), which can be performed by non-experts using a simple set of unary labels to mark prominence and boundaries based on immediate auditory impression. Inter-transcriber variability is used to calculate continuous-valued prosody ‘scores’ that are assigned to each word and represent the perceptual salience of its prosodic features or structure. RPT can be used to model the relative influence of top-down factors and acoustic cues in prosody perception, and to model prosodic variation across many dimensions, including language variety, speech style, or speaker’s affect. The second proposed method is the identification of individual cues to the contrastive prosodic elements of an utterance. Cue specification provides a link between the contrastive symbolic categories of prosodic structures and the continuous-valued parameters in the acoustic signal, and offers a framework for investigating how factors related to the grammatical and situational context influence the phonetic form of spoken words and phrases. While cue specification as a transcription tool has not yet been explored as RPT has, it has the potential to provide a level of detail that will be useful in modelling systematic context-governed variation in the implementation of prosodic categories, with applications in automatic speech synthesis and recognition, as well as modelling human speech production and perception. We discuss how RPT and cue specification, particularly when combined, can improve the efficiency and reliability of prosodic transcription and how they can be integrated with expert phonological transcription.

          Related collections

          Most cited references110

          • Record: found
          • Abstract: not found
          • Book: not found

          Intonational Phonology

          D. Ladd (2008)
            Bookmark
            • Record: found
            • Abstract: not found
            • Book: not found

            Phonology and Language Use

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Intonational structure in Japanese and English

                Bookmark

                Author and article information

                Contributors
                Journal
                1868-6354
                Laboratory Phonology: Journal of the Association for Laboratory Phonology
                Ubiquity Press
                1868-6354
                30 June 2016
                : 7
                : 1
                : 8
                Affiliations
                [-1]University of Illinois, US
                [-2]Massachusetts Institute of Technology, US
                Article
                10.5334/labphon.29
                7bf185f6-eccd-4da7-a92e-633d63714b9c
                Copyright: © 2016 The Author(s)

                This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. See http://creativecommons.org/licenses/by/4.0/.

                History
                Categories
                Journal article

                Applied linguistics,General linguistics,Linguistics & Semiotics
                Applied linguistics, General linguistics, Linguistics & Semiotics

                Comments

                Comment on this article