54
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Audio segmentation using Flattened Local Trimmed Range for ecological acoustic space analysis

      research-article

      Read this article at

      ScienceOpenPublisher
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The acoustic space in a given environment is filled with footprints arising from three processes: biophony, geophony and anthrophony. Bioacoustic research using passive acoustic sensors can result in thousands of recordings. An important component of processing these recordings is to automate signal detection. In this paper, we describe a new spectrogram-based approach for extracting individual audio events. Spectrogram-based audio event detection (AED) relies on separating the spectrogram into background (i.e., noise) and foreground (i.e., signal) classes using a threshold such as a global threshold, a per-band threshold, or one given by a classifier. These methods are either too sensitive to noise, designed for an individual species, or require prior training data. Our goal is to develop an algorithm that is not sensitive to noise, does not need any prior training data and works with any type of audio event. To do this, we propose: (1) a spectrogram filtering method, the Flattened Local Trimmed Range (FLTR) method, which models the spectrogram as a mixture of stationary and non-stationary energy processes and mitigates the effect of the stationary processes, and (2) an unsupervised algorithm that uses the filter to detect audio events. We measured the performance of the algorithm using a set of six thoroughly validated audio recordings and obtained a sensitivity of 94% and a positive predictive value of 89%. These sensitivity and positive predictive values are very high, given that the validated recordings are diverse and obtained from field conditions. The algorithm was then used to extract audio events in three datasets. Features of these audio events were plotted and showed the unique aspects of the three acoustic communities.

          Most cited references24

          • Record: found
          • Abstract: not found
          • Article: not found

          Survey over image thresholding techniques and quantitative performance evaluation

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Soundscape Ecology: The Science of Sound in the Landscape

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Acoustic monitoring in terrestrial environments using microphone arrays: applications, technological considerations and prospectus

                Bookmark

                Author and article information

                Contributors
                Journal
                peerj-cs
                peerj-cs
                PeerJ Comput. Sci.
                PeerJ Computer Science
                PeerJ Comput. Sci.
                PeerJ Inc. (San Francisco, USA )
                2376-5992
                27 June 2016
                : 2
                : e70
                Affiliations
                [1 ]Department of Mathematics, University of Puerto Rico-Rio Piedras , San Juan, Puerto Rico, United States
                [2 ]Department of Computer Science, University of Puerto Rico-Rio Piedras , San Juan, Puerto Rico, United States
                [3 ]Department of Biology, University of Puerto Rico-Rio Piedras , San Juan, Puerto Rico, United States
                Article
                cs-70
                10.7717/peerj-cs.70
                88ae849f-8a9f-457f-92eb-ea6760fc7620
                ©2016 Vega et al.

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.

                History
                : 28 January 2016
                : 21 May 2016
                Funding
                The authors received no funding for this work.
                Categories
                Bioinformatics
                Computational Biology

                Computer science
                Audio event detection,Flattened Local Trimmed Range,Bioacoustic
                Computer science
                Audio event detection, Flattened Local Trimmed Range, Bioacoustic

                Comments

                Comment on this article