45
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Estimation of the Direct-Path Relative Transfer Function for Supervised Sound-Source Localization

      Preprint
      , , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          This paper addresses the problem of binaural localization of a single speech source in noisy and reverberant environments. The binaural response corresponding to the direct-path propagation of a single source is a function of the source direction. In practice, this response is contaminated by noise and reverberations. The direct-path relative transfer function (DP-RTF) is defined as the ratio between the direct-path acoustic transfer function of the two channels. We propose a method to estimate the DP-RTF from the noisy and reverberant microphone signals in the short-time Fourier transform domain. First, the convolutive transfer function approximation is adopted to accurately represent the impulse response of the sensors in the STFT domain. Second, the DP-RTF is estimated by using the auto- and cross-power spectral densities at each frequency and over multiple frames. In the presence of stationary noise, an inter-frame spectral subtraction algorithm is proposed, which enables to achieve the estimation of noise-free auto- and cross-power spectral densities. Finally, the estimated DP-RTFs are concatenated across frequencies and used as a feature vector for the localization of speech. Experiments with both simulated and real data show that the proposed localization method performs well, even under severe adverse acoustic conditions, and outperforms state-of-the-art localization methods under most of the acoustic conditions.

          Related collections

          Author and article information

          Journal
          2015-09-10
          2015-12-30
          Article
          1509.03205
          06ae2f52-0793-4fb1-a218-53757ba6e434

          http://arxiv.org/licenses/nonexclusive-distrib/1.0/

          History
          Custom metadata
          12 pages, 7 figures, 5 tables
          cs.SD

          Graphics & Multimedia design
          Graphics & Multimedia design

          Comments

          Comment on this article