Dimensional feature weighting utilizing multiple kernel learning for single-channel talker location discrimination using the acoustic transfer function

J Acoust Soc Am. 2013 Feb;133(2):891-901. doi: 10.1121/1.4773255.

Abstract

This paper presents a method for discriminating the location of the sound source (talker) using only a single microphone. In a previous work, the single-channel approach for discriminating the location of the sound source was discussed, where the acoustic transfer function from a user's position is estimated by using a hidden Markov model of clean speech in the cepstral domain. In this paper, each cepstral dimension of the acoustic transfer function is newly weighted, in order to obtain the cepstral dimensions having information that is useful for classifying the user's position. Then, this paper proposes a feature-weighting method for the cepstral parameter using multiple kernel learning, defining the base kernels for each cepstral dimension of the acoustic transfer function. The user's position is trained and classified by support vector machine. The effectiveness of this method has been confirmed by sound source (talker) localization experiments performed in different room environments.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Acoustics* / instrumentation
  • Algorithms
  • Artificial Intelligence*
  • Computer Simulation
  • Discriminant Analysis*
  • Facility Design and Construction
  • Female
  • Humans
  • Likelihood Functions
  • Male
  • Motion
  • Noise / adverse effects
  • Pattern Recognition, Automated
  • Signal Processing, Computer-Assisted*
  • Sound*
  • Speech Acoustics*
  • Time Factors
  • Transducers
  • Vibration