Automated speech analysis applied to laryngeal disease categorization

Comput Methods Programs Biomed. 2008 Jul;91(1):36-47. doi: 10.1016/j.cmpb.2008.01.008. Epub 2008 Mar 17.

Abstract

The long-term goal of the work is a decision support system for diagnostics of laryngeal diseases. Colour images of vocal folds, a voice signal, and questionnaire data are the information sources to be used in the analysis. This paper is concerned with automated analysis of a voice signal applied to screening of laryngeal diseases. The effectiveness of 11 different feature sets in classification of voice recordings of the sustained phonation of the vowel sound /a/ into a healthy and two pathological classes, diffuse and nodular, is investigated. A k-NN classifier, SVM, and a committee build using various aggregation options are used for the classification. The study was made using the mixed gender database containing 312 voice recordings. The correct classification rate of 84.6% was achieved when using an SVM committee consisting of four members. The pitch and amplitude perturbation measures, cepstral energy features, autocorrelation features as well as linear prediction cosine transform coefficients were amongst the feature sets providing the best performance. In the case of two class classification, using recordings from 79 subjects representing the pathological and 69 the healthy class, the correct classification rate of 95.5% was obtained from a five member committee. Again the pitch and amplitude perturbation measures provided the best performance.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Artificial Intelligence*
  • Diagnosis, Computer-Assisted / methods*
  • Humans
  • Laryngeal Diseases / complications
  • Laryngeal Diseases / diagnosis*
  • Laryngeal Diseases / physiopathology
  • Pattern Recognition, Automated / methods*
  • Reproducibility of Results
  • Sensitivity and Specificity
  • Speech Disorders / diagnosis*
  • Speech Disorders / etiology
  • Speech Disorders / physiopathology
  • Speech Production Measurement / methods*