Identification of noisy speech signals using bispectrum-based 2D-MFCC and its optimization through genetic algorithm as a feature extraction subsystem

Benyamin Kusumo Putro, Agus Buono, And Lina

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

Power-spectrum-based Mel-Frequency Cepstrum Coefficients (MFCC) is usually used as a feature extractor in a speaker identification system. This one-dimensional feature extraction subsystem, however, shows low recognition rates for identifying utterance speech signals under harsh noise conditions. In this paper, we have developed a speaker identification system based on Bispectrum data that is more robust to the addition of Gaussian noise. As one-dimensional MFCC method could not be directly used to process the two-dimensional Bispectrum data, we proposed a two-dimensional MFCC method and its optimization using Genetic Algorithm (GA). Experiments using the two-dimensional MFCC method as the feature extractor and a Hidden Markov Model as the pattern classifier on utterance speeches contained with various levels of Gaussian noise are conducted. Results showed that the developed system performed higher recognition rates compare with that of 1D-MFCC method, especially when the 2D-MFCC with GA optimization method is utilized.

Original languageEnglish
Pages (from-to)241-251
Number of pages11
JournalWSEAS Transactions on Computers
Volume11
Issue number8
Publication statusPublished - 1 Aug 2012

Keywords

  • 2D Mel-Frequency Cepstrum Coefficients
  • Bispectrum
  • Genetics Algorithms
  • Hidden Markov Model
  • Speaker Identification System

Fingerprint Dive into the research topics of 'Identification of noisy speech signals using bispectrum-based 2D-MFCC and its optimization through genetic algorithm as a feature extraction subsystem'. Together they form a unique fingerprint.

Cite this