TY - JOUR
T1 - Bispectrum analysis for speaker identification in noisy environment with Karhunen-Loeve transformation technique
AU - Putro, Benyamin Kusumo
AU - Fanany, Mohamad Ivan
AU - Indrawati, Dian
PY - 2000
Y1 - 2000
N2 - The work described in this paper addresses the problem for extracting bispectrum feature of speech data. Very often the bispectrum feature extraction and data reduction are complicated due to some limiting constraints, i.e., no prior knowledge of feature's distribution and higher dimensionality of bispectrum data. In this article we developed an adaptive feature extraction mechanism based on cascade neural network in conjunction with feature's dimensionality reduction based on Karhunen-Loeve transformation technique. An adaptive codebook generation algorithm which is a cascade configuration of SOFM (Self Organizing Feature Map) and LVQ (Learning Vector Quantization) was used before the K-L transformation. The transformation was experimentally shown as an effective procedure for orthogonalization and dimensionality reduction of bispectrum feature. Performance of our speaker identification system was perceived to be significantly increased eventhough using limited number of channels in noisy environment. We also tried to improve the capability of adaptive codebook generation algorithm by applying simplified differential competitive learning (SDCL) network.
AB - The work described in this paper addresses the problem for extracting bispectrum feature of speech data. Very often the bispectrum feature extraction and data reduction are complicated due to some limiting constraints, i.e., no prior knowledge of feature's distribution and higher dimensionality of bispectrum data. In this article we developed an adaptive feature extraction mechanism based on cascade neural network in conjunction with feature's dimensionality reduction based on Karhunen-Loeve transformation technique. An adaptive codebook generation algorithm which is a cascade configuration of SOFM (Self Organizing Feature Map) and LVQ (Learning Vector Quantization) was used before the K-L transformation. The transformation was experimentally shown as an effective procedure for orthogonalization and dimensionality reduction of bispectrum feature. Performance of our speaker identification system was perceived to be significantly increased eventhough using limited number of channels in noisy environment. We also tried to improve the capability of adaptive codebook generation algorithm by applying simplified differential competitive learning (SDCL) network.
UR - http://www.scopus.com/inward/record.url?scp=0033680830&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:0033680830
SN - 0277-786X
VL - 4044
SP - 143
EP - 149
JO - Proceedings of SPIE - The International Society for Optical Engineering
JF - Proceedings of SPIE - The International Society for Optical Engineering
T2 - Hybrid Image and Signal Processing VII
Y2 - 25 April 2000 through 25 April 2000
ER -