TY - GEN
T1 - Speaker identification in noisy environment using bispectrum analysis and probabilistic neural network
AU - Putro, Benyamin Kusumo
AU - Triyanto, A.
AU - Fanany, Mohamad Ivan
AU - Jatmiko, Wisnu
N1 - Publisher Copyright:
© 2001 IEEE.
PY - 2001
Y1 - 2001
N2 - The paper describes the application of a neural processing for extracting bispectrum feature of speech data, and the use of probabilistic neural network as a classifier in an automatic speech recognition system. The usually used feature extraction paradigm in the early development of the speech recognition system is power spectrum analysis, however, the recognition rate of this system is not high enough, especially when a Gaussian noise is added to the utterance speech data. In this paper, we developed a speaker identification system using bispectrum feature analysis. To analyse the distribution of the bispectrum data along its two dimensional representation, we developed an adaptive feature extraction mechanism of the bispectrum speech data based on cascade neural network. A cascade configuration of SOFM (Self-Organizing Feature Map) and LVQ (Learning Vector Quantization) is used as an adaptive codebook generation algorithm for determining the feature distribution of the bispectrum speech data. The K-L transformation (K-LT) technique is then used as a preprocessing element before the neural classifier is utilized. This K-LT has shown as an effective procedure for orthogonalization and dimensionality reduction of the codebook vectors generated from bispectrum data. Experimental results show that our system could perform with high recognition rate on the undirected utterance speech, especially when a higher number of codebook vectors are utilized. It is also shown that the use of PNN could increase the recognition rate significantly, even using speech data with additional Gaussian noise.
AB - The paper describes the application of a neural processing for extracting bispectrum feature of speech data, and the use of probabilistic neural network as a classifier in an automatic speech recognition system. The usually used feature extraction paradigm in the early development of the speech recognition system is power spectrum analysis, however, the recognition rate of this system is not high enough, especially when a Gaussian noise is added to the utterance speech data. In this paper, we developed a speaker identification system using bispectrum feature analysis. To analyse the distribution of the bispectrum data along its two dimensional representation, we developed an adaptive feature extraction mechanism of the bispectrum speech data based on cascade neural network. A cascade configuration of SOFM (Self-Organizing Feature Map) and LVQ (Learning Vector Quantization) is used as an adaptive codebook generation algorithm for determining the feature distribution of the bispectrum speech data. The K-L transformation (K-LT) technique is then used as a preprocessing element before the neural classifier is utilized. This K-LT has shown as an effective procedure for orthogonalization and dimensionality reduction of the codebook vectors generated from bispectrum data. Experimental results show that our system could perform with high recognition rate on the undirected utterance speech, especially when a higher number of codebook vectors are utilized. It is also shown that the use of PNN could increase the recognition rate significantly, even using speech data with additional Gaussian noise.
UR - http://www.scopus.com/inward/record.url?scp=84861655255&partnerID=8YFLogxK
U2 - 10.1109/ICCIMA.2001.970480
DO - 10.1109/ICCIMA.2001.970480
M3 - Conference contribution
AN - SCOPUS:84861655255
T3 - Proceedings - 4th International Conference on Computational Intelligence and Multimedia Applications, ICCIMA 2001
SP - 282
EP - 287
BT - Proceedings - 4th International Conference on Computational Intelligence and Multimedia Applications, ICCIMA 2001
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 4th International Conference on Computational Intelligence and Multimedia Applications, ICCIMA 2001
Y2 - 30 October 2001 through 1 November 2001
ER -