TY - GEN
T1 - Audio Feature Extraction on SIBI Dataset for Speech Recognition
AU - Shoalihin, Ruhush
AU - Rakun, Erdefi
N1 - Publisher Copyright:
© 2020 IEEE.
Copyright:
Copyright 2021 Elsevier B.V., All rights reserved.
PY - 2020/11/19
Y1 - 2020/11/19
N2 - Mel Frequency Cepstral Coefficients has been regarded as the standard method of feature extraction for Automatic Speech Recognition (ASR) systems for the last few years. Its performance may be affected by multiple variables, such as the number of features, audio channels, filter width, or the types of filter banks used. In this paper, several comparisons were made to find the best combination of variables that provides the best results on the SIBI (Indonesian Sign Language) dataset, which consists of utterances of sentences by both Deaf and Hard of Hearing (DHH) and non-DHH people. Based on this experiment, although generally the ASR on DHH dataset is lower than those of the non-DHH dataset, the results are still relatively high, around 4.71 % WER and 10.30% SER compared to 0.15% and 0.40% in WER and SER, respectively.
AB - Mel Frequency Cepstral Coefficients has been regarded as the standard method of feature extraction for Automatic Speech Recognition (ASR) systems for the last few years. Its performance may be affected by multiple variables, such as the number of features, audio channels, filter width, or the types of filter banks used. In this paper, several comparisons were made to find the best combination of variables that provides the best results on the SIBI (Indonesian Sign Language) dataset, which consists of utterances of sentences by both Deaf and Hard of Hearing (DHH) and non-DHH people. Based on this experiment, although generally the ASR on DHH dataset is lower than those of the non-DHH dataset, the results are still relatively high, around 4.71 % WER and 10.30% SER compared to 0.15% and 0.40% in WER and SER, respectively.
KW - ASR
KW - Automatic Speech Recognition
KW - DHH
KW - Mel Frequency Cepstral Coefficients
KW - MFCC
KW - SIBI
UR - http://www.scopus.com/inward/record.url?scp=85102197532&partnerID=8YFLogxK
U2 - 10.1109/ICIMCIS51567.2020.9354290
DO - 10.1109/ICIMCIS51567.2020.9354290
M3 - Conference contribution
AN - SCOPUS:85102197532
T3 - Proceedings - 2nd International Conference on Informatics, Multimedia, Cyber, and Information System, ICIMCIS 2020
SP - 70
EP - 74
BT - Proceedings - 2nd International Conference on Informatics, Multimedia, Cyber, and Information System, ICIMCIS 2020
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2nd International Conference on Informatics, Multimedia, Cyber, and Information System, ICIMCIS 2020
Y2 - 19 November 2020 through 20 November 2020
ER -