TY - JOUR
T1 - Voice Biometrics for Indonesian Language Users using Algorithm of Deep Learning CNN Residual and Hybrid of DWT-MFCC Extraction Features
AU - Isyanto, Haris
AU - Arifin, Ajib Setyo
AU - Suryanegara, Muhammad
N1 - Funding Information:
ACKNOWLEDGMENT Haris Isyanto is in PhD program funded by Beasiswa Pendidikan Pascasarjana Dalam Negeri (BPPDN) Ministry of Education and Culture Republic of Indonesia. Dr. Muhammad Suryanegara is main supervisor, and Dr. Ajib Setyo Arifin is co-supervisor as well as the corresponding author. The voice data set is built by the support of Electrical Engineering - Faculty of Engineering, Universitas Muhammadiyah Jakarta. This publication is supported by Research Grant Universitas Indonesia.
Funding Information:
Haris Isyanto is in PhD program funded by Beasiswa Pendidikan Pascasarjana Dalam Negeri (BPPDN) Ministry of Education and Culture Republic of Indonesia. Dr. Muhammad Suryanegara is main supervisor, and Dr. Ajib Setyo Arifin is co-supervisor as well as the corresponding author. The voice data set is built by the support of Electrical Engineering - Faculty of Engineering, Universitas Muhammadiyah Jakarta. This publication is supported by Research Grant Universitas Indonesia.
Publisher Copyright:
© 2022. International Journal of Advanced Computer Science and Applications. All Rights Reserved.
PY - 2022
Y1 - 2022
N2 - This research develops a Voice Biometrics model for the Indonesian language users by using deep learning algorithm of CNN Residual and Hybrid of DWT-MFCC Feature Extraction. The voice dataset of Indonesian speakers were created with a duration of 5, 10, 15, 20, and 25 minutes. The testing phase of speaker recognition and speech recognition were carried out by comparing the model of CNN Residual with CNN Standard. In the phase of speaker recognition, CNN Residual model has obtained the best results with the highest precision percentage of 99.91% and the highest accuracy of 99.47% at 25 minutes voice samples, compared to the CNN Standard obtaining precision of 96.83% and accuracy of 99.00%. In the phase of speech recognition, CNN Residual model has reached the best performance at 100% accuracy during 20 trials, while CNN Standard only gave 95% accuracy. CNN Residual Model provides a better performance for its accuracy and precision, but it is slightly slower than the CNN Standard, with a time difference of 0.03 – 1.28 seconds.
AB - This research develops a Voice Biometrics model for the Indonesian language users by using deep learning algorithm of CNN Residual and Hybrid of DWT-MFCC Feature Extraction. The voice dataset of Indonesian speakers were created with a duration of 5, 10, 15, 20, and 25 minutes. The testing phase of speaker recognition and speech recognition were carried out by comparing the model of CNN Residual with CNN Standard. In the phase of speaker recognition, CNN Residual model has obtained the best results with the highest precision percentage of 99.91% and the highest accuracy of 99.47% at 25 minutes voice samples, compared to the CNN Standard obtaining precision of 96.83% and accuracy of 99.00%. In the phase of speech recognition, CNN Residual model has reached the best performance at 100% accuracy during 20 trials, while CNN Standard only gave 95% accuracy. CNN Residual Model provides a better performance for its accuracy and precision, but it is slightly slower than the CNN Standard, with a time difference of 0.03 – 1.28 seconds.
KW - Cnn
KW - Deep learning
KW - Dwt-mfcc
KW - Security
KW - Voice biometric
UR - http://www.scopus.com/inward/record.url?scp=85131400570&partnerID=8YFLogxK
U2 - 10.14569/IJACSA.2022.0130574
DO - 10.14569/IJACSA.2022.0130574
M3 - Article
AN - SCOPUS:85131400570
SN - 2158-107X
VL - 13
SP - 622
EP - 634
JO - International Journal of Advanced Computer Science and Applications
JF - International Journal of Advanced Computer Science and Applications
IS - 5
ER -