TY - JOUR
T1 - IMPLEMENTATION of MACHINE LEARNING for HUMAN ASPECT in INFORMATION SECURITY AWARENESS
AU - Saridewi, Valentina Siwi
AU - Sari, Riri Fitri
N1 - Funding Information:
We thank the Ministry of Education and Culture of Republic of Indonesia for financial support for this research under the PTUPT Research Grant number NKB-356/ UN2.RST/HKP.05.00/2020.
Funding Information:
We thank the Ministry of Education and Culture of Re - public of Indonesia for financial support for this research under the PTUPT Research Grant number NKB-356/ UN2.RST/HK .05.00/2020.
Publisher Copyright:
© 2021 Institut za Istrazivanja. All rights reserved.
PY - 2021
Y1 - 2021
N2 - This research discussed our experience in implementing machine learning algorithms on the human aspect of information security awareness. The implementation of the classification and clustering approach have been conducted by creating a questionnaire, creating dataset, importing data, handling incompleted and imbalanced data, compiling datasets, feature scaling, building models, and subsequently evaluating machine learning models. Datasets are generated based on the collection of questionnaire result of the distributed questionnaire related to the Human Aspects of Information Security Questionnaire (HAIS-Q) to the stakeholder of an Indonesian institution. Models as results of algorithms implementation through the classification approach has been evaluated by several methods, such as: k-fold Cross Validation analysis, Confusion Matrix, Receiver Operating Characteristics, and score calculation for each model. A model of the Support Vector implementation in the classification has an accuracy of 99.7% and an error rate of 0.3%. Models of clustering implementation are used to determine the number of clusters that can optimally divide the dataset. The model of the DBSCAN algorithm on the clustering approach has an adjusted rand index value of always close to 0.
AB - This research discussed our experience in implementing machine learning algorithms on the human aspect of information security awareness. The implementation of the classification and clustering approach have been conducted by creating a questionnaire, creating dataset, importing data, handling incompleted and imbalanced data, compiling datasets, feature scaling, building models, and subsequently evaluating machine learning models. Datasets are generated based on the collection of questionnaire result of the distributed questionnaire related to the Human Aspects of Information Security Questionnaire (HAIS-Q) to the stakeholder of an Indonesian institution. Models as results of algorithms implementation through the classification approach has been evaluated by several methods, such as: k-fold Cross Validation analysis, Confusion Matrix, Receiver Operating Characteristics, and score calculation for each model. A model of the Support Vector implementation in the classification has an accuracy of 99.7% and an error rate of 0.3%. Models of clustering implementation are used to determine the number of clusters that can optimally divide the dataset. The model of the DBSCAN algorithm on the clustering approach has an adjusted rand index value of always close to 0.
KW - Classification
KW - Clustering
KW - Information security awareness
KW - Python
UR - http://www.scopus.com/inward/record.url?scp=85121059424&partnerID=8YFLogxK
U2 - 10.5937/jaes0-28530
DO - 10.5937/jaes0-28530
M3 - Article
AN - SCOPUS:85121059424
SN - 1451-4117
VL - 19
SP - 1126
EP - 1142
JO - Journal of Applied Engineering Science
JF - Journal of Applied Engineering Science
IS - 4
ER -