Internet addiction and mental health prediction using ensemble learning based on web browsing history

Betty Purwandari, Wayan Surya Wibawa, Nilam Fitriah, Mellia Christia, Dini Rahma Bintari

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Citations (Scopus)

Abstract

The widespread prevalence of Web browsing may lead to Internet Addiction Disorder (IAD), which impacts negatively on Web users' general health. Young people who are very active online are prone to suffer from IAD. It negatively affects their academic performance and social lives. The earlier the detection, the better the treatment. Therefore, this pilot study aimed to predict IAD among the youth to encourage early treatment. The sample included 30 undergraduate students at Universitas Indonesia (UI). Their Web browsing histories for five weeks were recorded from their laptops and analyzed using the support vector machine (SVM) with radial basis function (RBF) kernel as a machine learning method for prediction. The results were subsequently compared using ensemble learning, such as random forest (RF) and gradient boosting (GB). It was then matched with respondents' responses to an Internet Addiction Test (IAT) questionnaire, which measures IAD levels. Respondents' general health data were collected with the 12-item General Health Questionnaire (GHQ-12). Features from Web browsing histories were extracted to classify activities in five types. These are information retrieval (IR), instant messaging (IM), social networking services (SNS), leisure, and online shopping (OS). The extracted features became input to classify participants' IAD. The results were compared with their IAD results from the IAT questionnaire. Machine learning was also employed to classify the input into respondents' general health (GH) status, which was matched with their responses to the GHQ-12 questionnaire. The findings revealed that the prediction accuracies were 66.67% for the IAD status and 65.17% for the GH status employing SVM. The precisions for predicting IAD and GH were 63.33% and 44.33%, according to RF. Moreover, the accuracies were 63.33% and 67.17%, according to GB. Results indicated that RF decreased prediction accuracies, but GB was slightly different from SVM. For each classifier, IAD status was predicted more accurately than GH status. An alternative to improve the outcomes is gaining data from the Internet firewall instead of the Web browsing history from users' laptops. It can provide richer and more realistic records of Web access, which are collected from any devices connected to the university's computer networks. However, it requires consent from the participants and authority managing the infrastructure. If each class has a balanced example, we plan to add more features and employ other types of ensemble learning for higher accuracy. Furthermore, performing a multiclass prediction can demonstrate specific IAD severity levels and the class of mental health status, i.e., anxiety and depression.

Original languageEnglish
Title of host publicationProceedings of the 2020 3rd International Conference on Software Engineering and Information Management, ICSIM 2020 - Workshop 2020 the 3rd International Conference on Big Data and Smart Computing, ICBDSC 2020
PublisherAssociation for Computing Machinery
Pages155-159
Number of pages5
ISBN (Electronic)9781450376907
DOIs
Publication statusPublished - 12 Jan 2020
Event3rd International Conference on Software Engineering and Information Management, ICSIM 2020 - and its Workshop 2020 the 3rd International Conference on Big Data and Smart Computing, ICBDSC 2020 - Sydney, Australia
Duration: 12 Jan 202015 Jan 2020

Publication series

NameACM International Conference Proceeding Series

Conference

Conference3rd International Conference on Software Engineering and Information Management, ICSIM 2020 - and its Workshop 2020 the 3rd International Conference on Big Data and Smart Computing, ICBDSC 2020
Country/TerritoryAustralia
CitySydney
Period12/01/2015/01/20

Keywords

  • Data mining
  • Ensemble learning
  • Internet addiction
  • Mental health
  • Support Vector Machine
  • Web behavior

Fingerprint

Dive into the research topics of 'Internet addiction and mental health prediction using ensemble learning based on web browsing history'. Together they form a unique fingerprint.

Cite this