TY - JOUR
T1 - Analyzing Kinship in Severe Acute Respiratory Syndrome Coronavirus 2 DNA Sequences Based on Hierarchical and K-Means Clustering Methods Using Multiple Encoding Vector
AU - Banjarnahor, Evander
AU - Bustamam, Alhadi
AU - Siswantining, Titin
AU - Tampubolon, Patuan
N1 - Funding Information:
ACKNOWLEDGMENT This study was funded by PUTI KI2Q2 2020 grants NKB-778/UN2.RST/HKP.05.00/2020. We would also like to thank BRIN/DIKTI for their significant thoughts and experience, which helped to strengthen our research in numerous ways.
Funding Information:
This study was funded by PUTI KI2Q2 2020 grants NKB-778/UN2.RST/HKP.05.00/2020. We would also like to thank BRIN/DIKTI for their significant thoughts and experience, which helped to strengthen our research in numerous ways.
Publisher Copyright:
© IJASEIT is licensed under a Creative Commons Attribution-Share Alike 4.0 International License.
PY - 2022
Y1 - 2022
N2 - Based on the World Health Organization data obtained in mid-April 2021, Coronavirus or Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) has already infected more than 134.9 million people worldwide. The virus attacks human breathing, which can cause lung infections and even death. More than 2.9 million people worldwide have died due to coronavirus infection. Meanwhile, more than 1.5 million people in Indonesia have been infected, and 42.5 thousand died because of this coronavirus. Based on this data, carrying out a kinship analysis of the coronavirus is important to reduce its spread. Identifying the kinship of the COVID-19 virus and its spread can be done by forming a phylogenetic tree and clustering. This study uses the Multiple Encoding Vector method in analyzing the sequences and Euclidean distance to determine the distance matrix. This research will then use the Hierarchical clustering method to determine the number of initial centroids, which will be used later by the K-Means clustering method kinship in the SARS-CoV-2 DNA sequence. This study took samples of DNA sequences of SARS-CoV-2 from several infected countries. From the simulation results, the ancestors of SARS-CoV-2 came from China. The analysis results also show that the closest ancestors of COVID-19 to Indonesia came from India. The SARS-CoV-2 DNA sequence also consisted of nine clusters, and the sixth cluster has the greatest number of members.
AB - Based on the World Health Organization data obtained in mid-April 2021, Coronavirus or Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) has already infected more than 134.9 million people worldwide. The virus attacks human breathing, which can cause lung infections and even death. More than 2.9 million people worldwide have died due to coronavirus infection. Meanwhile, more than 1.5 million people in Indonesia have been infected, and 42.5 thousand died because of this coronavirus. Based on this data, carrying out a kinship analysis of the coronavirus is important to reduce its spread. Identifying the kinship of the COVID-19 virus and its spread can be done by forming a phylogenetic tree and clustering. This study uses the Multiple Encoding Vector method in analyzing the sequences and Euclidean distance to determine the distance matrix. This research will then use the Hierarchical clustering method to determine the number of initial centroids, which will be used later by the K-Means clustering method kinship in the SARS-CoV-2 DNA sequence. This study took samples of DNA sequences of SARS-CoV-2 from several infected countries. From the simulation results, the ancestors of SARS-CoV-2 came from China. The analysis results also show that the closest ancestors of COVID-19 to Indonesia came from India. The SARS-CoV-2 DNA sequence also consisted of nine clusters, and the sixth cluster has the greatest number of members.
KW - Bioinformatics
KW - Clustering
KW - Dna kinship
KW - Phylogenetic analysis
KW - Sequence alignment
UR - http://www.scopus.com/inward/record.url?scp=85144676917&partnerID=8YFLogxK
U2 - 10.18517/ijaseit.12.6.15582
DO - 10.18517/ijaseit.12.6.15582
M3 - Article
AN - SCOPUS:85144676917
SN - 2088-5334
VL - 12
SP - 2237
EP - 2247
JO - International Journal on Advanced Science, Engineering and Information Technology
JF - International Journal on Advanced Science, Engineering and Information Technology
IS - 6
ER -