TY - GEN
T1 - Collaboration and implementation of self organizing maps (SOM) partitioning algorithm in HOPACH clustering method
AU - Siswantining, Titin
AU - Wulandari, Septian
AU - Bustamam, Alhadi
N1 - Publisher Copyright:
© 2018 Author(s).
PY - 2018/9/21
Y1 - 2018/9/21
N2 - Since the discovery of DNA structure in the form of the double helix, there is a development of the complex interaction required, DNA clustering into clusters which have the same features or functions. Two clustering methods can be combined by doing the partitioning and hierarchical stage alternately known as HOPACH clustering. The partitioning stage can be done by using SOM Algorithm, PAM, and K-Means. SOM algorithm because it uses unsupervised learning method and efficient to be used for extensive data. The partitioning process is continued by ordering process and then performed collapsing with an agglomerative process so that the clustering result obtained will be more accurate. The determination of the main cluster done by calculating the homogeneous value of the clustering result uses MSS (Mean Split Silhouette). The determination criteria of the main cluster are choosing the smallest MSS value. 136 sequences of DNA EVD (Ebola Virus Disease) are obtained from NCBI GenBank by extraction of DNA sequence, normalization, and then calculating the genetic distance with Euclidean Distance. The extraction of DNA sequence, normalization, and the implementation of SOM partitioning algorithm in HOPACH clustering method use open source program R. On the result of implementation SOM partitioning algorithm in HOPACH clustering method retrieved 9 clusters with MSS value of 0.50280. The cluster obtained can be identified according to species and the first year of becoming an epidemic.
AB - Since the discovery of DNA structure in the form of the double helix, there is a development of the complex interaction required, DNA clustering into clusters which have the same features or functions. Two clustering methods can be combined by doing the partitioning and hierarchical stage alternately known as HOPACH clustering. The partitioning stage can be done by using SOM Algorithm, PAM, and K-Means. SOM algorithm because it uses unsupervised learning method and efficient to be used for extensive data. The partitioning process is continued by ordering process and then performed collapsing with an agglomerative process so that the clustering result obtained will be more accurate. The determination of the main cluster done by calculating the homogeneous value of the clustering result uses MSS (Mean Split Silhouette). The determination criteria of the main cluster are choosing the smallest MSS value. 136 sequences of DNA EVD (Ebola Virus Disease) are obtained from NCBI GenBank by extraction of DNA sequence, normalization, and then calculating the genetic distance with Euclidean Distance. The extraction of DNA sequence, normalization, and the implementation of SOM partitioning algorithm in HOPACH clustering method use open source program R. On the result of implementation SOM partitioning algorithm in HOPACH clustering method retrieved 9 clusters with MSS value of 0.50280. The cluster obtained can be identified according to species and the first year of becoming an epidemic.
UR - http://www.scopus.com/inward/record.url?scp=85054169344&partnerID=8YFLogxK
U2 - 10.1063/1.5054538
DO - 10.1063/1.5054538
M3 - Conference contribution
AN - SCOPUS:85054169344
SN - 9780735417304
T3 - AIP Conference Proceedings
BT - International Conference on Science and Applied Science, ICSAS 2018
A2 - Suparmi, M.A.
A2 - Nugraha, Dewanta Arya
PB - American Institute of Physics Inc.
T2 - International Conference on Science and Applied Science, ICSAS 2018
Y2 - 12 May 2018
ER -