Application of hierarchical clustering ordered partitioning and collapsing hybrid in Ebola Virus phylogenetic analysis

Hengki Muradi, Alhadi B., Dian Lestari

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

17 Citations (Scopus)

Abstract

Gene clustering can be achieved through hierarchical or partition method. Both clustering methods can be combined by processing the partition and hierarchical phases alternately. This method is known as a hierarchical clustering ordered partitioning and collapsing hybrid (HOPACH) method. The Partitioning phase can be done by using PAM, SOM, or K-Means methods. The partition process is continued with the ordered process, and then it is corrected with agglomerative process, in order to have more accurate clustering results. Furthermore, the main clusters are determined by using MSS (Median Split Silhouette) value. We selected the clustering results which minimize the MSS value. In this work, we conduct the clustering on 136 Ebola Virus DNA sequences data from GenBank. The global alignment process is initially performed, followed by genetic distance calculation using Jukes-Cantor correction. In our implementation, we applied global alignment process and used the combination of HOPACH-PAM clustering using the R open source programming tool. In our results, we obtained maximum genetic distance is 0.6153407; meanwhile the minimum genetic distance is 0. Furthermore, genetic distance matrix can be used as a basis for sequences clustering and phylogenetic analysis. In our HOPACH-PAM clustering results, we obtained 10 main clusters with MSS value is 0.8873843. Ebola virus clusters can be identified by species and virus epidemic year.

Original languageEnglish
Title of host publicationICACSIS 2015 - 2015 International Conference on Advanced Computer Science and Information Systems, Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages317-323
Number of pages7
ISBN (Electronic)9781509003624
DOIs
Publication statusPublished - 19 Feb 2016
EventInternational Conference on Advanced Computer Science and Information Systems, ICACSIS 2015 - Depok, Indonesia
Duration: 10 Oct 201511 Oct 2015

Publication series

NameICACSIS 2015 - 2015 International Conference on Advanced Computer Science and Information Systems, Proceedings

Conference

ConferenceInternational Conference on Advanced Computer Science and Information Systems, ICACSIS 2015
Country/TerritoryIndonesia
CityDepok
Period10/10/1511/10/15

Keywords

  • Agglomerative R
  • DNA/Protein
  • Gene
  • Global alignment
  • HOPACH-PAM
  • Hierarchy
  • Jukes-Cantor
  • MSS
  • Partition

Fingerprint

Dive into the research topics of 'Application of hierarchical clustering ordered partitioning and collapsing hybrid in Ebola Virus phylogenetic analysis'. Together they form a unique fingerprint.

Cite this