Analysis of Diabetes Mellitus Gene Expression Data using Two-Phase Biclustering Method

Rahmat Al Kafi, Alhadi B., Wibowo Mangunwardoyo

Research output: Contribution to journalArticlepeer-review

Abstract

The purpose of this research is to find bicluster from Type 2 Diabetes Mellitus genes expression data which samples are obese and lean people using two-phase biclustering. The first step is to use Singular Value Decomposition to decompose matrix gene expression data into gene and condition based matrices. The second step is to use K-means to cluster gene and condition based matrices, forming several clusters from each matrix. Furthermore, the silhouette method is applied to determine the number of optimum clusters and measure the accuracy of grouping results. Based on the experimental results, Type 2 Diabetes Mellitus dataset with 668 selected genes produced optimal biclusters, with six biclusters. The obtained biclusters consist of 2 clusters on the gene-based matrix and 3 clusters on the sample-based matrix with silhouette values, respectively, are 0.7361615 and 0.7050163.
Original languageEnglish
JournalJurnal Ilmiah Matematika
Volume8
Issue number2
DOIs
Publication statusPublished - 2021

Fingerprint

Dive into the research topics of 'Analysis of Diabetes Mellitus Gene Expression Data using Two-Phase Biclustering Method'. Together they form a unique fingerprint.

Cite this