Mining Biological Information from 3D Medulloblastoma Cancerous Gene Expression Data Using TimesVector Triclustering Method

Ika Marta Sari, Saskya Mary Soemartojo, Titin Siswantining, Devvi Sarwinda

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

Triclustering analysis is the development of clustering analysis and biclustering analysis. The purpose of triclustering study is to group three-dimensional data simultaneously. The three-dimensional data can be in the form of observations, attributes, and context. One of the approaches used in tricluster analysis, namely an approach based on sample patterns, is the TimesVector method. The TimesVector method aims to group data matrices that show the same or different patterns in three-dimensional data. The TimesVector method has a work step that starts with reducing the three-dimensional data matrix to a two-dimensional data matrix to minimize complexity in the grouping. In this method, the Spherical K-means algorithm will be used in cluster it. The next step is to identify the pattern of the groups generated in the Spherical K-means. The pattern referred to consists of three types, namely DEP (Differentiated Patterns), ODEP (Differentiated Patterns), and SEP (Differentiated Patterns). The TimesVector method was applied on gene expression data, namely medulloblastoma cancerous data carried out in 6 scenarios. Each scenario uses the same many clusters but different threshold values. The six scenarios' results will be validated using the coverage value and the tricluster diffusion (TD) value. The application of the TimesVector method shows that using a threshold of 1.5 gives the most optimal results because it has a high coverage value and a low TD value. High-value coverage indicates the method's ability to extract data, and a low TD value suggests that the resulting tricluster has a large volume and high coherence. The best tricluster results can be used by medical experts to perform further actions on medulloblastoma cancerous patients.

Original languageEnglish
Title of host publicationICICoS 2020 - Proceeding
Subtitle of host publication4th International Conference on Informatics and Computational Sciences
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728195261
DOIs
Publication statusPublished - 10 Nov 2020
Event4th International Conference on Informatics and Computational Sciences, ICICoS 2020 - Semarang, Indonesia
Duration: 10 Nov 202011 Nov 2020

Publication series

NameICICoS 2020 - Proceeding: 4th International Conference on Informatics and Computational Sciences

Conference

Conference4th International Conference on Informatics and Computational Sciences, ICICoS 2020
Country/TerritoryIndonesia
CitySemarang
Period10/11/2011/11/20

Keywords

  • gene expression data
  • pattern-based
  • TimesVector
  • triclustering

Fingerprint

Dive into the research topics of 'Mining Biological Information from 3D Medulloblastoma Cancerous Gene Expression Data Using TimesVector Triclustering Method'. Together they form a unique fingerprint.

Cite this