TY - GEN
T1 - Mining Biological Information from 3D Medulloblastoma Cancerous Gene Expression Data Using TimesVector Triclustering Method
AU - Sari, Ika Marta
AU - Soemartojo, Saskya Mary
AU - Siswantining, Titin
AU - Sarwinda, Devvi
N1 - Publisher Copyright:
© 2020 IEEE.
Copyright:
Copyright 2021 Elsevier B.V., All rights reserved.
PY - 2020/11/10
Y1 - 2020/11/10
N2 - Triclustering analysis is the development of clustering analysis and biclustering analysis. The purpose of triclustering study is to group three-dimensional data simultaneously. The three-dimensional data can be in the form of observations, attributes, and context. One of the approaches used in tricluster analysis, namely an approach based on sample patterns, is the TimesVector method. The TimesVector method aims to group data matrices that show the same or different patterns in three-dimensional data. The TimesVector method has a work step that starts with reducing the three-dimensional data matrix to a two-dimensional data matrix to minimize complexity in the grouping. In this method, the Spherical K-means algorithm will be used in cluster it. The next step is to identify the pattern of the groups generated in the Spherical K-means. The pattern referred to consists of three types, namely DEP (Differentiated Patterns), ODEP (Differentiated Patterns), and SEP (Differentiated Patterns). The TimesVector method was applied on gene expression data, namely medulloblastoma cancerous data carried out in 6 scenarios. Each scenario uses the same many clusters but different threshold values. The six scenarios' results will be validated using the coverage value and the tricluster diffusion (TD) value. The application of the TimesVector method shows that using a threshold of 1.5 gives the most optimal results because it has a high coverage value and a low TD value. High-value coverage indicates the method's ability to extract data, and a low TD value suggests that the resulting tricluster has a large volume and high coherence. The best tricluster results can be used by medical experts to perform further actions on medulloblastoma cancerous patients.
AB - Triclustering analysis is the development of clustering analysis and biclustering analysis. The purpose of triclustering study is to group three-dimensional data simultaneously. The three-dimensional data can be in the form of observations, attributes, and context. One of the approaches used in tricluster analysis, namely an approach based on sample patterns, is the TimesVector method. The TimesVector method aims to group data matrices that show the same or different patterns in three-dimensional data. The TimesVector method has a work step that starts with reducing the three-dimensional data matrix to a two-dimensional data matrix to minimize complexity in the grouping. In this method, the Spherical K-means algorithm will be used in cluster it. The next step is to identify the pattern of the groups generated in the Spherical K-means. The pattern referred to consists of three types, namely DEP (Differentiated Patterns), ODEP (Differentiated Patterns), and SEP (Differentiated Patterns). The TimesVector method was applied on gene expression data, namely medulloblastoma cancerous data carried out in 6 scenarios. Each scenario uses the same many clusters but different threshold values. The six scenarios' results will be validated using the coverage value and the tricluster diffusion (TD) value. The application of the TimesVector method shows that using a threshold of 1.5 gives the most optimal results because it has a high coverage value and a low TD value. High-value coverage indicates the method's ability to extract data, and a low TD value suggests that the resulting tricluster has a large volume and high coherence. The best tricluster results can be used by medical experts to perform further actions on medulloblastoma cancerous patients.
KW - gene expression data
KW - pattern-based
KW - TimesVector
KW - triclustering
UR - http://www.scopus.com/inward/record.url?scp=85099443405&partnerID=8YFLogxK
U2 - 10.1109/ICICoS51170.2020.9299108
DO - 10.1109/ICICoS51170.2020.9299108
M3 - Conference contribution
AN - SCOPUS:85099443405
T3 - ICICoS 2020 - Proceeding: 4th International Conference on Informatics and Computational Sciences
BT - ICICoS 2020 - Proceeding
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 4th International Conference on Informatics and Computational Sciences, ICICoS 2020
Y2 - 10 November 2020 through 11 November 2020
ER -