TY - GEN
T1 - Triclustering Algorithm for 3D Gene Expression Data Analysis using Order Preserving Triclustering (OPTricluster)
AU - Siska, Dea
AU - Sarwinda, Devvi
AU - Siswantining, Titin
AU - Soemartojo, Saskya Mary
N1 - Publisher Copyright:
© 2020 IEEE.
Copyright:
Copyright 2021 Elsevier B.V., All rights reserved.
PY - 2020/11/10
Y1 - 2020/11/10
N2 - Triclustering is the expansion of clustering and biclustering methods that works on three-dimensional (3D) data. This method is generally implemented in the analysis of 3D gene expression data to find gene expression profiles. This data consists of three dimensions: genes, experimental conditions, and time points. Triclustering can group these dimensions simultaneously and form a 3D cluster called a tricluster. Order Preserving Triclustering (OPTricluster) is a triclustering algorithm that uses a pattern-based approach and is used to analyze short time-series data (3-8 time points). The OPTricluster forms the tricluster by identifying genes with the same pattern of change in expression across time points under several experimental conditions. In contrast to most triclustering algorithms that only focus on similarities between experimental conditions, OPTricluster considers the similarities and differences between them. In this study, OPTricluster was implemented with several scenarios in gene expression data of yellow fever patients after vaccination. The lowest average Tricluster Diffusion (TD) score indicates the scenario with the best triclustering result. For this case, we found that the scenario with threshold of 1.6 is the scenario that produced triclusters with better quality (lowest average TD score) than the other scenarios. These triclusters represent gene expression profiles that show the biological relationship among those patients, including anomalies found in patients.
AB - Triclustering is the expansion of clustering and biclustering methods that works on three-dimensional (3D) data. This method is generally implemented in the analysis of 3D gene expression data to find gene expression profiles. This data consists of three dimensions: genes, experimental conditions, and time points. Triclustering can group these dimensions simultaneously and form a 3D cluster called a tricluster. Order Preserving Triclustering (OPTricluster) is a triclustering algorithm that uses a pattern-based approach and is used to analyze short time-series data (3-8 time points). The OPTricluster forms the tricluster by identifying genes with the same pattern of change in expression across time points under several experimental conditions. In contrast to most triclustering algorithms that only focus on similarities between experimental conditions, OPTricluster considers the similarities and differences between them. In this study, OPTricluster was implemented with several scenarios in gene expression data of yellow fever patients after vaccination. The lowest average Tricluster Diffusion (TD) score indicates the scenario with the best triclustering result. For this case, we found that the scenario with threshold of 1.6 is the scenario that produced triclusters with better quality (lowest average TD score) than the other scenarios. These triclusters represent gene expression profiles that show the biological relationship among those patients, including anomalies found in patients.
KW - 3D gene expression data
KW - order preserving
KW - short time series
KW - tricluster diffusion
KW - triclustering
UR - http://www.scopus.com/inward/record.url?scp=85099439100&partnerID=8YFLogxK
U2 - 10.1109/ICICoS51170.2020.9299101
DO - 10.1109/ICICoS51170.2020.9299101
M3 - Conference contribution
AN - SCOPUS:85099439100
T3 - ICICoS 2020 - Proceeding: 4th International Conference on Informatics and Computational Sciences
BT - ICICoS 2020 - Proceeding
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 4th International Conference on Informatics and Computational Sciences, ICICoS 2020
Y2 - 10 November 2020 through 11 November 2020
ER -