Performance of multivariate mutual information and autocorrelation encoding methods for the prediction of protein-protein interactions

Alhadi Bustamam, Mohamad Irlin Sunggawa, Titin Siswantining

Research output: Contribution to journalArticlepeer-review

Abstract

Protein interactions play an essential role in the study of how an organism can be infected with a disease and also its effects. One of the challenges in computational methods in the prediction of protein-protein interactions is how to represent a sequence of amino acids in a vector so that it can be used in machine learning to create a model that can predict whether or not an interaction occurs in a protein pair. This paper examined the qualitative feature encoding methods of amino acid sequence, namely, multivariate mutual information (MMI), and the quantitative feature encoding methods, namely, autocorrelation. We develop the new design for MMI and autocorrelation feature encoding methods which give better results than the previous research. There are four ways to build the MMI method and six ways to build the autocorrelation method that we tested. We also built four types of MMI-autocorrelation (mixed) method and look for the best form of each type of MMI, autocorrelation, and mixed-method. We combine these feature encoding methods with support vector machine (SVM) as machine learning methods. We also test the encoding methods we propose to several machine learning classifier methods, such as random forest (RF), k-nearest neighbor (KNN), and gradient boosting.

Original languageEnglish
Pages (from-to)773-786
Number of pages14
JournalIAES International Journal of Artificial Intelligence
Volume11
Issue number2
DOIs
Publication statusPublished - Jun 2022

Keywords

  • Autocorrelation
  • Machine learning
  • Multivariate mutual information
  • Protein-protein interactions
  • Support vector machine

Fingerprint

Dive into the research topics of 'Performance of multivariate mutual information and autocorrelation encoding methods for the prediction of protein-protein interactions'. Together they form a unique fingerprint.

Cite this