Multi-View Kernel Learning for Identification of Disease Genes
Article Type
Research Article
Publication Title
IEEE/ACM Transactions on Computational Biology and Bioinformatics
Abstract
Gene expression data sets and protein-protein interaction (PPI) networks are two heterogeneous data sources that have been extensively studied, due to their ability to capture the coexpression patterns among genes and their topological connections. Although they depict different traits of the data, both of them tend to group co-functional genes together. This phenomenon agrees with the basic assumption of multi-view kernel learning, according to which different views of the data contain a similar inherent cluster structure. Based on this inference, a new multi-view kernel learning based disease gene identification algorithm, termed as DiGId, is put forward. A novel multi-view kernel learning approach is proposed that aims to learn a consensus kernel, which efficiently captures the heterogeneous information of individual views as well as depicts the underlying inherent cluster structure. Some low-rank constraints are imposed on the learned multi-view kernel, so that it can effectively be partitioned into k or fewer clusters. The learned joint cluster structure is used to curate a set of potential disease genes. Moreover, a novel approach is put forward to quantify the importance of each view. In order to demonstrate the effectiveness of the proposed approach in capturing the relevant information depicted by individual views, an extensive analysis is performed on four different cancer-related gene expression data sets and PPI network, considering different similarity measures.
First Page
2278
Last Page
2290
DOI
https://10.1109/TCBB.2023.3247033
Publication Date
5-1-2023
Recommended Citation
Shah, Ekta and Maji, Pradipta, "Multi-View Kernel Learning for Identification of Disease Genes" (2023). Journal Articles. 3751.
https://digitalcommons.isical.ac.in/journal-articles/3751