Colored Network Motif Analysis by Dynamic Programming Approach: An Application in Host Pathogen Interaction Network

Article Type

Research Article

Publication Title

IEEE/ACM Transactions on Computational Biology and Bioinformatics

Abstract

Network motifs are subgraphs of a network which are found with significantly higher frequency than that expected in similar random networks. Motifs are small building blocks of a network and they have emerged as a way to uncover topological properties of complex networks. A special yet not much explored type of motif is the 'colored motif' where color (type) of each node, and hence the edges, in the motif is distinguishable from each other. A traditional motif is defined as a recurring structure in a network, whereas colored motif introduces detailed information about the color of the nodes. G-Trie is a data structure to efficiently store a given set of subgraphs by exploiting the topological overlaps within them. In this article we have implemented a modified g-Trie to store colored subgraphs and developed a method to discover colored motifs. Our method uses an approximate enumeration for counting the subgraphs to reduce the runtime. We have applied our method to find colored motifs of size three in a host pathogen protein-protein interaction network having two types of proteins namely HIV-1 and human proteins, and four types of edges. Here, we have discovered eight motifs, six of which contain both HIV-1 and human proteins, while the remaining two contain only human proteins.

First Page

550

Last Page

561

DOI

10.1109/TCBB.2019.2923173

Publication Date

3-1-2021

Comments

Open Access, Bronze

This document is currently not available here.

Share

COinS