Weighted mutual information analysis substantially improves domain-based functional network models

Bioinformatics. 2016 Sep 15;32(18):2824-30. doi: 10.1093/bioinformatics/btw320. Epub 2016 May 20.

Abstract

Motivation: Functional protein-protein interaction (PPI) networks elucidate molecular pathways underlying complex phenotypes, including those of human diseases. Extrapolation of domain-domain interactions (DDIs) from known PPIs is a major domain-based method for inferring functional PPI networks. However, the protein domain is a functional unit of the protein. Therefore, we should be able to effectively infer functional interactions between proteins based on the co-occurrence of domains.

Results: Here, we present a method for inferring accurate functional PPIs based on the similarity of domain composition between proteins by weighted mutual information (MI) that assigned different weights to the domains based on their genome-wide frequencies. Weighted MI outperforms other domain-based network inference methods and is highly predictive for pathways as well as phenotypes. A genome-scale human functional network determined by our method reveals numerous communities that are significantly associated with known pathways and diseases. Domain-based functional networks may, therefore, have potential applications in mapping domain-to-pathway or domain-to-phenotype associations.

Availability and implementation: Source code for calculating weighted mutual information based on the domain profile matrix is available from www.netbiolab.org/w/WMI CONTACT: Insuklee@yonsei.ac.kr

Supplementary information: Supplementary data are available at Bioinformatics online.

MeSH terms

  • Algorithms
  • Computational Biology
  • Databases, Protein
  • Humans
  • Molecular Sequence Annotation
  • Protein Interaction Mapping*
  • Protein Interaction Maps*
  • Proteins

Substances

  • Proteins