The use of edge-betweenness clustering to investigate biological function in protein interaction networks

BMC Bioinformatics. 2005 Mar 1:6:39. doi: 10.1186/1471-2105-6-39.

Abstract

Background: This paper describes an automated method for finding clusters of interconnected proteins in protein interaction networks and retrieving protein annotations associated with these clusters.

Results: Protein interaction graphs were separated into subgraphs of interconnected proteins, using the JUNG implementation of Girvan and Newman's Edge-Betweenness algorithm. Functions were sought for these subgraphs by detecting significant correlations with the distribution of Gene Ontology terms which had been used to annotate the proteins within each cluster. The method was implemented using freely available software (JUNG and the R statistical package). Protein clusters with significant correlations to functional annotations could be identified and included groups of proteins know to cooperate in cell metabolism. The method appears to be resilient against the presence of false positive interactions.

Conclusion: This method provides a useful tool for rapid screening of small to medium size protein interaction datasets.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Cluster Analysis
  • Computational Biology / methods*
  • Databases, Protein
  • False Positive Reactions
  • Genome, Human
  • Genomics
  • Humans
  • Models, Biological
  • Programming Languages
  • Protein Interaction Mapping / methods*
  • Proteins / chemistry*
  • Proteomics
  • Software
  • Transcription, Genetic

Substances

  • Proteins