Network neighborhood analysis with the multi-node topological overlap measure

Bioinformatics. 2007 Jan 15;23(2):222-31. doi: 10.1093/bioinformatics/btl581. Epub 2006 Nov 16.

Abstract

Motivation: The goal of neighborhood analysis is to find a set of genes (the neighborhood) that is similar to an initial 'seed' set of genes. Neighborhood analysis methods for network data are important in systems biology. If individual network connections are susceptible to noise, it can be advantageous to define neighborhoods on the basis of a robust interconnectedness measure, e.g. the topological overlap measure. Since the use of multiple nodes in the seed set may lead to more informative neighborhoods, it can be advantageous to define multi-node similarity measures.

Results: The pairwise topological overlap measure is generalized to multiple network nodes and subsequently used in a recursive neighborhood construction method. A local permutation scheme is used to determine the neighborhood size. Using four network applications and a simulated example, we provide empirical evidence that the resulting neighborhoods are biologically meaningful, e.g. we use neighborhood analysis to identify brain cancer related genes.

Availability: An executable Windows program and tutorial for multi-node topological overlap measure (MTOM) based analysis can be downloaded from the webpage (http://www.genetics.ucla.edu/labs/horvath/MTOM/).

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms*
  • Biomarkers, Tumor / analysis*
  • Brain Neoplasms / diagnosis*
  • Brain Neoplasms / metabolism*
  • Computer Simulation
  • Diagnosis, Computer-Assisted / methods*
  • Gene Expression Profiling / methods
  • Humans
  • Models, Biological
  • Neoplasm Proteins / analysis*
  • Oligonucleotide Array Sequence Analysis / methods*
  • Signal Transduction

Substances

  • Biomarkers, Tumor
  • Neoplasm Proteins