Reverse engineering and analysis of genome-wide gene regulatory networks from gene expression profiles using high-performance computing

IEEE/ACM Trans Comput Biol Bioinform. 2012 May-Jun;9(3):668-78. doi: 10.1109/TCBB.2011.60.

Abstract

Regulation of gene expression is a carefully regulated phenomenon in the cell. “Reverse-engineering” algorithms try to reconstruct the regulatory interactions among genes from genome-scale measurements of gene expression profiles (microarrays). Mammalian cells express tens of thousands of genes; hence, hundreds of gene expression profiles are necessary in order to have acceptable statistical evidence of interactions between genes. As the number of profiles to be analyzed increases, so do computational costs and memory requirements. In this work, we designed and developed a parallel computing algorithm to reverse-engineer genome-scale gene regulatory networks from thousands of gene expression profiles. The algorithm is based on computing pairwise Mutual Information between each gene-pair. We successfully tested it to reverse engineer the Mus Musculus (mouse) gene regulatory network in liver from gene expression profiles collected from a public repository. A parallel hierarchical clustering algorithm was implemented to discover “communities” within the gene network. Network communities are enriched for genes involved in the same biological functions. The inferred network was used to identify two mitochondrial proteins.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Computing Methodologies*
  • Gene Expression Profiling / methods*
  • Gene Regulatory Networks*
  • Genome*
  • Mice
  • Oligonucleotide Array Sequence Analysis
  • RNA, Messenger / metabolism

Substances

  • RNA, Messenger