A modular approach for integrative analysis of large-scale gene-expression and drug-response data

Nat Biotechnol. 2008 May;26(5):531-9. doi: 10.1038/nbt1397.

Abstract

High-throughput technologies are now used to generate more than one type of data from the same biological samples. To properly integrate such data, we propose using co-modules, which describe coherent patterns across paired data sets, and conceive several modular methods for their identification. We first test these methods using in silico data, demonstrating that the integrative scheme of our Ping-Pong Algorithm uncovers drug-gene associations more accurately when considering noisy or complex data. Second, we provide an extensive comparative study using the gene-expression and drug-response data from the NCI-60 cell lines. Using information from the DrugBank and the Connectivity Map databases we show that the Ping-Pong Algorithm predicts drug-gene associations significantly better than other methods. Co-modules provide insights into possible mechanisms of action for a wide range of drugs and suggest new targets for therapy.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Biological Assay / methods*
  • Computer Simulation
  • Drug Evaluation, Preclinical / methods*
  • Gene Expression Profiling / methods*
  • Models, Biological*
  • Pharmaceutical Preparations / administration & dosage*
  • Signal Transduction / drug effects*
  • Systems Integration

Substances

  • Pharmaceutical Preparations