SGDI: system for genomic data integration

Pac Symp Biocomput. 2008:141-52. doi: 10.1142/9789812776136_0016.

Abstract

This paper describes a framework for collecting, annotating, and archiving high-throughput assays from multiple experiments conducted on one or more series of samples. Specific applications include support for large-scale surveys of related transcriptional profiling studies, for investigations of the genetics of gene expression and for joint analysis of copy number variation and mRNA abundance. Our approach consists of data capture and modeling processes rooted in R/Bioconductor, sample annotation and sequence constituent ontology management based in R, secure data archiving in PostgreSQL, and browser-based workspace creation and management rooted in Zope. This effort has generated a completely transparent, extensible, and customizable interface to large archives of high-throughput assays. Sources and prototype interfaces are accessible at www.sgdi.org/software.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Breast Neoplasms / genetics
  • Breast Neoplasms / pathology
  • Computational Biology
  • Database Management Systems*
  • Female
  • Gene Expression Profiling / statistics & numerical data
  • Genomics / statistics & numerical data*
  • Humans
  • Oligonucleotide Array Sequence Analysis / statistics & numerical data
  • Phenotype
  • Polymorphism, Single Nucleotide
  • Software*
  • Systems Biology