A predictive model for regulatory sequences directing liver-specific transcription

Genome Res. 2001 Sep;11(9):1559-66. doi: 10.1101/gr.180601.

Abstract

The identification and interpretation of the regulatory signals within the human genome remain among the greatest goals and most difficult challenges in genome analysis. The ability to predict the temporal and spatial control of transcription is likely to require a combination of methods to address the contribution of sequence-specific signals, protein-protein interactions and chromatin structure. We present here a new procedure to identify clusters of transcription factor binding sites characteristic of sequence modules experimentally verified to direct transcription selectively to liver cells. This algorithm is sufficiently specific to identify known regulatory sequences in genes selectively expressed in liver, promising acceleration of experimental promoter analysis. In combination with phylogenetic footprinting, this improvement in the specificity of predictions is sufficient to motivate a scan of the human genome. Potential regulatory modules were identified in orthologous human and rodent genomic sequences containing both known and uncharacterized genes.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Computational Biology / methods
  • DNA Footprinting / methods
  • Humans
  • Liver / metabolism*
  • Mice
  • Models, Genetic*
  • Organ Specificity / genetics
  • Phylogeny
  • Rats
  • Regulatory Sequences, Nucleic Acid / genetics*
  • Transcription, Genetic / genetics*