Utility and distribution of conserved noncoding sequences in the grasses

Proc Natl Acad Sci U S A. 2002 Apr 30;99(9):6147-51. doi: 10.1073/pnas.052139599. Epub 2002 Apr 23.

Abstract

Control of gene expression requires cis-acting regulatory DNA sequences. Historically these sequences have been difficult to identify. Conserved noncoding sequences (CNSs) have recently been identified in mammalian genes through cross-species genomic DNA comparisons, and some have been shown to be regulatory sequences. Using sequence alignment algorithms, we compared genomic noncoding DNA sequences of the liguleless1 (lg1) genes in two grasses, maize and rice, and found several CNSs in lg1. These CNSs are present in multiple grass species that represent phylogenetically disparate lineages. Six other maize/rice genes were compared and five contained CNSs. Based on nucleotide substitution rates, these CNSs exist because they have biological functions. Our analysis suggests that grass CNSs are smaller and far less frequent than those identified in mammalian genes and that mammalian gene regulation may be more complex than that of grasses. CNSs make excellent pan-grass PCR-based genetic mapping tools. They should be useful as characters in phylogenetic studies and as monitors of gene regulatory complexity.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Animals
  • Base Sequence
  • Blotting, Southern
  • Chromosome Mapping
  • Conserved Sequence
  • Exons
  • Models, Genetic
  • Molecular Sequence Data
  • Oryza / genetics*
  • Phylogeny
  • Polymerase Chain Reaction
  • RNA, Untranslated
  • Sequence Homology, Nucleic Acid
  • Species Specificity
  • Zea mays / genetics*

Substances

  • RNA, Untranslated

Associated data

  • GENBANK/AF451892
  • GENBANK/AF451893
  • GENBANK/AF451894
  • GENBANK/AF451895
  • GENBANK/AF479591
  • GENBANK/AF479592
  • GENBANK/AF480431
  • GENBANK/AF488772