Transcription factor IID in the Archaea: sequences in the Thermococcus celer genome would encode a product closely related to the TATA-binding protein of eukaryotes

Proc Natl Acad Sci U S A. 1994 May 10;91(10):4180-4. doi: 10.1073/pnas.91.10.4180.

Abstract

The first step in transcription initiation in eukaryotes is mediated by the TATA-binding protein, a subunit of the transcription factor IID complex. We have cloned and sequenced the gene for a presumptive homolog of this eukaryotic protein from Thermococcus celer, a member of the Archaea (formerly archaebacteria). The protein encoded by the archaeal gene is a tandem repeat of a conserved domain, corresponding to the repeated domain in its eukaryotic counterparts. Molecular phylogenetic analyses of the two halves of the repeat are consistent with the duplication occurring before the divergence of the archael and eukaryotic domains. In conjunction with previous observations of similarity in RNA polymerase subunit composition and sequences and the finding of a transcription factor IIB-like sequence in Pyrococcus woesei (a relative of T. celer) it appears that major features of the eukaryotic transcription apparatus were well-established before the origin of eukaryotic cellular organization. The divergence between the two halves of the archael protein is less than that between the halves of the individual eukaryotic sequences, indicating that the average rate of sequence change in the archael protein has been less than in its eukaryotic counterparts. To the extent that this lower rate applies to the genome as a whole, a clearer picture of the early genes (and gene families) that gave rise to present-day genomes is more apt to emerge from the study of sequences from the Archaea than from the corresponding sequences from eukaryotes.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Archaea / genetics*
  • Archaea / metabolism
  • Biological Evolution*
  • Cloning, Molecular
  • Conserved Sequence*
  • DNA, Bacterial / genetics
  • DNA, Bacterial / metabolism
  • Genome, Bacterial*
  • Humans
  • Molecular Sequence Data
  • Phylogeny
  • Polymerase Chain Reaction
  • Probability
  • Restriction Mapping
  • Sequence Homology, Amino Acid
  • Transcription Factor TFIID
  • Transcription Factors / biosynthesis
  • Transcription Factors / chemistry
  • Transcription Factors / genetics*

Substances

  • DNA, Bacterial
  • Transcription Factor TFIID
  • Transcription Factors

Associated data

  • GENBANK/L16957
  • GENBANK/M64861
  • GENBANK/U04932