Analysis of the Escherichia coli genome: DNA sequence of the region from 84.5 to 86.5 minutes

Science. 1992 Aug 7;257(5071):771-8. doi: 10.1126/science.1379743.

Abstract

The DNA sequence of 91.4 kilobases of the Escherichia coli K-12 genome, spanning the region between rrnC at 84.5 minutes and rrnA at 86.5 minutes on the genetic map (85 to 87 percent on the physical map), is described. Analysis of this sequence identified 82 potential coding regions (open reading frames) covering 84 percent of the sequenced interval. The arrangement of these open reading frames, together with the consensus promoter sequences and terminator-like sequences found by computer searches, made it possible to assign them to proposed transcriptional units. More than half the open reading frames correlated with known genes or functions suggested by similarity to other sequences. Those remaining encode still unidentified proteins. The sequenced region also contains several RNA genes and two types of repeated sequence elements were found. Intergenic regions include three "gray holes," 0.6 to 0.8 kilobases, with no recognizable functions.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Bacterial Proteins / genetics
  • Base Sequence
  • Codon
  • DNA, Bacterial / genetics*
  • Escherichia coli / genetics*
  • Genes, Bacterial*
  • Genome, Bacterial*
  • RNA, Bacterial / genetics
  • Restriction Mapping

Substances

  • Bacterial Proteins
  • Codon
  • DNA, Bacterial
  • RNA, Bacterial