Analysis of the Escherichia coli genome. V. DNA sequence of the region from 76.0 to 81.5 minutes

Nucleic Acids Res. 1994 Jul 11;22(13):2576-86. doi: 10.1093/nar/22.13.2576.

Abstract

The DNA sequence of a 225.4 kilobase segment of the Escherichia coli K-12 genome is described here, from 76.0 to 81.5 minutes on the genetic map. This brings the total of contiguous sequence from the E.coli genome project to 725.1 kb (76.0 to 92.8 minutes). We found 191 putative coding genes (ORFs) of which 72 genes were previously known, and 110 of which remain unidentified despite literature and similarity searches. Seven new genes--arsE, arsF, arsG, treF, xylR, xylG, and xylH--were identified as well as the previously mapped pit and dctA genes. The arrangement of proposed genes relative to possible promoters and terminators suggests 90 potential transcription units. Other features include 19 REP elements, 95 computer-predicted bends, 50 Chi sites, and one grey hole. Thirty-one putative signal peptides were found, including those of thirteen known membrane or periplasmic proteins. One tRNA gene (proK) and two insertion sequences (IS5 and IS150) are located in this segment. The genes in this region are organized with equal numbers oriented with or against replication.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Alcohol Dehydrogenase / genetics
  • Aldehyde Oxidoreductases / genetics
  • Chromosome Mapping*
  • Chromosomes, Bacterial
  • Codon
  • DNA, Bacterial / genetics*
  • Escherichia coli / genetics*
  • Gene Transfer Techniques
  • Genome, Bacterial*
  • Molecular Sequence Data
  • Open Reading Frames
  • Promoter Regions, Genetic
  • Protein Sorting Signals
  • Saccharomyces cerevisiae / enzymology
  • Saccharomyces cerevisiae / genetics
  • Terminator Regions, Genetic
  • Transcription, Genetic
  • Zymomonas / enzymology
  • Zymomonas / genetics

Substances

  • Codon
  • DNA, Bacterial
  • Protein Sorting Signals
  • Alcohol Dehydrogenase
  • Aldehyde Oxidoreductases
  • aldehyde dehydrogenase (NAD(P)+)

Associated data

  • GENBANK/U00039