Analysis of the Escherichia coli genome VI: DNA sequence of the region from 92.8 through 100 minutes

Nucleic Acids Res. 1995 Jun 25;23(12):2105-19. doi: 10.1093/nar/23.12.2105.

Abstract

The 338.5 kb of the Escherichia coli genome described here together with previously described segments bring the total of contiguous finished sequence of this genome to > 1 Mb. Of 319 open reading frames (ORFs) found in this 338.5 kb segment, 147 (46%) are potential new genes. The positions of several genes which had been previously located here by mapping or partial sequencing have been confirmed. Several ORFs have functions suggested by similarities to other characterised genes but cannot be assigned with certainty. Fifteen of the ORFs of unknown function had been previously sequenced. Eight transfer RNAs are encoded in the region and there are two grey holes in which no features were found. The attachment site for phage P4 and three insertion sequences were located. The region was also analysed for chi sites, bend sites, REP elements and other repeats. A computer search identified potential promoters and tentative transcription units were assigned. The occurrence of the rare tetramer CTAG was analysed in 1.6 Mb of contiguous E.coli sequence. Hypotheses addressing the rarity and distribution of CTAG are discussed.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Base Sequence
  • Chromosome Mapping
  • DNA, Bacterial / chemistry*
  • Escherichia coli / genetics*
  • Genes, Bacterial*
  • Molecular Sequence Data
  • Oligonucleotides / chemistry
  • Open Reading Frames
  • Operon
  • Promoter Regions, Genetic
  • Protein Sorting Signals
  • Repetitive Sequences, Nucleic Acid
  • Restriction Mapping
  • Sequence Alignment
  • Sequence Analysis*

Substances

  • DNA, Bacterial
  • Oligonucleotides
  • Protein Sorting Signals

Associated data

  • GENBANK/U14003