Relationship between codon usage and sequence-dependent curvature of genomes

Microb Comp Genomics. 1998;3(4):243-53.

Abstract

Static DNA curvature distributions of full-sequenced genomes and large DNA contigs from different organisms were calculated. Very distinctive differences among histogram profiles coming from archaebacteria, eubacteria, and eukaryotes were observed. Eubacterial profiles were, on average, more curved than were archaeal and eukaryotic profiles. A comparative analysis between real and randomized DNA sequences revealed that eubacterial genomes presented, overall, higher curvature values than random sequences. An opposite portrait was exhibited by archaeal and eukaryotic genomes. They displayed a lower frequency of curved regions than their corresponding randomized sequences. The contributions of coding and intergenic regions to the curvature profile were also analyzed. Intergenic regions, on average, were found to be more curved than the overall genomic sequences, especially in prokaryotic organisms. Nevertheless, because of their small size with respect to coding regions, the contribution of intergenic sequences to the overall curvature profile tended to be minor. A clear relationship between codon usage and DNA curvature was demonstrated, and a proposal of the possible coevolution of both systems is discussed. Finally, we present a procedure to quantify the deviation of a curvature profile from randomness through a formal statistical analysis.

MeSH terms

  • Archaea / genetics
  • Codon / genetics*
  • Computational Biology
  • DNA / chemistry*
  • Genome*
  • Genome, Bacterial
  • Genome, Human
  • Humans
  • Nucleic Acid Conformation*
  • Sequence Analysis, DNA

Substances

  • Codon
  • DNA