Neural network model of the genetic code is strongly correlated to the GES scale of amino acid transfer free energies

N Tolstrup; J Toftgård; J Engelbrecht; S Brunak

doi:10.1006/jmbi.1994.1683

Neural network model of the genetic code is strongly correlated to the GES scale of amino acid transfer free energies

J Mol Biol. 1994 Nov 11;243(5):816-20. doi: 10.1006/jmbi.1994.1683.

Authors

N Tolstrup¹, J Toftgård, J Engelbrecht, S Brunak

Affiliation

¹ Department of Physical Chemistry, Technical University of Denmark, Lyngby.

PMID: 7966302
DOI: 10.1006/jmbi.1994.1683

Abstract

A neural network trained to classify the 61 nucleotide triplets of the genetic code into 20 amino acid categories develops in its internal representation a pattern matching the relative cost of transferring amino acids with satisfied backbone hydrogen bonds from water to an environment of dielectric constant of roughly 2.0. Such environments are typically found in lipid membranes or in the interior of proteins. In learning the mapping between the codons and the categories, the network groups the amino acids according to the scale of transfer free energies developed by Engelman, Goldman and Steitz. Several other scales based on internal preference statistics also agree reasonably well with the network grouping. The network is able to relate the structure of the genetic code to quantifications of amino acid hydrophobicity-hydrophilicity more systematically than the numerous attempts made earlier. Due to its inherent non-linearity, the code is also shown to impose decisive constraints on algorithmic analysis of the protein coding potential of DNA.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Amino Acid Sequence
Amino Acids / chemistry*
Base Sequence
Energy Transfer / genetics*
Genetic Code
Models, Genetic
Molecular Sequence Data
Neural Networks, Computer*

Substances

Amino Acids