Understanding human disease mutations through the use of interspecific genetic variation

Hum Mol Genet. 2001 Oct 1;10(21):2319-28. doi: 10.1093/hmg/10.21.2319.

Abstract

Data on replacement mutations in genes of disease patients exist in a variety of online resources. In addition, genome sequencing projects and individual gene sequencing efforts have led to the identification of disease gene homologs in diverse metazoan species. The availability of these two types of information provides unique opportunities to investigate factors that are important in the development of genetically based disease by contrasting long and short-term molecular evolutionary patterns. Therefore, we conducted an analysis of disease-associated human genetic variation for seven disease genes: the cystic fibrosis transmembrane conductance regulator, glucose-6-phosphate dehydrogenase, the neural cell adhesion molecule L1, phenylalanine hydroxylase, paired box 6, the X-linked retinoschisis gene and TSC2/tuberin. Our analyses indicate that disease mutations show definite patterns when examined from an evolutionary perspective. Human replacement mutations resulting in disease are overabundant at amino acid positions most conserved throughout the long-term history of metazoans. In contrast, human polymorphic replacement mutations and silent mutations are randomly distributed across sites with respect to the level of conservation of amino acid sites within genes. Furthermore, disease-causing amino acid changes are of types usually not observed among species. Using Grantham's chemical difference matrix, we find that amino acid changes observed in disease patients are far more radical than the variation found among species and in non-diseased humans. Overall, our results demonstrate the usefulness of evolutionary analyses for understanding patterns of human disease mutations and underscore the biomedical significance of sequence data currently being generated from various model organism genome sequencing projects.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Amino Acids / genetics
  • Animals
  • Cattle
  • Cricetinae
  • Cystic Fibrosis Transmembrane Conductance Regulator / genetics
  • Databases, Nucleic Acid
  • Evolution, Molecular*
  • Eye Proteins / genetics
  • Gene Frequency
  • Genetic Predisposition to Disease / genetics*
  • Genetic Variation
  • Glucosephosphate Dehydrogenase / genetics
  • Homeodomain Proteins / genetics
  • Humans
  • Leukocyte L1 Antigen Complex
  • Membrane Glycoproteins / genetics
  • Mice
  • Mutation
  • Neural Cell Adhesion Molecules / genetics
  • PAX6 Transcription Factor
  • Paired Box Transcription Factors
  • Phenylalanine Hydroxylase / genetics
  • Phylogeny
  • Polymorphism, Single Nucleotide
  • Rats
  • Repressor Proteins / genetics
  • Species Specificity
  • Tuberous Sclerosis Complex 2 Protein
  • Tumor Suppressor Proteins

Substances

  • Amino Acids
  • CFTR protein, human
  • Eye Proteins
  • Homeodomain Proteins
  • Leukocyte L1 Antigen Complex
  • Membrane Glycoproteins
  • Neural Cell Adhesion Molecules
  • PAX6 Transcription Factor
  • PAX6 protein, human
  • Paired Box Transcription Factors
  • Pax6 protein, mouse
  • Pax6 protein, rat
  • RS1 protein, human
  • Repressor Proteins
  • TSC2 protein, human
  • Tsc2 protein, mouse
  • Tsc2 protein, rat
  • Tuberous Sclerosis Complex 2 Protein
  • Tumor Suppressor Proteins
  • Cystic Fibrosis Transmembrane Conductance Regulator
  • Glucosephosphate Dehydrogenase
  • Phenylalanine Hydroxylase