Positionally cloned human disease genes: patterns of evolutionary conservation and functional motifs

Proc Natl Acad Sci U S A. 1997 May 27;94(11):5831-6. doi: 10.1073/pnas.94.11.5831.

Abstract

Positional cloning has already produced the sequences of more than 70 human genes associated with specific diseases. In addition to their medical importance, these genes are of interest as a set of human genes isolated solely on the basis of the phenotypic effect of the respective mutations. We analyzed the protein sequences encoded by the positionally cloned disease genes using an iterative strategy combining several sensitive computer methods. Comparisons to complete sequence databases and to separate databases of nematode, yeast, and bacterial proteins showed that for most of the disease gene products, statistically significant sequence similarities are detectable in each of the model organisms. Only the nematode genome encodes apparent orthologs with conserved domain architecture for the majority of the disease genes. In yeast and bacterial homologs, domain organization is typically not conserved, and sequence similarity is limited to individual domains. Generally, human genes complement mutations only in orthologous yeast genes. Most of the positionally cloned genes encode large proteins with several globular and nonglobular domains, the functions of some or all of which are not known. We detected conserved domains and motifs not described previously in a number of proteins encoded by disease genes and predicted functions for some of them. These predictions include an ATP-binding domain in the product of hereditary nonpolyposis colon cancer gene (a MutL homolog), which is conserved in the HS90 family of chaperone proteins, type II DNA topoisomerases, and histidine kinases, and a nuclease domain homologous to bacterial RNase D and the 3'-5' exonuclease domain of DNA polymerase I in the Werner syndrome gene product.

Publication types

  • Comparative Study

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Bacteria / genetics
  • Biological Evolution*
  • Cloning, Molecular*
  • Colorectal Neoplasms, Hereditary Nonpolyposis / genetics
  • Conserved Sequence
  • DNA Polymerase I / chemistry
  • DNA Polymerase I / genetics
  • DNA Topoisomerases, Type II / chemistry
  • DNA Topoisomerases, Type II / genetics
  • Genetic Diseases, Inborn / genetics*
  • HSP90 Heat-Shock Proteins / chemistry
  • HSP90 Heat-Shock Proteins / genetics
  • Histidine Kinase
  • Humans
  • Information Systems
  • Molecular Sequence Data
  • Mutation
  • Nematoda / genetics
  • Protein Biosynthesis
  • Protein Kinases / chemistry
  • Protein Kinases / genetics
  • Proteins / chemistry
  • Proteins / genetics
  • Saccharomyces cerevisiae / genetics
  • Sequence Homology, Amino Acid

Substances

  • HSP90 Heat-Shock Proteins
  • Proteins
  • Protein Kinases
  • Histidine Kinase
  • DNA Polymerase I
  • DNA Topoisomerases, Type II