The genetic algorithm applied to haplotype data at the LDL receptor locus

Comput Methods Programs Biomed. 2000 Jan;61(1):1-9. doi: 10.1016/s0169-2607(99)00025-5.

Abstract

Conventional statistical methods based upon single restriction fragment length polymorphisms often prove inadequate in studies of genetic variation. Cladistic analysis has been suggested as an alternative, but requires basic assumptions that usually cannot be met. We wanted to test whether it could be a workable approach to apply the genetic algorithm, an artificial intelligence method, to haplotype data. The genetic algorithm creates in-computer artificial 'individuals', all having 'genes' coding for solutions to a problem. The individuals are allowed to compete and 'mate', individuals with genes coding for better solutions mating more often. Genes coding for good solutions survive through generations of the genetic algorithm. At the end of the run, the best solutions can be extracted. We applied the genetic algorithm to data consisting of cholesterol values and haplotypes made up of seven restriction sites at the LDL receptor locus. The persons included were 114 FH (familial hypercholesterolemia) patients and 61 normals. The genetic algorithm found the restriction sites 1 (Sph1 in intron 6), 2 (StuI in exon 8), and 7 (ApaLI site in the 3' flanking region) were associated with high cholesterol levels. As a validity check we used runs of the genetic algorithm applied to 'artificial patients', i.e. artificially generated haplotypes linked to artificially generated cholesterol values. This demonstrated the genetic algorithm consistently found the appropriate haplotype. We conclude that the genetic algorithm may be a useful tool for studying genetic variation.

MeSH terms

  • Algorithms*
  • Artificial Intelligence
  • Cholesterol / blood
  • Evaluation Studies as Topic
  • Female
  • Genetic Variation
  • Haplotypes*
  • Humans
  • Hyperlipoproteinemia Type II / blood
  • Hyperlipoproteinemia Type II / genetics
  • Male
  • Models, Genetic*
  • Polymorphism, Restriction Fragment Length
  • Receptors, LDL / genetics*

Substances

  • Receptors, LDL
  • Cholesterol