An integrated analysis tool for analyzing hybridization intensities and genotypes using new-generation population-optimized human arrays

BMC Genomics. 2016 Mar 31:17:266. doi: 10.1186/s12864-016-2478-8.

Abstract

Background: Affymetrix Axiom single nucleotide polymorphism (SNP) arrays provide a cost-effective, high-density, and high-throughput genotyping solution for population-optimized analyses. However, no public software is available for the integrated genomic analysis of hybridization intensities and genotypes for this new-generation population-optimized genotyping platform.

Results: A set of statistical methods was developed for an integrated analysis of allele frequency (AF), allelic imbalance (AI), loss of heterozygosity (LOH), long contiguous stretch of homozygosity (LCSH), and copy number variation or alteration (CNV/CNA) on the basis of SNP probe hybridization intensities and genotypes. This study analyzed 3,236 samples that were genotyped using different SNP platforms. The proposed AF adjustment method considerably increased the accuracy of AF estimation. The proposed quick circular binary segmentation algorithm for segmenting copy number reduced the computation time of the original segmentation method by 30-67 %. The proposed CNV/CNA detection, which integrates AI and LOH/LCSH detection, had a promising true positive rate and well-controlled false positive rate in simulation studies. Moreover, our real-time quantitative polymerase chain reaction experiments successfully validated the CNVs/CNAs that were identified in the Axiom data analyses using the proposed methods; some of the validated CNVs/CNAs were not detected in the Affymetrix Array 6.0 data analysis using the Affymetrix Genotyping Console. All the analysis functions are packaged into the ALICE (AF/LOH/LCSH/AI/CNV/CNA Enterprise) software.

Conclusions: ALICE and the used genomic reference databases, which can be downloaded from http://hcyang.stat.sinica.edu.tw/software/ALICE.html , are useful resources for analyzing genomic data from the Axiom and other SNP arrays.

Keywords: AF/LOH/LCSH/AI/CNV/CNA Enterprise (ALICE); Allele frequency (AF); Allelic imbalance (AI); Circular binary segmentation (CBS); Copy number variation or alteration (CNV/CNA); Fluorescence intensity; Long contiguous stretch of homozygosity (LCSH); Loss of heterozygosity (LOH); Microarray; Single-nucleotide polymorphism (SNP).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Allelic Imbalance
  • DNA Copy Number Variations
  • Gene Frequency
  • Genetics, Population / methods*
  • Genotype*
  • Homozygote
  • Humans
  • Hybridization, Genetic*
  • Loss of Heterozygosity
  • Models, Statistical
  • Oligonucleotide Array Sequence Analysis / methods*
  • Polymorphism, Single Nucleotide
  • Software*