A large-scale survey of genetic copy number variations among Han Chinese residing in Taiwan

BMC Genet. 2008 Dec 24:9:92. doi: 10.1186/1471-2156-9-92.

Abstract

Background: Copy number variations (CNVs) have recently been recognized as important structural variations in the human genome. CNVs can affect gene expression and thus may contribute to phenotypic differences. The copy number inferring tool (CNIT) is an effective hidden Markov model-based algorithm for estimating allele-specific copy number and predicting chromosomal alterations from single nucleotide polymorphism microarrays. The CNIT algorithm, which was constructed using data from 270 HapMap multi-ethnic individuals, was applied to identify CNVs from 300 unrelated Han Chinese individuals in Taiwan.

Results: Using stringent selection criteria, 230 regions with variable copy numbers were identified in the Han Chinese population; 133 (57.83%) had been reported previously, 64 displayed greater than 1% CNV allele frequency. The average size of the CNV regions was 322 kb (ranging from 1.48 kb to 5.68 Mb) and covered a total of 2.47% of the human genome. A total of 196 of the CNV regions were simple deletions and 27 were simple amplifications. There were 449 genes and 5 microRNAs within these CNV regions; some of these genes are known to be associated with diseases.

Conclusion: The identified CNVs are characteristic of the Han Chinese population and should be considered when genetic studies are conducted. The CNV distribution in the human genome is still poorly characterized, and there is much diversity among different ethnic populations.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Alleles
  • Asian People / genetics*
  • Gene Dosage*
  • Gene Frequency
  • Genetics, Population
  • Genome, Human*
  • Humans
  • Markov Chains
  • MicroRNAs / genetics
  • Models, Genetic
  • Oligonucleotide Array Sequence Analysis
  • Polymorphism, Single Nucleotide
  • Software
  • Taiwan

Substances

  • MicroRNAs