Gene-based comparative analysis of tools for estimating copy number alterations using whole-exome sequencing data

Oncotarget. 2017 Apr 18;8(16):27277-27285. doi: 10.18632/oncotarget.15932.

Abstract

Accurate detection of copy number alterations (CNAs) using next-generation sequencing technology is essential for the development and application of more precise medical treatments for human cancer. Here, we evaluated seven CNA estimation tools (ExomeCNV, CoNIFER, VarScan2, CODEX, ngCGH, saasCNV, and falcon) using whole-exome sequencing data from 419 breast cancer tumor-normal sample pairs from The Cancer Genome Atlas. Estimations generated using each tool were converted into gene-based copy numbers; concordance for gains and losses and the sensitivity and specificity of each tool were compared to validated copy numbers from a single nucleotide polymorphism reference array. The concordance and sensitivity of the tumor-normal pair methods for estimating CNAs (saasCNV, ExomeCNV, and VarScan2) were better than those of the tumor batch methods (CoNIFER and CODEX). SaasCNV had the highest gain and loss concordances (65.0%), sensitivity (69.4%), and specificity (89.1%) for estimating copy number gains or losses. These findings indicate that improved CNA detection algorithms are needed to more accurately interpret whole-exome sequencing results in human cancer.

Keywords: CNA estimation; NGS; WES; cancer CNV; copy number.

MeSH terms

  • Algorithms
  • Computational Biology* / methods
  • DNA Copy Number Variations*
  • Databases, Nucleic Acid
  • Exome Sequencing
  • Genome-Wide Association Study* / methods
  • Humans
  • Neoplasms / genetics*
  • Sensitivity and Specificity
  • Sequence Analysis, DNA
  • Software