Systematic discovery of complex insertions and deletions in human cancers

Nat Med. 2016 Jan;22(1):97-104. doi: 10.1038/nm.4002. Epub 2015 Dec 14.

Abstract

Complex insertions and deletions (indels) are formed by simultaneously deleting and inserting DNA fragments of different sizes at a common genomic location. Here we present a systematic analysis of somatic complex indels in the coding sequences of samples from over 8,000 cancer cases using Pindel-C. We discovered 285 complex indels in cancer-associated genes (such as PIK3R1, TP53, ARID1A, GATA3 and KMT2D) in approximately 3.5% of cases analyzed; nearly all instances of complex indels were overlooked (81.1%) or misannotated (17.6%) in previous reports of 2,199 samples. In-frame complex indels are enriched in PIK3R1 and EGFR, whereas frameshifts are prevalent in VHL, GATA3, TP53, ARID1A, PTEN and ATRX. Furthermore, complex indels display strong tissue specificity (such as VHL in kidney cancer samples and GATA3 in breast cancer samples). Finally, structural analyses support findings of previously missed, but potentially druggable, mutations in the EGFR, MET and KIT oncogenes. This study indicates the critical importance of improving complex indel discovery and interpretation in medical research.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Cell Line, Tumor
  • Class Ia Phosphatidylinositol 3-Kinase
  • DNA Helicases / genetics
  • DNA-Binding Proteins / genetics
  • Data Mining / methods*
  • ErbB Receptors / genetics
  • GATA3 Transcription Factor / genetics
  • Genomics / methods*
  • High-Throughput Nucleotide Sequencing
  • Humans
  • INDEL Mutation / genetics*
  • Neoplasm Proteins / genetics
  • Neoplasms / genetics*
  • Nuclear Proteins / genetics
  • PTEN Phosphohydrolase / genetics
  • Phosphatidylinositol 3-Kinases / genetics
  • Proto-Oncogene Proteins c-kit / genetics
  • Proto-Oncogene Proteins c-met / genetics
  • Transcription Factors / genetics
  • Tumor Suppressor Protein p53 / genetics
  • Von Hippel-Lindau Tumor Suppressor Protein / genetics
  • X-linked Nuclear Protein

Substances

  • ARID1A protein, human
  • DNA-Binding Proteins
  • GATA3 Transcription Factor
  • GATA3 protein, human
  • KMT2D protein, human
  • Neoplasm Proteins
  • Nuclear Proteins
  • TP53 protein, human
  • Transcription Factors
  • Tumor Suppressor Protein p53
  • Von Hippel-Lindau Tumor Suppressor Protein
  • PIK3R1 protein, human
  • Class Ia Phosphatidylinositol 3-Kinase
  • EGFR protein, human
  • ErbB Receptors
  • MET protein, human
  • Proto-Oncogene Proteins c-kit
  • Proto-Oncogene Proteins c-met
  • PTEN Phosphohydrolase
  • PTEN protein, human
  • DNA Helicases
  • ATRX protein, human
  • X-linked Nuclear Protein
  • VHL protein, human