Resolving genomic disorder-associated breakpoints within segmental DNA duplications using massively parallel sequencing

Nat Protoc. 2014;9(6):1496-513. doi: 10.1038/nprot.2014.096. Epub 2014 May 29.

Abstract

The most common recurrent copy-number variants associated with autism, developmental delay and epilepsy are flanked by segmental duplications. Complete genetic characterization of these events is challenging because their breakpoints often occur within high-identity, copy-number polymorphic paralogous sequences that cannot be specifically assayed using hybridization-based methods. Here we provide a protocol for breakpoint resolution with sequence-level precision. Massively parallel sequencing is performed on libraries generated from haplotype-resolved chromosomes, genomic DNA or molecular inversion probe (MIP)-captured breakpoint-informative regions harboring paralog-distinguishing variants. Quantification of sequencing depth over informative sites enables breakpoint localization, typically within several kilobases to tens of kilobases. Depending on the approach used, the sequencing platform, and the accuracy and completeness of the reference genome sequence, this protocol takes from a few days to several months to complete. Once established for a specific genomic disorder, it is possible to process thousands of DNA samples within as little as 3-4 weeks.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Chromosome Breakpoints*
  • Computational Biology / methods
  • Genome / genetics*
  • Genomic Library
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • Segmental Duplications, Genomic / genetics*