Triple-helix potential of the mouse genome

Proc Natl Acad Sci U S A. 2022 May 10;119(19):e2203967119. doi: 10.1073/pnas.2203967119. Epub 2022 May 3.

Abstract

Certain DNA sequences, including mirror-symmetric polypyrimidine•polypurine runs, are capable of folding into a triple-helix–containing non–B-form DNA structure called H-DNA. Such H-DNA–forming sequences occur frequently in many eukaryotic genomes, including in mammals, and multiple lines of evidence indicate that these motifs are mutagenic and can impinge on DNA replication, transcription, and other aspects of genome function. In this study, we show that the triplex-forming potential of H-DNA motifs in the mouse genome can be evaluated using S1-sequencing (S1-seq), which uses the single-stranded DNA (ssDNA)–specific nuclease S1 to generate deep-sequencing libraries that report on the position of ssDNA throughout the genome. When S1-seq was applied to genomic DNA isolated from mouse testis cells and splenic B cells, we observed prominent clusters of S1-seq reads that appeared to be independent of endogenous double-strand breaks, that coincided with H-DNA motifs, and that correlated strongly with the triplex-forming potential of the motifs. Fine-scale patterns of S1-seq reads, including a pronounced strand asymmetry in favor of centrally positioned reads on the pyrimidine-containing strand, suggested that this S1-seq signal is specific for one of the four possible isomers of H-DNA (H-y5). By leveraging the abundance and complexity of naturally occurring H-DNA motifs across the mouse genome, we further defined how polypyrimidine repeat length and the presence of repeat-interrupting substitutions modify the structure of H-DNA. This study provides an approach for studying DNA secondary structure genome-wide at high spatial resolution.

Keywords: DNA topology; H-DNA; genomics; non-B DNA; triplex DNA.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Animals
  • Base Sequence
  • Genome* / genetics
  • Mice
  • Nucleic Acid Conformation
  • Nucleotide Motifs*