Construction of a trio-based structural variation panel utilizing activated T lymphocytes and long-read sequencing technology

Commun Biol. 2022 Sep 20;5(1):991. doi: 10.1038/s42003-022-03953-1.

Abstract

Long-read sequencing technology enable better characterization of structural variants (SVs). To adapt the technology to population-scale analyses, one critical issue is to obtain sufficient amount of high-molecular-weight genomic DNA. Here, we propose utilizing activated T lymphocytes, which can be established efficiently in a biobank to stably supply high-grade genomic DNA sufficiently. We conducted nanopore sequencing of 333 individuals constituting 111 trios with high-coverage long-read sequencing data (depth 22.2x, N50 of 25.8 kb) and identified 74,201 SVs. Our trio-based analysis revealed that more than 95% of the SVs were concordant with Mendelian inheritance. We also identified SVs associated with clinical phenotypes, all of which appear to be stably transmitted from parents to offspring. Our data provide a catalog of SVs in the general Japanese population, and the applied approach using the activated T-lymphocyte resource will contribute to biobank-based human genetic studies focusing on SVs at the population scale.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA*
  • High-Throughput Nucleotide Sequencing*
  • Humans
  • Sequence Analysis, DNA
  • T-Lymphocytes
  • Technology

Substances

  • DNA