Transcript Profiling Using Long-Read Sequencing Technologies

Methods Mol Biol. 2018:1783:121-147. doi: 10.1007/978-1-4939-7834-2_6.

Abstract

RNA sequencing using next-generation sequencing (NGS, RNA-Seq) technologies is currently the standard approach for gene expression profiling, particularly for large-scale high-throughput studies. NGS technologies comprise short-read RNA-Seq (dominated by Illumina) and long-read RNA-Seq technologies provided by Pacific Bioscience (PacBio) and Oxford Nanopore Technologies (ONT). Although short-read sequencing technologies are the most widely used, long-read technologies are increasingly becoming the standard approach for de novo transcriptome assembly and isoform expression quantification due to the complex nature of the transcriptome which consists of variable lengths of transcripts and multiple alternatively spliced isoforms for most genes. In this chapter, we describe experimental procedures for library preparation, sequencing, and associated data analysis approaches for PacBio and ONT with a major focus on full length cDNA synthesis, de novo transcriptome assembly, and isoform quantification.

Keywords: Long read; Nanopore; Next-generation sequencing; PacBio; RNA-Seq; Transcriptome.

MeSH terms

  • Alternative Splicing*
  • Computational Biology / methods*
  • Gene Expression Profiling / methods*
  • Gene Library
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • Protein Isoforms
  • Sequence Analysis, RNA / methods*
  • Transcriptome*

Substances

  • Protein Isoforms