Reconstructing viral quasispecies from NGS amplicon reads

In Silico Biol. 2011;11(5-6):237-49. doi: 10.3233/ISB-2012-0458.

Abstract

This paper addresses the problem of reconstructing viral quasispecies from next-generation sequencing reads obtained from amplicons (i.e., reads generated from predefined amplified overlapping regions). We compare the parsimonious and likelihood models for this problem and propose several novel assembling algorithms. The proposed methods have been validated on simulated error-free HCV and real HBV amplicon reads. The new algorithms have been shown to outperform the method of Prosperi et. al. Our experiments also show that viral quasispecies can be reconstructed in most cases more accurately from amplicon reads rather than shotgun reads. All algorithms have been implemented and made available at https://bitbucket.org/nmancuso/bioa/.

MeSH terms

  • Algorithms*
  • Hepacivirus / genetics
  • Hepatitis B virus / genetics
  • High-Throughput Nucleotide Sequencing / methods*
  • Sequence Analysis, DNA
  • Software