Identifying small interfering RNA loci from high-throughput sequencing data

Bioinformatics. 2012 Feb 15;28(4):457-63. doi: 10.1093/bioinformatics/btr687. Epub 2011 Dec 9.

Abstract

Motivation: Small interfering RNAs (siRNAs) are produced from much longer sequences of double-stranded RNA precursors through cleavage by Dicer or a Dicer-like protein. These small RNAs play a key role in genetic and epigenetic regulation; however, a full understanding of the mechanisms by which they operate depends on the characterization of the precursors from which they are derived.

Results: High-throughput sequencing of small RNA populations allows the locations of the double-stranded RNA precursors to be inferred. We have developed methods to analyse small RNA sequencing data from multiple biological sources, taking into account replicate information, to identify robust sets of siRNA precursors. Our methods show good performance on both a set of small RNA sequencing data in Arabidopsis thaliana and simulated datasets.

Availability: Our methods are available as the Bioconductor (www.bioconductor.org) package segmentSeq (version 1.5.6 and above).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Arabidopsis / genetics*
  • Computer Simulation
  • Genome, Plant
  • High-Throughput Nucleotide Sequencing
  • RNA, Plant / genetics*
  • RNA, Small Interfering / genetics*
  • Sequence Analysis, RNA

Substances

  • RNA, Plant
  • RNA, Small Interfering