sRNAfrag: a pipeline and suite of tools to analyze fragmentation in small RNA sequencing data

Brief Bioinform. 2023 Nov 22;25(1):bbad515. doi: 10.1093/bib/bbad515.

Abstract

Fragments derived from small RNAs such as small nucleolar RNAs are biologically relevant but remain poorly understood. To address this gap, we developed sRNAfrag, a modular and interoperable tool designed to standardize the quantification and analysis of small RNA fragmentation across various biotypes. The tool outputs a set of tables forming a relational database, allowing for an in-depth exploration of biologically complex events such as multi-mapping and RNA fragment stability across different cell types. In a benchmark test, sRNAfrag was able to identify established loci of mature microRNAs solely based on sequencing data. Furthermore, the 5' seed sequence could be rediscovered by utilizing a visualization approach primarily applied in multi-sequence-alignments. Utilizing the relational database outputs, we detected 1411 snoRNA fragment conservation events between two out of four eukaryotic species, providing an opportunity to explore motifs through evolutionary time and conserved fragmentation patterns. Additionally, the tool's interoperability with other bioinformatics tools like ViennaRNA amplifies its utility for customized analyses. We also introduce a novel loci-level variance-score which provides insights into the noise around peaks and demonstrates biological relevance by distinctly separating breast cancer and neuroblastoma cell lines after dimension reduction when applied to small nucleolar RNAs. Overall, sRNAfrag serves as a versatile foundation for advancing our understanding of small RNA fragments and offers a functional foundation to further small RNA research. Availability: https://github.com/kenminsoo/sRNAfrag.

Keywords: Fragments; conservation; peaks; sRNA; snoRNA.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Computational Biology / methods
  • MicroRNAs* / genetics
  • RNA, Small Nucleolar / genetics
  • Sequence Alignment
  • Sequence Analysis, RNA / methods

Substances

  • MicroRNAs
  • RNA, Small Nucleolar