NanoMUD: Profiling of pseudouridine and N1-methylpseudouridine using Oxford Nanopore direct RNA sequencing

Yuxin Zhang; Huayuan Yan; Zhen Wei; Haifeng Hong; Daiyun Huang; Guopeng Liu; Qianshan Qin; Rong Rong; Peng Gao; Jia Meng; Bo Ying

doi:10.1016/j.ijbiomac.2024.132433

NanoMUD: Profiling of pseudouridine and N1-methylpseudouridine using Oxford Nanopore direct RNA sequencing

Int J Biol Macromol. 2024 May 15;270(Pt 2):132433. doi: 10.1016/j.ijbiomac.2024.132433. Online ahead of print.

Authors

Yuxin Zhang¹, Huayuan Yan², Zhen Wei¹, Haifeng Hong², Daiyun Huang³, Guopeng Liu², Qianshan Qin², Rong Rong⁴, Peng Gao⁵, Jia Meng⁶, Bo Ying⁷

Affiliations

¹ Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China; Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L69 7ZB Liverpool, United Kingdom.
² Suzhou Abogen Biosciences Co., Ltd., Suzhou 215123, China.
³ Wisdom Lake Academy of Pharmacy, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China.
⁴ Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China.
⁵ Suzhou Abogen Biosciences Co., Ltd., Suzhou 215123, China. Electronic address: peng.gao@abogenbio.com.
⁶ Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China; AI University Research Centre, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China; Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L69 7ZB Liverpool, United Kingdom. Electronic address: jia.meng@xjtlu.edu.cn.
⁷ Suzhou Abogen Biosciences Co., Ltd., Suzhou 215123, China. Electronic address: bo.ying@abogenbio.com.

PMID: 38759861
DOI: 10.1016/j.ijbiomac.2024.132433

Abstract

Nanopore direct RNA sequencing provided a promising solution for unraveling the landscapes of modifications on single RNA molecules. Here, we proposed NanoMUD, a computational framework for predicting the RNA pseudouridine modification (Ψ) and its methylated analog N1-methylpseudouridine (m1Ψ), which have critical application in mRNA vaccination, at single-base and single-molecule resolution from direct RNA sequencing data. Electric signal features were fed into a bidirectional LSTM neural network to achieve improved accuracy and predictive capabilities. Motif-specific models (NNUNN, N = A, C, U or G) were trained based on features extracted from designed dataset and achieved superior performance on molecule-level modification prediction (Ψ models: min AUC = 0.86, max AUC = 0.99; m1Ψ models: min AUC = 0.87, max AUC = 0.99). We then aggregated read-level predictions for site stoichiometry estimation. Given the observed sequence-dependent bias in model performance, we trained regression models based on the distribution of modification probabilities for sites with known stoichiometry. The distribution-based site stoichiometry estimation method allows unbiased comparison between different contexts. To demonstrate the feasibility of our work, three case studies on both in vitro and in vivo transcribed RNAs were presented. NanoMUD will make a powerful tool to facilitate the research on modified therapeutic IVT RNAs and provides useful insight to the landscape and stoichiometry of pseudouridine and N1-pseudouridine on in vivo transcribed RNA species.

Keywords: Deep learning; Epi-transcriptome; N1-methylpseudouridine; Nanopore direct RNA sequencing; Pseudouridine; mRNA vaccines.