A Bayesian Change Point Model for Dynamic Alternative Transcription Start Site Usage During Cellular Differentiation

J Comput Biol. 2024 May;31(5):445-457. doi: 10.1089/cmb.2023.0174. Epub 2024 May 14.

Abstract

ABSTRACT An alternative transcription start site (ATSS) is a major driving force for increasing the complexity of transcripts in human tissues. As a transcriptional regulatory mechanism, ATSS has biological significance. Many studies have confirmed that ATSS plays an important role in diseases and cell development and differentiation. However, exploration of its dynamic mechanisms remains insufficient. Identifying ATSS change points during cell differentiation is critical for elucidating potential dynamic mechanisms. For relative ATSS usage as percentage data, the existing methods lack sensitivity to detect the change point for ATSS longitudinal data. In addition, some methods have strict requirements for data distribution and cannot be applied to deal with this problem. In this study, the Bayesian change point detection model was first constructed using reparameterization techniques for two parameters of a beta distribution for the percentage data type, and the posterior distributions of parameters and change points were obtained using Markov Chain Monte Carlo (MCMC) sampling. With comprehensive simulation studies, the performance of the Bayesian change point detection model is found to be consistently powerful and robust across most scenarios with different sample sizes and beta distributions. Second, differential ATSS events in the real data, whose change points were identified using our method, were clustered according to their change points. Last, for each change point, pathway and transcription factor motif analyses were performed on its differential ATSS events. The results of our analyses demonstrated the effectiveness of the Bayesian change point detection model and provided biological insights into cell differentiation.

Keywords: Bayesian change point detection model; alternative transcription start site; cell differentiation; dynamic mechanisms; percentage data type.

MeSH terms

  • Algorithms
  • Bayes Theorem*
  • Cell Differentiation* / genetics
  • Computer Simulation
  • Humans
  • Markov Chains
  • Models, Genetic
  • Monte Carlo Method
  • Transcription Initiation Site*