Computational deconvolution of DNA methylation data from mixed DNA samples

Brief Bioinform. 2024 Mar 27;25(3):bbae234. doi: 10.1093/bib/bbae234.

Abstract

In this review, we provide a comprehensive overview of the different computational tools that have been published for the deconvolution of bulk DNA methylation (DNAm) data. Here, deconvolution refers to the estimation of cell-type proportions that constitute a mixed sample. The paper reviews and compares 25 deconvolution methods (supervised, unsupervised or hybrid) developed between 2012 and 2023 and compares the strengths and limitations of each approach. Moreover, in this study, we describe the impact of the platform used for the generation of methylation data (including microarrays and sequencing), the applied data pre-processing steps and the used reference dataset on the deconvolution performance. Next to reference-based methods, we also examine methods that require only partial reference datasets or require no reference set at all. In this review, we provide guidelines for the use of specific methods dependent on the DNA methylation data type and data availability.

Keywords: DNA methylation profiling; computational deconvolution; tool comparison.

Publication types

  • Review

MeSH terms

  • Algorithms
  • Computational Biology* / methods
  • DNA / genetics
  • DNA Methylation*
  • Humans