Analysis of ChIP-seq Data in R/Bioconductor

Methods Mol Biol. 2018:1689:195-226. doi: 10.1007/978-1-4939-7380-4_17.

Abstract

The development of novel high-throughput sequencing methods for ChIP (chromatin immunoprecipitation) has provided a very powerful tool to study gene regulation in multiple conditions at unprecedented resolution and scale. Proactive quality-control and appropriate data analysis techniques are of critical importance to extract the most meaningful results from the data. Over the last years, an array of R/Bioconductor tools has been developed allowing researchers to process and analyze ChIP-seq data. This chapter provides an overview of the methods available to analyze ChIP-seq data based primarily on software packages from the open-source Bioconductor project. Protocols described in this chapter cover basic steps including data alignment, peak calling, quality control and data visualization, as well as more complex methods such as the identification of differentially bound regions and functional analyses to annotate regulatory regions. The steps in the data analysis process were demonstrated on publicly available data sets and will serve as a demonstration of the computational procedures routinely used for the analysis of ChIP-seq data in R/Bioconductor, from which readers can construct their own analysis pipelines.

Keywords: Bioconductor; ChIP-seq; Computational analysis; Data analysis; Sequencing; Workflow.

MeSH terms

  • Animals
  • Chromatin Immunoprecipitation* / methods
  • Computational Biology* / methods
  • Data Interpretation, Statistical
  • Databases, Nucleic Acid
  • High-Throughput Nucleotide Sequencing* / methods
  • Mice
  • Sequence Analysis, DNA
  • Software*
  • Workflow