Design and analysis of quantitative differential proteomics investigations using LC-MS technology

J Bioinform Comput Biol. 2008 Feb;6(1):107-23. doi: 10.1142/s0219720008003321.

Abstract

Liquid chromatography-mass spectrometry (LC-MS)-based proteomics is becoming an increasingly important tool in characterizing the abundance of proteins in biological samples of various types and across conditions. Effects of disease or drug treatments on protein abundance are of particular interest for the characterization of biological processes and the identification of biomarkers. Although state-of-the-art instrumentation is available to make high-quality measurements and commercially available software is available to process the data, the complexity of the technology and data presents challenges for bioinformaticians and statisticians. Here, we describe a pipeline for the analysis of quantitative LC-MS data. Key components of this pipeline include experimental design (sample pooling, blocking, and randomization) as well as deconvolution and alignment of mass chromatograms to generate a matrix of molecular abundance profiles. An important challenge in LC-MS-based quantitation is to be able to accurately identify and assign abundance measurements to members of protein families. To address this issue, we implement a novel statistical method for inferring the relative abundance of related members of protein families from tryptic peptide intensities. This pipeline has been used to analyze quantitative LC-MS data from multiple biomarker discovery projects. We illustrate our pipeline here with examples from two of these studies, and show that the pipeline constitutes a complete workable framework for LC-MS-based differential quantitation. Supplementary material is available at http://iec01.mie.utoronto.ca/~thodoros/Bukhman/.

MeSH terms

  • Algorithms*
  • Amino Acid Sequence
  • Biotechnology / methods
  • Chromatography, Liquid / methods*
  • Mass Spectrometry / methods*
  • Molecular Sequence Data
  • Peptide Mapping / methods*
  • Proteome / chemistry*
  • Proteomics / methods*
  • Sequence Analysis, Protein / methods*
  • Software
  • Software Design

Substances

  • Proteome