Peptide binding at class I major histocompatibility complex scored with linear functions and support vector machines

Genome Inform. 2004;15(1):198-212.

Abstract

We explore two different methods to predict the binding ability of nonapeptides at the class I major histocompatibility complex using a general linear scoring function that defines a separating hyperplane in the feature space of sequences. In absence of suitable data on non-binding nonapeptides we generated sequences randomly from a selected set of proteins from the protein data bank. The parameters of the scoring function were determined by a generalized least square optimization (LSM) and alternatively by the support vector machine (SVM). With the generalized LSM impaired data for learning with a small set of binding peptides and a large set of non-binding peptides can be treated in a balanced way rendering LSM more successful than SVM, while for symmetric data sets SVM has a slight advantage compared to LSM.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Computer Simulation
  • Databases, Protein*
  • Genes, MHC Class I*
  • Histocompatibility Antigens Class I / genetics*
  • Least-Squares Analysis
  • Major Histocompatibility Complex
  • Peptides / chemistry
  • Peptides / immunology

Substances

  • Histocompatibility Antigens Class I
  • Peptides