Supervised identification of allergen-representative peptides for in silico detection of potentially allergenic proteins

Bioinformatics. 2005 Jan 1;21(1):39-50. doi: 10.1093/bioinformatics/bth477. Epub 2004 Aug 19.

Abstract

Motivation: Identification of potentially allergenic proteins is needed for the safety assessment of genetically modified foods, certain pharmaceuticals and various other products on the consumer market. Current methods in bioinformatic allergology exploit common features among allergens for the detection of amino acid sequences of potentially allergenic proteins. Features for identification still unexplored include the motifs occurring commonly in allergens, but rarely in ordinary proteins. In this paper, we present an algorithm for the identification of such motifs with the purpose of biocomputational detection of amino acid sequences of potential allergens.

Results: Identification of allergen-representative peptides (ARPs) with low or no occurrence in proteins lacking allergenic properties is the essential component of our new method, designated DASARP (Detection based on Automated Selection of Allergen-Representative Peptide). This approach consistently outperforms the criterion based on identical peptide match for predicting allergenicity recommended by ILSI/IFBC and FAO/WHO and shows results comparable to the alignment-based criterion as outlined by FAO/WHO.

Availability: The detection software and the ARP set needed for the analysis of a query protein reported here are properties of the Swedish National Food Agency and are available upon request. The protein sequence sets used in this work are publicly available on http://www.slv.se/templatesSLV/SLV_Page____9343.asp. Allergenicity assessment for specific protein sequences of interest is also possible via ulfh@slv.se

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Algorithms*
  • Allergens / analysis
  • Allergens / chemistry*
  • Allergens / classification*
  • Amino Acid Motifs
  • Artificial Intelligence
  • Internet
  • Pattern Recognition, Automated / methods
  • Peptides / analysis
  • Peptides / chemistry
  • Peptides / classification
  • Proteins / analysis
  • Proteins / chemistry*
  • Proteins / classification*
  • Sequence Alignment / methods*
  • Sequence Analysis, Protein / methods*
  • Sequence Homology, Amino Acid
  • Software*
  • Structure-Activity Relationship

Substances

  • Allergens
  • Peptides
  • Proteins