Computer-based annotation of putative AraC/XylS-family transcription factors of known structure but unknown function

J Biomed Biotechnol. 2012:2012:103132. doi: 10.1155/2012/103132. Epub 2012 Mar 13.

Abstract

Currently, about 20 crystal structures per day are released and deposited in the Protein Data Bank. A significant fraction of these structures is produced by research groups associated with the structural genomics consortium. The biological function of many of these proteins is generally unknown or not validated by experiment. Therefore, a growing need for functional prediction of protein structures has emerged. Here we present an integrated bioinformatics method that combines sequence-based relationships and three-dimensional (3D) structural similarity of transcriptional regulators with computer prediction of their cognate DNA binding sequences. We applied this method to the AraC/XylS family of transcription factors, which is a large family of transcriptional regulators found in many bacteria controlling the expression of genes involved in diverse biological functions. Three putative new members of this family with known 3D structure but unknown function were identified for which a probable functional classification is provided. Our bioinformatics analyses suggest that they could be involved in plant cell wall degradation (Lin2118 protein from Listeria innocua, PDB code 3oou), symbiotic nitrogen fixation (protein from Chromobacterium violaceum, PDB code 3oio), and either metabolism of plant-derived biomass or nitrogen fixation (protein from Rhodopseudomonas palustris, PDB code 3mn2).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • AraC Transcription Factor / chemistry
  • AraC Transcription Factor / classification*
  • Binding Sites
  • Cluster Analysis
  • Computational Biology / methods*
  • Databases, Protein
  • Models, Molecular
  • Models, Statistical
  • Molecular Sequence Annotation / methods*
  • Molecular Sequence Data
  • Sequence Alignment
  • Transcription Factors / chemistry
  • Transcription Factors / classification*

Substances

  • AraC Transcription Factor
  • Transcription Factors

Associated data

  • PDB/3MN2
  • PDB/3OIO
  • PDB/3OOU