The multiple common point set problem and its application to molecule binding pattern detection

J Comput Biol. 2006 Mar;13(2):407-28. doi: 10.1089/cmb.2006.13.407.

Abstract

Recognition of binding patterns common to a set of protein structures is important for recognition of function, prediction of binding, and drug design. We consider protein binding sites represented by a set of 3D points with assigned physico-chemical and geometrical properties important for protein-ligand interactions. We formulate the multiple binding site alignment problem as detection of the largest common set of such 3D points. We discuss the computational problem of multiple common point set detection and, particularly, the matching problem in K-partite-epsilon graphs, where K partitions are associated with K structures and edges are defined between epsilon-close points. We show that the K-partite-epsilon matching problem is NP-hard in the Euclidean space with dimension larger than one. Consequently, we show that the largest common point set problem between three point sets is NP-hard. On the practical side, we present a novel computational method, MultiBind, for recognition of binding patterns common to a set of protein structures. It performs a multiple alignment between protein binding sites in the absence of overall sequence, fold, or binding partner similarity. Despite the NP-hardness results, in our applications, we practically overcome the exponential number of multiple alignment combinations by applying an efficient branchand- bound filtering procedure. We show applications of MultiBind to several biological targets. The method recognizes patterns which are responsible for binding small molecules, such as estradiol, ATP/ANP, and transition state analogues.

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural
  • Research Support, N.I.H., Intramural

MeSH terms

  • Adenosine Triphosphate / metabolism*
  • Algorithms*
  • Binding Sites
  • Catalytic Domain
  • Estradiol / metabolism*
  • Ligands
  • Models, Molecular
  • Protein Conformation
  • Protein Kinases / chemistry
  • Protein Kinases / metabolism*
  • Proteins / chemistry*
  • Proteins / metabolism*

Substances

  • Ligands
  • Proteins
  • Estradiol
  • Adenosine Triphosphate
  • Protein Kinases