De novo design of protein interactions with learned surface fingerprints

Nature. 2023 May;617(7959):176-184. doi: 10.1038/s41586-023-05993-x. Epub 2023 Apr 26.

Abstract

Physical interactions between proteins are essential for most biological processes governing life1. However, the molecular determinants of such interactions have been challenging to understand, even as genomic, proteomic and structural data increase. This knowledge gap has been a major obstacle for the comprehensive understanding of cellular protein-protein interaction networks and for the de novo design of protein binders that are crucial for synthetic biology and translational applications2-9. Here we use a geometric deep-learning framework operating on protein surfaces that generates fingerprints to describe geometric and chemical features that are critical to drive protein-protein interactions10. We hypothesized that these fingerprints capture the key aspects of molecular recognition that represent a new paradigm in the computational design of novel protein interactions. As a proof of principle, we computationally designed several de novo protein binders to engage four protein targets: SARS-CoV-2 spike, PD-1, PD-L1 and CTLA-4. Several designs were experimentally optimized, whereas others were generated purely in silico, reaching nanomolar affinity with structural and mutational characterization showing highly accurate predictions. Overall, our surface-centric approach captures the physical and chemical determinants of molecular recognition, enabling an approach for the de novo design of protein interactions and, more broadly, of artificial proteins with function.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Binding Sites
  • Computer Simulation*
  • Deep Learning*
  • Humans
  • Protein Binding*
  • Protein Interaction Maps
  • Proteins* / chemistry
  • Proteins* / metabolism
  • Proteomics
  • Synthetic Biology

Substances

  • Proteins
  • spike protein, SARS-CoV-2
  • CTLA4 protein, human
  • CD274 protein, human
  • PDCD1 protein, human