Machine learning predicts the functional composition of the protein corona and the cellular recognition of nanoparticles

Proc Natl Acad Sci U S A. 2020 May 12;117(19):10492-10499. doi: 10.1073/pnas.1919755117. Epub 2020 Apr 24.

Abstract

Protein corona formation is critical for the design of ideal and safe nanoparticles (NPs) for nanomedicine, biosensing, organ targeting, and other applications, but methods to quantitatively predict the formation of the protein corona, especially for functional compositions, remain unavailable. The traditional linear regression model performs poorly for the protein corona, as measured by R2 (less than 0.40). Here, the performance with R2 over 0.75 in the prediction of the protein corona was achieved by integrating a machine learning model and meta-analysis. NPs without modification and surface modification were identified as the two most important factors determining protein corona formation. According to experimental verification, the functional protein compositions (e.g., immune proteins, complement proteins, and apolipoproteins) in complex coronas were precisely predicted with good R2 (most over 0.80). Moreover, the method successfully predicted the cellular recognition (e.g., cellular uptake by macrophages and cytokine release) mediated by functional corona proteins. This workflow provides a method to accurately and quantitatively predict the functional composition of the protein corona that determines cellular recognition and nanotoxicity to guide the synthesis and applications of a wide range of NPs by overcoming limitations and uncertainty.

Keywords: cellular recognition; machine learning; nano-bio interface; nanotoxicity; protein corona.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Forecasting / methods*
  • Humans
  • Machine Learning
  • Macrophages
  • Mice
  • Models, Theoretical
  • Nanoparticles / metabolism*
  • Protein Corona / metabolism*
  • Proteins
  • RAW 264.7 Cells

Substances

  • Protein Corona
  • Proteins