Machine-Learning-Assisted Descriptors Identification for Indoor Formaldehyde Oxidation Catalysts

Environ Sci Technol. 2024 May 14;58(19):8372-8379. doi: 10.1021/acs.est.4c01691. Epub 2024 May 1.

Abstract

The development of highly efficient catalysts for formaldehyde (HCHO) oxidation is of significant interest for the improvement of indoor air quality. Up to 400 works relating to the catalytic oxidation of HCHO have been published to date; however, their analysis for collective inference through conventional literature search is still a challenging task. A machine learning (ML) framework was presented to predict catalyst performance from experimental descriptors based on an HCHO oxidation catalysts database. MnOx, CeO2, Co3O4, TiO2, FeOx, ZrO2, Al2O3, SiO2, and carbon-based catalysts with different promoters were compiled from the literature. Notably, 20 descriptors including reaction catalyst composition, reaction conditions, and catalyst physical properties were collected for data mining (2263 data points). Furthermore, the eXtreme Gradient Boosting algorithm was employed, which successfully predicted the conversion efficiency of HCHO with an R-square value of 0.81. Shapley additive analysis suggested Pt/MnO2 and Ag/Ce-Co3O4 exhibited excellent catalytic performance of HCHO oxidation based on the analysis of the entire database. Validated by experimental tests and theoretical simulations, the key descriptor identified by ML, i.e., the first promoter, was further described as metal-support interactions. This study highlights ML as a useful tool for database establishment and the catalyst rational design strategy based on the importance of analysis between experimental descriptors and the performance of complex catalytic systems.

Keywords: SHAP analysis; catalyst; descriptors; formaldehyde; machine learning.

MeSH terms

  • Air Pollution, Indoor*
  • Catalysis
  • Formaldehyde* / chemistry
  • Machine Learning*
  • Oxidation-Reduction*