Ensemble-GNN: federated ensemble learning with graph neural networks for disease module discovery and classification

Bastian Pfeifer; Hryhorii Chereda; Roman Martin; Anna Saranti; Sandra Clemens; Anne-Christin Hauschild; Tim Beißbarth; Andreas Holzinger; Dominik Heider

doi:10.1093/bioinformatics/btad703

Ensemble-GNN: federated ensemble learning with graph neural networks for disease module discovery and classification

Bioinformatics. 2023 Nov 1;39(11):btad703. doi: 10.1093/bioinformatics/btad703.

Authors

Affiliations

¹ Institute for Medical Informatics, Statistics and Documentation, Medical University Graz, Graz 8036, Austria.
² Medical Bioinformatics, University Medical Center Göttingen, Göttingen 37077, Germany.
³ Data Science in Biomedicine, Department of Mathematics and Computer Science, University of Marburg, Marburg 35043, Germany.
⁴ Human-Centered AI Lab, University of Natural Resources and Life Sciences, Vienna 1190, Austria.
⁵ Institute for Medical Informatics, University Medical Center Göttingen, Göttingen 37075, Germany.

Abstract

Summary: Federated learning enables collaboration in medicine, where data is scattered across multiple centers without the need to aggregate the data in a central cloud. While, in general, machine learning models can be applied to a wide range of data types, graph neural networks (GNNs) are particularly developed for graphs, which are very common in the biomedical domain. For instance, a patient can be represented by a protein-protein interaction (PPI) network where the nodes contain the patient-specific omics features. Here, we present our Ensemble-GNN software package, which can be used to deploy federated, ensemble-based GNNs in Python. Ensemble-GNN allows to quickly build predictive models utilizing PPI networks consisting of various node features such as gene expression and/or DNA methylation. We exemplary show the results from a public dataset of 981 patients and 8469 genes from the Cancer Genome Atlas (TCGA).

Availability and implementation: The source code is available at https://github.com/pievos101/Ensemble-GNN, and the data at Zenodo (DOI: 10.5281/zenodo.8305122).

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

DNA Methylation*
Humans
Machine Learning*
Neural Networks, Computer
Protein Interaction Maps
Software

Grants and funding

826078/European Union's Horizon 2020 research and innovation programme