Statistical integration of multi-omics and drug screening data from cell lines

PLoS Comput Biol. 2024 Jan 31;20(1):e1011809. doi: 10.1371/journal.pcbi.1011809. eCollection 2024 Jan.

Abstract

Data integration methods are used to obtain a unified summary of multiple datasets. For multi-modal data, we propose a computational workflow to jointly analyze datasets from cell lines. The workflow comprises a novel probabilistic data integration method, named POPLS-DA, for multi-omics data. The workflow is motivated by a study on synucleinopathies where transcriptomics, proteomics, and drug screening data are measured in affected LUHMES cell lines and controls. The aim is to highlight potentially druggable pathways and genes involved in synucleinopathies. First, POPLS-DA is used to prioritize genes and proteins that best distinguish cases and controls. For these genes, an integrated interaction network is constructed where the drug screen data is incorporated to highlight druggable genes and pathways in the network. Finally, functional enrichment analyses are performed to identify clusters of synaptic and lysosome-related genes and proteins targeted by the protective drugs. POPLS-DA is compared to other single- and multi-omics approaches. We found that HSPA5, a member of the heat shock protein 70 family, was one of the most targeted genes by the validated drugs, in particular by AT1-blockers. HSPA5 and AT1-blockers have been previously linked to α-synuclein pathology and Parkinson's disease, showing the relevance of our findings. Our computational workflow identified new directions for therapeutic targets for synucleinopathies. POPLS-DA provided a larger interpretable gene set than other single- and multi-omic approaches. An implementation based on R and markdown is freely available online.

MeSH terms

  • Computational Biology* / methods
  • Drug Evaluation, Preclinical
  • Humans
  • Multiomics
  • Proteomics / methods
  • Synucleinopathies*

Grants and funding

All authors were funded by ERA-Net E-Rare JTC 2018 (MSA-omics) [40- 44000-98-2006/90030376507]. JD was supported by European Union’s Horizon 2020 (IMforFUTURE) [721815] and the EU funded Cost action DYNALIFE [CA21169]. GH was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy within the framework of the Munich Cluster for Systems Neurology (EXC 2145 SyNergy) [390857198] and within the Hannover Cluster RESIST (EXC 2155) [39087428], DFG [HO2402/18-1] MSAomics, the Bavarian Ministry for Education, Culture, Science and Art (ForIPS) [8810001412], Niedersächsisches Ministerium für Wissenschaft und Kunst [ZN3440.TP]: REBIRTH Forschungszentrum für translationale regenerative Medizin, VolkswagenStiftung (Niedersächsisches Vorab), Petermax-Müller Foundation (Etiology and Therapy of Synucleinopathies and Tauopathies). MH and GH received funding from ParkinsonFonds Deutschland (Hypothesis-free compound screening in a new human neuronal model of wild type alpha-synuclein-induced cell death). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.