A Gene Module-Based eQTL Analysis Prioritizing Disease Genes and Pathways in Kidney Cancer

Comput Struct Biotechnol J. 2017 Oct 10:15:463-470. doi: 10.1016/j.csbj.2017.09.003. eCollection 2017.

Abstract

Clear cell renal cell carcinoma (ccRCC) is the most common and most aggressive form of renal cell cancer (RCC). The incidence of RCC has increased steadily in recent years. The pathogenesis of renal cell cancer remains poorly understood. Many of the tumor suppressor genes, oncogenes, and dysregulated pathways in ccRCC need to be revealed for improvement of the overall clinical outlook of the disease. Here, we developed a systems biology approach to prioritize the somatic mutated genes that lead to dysregulation of pathways in ccRCC. The method integrated multi-layer information to infer causative mutations and disease genes. First, we identified differential gene modules in ccRCC by coupling transcriptome and protein-protein interactions. Each of these modules consisted of interacting genes that were involved in similar biological processes and their combined expression alterations were significantly associated with disease type. Then, subsequent gene module-based eQTL analysis revealed somatic mutated genes that had driven the expression alterations of differential gene modules. Our study yielded a list of candidate disease genes, including several known ccRCC causative genes such as BAP1 and PBRM1, as well as novel genes such as NOD2, RRM1, CSRNP1, SLC4A2, TTLL1 and CNTN1. The differential gene modules and their driver genes revealed by our study provided a new perspective for understanding the molecular mechanisms underlying the disease. Moreover, we validated the results in independent ccRCC patient datasets. Our study provided a new method for prioritizing disease genes and pathways.

Keywords: AUC, Area Under Curve; Causative mutation; DEG, Differentially expressed gene; DGM, Differential gene module; Gene module; KEGG, Kyoto Encyclopedia of Genes and Genomes; Pathways; Protein-protein interaction; RCC, Renal cell cancer; ROC, Receiver Operating Characteristic; SVM, Support vector machine; TCGA, The Cancer Genome Atlas; ccRCC; ccRCC, Clear cell renal cell carcinoma; eQTL; eQTL, Expression quantitative trait loci.