MT-HESS: an efficient Bayesian approach for simultaneous association detection in OMICS datasets, with application to eQTL mapping in multiple tissues

Bioinformatics. 2016 Feb 15;32(4):523-32. doi: 10.1093/bioinformatics/btv568. Epub 2015 Oct 26.

Abstract

Motivation: Analysing the joint association between a large set of responses and predictors is a fundamental statistical task in integrative genomics, exemplified by numerous expression Quantitative Trait Loci (eQTL) studies. Of particular interest are the so-called ': hotspots ': , important genetic variants that regulate the expression of many genes. Recently, attention has focussed on whether eQTLs are common to several tissues, cell-types or, more generally, conditions or whether they are specific to a particular condition.

Results: We have implemented MT-HESS, a Bayesian hierarchical model that analyses the association between a large set of predictors, e.g. SNPs, and many responses, e.g. gene expression, in multiple tissues, cells or conditions. Our Bayesian sparse regression algorithm goes beyond ': one-at-a-time ': association tests between SNPs and responses and uses a fully multivariate model search across all linear combinations of SNPs, coupled with a model of the correlation between condition/tissue-specific responses. In addition, we use a hierarchical structure to leverage shared information across different genes, thus improving the detection of hotspots. We show the increase of power resulting from our new approach in an extensive simulation study. Our analysis of two case studies highlights new hotspots that would remain undetected by standard approaches and shows how greater prediction power can be achieved when several tissues are jointly considered.

Availability and implementation: C[Formula: see text] source code and documentation including compilation instructions are available under GNU licence at http://www.mrc-bsu.cam.ac.uk/software/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Animals
  • Bayes Theorem*
  • Diabetes Mellitus, Type 1 / genetics
  • Gene Expression Regulation*
  • Gene Regulatory Networks*
  • Genomics / methods
  • Humans
  • Inflammation / genetics*
  • Inflammatory Bowel Diseases / genetics*
  • Models, Theoretical
  • Organ Specificity
  • Polymorphism, Single Nucleotide / genetics
  • Programming Languages
  • Quantitative Trait Loci / genetics*
  • Rats
  • Software*
  • Tissue Distribution