Unexpected links reflect the noise in networks

Anatoly Yambartsev; Michael A Perlin; Yevgeniy Kovchegov; Natalia Shulzhenko; Karina L Mine; Xiaoxi Dong; Andrey Morgun

doi:10.1186/s13062-016-0155-0

Unexpected links reflect the noise in networks

Biol Direct. 2016 Oct 13;11(1):52. doi: 10.1186/s13062-016-0155-0.

Authors

Anatoly Yambartsev¹, Michael A Perlin², Yevgeniy Kovchegov³, Natalia Shulzhenko⁴, Karina L Mine⁵, Xiaoxi Dong², Andrey Morgun⁶

Affiliations

¹ Department of Statistics, Institute of Mathematics and Statistics, University of Sao Paulo, Sao Paulo, SP, Brazil.
² College of Pharmacy, Oregon State University, Corvallis, OR, USA.
³ Department of Mathematics, College of Science, Oregon State University, Corvallis, OR, USA.
⁴ College of Veterinary Medicine, Oregon State University, Corvallis, OR, USA.
⁵ Instituto de Imunogenética - Associação Fundo de Incentivo à Pesquisa (IGEN-AFIP), São Paulo, SP, Brazil.
⁶ College of Pharmacy, Oregon State University, Corvallis, OR, USA. anemorgun@hotmail.com.

Abstract

Background: Gene covariation networks are commonly used to study biological processes. The inference of gene covariation networks from observational data can be challenging, especially considering the large number of players involved and the small number of biological replicates available for analysis.

Results: We propose a new statistical method for estimating the number of erroneous edges in reconstructed networks that strongly enhances commonly used inference approaches. This method is based on a special relationship between sign of correlation (positive/negative) and directionality (up/down) of gene regulation, and allows for the identification and removal of approximately half of all erroneous edges. Using the mathematical model of Bayesian networks and positive correlation inequalities we establish a mathematical foundation for our method. Analyzing existing biological datasets, we find a strong correlation between the results of our method and false discovery rate (FDR). Furthermore, simulation analysis demonstrates that our method provides a more accurate estimate of network error than FDR.

Conclusions: Thus, our study provides a new robust approach for improving reconstruction of covariation networks.

Reviewers: This article was reviewed by Eugene Koonin, Sergei Maslov, Daniel Yasumasa Takahashi.

Publication types

Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Bayes Theorem
Computational Biology / methods*
Gene Expression Regulation*
Gene Regulatory Networks*