Significance testing for small microarray experiments

Charles Kooperberg; Aaron Aragaki; Andrew D Strand; James M Olson

doi:10.1002/sim.2109

Significance testing for small microarray experiments

Stat Med. 2005 Aug 15;24(15):2281-98. doi: 10.1002/sim.2109.

Authors

Charles Kooperberg¹, Aaron Aragaki, Andrew D Strand, James M Olson

Affiliation

¹ Fred Hutchinson Cancer Research Center, Division of Public Health Sciences, Seattle, WA 98109, USA. clk@fhcrc.org

PMID: 15889452
DOI: 10.1002/sim.2109

Abstract

Which significance test is carried out when the number of repeats is small in microarray experiments can dramatically influence the results. When in two sample comparisons both conditions have fewer than, say, five repeats traditional test statistics require extreme results, before a gene is considered statistically significant differentially expressed after a multiple comparisons correction. In the literature many approaches to circumvent this problem have been proposed. Some of these proposals use (empirical) Bayes arguments to moderate the variance estimates for individual genes. Other proposals try to stabilize these variance estimate by combining groups of genes or similar experiments. In this paper we compare several of these approaches, both on data sets where both experimental conditions are the same, and thus few statistically significant differentially expressed genes should be identified, and on experiments where both conditions do differ. This allows us to identify which approaches are most powerful without identifying many false positives. We conclude that after balancing the numbers of false positives and true positives an empirical Bayes approach and an approach which combines experiments perform best. Standard t-tests are inferior and offer almost no power when the sample size is small.

Publication types

Comparative Study
Research Support, N.I.H., Extramural
Research Support, U.S. Gov't, P.H.S.

MeSH terms

Animals
Bayes Theorem*
Computer Simulation
DNA Fingerprinting / methods
Data Interpretation, Statistical*
False Positive Reactions
Humans
Huntington Disease / genetics
Mice
Oligonucleotide Array Sequence Analysis / methods*

Abstract

Publication types

MeSH terms

Grants and funding