The misleading effects of composite taxa in supermatrices

Mol Phylogenet Evol. 2003 Jun;27(3):522-7. doi: 10.1016/s1055-7903(03)00020-4.

Abstract

With the amount of available sequence data rapidly increasing, supermatrices are at the forefront of systematic studies. As an alternative to supertrees, supermatrices utilize a total evidence approach where different genes and other lines of data are merged into a single data matrix, which is then analyzed in an attempt to obtain the phylogeny that best explains the data. However, questions may arise when combining data sets in which one or more taxa do not have sequences available for each individual gene. Two possible solutions to this situation are to either leave all taxa separate and code unavailable sequences as missing, or to combine taxa at a level for which monophyly is assumed a priori. By reanalyzing the previous work of, we show that combining taxa may yield misleading results, i.e., hypotheses of relationships that are not supported by the underlying data.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Data Interpretation, Statistical
  • Mammals / classification
  • Phylogeny*
  • Sequence Analysis, DNA