Bioinformatics Approach to Assess the Biogeographical Patterns of Soil Communities: The Utility for Soil Provenance

J Forensic Sci. 2018 Jul;63(4):1033-1042. doi: 10.1111/1556-4029.13741. Epub 2018 Jan 22.

Abstract

Soil DNA profiling has potential as a forensic tool to establish a link between soil collected at a crime scene and soil recovered from a suspect. However, a quantitative measure is needed to investigate the spatial/temporal variability across multiple scales prior to their application in forensic science. In this study, soil DNA profiles across Miami-Dade, FL, were generated using length heterogeneity PCR to target four taxa. The objectives of this study were to (i) assess the biogeographical patterns of soils to determine whether soil biota is spatially correlated with geographic location and (ii) evaluate five machine learning algorithms for their predictive ability to recognize biotic patterns which could accurately classify soils at different spatial scales regardless of seasonal collection. Results demonstrate that soil communities have unique patterns and are spatially autocorrelated. Bioinformatic algorithms could accurately classify soils across all scales with Random Forest significantly outperforming all other algorithms regardless of spatial level.

Keywords: Random Forest; forensic science; machine learning algorithms; soil DNA profiling; soil provenance; spatial scale.

MeSH terms

  • Algorithms
  • Archaea / genetics
  • Computational Biology*
  • DNA Fingerprinting
  • DNA, Bacterial / genetics
  • DNA, Fungal / genetics
  • DNA, Intergenic
  • DNA, Plant / genetics
  • DNA, Ribosomal Spacer
  • Ecosystem
  • Florida
  • Forensic Sciences
  • Machine Learning
  • Polymerase Chain Reaction
  • Polymorphism, Restriction Fragment Length
  • RNA, Ribosomal, 16S
  • Soil / chemistry*
  • Spatial Analysis*

Substances

  • DNA, Bacterial
  • DNA, Fungal
  • DNA, Intergenic
  • DNA, Plant
  • DNA, Ribosomal Spacer
  • RNA, Ribosomal, 16S
  • Soil