The genus burkholderia: analysis of 56 genomic sequences

Genome Dyn. 2009:6:140-157. doi: 10.1159/000235768. Epub 2009 Aug 19.

Abstract

The genus Burkholderia consists of a number of very diverse species, both in terms of lifestyle (which varies from category B pathogens to apathogenic soil bacteria and plant colonizers) and their genetic contents. We have used 56 publicly available genomes to explore the genomic diversity within this genus, including genome sequences that are not completely finished, but are available from the NCBI database. Defining the pan- and core genomes of species results in insights in the conserved and variable fraction of genomes, and can verify (or question) historic, taxonomic groupings. We find only several hundred genes that are conserved across all Burkholderia genomes, whilst there are more than 40,000 gene families in the Burkholderia pan-genome. A BLAST matrix visualizes the fraction of conserved genes in pairwise comparisons. A BLAST atlas shows which genes are actually conserved in a number of genomes, located and visualized with reference to a chosen genome. Genomic islands are common in many Burkholderia genomes, and most of these can be readily visualized by DNA structural properties of the chromosome. Trees that are based on relatedness of gene family content yield different results depending on what genes are analyzed. Some of the differences can be explained by errors in incomplete genome sequences, but, as our data illustrate, the outcome of phylogenetic trees depends on the type of genes that are analyzed.

MeSH terms

  • Burkholderia
  • Genome, Bacterial*
  • Genomic Islands
  • Genomics
  • Molecular Sequence Data
  • Phylogeny*
  • Sequence Analysis, DNA