Pan-genome analysis highlights the extent of genomic variation in cultivated and wild rice

Nat Genet. 2018 Feb;50(2):278-284. doi: 10.1038/s41588-018-0041-z. Epub 2018 Jan 15.

Abstract

The rich genetic diversity in Oryza sativa and Oryza rufipogon serves as the main sources in rice breeding. Large-scale resequencing has been undertaken to discover allelic variants in rice, but much of the information for genetic variation is often lost by direct mapping of short sequence reads onto the O. sativa japonica Nipponbare reference genome. Here we constructed a pan-genome dataset of the O. sativa-O. rufipogon species complex through deep sequencing and de novo assembly of 66 divergent accessions. Intergenomic comparisons identified 23 million sequence variants in the rice genome. This catalog of sequence variations includes many known quantitative trait nucleotides and will be helpful in pinpointing new causal variants that underlie complex traits. In particular, we systemically investigated the whole set of coding genes using this pan-genome data, which revealed extensive presence and absence of variation among rice accessions. This pan-genome resource will further promote evolutionary and functional studies in rice.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Crops, Agricultural / genetics*
  • Domestication
  • Genetic Variation*
  • Genome, Plant*
  • Genomics / methods*
  • High-Throughput Nucleotide Sequencing
  • Oryza / classification
  • Oryza / genetics*
  • Plant Breeding
  • Sequence Analysis, DNA