A chromosome-scale assembly of Brassica carinata (BBCC) accession HC20 containing resistance to multiple pathogens and an early generation assessment of introgressions into B. juncea (AABB)

Plant J. 2024 May 9. doi: 10.1111/tpj.16794. Online ahead of print.

Abstract

Brassica carinata (BBCC) commonly referred to as Ethiopian mustard is a natural allotetraploid containing the genomes of Brassica nigra (BB) and Brassica oleracea (CC). It is an oilseed crop endemic to the northeastern regions of Africa. Although it is under limited cultivation, B. carinata is valuable as it is resistant/highly tolerant to most of the pathogens affecting widely cultivated Brassica species of the U's triangle. We report a chromosome-scale genome assembly of B. carinata accession HC20 using long-read Oxford Nanopore sequencing and Bionano optical maps. The assembly has a scaffold N50 of ~39.8 Mb and covers ~1.11 Gb of the genome. We compared the long-read genome assemblies of the U's triangle species and found extensive gene collinearity between the diploids and allopolyploids with no evidence of major gene losses. Therefore, B. juncea (AABB), B. napus (AACC), and B. carinata can be regarded as strict allopolyploids. We cataloged the nucleotide-binding and leucine-rich repeat immune receptor (NLR) repertoire of B. carinata and, identified 465 NLRs, and compared these with the NLRs in the other Brassica species. We investigated the extent and nature of early-generation genomic interactions between the constituent genomes of B. carinata and B. juncea in interspecific crosses between the two species. Besides the expected recombination between the constituent B genomes, extensive homoeologous exchanges were observed between the A and C genomes. Interspecific crosses, therefore, can be used for transferring disease resistance from B. carinata to B. juncea and broadening the genetic base of the two allotetraploid species.

Keywords: Brassica carinata; Brassica juncea; NLRs; comparative genomics; crop improvement; disease resistance; genetic introgressions; genome assembly; long‐read sequencing.