Chromosome level genome assembly of the Etruscan shrew Suncus etruscus

Sci Data. 2024 Feb 7;11(1):176. doi: 10.1038/s41597-024-03011-x.

Abstract

Suncus etruscus is one of the world's smallest mammals, with an average body mass of about 2 grams. The Etruscan shrew's small body is accompanied by a very high energy demand and numerous metabolic adaptations. Here we report a chromosome-level genome assembly using PacBio long read sequencing, 10X Genomics linked short reads, optical mapping, and Hi-C linked reads. The assembly is partially phased, with the 2.472 Gbp primary pseudohaplotype and 1.515 Gbp alternate. We manually curated the primary assembly and identified 22 chromosomes, including X and Y sex chromosomes. The NCBI genome annotation pipeline identified 39,091 genes, 19,819 of them protein-coding. We also identified segmental duplications, inferred GO term annotations, and computed orthologs of human and mouse genes. This reference-quality genome will be an important resource for research on mammalian development, metabolism, and body size control.

Publication types

  • Dataset

MeSH terms

  • Animals
  • Chromosomes* / genetics
  • Genome
  • Genomics
  • Mice
  • Molecular Sequence Annotation
  • Shrews* / genetics