Surfeit locus gene homologs are widely distributed in invertebrate genomes

Mol Cell Biol. 1996 Oct;16(10):5591-6. doi: 10.1128/MCB.16.10.5591.

Abstract

The mouse Surfeit locus contains six sequence-unrelated genes (Surf-1 to -6) arranged in the tightest gene cluster so far described for mammals. The organization and juxtaposition of five of the Surfeit genes (Surf-1 to -5) are conserved between mammals and birds, and this may reflect a functional or regulatory requirement for the gene clustering. We have undertaken an evolutionary study to determine whether the Surfeit genes are conserved and clustered in invertebrate genomes. Drosophila melanogaster and Caenorhabditis elegans homologs of the mouse Surf-4 gene, which encodes an integral membrane protein associated with the endoplasmic reticulum, have been isolated. The amino acid sequences of the Drosophila and C. elegans homologs are highly conserved in comparison with the mouse Surf-4 protein. In particular, a dilysine motif implicated in endoplasmic reticulum localization of the mouse protein is conserved in the invertebrate homologs. We show that the Drosophila Surf-4 gene, which is transcribed from a TATA-less promoter, is not closely associated with other Drosophila Surfeit gene homologs but rather is located upstream from sequences encoding a homolog of a yeast seryl-tRNA synthetase protein. There are at least two closely linked Surf-3/rpL7a genes or highly polymorphic alleles of a single Surf-3/rpL7a gene in the C. elegans genome. The chromosomal locations of the C. elegans Surf-1, Surf-3/rpL7a, and Surf-4 genes have been determined. In D. melanogaster the Surf-3/rpL7a, Surf-4, and Surf-5 gene homologs and in C. elegans the Surf-1, Surf-3/rpL7a, Surf-4, and Surf-5 gene homologs are located on completely different chromosomes, suggesting that any requirement for the tight clustering of the genes in the Surfeit locus is restricted to vertebrate lineages.

Publication types

  • Comparative Study

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Base Sequence
  • Caenorhabditis elegans / genetics*
  • Caenorhabditis elegans Proteins*
  • Conserved Sequence
  • Drosophila Proteins*
  • Drosophila melanogaster / genetics*
  • Genes, Insect
  • Genetic Linkage
  • Genome
  • Membrane Proteins / biosynthesis
  • Membrane Proteins / chemistry
  • Membrane Proteins / genetics*
  • Mice
  • Molecular Sequence Data
  • Oligonucleotide Probes
  • Polymerase Chain Reaction
  • Promoter Regions, Genetic
  • Ribosomal Proteins / chemistry
  • Ribosomal Proteins / genetics
  • Saccharomyces cerevisiae / genetics
  • Sequence Homology, Amino Acid
  • Sequence Homology, Nucleic Acid
  • Serine-tRNA Ligase / chemistry
  • Transcription, Genetic

Substances

  • Caenorhabditis elegans Proteins
  • Drosophila Proteins
  • Membrane Proteins
  • Oligonucleotide Probes
  • Ribosomal Proteins
  • RpL7A protein, Drosophila
  • Rpl7a protein, mouse
  • Surf4 protein, Drosophila
  • sft-4 protein, C elegans
  • Serine-tRNA Ligase

Associated data

  • GENBANK/Y14823
  • GENBANK/Y14949