Serine proteases and their homologs in the Drosophila melanogaster genome: an initial analysis of sequence conservation and phylogenetic relationships

Gene. 2003 Jan 30:304:117-31. doi: 10.1016/s0378-1119(02)01187-3.

Abstract

Serine proteases (SPs) and serine protease homologs (SPHs) constitute the second largest family of genes in the Drosophila melanogaster genome. Eighty-four SPs comprise less than 300 amino acid residues, and a significant portion of them are probably digestive enzymes. Some larger SPs may contain one or more regions important for protein-protein interactions, including clip domains, low-density lipoprotein receptor class A repeats, and scavenger receptor cysteine-rich domains. We identified 37 clusters of SP or SPH genes, which probably evolved from relatively recent gene duplication and sequence divergence. A majority of the SPs may be trypsin-like and activated by cleavage after a specific arginine or lysine residue. Among the 147 SPs and 57 SPHs studied, 24 SPs and 13 SPHs contain at least one regulatory clip domain. A multiple sequence alignment of the clip domains provided further information on structural conservation of these regulatory modules. Detailed sequence comparison led to an improved classification system for SPs containing clip domains. These analyses have established a framework of information about evolutionary relationships among the Drosophila SPs and SPHs, which may facilitate research on these proteins as well as homologous molecules from other invertebrate species.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Catalytic Domain / genetics
  • Chromosome Mapping
  • Conserved Sequence / genetics
  • Databases, Protein
  • Drosophila Proteins / genetics*
  • Drosophila melanogaster / enzymology
  • Drosophila melanogaster / genetics*
  • Expressed Sequence Tags
  • Genome*
  • Molecular Sequence Data
  • Phylogeny*
  • Sequence Alignment
  • Sequence Homology, Amino Acid
  • Serine Endopeptidases / genetics*

Substances

  • Drosophila Proteins
  • Serine Endopeptidases