Identification and characterization of simple sequence repeats in the genomes of Shigella species

Gene. 2003 Dec 11:322:85-92. doi: 10.1016/j.gene.2003.09.017.

Abstract

A variety of simple sequence repeats (SSRs) have been identified in the genome of Shigella flexneri serotype 2a (strain Sf301), an enteric pathogen that causes bacillary dysentery in man. The distribution of SSRs, with unit length ranging from 1 to 9 nucleotides, was biased in different regions of the genome. The tri-, tetra- and hexanucleotide SSRs prevailed in the coding regions while the mono- and dinucleotide SSRs were more common in the noncoding regions. Many intergenic SSRs are less than 30 bp away from the downstream open reading frames (ORFs), suggesting a potential role in transcriptional regulation. To study polymorphism of SSRs, we compared 17 coding-region SSRs from strain Sf301 with the corresponding sequences from 23 other strains of four Shigella species. Five chromosomal loci were found to be polymorphic, of which those from S. flexneri strains were most variable. Particularly interesting is the C5-1 locus in the coding sequence of the hcaD gene encoding a subunit of ferredoxin reductase. Depending on the insertion of variable numbers of the unit sequence (CGCAG), the Shigella hcaD genes can encode truncated products due to premature stop codons or frame shifts, or products with extended core alpha helices that leads to radical alterations in the predicted tertiary structure. Hence, SSRs may serve as genotyping markers for epidemiological investigations, and may offer insights into evolutionary adaptation of the pathogens.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Bacterial Proteins / chemistry
  • Bacterial Proteins / genetics
  • Base Sequence
  • DNA, Bacterial / chemistry
  • DNA, Bacterial / genetics
  • Ferredoxin-NADP Reductase / chemistry
  • Ferredoxin-NADP Reductase / genetics
  • Genetic Variation
  • Genome, Bacterial*
  • Microsatellite Repeats / genetics*
  • Models, Molecular
  • Molecular Sequence Data
  • Mutagenesis, Insertional
  • Polymorphism, Genetic
  • Protein Structure, Secondary
  • Protein Subunits / chemistry
  • Protein Subunits / genetics
  • Sequence Alignment
  • Sequence Analysis, DNA
  • Sequence Homology, Amino Acid
  • Sequence Homology, Nucleic Acid
  • Serotyping
  • Shigella / classification
  • Shigella / genetics*
  • Shigella boydii / genetics
  • Shigella dysenteriae / genetics
  • Shigella flexneri / genetics
  • Shigella sonnei / genetics
  • Species Specificity

Substances

  • Bacterial Proteins
  • DNA, Bacterial
  • Protein Subunits
  • Ferredoxin-NADP Reductase

Associated data

  • GENBANK/AY282807
  • GENBANK/AY282808
  • GENBANK/AY282809
  • GENBANK/AY282810
  • GENBANK/AY282811
  • GENBANK/AY282812
  • GENBANK/AY282813
  • GENBANK/AY282814
  • GENBANK/AY282815
  • GENBANK/AY282816
  • GENBANK/AY282817
  • GENBANK/AY282818
  • GENBANK/AY282819
  • GENBANK/AY282820
  • GENBANK/AY282821
  • GENBANK/AY282822
  • GENBANK/AY282823
  • GENBANK/AY282824
  • GENBANK/AY282825
  • GENBANK/AY282826
  • GENBANK/AY282827
  • GENBANK/AY282828
  • GENBANK/AY282829
  • GENBANK/AY282830
  • GENBANK/AY282831
  • GENBANK/AY282832
  • GENBANK/AY282833
  • GENBANK/AY282834
  • GENBANK/AY282835
  • GENBANK/AY282836
  • GENBANK/AY282837
  • GENBANK/AY282838
  • GENBANK/AY282839
  • GENBANK/AY282840
  • GENBANK/AY282841
  • GENBANK/AY282842
  • GENBANK/AY282843
  • GENBANK/AY282844
  • GENBANK/AY282845
  • GENBANK/AY282846
  • GENBANK/AY282847
  • GENBANK/AY282848
  • GENBANK/AY282849
  • GENBANK/AY282850
  • GENBANK/AY282851
  • GENBANK/AY282852
  • GENBANK/AY282853
  • GENBANK/AY282854
  • GENBANK/AY282855
  • GENBANK/AY282856
  • GENBANK/AY282857
  • GENBANK/AY282858
  • GENBANK/AY282859
  • GENBANK/AY282860
  • GENBANK/AY282861
  • GENBANK/AY282862
  • GENBANK/AY282863
  • GENBANK/AY282864
  • GENBANK/AY282865
  • GENBANK/AY282866
  • GENBANK/AY282867
  • GENBANK/AY282868
  • GENBANK/AY282869
  • GENBANK/AY282870
  • GENBANK/AY282871
  • GENBANK/AY282872
  • GENBANK/AY282873
  • GENBANK/AY282874
  • GENBANK/AY282875
  • GENBANK/AY282876
  • GENBANK/AY282877
  • GENBANK/AY282878
  • GENBANK/AY282879
  • GENBANK/AY282880
  • GENBANK/AY282881
  • GENBANK/AY282882
  • GENBANK/AY282883
  • GENBANK/AY282884
  • GENBANK/AY282885
  • GENBANK/AY282886
  • GENBANK/AY282887
  • GENBANK/AY282888
  • GENBANK/AY282889
  • GENBANK/AY282890
  • GENBANK/AY282891
  • GENBANK/AY282892
  • GENBANK/AY282893
  • GENBANK/AY282894
  • GENBANK/AY282895
  • GENBANK/AY282896
  • GENBANK/AY282897
  • GENBANK/AY282898
  • GENBANK/AY282899
  • GENBANK/AY282900
  • GENBANK/AY282901
  • GENBANK/AY282902
  • GENBANK/AY282903
  • GENBANK/AY282904
  • GENBANK/AY282905
  • GENBANK/AY282906
  • GENBANK/AY282907
  • GENBANK/AY282908
  • GENBANK/AY282909
  • GENBANK/AY282910
  • GENBANK/AY282911
  • GENBANK/AY282912
  • GENBANK/AY282913
  • GENBANK/AY282914
  • GENBANK/AY282915
  • GENBANK/AY282916
  • GENBANK/AY282917
  • GENBANK/AY282918
  • GENBANK/AY282919
  • GENBANK/AY282920
  • GENBANK/AY282921