Constructing high complexity synthetic libraries of long ORFs using in vitro selection

J Mol Biol. 2000 Mar 24;297(2):309-19. doi: 10.1006/jmbi.2000.3571.

Abstract

We present a method that can significantly increase the complexity of protein libraries used for in vitro or in vivo protein selection experiments. Protein libraries are often encoded by chemically synthesized DNA, in which part of the open reading frame is randomized. There are, however, major obstacles associated with the chemical synthesis of long open reading frames, especially those containing random segments. Insertions and deletions that occur during chemical synthesis cause frameshifts, and stop codons in the random region will cause premature termination. These problems can together greatly reduce the number of full-length synthetic genes in the library. We describe a strategy in which smaller segments of the synthetic open reading frame are selected in vitro using mRNA display for the absence of frameshifts and stop codons. These smaller segments are then ligated together to form combinatorial libraries of long uninterrupted open reading frames. This process can increase the number of full-length open reading frames in libraries by up to two orders of magnitude, resulting in protein libraries with complexities of greater than 10(13). We have used this methodology to generate three types of displayed protein library: a completely random sequence library, a library of concatemerized oligopeptide cassettes with a propensity for forming amphipathic alpha-helical or beta-strand structures, and a library based on one of the most common enzymatic scaffolds, the alpha/beta (TIM) barrel.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Cloning, Molecular / methods*
  • Codon / genetics
  • DNA Ligases / metabolism
  • Frameshift Mutation / genetics
  • Indole-3-Glycerol-Phosphate Synthase / chemistry
  • Indole-3-Glycerol-Phosphate Synthase / genetics
  • Molecular Sequence Data
  • Mutagenesis, Insertional / genetics
  • Oligodeoxyribonucleotides / chemical synthesis
  • Oligodeoxyribonucleotides / genetics
  • Oligopeptides / chemistry*
  • Oligopeptides / genetics*
  • Oligopeptides / isolation & purification
  • Oligopeptides / metabolism
  • Open Reading Frames / genetics*
  • Peptide Library*
  • Protein Biosynthesis / genetics
  • Protein Folding
  • Protein Structure, Secondary
  • RNA, Messenger / genetics
  • RNA, Messenger / metabolism
  • Random Allocation
  • Recombinant Fusion Proteins / chemistry
  • Recombinant Fusion Proteins / genetics
  • Recombinant Fusion Proteins / isolation & purification
  • Recombinant Fusion Proteins / metabolism
  • Reverse Transcriptase Polymerase Chain Reaction
  • Selection, Genetic
  • Sulfolobus
  • Templates, Genetic

Substances

  • Codon
  • Oligodeoxyribonucleotides
  • Oligopeptides
  • Peptide Library
  • RNA, Messenger
  • Recombinant Fusion Proteins
  • Indole-3-Glycerol-Phosphate Synthase
  • DNA Ligases