We have cloned and determined the nucleotide (nt) sequence of a 6.5-kb genomic DNA fragment containing the rat MyoD gene (encoding a muscle regulatory factor, MyoD). Mouse fibroblasts transfected with this DNA display a high degree of conversion to a muscle phenotype, suggesting that this genomic clone contains sufficient sequence information to allow the production of the rat MyoD protein in these cells. The 6.5-kb genomic fragment contains the complete coding region of MyoD, distributed over three exons, plus 2.3 kb of 5'-noncoding sequence and 1.4 kb of 3'-noncoding sequence. Based on RNase protection assays, the major transcription start point of MyoD is located 210 nt 5' to a methionine start codon and 26 nt 3' to a TAAATA motif which bears similarity to a consensus recognition sequence (TATA) utilized by eukaryotic RNA polymerase II transcription complexes. The high degree of identity between the amino acid sequence of rat MyoD and the MyoD proteins isolated from other vertebrates indicates that this muscle regulatory protein has been evolutionarily conserved.