Pairing between the normal (keto) forms of the bases.
Spontaneous mutations arise from a variety of sources, including errors in DNA replication, spontaneous lesions, and transposable genetic elements. The first two are considered in this section; the third is examined in Chapter 20.
An error in DNA replication can occur when an illegitimate nucleotide pair (say, A–C) forms in DNA synthesis, leading to a base substitution.
Pairing between the normal (keto) forms of the bases.
Mismatched bases. (a) Mispairs resulting from rare tautomeric forms of the pyrimidines; (b) mispairs resulting from rare tautomeric forms of the purines.
Mispairs can also result when one of the bases becomes ionized. This type of mispair may occur more frequently than mispairs due to imino and enol forms of bases.


Mutation by tautomeric shifts in the bases of DNA. (a) In the example diagrammed, a guanine undergoes a tautomeric shift to its rare enol form (G*) at the time of replication. (b) In its enol form, it pairs with thymine. (c and d) In the next replication, the guanine shifts back to its more stable keto form. The thymine incorporated opposite the enol form of guanine, seen in part b, directs the incorporation of adenine in the subsequent replication, shown in parts c and d. The net result is a GC → AT mutation. If a guanine undergoes a tautomeric shift from the common keto form to the rare enol form at the time of incorporation (as a nucleoside triphosphate, rather than in the template strand diagrammed here), it will be incorporated opposite thymine in the template strand and cause an AT → GC mutation. (From E. J. Gardner and D. P. Snustad, Principles of Genetics, 5th ed. Copyright © 1984 by John Wiley & Sons, New York.)




Replication errors can also lead to frameshift mutations. Recall from Chapter 10 that such mutations result in greatly altered proteins.
A simplified version of the Streisinger model for frameshift formation. (a–c) In DNA synthesis, the newly synthesized strand slips, looping out one or several bases. This loop is stabilized by the pairing afforded by the repetitive-sequence unit (the A bases in this case). An addition of one base pair, A–T, will result at the next round of replication in this example. (d–f) If, instead of the newly synthesized strand, the template strand slips, then a deletion results. Here the repeating unit is a CT dinucleotide. After slippage, a deletion of two base pairs (C–G and T–A) would result at the next round of replication.
The distribution of 140 spontaneous mutations in lacI. Each occurrence of a point mutation is indicated by a box. Red boxes designate fast-reverting mutations. Deletions (gold) are represented below. The I map is given in terms of the amino acid number in the corresponding I-encoded lac repressor. Allele numbers refer to mutations that have been analyzed at the DNA sequence level. The mutations S114 and S58 (circles) result from the insertion of transposable elements (see Chapter 20). S28 (red circle) is a duplication of 88 base pairs. (From P. J. Farabaugh, U. Schmeissner, M. Hofer, and J. H. Miller, Journal of Molecular Biology 126, 1978, 847.)

The major hot spot, represented here by the mutations FS5, FS25, FS45, and FS65, results from the addition of one extra set of the four bases CTGG to one strand of the DNA. This hot spot reverts at a high rate, losing the extra set of four bases. The minor hot spot, represented here by the mutations FS2 and FS84, results from the loss of one set of the four bases CTGG. This mutant does not readily regain the lost set of four base pairs.


Deletions in lacI. Deletions occurring in S74 and S112 are shown at the top of the figure. As indicated by the gold bars, one of the sequence repeats (aqua) and all the intervening DNA are deleted, leaving one copy of the repeated sequence. All mutations were analyzed by direct DNA sequence determination. (From P. J. Farabaugh, U. Schmeissner, M. Hofer, and J. H. Miller, Journal of Molecular Biology 126, 1978, 847.)
In addition to replication errors, spontaneous lesions, naturally occurring damage to the DNA, can generate mutations. Two of the most frequent spontaneous lesions result from depurination and deamination.
The loss of a purine residue (guanine) from a single strand of DNA. The sugar-phosphate backbone is left intact.
Deamination of (a) cytosine and (b) 5-methylcytosine.
5-Methylcytosine hot spots in E. coli. Nonsense mutations occurring at 15 different sites in lacI were scored. All result from the GC → AT transition. The asterisks (*) mark the positions of 5-methylcytosines. Open bars depict sites at which the GC → AT change could be detected but at which no mutations occurred in this particular collection. It can be seen that 5-methylcytosine residues are hot spots for the GC → AT transition. Of 50 independently occurring mutations, 44 were at the 4 methylated cytosine sites and only 6 were at the 11 unmethylated cytosines. (From C. Coulondre et al., Nature 274, 1978, 775.).
A consequence of the frequent mutation of 5-methylcytosine to thymine is the underrepresentation of CpG dinucleotides in higher cells, because this sequence is methylated to give 5-methyl-CpG, which is gradually converted into TpG.
DNA damage products formed after attack by oxygen radicals. dR = deoxyribose.
Spontaneous mutations can be generated by different processes. Replication errors and spontaneous lesions generate most of the base-substitution and frameshift mutations. Replication errors may also cause some deletions that occur in the absence of mutagenic treatment.
DNA sequence analysis has revealed the mutations responsible for a number of human hereditary diseases. The previously discussed studies of bacterial mutations allow us to suggest mechanisms that cause these human disorders.
Sequences of wild-type (WT) mitochondrial DNA and deleted DNA (KS) from a patient with Kearns-Sayre syndrome. The 13-base boxed sequence is identical in both WT and KS and serves as a breakpoint for the DNA deletion. A single base (boldface type) is altered in KS, aside from the deleted segment.
Expansion of the CGG triplet in the FMR-1 gene seen in the fragile X syndrome. Normal persons have from 6 to 54 copies of the CGG repeat, whereas members of susceptible families display an increase (premutation) in the number of repeats: normally transmitting males (NTMs) and their daughters are phenotypically normal but display from 50 to 200 copies of the CGG triplet; the number of repeats expands to some 200 to 1300 in those showing full symptoms of the disease.
The inheritance of fragile X syndrome is unusual in that 20 percent of the males with a fragile X chromosome are phenotypically normal but transmit the affected chromosome to their daughters, who also appear normal. These males are said to be normally transmitting males (NTMs). However, the sons of the daughters of the NTMs frequently display symptoms. The fragile X syndrome results from mutations in a (CGG)n repeat in the coding sequence of the FMR-1 gene. Patients with the disease show specific methylation, induced by the mutation, at a nearby CpG cluster, resulting in reduced FMR-1 expression.
Why do symptoms develop in some persons with a fragile X chromosome and not in others? The answer seems to lie in the number of CGG repeats in the FMR-1 gene. Humans normally show a considerable variation in the number of CGG repeats in the FMR-1 gene, ranging from 6 to 54, with 29 repeats in the most frequent allele. [The variation in CGG repeats produces a corresponding variation in the number of arginine residues (CGG is an arginine codon) in the FMR-1-encoded protein.] Both NTMs and their daughters have a much larger number of repeats, ranging from 50 to 200. These increased repeats have been termed premutations. All premutation alleles are unstable. The males and females with symptoms of the disease, as well as many carrier females, have additional insertions of DNA, suggesting repeat numbers of 200 to 1300. The frequency of expansion has been shown to increase with the size of the DNA insertion (and thus, presumably, with the number of repeats). Apparently, the number of repeats in the premutation alleles found in NTMs and their daughters is above a certain threshold and thus is much more likely to expand to a full mutation than is the case for normal persons.
A second inherited disease, X-linked spinal and bulbar muscular atrophy (known as Kennedy disease), also results from the amplification of a three-base-pair repeat, in this case a repeat of the CAG triplet. Kennedy disease, which is characterized by progressive muscle weakness and atrophy, results from mutations in the gene that encodes the androgen receptor. Normal persons have an average of 21 CAG repeats in this gene, whereas affected patients have repeats ranging from 40 to 52.
Myotonic dystrophy, the most common form of adult muscular dystrophy, is yet another example of sequence expansion causing a human disease. Susceptible families display an increase in severity of the disease in successive generations; this increase is caused by the progressive amplification of a CTG triplet at the 3′ end of a transcript. Normal people possess, on average, five copies of the CTG repeat; mildly affected people have approximately 50 copies, and severely affected people have more than 1000 repeats of the CTG triplet. Additional examples of triplet expansion are still appearing—for instance, Huntington disease.