![]() | ![]() |
Formats:
|
||||||||||||||||||||||
Copyright © 1999, The National Academy of Sciences Evolution Genomic evolution during a 10,000-generation experiment with bacteria *Abteilung Mikrobiologie, Biozentrum, CH-4056 Basel, Switzerland; ‡Génomique Bactérienne et Evolution, Université Joseph Fourier, Centre National de la Recherche Scientifique EP2029, Commissariat à l’Energie Atomique-LRC12, F-38041 Grenoble, France; and §Center for Microbial Ecology, Michigan State University, East Lansing, MI 48824 †D.P. and D.S. contributed equally to this paper. ¶To whom reprint requests should be addressed at: Génomique Bactérienne et Evolution, Université Joseph Fourier, 38041 Grenoble, France. e-mail: michel.blot/at/ujf-grenoble.fr. Edited by John R. Roth, University of Utah, Salt Lake City, UT, and approved February 3, 1999 Received July 21, 1998. This article has been cited by other articles in PMC.Abstract Molecular methods are used widely to measure genetic diversity within populations and determine relationships among species. However, it is difficult to observe genomic evolution in action because these dynamics are too slow in most organisms. To overcome this limitation, we sampled genomes from populations of Escherichia coli evolving in the laboratory for 10,000 generations. We analyzed the genomes for restriction fragment length polymorphisms (RFLP) using seven insertion sequences (IS) as probes; most polymorphisms detected by this approach reflect rearrangements (including transpositions) rather than point mutations. The evolving genomes became increasingly different from their ancestor over time. Moreover, tremendous diversity accumulated within each population, such that almost every individual had a different genetic fingerprint after 10,000 generations. As has been often suggested, but not previously shown by experiment, the rates of phenotypic and genomic change were discordant, both across replicate populations and over time within a population. Certain pivotal mutations were shared by all descendants in a population, and these are candidates for beneficial mutations, which are rare and difficult to find. More generally, these data show that the genome is highly dynamic even over a time scale that is, from an evolutionary perspective, very brief. Keywords: experimental evolution, molecular evolution, morphological evolution, insertion sequence elements, Escherichia coli Our collaborative work builds on two previous studies. One examined genomic variation among cells recovered from populations of Escherichia coli that had been stored as a “stab” culture for ≈30 years without renewal of the medium and, hence, with little opportunity for growth (1, 2). A high level of diversity was found using restriction fragment length polymorphism (RFLP) analysis with eight insertion sequence (IS) elements as molecular probes. Clones differed from their putative ancestor by ~12 changes, on average. It was unclear, however, whether the prolonged starvation had an important role in promoting or maintaining this variability and whether the derived bacteria were any better adapted to the storage regime than was their ancestor. The other study examined the dynamics of phenotypic evolution in populations of E. coli that were propagated by daily serial transfer for 1,500 days, yielding 10,000 generations of binary fission (3, 4). The fitness of the bacteria improved by ~50%, on average, relative to the ancestor, and other phenotypic properties, such as cell size, also underwent large changes. The rate of phenotypic evolution was very fast during the initial 2,000 generations, but much slower during the subsequent 8,000 generations. However, certain other issues were not addressed, including the extent of genomic change and whether the rate of molecular evolution decelerated in parallel with phenotypic evolution. At one extreme, the derived genotypes may differ from their ancestor by only a small number of point mutations, which would require very extensive DNA sequencing to discover. At the other extreme, the derived lines may have undergone many chromosomal rearrangements (transpositions, inversions, deletions, etc.), in which case genomic evolution should be detected easily by RFLP analysis using IS elements as probes. The present paper combines the methods of the first study with the populations from the second study to address these issues. MATERIALS AND METHODS Bacterial Strains. Twelve populations of E. coli B were founded from a common ancestor and serially propagated for 10,000 generations (1,500 days) in a glucose-limited minimal medium (3, 4). Samples from each population were obtained at 500-generation intervals and stored at −80°C. The ancestral strain has no functional viruses or plasmids, and it therefore is strictly asexual (clonal). This study examines the common ancestor and 7–20 clones randomly chosen after 500, 1,000, 1,500, 2,000, 5,000, 8,000, and 10,000 generations from two populations, designated Ara−1 and Ara+1. Each of these two populations retained point-mutation rates similar to that of their ancestor, unlike some of the other populations (5). We also examined 9–10 clones randomly sampled from each of the 10 other populations after 10,000 generations only. Three of these populations had spontaneously acquired mutator phenotypes by becoming defective in their methyl-directed mismatch repair pathways (5). DNA Preparation and Hybridization. Molecular methods were described previously (1, 2). Briefly, clones were grown in LB medium and their genomic DNA was harvested by using standard methods. DNA was digested with EcoRV, and the resulting fragments (103−104) were separated by electrophoresis. The DNA fragments then were transferred to a nylon membrane, and Southern blot hybridizations were performed using internal pieces of the IS elements as probes; these internal pieces all lacked EcoRV restriction sites. Every clone was scored for the presence or absence of each fragment that hybridized with a particular IS probe. Ambiguous fragments of similar size usually were resolved by running the relevant clones in parallel and, otherwise, were scored conservatively. Phylogenetic Methods. Phylogenies were constructed by using a parsimony method in which the roots of the trees were forced to be the actual common ancestor (6). The purpose of the phylogenies is to illustrate the divergence of the clones from their ancestor and from one another, rather than to make probabilistic statements about the validity of specific groupings. Hence, we did not weight losses and gains of IS elements differently, nor did we attempt to adjust for the fact that certain genetic events may cause the simultaneous loss of one band and gain of another. All genetic distances are calculated as the total number of differences between any pair of clones (including the ancestor). RESULTS Phylogenetic Relationships Among Clones Based on RFLP. The ancestral genome contains one or more copies of seven IS elements: IS1, IS2, IS3, IS4, IS30, IS150, and IS186 (IS5 is absent from the strain used in this study). Numerous clones were chosen randomly from frozen samples of two evolving populations, designated Ara−1 and Ara+1, obtained at several time points. Each clone’s genomic DNA was hybridized successively with probes for each IS element. The RFLP is a genetic “fingerprint” that consists of the presence or absence of each fragment that hybridizes with an IS probe. These data were used to compute genetic distances among clones and reconstruct clonal phylogenies by parsimony methods (6). It is important to bear in mind that the bacteria in this experiment were strictly asexual (3). Therefore, a mutation may have reached high frequency either because it conferred a selective advantage or because it “hitchhiked” with another mutation that was beneficial (7–9). However, any mutation was very unlikely to have reached high frequency solely by genetic drift; the expected number of generations for a new neutral mutation to drift to fixation is of the same magnitude as the population size (10), which was >106 even at the bottleneck during transfer (3, 4). Another possibility is that some mutations might have become common by recurrent insertions into “hot spots” for such events. If that were the case, then one would expect extensive convergence between the Ara−1 and Ara+1 populations. Using the 10,000-generation samples, we calculated an index of divergence as the observed genetic distance between the populations divided by the distance expected if all their mutations were unique. We obtain an index of 0.88, which indicates that only 12% of their evolutionary changes are convergent. (The actual convergence may be even less, because apparent convergence can arise from imprecision in distinguishing between fragments of similar size. Further molecular analysis will be needed to resolve these cases.) Fig. Fig.11
Second, despite the continued persistence of the ancestral fingerprint, many genomic rearrangements also were observed during this initial phase. Moreover, some of these were quite successful, at least transiently. For example, in Ara+1 (Fig. (Fig.11 Third, as a consequence of selection and competition among beneficial mutations, the phylogeny appears not as a fat bush, comprising several roughly equal branches, but rather as a slender tree in which all of the side branches eventually end, leaving one main trunk (13). Along this trunk lie a succession of pivotal genotypes that—like mitochondrial “Eve” in human evolution (14, 15)—are ancestral to all individuals that subsequently were sampled. Indeed, several of these pivotal genotypes—these bacterial “Eves”—were actually present in our samples, for example, clones 1500.07 and 2000.09 in Ara+1 (Fig. (Fig.11 Discrepancies Between Rates of Genomic and Phenotypic Evolution. Fig. Fig.22
It is therefore also interesting to compare and contrast the temporal dynamics of genomic and phenotypic evolution within each of the populations. To that end, we calculated rates of evolutionary change, expressed per generation, for three traits over two periods of the experimental evolution. Table 1 shows that performance and morphological traits (fitness and cell size, respectively) changed much more rapidly during the first 2,000 generations than the subsequent 8,000 generations, in both focal populations. The same pattern for both of these traits was seen in all 12 replicate populations (4). By contrast, no deceleration was seen in the rate of genomic evolution in either focal population (Table 1). Given that the rates of phenotypic evolution decelerated in all 12 of the replicate populations, whereas the rate of IS-associated genomic evolution did not decelerate in either focal population, a Fisher’s exact test indicates that this difference is significant (two-tailed P = 0.0110).
The continued persistence of the ancestral IS fingerprint for 2,000 generations (Fig. (Fig.1)1 In summary, there are conspicuous and significant discrepancies between the rates of genomic and phenotypic evolution, both across the replicate populations and over time within each population. Variation Among Clones Within Each Population. Fig. Fig.33
Yet, the increase in diversity was not monotonic. The variation in Ara−1 declined by half from generation 1,000 to 1,500, and it fell even more sharply in Ara+1 between 5,000 and 8,000 generations. To test the statistical significance of these declines, we first calculated each individual clone’s average pairwise difference from all of the individuals within the sample, so that the degrees of freedom correspond to the number of independent observations. We then compared these difference scores between consecutive samples. Both declines were highly significant based on two-tailed Mann–Whitney U tests (P = 0.0015 and P < 0.0001, respectively); both remain significant (P < 0.01) even after performing a Bonferroni correction (19) to adjust for the fact that each time series includes seven points that would allow six such comparisons between consecutive samples. These temporary reversals presumably reflect the variation-purging effect of the substitution of beneficial mutations in asexual populations (7, 12, 16, 17). In the absence of selective sweeps, genetic diversity in a population founded from a single clone should increase monotonically to a quasi-equilibrium that reflects the joint balance between mutation, selection against deleterious mutations, and random genetic drift (20–23). Dynamical Behaviors of the Various IS Elements. Table 2 shows that the seven IS elements had very different dynamical behaviors. Moreover, the same element sometimes behaved quite differently in the two focal populations. The three IS elements with single copies—IS2, IS4, and IS30—were completely stable, retaining the same copy number and physical location in every derived clone as in the ancestor. Interestingly, based on the distribution of IS elements among natural isolates of E. coli, Sawyer et al. (24) suggested that IS2, IS4, and IS30 may have mechanisms that repress transposition. Such mechanisms could contribute to their observed stability in this study. IS3 was only slightly less stable, with occasional clones from each population yielding slightly different fingerprints (including, in Ara+1, slight variation in copy number in certain generations).
By contrast, IS1, IS150, and IS186 underwent many changes in each focal population. The average copy number after 10,000 generations for IS1 was close to the ancestral value of 20 in Ara−1, but it declined to ~17.5 in Ara+1. IS186 experienced a small increase in copy number in both populations, from 5 in the ancestor to ~6.5 at generation 10,000. IS150 behaved similarly to IS186 in Ara−1, showing a slight increase in copy number from 5 to a final average of ~6.5. By contrast, IS150 was much more active in Ara+1, where its average copy number more than tripled to ~16.5 after 10,000 generations. This difference in the dynamics of IS150 between the two focal populations had nothing to do with the Ara marker. A survey of its copy number in clones sampled at 10,000 generations from all 12 replicate populations (six Ara− and six Ara+) reveals substantial variability, with the average copy number ranging from ~4.8 to ~16.5, but there is no statistical association with the Ara marker state (t = 0.547, two-tailed P = 0.5965). The reason for the greater IS150 activity in Ara+1 is presently unknown. It might indicate increased transposition rate because of changes in either the IS itself or the bacterial chromosome, or it might reflect reduced selection on copy number for this IS in certain genetic backgrounds (25). In any case, this “burst” of IS150 activity in Ara+1 accounts for ~60% of the difference between the two focal populations in their genetic distances to the ancestor after 10,000 generations (Fig. (Fig.22 Effect of Mutator Status on Genomic Divergence from the Ancestor. Both focal populations retained point-mutation rates similar to their ancestor. However, 3 of the 12 replicate populations acquired mutator phenotypes because of the fixation of mutations in genes in the methyl-directed mismatch repair pathway; these defects caused ~100-fold increases in the rate of point mutation (5). In two of the three populations that became mutators, the change occurred in the first 3,000 generations of the experimental evolution (5), thus providing ample time for any effect of the mutator phenotype on genomic divergence to be manifest. Fig. Fig.44
DISCUSSION Our results demonstrate that these experimental populations of E. coli underwent rapid molecular evolution, leading to extensive changes in their genome structure, during ~4 years of adaptation to an environment in which they received nutrients every day. Divergence from the ancestor increased over time, as did genetic diversity within each population. The amounts of evolutionary divergence and genetic diversity were roughly similar to those seen after ~30 years of storage without any nutrient inputs (1, 2). Therefore, we conclude that constant, long-term starvation was not necessary to either substantially restructure the genome or to maintain a high level of genetic diversity, contrary to previous suggestions (1, 2, 26–28). By contrast, selective sweeps of beneficial mutations evidently had important effects on the dynamics of genome evolution. Adaptive evolution was much faster in the initial phase of this experiment (Table 1), whereas the genetic diversity within populations reached its highest levels only after adaptive evolution had slowed significantly (Fig. (Fig.3).3 At the same time, linkage complicates the interpretation of the adaptive significance of any particular mutation. Were the IS-associated mutations that we detected merely passive “markers” or were they active “motors” in the adaptive evolution of these populations (25, 29)? The phylogenies shown in Fig. Fig.11 The derived populations became increasingly different from their common ancestor over time, both phenotypically and genetically; in that trivial sense, phenotypic and genomic evolution were concordant. However, we also showed that rates of phenotypic and genomic change were discordant in two important respects. First, populations that underwent similar fitness gains differed almost 3-fold in their rates of genomic evolution (Fig. (Fig.2)2 The only other experiment that sought to examine directly the concordance between phenotypic and genomic change was the recent study by Bull et al. (35). They propagated several replicate lines of the bacteriophage ΦX174 for several weeks under novel growth conditions. They then measured the changes in viral growth rate, and they also sequenced the entire 5.4-kb genome of the ancestral and derived genotypes. The rate of fitness gain in the evolving virus populations decelerated significantly over time. They also saw a significant deceleration in the rate of nucleotide substitution over time. Thus, Bull et al. (35) did not observe the same qualitative discrepancy between rates of phenotypic and genomic change that we saw with E. coli (Table 1). However, it appears from their viral data that the deceleration in the rate of fitness improvement was much more pronounced than was the deceleration in the rate of nucleotide substitution, which may indicate that the quantitative trend is similar in the two studies. Of course, there are important differences between these studies, including the organisms, duration of the experimental evolution, and methods to detect genomic changes. The direct sequencing of entire E. coli genomes in a study such as ours is unfeasible. However, Michael Travisano (University of Houston, personal communication) sequenced more than 1,000 bp in clones sampled from each of the 12 populations after 2,000 generations. He found no mutations whatsoever from the ancestral sequence among the 15,552 total bp sequenced. The regions he sequenced include promoters for ptsHI, crr, fruR, and cya; they were chosen based on physiological evidence that regulatory changes in the phosphotransferase system might be responsible for some of the genetic adaptation to the glucose-limited selective environment (36). We have shown that IS elements are efficient tools for monitoring genomic changes in these evolving populations, including both divergence from the ancestral state (Figs. (Figs.11 Our future work will be directed toward identifying the molecular basis of the observed genomic changes, especially those pivotal mutations that are shared by all descendants within a population. We then will perform genetic manipulations to construct strains that are isogenic except for a pivotal mutation. By measuring the relative fitness of these constructs, we can determine which pivotal mutations simply hitchhiked with some (still unknown) beneficial mutation and which ones encode beneficial phenotypes that were selected during 10,000 generations of experimental evolution. Acknowledgments We thank M. Travisano for sharing unpublished data; L. J. Forney, P. J. Gerrish, D. E. Rozen, and H. B. Shaffer for valuable discussion; and three anonymous reviewers for helpful comments. This study was supported by the Swiss Priority Program on Biodiversity to W.A. and M.B., by the Commissariat à l’Energie Atomique and an Action Thématique et Incitative sur Programme et Equipe from Centre National de la Recherche Scientifique to M.B., and by a National Science Foundation grant and a fellowship from the MacArthur Foundation to R.E.L. ABBREVIATIONS
Footnotes This paper was submitted directly (Track II) to the Proceedings Office. References 1. Naas T, Blot M, Fitch W M, Arber W. Genetics. 1994;136:721–730. [PubMed] 2. Naas T, Blot M, Fitch W M, Arber W. Mol Biol Evol. 1995;12:198–207. [PubMed] 3. Lenski R E, Rose M R, Simpson S C, Tadler S C. Am Nat. 1991;138:1315–1341. 4. Lenski R E, Travisano M. Proc Natl Acad Sci USA. 1994;91:6808–6814. [PubMed] 5. Sniegowski P D, Gerrish P J, Lenski R E. Nature (London). 1997;387:703–705. [PubMed] 6. Swofford D L. paup: Phylogenetic Analysis Using Parsimony. Champaign, IL: Illinois Natural History Survey; 1993. , Version 3.1.1. 7. Maynard Smith J, Haigh J. Genet Res. 1974;23:23–35. [PubMed] 8. Kaplan N L, Hudson R R, Langley C H. Genetics. 1989;123:887–899. [PubMed] 9. Begun D J, Aquadro C F. Nature (London). 1992;356:519–520. [PubMed] 10. Kimura M, Ohta T. Genetics. 1969;61:763–771. 11. Elena S F, Cooper V S, Lenski R E. Science. 1996;272:1802–1804. [PubMed] 12. Gerrish P J, Lenski R E. Genetica. 1998;102:127–144. [PubMed] 13. Fitch W M, Bush R M, Bender C A, Cox N J. Proc Natl Acad Sci USA. 1997;94:7712–7718. [PubMed] 14. Cann R L, Stoneking M, Wilson A C. Nature (London). 1987;325:31–36. [PubMed] 15. Ayala F J. Science. 1995;270:1930–1936. [PubMed] 16. Atwood K C, Schneider L K, Ryan F J. Proc Natl Acad Sci USA. 1951;37:146–155. [PubMed] 17. Dykhuizen D E. In: Encyclopedia of Microbiology. Lederberg J, editor. Vol. 3. San Diego: Academic; 1992. pp. 351–355. 18. Gerrish P J. Ph.D. dissertation. East Lansing: Michigan State University; 1998. 19. Rice W R. Evolution. 1989;43:223–225. 20. Kimura M, Crow J F. Genetics. 1964;49:725–738. [PubMed] 21. Ewens W J. Theor Pop Biol. 1972;3:87–112. [PubMed] 22. Ohta T. Theor Pop Biol. 1976;10:254–275. [PubMed] 23. Charlesworth B. Genet Res. 1990;55:199–221. [PubMed] 24. Sawyer S A, Dykhuizen D E, DuBose R F, Green L, Mutangadura-Mhlanga T, Wolczyk D F, Hartl D L. Genetics. 1987;115:51–63. [PubMed] 25. Kidwell M, Lisch D. Proc Natl Acad Sci USA. 1997;94:7704–7711. [PubMed] 26. Shapiro J A, Higgins N P. J Bacteriol. 1989;171:5975–5986. [PubMed] 27. Hall B G. Genetics. 1990;126:5–16. [PubMed] 28. Rainey P B, Moxon E R, Thompson I P. Adv Microb Ecol. 1993;13:263–300. 29. Blot M. Genetica. 1994;93:5–12. [PubMed] 30. Wilson A C, Carlson S S, White T J. Annu Rev Biochem. 1977;46:573–639. [PubMed] 31. Shaffer H B, Clark J M, Kraus F. Syst Zool. 1991;40:284–303. 32. Avise J C. Molecular Markers, Natural History and Evolution. New York: Chapman & Hall; 1994. 33. Wallis M. J Mol Evol. 1996;43:93–100. [PubMed] 34. Omland K E. Evolution. 1997;51:1381–1393. 35. Bull J J, Badgett M R, Wichman H A, Huelsenbeck J P, Hillis D M, Gulati A, Ho C, Molineux I J. Genetics. 1997;147:1497–1507. [PubMed] 36. Travisano M, Lenski R E. Genetics. 1996;143:15–26. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||
Genetics. 1994 Mar; 136(3):721-30.
[Genetics. 1994]Mol Biol Evol. 1995 Mar; 12(2):198-207.
[Mol Biol Evol. 1995]Proc Natl Acad Sci U S A. 1994 Jul 19; 91(15):6808-14.
[Proc Natl Acad Sci U S A. 1994]Proc Natl Acad Sci U S A. 1994 Jul 19; 91(15):6808-14.
[Proc Natl Acad Sci U S A. 1994]Nature. 1997 Jun 12; 387(6634):703-5.
[Nature. 1997]Genetics. 1994 Mar; 136(3):721-30.
[Genetics. 1994]Mol Biol Evol. 1995 Mar; 12(2):198-207.
[Mol Biol Evol. 1995]Genet Res. 1974 Feb; 23(1):23-35.
[Genet Res. 1974]Nature. 1992 Apr 9; 356(6369):519-20.
[Nature. 1992]Proc Natl Acad Sci U S A. 1994 Jul 19; 91(15):6808-14.
[Proc Natl Acad Sci U S A. 1994]Proc Natl Acad Sci U S A. 1994 Jul 19; 91(15):6808-14.
[Proc Natl Acad Sci U S A. 1994]Science. 1996 Jun 21; 272(5269):1802-4.
[Science. 1996]Genetica. 1998; 102-103(1-6):127-44.
[Genetica. 1998]Proc Natl Acad Sci U S A. 1997 Jul 22; 94(15):7712-8.
[Proc Natl Acad Sci U S A. 1997]Nature. 1987 Jan 1-7; 325(6099):31-6.
[Nature. 1987]Science. 1995 Dec 22; 270(5244):1930-6.
[Science. 1995]Proc Natl Acad Sci U S A. 1994 Jul 19; 91(15):6808-14.
[Proc Natl Acad Sci U S A. 1994]Proc Natl Acad Sci U S A. 1994 Jul 19; 91(15):6808-14.
[Proc Natl Acad Sci U S A. 1994]Proc Natl Acad Sci U S A. 1994 Jul 19; 91(15):6808-14.
[Proc Natl Acad Sci U S A. 1994]Science. 1996 Jun 21; 272(5269):1802-4.
[Science. 1996]Genetica. 1998; 102-103(1-6):127-44.
[Genetica. 1998]Proc Natl Acad Sci U S A. 1951 Mar; 37(3):146-55.
[Proc Natl Acad Sci U S A. 1951]Genet Res. 1974 Feb; 23(1):23-35.
[Genet Res. 1974]Genetica. 1998; 102-103(1-6):127-44.
[Genetica. 1998]Proc Natl Acad Sci U S A. 1951 Mar; 37(3):146-55.
[Proc Natl Acad Sci U S A. 1951]Genetics. 1964 Apr; 49():725-38.
[Genetics. 1964]Genet Res. 1990 Jun; 55(3):199-221.
[Genet Res. 1990]Genetics. 1987 Jan; 115(1):51-63.
[Genetics. 1987]Proc Natl Acad Sci U S A. 1997 Jul 22; 94(15):7704-11.
[Proc Natl Acad Sci U S A. 1997]Nature. 1997 Jun 12; 387(6634):703-5.
[Nature. 1997]Genetics. 1994 Mar; 136(3):721-30.
[Genetics. 1994]Mol Biol Evol. 1995 Mar; 12(2):198-207.
[Mol Biol Evol. 1995]J Bacteriol. 1989 Nov; 171(11):5975-86.
[J Bacteriol. 1989]Genet Res. 1974 Feb; 23(1):23-35.
[Genet Res. 1974]Genetica. 1998; 102-103(1-6):127-44.
[Genetica. 1998]Proc Natl Acad Sci U S A. 1997 Jul 22; 94(15):7704-11.
[Proc Natl Acad Sci U S A. 1997]Genetica. 1994; 93(1-3):5-12.
[Genetica. 1994]Annu Rev Biochem. 1977; 46():573-639.
[Annu Rev Biochem. 1977]Genetics. 1997 Dec; 147(4):1497-507.
[Genetics. 1997]Genetics. 1996 May; 143(1):15-26.
[Genetics. 1996]Nature. 1997 Jun 12; 387(6634):703-5.
[Nature. 1997]Proc Natl Acad Sci U S A. 1994 Jul 19; 91(15):6808-14.
[Proc Natl Acad Sci U S A. 1994]