![]() | ![]() |
Formats:
|
||||||||||||||||
Copyright © 2007 by The National Academy of Sciences of the USA Ecology Genome dynamics in a natural archaeal population Jillian F. Banfield, Departments of *Environmental Science, Policy, and Management and ; ‖Earth and Planetary Science, University of California, Berkeley, CA 94720; and ; ¶Department of Energy Joint Genome Institute, Walnut Creek, CA 94598; **To whom correspondence should be addressed. E-mail: jill/at/eps.berkeley.edu Edited by James M. Tiedje, Michigan State University, East Lansing, MI, and approved November 30, 2006 Author contributions: E.E.A. and J.F.B. designed research; E.E.A., G.W.T., J.C.D., and P.M.R. performed research; E.E.A., G.W.T., R.J.W., and J.F.B. analyzed data; and E.E.A. and J.F.B. wrote the paper. †Present address: Scripps Institution of Oceanography, University of California at San Diego, La Jolla, CA 92093. ‡Present address: Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139. §Present address: Department of Microbiology, University of Illinois, Urbana, IL 61801. Received June 9, 2006. This article has been cited by other articles in PMC.Abstract Evolutionary processes that give rise to, and limit, diversification within strain populations can be deduced from the form and distribution of genomic heterogeneity. The extent of genomic change that distinguishes the acidophilic archaeon Ferroplasma acidarmanus fer1 from an environmental population of the same species from the same site, fer1(env), was determined by comparing the 1.94-megabase (Mb) genome sequence of the isolate with that reconstructed from 8 Mb of environmental sequence data. The fer1(env) composite sequence sampled ≈92% of the isolate genome. Environmental sequence data were also analyzed to reveal genomic heterogeneity within the coexisting, coevolving fer1(env) population. Analyses revealed that transposase movement and the insertion and loss of blocks of novel genes of probable phage origin occur rapidly enough to give rise to heterogeneity in gene content within the local population. Because the environmental DNA was derived from many closely related individuals, it was possible to quantify gene sequence variability within the population. All but a few gene variants show evidence of strong purifying selection. Based on the small number of distinct sequence types and their distribution, we infer that the population is undergoing frequent genetic recombination, resulting in a mosaic genome pool that is shaped by selection. The larger genetic potential of the population relative to individuals within it and the combinatorial process that results in many closely related genome types may provide the basis for adaptation to environmental fluctuations. Keywords: Archaea, Ferroplasma, population genomics, recombination The sequencing of microbial isolate genomes has enabled comparison of gene complements from a broad selection of bacterial and archaeal species. Comparative genomic explorations lend insight into the metabolic and ecological diversity of microbial taxa and provide us with the raw materials for inferring evolutionary history among lineages. However, in most cases only a single representative of a particular lineage has been analyzed genomically. Where multiple strain genomes are available, the organisms are typically pathogens, and the strains are derived from distinct localities (1–5). Lacking are comparative population genomic studies that include analysis of an isolate versus an environmental population of the same species obtained from the same site. Such data would enable evaluation of how representative of the population an individual is and would reveal the form and distribution of genetic variability that can give rise to the emergence of new species. Genetic techniques such as multilocus sequence typing (MLST) (6, 7) of multiple individuals derived from a microbial population can provide a glimpse into the extent of genetic diversity within a natural population and insight into important processes that shape population structure. However, MLST requires isolation of the organisms under investigation and typically surveys only a small (6 to 10) number of loci. Consequently, MLST approaches are limited in the extent to which they can be applied to a wide variety of organisms, and extrapolation of such data to the whole-genome scale may be limited. Similarly, the utility of DNA microarray technologies to survey genetic diversity among closely related strains or environmental populations is inherently limited owing to the fact that only genes shared between reference and test strain(s) can be surveyed (8). Missing from such analyses are genes unique to the interrogated strains and the inability to extract syntenic information. An alternative route to assess genetic diversity and reveal microevolutionary processes within natural microbial populations involves the cultivation-independent analysis of sequences from many individuals obtained from environmentally derived shotgun sequence data (9, 10). At the population level, comparison of reconstructed genome fragments and the associated raw sequence data can reveal genomic regions that are invariant as well as those regions that possess heterogeneity. In the present study, we reconstructed the genome sequence of an isolate, Ferroplasma acidarmanus fer1, and compared it with ≈103 megabases (Mb) of environmental sequence data obtained directly from a natural biofilm community from the same site. Both the isolate (collected in July 1997) (11) and the environmental biofilm (≈1-cm3 sample collected in March 2002) (9) were recovered from the five-way region of the Richmond Mine at Iron Mountain in Northern California. F. acidarmanus fer1 is an extremely acidophilic ferrous iron [Fe(II)]-oxidizing Euryarchaeote (order Thermoplasmatales) implicated in acid mine drainage (AMD) generation (11). Isolation of fer1 and description of its habitat have been reported (11). The fer1 strain grows optimally at 42°C, pH 1.2, and is capable of growth at pH 0. It is closely related to Ferroplasma acidiphilum (12, 13), based on comparison of 16S rRNA gene sequences (99.8% nucleotide similarity). Detailed phenotypic characterization of Ferroplasma species, including F. acidarmanus fer1, has been reported (12). To date, fer1 represents the only archaeal isolate recovered from the Richmond Mine. Here, we investigate the structure and genomic diversity of an environmental population of F. acidarmanus, herein referred to as fer1(env), making use of the fer1 isolate genome sequence as a reference. Although both fer1 and the fer1(env) population were obtained from the same site, it cannot be claimed that the fer1 isolate is a member of the fer1(env) population because it was collected >4 years earlier. However, because both fer1 and the fer1(env) population had a very recent common ancestor, we can use the fer1 isolate genome in a comparative approach to quantify genome-wide heterogeneity and deduce the relative rates of the various modes of genome evolution. The approach taken here, in which an isolate genome sequence is used as a reference to which a sequence from a closely related environmental population is compared, provides an innovative route to address population-level ecological and evolutionary questions. Results and Discussion Isolate Versus Population Composite Genome Analysis. The fer1 isolate genome sequence was assembled from data obtained from small and large insert libraries. General features of the 1.94-Mb genome, classification of the 1,963 ORFs by functional category, and a metabolic reconstruction are reported in supporting information (SI) Table 3 and SI Figs. 6 and 7. Composite scaffolds and contigs from the fer1(env) population were assembled independently of the fer1 isolate sequence. Assembled fragments were ordered and aligned by using the fer1 genome as a reference sequence to reconstruct ≈92% (1,792 genes) of the fer1 isolate genome (Fig. 1
Fig. 1
Movement of insertion sequence (IS) elements also occurs fast enough to give rise to strain-level differences (Fig. 1 Genomic heterogeneity between fer1 and the composite fer1(env) sequence was quantified by determining the number of differences that occur across comparable genomic regions (Table 1). The average nucleotide-level divergence between fer1 and fer1(env) is ≈3%. However, ≈50% of proteins encoded by the composite fer1(env) sequence are identical to the orthologous isolate sequences. The average amino acid-level similarity among orthologs is 99.16%. Overall, blocks of strain-specific genes localized to integrated elements represent the largest input of novel genetic potential within the population. The gain or loss of single or a few genes (mostly hypothetical) is the second most significant source of strain heterogeneity. In contrast, gene duplications (paralogous sequences) are very infrequent events and gene order is almost completely preserved.
Environmental Genotypic Heterogeneity. Comparison among composite genome scaffolds revealed that coexisting members of the fer1(env) population exhibit differences in gene content. When a subset of individuals contain genes not present in all population members, the same genomic region will be assembled into two or more discrete fragments (10). The genes most commonly responsible for this phenomenon are transposases. In addition to documenting their presence/absence in composite genome sequences, the insertion of transposases into a subset of population members is evidenced by the localized “stretching” of mate-pairs (paired end reads) across genomic regions. Integrated elements also contribute to genetic heterogeneity between fer1(env) members. Fig. 2 Analyses presented above rely upon comparison between the fer1 isolate and composite fer1(env) sequences. However, greater resolution into the fer1(env) population structure can be achieved because each environmentally derived sequencing read likely originated from a different individual within the natural population (9). We evaluated sequence variability in the population on a gene-by-gene basis by aligning all reads constituting the fer1(env) population with the nucleotide sequence of fer1 (average coverage at each locus is ≈4.5×). Overall, 56% of the genes/gene fragments encoded on fer1(env) reads (out of a total of ≈14,000 reads) have the fer1 sequence type (defined as having ≥99.9% nucleotide-level similarity to the fer1 ortholog). At least one read has the fer1 sequence type at ≈80% of loci (see gene content dendogram, SI Fig. 9). We categorized all gene types in the fer1(env) population as either homogeneous (one sequence type only, 30.4%) or heterogeneous (multiple variants at one locus, 69.6%) (Figs. 3
To quantify and evaluate the distribution of gene variants within the fer1(env) population, we evaluated several >100-kb regions in detail, verifying that mate-pairs carrying environmental variant sequences anchor them into the same genomic context (see 109-gene region in Fig. 5
Given the ≈4.5× sequencing depth across the fer1(env) population, we cannot determine whether regions are homogeneous because variants were absent or simply present in low abundance. However, our results do show the dominance of sequence types present. In the future, PCR-based analyses together with population genomic sampling will provide the sequence depth necessary to more fully resolve the abundance distribution of gene types in the population. Mate-pair information (Fig. 5 Extensive recombination has been documented in Ferroplasma type II (9). Subsequently, MLST-based analyses provided evidence for genetic recombination in the halophilic archaeon Halorubrum sp (25) and the thermoacidophilic archaeon Sulfolobus islandicus (7). Results documenting recombination in F. acidarmanus suggest that mosaic genome structure may be a common feature of natural archaeal populations. The restriction subunit of the type I R/M system described above exemplifies the form of sequence heterogeneity that can exist in the fer1(env) population. There are three distinguishable sequence types for the restriction subunit, exhibiting at least five recombination events across the 2,985-bp gene. The fer1(env) population also contains a type I specificity subunit gene with several allelic variants that show evidence of recombination among them. Recombination among individuals with distinct sequence types may provide the capacity to generate a large number of novel, hybrid restriction enzymes. Overall, R/M system dynamics may factor significantly in natural population structure and evolution (26–28) as heterogeneity in R/M activity (diversity in molecular recognition or the complete presence/absence of such systems) may lead to “recombination isolation.” Ultimately, the absence of recombination as a cohesive mechanism hindering genetic divergence among population members may result in an altered evolutionary trajectory (e.g., speciation or extinction). Population Structure. An important question is the role of selection in shaping genetic variation in natural populations. It has been postulated that genetic homogenization can be achieved when an adaptive genotype providing a selective advantage sweeps through a population (29, 30). Cohesion can also be maintained by high levels of genetic exchange between population members (31). As suggested here for the fer1(env) population and recently for local populations of Ferroplasma type II (9) and S. islandicus (7), recombination prevents periodic selection from purging total diversity. Because gene variants are not linked to a single genome type, selection removes only the subset of mosaic genome types with less adaptive gene variants. The removal of sequence variation by incomplete selective sweeps may explain the homogeneous genomic regions within the fer1(env) population. It has been suggested that genetic diversity can be maintained within a clade of very close relatives by one of two mechanisms: (i) the clade contains only a single population, and selection for adaptive genotypes is too weak to purge diversity (23, 32), or (ii) the clade contains multiple ecologically distinct populations (ecotypes) (33). In this latter model, localized chromosomal regions of sequence identity are explained by rare “globally adaptive” mutations, which are beneficial in the context of any ecotype to which they are introduced (29). When these mutations are transferred by recombination between ecotypes, selection may result in homogenization of the recombined region across ecotypes, while leaving the rest of the genome heterogeneous. We cannot rule out the possibility that some sequence diversity in the fer1(env) population is due to the partitioning of specific mosaic genome types into different niches within the biofilm. In support of the multiple-ecotype model, it is clear that recombinant genotypes have been amplified throughout the clade and that heterogeneous loci fall into discrete clusters. However, as our data indicates that nearly a third of the genes have been involved in recent global selection-like events, it is more likely that the fer1(env) organisms share a single niche, are subject to very similar selective pressures, and thus represent a single population. Furthermore, high rates of recombination effectively blurs the distinction between ecotypes (29). Ultimately, definitive discrimination between either model will require additional genomic monitoring of these populations over space and time. Evidence for Selection. The model for the mosaic population structure requires selection to explain the presence of genes with only one sequence type. To seek evidence for selection we modeled synonymous vs. nonsynonymous nucleotide polymorphism patterns in the fer1(env) population, both between population variants and against the respective fer1 ortholog. Pairwise dN/dS ratios (ratios of nonsynonymous to synonymous nucleotide substitutions) among partial and complete orthologous sequences were determined (34). Approximately seventy-nine percent of the 3,745 pairwise comparisons among variable sequence segments exhibited a dN/dS ratio <0.15, indicating that the majority of the genome is under strong purifying selection. No functional bias in distribution of dN/dS values could be detected (data not shown). Only six cases were identified with dN/dS values >1.0, suggesting that they are under positive selection. Four of these genes encode small hypothetical proteins, one is a conserved hypothetical protein, and one is a predicted siroheme synthase [siroheme, an iron-containing porphinoid, serves as a cofactor for sulfite and nitrite reductases (35)]. In many cases, elevated dN/dS values were associated with regions of genes, rather than entire genes. More sensitive methods for identifying positive selection (e.g., on a codon-by-codon basis) will require reconstruction of complete environmental variant gene sequences and the inclusion of sequences from more distantly related species. Whereas it is expected that many genes would display low dN/dS ratios (e.g., housekeeping genes), these results suggest that stringent functional constraints [including codon usage bias, the selection for preferred synonymous codons (36)] are characteristics of adaptation to this environment. These results are consistent with analyses revealing strong functional constraints in thermophilic species as revealed by global dN/dS analyses among closely related organisms (37). To seek evidence for the basis of selection, we examined the physical and functional distribution of homogeneous and heterogeneous genes within the fer1(env) data set. Clusters of genes with only one sequence type may be moved in blocks, possibly by recombination events (Figs. 4 Population Structure in the Context of Environmental and Geologic History. To understand population dynamics it would be helpful to able to correlate environmental perturbations with their genomic and evolutionary consequences. Geologic and historic records can provide information about the sequence of geochemical changes at the site. Using magnetism recorded in hematite gossans (products of pyrite oxidation), Alpers et al.†† determined that ore bodies at Iron Mountain were first exposed by erosion ≈700,000–1,000,000 years ago. When mining began ≈150 years ago, the physical fracturing of the system resulted in increased porosity and permeability within the mountain thus accelerating pyrite oxidation. As solution chemistry and temperature are strongly controlled by the dissolution of pyrite, this process resulted in a decrease in solution pH to values <1.0 and increased temperatures to ≈40°C in the contemporary underground mine. Seasonal rainfall-driven fluctuations in temperature and ionic strength occur within the mine and have been shown to be factors in species selection (38). These factors may also be important in strain genotype selection. Our comparative analyses have revealed genomic heterogeneity that may be understood in the context of these environmental changes. First, we determined that the fer1(env) population consists of mosaic genomes in which a small number of sequence types predominate. One possible scenario is that subsurface populations that were previously evolving in isolation from one another were brought together and began to recombine when different regions of the mountain were physically interconnected by mining. Recombination-driven homogenization of distinct but closely related populations at their geographic boundaries may be a common process in many ecosystems. Perhaps a limited number of mosaic genome type rose in abundance relative to a diverse mosaic pool in a relatively recent clonal expansion (39), possibly induced by environmental changes caused by the onset of mining. Under this scenario, genes with only a single sequence type may have derived from any of the ancestral strains. As recombination rates are inferred to be very fast relative to nucleotide polymorphism fixation rates (otherwise we would see many closely related variant types), rapid recombinatorial mixing of variants into the mosaic genome pool may maintain population cohesion. However, mechanisms such as phage and IS dynamics contribute to strain-level variation over short time scales via the rapid introduction and distribution of novel genes. Ultimately, accumulated variation may diminish the frequency of genetic recombination, promoting divergence and possibly speciation (31). A second finding is that almost all genes are under strong stabilizing selection. Environmental sequence types that are nearly identical (≥99.9% similar) to the isolate type are found at most loci despite >4 years difference in sample collection time. The exact preservation of a large amount of DNA sequence over 4 years may provide one constraint on rates of nucleotide polymorphism fixation. Such information is needed to quantify rates of genome evolution. At present, it is only possible to speculate about the relationship between strain divergence and site history. One possibility is that all of the observed nucleotide-level heterogeneity arose in situ over 700,000–1 million years. If this possibility could be confirmed, the relative rates for the accumulation of the different forms of genome change (Table 1) could be converted to absolute rates. Support for this correlation may be obtained by future genomic monitoring of these populations over space and time and through deductions at other sites for which dates of onset of colonization can be determined. This approach may provide a route to the long-term objective of mapping genome change onto environmental change so as to discover how these processes are interconnected. Materials and Methods F. acidarmanus fer1 Genome Sequencing and Assembly. F. acidarmanus fer1 was isolated from enrichment cultures inoculated with mine water and sediment samples collected in July 1997 from the five-way site within the Richmond Mine (11). Details of the fer1 genome assembly and annotation are presented in SI Materials and Methods. fer1(env) Genome Assembly and Comparative Analyses. We compared the isolate fer1 genome with genomic information obtained by shotgun sequencing small insert libraries constructed from DNA extracted directly from an environmental biofilm. The biofilm was sampled in 2002 from the five-way site of the Richmond Mine at the same location from which the fer1 isolate was collected. Tyson et al. (9) used 76 Mb of environmental sequence data to reconstruct large genome scaffolds from five species, one of which was closely related to F. acidarmanus. The partial composite genome reported for this fer1(env) population was previously referred to simply as Ferroplasma type I (9). In this work, an additional 27 Mb of community small insert library sequence not reported in ref. 9 was coassembled with the initial 76 Mb of data. The total community genome data set used in the present study consisted of 141,312 small-insert library reads (average trimmed length 737 bp) for a total of ≈103 Mb of environmental sequence data. Assembly was performed by using PHRAP (minmatch 50; penalty −15; minscore 40) (40) and ATLAS (k-mer size 32 bp, MaxMismatch 10%, band size 7; minmatch 12, minscore 20, penalty −2, forcelevel 10) (41). Newly generated genomic contigs were binned by alignment with existing contigs and scaffolds described in Tyson et al. (9). Scaffolds and contigs were initially assigned to fer1(env) based solely on assembly and subsequent analyses including % G+C, read depth, and profiling of nucleotide frequencies using TETRA (42). Subsequently, community genome contigs and scaffolds assigned to fer1(env) were verified by comparison with the F. acidarmanus fer1 genome by using BLAST (43) and MUMmer 3.0 software package's NUCmer (44). Approximately 14,000 environmentally derived sequencing reads constituted the genomically sampled fer1(env) population. In total, 172 scaffolds belonging to fer1(env) were used in the comparative analyses totaling >1.79 Mb of assembled sequence. Genome circular diagrams were generated from .embl format genome files exported from ARTEMIS (45) by using “circular diagram.pl” (Pathogen Group, The Wellcome Trust Sanger Institute). Synteny dot plots were generated by alignment of the fer1 genome sequence with all fer1(env) composite contigs and scaffolds by using NUCmer and MUMMERPLOT (44). Orthologous sequences between fer1 and the composite fer1(env) sequences were identified by using the reciprocal smallest distance method described by Wall et al. (46). To determine genome-wide sequence variation in the fer1(env) population, all fer1(env) environmental sequencing reads were aligned to fer1 genes by using BLASTN, and sequence identity was analyzed on a gene-by-gene basis. Multiple alignment outputted blast results were also generated, and mate-pair consistency for all reads localized to a given fer1 gene were verified on a gene-by-gene basis to verify that environmental reads aligned to fer1 genes in the correct genomic context. Analysis of Selection in the fer1(env) Population. To screen for evidence of selection within the fer1(env) population and against the orthologous fer1 sequences, the set of predicted fer1 orfs was queried against all reads from the community genome sequence by using BLASTN with a strict E value of e−75. To verify correct assignment of environmental reads to the fer1(env) population, reads comprising independently assembled fer1(env) contigs were extracted and checked for consistency against all blast matches. All alignments were preformed in CLUSTALW. dN/dS ratios for pairwise comparisons between fer1 orfs and orthologous fer1(env) reads were determined by using the method of Nei and Gojobori (34) in SNAP (47). Correct assignment of environmental sequences to the fer1(env) genome and the accuracy of alignments were confirmed for all pairwise comparisons resulting in dN/dS values >0.9 by reconstructing variant alleles and estimating dN/dS values in DnaSP (48). Supporting Information
Acknowledgments We thank Ted W. Arman (owner, Iron Mountain Mines), Richard Sugarek, Don Dodds, and Rudy Carver for mine access and on-site assistance; Val Burland and other members of the Blattner laboratory at the University of Wisconsin, Madison for contributions to the initial isolate genome sequencing effort; Fred Cohan for critical reading of the manuscript; and three anonymous reviewers. Research was supported by the Department of Energy (DOE) Microbial Genome Program (J.F.B.), the National Science Foundation (NSF) Biocomplexity Program (J.F.B.), the National Aeronautics and Space Administration Astrobiology Institute (J.F.B.), and a NSF Postdoctoral Research Fellowship in Microbial Biology (to E.E.A.). Sequencing was performed under the auspices of the DOE's Office of Science, Biological and Environmental Research Program. Abbreviations Footnotes The authors declare no conflict of interest. This article is a PNAS direct submission. Data deposition: Accession numbers for the community genomic data are AADL01000000 and AADL00000000. The Ferroplasma acidarmanus fer1 genome data associated with this study have been deposited in the GenBank database (accession no. AABC05000000). This article contains supporting information online at www.pnas.org/cgi/content/full/0604851104/DC1. ††Alpers, C. N., Nordstrom, D. K., Verosub, K. L., Helm, C. M. (1999) Geological Society of America Cordilleran Section Centennial Meeting, Abstracts with Programs, Vol. 31, No. 6, p. A-33 (abstr). Contributor Information Jillian F. Banfield, Departments of *Environmental Science, Policy, and Management and ; ‖Earth and Planetary Science, University of California, Berkeley, CA 94720; and ; ¶Department of Energy Joint Genome Institute, Walnut Creek, CA 94598; References 1. Bhattacharyya A, Stilwagen S, Ivanova N, D'Souza M, Bernal A, Lykidis A, Kapatral V, Anderson I, Larsen N, Los T, et al. Proc Natl Acad Sci USA. 2002;99:12403–12408. [PubMed] 2. Nelson KE, Fouts DE, Mongodin EF, Ravel J, DeBoy RT, Kolonay JF, Rasko DA, Angiuoli SV, Gill SR, Paulsen IT, et al. Nucleic Acids Res. 2004;32:2386–2395. [PubMed] 3. Eppinger M, Baar C, Raddatz G, Huson DH, Schuster SC. Nat Rev Microbiol. 2004;2:872–885. [PubMed] 4. Holden MT, Titball RW, Peacock SJ, Cerdeno-Tarraga AM, Atkins T, Crossman LC, Pitt T, Churcher C, Mungall K, Bentley SD, et al. Proc Natl Acad Sci USA. 2004;101:9786–9791. [PubMed] 5. Deng W, Burland V, Plunkett G, III, Boutin A, Mayhew GF, Liss P, Perna NT, Rose DJ, Mau B, Zhou S, et al. J Bacteriol. 2002;184:4601–4611. [PubMed] 6. Hanage WP, Fraser C, Spratt BG. BMC Biol. 2005;3:6. [PubMed] 7. Whitaker RJ, Grogan DW, Taylor JW. Mol Biol Evol. 2005;22:2354–2361. [PubMed] 8. Ochman H, Santos SR. Infect Genet Evol. 2005;5:103–108. [PubMed] 9. Tyson GW, Chapman J, Hugenholtz P, Allen EE, Ram RJ, Richardson PM, Solovyev VV, Rubin EM, Rokhsar DS, Banfield JF. Nature. 2004;428:37–43. [PubMed] 10. Allen EE, Banfield JF. Nat Rev Microbiol. 2005;3:489–498. [PubMed] 11. Edwards KJ, Bond PL, Gihring TM, Banfield JF. Science. 2000;287:1796–1799. [PubMed] 12. Dopson M, Baker-Austin C, Hind A, Bowman JP, Bond PL. Appl Environ Microbiol. 2004;70:2079–2088. [PubMed] 13. Golyshina OV, Pivovarova TA, Karavaiko GI, Kondrateva TF, Moore ER, Abraham WR, Lunsdorf H, Timmis KN, Yakimov MM, Golyshin PN. Int J Syst Evol Microbiol. 2000;50:997–1006. [PubMed] 14. She Q, Brugger K, Chen L. Res Microbiol. 2002;153:325–332. [PubMed] 15. Schneider D, Lenski RE. Res Microbiol. 2004;155:319–327. [PubMed] 16. Ciampi MS, Schmid MB, Roth JR. Proc Natl Acad Sci USA. 1982;79:5016–5020. [PubMed] 17. Prentki P, Teter B, Chandler M, Galas DJ. J Mol Biol. 1986;191:383–393. [PubMed] 18. Nevers P, Saedler H. Nature. 1977;268:109–115. [PubMed] 19. Schneider D, Duperchy E, Coursange E, Lenski RE, Blot M. Genetics. 2000;156:477–488. [PubMed] 20. Martusewitsch E, Sensen CW, Schleper C. J Bacteriol. 2000;182:2574–2581. [PubMed] 21. Wick LM, Weihong Q, Lacher DW, Whittam TS. J Bacteriol. 2005;187:1783–1791. [PubMed] 22. Ohnishi M, Kurokawa K, Hayashi T. Trends Microbiol. 2001;9:481–485. [PubMed] 23. Thompson JR, Pacocha S, Pharino C, Klepac-Ceraj V, Hunt DE, Benoit J, Sarma-Rupavtarm R, Distel DL, Polz MF. Science. 2005;307:1311–1313. [PubMed] 24. Coleman ML, Sullivan MB, Martiny AC, Steglich C, Barry K, DeLong EF, Chisholm SW. Science. 2006;311:1768–1770. [PubMed] 25. Papke RT, Koenig JE, Rodriguez-Valera F, Doolittle WF. Science. 2004;306:1928–1929. [PubMed] 26. Milkman R. Genetics. 1997;146:745–750. [PubMed] 27. Milkman R, Raleigh EA, McKane M, Cryderman D, Bilodeau P, McWeeny K. Genetics. 1999;153:539–554. [PubMed] 28. Jeltsch A. Gene. 2003;317:13–16. [PubMed] 29. Majewski J, Cohan FM. Genetics. 1999;152:1459–1474. [PubMed] 30. Cohan FM. Genetica. 2002;116:359–370. [PubMed] 31. Lawrence JG. Theor Popul Biol. 2002;61:449–460. [PubMed] 32. Acinas SG, Klepac-Ceraj V, Hunt DE, Pharino C, Ceraj I, Distel DL, Polz MF. Nature. 2004;430:551–554. [PubMed] 33. Cohan FM. Annu Rev Microbiol. 2002;56:457–487. [PubMed] 34. Nei M, Gojobori T. Mol Biol Evol. 1986;3:418–426. [PubMed] 35. Murphy MJ, Siegel LM, Tove SR, Kamin H. Proc Natl Acad Sci USA. 1974;71:612–616. [PubMed] 36. Lynn DJ, Singer GAC, Hickey DA. Nucleic Acids Res. 2002;30:4272–4277. [PubMed] 37. Friedman R, Drake JW, Hughes AL. Genetics. 2004;167:1507–1512. [PubMed] 38. Edwards KJ, Gihring TM, Banfield JF. Appl Environ Microbiol. 1999;65:3627–3632. [PubMed] 39. Smith NH, Dale J, Inwald J, Palmer S, Gordon SV, Hewinson RG, Smith JM. Proc Natl Acad Sci USA. 2003;100:15271–15275. [PubMed] 40. Ewing B, Green P. Genome Res. 1998;8:186–194. [PubMed] 41. Havlak P, Chen R, Durbin KJ, Egan A, Ren Y, Song XZ, Weinstock GM, Gibbs RA. Genome Res. 2004;14:721–732. [PubMed] 42. Teeling H, Waldman J, Lombardot T, Bauer M, Glöckner FO. BMC Bioinformatics. 2004;5:163. [PubMed] 43. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. J Mol Biol. 1990;215:403–410. [PubMed] 44. Kurtz S, Phillippy A, Delcher AL, Smoot M, Shumway M, Antonescu C, Salzberg SL. Genome Biol. 2004;5:R12. [PubMed] 45. Rutherford K, Parkhill J, Crook J, Horsnell T, Rice P, Rajandream M-A, Barrell B. Bioinformatics. 2000;16:944–945. [PubMed] 46. Wall DP, Fraser HB, Hirsch AE. Bioinformatics. 2003;19:1710–1711. [PubMed] 47. Korber B. In: Computational Analysis of HIV Molecular Signatures. Rodrigo AG, Learn GH, editors. Dordrecht, The Netherlands: Kluwer; 2000. pp. 55–72. 48. Rozas J, Sánchez-DelBarrio JC, Messeguer X, Rozas R. Bioinformatics. 2003;19:2496–2497. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||
Proc Natl Acad Sci U S A. 2002 Sep 17; 99(19):12403-8.
[Proc Natl Acad Sci U S A. 2002]Nucleic Acids Res. 2004; 32(8):2386-95.
[Nucleic Acids Res. 2004]Nat Rev Microbiol. 2004 Nov; 2(11):872-85.
[Nat Rev Microbiol. 2004]Proc Natl Acad Sci U S A. 2004 Jun 29; 101(26):9786-91.
[Proc Natl Acad Sci U S A. 2004]J Bacteriol. 2002 Aug; 184(16):4601-11.
[J Bacteriol. 2002]BMC Biol. 2005 Mar 7; 3():6.
[BMC Biol. 2005]Mol Biol Evol. 2005 Dec; 22(12):2354-61.
[Mol Biol Evol. 2005]Infect Genet Evol. 2005 Mar; 5(2):103-8.
[Infect Genet Evol. 2005]Nature. 2004 Mar 4; 428(6978):37-43.
[Nature. 2004]Nat Rev Microbiol. 2005 Jun; 3(6):489-98.
[Nat Rev Microbiol. 2005]Science. 2000 Mar 10; 287(5459):1796-9.
[Science. 2000]Nature. 2004 Mar 4; 428(6978):37-43.
[Nature. 2004]Appl Environ Microbiol. 2004 Apr; 70(4):2079-88.
[Appl Environ Microbiol. 2004]Int J Syst Evol Microbiol. 2000 May; 50 Pt 3():997-1006.
[Int J Syst Evol Microbiol. 2000]Res Microbiol. 2002 Jul-Aug; 153(6):325-32.
[Res Microbiol. 2002]Res Microbiol. 2004 Jun; 155(5):319-27.
[Res Microbiol. 2004]Proc Natl Acad Sci U S A. 1982 Aug; 79(16):5016-20.
[Proc Natl Acad Sci U S A. 1982]J Mol Biol. 1986 Oct 5; 191(3):383-93.
[J Mol Biol. 1986]Nature. 1977 Jul 14; 268(5616):109-15.
[Nature. 1977]Genetics. 2000 Oct; 156(2):477-88.
[Genetics. 2000]Nat Rev Microbiol. 2005 Jun; 3(6):489-98.
[Nat Rev Microbiol. 2005]J Bacteriol. 2002 Aug; 184(16):4601-11.
[J Bacteriol. 2002]Res Microbiol. 2002 Jul-Aug; 153(6):325-32.
[Res Microbiol. 2002]J Bacteriol. 2005 Mar; 187(5):1783-91.
[J Bacteriol. 2005]Trends Microbiol. 2001 Oct; 9(10):481-5.
[Trends Microbiol. 2001]Science. 2005 Feb 25; 307(5713):1311-3.
[Science. 2005]Nature. 2004 Mar 4; 428(6978):37-43.
[Nature. 2004]Nature. 2004 Mar 4; 428(6978):37-43.
[Nature. 2004]Science. 2004 Dec 10; 306(5703):1928-9.
[Science. 2004]Mol Biol Evol. 2005 Dec; 22(12):2354-61.
[Mol Biol Evol. 2005]Genetics. 1997 Jul; 146(3):745-50.
[Genetics. 1997]Genetics. 1999 Oct; 153(2):539-54.
[Genetics. 1999]Gene. 2003 Oct 23; 317(1-2):13-6.
[Gene. 2003]Genetics. 1999 Aug; 152(4):1459-74.
[Genetics. 1999]Genetica. 2002 Nov; 116(2-3):359-70.
[Genetica. 2002]Theor Popul Biol. 2002 Jun; 61(4):449-60.
[Theor Popul Biol. 2002]Nature. 2004 Mar 4; 428(6978):37-43.
[Nature. 2004]Mol Biol Evol. 2005 Dec; 22(12):2354-61.
[Mol Biol Evol. 2005]Science. 2005 Feb 25; 307(5713):1311-3.
[Science. 2005]Nature. 2004 Jul 29; 430(6999):551-4.
[Nature. 2004]Annu Rev Microbiol. 2002; 56():457-87.
[Annu Rev Microbiol. 2002]Genetics. 1999 Aug; 152(4):1459-74.
[Genetics. 1999]Genetics. 1999 Aug; 152(4):1459-74.
[Genetics. 1999]Mol Biol Evol. 1986 Sep; 3(5):418-26.
[Mol Biol Evol. 1986]Proc Natl Acad Sci U S A. 1974 Mar; 71(3):612-6.
[Proc Natl Acad Sci U S A. 1974]Nucleic Acids Res. 2002 Oct 1; 30(19):4272-7.
[Nucleic Acids Res. 2002]Genetics. 2004 Jul; 167(3):1507-12.
[Genetics. 2004]Appl Environ Microbiol. 1999 Aug; 65(8):3627-32.
[Appl Environ Microbiol. 1999]Proc Natl Acad Sci U S A. 2003 Dec 9; 100(25):15271-5.
[Proc Natl Acad Sci U S A. 2003]Theor Popul Biol. 2002 Jun; 61(4):449-60.
[Theor Popul Biol. 2002]Science. 2000 Mar 10; 287(5459):1796-9.
[Science. 2000]Nature. 2004 Mar 4; 428(6978):37-43.
[Nature. 2004]Genome Res. 1998 Mar; 8(3):186-94.
[Genome Res. 1998]Genome Res. 2004 Apr; 14(4):721-32.
[Genome Res. 2004]BMC Bioinformatics. 2004 Oct 26; 5():163.
[BMC Bioinformatics. 2004]J Mol Biol. 1990 Oct 5; 215(3):403-10.
[J Mol Biol. 1990]Genome Biol. 2004; 5(2):R12.
[Genome Biol. 2004]Bioinformatics. 2000 Oct; 16(10):944-5.
[Bioinformatics. 2000]Bioinformatics. 2003 Sep 1; 19(13):1710-1.
[Bioinformatics. 2003]Mol Biol Evol. 1986 Sep; 3(5):418-26.
[Mol Biol Evol. 1986]Bioinformatics. 2003 Dec 12; 19(18):2496-7.
[Bioinformatics. 2003]