![]() | ![]() |
Formats:
|
||||||||||||||||||||||||
Copyright © 2009, American Society for Microbiology Effects of Experimental Choices and Analysis Noise on Surveys of the “Rare Biosphere” † Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, North Carolina,1 Environmental Genomics Core Facility, University of South Carolina at Columbia, Columbia, South Carolina2 *Corresponding author. Mailing address: Bioinformatics Resource Center, University of North Carolina—Charlotte, Cameron 212, 9201 University City Boulevard, Charlotte, NC 28223. Phone: (704) 687-8214. Fax: (704) 687-6610. E-mail: anthony.fodor/at/gmail.com Received August 19, 2008; Accepted February 25, 2009. Abstract When planning a survey of 16S rRNA genes from a complex environment, investigators face many choices including which primers to use and how to taxonomically classify sequences. In this study, we explored how these choices affected a survey of microbial diversity in a sample taken from the aerobic basin of the activated sludge of a North Carolina wastewater treatment plant. We performed pyrosequencing reactions on PCR products generated from primers targeting the V1-V2, V6, and V6-V7 variable regions of the 16S rRNA gene. We compared these sequences to 16S rRNA gene sequences found in a whole-genome shotgun pyrosequencing run performed on the same sample. We found that sequences generated from primers targeting the V1-V2 variable region had the best match to the whole-genome shotgun reaction across a range of taxonomic classifications from phylum to family. Pronounced differences between primer sets, however, occurred in the “rare biosphere” involving taxa that we observed in fewer than 11 sequences. We also examined the results of analysis strategies comparing a classification scheme using a nearest-neighbor approach to directly classifying sequences with a naïve Bayesian algorithm. Again, we observed pronounced differences between these analysis schemes in infrequently observed taxa. We conclude that if a study is meant to probe the rare biosphere, both the experimental conditions and analysis choices will have a profound impact on the observed results. For nearly 3 decades, investigations of the distribution of microbes in complex environments have focused on the use of rRNA genes (1, 2, 4, 11, 16, 18, 19, 22, 24). Because the full-length 16S rRNA sequence can be obtained with paired-end reads via traditional Sanger sequencing, until recently most studies of the 16S rRNA gene captured most or nearly most of the 16S sequence length. New pyrosequencing technologies, however, have recently been introduced that greatly reduce the per base cost of sequencing but with shorter read lengths than traditional Sanger sequencing (17). This new approach has proven powerful, yielding a previously unobtainable view of rare taxa (7, 12-14, 25). The shorter reads produced by pyrosequencing require the choice of a particular region of the 16S rRNA gene to target for pyrosequencing as well as the choice of an algorithm to classify the taxonomy of the shorter reads. In their initial surveys of microbial diversity with pyrosequencing (12, 14, 25), Sogin and colleagues targeted the V6 variable region, in part because it is was small enough to be captured with the 100-bp reads of the pyrosequencing technology available at the time. Recently, the read length of 454 pyrosequencing machines has been increased to an average of ~250 bp. This allows for more flexibility in primer design and opens up the possibility of targeting regions of the 16S rRNA gene other than V6. In recent work, Huse et al. took advantage of this new capability to compare the classifications made for the human gut microbiome with the V6 and longer V3 regions (13). Plotting the taxonomic abundance of these two sequence sets against each other yielded an excellent correlation (r2 = 0.99), suggesting that the choice of which variable region to target makes little difference. In this report, we introduce a data set examining the performance of sets of primers targeting the V1-V2, V6, and V6-V7 regions. By using a sample for which we have also generated a whole-genome shotgun sequencing run with 250 bp reads, we were able to compare the observed 16S rRNA genes in samples with and without an initial PCR step targeting the 16S rRNA gene. Our results demonstrate that experimental choices such as which region of the 16S rRNA gene to sequence and which algorithm to use to classify taxa are much more likely to affect observations of the “rare biosphere” than more commonly observed taxa. MATERIALS AND METHODS Sample collection. The Mallard Creek Water Reclamation Facility is located in Charlotte, North Carolina. The plant has an average daily inflow of 7.5 million gallons, and the wastewater is mostly domestic, with additional input from the University of North Carolina-Charlotte, University City Carolinas Medical Center hospital, and several industrial users (see reference 21 for additional details). On the morning of 20 March 2007, we collected a 50-ml sample from the aeration basin using a plastic dipper. The sample was decanted to remove as much foam as possible before the liquid was transferred to a sterile tube. DNA was extracted from the sample using a Mo Bio UltraClean Water DNA Kit. The sample tube was inverted several times to maximize homogeneity, and a 10-ml aliquot was removed and pipetted onto the provided filter (0.22 μm pore size). Filtrate was discarded, and the membrane was used for bacterial DNA extraction using the manufacturer's protocol. The final DNA extract was analyzed for purity and concentration using a NanoDrop ND-1000 spectrophotometer. Approximately 100 μl of extracted DNA was concentrated in a vacuum centrifuge and resuspended in about 12 μl of molecular-grade biology water. The final sample concentration was determined by a NanoDrop spectrophotometer. A whole-genome shotgun sequencing run was performed on this sample, and a detailed description of these data can be found in our previous paper (21). The DNA sample was stored for just over 1 year at −20°C before the experiments described in this report were performed. PCR conditions. A 454 pyrosequencing reaction requires the presence of two oligonucleotide sequences, the 454A and 454B primers, for the emulsion PCR reaction that amplifies DNA prior to pyrosequencing. In a whole-genome shotgun sequencing experiment, these primer sequences are blunt-end ligated to sheared sequence prior to emulsion PCR. If a pyrosequencing sequencing run instead uses PCR to focus on a particular gene, these primer sequences can be incorporated into the primers used in the initial PCR. Table 1 shows the full primer sequences used in our initial PCR targeting the 16S rRNA gene. Primers were ordered from IDT and were not purified by high-performance liquid chromatography.
On 7 April 2008, four 50-μl PCR mixtures were set up using Pfu Ultra (Stratagene) high-fidelity DNA polymerase by following the manufacturer's directions. The PCR conditions were modified from Sekiguchi et al. (22) through trial and error to find cycling conditions that appeared to work well with all three primer sets based on analysis of PCRs with agarose gels. For the sequencing reactions described in this paper, our cycling conditions were 94°C for 5 min and 20 rounds of 94°C for 1 min, 60°C for 1 min (with this temperature dropping 1°C every second cycle), and 72°C for 1 min. The annealing temperature was then set to 55°C for 10 rounds (i.e., 94°C 1 min, 55°C 1 min, and 72°C 1 min). Finally, the samples were exposed to 72°C for 7 min and then cooled to 4°C. Our PCR buffer contained 5 μl of 10× Pfu Ultra buffer, 1 μl of deoxynucleoside triphosphate mix at 10 mM for each deoxynucleoside triphosphate, 1 μl of template DNA from the wastewater treatment plant at 58.7 ng/μl, 1 μl of Pfu Ultra polymerase at 2.5 U/μl, 40 μl of nuclease-free water, and 1 μl each of the forward and reverse primers at 10 μM. Samples were run on an agarose gel and gel purified with the Promega Wizard SV gel and PCR clean-up system by following the manufacturer's instructions. Shannon sequence entropy. The aligned version of the Ribosomal Database Project ([RDP] version 9.59) was downloaded from http://rdp.cme.msu.edu/. A reference sequence was chosen (Escherichia coli J01695), and a new alignment was created in which all the columns of this alignment were numbered by the reference sequence. That is, if within the alignment the reference sequence E. coli J01695 had a gap, that column was removed from every sequence in the alignment. We then calculated the Shannon sequence entropy (Fig. (Fig.1)1
Filtering of primer sequences. We followed the recommendations of Huse et al. (14) and removed all sequences (Table 2) that had any N residues anywhere in the sequence, did not start with the expected 5′ primer sequence, or for which the sequence read lengths (including nucleotides derived from the primer sequence) were below 150 bp (70 for primer 967F-1046R). For sequences that survived our filters, we removed regions containing the upstream and downstream primer sequences by simply removing the first and last 20 bp (since sequencing error tends to accumulate toward the end of pyrosequencing reads, we wished to treat sequences from all primer sets uniformly regardless of read length). Unfiltered sequence sets are available as File S1 in the supplemental material.
Whole-genome sequences and the RDP classification algorithm. Using BLASTN with an e-score cutoff of 0.01 (and with the filter parameter −F “m D”), we ran our 378,601 454-FLX whole-genome shotgun sequences from our 20 March 2007 sample of the Mallard Creek Wastewater Treatment Plant (24) against version 9.60 of the RDP database (22). This search yielded 739 hits of which 467 could be assigned by stand-alone version 2.0 of the RDP classification algorithm to either Bacteria (464) or Archaea (3) at the top of the phylogenetic tree with a confidence score of >80 (Fig. (Fig.22
Compared to our PCR runs, we have a relatively small number of whole-genome shotgun sequences that can be assigned to taxa above the RDP threshold of >80% (Fig. (Fig.2).2 Implementation of JGast. A “nearest neighbor” algorithm maps a query sequence to a previously described full-length sequence in a database and then assigns the classification of the full-length sequence to the query sequence. There are several nearest-neighbor algorithms available as web servers (5, 6), but they have limitations in the number of sequences that can be uploaded, making them poorly suited to large pyrosequencing datasets. The code for the Global Alignment for Sequence Taxonomy (GAST) process (13) has been made publicly available (http://vamps.mbl.edu/resources/software.php) but requires creation of a database containing variable regions extracted from a full-length 16S rRNA alignment. As an alternative, we implemented a nearest-neighbor algorithm in Java (see File S2 in the supplemental material) that works directly on unaligned reference 16S rRNA sequences. Our implementation, which we call JGast, begins with classification of the 302,066 sequences in the Greengenes database (the file current_prokMSA_unaligned.fasta dated 16 December 2008 was downloaded from http://greengenes.lbl.gov/Download/Sequence_Data/Fasta_data_files/on 7, January 2009) by the stand-alone RDP classification algorithm, version 2 (downloaded from http://sourceforge.net/projects/rdp-classifier/). In addition, a BLAST database was created from the current_prokMSA_unaligned.fasta sequences using formatdb (with the default parameters except for “−p f”). For each query sequence, we performed a BLASTN search against this database. We used NCBI BLAST, version 2.2.18 for Linux, with the following parameters: −p blastn, v = 250, and −e 0.000001. For each query sequence, we took the top 250 sequences with the best BLAST match (i.e., lowest e-score) to the query sequence. For each of the 250 sequences found by BLAST, we performed a pairwise global alignment with the query sequence using the Needleman-Wunsch algorithm with a match score of 2, a mismatch score of −1, and an affine gap penalty of −2 for opening a gap and a score of 0 for extending the gap. Following Huse et al., we defined the distance of our pairwise Needleman-Wunsch-aligned sequences as the number of insertions, deletions, and mismatches, divided by the ungapped length of the query sequence. In determining the region of the target sequence to use for the global alignment, we took enough of the target sequence to ensure full coverage of the query sequence. For example, consider a BLAST alignment of 100 nucleotides between a query sequence of 175 nucleotides to a target sequence of 400 nucleotides that matched from nucleotides 200 to 300 of the target sequence. There are 75 nucleotides in the query sequence that were not present in the BLAST alignment. We wish to allow for the possibility that the missing 75 nucleotides could contribute to a global alignment on either the 5′ or 3′ side of the target sequence. We would therefore use the Needleman-Wunsch algorithm to align the full query sequence to a substring of 125 to 375 from the target sequence (plus an additional 10 nucleotides on either side to allow for gaps). Since trailing or leading gaps are treated as a single insertion event, the presence of the extra sequence does not substantially increase the calculated distance. Of the 250 sequences aligned to the query sequence in this way, we followed Huse et al. and considered the taxonomy of the target sequence(s) with the best percent identity (i.e., lowest distance) from the Needleman-Wunsch alignment as reference sequences. We compared the classification of these reference sequences and generated a consensus taxonomy in which two-thirds of the reference sequences agreed on the taxonomy, with at least an 80% RDP confidence score on the full-length reference sequence. If two-thirds of the sequences did not agree on a classification with an 80% RDP confidence score at a given taxonomic level (e.g., genus), we moved to the next higher taxonomic level on the tree (e.g., family). Although JGast shares many similarities with GAST, there are some differences that could yield different results. GAST starts with RDP classifications of the SILVA database. We instead used RDP classifications of the Greengenes database in part because the Greengenes database has a longer average sequence length (1,416 bp for Greengenes versus 1,005 for SILVA). GAST uses a blast database consisting of an alignment of just the target variable regions and uses a multiple sequence alignment program (MUSCLE) (8, 9) to align the query sequence with the 100 best BLAST hits. Distance scores are calculated on this global alignment. JGast, by contrast, calculates distance scores from a pairwise Needleman-Wunsch alignment of the query sequence to each of the 250 best BLAST hits found in a BLAST database consisting of full-length sequences of the Greengenes database. Finally, GAST considers the taxonomy of all sequences in the SILVA database that contain an exact match to the reference sequence substring containing the variable region. By contrast, JGast considers only the taxonomy of sequences among the 250 best BLAST hits that have an exact match to the substring of the target sequence used for the global alignment. We do not anticipate these different choices, employed for expediency or performance or to take advantage of an existing Java code base, would make large differences in the results that we obtain. Indeed, our results strongly suggest that different classifications that arise due to different choices made during construction of algorithms will mostly be limited to the rare biosphere. In Fig. Fig.11
Linear regressions. All linear regressions were performed on log-transformed data with 1 added to each observed taxon before the log transformation. Taxa that were not present in either condition were not included in the regression. Nucleotide sequence accession number. Sequences in this study are available at the NCBI short-read archive under accession number SRA001164 and as File S1 in the supplemental material. RESULTS Primer sets were chosen based on previous literature and an analysis of the RDP. Figure Figure11 Once the primers were chosen, we used as a template a DNA sample taken on 20 March 2007 from the aerobic basin of the Mallard Creek Wastewater Treatment Plant for which we had also performed a 454-FLX whole-genome shotgun sequencing reaction (21). Using the three sets of primers in Table 1, we performed 30 rounds of PCR on this sample (see Materials and Methods), gel purified the resulting amplicon, and submitted the resulting DNA for 454-FLX pyrosequencing at the Environmental Genomics Core Facility in Columbia, South Carolina. Each of the three reactions was run on 1/16 of an LR70 plate. The number of sequences that was generated for each primer pair is shown in Table 2. Differences between whole-genome results and results from the 16S rRNA gene are pronounced for infrequently observed taxa. Although it is the basis of much of the biotechnology revolution, conventional PCR is known to have limits as a quantitative technology. Primer bias, saturation of amplicon at higher cycle numbers, and a stochasticity that is an inherent part of the PCR process can all limit the degree to which conventional PCR can be reliably used as a quantitative tool. A comparison of our whole-genome-derived 16S rRNA sequences with sequences derived from PCR targeting the 16S rRNA gene on the same DNA sample allows us to evaluate the degree of bias introduced by different PCR primers during the initial PCR step. Figure Figure22 Differences between different analysis schemes also most significantly affect infrequently observed taxa. In Fig. Fig.22
DISCUSSION Our comparison of 16S rRNA sequences derived from whole-genome sequence sets with PCR-generated 16S sequences yielded reasonable overall positive correlations (Fig. (Fig.2)2 In addition to primer bias and differences in the number of 16S sequences as causes of the poor correspondence between whole-genome and PCR results for low-abundance taxa, we suggest a third, less immediately obvious, cause: low-abundance taxa are harder to classify, and hence results for these taxa are more sensitive to choices made during data analysis, what we here call “analysis noise” (Fig. (Fig.44 In a recent paper, Huse et al. performed a comparison of different primer sets (full-length, V3, and V6). Their regressions of the abundance of taxa produced by these different primer sets produced r-squared values down to genus of ~0.99 (13). These are substantially higher than the r-squared values we observed when we compared our whole-genome run to our runs targeting V1-V2, V6, and V7 (Fig. (Fig.22 We note that there is not an immediately obvious superior choice between the two analysis schemes we examined. The RDP classification scheme offers simplicity, very fast run times, and a straightforward bootstrap mechanism for establishing significance. Algorithms like JGast (and GAST [13] and the Greengenes classification scheme [5]) offer potentially greater sensitivity but are also more susceptible to small changes in the database. For example, a single misclassified sequence inserted into the reference database can significantly change classifications for a large number of query sequences if it serves as the nearest neighbor. The RDP classification algorithm is less sensitive to small changes in the training set. We imagine that in the future as the read length of new sequencing technologies approaches the length of the 16S rRNA genes (10), nearest-neighbor approaches, which replace a short query sequence with a full-length database sequence, may be replaced by direct classification of full-length query sequences. In the paper that introduced the RDP classification algorithm (26), an in silico analysis suggested that subsequences covering the V1-V2 region out-performed other variable regions in reproducing full-length classifications. Our results are consistent with this finding, with the V1-V2 region slightly out-performing V6 and V6-V7 in matching our whole-genome results (Fig. (Fig.22 In their analysis of primer bias, Huse et al. conclude that “any effects of primer bias are limited to rare taxa.” This analysis is largely in agreement with our results in which different primer sets produce somewhat similar results for abundant taxa while producing vast differences in the results generated for the rare biosphere. A principle argument for favoring pyrosequencing over traditional Sanger sequencing is that the much greater depth of pyrosequencing allows for investigations of the rare biosphere (25). Our results suggest that even more diversity in the rare biosphere will be uncovered when pyrosequencing experiments are repeated with different primer sets. Our results also suggest that longer sequence reads, which may be possible in sequencing technology to be introduced soon (10), will reduce analysis noise by eliminating the step in which a short query sequence is mapped to a full-length reference sequence. Such improvements in technology will be an important component of explorations of the rare biosphere while having much less impact on surveys of more abundant taxa. [Supplemental material]
Acknowledgments We thank Mary Ann Moran for very helpful comments on a previous version of the manuscript. Footnotes Published ahead of print on 6 March 2009.†Supplemental material for this article may be found at http://aem.asm.org/. REFERENCES 1. Amann, R., W. Ludwig, and K. Schleifer. 1995. Phylogenetic identification and in situ detection of individual microbial cells without cultivation. Microbiol. Rev. 59:143-169. [PubMed] 2. Bond, P. L., P. Hugenholtz, J. Keller, and L. L. Blackall. 1995. Bacterial community structures of phosphate-removing and non-phosphate-removing activated sludges from sequencing batch reactors. Appl. Environ. Microbiol. 61:1910-1916. [PubMed] 3. Chakravorty, S., D. Helb, M. Burday, N. Connell, and D. Alland. 2007. A detailed analysis of 16S ribosomal RNA gene segments for the diagnosis of pathogenic bacteria. J. Microbiol. Methods 69:330-339. [PubMed] 4. Crump, B. C., G. W. Kling, M. Bahr, and J. E. Hobbie. 2003. Bacterioplankton community shifts in an Arctic lake correlate with seasonal changes in organic matter source. Appl. Environ. Microbiol. 69:2253-2268. [PubMed] 5. DeSantis, T. Z., P. Hugenholtz, N. Larsen, M. Rojas, E. L. Brodie, K. Keller, T. Huber, D. Dalevi, P. Hu, and G. L. Andersen. 2006. Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl. Environ. Microbiol. 72:5069-5072. [PubMed] 6. DeSantis, T. Z., Jr., P. Hugenholtz, K. Keller, E. L. Brodie, N. Larsen, Y. M. Piceno, R. Phan, and G. L. Andersen. 2006. NAST: a multiple sequence alignment server for comparative analysis of 16S rRNA genes. Nucleic Acids Res. 34:W394-W399. [PubMed] 7. Dethlefsen, L., S. Huse, M. L. Sogin, and D. A. Relman. 2008. The pervasive effects of an antibiotic on the human gut microbiota, as revealed by deep 16S rRNA sequencing. PLoS Biol. 6:e280. [PubMed] 8. Edgar, R. C. 2004. MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics 5:113. [PubMed] 9. Edgar, R. C. 2004. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 32:1792-1797. [PubMed] 10. Eid, J., A. Fehr, J. Gray, K. Luong, J. Lyle, G. Otto, P. Peluso, D. Rank, P. Baybayan, B. Bettman, A. Bibillo, K. Bjornson, B. Chaudhuri, F. Christians, R. Cicero, S. Clark, R. Dalal, A. Dewinter, J. Dixon, M. Foquet, A. Gaertner, P. Hardenbol, C. Heiner, K. Hester, D. Holden, G. Kearns, X. Kong, R. Kuse, Y. Lacroix, S. Lin, P. Lundquist, C. Ma, P. Marks, M. Maxham, D. Murphy, I. Park, T. Pham, M. Phillips, J. Roy, R. Sebra, G. Shen, J. Sorenson, A. Tomaney, K. Travers, M. Trulson, J. Vieceli, J. Wegener, D. Wu, A. Yang, D. Zaccarin, P. Zhao, F. Zhong, J. Korlach, and S. Turner. 2009. Real-time DNA sequencing from single polymerase molecules. Science 323:133-138. [PubMed] 11. Ginige, M. P., P. Hugenholtz, H. Daims, M. Wagner, J. Keller, and L. L. Blackall. 2004. Use of stable-isotope probing, full-cycle rRNA analysis, and fluorescence in situ hybridization-microautoradiography to study a methanol-fed denitrifying microbial community. Appl. Environ. Microbiol. 70:588-596. [PubMed] 12. Huber, J. A., D. B. Mark Welch, H. G. Morrison, S. M. Huse, P. R. Neal, D. A. Butterfield, and M. L. Sogin. 2007. Microbial population structures in the deep marine biosphere. Science 318:97-100. [PubMed] 13. Huse, S. M., L. Dethlefsen, J. A. Huber, D. M. Welch, D. A. Relman, and M. L. Sogin. 2008. Exploring microbial diversity and taxonomy using SSU rRNA hypervariable tag sequencing. PLoS Genet. 4:e1000255. [PubMed] 14. Huse, S. M., J. A. Huber, H. G. Morrison, M. L. Sogin, and D. M. Welch. 2007. Accuracy and quality of massively parallel DNA pyrosequencing. Genome Biol. 8:R143. [PubMed] 15. Li, F., M. A. Hullar, and J. W. Lampe. 2007. Optimization of terminal restriction fragment polymorphism (TRFLP) analysis of human gut microbiota. J. Microbiol. Methods 68:303-311. [PubMed] 16. Lyautey, E., B. Lacoste, L. Ten-Hage, J. L. Rols, and F. Garabetian. 2005. Analysis of bacterial diversity in river biofilms using 16S rDNA PCR-DGGE: methodological settings and fingerprints interpretation. Water Res. 39:380-388. [PubMed] 17. Margulies, M., M. Egholm, W. E. Altman, S. Attiya, J. S. Bader, L. A. Bemben, J. Berka, M. S. Braverman, Y. J. Chen, Z. Chen, S. B. Dewell, L. Du, J. M. Fierro, X. V. Gomes, B. C. Godwin, W. He, S. Helgesen, C. H. Ho, G. P. Irzyk, S. C. Jando, M. L. Alenquer, T. P. Jarvie, K. B. Jirage, J. B. Kim, J. R. Knight, J. R. Lanza, J. H. Leamon, S. M. Lefkowitz, M. Lei, J. Li, K. L. Lohman, H. Lu, V. B. Makhijani, K. E. McDade, M. P. McKenna, E. W. Myers, E. Nickerson, J. R. Nobile, R. Plant, B. P. Puc, M. T. Ronan, G. T. Roth, G. J. Sarkis, J. F. Simons, J. W. Simpson, M. Srinivasan, K. R. Tartaro, A. Tomasz, K. A. Vogt, G. A. Volkmer, S. H. Wang, Y. Wang, M. P. Weiner, P. Yu, R. F. Begley, and J. M. Rothberg. 2005. Genome sequencing in microfabricated high-density picolitre reactors. Nature 437:376-380. [PubMed] 18. Pace, N., D. Stahl, D. Lane, and G. Olsen. 1985. Analyzing natural microbial populations by rRNA sequences. ASM News 51:4-12. 19. Rappe, M. S., and S. J. Giovannoni. 2003. The uncultured microbial majority. Annu. Rev. Microbiol. 57:369-394. [PubMed] 20. Rozen, S., and H. Skaletsky. 2000. Primer3 on the WWW for general users and for biologist programmers. Methods Mol. Biol. 132:365-386. [PubMed] 21. Sanapareddy, N., T. J. Hamp, L. C. Gonzalez, H. A. Hilger, A. A. Fodor, and S. M. Clinton. 2009. Molecular diversity of a North Carolina wastewater treatment plant as revealed by pyrosequencing. Appl. Environ. Microbiol. 75:1688-1696. [PubMed] 22. Sekiguchi, H., M. Watanabe, T. Nakahara, B. Xu, and H. Uchiyama. 2002. Succession of bacterial community structure along the Changjiang River determined by denaturing gradient gel electrophoresis and clone library analysis. Appl. Environ. Microbiol. 68:5142-5150. [PubMed] 23. Shenkin, P. S., B. Erman, and L. D. Mastrandrea. 1991. Information-theoretical entropy as a measure of sequence variability. Proteins 11:297-313. [PubMed] 24. Snaidr, J., R. Amann, I. Huber, W. Ludwig, and K. H. Schleifer. 1997. Phylogenetic analysis and in situ identification of bacteria in activated sludge. Appl. Environ. Microbiol. 63:2884-2896. [PubMed] 25. Sogin, M. L., H. G. Morrison, J. A. Huber, D. M. Welch, S. M. Huse, P. R. Neal, J. M. Arrieta, and G. J. Herndl. 2006. Microbial diversity in the deep sea and the underexplored “rare biosphere.” Proc. Natl. Acad. Sci. USA 103:12115-12120. [PubMed] 26. Wang, Q., G. M. Garrity, J. M. Tiedje, and J. R. Cole. 2007. Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy. Appl. Environ. Microbiol. 73:5261-5267. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||||
Microbiol Rev. 1995 Mar; 59(1):143-69.
[Microbiol Rev. 1995]Appl Environ Microbiol. 1995 May; 61(5):1910-6.
[Appl Environ Microbiol. 1995]Appl Environ Microbiol. 2003 Apr; 69(4):2253-68.
[Appl Environ Microbiol. 2003]Appl Environ Microbiol. 2004 Jan; 70(1):588-96.
[Appl Environ Microbiol. 2004]Water Res. 2005 Jan-Feb; 39(2-3):380-8.
[Water Res. 2005]Science. 2007 Oct 5; 318(5847):97-100.
[Science. 2007]Genome Biol. 2007; 8(7):R143.
[Genome Biol. 2007]Proc Natl Acad Sci U S A. 2006 Aug 8; 103(32):12115-20.
[Proc Natl Acad Sci U S A. 2006]PLoS Genet. 2008 Nov; 4(11):e1000255.
[PLoS Genet. 2008]Appl Environ Microbiol. 2009 Mar; 75(6):1688-96.
[Appl Environ Microbiol. 2009]Appl Environ Microbiol. 2002 Oct; 68(10):5142-50.
[Appl Environ Microbiol. 2002]Proteins. 1991; 11(4):297-313.
[Proteins. 1991]Genome Biol. 2007; 8(7):R143.
[Genome Biol. 2007]Appl Environ Microbiol. 1997 Jul; 63(7):2884-96.
[Appl Environ Microbiol. 1997]Appl Environ Microbiol. 2002 Oct; 68(10):5142-50.
[Appl Environ Microbiol. 2002]J Microbiol Methods. 2007 May; 69(2):330-9.
[J Microbiol Methods. 2007]Appl Environ Microbiol. 2006 Jul; 72(7):5069-72.
[Appl Environ Microbiol. 2006]Nucleic Acids Res. 2006 Jul 1; 34(Web Server issue):W394-9.
[Nucleic Acids Res. 2006]PLoS Genet. 2008 Nov; 4(11):e1000255.
[PLoS Genet. 2008]BMC Bioinformatics. 2004 Aug 19; 5():113.
[BMC Bioinformatics. 2004]Nucleic Acids Res. 2004; 32(5):1792-7.
[Nucleic Acids Res. 2004]Methods Mol Biol. 2000; 132():365-86.
[Methods Mol Biol. 2000]Proc Natl Acad Sci U S A. 2006 Aug 8; 103(32):12115-20.
[Proc Natl Acad Sci U S A. 2006]Appl Environ Microbiol. 2009 Mar; 75(6):1688-96.
[Appl Environ Microbiol. 2009]Appl Environ Microbiol. 2007 Aug; 73(16):5261-7.
[Appl Environ Microbiol. 2007]Appl Environ Microbiol. 2006 Jul; 72(7):5069-72.
[Appl Environ Microbiol. 2006]PLoS Genet. 2008 Nov; 4(11):e1000255.
[PLoS Genet. 2008]PLoS Biol. 2008 Nov 18; 6(11):e280.
[PLoS Biol. 2008]Proc Natl Acad Sci U S A. 2006 Aug 8; 103(32):12115-20.
[Proc Natl Acad Sci U S A. 2006]PLoS Genet. 2008 Nov; 4(11):e1000255.
[PLoS Genet. 2008]PLoS Genet. 2008 Nov; 4(11):e1000255.
[PLoS Genet. 2008]Appl Environ Microbiol. 2006 Jul; 72(7):5069-72.
[Appl Environ Microbiol. 2006]Science. 2009 Jan 2; 323(5910):133-8.
[Science. 2009]Appl Environ Microbiol. 2007 Aug; 73(16):5261-7.
[Appl Environ Microbiol. 2007]Proc Natl Acad Sci U S A. 2006 Aug 8; 103(32):12115-20.
[Proc Natl Acad Sci U S A. 2006]Science. 2009 Jan 2; 323(5910):133-8.
[Science. 2009]Proteins. 1991; 11(4):297-313.
[Proteins. 1991]J Microbiol Methods. 2007 May; 69(2):330-9.
[J Microbiol Methods. 2007]