![]() | ![]() |
Formats:
|
||||||||||
Copyright © 2009, American Society for Microbiology The Genome of Thermosipho africanus TCF52B: Lateral Genetic Connections to the Firmicutes and Archaea † Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada,1 UMR CNRS 7138 Systématique, Adaptation et Evolution, Universite Pierre et Marie Curie, Paris, France,2 Centre for Geobiology, University of Bergen, Bergen, Norway,3 Department of Process Engineering and Applied Science, Dalhousie, University, Halifax, Nova Scotia B3J 1Z1, Canada,4 Department of Biology, University of Bergen, Bergen, Norway5 *Corresponding author. Present address: Centre for Ecological and Evolutionary Synthesis, Department of Biology, University of Oslo, P.O. Box 1066 Blindern, N-0316 Oslo, Norway, and Department of Biological Sciences, University of Alberta, Edmonton, Alberta, Canada, T6G 2E9. Phone: (780) 492-3671. Fax: (780) 492-9234. E-mail: nesbo/at/ualberta.ca Received October 15, 2008; Accepted December 29, 2008. Abstract Lateral gene transfers (LGT) (also called horizontal gene transfers) have been a major force shaping the Thermosipho africanus TCF52B genome, whose sequence we describe here. Firmicutes emerge as the principal LGT partner. Twenty-six percent of phylogenetic trees suggest LGT with this group, while 13% of the open reading frames indicate LGT with Archaea. Thermosipho africanus TCF52B was isolated from produced fluids of a high-temperature oil reservoir in the North Sea using fish waste as the only substrate (4). Phylogenetic analyses based on the 16S rRNA gene sequence and DNA-DNA hybridization placed it as a strain of Thermosipho africanus, which was first isolated from a shallow marine hydrothermal system in Djibouti, Africa (8, 21). The complete genome sequence of this strain was determined by the conventional whole-genome shotgun strategy. Genomic libraries containing 1- to 4-kb and 40-kb fragments were constructed, and sequence chromatograms were produced using a MegaBACE 1000 capillary DNA sequencer (GE Healthcare). Nucleotide skews were computed as described previously (11). Automated open reading frame (ORF) identification and annotation were performed using the annotation software Manatee made available by TIGR (23). Pseudogenes were identified by doing BLAST searches of neighboring ORFs with the same or similar annotations and by using the program Psi-phi (9, 10), and clustered regularly interspaced short palindromic repeat loci (CRISPRs) were identified using the web site http://crispr.u-psud.fr/crispr/CRISPRHomePage.php with the default parameters (6). Maximum-likelihood (ML) trees (WAG [Γ+Ι model, four categories]) were constructed from protein-coding ORFs using PHYML and the PhyloGenie package (5). Recently, several Thermotogales genomes have become available in GenBank. As these genomes had not been published yet, we did not include them in any “genome-scale” analyses (i.e., the phylogenetic analyses). We did, however, include them in the BLAST analyses of mobile Thermosipho africanus genes. The genome of Thermosipho africanus strain TCF52B is a single circular chromosome consisting of 2,016,657 bp with an average G+C content of 30.8%. Strand asymmetries, such as GC skew and tetramer skews, are pronounced and show two clear singularity points, located at roughly 8 kb and 1033 kb from the +1 site (see Fig. S1 in the supplemental material). Since these two points are diametrically opposed on the circular chromosome, dividing it into two halves with opposite compositional skews, they make good candidates for the putative origin and termination of replication. The 1,033-kb region is likely to harbor the origin, since GC skew becomes positive past this location, as in most bacterial genomes with a known origin. The genome contains 2,000 potential coding sequences, of which 1913 are putative protein-coding ORFs, 30 are putatively assigned as pseudogenes, and 57 encode RNA. A comparison to the genome of Thermotoga maritima is given in Table 1. The Thermosipho africanus genome is about 156 kb larger than the Thermotoga maritima genome and carries 36 more ORFs. The genome contains duplicated regions comprising paralogous gene copies, CRISPRs, and mobile genetic elements, which collectively provide considerable indirect evidence for genomic instability and acquisition of exogenous genetic information.
CRISPR structures comprise direct genomic repeats of 24 to 47 bp length separated by variable-length spacers (1, 13, 22) and are thought to function as a prokaryotic “immune system.” Due to their patchy distribution in prokaryotes, CRISPRs are often assumed to undergo frequent lateral transfer. Thermosipho africanus displays 12 CRIPSRs spread over its chromosome (Fig. (Fig.1),1
Seventy-eight ORFs were annotated as encoding transposases or integrases, and at least 61 of these are likely to be active genes (Fig. (Fig.1).1 We attempted to calculate ML phylogenetic trees from each of the 1,913 ORFs and obtained trees from 1,578 (82%), using the PhyloGenie package. The distribution of the “immediate sisters” (nearest neighbors) of Thermosipho africanus in the trees is shown in Fig. Fig.2.2
The phylogenetic analysis revealed that 58 ORFs (3.7%) had Archaea as immediate sister in the tree. This is considerably lower than the 24% first reported for the Thermotoga maritima genome (16). A lower value was to be expected, for two reasons. First, growth of the bacterial gene and genome data has outpaced that for Archaea, so that bacterial best hits to patchily distributed genes with ambiguous phylogenetic signals have become differentially more likely. Second, the Thermotoga maritima genome will itself be sister for all or most Thermosipho africanus genes that were transferred prior to their divergence and are still present in both. We therefore visually inspected each of the trees in order to also obtain information on LGT that predate the split between Thermosipho and Thermotoga (see Fig. S2 in the supplemental material). This also allowed us to detect transfers where the genes involved have later been duplicated in the Thermosipho africanus genome (so that the sister in the tree was another Thermosipho africanus gene.) This analysis suggested that a total of 202 ORFs (~13%) have been involved in LGT with Archaea (including both ancient and recent events). Among these, 125 (~62%) also involve Thermotoga maritima, while 77 (~38%) have no close homolog in Thermotoga maritima. This latter number is of course an overestimate of the number of potential recent transfers, as many of the transferred genes might have been lost by Thermotoga maritima MSB8, but these numbers do suggest that LGT between the Thermotogales and the Archaea is a still an ongoing process. Thermophilic Archaea such as members of the genera Archaeoglobus (2) and Thermococcus (3, 14) are among the few other organisms considered to be native to oil reservoirs, the habitat from which this strain was isolated (4). Moreover, a recent reanalysis of the Thermotoga maritima genome reported 11.3% archaeal genes in this genome, consistent with our findings (20). A large proportion of the ORFs have a close phylogenetic relationship with Firmicutes, with 8% of the ORFs having Firmicutes as sister in the tree (Fig. (Fig.2).2 A high level of LGT between Thermotogales and Firmicutes might in any case be expected, since some members of the Firmicutes, e.g., the Thermoanaerobales, frequently cohabit with Thermotogales in natural environments. For instance, Thermotogales and the Firmicutes genera Thermoanaerobacter and Desulfotomaculum are the only bacteria thought to be indigenous to oil reservoirs (4, 12, 18). Moreover, most of the mobile elements found scattered in the Thermosipho africanus genome seem to have recently originated from Firmicutes, further supporting the importance of LGT between these lineages. Nucleotide sequence accession number. The genome sequence of Thermosipho africanus strain TCF52B has been submitted to GenBank under accession number CP001185. [Supplemental material]
Acknowledgments This work was supported by funds from the Canadian Institutes for Health Research (MOP 4467) and Genome Atlantic (ACOA) to W.F.D. and by funds from the Norwegian Research Council (grant no. 145854/110 to N.K.B.). C.L.N. is supported by a Young Scientist grant from the Norwegian Research Council (180444/V40). Sequencing and assembly were performed at The Atlantic Genome Centre (Halifax, Canada). We thank TIGR (now JCVI) for providing the TIGR Annotation Service, which provided us with automatic annotation data and the manual annotation tool Manatee. We also thank Peter Cordes and Sebastien Halary for help with the data analysis and Angie Lewis for help with sequencing and assembly. Footnotes Published ahead of print on 5 January 2009.†Supplemental material for this article may be found at http://jb.asm.org/. REFERENCES 1. Barrangou, R., C. Fremaux, H. Deveau, M. Richards, P. Boyaval, S. Moineau, D. A. Romero, and P. Horvath. 2007. CRISPR provides acquired resistance against viruses in prokaryotes. Science 3151709-1712. [PubMed] 2. Beeder, J., R. K. Nilsen, J. T. Rosnes, T. Torsvik, and T. Lien. 1994. Archaeoglobus-Fulgidus isolated from hot North Sea oil field waters. Appl. Environ. Microbiol. 601227-1231. [PubMed] 3. Bonch-Osmolovskaya, E. A., M. L. Miroshnichenko, A. V. Lebedinsky, N. A. Chernyh, T. N. Nazina, V. S. Ivoilov, S. S. Belyaev, E. S. Boulygina, Y. P. Lysov, A. N. Perov, A. D. Mirzabekov, H. Hippe, E. Stackebrandt, S. L'Haridon, and C. Jeanthon. 2003. Radioisotopic, culture-based, and oligonucleotide microchip analyses of thermophilic microbial communities in a continental high-temperature petroleum reservoir. Appl. Environ. Microbiol. 696143-6151. [PubMed] 4. Dahle, H., F. Garshol, M. Madsen, and N. K. Birkeland. 2008. Microbial community structure analysis of produced water from a high-temperature North Sea oil-field. Antonie van Leeuwenhoek 9337-49. [PubMed] 5. Frickey, T., and A. N. Lupas. 2004. PhyloGenie: automated phylome generation and analysis. Nucleic Acids Res. 325231-5238. [PubMed] 6. Grissa, I., G. Vergnaud, and C. Pourcel. 2007. CRISPRFinder: a web tool to identify clustered regularly interspaced short palindromic repeats. Nucleic Acids Res. 35W52-W57. [PubMed] 7. Haft, D. H., J. Selengut, E. F. Mongodin, and K. E. Nelson. 2005. A guild of 45 CRISPR-associated (Cas) protein families and multiple CRISPR/Cas subtypes exist in prokaryotic genomes. PLoS Comput. Biol. 1e60. [PubMed] 8. Huber, R., C. R. Woese, T. A. Langworthy, H. Fricke, and K. O. Stetter. 1989. Thermosipho africanus gen. nov., represents a new genus of thermophilic eubacteria within the“Thermotogales.” Syst. Appl. Microbiol. 1232-37. 9. Lerat, E., and H. Ochman. 2004. Psi-Phi: exploring the outer limits of bacterial pseudogenes. Genome Res. 142273-2278. [PubMed] 10. Lerat, E., and H. Ochman. 2005. Recognizing the pseudogenes in bacterial genomes. Nucleic Acids Res. 333125-3132. [PubMed] 11. Lopez, P., P. Forterre, H. le Guyader, and H. Philippe. 2000. Origin of replication of Thermotoga maritima. Trends Genet. 1659-60. [PubMed] 12. Magot, M., B. Ollivier, and B. Patel. 2000. Microbiology of oil reservoirs. Antonie van Leeuwenhoek 77103-116. [PubMed] 13. Makarova, K. S., N. V. Grishin, S. A. Shabalina, Y. I. Wolf, and E. V. Koonin. 2006. A putative RNA-interference-based immune system in prokaryotes: computational analysis of the predicted enzymatic machinery, functional analogies with eukaryotic RNAi, and hypothetical mechanisms of action. Biol. Direct. 17. [PubMed] 14. Miroshnichenko, M. L., H. Hippe, E. Stackebrandt, N. A. Kostrikina, N. A. Chernyh, C. Jeanthon, T. N. Nazina, S. S. Belyaev, and E. A. Bonch-Osmolovskaya. 2001. Isolation and characterization of Thermococcus sibiricus sp. nov. from a Western Siberia high-temperature oil reservoir. Extremophiles 585-91. [PubMed] 15. Mongodin, E. F., I. R. Hance, R. T. Deboy, S. R. Gill, S. Daugherty, R. Huber, C. M. Fraser, K. Stetter, and K. E. Nelson. 2005. Gene transfer and genome plasticity in Thermotoga maritima, a model hyperthermophilic species. J. Bacteriol. 1874935-4944. [PubMed] 16. Nelson, K. E., R. A. Clayton, S. R. Gill, M. L. Gwinn, R. J. Dodson, D. H. Haft, E. K. Hickey, J. D. Peterson, W. C. Nelson, K. A. Ketchum, L. McDonald, T. R. Utterback, J. A. Malek, K. D. Linher, M. M. Garrett, A. M. Stewart, M. D. Cotton, M. S. Pratt, C. A. Phillips, D. Richardson, J. Heidelberg, G. G. Sutton, R. D. Fleischmann, J. A. Eisen, O. White, S. L. Salzberg, H. O. Smith, J. C. Venter, and C. M. Fraser. 1999. Evidence for lateral gene transfer between Archaea and bacteria from genome sequence of Thermotoga maritima. Nature 399323-329. [PubMed] 17. Nesbø, C. L., M. Dlutek, O. Zhaxybayeva, and W. F. Doolittle. 2006. Evidence for existence of “mesotogas,” members of the order Thermotogales adapted to low-temperature environments. Appl. Environ. Microbiol. 725061-5068. [PubMed] 18. Ollivier, B., and J. Cayol. 2005. Fermentive, iron-reducing, and nitrogen-reducing microorhganisms, p. 71-88. In B. Ollivier and M. Magot (ed.), Petroleum microbiology. ASM Press, Washington, DC. 19. Omelchenko, M. V., Y. I. Wolf, E. K. Gaidamakova, V. Y. Matrosova, A. Vasilenko, M. Zhai, M. J. Daly, E. V. Koonin, and K. S. Makarova. 2005. Comparative genomics of Thermus thermophilus and Deinococcus radiodurans: divergent routes of adaptation to thermophily and radiation resistance. BMC Evol. Biol. 557. [PubMed] 20. Podell, S., and T. Gaasterland. 2007. DarkHorse: a method for genome-wide prediction of horizontal gene transfer. Genome Biol. 8R16. [PubMed] 21. Ravot, G., B. Ollivier, B. K. Patel, M. Magot, and J. L. Garcia. 1996. Emended description of Thermosipho africanus as a carbohydrate-fermenting species using thiosulfate as an electron acceptor. Int. J. Syst. Bacteriol. 46321-323. 22. Sorek, R., V. Kunin, and P. Hugenholtz. 2008. CRISPR—a widespread system that provides acquired resistance against phages in bacteria and archaea. Nat. Rev. Microbiol. 6181-186. [PubMed] 23. White, O. 2004. Bacterial genome annotation at TIGR. In C. M. Fraser, T. D. Read, and K. E. Nelson (ed.), Microbial genomes. Humana Press, Totowa, NJ. |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||
Antonie Van Leeuwenhoek. 2008 Jan-Feb; 93(1-2):37-49.
[Antonie Van Leeuwenhoek. 2008]Trends Genet. 2000 Feb; 16(2):59-60.
[Trends Genet. 2000]Genome Res. 2004 Nov; 14(11):2273-8.
[Genome Res. 2004]Nucleic Acids Res. 2005; 33(10):3125-32.
[Nucleic Acids Res. 2005]Nucleic Acids Res. 2007 Jul; 35(Web Server issue):W52-7.
[Nucleic Acids Res. 2007]Nucleic Acids Res. 2004; 32(17):5231-8.
[Nucleic Acids Res. 2004]Science. 2007 Mar 23; 315(5819):1709-12.
[Science. 2007]Biol Direct. 2006 Mar 16; 1():7.
[Biol Direct. 2006]Nat Rev Microbiol. 2008 Mar; 6(3):181-6.
[Nat Rev Microbiol. 2008]J Bacteriol. 2005 Jul; 187(14):4935-44.
[J Bacteriol. 2005]PLoS Comput Biol. 2005 Nov; 1(6):e60.
[PLoS Comput Biol. 2005]Nature. 1999 May 27; 399(6734):323-9.
[Nature. 1999]Appl Environ Microbiol. 1994 Apr; 60(4):1227-1231.
[Appl Environ Microbiol. 1994]Appl Environ Microbiol. 2003 Oct; 69(10):6143-51.
[Appl Environ Microbiol. 2003]Extremophiles. 2001 Apr; 5(2):85-91.
[Extremophiles. 2001]Antonie Van Leeuwenhoek. 2008 Jan-Feb; 93(1-2):37-49.
[Antonie Van Leeuwenhoek. 2008]Genome Biol. 2007; 8(2):R16.
[Genome Biol. 2007]Appl Environ Microbiol. 2006 Jul; 72(7):5061-8.
[Appl Environ Microbiol. 2006]BMC Evol Biol. 2005 Oct 20; 5():57.
[BMC Evol Biol. 2005]Genome Biol. 2007; 8(2):R16.
[Genome Biol. 2007]Antonie Van Leeuwenhoek. 2008 Jan-Feb; 93(1-2):37-49.
[Antonie Van Leeuwenhoek. 2008]Antonie Van Leeuwenhoek. 2000 Feb; 77(2):103-16.
[Antonie Van Leeuwenhoek. 2000]