![]() |
Formats:
|
||||||||||||||||||||||
Copyright © 2009 The Authors. Journal compilation © 2009 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd Genome dynamics in major bacterial pathogens 1Centre for Molecular Biology and Neuroscience, Institute of Microbiology, University of Oslo, Oslo University Hospital (Rikshospitalet), Oslo, Norway 2Centre for Molecular Biology and Neuroscience, Institute of Microbiology, University of Oslo, Oslo, Norway 3Department of Informatics, University of Oslo, Oslo, Norway Josep Casadesús, Editor Correspondence: Tone Tønjum, Centre for Molecular Biology and Neuroscience, Institute of Microbiology, University of Oslo, Oslo University Hospital (Rikshospitalet), Sognsvannsveien 20, NO-0027 Oslo, Norway. Tel.: +4 723 074 065; fax: +4 723 074 061; e-mail: tone.tonjum/at/rr-research.no Received January 30, 2009; Revised February 24, 2009; Accepted February 25, 2009. Re-use of this article is permitted in accordance with the Creative Commons Deed, Attribution 2.5, which does not permit commercial exploitation. This article has been cited by other articles in PMC.Abstract Pathogenic bacteria continuously encounter multiple forms of stress in their hostile environments, which leads to DNA damage. With the new insight into biology offered by genome sequences, the elucidation of the gene content encoding proteins provides clues toward understanding the microbial lifestyle related to habitat and niche. Campylobacter jejuni, Haemophilus influenzae, Helicobacter pylori, Mycobacterium tuberculosis, the pathogenic Neisseria, Streptococcus pneumoniae, Streptococcus pyogenes and Staphylococcus aureus are major human pathogens causing detrimental morbidity and mortality at a global scale. An algorithm for the clustering of orthologs was established in order to identify whether orthologs of selected genes were present or absent in the genomes of the pathogenic bacteria under study. Based on the known genes for the various functions and their orthologs in selected pathogenic bacteria, an overview of the presence of the different types of genes was created. In this context, we focus on selected processes enabling genome dynamics in these particular pathogens, namely DNA repair, recombination and horizontal gene transfer. An understanding of the precise molecular functions of the enzymes participating in DNA metabolism and their importance in the maintenance of bacterial genome integrity has also, in recent years, indicated a future role for these enzymes as targets for therapeutic intervention. Keywords: genome sequences, gene profile, DNA repair, recombination, competence, transformation Introduction Continuously, whole genome sequences of bacterial pathogens are being completed, allowing a comparative genomic analysis of the adaptation of different species to their natural habitats. We selected the genomes of nine pathogens and two model organisms for analysis of their gene complements related to genome maintenance and horizontal gene transfer (HGT). In this context, the aim was to focus on major pathogens with relatively small genomes exhibiting vivid genome dynamics, related to competence for transformation, and also include Gram-positive and Gram-negative representatives and model organisms, without covering a wall-to-wall panel for all infectious diseases. Among the major microbial pathogens dominating the human infectious disease scenario at the global level, Neisseria meningitidis, Haemophilus influenzae and Streptococcus pneumoniae are the causative agents of meningitis and airway-related infections. Neisseria gonorrhoeae is the causative agent of gonorrhoea, Helicobacter pylori is the cause of gastric and duodenal ulcers and precancerous gastric lesions, and Campylobacter jejuni is a main source of diarrhea. Streptococcus pyogenes, ‘the flesh-eating bug’ or group A streptococcus, is a major tissue destructor and the cause of a number of disease types including tonsillitis, serious skin infections with tissue damage, erysipelas, scarlatina, rheumatic fever and puerperal fever. Staphylococcus aureus is a typical abscess-forming agent including the methicillin-resistant S. aureus (MRSA), which is an emerging and feared multiresistant opportunist. Mycobacterium tuberculosis is the cause of tuberculosis, infecting one-third of the world's population, making it the most widespread pathogen known. Thus, C. jejuni, H. influenzae, H. pylori, M. tuberculosis, pathogenic Neisseria, S. pneumoniae, S. pyogenes and S. aureus all contribute to frequent infectious disease cases of mild to grave severity, as well as a large numbers of deaths each year. Most of these pathogens are opportunistic, mucosal surface or skin organisms, while M. tuberculosis is an intracellular parasite. Representing members of the phylae Proteobacteriae, Actinobacteria and Firmicutes (Fig. 1
Here, we summarize comparative genomic characteristics of this subset of human pathogens and studies that have contributed to our understanding of how they adapt to different environments, combat antibiotics and acquire increased virulence. We address selected parts of the total gene content of these major pathogens in order to elucidate how these reflect their major traits and enable them to persist in their respective environments. As such, gene complements shed new light on the basis for genome dynamics in microbial pathogens. In the context of genome maintenance and HGT, major emphasis will be placed on DNA repair, type IV secretion and transformation processes. Methods Identification of orthologs by use of the DNA Repair Gene Orthologs system Initially, genes known to be involved in different types of DNA repair, replication and recombination as well as genes responsible for secretion systems (mainly type II and IV secretion), pilus biogenesis and DNA uptake were identified in representative bacterial species on the basis of a combination of manual selection, databases (COG), lists in review papers (Cascales & Christie, 2004; Chen et al., 2005) and other sources. A system for the identification of gene orthologs was then used in order to see whether orthologs of the selected genes were present or absent in the genomes of the selected pathogenic bacteria under the study listed in Table 1. Based on the known genes of the various functions and their orthologs in the selected pathogenic bacteria under study, an overview of the presence of the different types of genes was created. In brief, the ortholog system DNA Repair Gene Orthologs system (T. Rognes, O. Aussedat & B. Eliassen, unpublished data) identifies orthologous genes based on similarity of sequence. Initially, all protein sequences in Refseq were compared using an all-vs.-all blast search, and all significant matches were identified (E<1e-7). Genes were linked using single-linkage clustering based on the protein sequence alignment scores, starting with the highest-scoring pairs of sequences and progressing to gradually lower-scoring pairs. Genes belonging to the same organisms were not allowed to be clustered, unless all genes in the cluster belonged to that same organism (inparalogs). Access to the clustering information was provided through a web interface, where organisms and groups of genes could be selected. In addition to DNA Repair Gene Orthologs, the KEGG pathway database (http://www.genome.jp/kegg/pathway.html) (Kanehisa et al., 2008) was used for identifying orthologs. Positive hits, and lack thereof, were checked against homology identifications from http://www.microbesonline.org/ (Alm et al., 2005) and blast searches. DNA repair and recombination DNA repair DNA repair is essential to all organisms (Fig. 2
Nearly 60 years ago, the studies of Escherichia coli DNA repair systems were initiated (Friedberg, 2008), and this organism now represents the most well-characterized bacterium of all. However, there is a growing body of evidence showing that not all bacteria function as E. coli, and in order to gain a wider genome-based perspective beyond the E. coli paradigm, a thorough analysis of different groups of bacteria is warranted. Campylobacter jejuni, H. influenzae, H. pylori, M. tuberculosis, pathogenic Neisseria, S. pneumoniae, S. pyogenes and S. aureus contribute to the majority of morbidity and mortality caused by bacteria worldwide. At the same time, E. coli and Bacillus subtilis serve as Gram-negative and Gram-positive model organisms, respectively. When comparing the DNA repair, recombination and replication (3R) enzymes in these bacteria with those present in E. coli (Table 2), a general theme seems to be the occurrence of a reduced number of genes in each class of DNA repair as compared with E. coli: in base excision repair, which normally removes subtle base damages (Seeberg et al., 1995), nei, alkA, nfi, nfo, tag and xthA are often not present in these pathogens. In the postreplication mismatch repair (MMR) pathway, base–base mismatches and insertion/deletion loops (IDLs) are recognized and excised (Schofield & Hsieh, 2003). The key enzymes of this pathway, MutS and MutL, are absent in some of the pathogens, as is MutH. A more detailed description of this pathway and some interesting features are given below. Direct repair, in which DNA lesions are chemically reversed (Mishina et al., 2006), and especially for enzymes handling alkylation damage (ada and alkB), the pathogens often show an absence of genes. On the one hand, the apparent lack of function might allow adapted genome dynamics. On the other hand, one needs to bear in mind that there might exist genes encoding products that perform identical functions, but that lack sequence homology. In this context, new protein-encoding and RNA genes remain to be discovered. Also, a number of (error-prone) DNA polymerases and the SOS response regulator, LexA, are often not present in these bacteria. Likewise, when considering helicases, which are important proteins involved in various aspects of 3R activities, E. coli is the organism studied most and is generously equipped (Table 3). The only 3R pathways that appear to be ubiquitous for most organisms are nucleotide excision repair, recombinational repair and replication (Table 3). Nucleotide excision repair removes many types of bulky lesions from DNA, often of exogenous origin, while recombinational repair is crucial for the repair of DNA strand breaks that occur during recombination (de Laat et al., 1999). Replication is fundamental for the perpetuation of the genome, and this process is tightly coupled to most of the DNA repair pathways (Friedberg et al., 2006). Our observations (Table 3) corroborate the results of Eisen & Hanawalt (1999), who performed a phylogenomic study of DNA repair genes, proteins and processes in, among others, 11 bacterial species. An immediate question arising from these findings is that how representative for each species is the number of DNA repair genes found in one strain, considering the variable level conservation and diversity in clonal and polyphyletic species? In a larger context, more central questions arising from the discussion above are as follows: how does the DNA repair enzyme repertoire affect the lifestyle of the bacteria? For instance, what does it mean for Neisseria sp. not to host an SOS response? Or for M. tuberculosis not to encode conventional mismatch repair? How do DNA repair enzymes from different pathways interact? How do DNA repair enzymes interact with enzymes from other cellular systems? And most importantly, how are colonization, transmission and virulence of the pathogenic bacteria affected by the presence or the absence of specific DNA repair enzymes?
Although substantial wet-lab analysis regarding DNA repair enzymes in these pathogens is not available, some studies have been conducted. The readers are referred to recent summaries (Davidsen & Tønjum, 2006; Wang et al., 2006; Davidsen et al., 2007a) and the present review (dos Vultos et al., 2009) on DNA repair in N. meningitidis, H. influenzae, S. pneumoniae, H. pylori and M. tuberculosis. For instance, in N. meningitidis mutants inactivated in genes representing all the main DNA repair pathways, Davidsen et al. (2007b) have demonstrated that the highest spontaneous mutation frequency among the N. meningitidis single mutants are found in MutY-deficient strains, as opposed to mutS mutants in E. coli, indicating a possible role for meningococcal MutY in antibiotic resistance development. In general, distinct differences between N. meningitidis and established DNA repair characteristics in E. coli have been found. Interestingly, an increasing number of studies are focusing on the in vivo survival of DNA repair mutants in animal models. In M. tuberculosis, it has been shown that a nucleotide excision repair mutant and a DNA polymerase E2 mutant are attenuated in mice (Boshoff et al., 2003; Darwin & Nathan, 2005). In H. pylori, the base excision glycosylases MutY and Nth, as well as recombinational repair, are required for effective colonization of the stomach of mice (O'Rourke et al., 2003; Eutsey et al., 2007; Amundsen et al., 2008; Wang & Maier, 2008). These findings suggest that the host induces DNA lesions in the genomes of the infectious agents, and therefore effective DNA repair is crucial for the pathogen to be able to colonize its host (Fig. 2 Focus on MMR MMR functions The DNA MMR pathway is conserved from prokaryotes/bacteria to eukaryotes including humans. Defects in MMR increase mutation rates and cause genome instability, which in turn may expand the fitness landscape of bacterial pathogens. In humans, impaired MMR may cause a range of cancers, and the documented association to hereditary nonpolyposis colorectal cancer (or Lynch syndrome) has been studied intensively (Lynch & Lynch, 1985). MMR is a postreplicative process and provides an efficient way of repairing both base mismatches and IDLs that are generated during DNA synthesis. In essence, MMR allows degradation of error-containing DNA and resynthesis of unimpaired DNA. High-fidelity DNA replication is central to genome maintenance, and evidence for a close spatio-temporal association between replication, recombination and MMR is growing (Simmons et al., 2008). MMR has been extensively investigated in E. coli and much of our current knowledge is obtained from studies of this model organism. In short, the process of MMR can be summarized as follows: MutS binds the mismatch and recruits MutL, which orchestrates several interactions including the activation of MutH – an endonuclease nicking the unmethylated strand of newly synthesized DNA at GATC sites. A piece of the nascent DNA that has received a nick is degraded by exonucleases with the aid of the DNA helicase UvrD before DNA polymerase III accurately resynthesizes DNA, and the remaining nick is sealed by DNA ligase. The process also depends on single-strand-binding proteins and the initial methylation of GATC sites by Dam methylase. The strand containing the mismatch is degraded in the 5′-to-3′ or 3′-to-5′ direction, depending on the location of the mismatch relative to the nick. The minimal human MMR has also been reconstituted in vitro by the following elements: MutSα, MutLα, ExoI, proliferating cell nuclear antigen (PCNA), replication factor C (which loads PCNA onto DNA), the single-strand-binding factor replication protein A, polδ and DNA ligase I (Constantin et al., 2005; Zhang et al., 2005). It is now clear, however, that many bacteria differ from E. coli in their basic MMR machinery. The conundrum of MutH vs. MutL The absence of MutH in many bacteria and all eukaryotes is particularly striking, because it suggests that strand discrimination and the initiation of excision may have a basis different from that of the methyl-directed process in E. coli and in certain other Gram-negative microorganisms. How can MMR discriminate between the template strand and the newly synthesized strand if it is not methyl-directed? It has been proposed that MMR can be directed to the newly synthesized strand by interacting with strand termini during replication. As such, MMR would constitute a part of the replisome and could direct repair activity from the termini between okazaki fragments on the lagging strand or from the 3′-terminus on the leading strand, linking the two processes tightly (Jiricny, 2006). Indeed, the interaction between MutS from B. subtilis, which does not belong to the methyl-directed MMR group, and the β-clamp (PCNA in eukaryotes) was recently described in detail and supported that the MMR complex acts at, or in association with, the replication fork (Simmons et al., 2008). A breakthrough came from the laboratory of Paul Modrich when they found that human MutLα itself is an endonuclease that is able to produce a nick in nascent DNA (Kadyrov et al., 2006). A model where MutL can introduce random nicks on both sides of the mismatch has helped explain how a bidirectional DNA repair process was able to operate with a single exonuclease (EXOI in the human model) that degrades only in the 5′-3′ direction. Based on the finding that endonucleolytic hydrolysis of DNA depends on one or two divalent cations as the metal required, a binding site with a motif [DQHA(X)2E(X)4E] was identified, which is conserved in archaeal, eukaryotic (PMS2 and MLH3) and eubacterial MutL homologs. Convincingly, this motif was absent in all MutL homologs from those Gram-negative organisms known to have MutH and the methyl-directed MMR pathway (Kadyrov et al., 2006). We compiled a list of organisms containing a MutH homolog (Supporting Information, Table S1) and found that the distribution of MutH-dependent MMR is very limited. With a few exceptions, MutH is primarily found in bacterial species sorting under the class of Gammaproteobacteria. In our selection of pathogens, only H. influenzae belonging to the Pasteurellaceae (class Gammaproteobacteria) contains the mutH gene whereas the Neisseria (class Betaproteobacteria) and the staphylococci and streptococci (both class Bacilli) contain the MutH-less MMR (Table 2 and Fig. 1 An alignment based on the MutL pfam entries from 822 organisms was used to investigate sequence conservation of the metal-binding motif. We note that the motif in Neisseria sp. has a Q/M substitution in the second position DQ/MHA(X)2E(X)4E. As Fig. S1 shows, the Q/M substitution is one of the most common substitutions. Whether this substitution from a polar to a nonpolar amino acid is functionally important for MutL activity and function remains to be investigated. It has, however, been established that MutL knock-outs in N. meningitidis produce a mutator phenotype as expected for MMR malfunction (Richardson et al., 2002). MutS sequence diversity Genome comparisons have also revealed differences in the distribution of the mutS gene and also great diversity within the mutS-group. Phylogenomic analysis has revealed that the mutS lineage split early in evolution and gave several distinct lines, where, importantly, only one belongs to the MMR pathway (Eisen, 1998; Lin et al., 2007). When MMR is absent One of the striking characteristics of the M. tuberculosis and partially for H. pylori DNA repair system is the absence of recognized MMR homologs, which might suggest that these bacteria do not perform MMR activity. The consequent reduced fidelity in genome maintenance might add to the adaptive ability of M. tuberculosis, which otherwise seems to exist in genetic isolation. On the other hand, MMR activity could exist without sequence homology to recognized MMR components, and the search for components exerting MMR activity in M. tuberculosis should still be pursued. Distribution of helicases in pathogenic bacteria Helicases are ubiquitous enzymes vital to all living organisms. They are motor proteins that move directionally along the nucleic acid phosphodiester backbone separating two annealed nucleic acid strands using energy from NTP hydrolysis. Helicases are involved in various aspects of cellular processes including replication, repair, recombination, transcription and RNA processing (Schmid & Linder, 1992; Matson et al., 1994). The vital role(s) that these enzymes play has been underscored by a number of genetic discoveries. Mutations in three out of the five human recQ homologs have been identified as causes of Werner (WRN), Bloom (BLM) or Rothmund–Thomson syndrome (RECQ4), respectively (Ellis et al., 1995; Yu et al., 1996; Kitao et al., 1999). Mutations interfering with the proper function of XPB and XPD helicases in humans have been linked to disorders such as Xeroderma Pigmentosum (XP), Cockayne syndrome (CS) and trichothiodystrophy (TTD) (Hoeijmakers, 1994; Vermeulen et al., 1994; de Boer & Hoeijmakers, 2000). First discovered in E. coli as a ‘DNA-unwinding enzyme’ more than 32 years ago, the number of helicases identified and characterized has since then increased tremendously (Abdel-Monem & Hoffmann-Berling, 1976). Most organisms host multiple helicases; for example the E. coli genome encodes at least 12 helicases (Matson et al., 1994). When examining the pathogens under study (Table 3), some helicases that are essential to cellular functions, such as DnaB and UvrD, are distributed across all the organisms. In addition, RecG, RuvA and RuvB helicases that participate in recombinational repair, as well as Mfd involved in nucleotide excision repair, are found in all the pathogens. On the other hand, some helicases, such as RecQ, Ercc3, DinG and Lhr, are not universally distributed in our selected organisms (Table 3). The recQ gene homolog is present in H. influenzae, the Neisseria and S. aureus while it is missing in C. jejuni, H. pylori, M. tuberculosis, S. pneumoniae and S. pyogenes. The E. coli RecQ DNA helicase has served as a paradigm for the RecQ family and has been proposed to have multiple functions in the initiation of recombination, resolution of recombination intermediates and suppression of illegitimate recombination also required for proper induction of the ‘SOS’ response to stalled replication forks (Hishida et al., 2004; Chow & Courcelle, 2007). The helicase- and RNAse-like C-terminal (HRDC) domain is characteristic of many members of the RecQ helicase clade (Bernstein & Keck, 2005; Wu et al., 2005; Killoran & Keck, 2006a). Interestingly, RecQ, which usually contains a single HRDC domain in most organisms, is identified with three HRDC domains in N. meningitidis and N. gonorrhoeae and plays a critical role in determining pilin antigenic variation and also participates in DNA repair (Mehr & Seifert, 1998; Killoran & Keck, 2006b, 2008; Stohl & Seifert, 2006). This might indicate that the multiplicity of HRDC domains can represent one specialized way to exert specificity in RecQ activities (Killoran et al., 2009). Even though RecQ is absent in M. tuberculosis, the HRDC domain is identified in its UvrD2 helicase, which is one out of the two UvrD-like paralogs found in mycobacteria (Morozov et al., 1997; Sinha et al., 2008). However, the HRDC domain appeared not to be essential for enzymatic activity of UvrD2, suggesting that it might be involved in DNA binding (Sinha et al., 2008). It was proposed that the HRDC domain might target RecQ-family proteins to specific DNA structures (Bernstein & Keck, 2005). Another feature noted among the distribution of the helicases in the pathogenic bacteria under study is the presence of XPB/ERCC3 homolog, which is found only in M. tuberculosis (Table 3) (Poterszman et al., 1997). XPB/ERCC3/RAD25 in eukaryotes is an integral subunit of the transcription factor TFIIH, which is involved in transcription initiation and nucleotide excision repair (Weeda et al., 1990; Schaeffer et al., 1993). Even though well studied in humans, the role of ERCC3 helicase in bacteria is not yet known. However, the occurrence of the ercc3 gene in prokaryotes seems to be limited to mycobacteria and Kineococcus radiotolerans (Biswas et al., 2009), which might have acquired the gene through infrequent HGT that might occur from eukaryotes to certain bacterial species (Poterszman et al., 1997; Aravind et al., 1999). The DinG helicase in E. coli is a damage-inducible, SOS-regulated, strucure-specific enzyme, related to the human helicases XPD and BACH1, Rad3 from Saccharomyces cerevisiae and Rad15 from Schizosaccharomyces pombe (see Voloshin & Camerini-Otero, 2007 and references therein) (Voloshin & Camerini-Otero, 2007). Similar to XPB, XPD is also a part of the multisubunit complex TFIIH that plays a dual role in the transcription initiation and nucleotide excision repair (de Boer & Hoeijmakers, 2000). Another less-distributed helicase, the long helicase-related protein (Lhr), which is the longest protein identified in E. coli, is also found in M. tuberculosis (Table 3) (Reuven et al., 1995). However, its exact function is not yet known. The fact that the M. tuberculosis genome encodes the eukaryotic DNA repair proteins such as ERCC3 and Mpg might reflect past HGT events and enable this bacterium to survive in the hostile environment inside human macrophages. HGT Natural transformation in selected pathogens Transformation, type IV pili and type II and IV secretions The strong selective advantage of HGT has in several instances driven the evolution of complex machineries in favor of transformation. These divergent competence vehicles can ultimately cause the acquisition of novel traits such as antibiotic resistance, while they still allow homologous recombination that in turn could facilitate DNA repair and the fixation of beneficial alleles. Because different bacteria have solved their sex drives in various ways, the study of their strategies provides an exemplary case of convergent evolution. An interesting feature of all these systems is that they are based on structures already present in the cell that have become modified to facilitate and control genetic flux. Neisseria meningitidis, H. influenzae and the two streptococcal species S. pneumoniae and S. pyogenes all host systems composed of partners involved in the assembly of type IV pili and proteins with homology to type II secretion systems (Table 4) (Woodbury et al., 2006). Type IV pili are important virulence factors in many pathogens, are required for transformation and are also associated with many other functions including cell adhesion, twitching motility and biofilm formation (Tønjum & Koomey, 1997; Mattick, 2002). In order to identify the entire complement of proteins driving the transformation machinery, the complete set of neisserial DNA-binding proteins should be defined (Lång et al., 2009) (Fig. 3
Repeat sequences promoting transformation HGT is associated with the risk of allowing entry of alien and potentially harmful DNA from other organisms such as viruses. Even similar and only slightly diverged DNA from other species may be disadvantageous in a new host with separate sets of adaptations and fine-tuned processes. Various strategies have therefore been used in many bacteria to control the entry and persistence of DNA, including ecological isolation such as competence induction by quorum sensing, restriction modification systems and stringent homologous recombination. The Neisseria sp. and members of the Pasteurellaceae, such as H. influenzae, discriminate between homologous and alien DNA by recognizing a short specific sequence in DNA from their own genus. These sequences are known as DNA uptake sequence (DUS) and uptake signal sequence (USS). DUS/USS are found in exceptionally high numbers throughout the genomes of these species, ensuring that almost any piece of the chromosome of a certain length will contain such a signal and hence be recognized, taken up and finally constitute a substrate for homologous recombination (Goodman & Scocca, 1991; Ambur et al., 2007). In contrast, the pneumococci and streptococci use quorum sensing of a competence-stimulating peptide and fratricide to ensure that most of the DNA available in their surroundings is homologous (Johnsborg et al., 2007). Again, the positive effects of allowing HGT has driven the evolution of different strategies that ultimately produce the same result, namely homologous recombination between closely related alleles. The identification of the positive effects that have influenced the evolution of transformation has not been straightforward, and many selective pressures have been proposed, most of which are not mutually exclusive. Firstly, it has been proposed that transformation evolved from its ability to provide nutrients for the recipient organism, also known as the sex-for-food hypothesis. The rationale behind this notion is that in many organisms, transformation is induced upon starvation and would provide a good system to ensure uptake of high-energy compounds that could promote survival (Redfield et al., 1997; Palchevskiy & Finkel, 2006). Other hypotheses are based on incoming DNA providing a benefit after having been recombined into the chromosome and include, for example, sex for repair and in response to oxidative stress (Nedelcu & Michod, 2003; Nedelcu et al., 2004; Michod et al., 2008) and innovation (Ochman et al., 2000; Narra & Ochman, 2006; Jeon et al., 2008). An in-depth discussion and evaluation of these hypotheses are beyond the scope of this review and the debate on the evolution of transformation and bacterial sex in general is still ongoing (for excellent reviews, please see Johnsborg et al., 2007; Michod et al., 2008). Our own studies of DUS, itself a sign of transformation, have shown that that the complex DUS-mediated control of transformation is not likely to have evolved for its ability to import completely novel sequences. The genomic distribution of these sequences showed that DUS were over-represented in the conserved common core genome, under-represented in regions under diversification, and absent in both recently acquired genes and recently lost core genes (Treangen et al., 2008). Previously, we have found that DUS and USS in Neisseria sp. and H. influenzae, respectively, are biased toward 3R genes, suggesting that a functional relationship between genome maintenance and transformation exists (Davidsen et al., 2004). In addition, DUS occurrences correlate with the size of conversion fragments. We have therefore proposed that transformation has evolved from its ability to incorporate homologous sequences, either for the regeneration of damaged DNA or from benefits associated with the reassortment of alleles, or both (Treangen et al., 2008). We hypothesize that the core genomes in species that have no means of biasing their DNA uptake, such as S. pneumoniae, H. pylori and C. jejuni, still could have experienced more recombination during the evolutionary time than the more variable parts of their respective genomes. Although speculative, such frequent nonrandom allelic replacements could be generated by biases at the level of recombination by, for example, repeats. We have no indication that nondiscriminatory transformation differs from the signature-discriminating DUS/USS system of Neisseria/Pasteurellaceae in its regenerative properties or that these species experience fundamental differences in DNA damage or the need to reassort alleles. On the contrary, we suspect that these diverse transformation systems have been shaped by the same evolutionary forces. Thus, the study of the genomics of DUS/USS represents an example of convergent evolution (Davidsen et al., 2004), which may also be helpful in generating new testable hypotheses regarding transformation and its history in other organisms. Comparative genomics in a multispecies approach could elaborate on this hypothesis and increase our understanding of the selective advantages of transformation. The hypothesis that transformation evolved due to its ability to provide substrate for recombination has also been strengthened by the observation that these two processes are physically linked in space and time (Kidane & Graumann, 2005). Conjugation-related genes: type IV secretion in bacterial DNA transfer/sex The versatile type IV secretion (T4S) systems T4S systems are involved in the transport of macromolecules such as proteins and DNA across the outer envelope of bacteria. These systems are primarily known from Gram-negative bacteria, where they transport components over both the cytoplasmic and the outer membranes. Interestingly, the conjugation systems that transfer plasmid DNA are one form of the T4S system and are also described for Gram-positive bacteria (Grohmann et al., 2003). While proteins are secreted by the T4S systems and can also be transferred through the plasma membrane of a target cell, DNA can also be secreted and even imported by the T4S system (Ding et al., 2003; Economou et al., 2006). Bacterial conjugation systems represent a prominent subfamily of the T4S systems, while the VirB/D4 system in A. tumefaciens is the prototype example of a T4S system. A subclassification of the T4S systems was established based on ancestral lineage, building on two main groups (Christie & Vogel, 2000), followed by a subsequent systematic organization/grouping of all known T4S systems based on their function in conjugation, DNA uptake and release, and effector translocation (Cascales & Christie, 2004). Further analysis of the evolution of the T4S systems based on a protein homology-network defined them into four groups (Medini et al., 2006). Core T4S proteins are part of all T4S systems and can be complemented with independently recruited subunits or proteins to gain a system-specific function (Ding et al., 2003; Medini et al., 2006). It was also suggested that those T4S systems particularly involved in HGT between species led to a functional divergence of these systems (Frank et al., 2005). The secretion of DNA has probably evolved from protein secretion systems (Cascales & Christie, 2003, 2004). The DNA-binding relaxases recognize and translocate DNA, which is suggested to lead to an only coincidental ‘hitch-hiking’ of DNA together with the protein secreted (Cascales & Christie, 2004; Lybarger & Sandkvist, 2004; Chen et al., 2005). Among the pathogenic bacteria assessed in this review, C. jejuni, H. pylori, H. influenzae and N. meningitidis as well as N. gonorrhoeae possess one or more T4S systems (Table 5) (Hofreuter et al., 1998, 2001; Bacon et al., 2000, 2002; Cascales & Christie, 2004; Snyder et al., 2005). The C. jejuni strain 81-176 pTet plasmid is a true conjugative plasmid. It coexists with the smaller pVir plasmid, which also encodes a T4S system, and influences C. jejuni virulence (Bacon et al., 2000; Batchelor et al., 2004). Other conjugative plasmids in C. jejuni had, as opposed to pVir, no influence on the invasiveness of this bacterium, but often encode antibiotic resistance (Schmidt-Ott et al., 2005; Dasti et al., 2007). Furthermore, other conjugative plasmids are found in other Campylobacter species (Boosinger et al., 1990; Fouts et al., 2005).
Another pathogenic member of the order Campylobacteriales is H. pylori, which has three T4S systems: the Com-system, the Cag- or HP-system and the Tsf3-system (Chen et al., 2005; Zhong et al., 2007). While the Cag- or HP-system is used for exotoxin effector translocation and the function of the Tsf3-system is unknown, the Com-system has evolved for DNA transport (Hofreuter et al., 2001). The conjugation-like Com system is special in that it is used for the uptake of DNA, thus translocating DNA in a direction opposite to that for secretion. This might also be the case for the C. jejuni Vir system. These ‘competence’ systems have probably evolved to increase the possibilities for genetic variation or renewal, leading to enhanced cellular fitness, survival and invasion of the eukaryotic host (Ding et al., 2003). In addition, the transfer of chromosomally encoded properties by a conjugation-like mechanism may contribute to horizontal DNA transfer between different members of the Campylobacteriales group (Oyarzabal et al., 2007). Little is known about the T4S systems in H. influenzae. There are two Tra-like plasmid-encoded systems (Smoot et al., 2002; McGillivary et al., 2005). T4S systems on genomic islands (GIs) Recently, a GI containing a T4S system, which is evolutionarily distant from the plasmid-based systems and a vector for antibiotic resistance, was discovered (Juhas et al., 2007a, b). This system belongs to a new type of T4S systems found in a wide number of bacteria. They were named GI-like T4S systems and allow GIs encoding many different properties to mobilize and spread (Juhas et al., 2008). The gonococcal genetic island (GGI) was first identified in N. gonorrhoeae. The T4S system encoded by GGI is related to the conjugational F plasmid system of E. coli and is used by the bacteria for secretion of chromosomal DNA (Ding et al., 2003; Hamilton et al., 2005). Later, complete and partial forms of the GGI were also found in N. meningitidis (Snyder et al., 2005). Approximately 80% of gonococcal strains and some N. meningitidis strains carry the GGI, probably inserted by the site-specific recombinase XerCD into the dif site (Dillard & Hamilton, 2002; Hamilton et al., 2005). The sequence of the GGI is characterized by a low G+C and low DUS content, suggesting that it is not of neisserial origin, but the amelioration of some regions to a typical neisserial composition indicate an already long-term existence of GGIs in neisserial genomes. As chromosomal DNA is secreted by the T4S system encoded by GGI, no direct contact between the donor and recipient of DNA is needed. This may be so because Neisseria species are naturally competent throughout their life cycle and preferentially take up DUS-containing DNA (Lie, 1965; Sparling, 1966; Mathis & Scocca, 1982; Goodman & Scocca, 1991) The GGI contains only one DUS per 10 kb, which is only about 10% of the average DUS density found in the whole genome, but, in addition, it contains several incomplete DUS with one mutation showing that the DUS may be on the way to establish itself in the GGI. For the stable maintenance of GGI in the neisserial genome, the imperfections of one dif site were shown to be responsible, because reversion to a perfect site led to significant loss of the GGI (Dillard & Dominguez, 2008). The two parts of the GGI missing in N. meningitidis serogroup H and Z strains are flanked by DUS (Snyder et al., 2005). These sites may be the sites of recombination that led to an excision of the sequence blocks (Treangen et al., 2008). Because surrounding sequences of the GGI are still available, they may serve as a target for reintroduction of chunks of DNA by recombination with GGI DNA taken up by the bacterium through the DUS-specific uptake/recombination system. The effects of the GGI are still mostly obscure. It was shown that for the peptidoglycan fragment release in culture, neither the T4S system components nor the GGI-encoded lytic transglycosylases AtlA and LtgX are required for this process, but that the presence of the GGI can bypass the TonB-dependent iron acquisition of intracellular gonococci (Hagen et al., 2006; Cloud-Hansen et al., 2008). On the other hand, the high number of strains of N. gonorrhoeae that host a GGI, which can support efficient conjugation, may explain why plasmids, and the consequent antibiotic resistance when selective pressures exert their action, are more prevalent in gonococcal than in meningococcal strains. Conclusions Acquisition and loss of genetic material are essential forces in bacterial microevolution, also challenging functions involved in DNA repair, recombination and HGT. These functions have been repeatedly linked with adaptation of lineages to new lifestyles, and in particular to pathogenicity. Comparative genomics has the potential to elucidate this genetic flux, but there are many methodological challenges involved in inferring gene content and evolutionary events from collections of genome sequences. Here, we have described a method for detecting the presence or the absence of genes in whole genome sequences to elucidate the impact of gene content on microbial lifestyle. Our approach is purely sequence based and relies on gene identification. We have demonstrated its use on datasets from the genomes of C. jejuni, H. influenzae, H. pylori, M. tuberculosis, pathogenic Neisseria, S. pneumoniae, S. pyogenes and S. aureus. In all these examples, we found interesting variations in the presence and absence/gain and loss of genetic material, which correlate with their niches and fitness for survival. Competence for transformation, according to the gene content detected in many genomes, might be under-rated in many microbial species, including pathogens. At the same time, the different strategies in Gram-negative and Gram-positive organisms to achieve the net result of competence for transformation, leading to the same outcome, namely preferential uptake of its own DNA, represent an exciting diversity in biology. In this context, the presence of repeats, such as DUS and USS, has a tendency to accumulate in the core genome (Treangen et al., 2008), emphasizing their importance. Recently, the linkage between transformation and recombination, including the close proximity of the recombination process to the cytoplasmatic side of the inner membrane (Kidane & Graumann, 2005), has been elucidated. Taken together, this study of the presence and the absence of genes related to DNA metabolism and HGT enlightens how the gene profile affects the lifestyle of microbial pathogens in their respective niches. Acknowledgments We are grateful to Ophélie Aussedat for her work with the DNA Repair Genes Orthologs database. This work was supported by grants from the Research council of Norway. Additional Supporting Information may be found in the online version of this article: Fig. S1. Sequence conservation of the MutL metal-binding motif DQH/MA(X)2E(X)4E based on a MutL alignment of entries from 822 organisms. Table S1. Bacteria containing a MutH homolog. Click here to view.(954K, doc) Please note: Wiley-Blackwell is not responsible for the content or functionality of any supporting materials supplied by the authors. Any queries (other than missing material) should be directed to the corresponding author for the article. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||
Nature. 2000 Mar 30; 404(6777):502-6.
[Nature. 2000]Science. 2000 Mar 10; 287(5459):1809-15.
[Science. 2000]Science. 1995 Jul 28; 269(5223):496-512.
[Science. 1995]Science. 2001 Jul 20; 293(5529):498-506.
[Science. 2001]Mol Microbiol. 2002 Oct; 46(1):87-99.
[Mol Microbiol. 2002]Science. 2004 May 21; 304(5674):1170-3.
[Science. 2004]Science. 2005 Dec 2; 310(5753):1456-60.
[Science. 2005]Nucleic Acids Res. 2008 Jan; 36(Database issue):D480-4.
[Nucleic Acids Res. 2008]Genome Res. 2005 Jul; 15(7):1015-22.
[Genome Res. 2005]Infect Immun. 2003 Jan; 71(1):541-5.
[Infect Immun. 2003]Carcinogenesis. 2000 Jun; 21(6):1111-5.
[Carcinogenesis. 2000]Helicobacter. 2006 Aug; 11(4):272-86.
[Helicobacter. 2006]Carcinogenesis. 2002 Mar; 23(3):419-24.
[Carcinogenesis. 2002]Mol Microbiol. 2005 May; 56(3):836-44.
[Mol Microbiol. 2005]Science. 2006 Jul 7; 313(5783):89-92.
[Science. 2006]Cell Res. 2008 Jan; 18(1):3-7.
[Cell Res. 2008]Trends Biochem Sci. 1995 Oct; 20(10):391-7.
[Trends Biochem Sci. 1995]Annu Rev Microbiol. 2003; 57():579-608.
[Annu Rev Microbiol. 2003]Chem Rev. 2006 Feb; 106(2):215-32.
[Chem Rev. 2006]Genes Dev. 1999 Apr 1; 13(7):768-85.
[Genes Dev. 1999]Nat Rev Microbiol. 2006 Jan; 4(1):11-22.
[Nat Rev Microbiol. 2006]Mol Microbiol. 2006 Aug; 61(4):847-60.
[Mol Microbiol. 2006]Neuroscience. 2007 Apr 14; 145(4):1375-87.
[Neuroscience. 2007]FEMS Microbiol Rev. 2009 May; 33(3):471-87.
[FEMS Microbiol Rev. 2009]J Bacteriol. 2007 Aug; 189(15):5728-37.
[J Bacteriol. 2007]Med Hypotheses. 1985 Sep; 18(1):19-28.
[Med Hypotheses. 1985]Mol Cell. 2008 Feb 15; 29(3):291-301.
[Mol Cell. 2008]J Biol Chem. 2005 Dec 2; 280(48):39752-61.
[J Biol Chem. 2005]Cell. 2005 Sep 9; 122(5):693-705.
[Cell. 2005]Cell. 2006 Jul 28; 126(2):239-41.
[Cell. 2006]Mol Cell. 2008 Feb 15; 29(3):291-301.
[Mol Cell. 2008]Cell. 2006 Jul 28; 126(2):297-308.
[Cell. 2006]Nucleic Acids Res. 1998 Sep 15; 26(18):4291-300.
[Nucleic Acids Res. 1998]Nucleic Acids Res. 2007; 35(22):7591-603.
[Nucleic Acids Res. 2007]Mol Microbiol. 1992 Feb; 6(3):283-91.
[Mol Microbiol. 1992]Bioessays. 1994 Jan; 16(1):13-22.
[Bioessays. 1994]Cell. 1995 Nov 17; 83(4):655-66.
[Cell. 1995]Science. 1996 Apr 12; 272(5259):258-62.
[Science. 1996]Nat Genet. 1999 May; 22(1):82-4.
[Nat Genet. 1999]Eur J Biochem. 1976 Jun 1; 65(2):431-40.
[Eur J Biochem. 1976]Bioessays. 1994 Jan; 16(1):13-22.
[Bioessays. 1994]Genes Dev. 2004 Aug 1; 18(15):1886-97.
[Genes Dev. 2004]Radiat Res. 2007 Oct; 168(4):499-506.
[Radiat Res. 2007]Structure. 2005 Aug; 13(8):1173-82.
[Structure. 2005]EMBO J. 2005 Jul 20; 24(14):2679-87.
[EMBO J. 2005]J Biol Chem. 2006 May 5; 281(18):12849-57.
[J Biol Chem. 2006]Mol Microbiol. 1998 Nov; 30(4):697-710.
[Mol Microbiol. 1998]Nucleic Acids Res. 2006; 34(15):4098-105.
[Nucleic Acids Res. 2006]Trends Biochem Sci. 1997 Nov; 22(11):418-9.
[Trends Biochem Sci. 1997]Cell. 1990 Aug 24; 62(4):777-91.
[Cell. 1990]Science. 1993 Apr 2; 260(5104):58-63.
[Science. 1993]Nucleic Acids Res. 1999 Mar 1; 27(5):1223-42.
[Nucleic Acids Res. 1999]J Biol Chem. 2007 Jun 22; 282(25):18437-47.
[J Biol Chem. 2007]Carcinogenesis. 2000 Mar; 21(3):453-60.
[Carcinogenesis. 2000]J Bacteriol. 1995 Oct; 177(19):5393-400.
[J Bacteriol. 1995]Res Microbiol. 2006 Nov; 157(9):851-6.
[Res Microbiol. 2006]Gene. 1997 Jun 11; 192(1):155-63.
[Gene. 1997]Annu Rev Microbiol. 2002; 56():289-314.
[Annu Rev Microbiol. 2002]Microbiology. 2009 Mar; 155(Pt 3):852-62.
[Microbiology. 2009]Microbiology. 2007 May; 153(Pt 5):1593-603.
[Microbiology. 2007]J Bacteriol. 1991 Sep; 173(18):5921-3.
[J Bacteriol. 1991]J Bacteriol. 2007 Mar; 189(5):2077-85.
[J Bacteriol. 2007]Res Microbiol. 2007 Dec; 158(10):767-78.
[Res Microbiol. 2007]Genetics. 1997 May; 146(1):27-38.
[Genetics. 1997]J Bacteriol. 2006 Jun; 188(11):3902-10.
[J Bacteriol. 2006]Proc Biol Sci. 2003 Nov 7; 270 Suppl 2():S136-9.
[Proc Biol Sci. 2003]Proc Biol Sci. 2004 Aug 7; 271(1548):1591-6.
[Proc Biol Sci. 2004]Infect Genet Evol. 2008 May; 8(3):267-85.
[Infect Genet Evol. 2008]Trends Microbiol. 2003 Nov; 11(11):527-35.
[Trends Microbiol. 2003]Mol Microbiol. 2006 Oct; 62(2):308-19.
[Mol Microbiol. 2006]Trends Microbiol. 2000 Aug; 8(8):354-60.
[Trends Microbiol. 2000]Science. 2004 May 21; 304(5674):1170-3.
[Science. 2004]PLoS Comput Biol. 2006 Dec 1; 2(12):e173.
[PLoS Comput Biol. 2006]Mol Microbiol. 1998 Jun; 28(5):1027-38.
[Mol Microbiol. 1998]Mol Microbiol. 2001 Jul; 41(2):379-91.
[Mol Microbiol. 2001]Infect Immun. 2000 Aug; 68(8):4384-90.
[Infect Immun. 2000]Infect Immun. 2002 Nov; 70(11):6242-50.
[Infect Immun. 2002]Science. 2004 May 21; 304(5674):1170-3.
[Science. 2004]Annu Rev Microbiol. 2005; 59():451-85.
[Annu Rev Microbiol. 2005]Science. 2005 Dec 2; 310(5753):1456-60.
[Science. 2005]Mol Microbiol. 2001 Jul; 41(2):379-91.
[Mol Microbiol. 2001]Trends Microbiol. 2003 Nov; 11(11):527-35.
[Trends Microbiol. 2003]J Clin Microbiol. 2007 Feb; 45(2):402-8.
[J Clin Microbiol. 2007]Infect Immun. 2002 May; 70(5):2694-9.
[Infect Immun. 2002]J Bacteriol. 2007 Feb; 189(3):761-71.
[J Bacteriol. 2007]Genome Biol. 2007; 8(11):R237.
[Genome Biol. 2007]Cell Microbiol. 2008 Dec; 10(12):2377-86.
[Cell Microbiol. 2008]Trends Microbiol. 2003 Nov; 11(11):527-35.
[Trends Microbiol. 2003]Mol Microbiol. 2005 Mar; 55(6):1704-21.
[Mol Microbiol. 2005]Microbiology. 2005 Dec; 151(Pt 12):4005-13.
[Microbiology. 2005]J Bacteriol. 1966 Nov; 92(5):1364-71.
[J Bacteriol. 1966]J Gen Microbiol. 1982 May; 128(5):1159-61.
[J Gen Microbiol. 1982]Genome Biol. 2008; 9(3):R60.
[Genome Biol. 2008]Cell. 2005 Jul 15; 122(1):73-84.
[Cell. 2005]