![]() | ![]() |
Formats:
|
||||||||||||||||||||
Lineage specific recombination and positive selection in coding and intragenic regions contributed to evolution of the main Listeria monocytogenes virulence gene cluster 1Department of Food Science, Cornell University, Ithaca, NY 2Department of Animal Science, Colorado State University, Ft. Collins, CO 3National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, Manitoba *Corresponding author: Cornell University, Department of Food Science, 412B Stocking Hall, Ithaca, NY 14853, Phone: 607-254-2838, Fax 607-254-4868, Email: mw16/at/cornell.edu The publisher's final edited version of this article is available at Infect Genet Evol.Abstract The major virulence cluster of Listeria monocytogenes harbors six virulence genes that encode proteins critical for the intracellular life cycle of this human and animal pathogen. In this study, we determined the sequence (8,709 nt) of the virulence gene cluster (including the six main virulence genes) in 40 L. monocytogenes isolates from different source populations (human clinical cases, animal clinical cases, foods, and natural environments). An alignment of the full length cluster as well as individual gene alignments and alignments of intragenic regions were used for phylogenetic, recombination, and positive selection analyses. Initial phylogenetic analyses showed that the sequences represented two main clusters, consistent with previously defined L. monocytogenes phylogenetic lineages. The 40 sequences represented 25 distinct allelic types and the overall alignment included 592 polymorphic sites. Overall, our data show that (i) virulence genes in the main L. monocytogenes virulence gene cluster include highly conserved genes (i.e., hly, prfA) as well as diverse genes that appear to have evolved by positiveselection (mpl, actA, plcA), (ii) recombination has played an important role in the evolution of the virulence gene cluster, but is limited to lineage II isolates, and (iii) the promoter region driving the transcription of virulence genes transcribed early in intracellular infection (i.e., hly, plcA) has evolved by positive selection. The genes and intragenic regions in the L. monocytogenes virulence gene cluster thus have evolved independently, despite their close physical linkage, likely reflecting distinct selective pressures associated with expression and function of the proteins encoded in this region. 1. Introduction Listeria monocytogenes is a Gram-positive facultative intracellular pathogen that can cause gastroenteritis as well as invasive disease in humans and animals (Vazquez-Boland et al., 2001). Human listeriosis is typically foodborne and L. monocytogenes can be found in many different environments, including food processing plants, farms, and urban and natural environments (Lappi et al., 2004; Sauders et al., 2006). A number of virulence genes important in the intracellular life cycle of L. monocytogenes have been identified. Six key L. monocytogenes virulence genes (i.e., prfA, plcA, hly, mpl, actA, and plcB) that are critical for the intracellular life cycle, including vacuolar escape and cell-to-cell spread, are located in a virulence gene island, which is often referred to as the “prfA virulence gene cluster (pVGC)“ (Ward et al., 2004) or the “Listeria pathogenicity island 1 (LIPI-1)” (Vazquez-Boland et al., 2001). hly encodes listeriolysin O (LLO), which facilitates L. monocytogenes escape from the host cell vacuole. plcA and plcB encode the two phospholipases, (PI-PLC and PC-PLC), which also contribute to disruption of the host cell vacuole membrane (Gaillard et al., 1987; Smith et al., 1995), while mpl encodes a zinc metalloproteinase needed to activate PC-PLC. actA encodes the ActA protein, which is responsible for host actin accumulation and motility. prfA, which can be transcribed from at least three different promoters (Camilli et al., 1993; Freitag et al., 1993), encodes the transcriptional factor PrfA; PrfA positively regulates transcription of the other genes in the virulence cluster as well as of some other virulence genes (e.g., inlA, inlB) (Chakraborty et al., 1992; Domann et al., 1992; Leimeister-Wachter et al., 1990; Scortti et al., 2007). PrfA-dependent transcription of virulence genes appears to be hierarchical, such that virulence genes needed early in the intracellular life cycle (e.g. hly, plcA) are more rapidly activated than genes required later (e.g., actA) (Sheehan et al., 1995). While hly and plcA are transcribed in opposite directions using a shared PrfA binding site that overlaps with the −35 site for both promoters, hly can also be transcribed from a second putatively PrfA-independent promoter (Mengaud et al.,1989). A PrfA-dependent promoter upstream of mpl generate a monocistronic mpl transcript as well as a polycistronic mpl-actA-plcB transcript, while another PrfA-dependent promoter upstream of actA can generate an actA as well as an actA-plcB transcript (reviewed in Kreft and Vazquez-Boland, 2001). In addition to the coding sequences, the prfA virulence gene cluster thus includes intergenic regions with complex regulatory elements that are needed for virulence of L. monocytogenes. L. monocytogenes is a diverse organism with a structured population that includes at least three phylogenetic lineages (Rasmussen et al., 1995). While two lineages (I and II) are common, lineage III appears to be uncommon (Jeffers et al., 2001; Nightingale et al., 2004; Sauders et al., 2006). Although isolates from all lineages seem to be capable of causing disease, epidemiological and phenotypic differences among these lineages have been observed. Lineage II strains, although the most prevalent in foods, are underrepresented among human sporadic listeriosis cases and are seldom associated with human listeriosis outbreaks (De Cesare et al.,2007; Gray et al., 2004; Jeffers et al., 2001; McLauchlin et al., 2004; Norton et al., 2001). Conversely, lineage I isolates, and in particular lineage I serotype 4b strains, have been associated with most human outbreaks and the majority of human listeriosis cases in most countries despite their lower prevalence in foods (McLauchlin et al., 2004). These epidemiological findings suggested that all or some lineage I strains could be more virulent than lineage II strains, a hypothesis that has been supported by some in vitro assays, including tissue culture assays that found that lineage I isolates, on average, seem to form more, and larger plaques in mouse L cells (Gray et al., 2004; Norton et al., 2001; Wiedmann et al., 1997). Moreover, several lineage II isolates were shown to carry nonsense or frameshift mutations in virulence genes (particularly in inlA), which may lead to virulence attenuation (Rousseaux et al.,2004; Nightingale et al., 2005a; Orsi et al., 2007; Roche et al., 2005; Velge et al., 2007). The goal of this study was to use evolutionary analyses (including positive selection and recombination analyses) of both coding and intergenic regions in the prfA cluster to improve our understanding of the evolution and adaptation of this virulence gene cluster in L. monocytogenes, using isolates representing different lineages and obtained from different source population (i.e., human and animal clinical cases, foods, and natural environments). While one study (Ward et al., 2004) has previously used DNA sequence data for the prfA cluster for a phylogenetic analyses, comprehensive analyses on the contributions of recombination and positive selection to the evolution of this virulence gene cluster have been missing so far. In particular, positive selection analyses have only been reported for a fragment of one virulence gene (i.e., actA) in the prfA cluster (Nightingale et al., 2005b) and no positive selection analyses have been reported for the non-coding and regulatory regions in the prfA cluster. 2. Methods 2.1. Listeria monocytogenes isolates The 40 L. monocytogenes isolates used here were selected from a larger set of 132 isolates as previously described (Orsi et al., 2007; Tsai et al., 2006); DNA sequences for selected internalin genes had previously been reported and analyzed for these isolates (Orsi et al., 2007; Tsai et al., 2006). Briefly, the original set of 132 isolates represented isolates from four source populations, including human clinical cases (n = 60), animal clinical cases (n = 30), foods (n = 30), and pristine environment (n = 12); these isolates had been obtained from two previous studies, which provide details on these isolates (Nightingale et al., 2005b; Sauders et al., 2006). For each of the four source populations, ten isolates were randomly selected (using a random number table) to yield the set of 40 isolates used here (Supplemental Table 1; all supplemental are available at http://www.foodscience.cornell.edu/cals/foodsci/research/labs/wiedmann/links/orsi_2008.cfm). Serotypes were previously reported for 30 isolates (Nightingale et al., 2005b) and were determined for 10 isolates using the methods previously described (Nadon et al., 2001); serotype data were used to determine clustering of serotypes within a lineage. 2.2. DNA amplification and sequencing The L. monocytogenes virulence gene cluster (see Figure 1
2.3. Descriptive analyses Clustal W implemented in Megalign (Lasergene) was used to create alignments of each gene and of the whole virulence cluster. DNASP version 4.0 (Rozas and Rozas, 1999) was used to assess the number of allelic types, number of polymorphic sites (S), number of nonsynonymous (NNS) and synonymous substitutions (NSS), and nucleotide diversity (π) in each alignment. 2.4. Phylogenetic analysis The model of DNA substitution that best fit each data was identified using MODELTEST (Posada and Crandall, 1998) and was used to create phylogenetic trees using the maximum likelihood method implemented in PAUP (http://paup.csit.fsu.edu/). The phylogeny for the full virulence gene cluster was rooted using the L. ivanovii subsp. londoniensis NRRL 33021 sequence (GenBank accession number AY510073) previously published (Ward et al., 2004). Support for each branch was assessed by the bootstrap procedure in PAUP (100 replicates). 2.5. Positive selection analyses of coding regions Alignments and trees generated for each of the six coding regions within the virulence gene cluster were used to assess whether a given gene had evolved by positive selection; only the sequences for the 40 L. monocytogenes isolates were used for these analyses. Trees were generated using the maximum likelihood method as described above. Positive selection in coding regions can be assessed by comparing the rate of nonsynonymous substitutions (dN) to the rate of synonymous substitutions (dS) in that region. By assuming that the rate of synonymous mutation is constant and not targeted by natural selection, a dN/dS > 1 indicates that nonsynonymous mutations were more likely to be fixed in the population than synonymous mutations suggesting positive selection of nonsynonymous mutations. While some studies use an estimation of a single average dN/dS for a whole gene to identify genes under positive selection, the approaches implemented “Phylogenetic Analysis using Maximum Likelihood” (PAML) provide an advancement over these methods by allowing for detection of positive selection at specific sites and in specific lineages (Lefebure and Stanhope, 2007; Yang et al., 2000; Zhang et al., 2005). In PAML, three null models, in which positive selection is not allowed, and three alternative models, in which positive selection is allowed, can be compared using a Likelihood Ratio Test (LRT) to assess whether the alternative model is significantly better than the null model. If an alternative model is accepted over the null model, one can conclude that a given gene evolved by positive selection (Yang et al., 2000). Specifically, the null models M0, M1a, and M7 are compared to the alternative models M3, M2a, and M8, respectively. M0 assumes only one class of ω along the sequence whereas M3 assumes three classes of ω; the comparison between these two models can be used to determine whether selection differed along a given sequence (suggesting, for example, that regions in a gene differ in their selection). M1a assumes two classes of codon sites each with a different value of ω (i.e., ω0 < 1 and ω1 = 1), while M2a assumes one additional class of sites (i.e., ω2) with an ω > 1. Similarly, M7 assumes ten classes of sites with ω following a beta distribution that approximate a continuous distribution constrained to values between 0 and 1, while M8 assumes an extra class of sites with ω1 > 1. Hence, comparisons between M1a and M2a as well as comparisons between M7 and M8 assess whether a gene has evolved by positive selection; acceptance of an alternative model (i.e., M2a or M8) suggests that a given gene evolved by positive selection. However, the test between M7 and M8 is less conservative than the test between M1a and M2a as models M7 and M8 do not assume a class of sites with ω = 1. A Bayes Empirical Bayes (BEB) approach implemented in PAML was used to further identify specific codon sites with a significant posterior probability of having evolved by positive selection. In addition to the tests described above, branch-site models as described by Zhang et al. (2005) were used to assess whether a given gene evolved by positive selection within specific branches in a tree; we specifically tested branches within L. monocytogenes lineage I or II for positive selection in each gene. Similar to the tests described above, the branch-site test relies on the comparison of a null model with an alternative model. While both the null model and the alternative model have four site classes, only the alternative model allows positive selection (i.e., ω classes with ω > 1) in the branch(es) of interest (Zhang et al., 2005). Therefore, if the alternative model is accepted over the null model, one can conclude that a given gene evolved under positive selection in the branch(es) of interest. A BEB approach was also used to identify specific codon sites evolving by positive selection in the branches of interest. All analyses for assessing evolution by positive selection were carried out using PAML version 3.15. 2.6. Positive selection analyses of the non-coding regions Six separate alignments (each including a given gene and its upstream and downstream non-coding regions) were used to construct phylogenetic trees; these alignments and phylogenetic trees were then used to identify non-coding fragments that have evolved by positive selection using the program EvoNC (Wong and Nielsen, 2004). Like PAML, EvoNC assumes that synonymous mutations are neutral and happen in a constant rate within the coding fragment. Hence, calculating the ratio of the estimated nucleotide substitution rate in the non-coding region to the estimated synonymous substitution rate in the coding region ratio (this ratio is designated as ζ) provides a numerical approach to assess whether a non-coding region has evolved by positive selection. Three models implemented in EvoNC can be used in two different tests to determine whether a non-coding region is under positive selection; the neutral model assumes two classes of nucleotide sites with ζ0 < 1 and ζ1 = 1; the two-category model assumes two classes with ζ0 < 1 and ζ1 ≥ 1; and the three-category model assumes three classes of nucleotides with: ζ0 < 1, ζ1 = 1, and ζ2 > 1. The test comparing the neutral model and the three-category model is called Test 1; Test 1 is more conservative than Test 2, which compares the neutral model and the two-category model (Wong and Nielsen, 2004). 2.7. Recombination analyses The Sawyer’s test implemented in GENECONV (http://www.math.wustl.edu/~sawyer) was initially used to identify recombinant fragments within the virulence cluster, using the default settings of the program. As previously described (Nightingale et al., 2005b), GENECONV fragments that shared the same 5’ or 3’ ends were considered to represent the same recombination event. Clonal Frame v1.1 (Didelot and Falush, 2007) was used to identify isolates that have recently imported external DNA fragments in the virulence cluster. Two independent runs of the program were used to estimate a 95% consensus tree. Branches supported by the tree were analyzed for recombination. Only recombination events with probability higher than 90% were considered. The program was run with 100,000 burn-in iteration and 100,000 sampling iteration. 3. Results 3.1. Descriptive analysis The alignment of the virulence gene cluster for the 40 L. monocytogenes isolates was 8,709 bp long with an average G + C content of 36.3 %. A total of 7,984 sites were monomorphic; 133 sites in the alignment represented indels (insertions or deletions). Overall, 592 polymorphic sites (S) were identified (excluding the sites with indels), including 117 singletons (i.e., a polymorphic sites with substitution in only one isolate) and 475 informative sites (451 and 24 sites with two and three nucleotide variants, respectively). The 40 sequences represented 25 unique virulence gene cluster alleles. The overall nucleotide diversity (π) was estimated as 0.0246; π was significantly higher for lineage II isolates (21 isolates; πII = 0.0064) than lineage I isolates (18 isolates, πI = 0.0030) (Table 1). Lineage I and II isolates shared nine substitutions whereas 267 were fixed in either lineage I or II.
Descriptive analyses of the six genes in the virulence gene cluster showed considerable variation in nucleotide diversity and other parameters among these genes (Table 1). actA was the most diverse gene (π = 0.0383) with a total of 184 polymorphic sites and 197 substitutions (excluding polymorphic sites in the indel regions). Thirteen polymorphic sites had three different nucleotides. Twelve isolates (11 in lineage I and 1 in lineage II) showed a single large deletion of 105 nucleotide in actA (representing the majority of the indel sites observed in the virulence gene cluster), representing a previously reported deletion (Smith et al., 1996; Wiedmann et al., 1997; Moriishi et al., 1998; Jacquet et al., 2002) of one of the four Proline-Rich Repeats (PRR) in ActA. On average, lineage I isolates showed significantly lower (P < 0.0001) nucleotide diversity in actA (πI = 0.0052) as compared to lineage II isolates (πII = 0.0149) (Table 1). prfA was the most conserved gene (π= 0.0143) with only 25 substitutions, including only two nonsynonymous substitutions; one of these substitutions (K197N) is present in one lineage II isolate (FSL S4–304) and the lineage III isolate (FSL F2–695). While prfA and mpl showed higher nucleotide diversity (π) in lineage I isolates, all other genes in the virulence gene cluster showed higher nucleotide diversity in lineage II isolates (Table 1). 3.2. Phylogenetic analysis The maximum likelihood tree constructed using the complete virulence gene cluster sequence showed two main clusters (Fig. 2
3.3. Positive selection analysis of the coding regions Three of the six coding regions in the virulence gene cluster (i.e., mpl, plcA, and actA) showed evidence for positive selection. For all three genes M0 was rejected in favor of the alternative model M3 (Table 2), suggesting that selection is not homogeneous across these genes. For mpl and plcA, the branch-specific analyses showed evidence for positive selection among branches within lineage I and II, respectively (Table 2). For mpl, 1.5% of sites were classified into the site category for positively selected sites (ω = 7.28) in the branch specific model for lineage I; one mpl aa site (aa 103) showed a posterior probability > 95% of being under positive selection (Table 2). For plcA, 3.8% of sites were classified into the site category for positively selected sites (the ω for this site category was estimated to be 2.82) in the branch-specific model for lineage II; no plcA sites were identified as having a posterior probability > 95% of being under positive selection.
For actA, both overall tests (i.e., M1a vs M2a and M7 vs. M8, Table 2) showed evidence for positive selection (Table 2). In addition, the branch-specific analyses for actA showed evidence for positive selection among lineage II branches (Table 2). M8 estimated that 9.6% of the sites evolved by positive selection (ω = 2.40) and identified six aa sites (aa 112, 123, 291, 449, 461 and 522) with posterior probabilities > 95% of being under positive selection. The more conservative M2a model estimated that 7.7% of actA aa sites evolved by positive selection (ω = 2.55) and identified one aa site (aa 461) with posterior probability > 95% of being under positive selection. In the branch specific analysis, 2.5% of the sites were estimated to have evolved by positive selection (ω = 3.59), but no sites presented a posterior probability > 95% of being under positive selection. ActA aa site 498, which had previously been reported, using a different isolate set, to be under positive selection (Roberts and Wiedmann, 2006) was classified into the positively selected sites, although with a posterior probability < 95 % (i.e., 63 %). Among lineage II isolates, the nucleotide diversity among nonsynonymous (πNNS) and synonymous (πNSS) sites were higher for the genes identified as being under positive selection in this lineage, i.e., plcA and actA (Table 1), suggesting that plcA and actA evolved faster than the other virulence genes in this lineage. Moreover, actA showed the highest πNNS among all isolates (0.0226) while prfA and hly showed the lowest πNNS among all isolates (Table 1). 3.4. Positive selection and polymorphism analysis for non-coding regions Analysis in non-coding regions was performed separately using each gene including the upstream and downstream non-coding regions, except that no sequence downstream of prfA (i.e., on the 5’ end of the virulence cluster) and only 47 nucleotides downstream of plcB (i.e., on the 3’ end of the virulence gene cluster) were available for analyses. Both, the analysis with plcA and the analysis with hly (representing two adjacent genes), found significant evidence for positive selection in the non-coding areas; five out of the six sites identified as being under positive selection (with a posterior probability > 95%; Table 3) fall in the intergenic region between plcA and hly, which was included in both analyses (as this region is located 5’ of both of these genes, which are adjacent to each other and divergently transcribed). One site downstream of hly was also identified as being under positive selection (Table 3), but this site was only identified in the analysis with hly, but not in the analysis with mpl (where this site would be included in the non-coding region upstream of mpl). Our analyses thus provide strong evidence for positive selection specifically in the intergenic region between plcA and hly; this region harbors the promoter sequences for both hly and plcA, which share a PrfA box and are transcribed in opposite directions. None of the sites under positive selection falls within the PrfA box or a −10 promoter region. However, the nucleotide variation at the positively selected site 92 (Table 3, Fig. 3
In addition to the positively selected sites in the hly-plcA intergenic region, we also identified a number of polymorphic sites (which were not specifically identified as being under positive selection), which may affect promoter elements. Specifically, we identified three polymorphic sites within the −35 region (GGGACA) of the putatively PrfA-independent hly P1 promoter (Mengaud et al., 1989) (Fig. 3 3.5. Recombination analysis Initial GENECONV analyses identified 91 unique fragments, which could be grouped into 26 independent events (i.e., events with distinct 5’ and 3’ breakpoints). actA showed the most recombination events, while prfA showed the least recombination events (Supplemental Table 3). We subsequently used ClonalFrame to further assess recombination events and to identify the recipient strains in the recombination events. ClonalFrame detected no recombination events where lineage I isolates served as recipients, while it detected 25 events where lineage II isolates served as recipients in a recombination event (Fig. 4
4. Discussion We determined and analyzed the DNA sequences for six virulence genes and intergenic regions in the prfA cluster in 40 L. monocytogenes isolates representing diverse source populations. Initial phylogenetic analyses confirmed that the 40 sequences group into two main clusters representing the two common L. monocytogenes lineages that have first been described by Piffaretti et al. (1989) and subsequently been confirmed by phylogenetic analyses based on various virulence and housekeeping genes (Nightingale et al., 2005b; Orsi et al., 2007; Tsai et al., 2006) as well as a previous phylogenetic study of the prfA virulence gene cluster (Ward et al., 2004); these two main clusters have also been defined by all other subtyping studies that included cluster analyses, including PFGE and ribotyping studies (as reviewed by Wiedmann, 2002). Subsequent positive selection and recombination analyses showed that (i) virulence genes in the main L. monocytogenes virulence gene cluster includes highly conserved genes (i.e., hly, prfA) as well as diverse genes that appear to have evolved by positive selection (mpl, actA, plcB), (ii) recombination has played an important role in the evolution of the virulence gene cluster, but is limited to lineage II isolates, and (iii) the promoter region driving the transcription of virulence genes transcribed early in intracellular infection (i.e., hly, plcA) appears to have evolved by positive selection and recombination. 4.1. Virulence genes in the main L. monocytogenes virulence gene cluster include highly conserved genes (i.e., hly, prfA) as well as diverse genes that appear to have evolved by positive selection (mpl, actA, plcB) The genetic diversity for the six genes in the virulence cluster ranged from 0.0143 (prfA) to 0.0383 (actA) with an average diversity for the whole virulence gene cluster of 0.0246. Using the same 40 isolates, similar diversity was previously observed for another virulence gene, inlA (0.02134) (Orsi et al., 2007), while other internalin genes found in most or all of these isolates (i.e., inlB, inlC, inlC2, inlD, inlE) (Tsai et al., 2006) showed a wider range of diversity (0.0167 to 0.0702) with some highly diverse genes (e.g., inlE; π = 0.0702). Nucleotide diversity within actA was not limited to single nucleotide polymorphisms, but also included a previously reported (Wiedmann et al., 1997; Moriishi et al., 1998; Jacquet et al., 2002) deletion of a fragment encoding a 35 aa Proline-Rich Repeats (PRR) in a number of lineage I isolates. This deletion yields an ActA protein with three rather than four PRRs; while Smith et al. (1996) previously reported a linear relationship between the number of PRRs and the rate of bacterial movement in the host cell cytoplasm, the effects of PRR variation on virulence (Smith et al., 1996) and in natural strains remain unclear (Roberts and Wiedmann, 2006). The three most diverse genes in the virulence gene island (i.e., plcA, actA and mpl) also showed evidence for positive selection in their coding regions, although with different lineage-specific patterns of positive selection. While positive selection in plcA and mpl were restricted to lineage II and lineage I isolates, respectively, positive selection in actA was identified in the overall analysis, suggesting that this gene evolved by positive selection in different branches. Interestingly, ActA region C (aa 50–126), which is necessary for continuous actin filament elongation (Lasa et al., 1997), was very diverse with ten polymorphic sites, including two sites (aa 112 and 123) that were identified by M8 as being under positive selection and where three amino acid variants were observed in each site. Another site identified as being under positive selection (aa 291) falls within the first PRR domain. On the other hand, the site with highest probability of being under positive selection (aa 461) as well as other two sites (aa 449 and 522) fall in regions with no described function. Overall, these findings suggest that actA may be adapting to certain functions in addition to cell-to-cell spread [e.g., a role in host cell invasion (Suarez et al., 2001)] that require specific aa sequences in regions that have not yet been assigned specific biological functions. This hypothesis is consistent with experimental data, which showed that an isogenic mutation in a positively selected site in ActA did not affect cell-to-cell spread phenotype (Roberts and Wiedmann, 2006). Although mpl showed evidence for positive selection and considerable nucleotide and aa diversity, the Mpl aa 84–92 region, which has been identified as a CTL epitope in mice (Busch et al., 1997) was conserved in all isolates analyzed, possibly because this region may represent the active domain of Mpl. Similarly, the major murine CTL epitope region in LLO (aa 91 and 99) (Pamer et al., 1991; Vijh and Pamer, 1997) was conserved among all 40 isolates, possibly because conservation of this region appears to be important for virulence (Lety et al., 2006). These findings illustrate conservation of certain sequence features, including some that may be expected to be under selection for diversification. Overall, the analyses reported provide important new data on specific aa sites under positive selection in mpl, actA, and plcA; identification of these sites provides an opportunity for mutational analyses of specific polymorphic sites in order to probe the contributions of diversification in these virulence genes to niche adaptation in L. monocytogenes. 4.2. Recombination has played an important role in the evolution of the virulence gene cluster, but is limited to lineage II isolates Interestingly, our data suggest that recombination events within the virulence gene cluster commonly involve lineage II strains as recipients, but rarely involve lineage I isolates as recipients (recombination in lineage III isolates was not characterized as only a single lineage III isolate was included in our isolate set). These findings are in agreement with previous studies that have shown considerable recombination in other virulence genes, such as inlA, inlC, inlC2,inlD, inlE, inlF and inlG among lineage II isolates (Orsi et al., 2007; Tsai et al., 2006), with recombination generally more frequently in lineage II than lineage I isolates. Similar observations (i.e., more common recombination among lineage II strains than among lineage I strains) have also been reported for L. monocytogenes housekeeping genes (Meinersmann et al., 2004). Recombination thus appears to be critical for the diversification of lineage II isolates. While lineage II has generally been reported as showing considerably higher nucleotide diversity than lineage I isolates, our data suggest that this higher diversity in lineage II strains may be largely due to recombination. Specifically, this hypothesis is supported by the observation that the two virulence genes with no evidence for recombination (i.e., prfA and mpl) show higher nucleotide diversity among lineage I strains than among lineage II strains. Similarly, sigB, the only gene which showed no evidence for recombination among 120 L. monocytogenes isolates in previous study, showed identical nucleotide diversity among lineage I and II isolates (Nightingale et al., 2005b). Additional evolutionary analyses in L. monocytogenes genes with no evidence for recombination will be necessary to further characterize the evolutionary history of lineages I and II and to test the current hypothesis that only lineage I has experienced a recent bottleneck (Meinersmann et al., 2004). Our data reported here specifically suggest as a possible alternative hypothesis that lineage II simply diversified more rapidly, e.g., through more frequent recombination than lineage I. Possible reasons for more common recombination in lineage II may involve molecular (e.g. different restriction systems in each lineage), epidemiological (e.g. abundance of different isolates in environments where horizontal gene transfer might be facilitated) or evolutionary (e.g. different selective pressures to maintain recombinant fragments in the population) factors. 4.3. The promoter region driving the transcription of virulence genes transcribed early in intracellular infection (i.e., hly, plcA) appears to have evolved by positive selection and recombination While analyses for positive selection in coding regions are increasingly recognized as a powerful tool for identifying functionally important genes or residues within a gene, few studies have used emerging evolutionary analyses tools (Wong and Nielsen, 2004) to identify intergenic regions under positive selection, including the specific nucleotide sites that may be under positive selection. Interestingly, the positive selection analyses on non-coding regions in the L. monocytogenes virulence gene cluster provided initial evidence that the intergenic region between hly and plcA includes nucleotide sites that are under positive selection; Clonalframe also provided evidence for recombination events in this region. While the PrfA binding site in this region (as well as the other PrfA binding sites upstream of prfA, mpl, and actA) was perfectly conserved, a nucleotide change in at least one of the positively selected sites in hly and plcA intergenic region (nucleotide 92, Figure 3 Acknowledgements This work was partially supported by USDA Special Research Grant 2005-34459-15625 (to MW) and National Institutes of Health (NIH) Grant No. R01GM63259 (to MW). Footnotes Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||
Clin Microbiol Rev. 2001 Jul; 14(3):584-640.
[Clin Microbiol Rev. 2001]J Food Prot. 2004 Nov; 67(11):2500-14.
[J Food Prot. 2004]J Food Prot. 2006 Jan; 69(1):93-105.
[J Food Prot. 2006]J Bacteriol. 2004 Aug; 186(15):4994-5002.
[J Bacteriol. 2004]Infect Immun. 1987 Nov; 55(11):2822-9.
[Infect Immun. 1987]Appl Environ Microbiol. 2004 Aug; 70(8):4458-67.
[Appl Environ Microbiol. 2004]J Food Prot. 2006 Jan; 69(1):93-105.
[J Food Prot. 2006]Appl Environ Microbiol. 2004 Oct; 70(10):5833-41.
[Appl Environ Microbiol. 2004]Int J Food Microbiol. 2004 Apr 1; 92(1):15-33.
[Int J Food Microbiol. 2004]Appl Environ Microbiol. 2001 Feb; 67(2):646-53.
[Appl Environ Microbiol. 2001]J Bacteriol. 2004 Aug; 186(15):4994-5002.
[J Bacteriol. 2004]J Bacteriol. 2005 Aug; 187(16):5537-51.
[J Bacteriol. 2005]Infect Genet Evol. 2006 Sep; 6(5):378-89.
[Infect Genet Evol. 2006]J Bacteriol. 2005 Aug; 187(16):5537-51.
[J Bacteriol. 2005]J Food Prot. 2006 Jan; 69(1):93-105.
[J Food Prot. 2006]J Clin Microbiol. 2001 Jul; 39(7):2704-7.
[J Clin Microbiol. 2001]Bioinformatics. 1999 Feb; 15(2):174-5.
[Bioinformatics. 1999]Bioinformatics. 1998; 14(9):817-8.
[Bioinformatics. 1998]J Bacteriol. 2004 Aug; 186(15):4994-5002.
[J Bacteriol. 2004]Genome Biol. 2007; 8(5):R71.
[Genome Biol. 2007]Genetics. 2000 May; 155(1):431-49.
[Genetics. 2000]Mol Biol Evol. 2005 Dec; 22(12):2472-9.
[Mol Biol Evol. 2005]Mol Biol Evol. 2005 Dec; 22(12):2472-9.
[Mol Biol Evol. 2005]Genetics. 2004 Jun; 167(2):949-58.
[Genetics. 2004]J Bacteriol. 2005 Aug; 187(16):5537-51.
[J Bacteriol. 2005]Genetics. 2007 Mar; 175(3):1251-66.
[Genetics. 2007]J Cell Biol. 1996 Nov; 135(3):647-60.
[J Cell Biol. 1996]Infect Immun. 1997 Jul; 65(7):2707-16.
[Infect Immun. 1997]Microbiol Immunol. 1998; 42(2):129-32.
[Microbiol Immunol. 1998]Appl Environ Microbiol. 2002 Feb; 68(2):616-22.
[Appl Environ Microbiol. 2002]J Bacteriol. 2005 Aug; 187(16):5537-51.
[J Bacteriol. 2005]J Food Prot. 2006 Jan; 69(1):93-105.
[J Food Prot. 2006]Infect Genet Evol. 2006 Sep; 6(5):378-89.
[Infect Genet Evol. 2006]J Clin Microbiol. 2001 Jul; 39(7):2704-7.
[J Clin Microbiol. 2001]J Clin Microbiol. 2003 Feb; 41(2):632-9.
[J Clin Microbiol. 2003]FEMS Microbiol Lett. 2006 Jan; 254(2):300-7.
[FEMS Microbiol Lett. 2006]Nucleic Acids Res. 2004; 32(8):2386-95.
[Nucleic Acids Res. 2004]Appl Environ Microbiol. 2008 Jan; 74(1):158-71.
[Appl Environ Microbiol. 2008]J Bacteriol. 2003 Oct; 185(19):5722-34.
[J Bacteriol. 2003]Infect Immun. 1989 Dec; 57(12):3695-701.
[Infect Immun. 1989]Proc Natl Acad Sci U S A. 1989 May; 86(10):3818-22.
[Proc Natl Acad Sci U S A. 1989]J Bacteriol. 2005 Aug; 187(16):5537-51.
[J Bacteriol. 2005]Infect Genet Evol. 2006 Sep; 6(5):378-89.
[Infect Genet Evol. 2006]J Bacteriol. 2004 Aug; 186(15):4994-5002.
[J Bacteriol. 2004]J AOAC Int. 2002 Mar-Apr; 85(2):524-31.
[J AOAC Int. 2002]Infect Genet Evol. 2006 Sep; 6(5):378-89.
[Infect Genet Evol. 2006]Infect Immun. 1997 Jul; 65(7):2707-16.
[Infect Immun. 1997]Microbiol Immunol. 1998; 42(2):129-32.
[Microbiol Immunol. 1998]Appl Environ Microbiol. 2002 Feb; 68(2):616-22.
[Appl Environ Microbiol. 2002]J Cell Biol. 1996 Nov; 135(3):647-60.
[J Cell Biol. 1996]EMBO J. 1997 Apr 1; 16(7):1531-40.
[EMBO J. 1997]Cell Microbiol. 2001 Dec; 3(12):853-64.
[Cell Microbiol. 2001]FEMS Microbiol Lett. 2006 Jan; 254(2):300-7.
[FEMS Microbiol Lett. 2006]Infect Immun. 1997 Dec; 65(12):5326-9.
[Infect Immun. 1997]Nature. 1991 Oct 31; 353(6347):852-5.
[Nature. 1991]J Immunol. 1997 Apr 1; 158(7):3366-71.
[J Immunol. 1997]Infect Genet Evol. 2006 Sep; 6(5):378-89.
[Infect Genet Evol. 2006]Appl Environ Microbiol. 2004 Apr; 70(4):2193-203.
[Appl Environ Microbiol. 2004]J Bacteriol. 2005 Aug; 187(16):5537-51.
[J Bacteriol. 2005]Genetics. 2004 Jun; 167(2):949-58.
[Genetics. 2004]Appl Environ Microbiol. 2006 Aug; 72(8):5197-203.
[Appl Environ Microbiol. 2006]Appl Environ Microbiol. 2007 Oct; 73(19):6078-88.
[Appl Environ Microbiol. 2007]