![]() | ![]() |
Formats:
|
||||||||||||||||||||||||
Copyright © 2006 Delmotte et al; licensee BioMed Central Ltd. Tempo and mode of early gene loss in endosymbiotic bacteria from insects 1UMR Santé Végétale (INRA-ENITAB), INRA BP81, 33883 Villenave d'Ornon Cedex, France 2UMR Biologie des Organismes et des Populations appliquée à la Protection des Plantes [BIO3P], INRA BP 35327, 35653 Le Rheu Cedex, France 3Max Planck Institute for Molecular Genetics, Ihnestrasse 63–73, 14196 Berlin, Germany 4Instituto Cavanilles de Biodiversidad y Biologia Evolutiva, Universidad de Valencia, A.C. 22085, 46071 Valencia, Spain Corresponding author.F Delmotte: francois.delmotte/at/bordeaux.inra.fr; C Rispe: claude.rispe/at/rennes.inra.fr; J Schaber: schaber/at/molgen.mpg.de; FJ Silva: francisco.silva/at/uv.es; A Moya: andres.moya/at/uv.es Received April 11, 2006; Accepted July 18, 2006. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. This article has been cited by other articles in PMC.Abstract Background Understanding evolutionary processes that drive genome reduction requires determining the tempo (rate) and the mode (size and types of deletions) of gene losses. In this study, we analysed five endosymbiotic genome sequences of the gamma-proteobacteria (three different Buchnera aphidicola strains, Wigglesworthia glossinidia, Blochmannia floridanus) to test if gene loss could be driven by the selective importance of genes. We used a parsimony method to reconstruct a minimal ancestral genome of insect endosymbionts and quantified gene loss along the branches of the phylogenetic tree. To evaluate the selective or functional importance of genes, we used a parameter that measures the level of adaptive codon bias in E. coli (i.e. codon adaptive index, or CAI), and also estimates of evolutionary rates (Ka) between pairs of orthologs either in free-living bacteria or in pairs of symbionts. Results Our results demonstrate that genes lost in the early stages of symbiosis were on average less selectively constrained than genes conserved in any of the extant symbiotic strains studied. These results also extend to more recent events of gene losses (i.e. among Buchnera strains) that still tend to concentrate on genes with low adaptive bias in E. coli and high evolutionary rates both in free-living and in symbiotic lineages. In addition, we analyzed the physical organization of gene losses for early steps of symbiosis acquisition under the hypothesis of a common origin of different symbioses. In contrast with previous findings we show that gene losses mostly occurred through loss of rather small blocks and mostly in syntenic regions between at least one of the symbionts and present-day E. coli. Conclusion At both ancient and recent stages of symbiosis evolution, gene loss was at least partially influenced by selection, highly conserved genes being retained more readily than lowly conserved genes: although losses might result from drift due to the bottlenecking of endosymbiontic populations, we demonstrated that purifying selection also acted by retaining genes of greater selective importance. Background The smallest genomes belong to bacterial obligate pathogens or intracellular symbionts of eukaryotes (e.g., Mollicutes, Rickettsiae, Spirochetes, Chlamydiae, and insect-associated gamma-proteobacteria). Phylogenetic analyses indicate that their small genome size is a derived state accompanying the transition to their specialized intracellular lifestyle. In other words, all these microbes certainly evolved from ancestors with larger genomes. One of the smallest genome described to date belongs to B. aphidicola, gamma-proteobacteria that maintain mutualistic endosymbiotic associations with aphids [1]. Gamma-proteobacteria include other obligate partners of insects and command special attention for several reasons: first, genome reduction in this group has been extreme, yielding a ten fold range of genome sizes; second, free-living relatives of these symbionts include the model organism Escherichia coli (E. coli) for which extensive genetic information is available; third, gene order is extremely well conserved between symbionts of the same genus [2,3] and rather well conserved over genome fragments between symbionts and free-living relatives [4]. This allows the reconstruction of gene loss events at different evolutionary steps of symbiosis. The first genome sequence of B. aphidicola BAp [5] initiated comparative studies shedding light on the process of genome reduction in this endosymbiont [6,7] but the recent sequencing of two additional B. aphidicola strains [2,8] and of three endosymbionts closely related to Buchnera, i.e. Blochmannia floridanus and Blochmannia pennsylvanicus (the endosymbionts of carpenter ants, respectively Camponotus floridanus and Camponotus pennsylvanicus; [3,9]), and Wigglesworthia glossinidia (the endosymbiont of the tsétsé fly, Glossina brevipalpis; [10]) considerably increase the scope for comparative analysis to reveal evolutionary patterns of gene loss. Moreover, it provides the opportunity for assessing the generality of the process of genome size reduction in three kinds of symbiotic lineages that have different gene content shaped by specific nutritional needs of their insect hosts. The identification of long term evolutionary forces that drive genome shrinkage in endosymbionts is much debated. The trend toward large scale gene loss could reflect the inefficiency of natural selection at maintaining genes in the genomes of these cytoplasmically inherited bacteria. Indeed, the vertical partitioning of symbiotic lineages among hosts and their reduced population sizes strongly favour drift, a mechanism proposed to be responsible for the accumulation of mildly deleterious mutations in symbiotic genomes [6,11]. The hypothesis of an increased fixation of deleterious mutations is supported by several lines of evidence: the general acceleration of evolutionary rates and massive AT enrichment [[12,13]; but [14] for a selectionist interpretation], the increased proportion of non-synonymous mutations [12], the loss of adaptive codon bias [15] and the very low level of intraspecific polymorphism observed in bacterial populations [16,17]. Because the host environment provides metabolites, many bacterial loci would become redundant thereby accumulating slightly deleterious mutations by a process known as Muller's ratchet, which eventually could lead to the functional inactivation of non essential genes. In fact, the DNA of a pseudogene may be completely deleted from the B. aphidicola genome in 40 to 60 My [18]. Even some apparently beneficial genes (DNA repair, transcriptional regulation and replication initiation mechanisms) have been lost confirming that drift may play an important role in genome reduction [5,9,19]. If the ultimate forces leading to genome shrinkage in endosymbionts are still much debated, the proximal mechanisms responsible for DNA removal seem to be better understood [20,21]. In particular, it has been suggested that DNA removal occurs because of mutational bias favouring deletions over insertions in bacterial genomes [22]. This process which is apparently universal in prokaryotes could account for the scarcity of non coding DNA in most bacterial lineages. The loss of genes involved in DNA repair which are otherwise broadly conserved in bacteria might explain why deletion biases are not "corrected" and eventually lead to gene degradation and loss. Strikingly, such losses are a common characteristic of Buchnera, Wigglesworthia and Blochmannia, but also of Sitophilus orizae primary endosymbiont (a younger symbiont with a larger genome, where two of such genes are inactivated [23]) and even of free-living marine bacteria recently engaged in genome reduction [24]. Chromosomal rearrangement which can lead to large deletion events as also contributes to DNA removal. Finally, obligate bacterial mutualists have lost the ability to incorporate foreign DNA which grants them a "one-way ticket to genome shrinkage". Understanding the evolutionary processes that drive genome reduction requires determining the tempo (rate) and the mode (size and types of deletions) of gene losses. Detailed information is available on recent events of gene loss from comparing closely related symbionts: for example extreme genome stasis (conservation of gene order and gene repertoires) has been shown for three B. aphidicola genomes [2,8] with an estimated tempo of gene loss in the range of 2–3 Myr per gene. Similar patterns emerge for Blochmannia [3], where a comparison between two species revealed a limited number of gene losses scattered along the genome with different rates for the two species (ca 0.6 Myr per gene for B. floridanus and ca 6 Myr per gene for B. penssylvanicus). These findings of gradual gene loss with some degree of specific variation is compatible with a phenomenon of drift, as seen above, mitigated by selective processes, since gene loss affects differential gene functions for different symbionts (e.g. B. aphidicola strains tend to conserve many genes involved in amino acid synthesis, while Blochmannia species tend to conserve a disproportionate number of genes involved in synthesis of cofactors). In contrast, less is known about the rhythm and types of gene loss that occurred in the early steps of the acquisition of symbiosis. This is particularly critical, because gene losses that occurred in the ancestor of all B. aphidicola strains or Blochmannia species respectively were massive (estimated to >1000 genes lost [25]). Two recent comparative studies which reconstructed the hypothetical ancestral genome of B. aphidicola BAp and its free-living relatives have reached diverging conclusions on the organisation of gene losses. Moran and Mira [6] found that early gene loss involved deletions of large sets of contiguous genes including loci with unrelated functions, possibly due to recombination at repeated sequences [2,26]. Such massive and non-oriented process would be the sign of strong genetic drift and weakened selection in the early stages of endosymbiosis [6,11,27,28]. In contrast, Silva et al. [25] who focused on blocks of conserved synteny, found that genome shrinkage arose through multiple events of gene disintegration dispersed over the whole genome, which would be better explained by selection acting on individual genes even in these early steps. Even if larger deletions could not be ruled out, these authors insisted on the importance of selection for explaining the genome shrinkage observed in endosymbionts. Interestingly, recent genomic and experimental data support both scenarios: on the one hand, the presence of hundreds of pseudogenes scattered around the genomes of some pathogens recently sequenced [29-32] support the scenario whereby genome reduction occurs by slow erosion of individual genes. On the other hand, Nilsson et al. [33] have nicely showed that, even on a short evolutionary time scale, the disappearance of large stretches of DNA can be frequent in bacteria establishing in a constant environment. This experimental result suggests that large-scale deletions may occurred during the initial stages of genome reduction [33]. Characterizing the final set of genes of reduced genomes should help resolve this debate. Genome shrinkage is largely a lineage-specific process [34]. Indeed, because evolution is a highly contingent process, it is difficult to predict the fate of each single lineage. Moreover, successive losses are not independent events because the loss of a gene from a genome may influence the types of losses tolerated in the future. Finally, identical functions can be achieved by non-orthologous genes in different lineages. For example, the comparison of genomes of closely related insect endosymbionts shows that they share only 50% of their protein-coding genes. Although important functions such as cell division processes, information storage and processing show a high conservation of their gene repertoires, remarkable differences exist for those genes that encode proteins involved in the cell envelope, flagellum biosynthesis and the metabolism of amino acids, nucleotides, or coenzymes [9,34]. However, this does not necessarily mean that gene loss is a completely random process. In eukaryotes, Krylov et al. [35] recently investigated the relation between the propensity of a gene to be lost and its functional importance. Interestingly, these authors found significant relationships between the "propensity for gene loss" (PGL) and sequence substitution rate, gene dispensability, the number of protein interactions and the expression level of the gene. Thus, at least in eukaryotes, the likelihood of being lost seems to be an inverse relation of the biological importance of a gene. To date, this hypothesis has not been tested in prokaryotes, possibly because it is difficult to disentangle the respective contribution of gene loss and horizontal gene transfers in the evolution of bacterial genome size [36]. However, we propose that the well characterized group of insect endosymbionts provides a good model system for investigating the relative importance of drift and purifying selection in the process of genome shrinkage. In this study, we used five available endosymbiotic genome sequences of the gamma-proteobacteria – B. aphidicola BAp, B. aphidicola BSg, B. aphidicola BBp, Wigglesworthia glossinidia (Wgl) and Blochmannia floridanus (Bfl) – to test whether gene loss was related to the selective importance of genes. We used a parsimony method to reconstruct a minimal ancestral genome of insect endosymbionts and quantified gene loss along the branches of the phylogenetic tree. This allowed us to assess different loss events at different evolutionary scales and to estimate the likelihood that a gene be lost during the evolution of symbiosis. To evaluate the selective or functional importance of genes, we used a parameter that measures the level of adaptive codon bias in E. coli (i.e. codon adaptive index, or CAI), and also estimates of evolutionary rates (Ka) between either pairs of orthologs in free-living bacteria or in pairs of symbionts. These parameters are expected to be correlated with the selective pressure on the genes. We also studied the distribution of deletion sizes (in number of loci) in order to determine whether they occurred by small steps or through large deletions of multiple loci at different positions on the phylogenetic tree rooted at the last common ancestor of free living bacteria and modern endosymbionts. Results and discussion Single or multiple origin(s) of endosymbiosis? To place each gene loss event in an evolutionary context, we needed a reliable species tree onto which gene losses could be mapped. Our most parsimonious phylogenetic reconstruction yielded a monophyletic group containing the five symbiont lineages (Figure (Figure1).1
It is worth noting that the same result was recently found in at least three other studies, applying different methods on a different set of orthologous protein-coding genes [9,37,38]. In particular, Lerat et al. [38] insisted on the complete lack of conflict between 205 orthologous genes they chose resulting in a fully resolved phylogeny where B. aphidicola BAp and Wiggleworthia grouped together. In this context, we believe that the monophyletic topology of these five endosymbionts is a good "working hypothesis" although our phylogenomic approach precludes classical bootstrapping testing of the robustness of the tree. However, Herbeck et al. [39] using a similar approach with more taxa but only two genes, the conserved groEL and 16S rRNA, proposed a different scenario. They found various phylogenies that group Blochmannia and Wigglesworthia but separately from Buchnera, which leaded them to conclude that Blochmannia and Wigglesworthia represent an origin of primary endosymbiosis that is independent from that of Buchnera. Another recent study also cast in doubt the monophyletic character of the three endosymbiotic lineages [4]. These authors found a strong discordance between a phylogeny based on concatenated conserved amino acid sequences and reconstructions based on gene order. The former supported a single origin, while the latter placed Blochmannia as closer relative to E. coli than Yersinia and any other symbiotic lineage. This comes from the fact that synteny is high between Blochmannia and E. coli [3]. This result could still be compatible with a single origin for endosymbionts, provided that different levels of gene rearrangements occur among different lineages. Although we cannot definitively conclude on the single/multiple origin of AT-rich endosymbionts, we have retained the monophyly of endosymbionts as a working hypothesis since it has been reached several times by different phylogenomic approaches, including our. Gene loss and CAI of E. coli At all evolutionary scales low CAI genes were more readily lost than high CAI genes (Figure (Figure2).2
The proportion of genes lost in each CAI class within the symbiont clade did not exceed 60% (Figure (Figure2c),2c
Gene loss and substitution rates The pairwise estimates of non-synonymous substitution rates between two free-living species pairs (Eco-Stm, Eco-Ype) were averaged for three different categories of genes: (A) Genes lost during the transition to symbiosis, i.e. between LCA1 and LCA2; (B) Genes present in the common ancestor of all symbionts but lost in some symbiotic lineages; (C) Genes retained in all endosymbionts (Figures (Figures3,3
We also examined the non-synonymous substitution rates estimated for four endosymbiotic species pairs (BAp-BSg, BAp-BBp, BSg-BBp, Bfl-Wgl): for all comparisons tested, we found a similar evolutionary signal within the endosymbiont clade. Genes lost at least once within endosymbionts (B) were significantly less conserved than genes never lost (C), suggesting once again that those genes that were lost were of less essential function (Figure (Figure55
In consequence, we found significant positive correlations between PGL and the non-synonymous substitution rate for the five pairwise comparisons conducted (Table 1). Genes lost over the course of symbiosis evolution were faster evolving than average (in symbiont and free living bacteria). Interestingly, CAI was negatively correlated with non-synonymous substitution rates both in symbiotic bacteria (four different comparisons tested) and free-living bacteria (two comparisons tested, Table 2). It is noteworthy that greater conservation of high expression genes at non synonymous site has been recently described in endosymbionts [40]. These results and ours confirms, also for prokaryotes, previous findings that highly expressed genes appear to evolve slowly [35,41].
Why some genes are retained and some are lost? We have tried to determine the extent to which gene loss is predictable, by evaluating the correlations between some evolutionary parameters and the propensity of a gene to be lost. We showed that losses are concentrated on the genes that are evolutionary less constrained and of probably less selective importance (genes characterized by low adaptive codon bias in E. coli and high evolutionary rates in free-living and symbiotic species). The bias between propensities of loss for different categories of genes (genes of different CAI in E. coli and of different non-synonymous substitution rates) is particularly strong at the initial steps of the acquisition of symbiosis. We indeed demonstrated that at all stages of symbiosis, genome reduction was specifically targeted to genes of lesser selective importance, or less essential for the survival of the host/symbiont couple. It also is significant that symbionts retain some informative signal about the propensity of gene loss across subsequent diversification in their hosts: indeed, gene losses in later stages concern primarily sequences characterized by relatively high evolutionary rates in other symbiotic lineages. Late gene losses also primarily affect genes characterized by low CAI in E. coli. All this suggests that gene loss, even though governed by drift, was also partially influenced by selection both at ancient and recent stages of symbiosis evolution. There is no a priori reason why non-synonymous substitution rate should be correlated with the propensity of a gene to be lost. Indeed, these two variables are measures of evolutionary conservation that capture substantially different aspects of evolution and one can imagine that functional constraints could be relaxed on a protein achieving an important function in the cell. Here, we have showed that PGL, sequence evolution rate and CAI are interdependent as already found in eukaryotes [35]. Thus, we believe that the large prokaryotic genomes already available combined with data on gene dispensability, expression, and protein interactivity, open the possibility to new comparative studies testing the prevailing forces driving genome reduction. A majority of deletions involving few genes in syntenic fragments We reconstructed loss events between the last common ancestor of free living and symbiotic bacteria (LCA1) and the last common ancestor of B. aphidicola strains (LCA3) or between LCA1 and the supposed common ancestor of symbiotic bacteria (LCA2). Figure Figure66
We also compared the size distribution (in number of loci) of losses for the LCA1–LCA3 and LCA1–LCA2 steps for both syntenic and non-syntenic losses (Figure (Figure7).7
To test if deletions were random or clustered, we analysed the distribution of syntenic losses between LCA1 and LCA2 (Figure (Figure8).8
Validity of the results under alternative phylogenetic scenarios We stress that even under the hypothesis of independent origins of endosymbionts, our conclusions on the links between gene dispensability and selective parameters (CAI, Ka) would be little affected. Indeed, a strong effect of CAI and Ka on the propensity of gene loss was observed even without considering the existence of LCA2 because it was observed for the losses occurring between LCA1 and each of the extant endosymbiotic lineages. In contrast the reconstruction of deletion sizes is more dependent on the validity of our common ancestry of the five symbionts studied. Considering the reconstructed ancestor of the three B. aphidicola strains that are clearly monophyletic affected only marginally earlier studies based on single genome [6]. Conclusion The particular originality of this work compared to that of precedent studies [6,7] lies in its integration of the information on the selective pressures on genes together with more genome data. The larger genome data set allowed us to detect and characterize more ancient events of gene loss by including the reconstructed common ancestors of symbiotic lineages. We have shown that genes lost in the early stages of symbiosis are on average less selectively constrained than genes conserved in any of the extant symbiotic strains studied. This is shown by significant differences among the two types of genes of two parameters that can measure selective importance: non-synonymous evolutionary rates (in symbionts or in free-living enterics) and codon bias in E. coli. In addition, our reconstruction of deeper nodes allowed also a better description of deletion events at the different steps, in particular of their size distribution. Under the hypothesis of a common origin of different symbioses, gene losses would have been mostly occurring through rather small blocks, and in syntenic regions between at least one of the symbionts and present-day E. coli. Our study did not include two genomes from insect-associated endosymbionts that have just been completed, Blochmannia pennsylvanicus [3] and B. aphidicola BCc (host Cinara cedri, available soon). Studying these genomes will help to better reconstruct recent gene loss events, which occurred after the divergence of Blochmannia and B. aphidcola strains respectively. However, given their small genomes that include very few genes not already present in other complete sequences, their inclusion could not significantly change our conclusions on early patterns of gene loss. The knowledge of more endosymbiotic genomes, particular of genomes of larger sizes (e.g. Sitophilus oryzae primary endosymbiont [23] will be of paramount interest for fully resolving several puzzles that remain to date. It will indeed provide a more robust phylogenetic scenario of symbiosis acquisition (in single or multiple events) and a finer knowledge on the rate and patterns of gene losses, which will allow disentangling mutational and selective pressures that modulate genome reduction. Methods Genomic sequences Complete genomes were retrieved from EMBL database: Escherichia coli K12 (NC_000913), Salmonella typhimurium LT2 (NC_003197), Vibrio cholerae O1 biovar eltor str. N16961 (NC_002505, NC_002506), Yersinia pestis (NC_003143), Buchnera aphidicola of Baizongia pistaciae (NC_004545), Buchnera aphidicola of Schizaphis graminum (NC_004061), Buchnera aphidicola of Acyrthosiphon pisum (NC_002528), Wigglesworthia brevipalpis (NC_004344), Candidatus Blochmannia floridanus (NC_005061). The symbiotic strains will be referred as B. aphidicola BAp, B. aphidicola BSg, B. aphidicola BBp, Wigglesworthia (Wgl) and Blochmannia (Bfl). Phylogenetic tree Phylogenetic reconstruction of trees including endosymbiotic DNA sequences which are strongly AT-biased and evolve at relatively high rates is problematic when using classical models of DNA sequence evolution. To avoid this pitfall, we used the tree building method implemented in NHML3 which is based on a heterogeneous model accounting for unequal transition/transversion rates, unequal evolutionary rates among sequence sites and unequal base compositions of sequences [43]. Maximum likelihood inference based on this model was applied to a trimmed alignment of 61 concatenated conserved protein-coding genes (19143 nucleotides) involved in translation. Trimming was done with GBLOCKS [44] in order to limit the data set to unambiguous well conserved parts of the alignments. Only the first two positions of codons, which are relatively less AT enriched [15] were retained for phylogenetic analysis. This yielded a most likely phylogenetic tree that grouped together the five endosymbiotic lineages (Figure (Figure1)1 Reconstruction of endosymbiotic ancestors Because it is difficult to assign orthology for structural RNA, we excluded from the analysis all non coding genes. To determine the set of orthologous coding genes between E. coli K12 and each of the nine bacteria, we performed reciprocal blasts with a cut off value of 10-4, retaining only those genes that were best hits in both comparisons. Applying a parsimony principle, the common ancestor of all endosymbionts was reconstructed as the sum of the orthologous coding genes present in at least one endosymbiotic lineage, E. coli and V. cholera. This led to removal of the few genes that were present in endosymbionts but have no orthologous equivalent in either E. coli or V. cholera. Pseudogenes of endosymbiotic genomes were included in the analysis and considered as lost genes. Finally, this approach allowed the determination of a minimal free-living ancestor of endosymbionts and of their free-living relatives (LCA1) comprising 1983 conserved coding genes that have a low probability of being acquired by lateral gene transfer (LGT). Indeed, LGT generally involves uptake, from distantly related bacteria or phages [45], of genes that are absent from related bacteria. Since our method for reconstructing LCA1 was very conservative, we can speculate that the real last free-living bacteria that gave rise to endosymbionts contained more genes. Intermediary common ancestor to symbiotic lineages were defined as subsets of the initial 1983 CDS of LCA1: LCA2 corresponded to the sum of genes present in all endosymbiotic lineages, generating our common ancestry to all these symbionts, LCA3 to the sum of the genes present in all B. aphidicola lineages, and LCA4 to the sum of genes present in Blochmannia and Wigglesworthia lineages (Figure (Figure11 Quantifying gene loss Gene loss was quantified using different approaches including both quantitative and qualitative variables. A quantitative marker of gene loss: for each gene present in LCA1, a parameter called the propensity of a given gene to be lost (PGL) was calculated [35]. This measure required the identification of gene loss along the branches of the phylogeny which was based on the reconstruction of last common ancestors of endosymbionts described above. The propensity of a gene to be lost was calculated as the ratio of the sum of the lengths of branches lacking a given gene to the sum of the lengths of all branches of the tree. This generated a quantitative parameter ranging from 0 (never lost) to 1 (lost from all the branches) that could be used to perform correlations with CAI and substitution rates (Tables 1 and 2). Discrete categories of genes (Figure (Figure3):3 CAI and substitution rate estimates Our main objective was to examine correlations between patterns of gene loss (at different depths of the tree) and the functional importance of genes, in order to measure if losses, particularly in the initial stages of symbiosis could have been limited by rather precise constraints [7]. To evaluate the level of functional importance of genes, we used two different parameters i) the level of adaptive codon bias (CAI) and ii) the rate of sequence evolution. The former parameter is correlated with the level of gene expression in E. coli [46], and more essential genes probably have higher levels of expression [47]. Unfortunately CAI data are not available for genes of endosymbionts which show hardly any trace of adaptive bias [12,15]. We therefore used the CAI of E. coli orthologs, calculated through the CODONW package [48]. In addition, we estimated synonymous (Ks) and non-synonymous (Ka) substitutions rates by performing pairwise comparisons of coding sequences. Estimates were calculated using Li's method [49] implemented in the diverge function from the GCG 10.2 package. Since Ks were likely saturated for many of the pairwise comparisons, we restricted our analysis to non-synonymous substitution rates (Ka). Pairwise estimates of non-synonymous substitution rates were conducted for two free-living species pairs (Eco-Stm, Eco-Ype) and four endosymbiotic species pairs (BAp-BSg, BAp-BBp, BSg-BBp, Bfl-Wgl; see Schaber et al. [40] for detailed results on substitution rates). Finally, we looked at the relation between gene loss at different depths of the tree and the degree of functional constraint of genes (estimated by CAI or evolutionary rates) (Figures (Figures3,3 Reconstruction of gene deletions Following Moran and Mira [6], we assumed that the LCA1 possessed the E. coli gene order. This is likely true for a majority of genes, given evidence discussed by these authors that most gene rearrangements occurred in the symbiont, and is also supported by our estimations that about 75% of the genes from chromosome I in V. cholerae are in syntenic fragments with E. coli. Synteny was defined between fragments of the LCA1 and individual symbiotic species if consecutive genes in the LCA1 corresponded to consecutive genes in a given symbiont. To reconstruct synteny between LCA1 and the ancestor of B. aphidicola strains (LCA3), we used the fact that gene order is conserved in that group and simply "filled the gaps" of B. aphidicola BAp with genes absent from this species but present in either of the two other B. aphidicola strains. It was not possible to establish the gene order in LCA2 given the major rearrangements between Blochmannia, Wigglesworthia, and Buchnera (LCA3). However, we assumed that if synteny was established between a fragment in LCA1 and genes from any of these three symbionts, this fragment must have been syntenic with LCA1 in the common ancestor of these symbionts (LCA2). Authors' contributions FD, CR and JS conceived of the study and carried out the statistical analysis. FJS participated in the design of the study. FD drafted the manuscript. AM initiated the study and participated in its coordination. All authors read and approved the final manuscript. Acknowledgements FD and JS were respectively supported by the Marie Curie Host Fellowships of the European Union Nr. MCFI-1999-01047 and MCFI-1999-01055. This work was supported by the contract HPMD-CT-2000-00056 from European Union and the grants BMC2003-00305 from the Ministerio de Educación y Ciencia (Spain) and Grupos03/04 from Conselleria d'Empresa, Universitat i Ciència (Generalitat Valenciana, Spain) to AM. We are grateful to Jacqui Shykoff for her critical review of the manuscript and her help in revising the english. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||||
Proc Natl Acad Sci U S A. 2002 Apr 2; 99(7):4454-8.
[Proc Natl Acad Sci U S A. 2002]Science. 2002 Jun 28; 296(5577):2376-9.
[Science. 2002]Genome Res. 2005 Aug; 15(8):1023-33.
[Genome Res. 2005]Mol Biol Evol. 2005 Jun; 22(6):1456-67.
[Mol Biol Evol. 2005]Nature. 2000 Sep 7; 407(6800):81-6.
[Nature. 2000]Genome Biol. 2001; 2(12):RESEARCH0054.
[Genome Biol. 2001]Science. 2001 May 11; 292(5519):1096-9.
[Science. 2001]Proc Natl Acad Sci U S A. 1996 Apr 2; 93(7):2873-8.
[Proc Natl Acad Sci U S A. 1996]Mol Biol Evol. 1999 Nov; 16(11):1586-98.
[Mol Biol Evol. 1999]Trends Genet. 2002 Jun; 18(6):291-4.
[Trends Genet. 2002]Curr Opin Genet Dev. 1999 Dec; 9(6):664-71.
[Curr Opin Genet Dev. 1999]Genome Biol. 2001; 2(2):COMMENT2002.
[Genome Biol. 2001]Trends Genet. 2001 Oct; 17(10):589-96.
[Trends Genet. 2001]Mol Biol Evol. 2003 Aug; 20(8):1188-94.
[Mol Biol Evol. 2003]Genome Biol. 2005; 6(2):R14.
[Genome Biol. 2005]Science. 2002 Jun 28; 296(5577):2376-9.
[Science. 2002]Proc Natl Acad Sci U S A. 2003 Jan 21; 100(2):581-6.
[Proc Natl Acad Sci U S A. 2003]Genome Res. 2005 Aug; 15(8):1023-33.
[Genome Res. 2005]Trends Genet. 2003 Apr; 19(4):176-80.
[Trends Genet. 2003]Genome Biol. 2001; 2(12):RESEARCH0054.
[Genome Biol. 2001]Science. 2002 Jun 28; 296(5577):2376-9.
[Science. 2002]Trends Microbiol. 1998 Jul; 6(7):263-8.
[Trends Microbiol. 1998]Science. 2001 May 11; 292(5519):1096-9.
[Science. 2001]Trends Microbiol. 2004 Jan; 12(1):37-43.
[Trends Microbiol. 2004]Proc Natl Acad Sci U S A. 2003 Aug 5; 100(16):9388-93.
[Proc Natl Acad Sci U S A. 2003]Genome Res. 2003 Oct; 13(10):2229-35.
[Genome Res. 2003]Proc Natl Acad Sci U S A. 2004 Jun 29; 101(26):9722-7.
[Proc Natl Acad Sci U S A. 2004]Proc Natl Acad Sci U S A. 2003 Aug 5; 100(16):9388-93.
[Proc Natl Acad Sci U S A. 2003]Mol Biol Evol. 2004 Jun; 21(6):1110-22.
[Mol Biol Evol. 2004]Mol Biol Evol. 2005 Mar; 22(3):520-32.
[Mol Biol Evol. 2005]Mol Biol Evol. 2005 Jun; 22(6):1456-67.
[Mol Biol Evol. 2005]Genome Res. 2005 Aug; 15(8):1023-33.
[Genome Res. 2005]Gene. 2005 Jun 6; 352():109-17.
[Gene. 2005]Genome Res. 2003 Oct; 13(10):2229-35.
[Genome Res. 2003]Genetics. 2001 Jun; 158(2):927-31.
[Genetics. 2001]Genome Res. 2003 Oct; 13(10):2229-35.
[Genome Res. 2003]Genome Biol. 2001; 2(12):RESEARCH0054.
[Genome Biol. 2001]Trends Genet. 2001 Nov; 17(11):615-8.
[Trends Genet. 2001]Genome Biol. 2001; 2(12):RESEARCH0054.
[Genome Biol. 2001]Science. 2001 May 11; 292(5519):1096-9.
[Science. 2001]Nat Rev Genet. 2002 Nov; 3(11):850-61.
[Nat Rev Genet. 2002]Genome Biol. 2001; 2(12):RESEARCH0054.
[Genome Biol. 2001]Mol Biol Evol. 2006 Feb; 23(2):310-6.
[Mol Biol Evol. 2006]Genome Biol. 2001; 2(12):RESEARCH0054.
[Genome Biol. 2001]Genome Biol. 2001; 2(12):RESEARCH0054.
[Genome Biol. 2001]Trends Genet. 2001 Nov; 17(11):615-8.
[Trends Genet. 2001]Genome Res. 2005 Aug; 15(8):1023-33.
[Genome Res. 2005]Mol Biol Evol. 2003 Aug; 20(8):1188-94.
[Mol Biol Evol. 2003]Mol Biol Evol. 1998 Jul; 15(7):871-9.
[Mol Biol Evol. 1998]Mol Biol Evol. 2000 Apr; 17(4):540-52.
[Mol Biol Evol. 2000]Genome Res. 2004 Jan; 14(1):44-53.
[Genome Res. 2004]Science. 2003 Aug 8; 301(5634):829-32.
[Science. 2003]Genome Res. 2003 Oct; 13(10):2229-35.
[Genome Res. 2003]Trends Genet. 2001 Nov; 17(11):615-8.
[Trends Genet. 2001]Nucleic Acids Res. 1987 Feb 11; 15(3):1281-95.
[Nucleic Acids Res. 1987]J Bacteriol. 2000 Sep; 182(18):5238-50.
[J Bacteriol. 2000]Proc Natl Acad Sci U S A. 1996 Apr 2; 93(7):2873-8.
[Proc Natl Acad Sci U S A. 1996]Genome Res. 2004 Jan; 14(1):44-53.
[Genome Res. 2004]J Mol Evol. 1993 Jan; 36(1):96-9.
[J Mol Evol. 1993]Gene. 2005 Jun 6; 352():109-17.
[Gene. 2005]Genome Biol. 2001; 2(12):RESEARCH0054.
[Genome Biol. 2001]