![]() | ![]() |
Formats:
|
||||||||||||||||||||||||||||||
Copyright © 2007 McNeal et al; licensee BioMed Central Ltd. Complete plastid genome sequences suggest strong selection for retention of photosynthetic genes in the parasitic plant genus Cuscuta 1Department of Plant Biology, University of Georgia, Athens, GA 30602, USA 2Department of Biology, Huck Institutes of the Life Sciences, and Institute of Molecular Evolutionary Genetics, The Pennsylvania State University, University Park, Pennsylvania 16802-5301, USA 3DOE Joint Genome Institute and Lawrence Berkeley National Laboratory, Walnut Creek, California 94598, USA 4Genome Project Solutions, Hercules, CA 94547, USA Corresponding author.Joel R McNeal: jmcneal/at/plantbio.uga.edu; Jennifer V Kuehl: JVKuehl/at/lbl.gov; Jeffrey L Boore: jlboore/at/calmail.berkeley.edu; Claude W de Pamphilis: cwd3/at/psu.edu Received June 6, 2007; Accepted October 24, 2007. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. This article has been cited by other articles in PMC.Abstract Background Plastid genome content and protein sequence are highly conserved across land plants and their closest algal relatives. Parasitic plants, which obtain some or all of their nutrition through an attachment to a host plant, are often a striking exception. Heterotrophy can lead to relaxed constraint on some plastid genes or even total gene loss. We sequenced plastid genomes of two species in the parasitic genus Cuscuta along with a non-parasitic relative, Ipomoea purpurea, to investigate changes in the plastid genome that may result from transition to the parasitic lifestyle. Results Aside from loss of all ndh genes, Cuscuta exaltata retains photosynthetic and photorespiratory genes that evolve under strong selective constraint. Cuscuta obtusiflora has incurred substantially more change to its plastid genome, including loss of all genes for the plastid-encoded RNA polymerase. Despite extensive change in gene content and greatly increased rate of overall nucleotide substitution, C. obtusiflora also retains all photosynthetic and photorespiratory genes with only one minor exception. Conclusion Although Epifagus virginiana, the only other parasitic plant with its plastid genome sequenced to date, has lost a largely overlapping set of transfer-RNA and ribosomal genes as Cuscuta, it has lost all genes related to photosynthesis and maintains a set of genes which are among the most divergent in Cuscuta. Analyses demonstrate photosynthetic genes are under the highest constraint of any genes within the plastid genomes of Cuscuta, indicating a function involving RuBisCo and electron transport through photosystems is still the primary reason for retention of the plastid genome in these species. Background Parasitic plants offer excellent opportunities to study changes in genome evolution that accompany the switch from an autotrophic to a heterotrophic lifestyle, a transition that has occurred many times over the course of evolution. Within angiosperms, the ability to obtain nutrition through direct attachment to a host plant has evolved at least a dozen times [1] with many additional instances of plants obtaining most or all of their nutrition through specific mycotrophic fungal interactions [2,3]. While approximately 90% of genes involved in photosynthesis have been transferred to the nuclear genome over the course of chloroplast evolution since divergence from free-living cyanobacterial relatives [4], these nuclear genes are often difficult to study in non-model organisms. Widespread gene and genome duplication often makes inference of orthology among nuclear genes difficult, and rate acceleration in ribosomal loci of some parasitic plants suggests that the sequences of nuclear genes may be too divergent to amplify through standard PCR [5]. By contrast, genes remaining on the plastid chromosome evolve more slowly than nuclear genes and exist as single, readily identifiable orthologs in each plastome, although the plastid chromosome itself is in high copy number per cell [6]. Many species of parasitic plants retain the ability to photosynthesize, and aside from a supplemental connection to the roots of a host, otherwise resemble fully autotrophic plants in habit [7]. Others, however, display increased dependency on their hosts, often to the extent of becoming fully heterotrophic and nonphotosynthetic. Such plants are often deemed "holoparasites", and one such species, Epifagus virginiana (Beechdrops, Orobanchaceae) is the only parasitic plant whose full plastid genome has been sequenced to date [8]. Its plastid genome is reduced to less than half the size of that in normal angiosperms due to ubiquitous gene loss, including all photosynthetic and photorespiratory genes, some ribosomal protein genes, many tRNA genes, and genes for plastid-encoded polymerase [8,9]. Despite such drastic changes, plastid transcription and intron splicing still occur [9,10], presumably for the purpose of producing the four remaining proteins not related to transcription or translation. Smaller scale studies show similar or less genome reduction in related species [11-14]. For some holoparasitic lineages, existence of a functional plastid genome remains to be proven, although preliminary evidence suggests extremely divergent plastid genomes may occur in the families Balanophoraceae, Cytinaceae, Hydonoraceae, and Cynomoriaceae [15,16]. A large number of studies on plastid function have been performed involving members of the parasitic genus Cuscuta, derived from within the otherwise autotrophic Morning Glory Family (Convolvulaceae, order Solanales, class Asteridae). Plastid ultrastructure and gene content are quite variable between different taxa [17], and over 150 species exist in this widespread and recognizable genus [18]. Unlike Epifagus and other root-parasitic Orobanchaceae, Cuscuta is a twining vine with no roots at maturity. Instead, it sends its shoot like feeding organs, haustoria, directly into the stems of its hosts to invade the vasculature and obtain all necessary water and other nutrients. Leaves are reduced to vestigial scales. Despite an obligate reliance upon their hosts, many Cuscuta species show some green color, at least in their inflorescences and, particularly, in maturing ovules. Machado and Zetsche demonstrated the presence of RuBisCo, chlorophyll, and low levels of carbon fixation in Cuscuta reflexa, a member of subgenus Monogyna [19]. Additionally, although all NADH dehydrogenase (ndh) genes were either undetectable or nonfunctional [20], other genes related to photosynthesis appeared to be present in functional form [21]. In this species, green plastids of normal function are localized to a ring of cells between the stem pith and cortex that are isolated from atmospheric gas exchange, indicating photosynthesis may occur in this species using recycled respiratory CO2 [22] despite an altered xanthophyll cycle in its light-harvesting complex [23]. Dot blots using poly-A selected RNA from C. reflexa as a probe also showed positive hybridization to some of the 101 tobacco genes in tobacco, although whether these results actually represent nuclear transcribed copies of the plastid genes, polyadenylated plastid transcripts, or leakage of non-polyadenylated plastid transcripts through cDNA production steps is unclear [24]. A different situation exists in Cuscuta pentagona (subgenus Grammica), which lacks the ring of photosynthetic cells observed in the stems of C. reflexa, but possesses what appear to be photosynthetically capable plastids with immunodetectable RuBisCo, photosystem, and light-harvesting proteins in proper locations within the plastids in green tissues of seedlings and adult plants [25]. Other species within subgenus Grammica show a range of rbcL transcript levels, from low to none [17], and sampled members of this subgenus lack promoters for plastid-encoded polymerase upstream of the rrn16 and rbcL genes, although transcription of rbcL still occurs from nuclear-encoded polymerase promoter sites in both cases [26]. Conflicting evidence exists for Cuscuta europaea (subgenus Cuscuta), which has been described as lacking chlorophyll and detectable rbcL protein [19], yet still possesses green color and more typical plastid sequences, including rbcL, than members of subgenus Grammica [27]. Other minor changes have been detected in the plastid genome of Cuscuta sandwichiana, such as deletion of introns within ycf3, constriction of the inverted repeat to exclude rpl2, rpl23, and trnI, loss of trnV-UAC, and reduction in size of ycf2; slight changes to the end of atpB, size reduction of the trnL intron, and deletion of the rpl2 intron are shared with other, non-parasitic Convolvulaceae, and occurred before the evolution of parasitism in Cuscuta [28]. In this study, we test if significant changes to the plastid genome have occurred prior to the evolution of parasitism, if previously published observations of plastid genome evolution in Cuscuta apply to other members of the genus, if differences in chlorophyll content and distribution between Cuscuta species parallel differences in plastid genome content, whether plastid genes retained in Cuscuta are still evolving under strong purifying selection, and whether plastid gene retention and selective constraint suggest a photosynthetic function for plastids in this parasitic genus. To do so, we sequenced the full plastid genomes of two species of Cuscuta and a close photosynthetic relative, Ipomoea purpurea (Common Morning Glory). Ipomoea is a member of the Convolvuloideae clade, which has been shown as the most likely sister group to Cuscuta in a number of studies [27,29-31]. Cuscuta exaltata, a member of subgenus Monogyna with visible chlorophyll distributed throughout the stems and inflorescences, and Cuscuta obtusiflora, a member of subgenus Grammica that usually only exhibits green pigmentation in inflorescences, fruits, starved seedlings and stressed stem tips, were chosen to represent Cuscuta. We examined overall rates of substitution and changes in selective constraint by comparing rates of synonymous and nonsynonymous substitution for all plastid genes and across functionally defined classes of genes to determine if photosynthetic genes remain the most highly conserved in the plastid genome and whether relaxation of functional constraint precedes gene losses both before and after the evolution of parasitism in this lineage. We also tested whether patterns of transfer RNA loss, changes in intergenic regions, and rates of substitution parallel those seen in the completely nonphotosynthetic Epifagus virginiana. Finally, we use the cumulative evidence of photosynthetic localization, specific gene loss, and strong functional constraint of specific genes to suggest a photosynthetic function of the plastid genome unrelated to the Calvin Cycle in Cuscuta and perhaps other parasitic plants as well. Results and Discussion Plastid Genome Size and Inverted Repeat Structure The three plastid genomes presented here all have a pair of large, inverted, identical repeat sequence (IR) separated from each other by a large single copy and small single copy region (LSC and SSC) on either end, as is the case for practically all plant plastid genomes [32]. However, considerable length variation exists between these three plastid genome sequences, with the smallest genome, Cuscuta obtusiflora, barely half the size of that in Ipomoea purpurea (85,280 base pairs versus 162,046 bp). Cuscuta exaltata is intermediate in size at 125,373 bp (See figs. figs.1,1
Plastid Gene Content Gene content in Ipomoea is decidedly similar to that in Nicotiana and Atropa. These three taxa, along with both Cuscuta species, lack an intact infA [34], indicating this gene loss probably occurred prior to the divergence of Solanaceae from Convolvulaceae, both in the order Solanales. This is not surprising, as infA has been lost from the plastid many times in angiosperm evolution [35]. A second gene, ycf15, is lost across Convolvulaceae taxa sequenced in this study but is present in Solanaceae and outgroups [34,36,37]. However, the function of this gene is not known, and the effect of its loss in Convolvulaceae is difficult to interpret. A third gene, rpl23, is clearly a pseudogene in Cusucta exaltata and is lost completely in C. obtusiflora, but it is not clear whether it is functional in Ipomoea. Although a full length open reading frame exists in Ipomoea for rpl23, it contains two frameshift mutations and an extension of the 3' end. The gene also does not appear to be evolving under purifying selective constraint as in Nicotiana (see fig. fig.4),4
One gene that was surprisingly found in all three Convolvulaceae plastid genomes is ycf1, a large gene of unknown function previously reported as missing in Cuscuta and three other Convolvulaceae [38]. That study used Southern Blot hybridizations to screen for gene presence; ycf1 is still present as the second largest open reading frame in the plastid genome, but is extremely variable in size between the two Cuscuta species and is greatly elongated in Ipomoea, possesses numerous large indels, and is difficult to align with other species at the protein level in some regions These factors likely explain the negative hybridizations previously observed. Although it is one of the least conserved genes in both Cuscutas and Ipomoea, it is still apparently evolving under selective constraint as a functional gene. As is the case for ycf15, interpreting consequences of the extreme divergence of this gene in Convolvulaceae awaits full knowledge of its function. Gene loss is much more prominent in the two Cuscuta species than Ipomoea. All genes lost in C. exaltata are also lost in C. obtusiflora, and are most parsimoniously assumed to be lost in the common ancestor of both species. Most notable of these losses are the ndh genes, all of which are fully lost from the plastid or are pseudogenes in Cuscuta. This confirms the PCR and blot data collected for Cuscuta reflexa that suggested all ndh genes were missing, highly altered, or translocated in that species [21] as well as negative PCR and sequence results from other species [28]. All ndh genes are also lost from the plastid in Epifagus [9], indicatingevolution of parasitism may facilitate loss of these genes or movement to the nuclear genome. Although ndh genes are retained in most photosynthetic plants, they are also lost from the chloroplast genome of Pinus [39], indicating their presence in the plastid genome is not necessary for photosynthesis even in fully autotrophic plants. Both Cuscuta species also lack a functional rps16 gene in the plastid, although C. exaltata contains a pseudogene with portions of both exons and the group II intron present between them. A final gene loss from both Cuscuta plastomes that is also reported in C. reflexa is the loss of trnK-UUU [40]. As is the case for Epifagus, C. exaltata retains the open reading frame, matK, contained within the intron of that tRNA. A deletion within the trnV -CAU intron also reported in C. reflexa [21], and similar to that seen in Orobanche minor, may hypothetically disrupt its splicing [13], but because both exons remain intact in these species, we hesitate to call it a pseudogene in C. exaltata without experimental evidence. Aside from these gene losses, plastid genome content of C. exaltata is identical to that in Ipomoea and includes a full set of genes presumably necessary for photosynthesis. Plastid Genome Rearrangements Structurally, the plastid genome of C. exaltata has undergone a number of changes relative to Ipomoea and Nicotiana. The LSC end of the IR is constricted in both Cuscuta species, but it has apparently re-extended to include a few nucleotides of trnH-GUG (4 nucleotides in C. exaltata, 6 in C. obtusiflora). As in Ipomoea, the first full gene in the LSC end of the IR in C. obtusiflora is trnI-CAU. However, the IR constriction is much more dramatic in C. exaltata, with rpl2, trnI, and over half of ycf2 falling outside the IR (fig. (fig.2).2 Plastid Genome Changes in C. obtusiflora The plastid genome of Cuscuta obtusiflora surprisingly lacks any structural rearrangements relative to Nicotiana and Ipomoea. Unlike C. exaltata, C. obtusiflora lacks extensive pseudogene sequence and may have purged such unused DNA from its plastome before sequence motifs conducive to inversion events had time to develop. Gene loss, on the other hand, is much more rampant within C. obtusiflora (Table 1). In addition to the genes previously discussed for C. exaltata, C. obtusiflora has lost a third ribosomal protein gene, rpl32, and five additional tRNAs. Also lost are all subunits of the plastid-encoded RNA polymerase (rpo), and the intron maturase matK, the loss of which parallels loss of all group IIA introns from the genome as well, as previously reported [41]. Blot data and negative PCR results have suggested loss of plastid rpo genes from other species within subgenus Grammica as well [26,28], although the rrn gene cluster and rbcL gene appear to still be transcribed from nuclear-encoded polymerase in at least some species [42]. Despite such extensive gene loss from the plastome, C. obtusiflora retains all plastid genes directly involved in photosynthesis within the chloroplast, including all atp genes, all pet genes, rbcL, and all psa and psb genes, with the exception of psaI. This gene is one of the smallest in the plastome (36 codons or less), although it is highly conserved across land plants. Losses of trnV and two introns within ycf3 reported for another member of subgenus Grammica, Cuscuta sandwichiana [28], are also present in Cuscuta obtusiflora.
Selective Constraint in Plastid Genes With these three new full plastid genome sequences, we tested whether substantial changes in selective pressure of genes, particularly those lost in Cuscuta, occurred prior to evolution of parasitism in this lineage. After calculating corrected distances of nonsynonymous nucleotide substitution per nonsynonymous site (dN) and synonymous substitution per synonymous site (dS) for all genes in Ipomoea and Nicotiana relative to a common outgroup, Panax, an interesting trend toward relaxed selection in the genome of the fully autotrophic Ipomoea was revealed (fig. (fig.4).4
Pairwise relative rates tests also show significant overall rate differences between Ipomoea and Cuscuta exaltata as well as between the two Cuscuta species for all types of genes (Table 2). We next wanted to test whether ratios of overall selection between classes of genes remaining in Cuscuta are similar to autotrophic taxa. Figure Figure55
Although plastid genes in Cuscuta are still evolving under strong negative selection, the data show that they are somewhat relaxed compared to their fully autotrophic relatives. Figure Figure66
Plastid Genome Differences in Cuscuta versus Epifagus virginiana Although Epifagus virginiana has undergone a similar downsizing of its plastid genome, it and Cuscuta are quite different in a number of ways, most obviously in that Cuscuta retains a seemingly functional set of photosynthetic genes while Epifagus has lost all such genes. With the loss of rpo genes in both taxa, we investigated whether both taxa show similar patterns of deletion in intergenic regions, which should contain plastid promoters, transcription-factor binding sites, and other motifs no longer necessary in a nuclear-transcribed plastome. Overall, Epifagus has 22 fewer protein coding genes and 7 fewer tRNA genes than C. obtusiflora. While the plastid genome size of Epifagus (70,028 bp) is over 15 kilobases smaller than that of C. obtusiflora, this is actually less than would be expected given such a dramatic difference in overall gene content. In 63 non-coding, intergenic regions between homologous functional genes in both Cuscuta species, Ipomoea, and Nicotiana, C. obtusiflora (11714) has undergone a 49% overall decrease in length relative to Nicotiana (22,996 bp), perhaps largely due to a deletion of plastid polymerase and transcription factor binding sites. C. exaltata has decreased 16% over the same area, and Ipomoea only 1%. Over the 16 intergenic regions shared by Epifagus, C. obtusiflora has decreased by 33% relative to Nicotiana, while Epifagus has only decreased by slightly over 3% (values in Table 4A). Likewise, in 3 regions for which conserved functional genes flanking regions containing homologously defunct genes could be compared between Epifagus and C. obtusiflora, Epifagus exhibits a 32% total decrease in size relative to the full length sequences containing functional genes in Nicotiana, while C. obtusiflora is 85% shorter (Table 4B). The IR of Epifagus is almost the same length as that of a normal angiosperm, while its SSC and LSC regions are the sites of practically all of its gene loss. Cuscuta obtusiflora has extensive deletion in those areas too, but also exhibits a significant contraction of the IR, largely through pseudogene loss relative to Epifagus. While Cuscuta obtusiflora almost completely lacks pseudogene sequences, Epifagus retains a fair number of them. Coupled with various intron losses, the plastid genome of C. obtusiflora is much more streamlined than that of Epifagus.
We also wanted to test whether genes remaining in the plastid of the fully nonphotosynthetic Epifagus are under less constraint than those of the putatively photosynthetic Cuscuta species. Surprisingly, among the alignable genes they share, C. obtusiflora is usually more divergent at the protein level from a common outgroup, Panax (fig. (fig.7).7
C. obtusiflora retains the four protein-coding genes in Epifagus not related to transcription or translation and presumably the reason for retaining a plastid genome in that species: accD, clpP, ycf1, and ycf2 [8]. accD and clpP both are less constrained in Cuscuta than in Epifagus, and in clpP, dramatically so, with all three Convolvulaceae taxa exhibiting higher dN/dS for both genes. The effect this has on the amino acid divergence is also very apparent (fig. (fig.7).7 Comparisons with other Nonphotosynthetic Lineages Despite some differences in patterns of evolution, many parallels exist between plastid genome evolution in Cuscuta and that of the related but independently derived parasitic lineage Orobanchaceae, including Epifagus. Both lineages show overall increased rates of nucleotide substitution, relaxed selective constraint, and lack any appreciable shift in synonymous codon usage in spite of loss of multiple tRNAs [46](Table 5). Substantial gene loss is observed in both lineages; in addition to sharing loss of all ndh and rpo genes with C. obtusiflora, Epifagus has lost a largely overlapping set of tRNAs from its plastid genome. All tRNAs lost in C. obtusiflora are also lost in Epifagus with the exception of trnR-ACG, and even that has been suggested to be a pseudogene [13]. The three ribosomal proteins lost in C. obtusiflora are also a subset of the six lost in Epifagus. Although Epifagus lacks all photosynthetic genes, other Orobanchaceae retain genes normally required for photosynthesis in seemingly functional form. Lathraea clandestina has what appears to be a functional rbcL (RuBisCo, large subunit) gene, and rpo genes are also amplifiable by PCR, despite the fact that the plant apparently lacks chlorophyll and spends its entire life cycle underground except when flowering [12]. Similarly, some members of the genus Orobanche and other holoparasites within the family retain rbcL genes that appear to be evolving under functional constraint [47-49]. Pholisma, a genus in the holoparasitic family Lennoaceae, is yet another example of an independently nonphotosynthetic lineage retaining rbcL [50]. Without full plastid genome sequence from these plants, it is difficult to know whether they too may still possess a necessary complement of plastid genes for residual photosynthesis, although unlike the Cuscuta species in this study, they lack obvious chlorophyll at any life stage and are not above ground to encounter light for most of their life cycle.
Other non-angiosperm plastid-containing parasite lineages also show patterns of plastid gene loss and loss of selective constraint. Apicomplexan apicoplast genomes, which are thought to be derived from a plastid ancestor, are more similar to Epifagus than Cuscuta in that they contain only a few genes not involved in transcription or translation, none of which are related to photosynthesis. However, none of the remaining non-transcriptional/translational genes are shared with Epifagus, and RNA polymerase genes are retained in apicoplast genomes [51]. The euglenoid parasite Astasia longa also has a highly reduced plastid genome but like Cuscuta retains rbcL [52]. Nonphotosynthetic algae are variable in plastid genome content; the only genes not involved in transcription or translation that are normally found in angiosperms in the plastid genome of Helicosporidium are ycf1 and accD, two of the four such genes retained in parallel by Epifagus[53]. Prototheca also retains ATP synthase genes, part of the photosynthetic apparatus also retained by Cuscuta but lost in Epifagus[54]. Although some parallels exist, each independently derived parasite lineage appears to follow a unique pathway in plastid genome reduction. Lipid Biosynthesis as a Hypothesis for Photosynthetic Gene Retention in Cuscuta While it has been hypothesized that the plastid genome must be retained at least minimally in nonphotosynthetic organisms for the transcription of trnE, an essential product for tetrapyrrole synthesis in the plastid[55], retention of a conserved photosynthetic apparatus in Cuscuta suggests another important role for the plastid genome in these parasites. Because no atmospheric gas exchange occurs with chlorophyllous cells in C. reflexa, recycling of respiratory carbon dioxide has been presented as a hypothesis for retention of photosynthesis in that species, and although their source carbohydrates all apparently originate from the host, a net decline in carbon dioxide release is indeed detected in the presence of light [22]. However, recycling carbon dioxide back to carbohydrate through the Calvin cycle is not the only potential reason for retaining photosynthesis. Another possible explanation for conservation of photosynthetic genes in Cuscuta and retention of rbcL in other holoparasities may lie in a recently described alternative function of RuBisCo involving lipid biosynthesis, where it acts independently of its formerly known role in Calvin cycle production of carbohydrates. In this alternative pathway, 20% more acetyl-CoA is available for fatty acid biosynthesis, and 40% less carbon is lost as carbon dioxide in green seeds of Brassica napus. This pathway is still largely reliant on ATP and NADPH generated during the light reactions of photosynthesis, although less than 15% of that necessary for the Calvin cycle is needed for this function of RuBisCo to play a dominant role in lipid synthesis [56]. No atmospheric carbon dioxide would be necessary for this process, and it could also explain the observation of less respiratory carbon dioxide loss during light exposure [22], when necessary ATP and NADPH for the reaction would be produced. Chlorophyll is most concentrated in developing ovules and seeds of Cuscuta obtusiflora and close relatives in subgenus Grammica like Cuscuta pentagona, which has been shown to lack the circular ring of chlorophyllous cells between the pith and cortex [57]. Because Cuscuta species must survive long enough after germination to search for and attach to a host, utilizing this alternative function of photosynthesis for efficient lipid allocation to seeds and subsequent efficient carbon use in the free-living seedlings may explain the need for an intact photosynthetic apparatus in this parasitic lineage. Use of RuBisCo for lipid biosynthesis may also explain retention of rbcL in other holoparasite angiosperm plastids and in the parasitic euglenoid Astasia. One of the few genes for which the plastid genome is transcribed in Epifagus and Helicosporidium, accD, is a subunit of acetyl co-A carboxylase, indicating that lipid biosynthesis remains an important function of plastids in these species as well. Lipid biosynthesis has already been experimentally demonstrated as a major function of apicoplasts and plastids in a number of nonphotosynthetic non-angiosperm species[58,59]. Future physiological study of photosynthetic tissues in Cuscuta as well as other parasitic plants, which may have largely if not entirely lost the primary photosynthetic function of their plastid genomes, should lead to greater understanding of possible alternate roles of the plastid genome in parasitic and autotrophic plants alike. Conclusion By sequencing the full plastid genomes of two parasitic angiosperms in the genus Cuscuta and a non-parasitic close relative, Ipomoea, we have been able to gain a better understanding of the directional downsizing of the plastid genome in heterotrophs as dependence on photosynthesis for carbohydrate production decreases. These genomes are greatly complemented by comparison to that already existing for the parasitic angiosperm Epifagus virginiana; by studying the similarities and the differences between these two parasitic lineages, we have unveiled a clearer picture of which changes to the plastid genome might be expected to occur in all transitions to heterotrophy, which might be peculiar to those plants lacking chlorophyll, and which may be lineage specific to Cuscuta. We find ndh genes to be the only genes functionally lost during the transition to parasitism in Cuscuta, while more extensive loss of both coding and noncoding material has occurred more recently. Despite substantial reduction of the plastid genome in Cuscuta, the most highly conserved genes continue to be those directly involved in photosynthesis, whereas those genes shared with the fully nonphotosynthetic Epifagus are amongst the least conserved. We postulate that the extremely high levels of constraint on remaining photosynthetic genes may indicate an important function for these gene products in lipid biosynthesis rather than carbohydrate production through the Calvin Cycle. Methods Plastid Genome Sequencing, Assembly, and Annotation Seeds of all three species were germinated and grown in the Pennsylvania State University Biology Greenhouse. An heirloom cultivar of Ipomoea purpurea, "Grandpa Ott's", was used to decrease likelihood of heteroplasmy within the sample. One gram of young leaf tissue was used for DNA isolation in Ipomoea. One gram of tissue from a collection of very green seedlings originating from a selfed parental plant was used for Cuscuta exaltata, and one gram of stem tip tissue was used for Cuscuta obtusiflora. Partial fosmid libraries were constructed from the extracted DNA using the CopyControl Fosmid Library Production Kit (Epicentre, Madison, WI). Libraries were screened for clones containing plastid DNA according to McNeal et al[60]. A subset of clones covering the entire plastid genome was selected for each species, and clones were shotgun sequenced and the reads assembled according to previously described methods [61]. Genome annotations were completed using DOGMA [62] in combination with manual sequence alignments of previously annotated genes from available related species. Sequences were deposited in Genbank with accession numbers EU118126 (Ipomoea), EU189132 (C. exaltata), and EU189133 (C. obtusiflora). Molecular Evolutionary Analyses Phylogenies for each gene were constructed in PAUP*4.0b10 [63] under various Maximum Parsimony, Neighbor-Joining, and Maximum Likelihood criteria, including the following related taxa with full plastid genome sequences publicly available: Nicotiana tabacum (Genbank accession NC 001879) and Atropa belladona (NC 004561)(Solanaceae, Solanales, Asteridae), Panax ginseng (NC 006290)(Araliaceae, Apiales, Asteridae), and Spinacia oleracea (NC 002202)(Caryophyllidae) as an outgroup. All sequenced genes appeared orthologous, with only minor, method-dependent aberrations from the expected phylogeny which were probably due to extreme rate heterogeneity between taxa [64]. Maximum Likelihood analyses that were performed under the General Time-Reversible model with gamma distribution of among-site variation (GTR+gamma) and model parameters estimated from the data were most accurate at obtaining the expected phylogeny from the data and were used for subsequent phylogenetic reconstruction of combined-gene datasets. Pairwise estimates of synonymous substitutions per synonymous site (dS) and nonsynonymous nucleotide substitutions per nonsynonymous site (dN) and standard errors were computed under the Kumar method using MEGA 2.1 [65] for each gene and for classes of genes that together encode subunits of larger proteins. ATP synthase genes, 6 genes, 4344 aligned characters (atp); cytochrome b6/f complex subunits, 6 genes, 2622 aligned characters (pet); photosystem I and II protein subunits, 19 genes, 11730 aligned characters (psa and psb = ps); large and small ribosomal protein subunits, 17 genes, 7686 aligned characters (rpl and rps = rp); plastid-encoded RNA polymerase 4 genes, 11958 aligned characters (rpo); and NADH-dehydrogenase, 11 genes, 10,653 aligned characters (ndh) were predefined classes of genes examined. Pairwise dS, dN, and amino acid p-distance were also calculated between Epifagus virginiana and Panax, the closest available outgroup to both it and Cuscuta. Maximum Likelihood estimates of synonymous substitution rates and the ratio of nonysnonymous to synonymous rates (R) for each branch of the combined-gene dataset phylogenies were calculated under the MG94 × HKY 3 × 4 codon model in the HYPHY .99beta package [66]. HYPHY was also used to conduct likelihood ratio tests (LRTs) between trees with universally constrained synonymous rates and R versus trees with each respective parameter free from constraint on one branch. Branches leading to Convolvulaceae (Ipomoea + both Cuscuta species), Cuscuta, and each individual Cuscuta species were tested for significant p-values in this manner. Additionally, pairwise relative rates tests were conducted for each gene class using various combinations of taxa with all parameters constrained as the null hypothesis and all parameters unconstrained as the alternate hypothesis. Pairwise relative ratio tests were conducted in HYPHY between the combined datasets with either synonymous or nonsynonymous distances constrained as the null hypothesis to determine whether there was significant heterogeneity in either across gene classes. Parameters were estimated independently for each branch. Finally, GCUA [67] was used to determine relative synonymous codon usage across all coding sequences for each genome to identify any changes in codon bias that may have accompanied tRNA loss or relaxed selection for photosynthesis. Note in Proof Two additional Cuscuta plastid genome sequences were published (Funk et al. BMC Plant Biology 2007, 7:45) during the late stages of review of our manuscript. Although the analyses of the data in our manuscripts demonstrate little overlap, the plastid genome structure and content of Cuscuta reflexa in that manuscript closely parallels our data from Cuscuta exaltata. Likewise, the sequence of Cuscuta gronovii demonstrates similar structure and gene content as Cuscuta obtusiflora. Competing interests The author(s) declares that there are no competing interests. Authors' contributions JRM cultivated the plants, produced and screened the partial genomic fosmid libraries, annotated and analyzed the plastid genomes, and drafted the manuscript. JVK subcloned, sequenced, and assembled reads for the selected fosmid clones. JLB and CWD helped conceive, design, and coordinate the study and made large contributions to manuscript editing. All authors read and approved the final manuscript. Acknowledgements We would like to thank Mauricio Bonifacino and Daniel Austin for assistance in obtaining plant material for the study, Tony Omeis for greenhouse assistance, Tim Chumley for technical support, Robert Jansen for general support and encouragement, and David Geiser, Stephen Schaeffer, Andrew Stephenson, and Kai Müeller for critical review of the manuscript. We also thank the Pennsylvania State University Department of Biology for support, including a Henry W. Popp fellowship awarded to JRM. This research was supported by National Science Foundation Grants DEB-0206659 and DEB-0120709 and partly performed under the auspices of the US Department of Energy's Office of Science, Biological and Environmental Research Program, and by the University of California, Lawrence Berkeley National Laboratory under Contract No. DE-AC02-05CH11231. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||||||||||
Mol Ecol. 2001 Sep; 10(9):2285-95.
[Mol Ecol. 2001]Nature. 2002 Sep 26; 419(6905):389-92.
[Nature. 2002]Plant Physiol. 1998 Sep; 118(1):9-17.
[Plant Physiol. 1998]J Mol Evol. 1994 Jul; 39(1):62-70.
[J Mol Evol. 1994]Proc Natl Acad Sci U S A. 1987 Dec; 84(24):9054-8.
[Proc Natl Acad Sci U S A. 1987]Proc Natl Acad Sci U S A. 1992 Nov 15; 89(22):10648-52.
[Proc Natl Acad Sci U S A. 1992]Nature. 1990 Nov 22; 348(6299):337-9.
[Nature. 1990]Plant Mol Biol. 1995 Nov; 29(4):721-33.
[Plant Mol Biol. 1995]Plant Mol Biol. 1995 Dec; 29(5):1071-9.
[Plant Mol Biol. 1995]Curr Genet. 1992 Apr; 21(4-5):417-22.
[Curr Genet. 1992]Planta. 2000 Apr; 210(5):701-7.
[Planta. 2000]Plant Mol Biol. 1994 Jan; 24(1):217-22.
[Plant Mol Biol. 1994]Mol Gen Genet. 1992 Mar; 232(1):154-61.
[Mol Gen Genet. 1992]Proc Natl Acad Sci U S A. 1999 Feb 2; 96(3):1135-9.
[Proc Natl Acad Sci U S A. 1999]J Exp Bot. 2005 Sep; 56(419):2477-86.
[J Exp Bot. 2005]Planta. 2000 Apr; 210(5):701-7.
[Planta. 2000]Planta. 2003 Mar; 216(5):815-23.
[Planta. 2003]J Mol Evol. 2005 Sep; 61(3):292-305.
[J Mol Evol. 2005]Annu Rev Genet. 1985; 19():325-54.
[Annu Rev Genet. 1985]J Exp Bot. 2005 Sep; 56(419):2477-86.
[J Exp Bot. 2005]Mol Biol Evol. 2002 Sep; 19(9):1602-12.
[Mol Biol Evol. 2002]Plant Cell. 2001 Mar; 13(3):645-58.
[Plant Cell. 2001]DNA Res. 2004 Aug 31; 11(4):247-61.
[DNA Res. 2004]Plant Mol Biol. 2001 Feb; 45(3):307-15.
[Plant Mol Biol. 2001]Mol Gen Genet. 1992 Mar; 232(1):154-61.
[Mol Gen Genet. 1992]J Mol Evol. 2005 Sep; 61(3):292-305.
[J Mol Evol. 2005]Nature. 1990 Nov 22; 348(6299):337-9.
[Nature. 1990]Proc Natl Acad Sci U S A. 1994 Oct 11; 91(21):9794-8.
[Proc Natl Acad Sci U S A. 1994]Curr Genet. 1993 Jul-Aug; 24(1-2):171-6.
[Curr Genet. 1993]Curr Genet. 1993 Jul-Aug; 24(1-2):171-6.
[Curr Genet. 1993]Proc Natl Acad Sci U S A. 1992 Nov 15; 89(22):10648-52.
[Proc Natl Acad Sci U S A. 1992]Planta. 2003 Mar; 216(5):815-23.
[Planta. 2003]J Mol Evol. 2005 Sep; 61(3):292-305.
[J Mol Evol. 2005]Planta. 2004 Jul; 219(3):541-6.
[Planta. 2004]Nature. 1990 Nov 22; 348(6299):337-9.
[Nature. 1990]Plant Mol Biol. 1995 Nov; 29(4):721-33.
[Plant Mol Biol. 1995]Proc Natl Acad Sci U S A. 1992 Nov 15; 89(22):10648-52.
[Proc Natl Acad Sci U S A. 1992]Nature. 2003 Sep 4; 425(6953):86-9.
[Nature. 2003]EMBO J. 1991 Nov; 10(11):3281-8.
[EMBO J. 1991]Genetics. 1998 Sep; 150(1):425-33.
[Genetics. 1998]Mol Biol Evol. 2002 Aug; 19(8):1292-302.
[Mol Biol Evol. 2002]Plant Mol Biol. 1997 Apr; 33(6):965-77.
[Plant Mol Biol. 1997]Mol Phylogenet Evol. 2002 Aug; 24(2):274-301.
[Mol Phylogenet Evol. 2002]J Mol Biol. 1996 Aug 16; 261(2):155-72.
[J Mol Biol. 1996]Protist. 2000 Dec; 151(4):347-51.
[Protist. 2000]Mol Genet Genomics. 2002 Jun; 267(4):492-7.
[Mol Genet Genomics. 2002]Trends Plant Sci. 2006 Feb; 11(2):101-8.
[Trends Plant Sci. 2006]Nature. 2004 Dec 9; 432(7018):779-82.
[Nature. 2004]Eukaryot Cell. 2005 Feb; 4(2):253-61.
[Eukaryot Cell. 2005]Biotechniques. 2006 Jul; 41(1):69-73.
[Biotechniques. 2006]Methods Enzymol. 2005; 395():348-84.
[Methods Enzymol. 2005]Bioinformatics. 2004 Nov 22; 20(17):3252-5.
[Bioinformatics. 2004]J Gastrointest Surg. 2002 Nov-Dec; 6(6):845-52; discussion 853-4.
[J Gastrointest Surg. 2002]Bioinformatics. 1998; 14(4):372-3.
[Bioinformatics. 1998]