![]() | ![]() |
Formats:
|
||||||||||||||||||||||||||||||||||||||||||
Copyright © 2007, American Society for Microbiology Function, Structure, and Evolution of the RubisCO-Like Proteins and Their RubisCO Homologs† Department of Microbiology and Plant Molecular Biology/Biotechnology Program, The Ohio State University, 484 West 12th Avenue, Columbus, Ohio 43210-1292,1 Graduate College of Marine and Earth Studies, Delaware Biotechnology Institute, University of Delaware, 127 DBI, 15 Innovation Way, Newark, Delaware 19711,2 Howard Hughes Medical Institute, UCLA-DOE Institute for Genomics and Proteomics, Department of Chemistry and Biochemistry, University of California, Los Angeles, Box 951570, Los Angeles, California 90095-1570,3 Department of Plant Cellular and Molecular Biology, The Ohio State University, 582 Aronoff Laboratory, 318 W. 12th Avenue, Columbus, Ohio 43210-12924 *Corresponding author. Mailing address: Department of Microbiology, The Ohio State University, 484 West 12th Avenue, Columbus, OH 43210-1292. Phone: (614) 292-4297. Fax: (614) 292-6337. E-mail: Tabita.1/at/osu.edu This article has been cited by other articles in PMC.Abstract Summary: About 30 years have now passed since it was discovered that microbes synthesize RubisCO molecules that differ from the typical plant paradigm. RubisCOs of forms I, II, and III catalyze CO2 fixation reactions, albeit for potentially different physiological purposes, while the RubisCO-like protein (RLP) (form IV RubisCO) has evolved, thus far at least, to catalyze reactions that are important for sulfur metabolism. RubisCO is the major global CO2 fixation catalyst, and RLP is a somewhat related protein, exemplified by the fact that some of the latter proteins, along with RubisCO, catalyze similar enolization reactions as a part of their respective catalytic mechanisms. RLP in some organisms catalyzes a key reaction of a methionine salvage pathway, while in green sulfur bacteria, RLP plays a role in oxidative thiosulfate metabolism. In many organisms, the function of RLP is unknown. Indeed, there now appear to be at least six different clades of RLP molecules found in nature. Consideration of the many RubisCO (forms I, II, and III) and RLP (form IV) sequences in the database has subsequently led to a coherent picture of how these proteins may have evolved, with a form III RubisCO arising from the Methanomicrobia as the most likely ultimate source of all RubisCO and RLP lineages. In addition, structure-function analyses of RLP and RubisCO have provided information as to how the active sites of these proteins have evolved for their specific functions. INTRODUCTION Virtually all the organic carbon found on earth is derived from oxidized inorganic sources such as gaseous carbon dioxide and carbon monoxide as well as soluble and insoluble bicarbonate and carbonate deposits. These various forms of inorganic carbon are in chemical equilibrium on earth, and the relative concentration of each species in specific environments is dependent on localized parameters such as temperature, pH, and pressure. Moreover, it is widely believed that levels of anthropogenic CO2 are steadily increasing in the earth's atmosphere, and predictions are that these levels will increase steadily, with consequent effects related to the potential warming of the earth. In order to surmount the rather considerable energy required to chemically convert oxidized inorganic carbon to reduced organic carbon on a global scale, living organisms and specific biological macromolecules eventually evolved to catalyze this process. Fortunately, terrestrial and marine plants and specialized microbes developed the ability to remove and assimilate considerable amounts of CO2 from the atmosphere and, in the process, formed the necessary organic carbon skeletons required to sustain the biosphere. More specifically, different enzymatic schemes evolved to catalyze inorganic carbon reduction such that there are currently four known metabolic pathways by which organisms can grow using CO2 as their sole source of carbon (28, 79). These include the Calvin-Benson-Bassham (CBB) reductive pentose phosphate pathway, the reductive tricarboxylic acid cycle, the Wood-Ljungdahl acetyl coenzyme A pathway, and the hydroxypropionate pathway. From a biogeochemical standpoint, the CBB reductive pentose phosphate pathway is by far the major means by which CO2 is reduced to form organic carbon. In this scheme, the sugar bisphosphate ribulose-1,5-bisphosphate (RuBP) serves as the acceptor molecule for CO2, with the enzyme RuBP carboxylase/oxygenase (RubisCO) catalyzing the actual primary CO2 fixation reaction. RubisCO is found in most autotrophic organisms, ranging from diverse prokaryotes, including photosynthetic and chemolithoautotrophic bacteria and archaea, to eukaryotic algae and higher plants. RubisCO is also clearly the most abundant protein found on earth (21), as it can comprise up to 50% of the total soluble protein found in leaf tissue or within specific microbes (67, 68). Such exaggerated abundance is most likely due to the poor catalytic efficiency of RubisCO, with a turnover number (~5 s−1) that is among the lowest for any biological catalyst (13, 68). DIFFERENT MOLECULAR FORMS FOR THE SAME (AND DIFFERENT) FUNCTIONS Classically, RubisCO is comprised of both large (catalytic) and small subunits to form a massive hexadecameric protein structure with an Mr of about 550,000, i.e., eight copies of both large (~ 55,000 Mr) and small (~ 15,000 Mr) polypeptides in an (L2)4(S4)2 structure (4, 35). This type of enzyme, now called form I, is the predominant RubisCO form found in nature, and it is present in terrestrial and marine plants, eukaryotic algae, cyanobacteria, and most phototrophic and chemolithoautotrophic proteobacteria (68). The name form I was originally used to distinguish this type of RubisCO from another structurally simpler form of the enzyme that was shown to be a dimer of only large subunits, which was discovered originally in the nonsulfur phototrophic bacterium Rhodospirillum rubrum (69, 70). Interestingly, another nonsulfur purple phototrophic bacterium, Rhodobacter sphaeroides, also appeared to contain this second structural form of RubisCO (albeit in higher aggregates of large subunits) and was originally isolated as a second peak of activity after ion-exchange fractionation of extracts from induced R. sphaeroides. Form I RubisCO was isolated from the same crude extracts, i.e., in the first activity peak that eluted from the column (29). Thus, the enzyme from the second activity peak (peak II), which contained the novel structural form analogous to R. rubrum RubisCO, was eventually called the form II enzyme to distinguish it from the first peak of activity or the form I enzyme. Form II RubisCO proteins were shown to catalyze the same reaction as form I RubisCO, and both enzymes catalyze an oxygen fixation reaction whereby the enediol of RuBP is attacked by molecular oxygen. The form II enzyme, comprised only of multimers of large-type subunits [(L2)x], shows only about 30% amino acid sequence identity to form I large subunits. In addition, form II enzymes all appear to be less efficient in partitioning the two gaseous substrates of RubisCO, CO2 and O2. Most importantly, the form II enzyme takes on a distinct physiological role, as it is used primarily to enable the CBB pathway to balance the redox potential of the cell under select growth conditions (19, 68, 74). To this day, the relative differences and similarities in primary structure serve as a convenient means to classify all the different forms of RubisCO found in nature. By the mid-1990s, it was recognized that the form I enzyme could be further classified, according to amino acid sequence homologies, as either “green” (cyanobacterial, algal, and plant) and “red” (phototrophic bacterial and nongreen eukaryotic algal) (16, 19, 67, 68, 74). As more RubisCO gene sequences became available, the green enzymes were further subdivided into forms IA and IB, and the red enzymes were subdivided into forms IC and ID (67, 68) (Fig. (Fig.1).1
The RubisCO-Like Protein (Form IV), a Homolog of RubisCO It was first noted in 1999 (68), via just-completed genomic sequencing projects, that the green sulfur phototrophic bacterium Chlorobium tepidum and the heterotroph Bacillus subtilis contained putative RubisCO genes that were clearly not of the form I and form II types. These RubisCO genes were initially thought to be in the newly discovered form III archaeal class since those sequences that were not of form I and form II all seemed to be quite different from each other as well as from the established form I and form II sequences (68). Interestingly, C. tepidum, though autotrophic, does not assimilate CO2 via the CBB pathway to obtain organic carbon, and B. subtilis does not use CO2 as a carbon source at all. Subsequent analyses showed that the putative RubisCO genes from these organisms were distinct from bona fide form III RubisCOs from archaea, as the C. tepidum and B. subtilis sequences both contain dissimilar residues at positions analogous to the mechanistically significant residues that are important for catalysis in RubisCO counterparts, and the purified recombinant C. tepidum protein was unable to catalyze RuBP-dependent carboxylation (31). Moreover, disruption of the gene in C. tepidum resulted in sulfur deposition into the surrounding media as well as distinct effects on autotrophic growth. Based on these studies and the fact that this protein resembles bona fide RubisCOs (about 35% identity at the amino acid level), the RubisCO homolog from C. tepidum was termed the RubisCO-like protein (RLP) and categorized as form IV RubisCO (31). Further studies confirmed the role of C. tepidum RLP in sulfur metabolism (thiosulfate oxidation), and its disruption led to a general stress response (30). As for B. subtilis, genetic studies (45, 63), followed by biochemical analyses (7), showed that its RLP (or YkrW/MtnW) participates in a methionine salvage pathway and catalyzes the enolization of the RuBP analog 2,3-diketo-5-methylthiopentyl-1-P. Based on phylogenetic analyses of currently available RLP sequences (see below), there appear to be six different clades of RLP or form IV RubisCO (Fig. (Fig.1);1 The RubisCO Superfamily at Present For years, RubisCO has been one of the most deeply sequenced protein families. However, up until about 10 years ago, except for a few microbial genes, most of the known RubisCO gene sequences were obtained from different plants, all of which were shown to be closely related. These earlier sequencing projects were all directed towards the obvious interest in this protein as a target for crop improvement. However, the recent profusion of microbial genome sequencing projects from truly diverse organisms plus the burgeoning documentation of metagenomic RubisCO sequences from environmental samples (2, 22, 23, 34, 35, 47, 55, 65, 75, 78) have opened up new vistas and suggest that meaningful evolutionary analysis of this protein may now be undertaken and proceed towards an informed conclusion. This analysis is feasible despite the rather obvious misannotation of various RubisCO sequences in the NCBI protein database as methionine sulfoxide reductase A (see GenBank accession NP_248230 for one example from Methanocaldococcus jannaschii). Phylogenetic analyses of RubisCO and RLP sequences indicate that there are at least three distinct lineages of bona fide RubisCO and six distinct clades of RLP molecules (Fig. (Fig.1).1
The six remaining clades in the RubisCO form IV (RLP) lineage have been termed IV-Photo (found in phototrophic bacteria), IV-NonPhoto (found in nonphototrophic bacteria), IV-AMC (acid mine consortia), IV-YkrW, IV-DeepYkr, and IV-GOS (global ocean sequencing sequencing program) based on characteristics of the source organisms, prior designation of the gene product, and/or relationship to other sequences (Fig. (Fig.1).1 Sequence Conservation in the RubisCO Superfamily Overall sequence conservation between lineages in the RubisCO large-subunit superfamily is detectable with an average of 31% amino acid sequence identity across the 193 nonredundant, full-length sequences analyzed (see Table S1 in the supplemental material). This sequence set included all available full-length RLP and metagenomic RubisCO amino acid sequences present in public databases as of April 2007, including those that recently became available from the global ocean-sampling (GOS) expedition (80). The minimum sequence identity observed between any two sequences in this data set was 6.9%, which is between a IV-DeepYkr sequence from the marine chlorophyte Ostreococcus tauri and a metagenomic form I sequence from the GOS sequence collection. Sequence conservation within a given lineage is variable but significantly higher than the average, ranging from an 85% mean in-group identity in the form IB lineage to 49% in the form IV-NonPhoto lineage (Table 1). The two exceptions to this rule are the form IV-DeepYkr lineage, which shares only 37% average in-group sequence identity, and the IV-GOS clade, which shares an average of 43% in-group sequence identity. The low sequence conservation and wide size range observed in both groups suggest that they may contain collections of single representatives of rarely observed RLP lineages. There is also relatively low concerted covariation of active-site residues in the IV-DeepYkr group, further suggesting that it may require further subdivision as more sequences become available. Furthermore, recent findings, based on mass spectrometry-based discrimination of expressed protein products, suggest that recombination of diverse genomes from acidophilic bacteria has occurred in an acid mine environment. Such genomic recombination apparently resulted in the creation of a chimeric RLP with potentially novel functions among the acidophiles of this environment (41). With respect to RLP, it will be interesting to determine how widespread such recombination events are and their physiological consequences.
Only four residues are absolutely conserved among all members of the RubisCO superfamily when they are aligned on the basis of sequence similarity, with no specific consideration of structural motifs. These residues are Gly-122/110/100, Lys-175/166/153, Asp-203/193/191, and Gly-322/316/297 in representative enzymes of form I, form II, and form III from Spinacia oleracea, Rhodospirillum rubrum, and Methanocaldococcus jannaschii, respectively. Relaxing this conservation requirement to 99% of the sequences analyzed results in the identification of 10 additional residues. Three of these highly conserved residues, Asp-198/188/176, Lys-201/191/179, and Asp-203/193/181, lie within the “RubisCO motif.” Lys-201/191/179 is the residue that becomes carbamylated when RubisCO is “activated” by CO2 in the presence of a divalent metal prior to the actual catalytic event (see Fig. Fig.8),8
Evidence for Distinct Functions among RLP Lineages (i) Active-site substitution patterns and implications from functional studies. Distinct histories for each RLP lineage are supported by a common pattern of active-site substitutions observed within a given lineage that is not shared with other lineages (Fig. (Fig.3).3 Given this variation, it seems unlikely that members of one lineage would functionally substitute for a member from another RLP family, although evidence exists otherwise (discussed below). Currently, detailed functional studies have been carried out for only four RLPs, C. tepidum RLP (30, 31), the YkrW/MtnW proteins of Bacillus subtilis and Geobacillus kaustophilus (8, 33, 45, 63), and the YkrW-like RLP from the cyanobacterium Microcystis aeruginosa (11). Thus far, the three-dimensional structures have been solved only for C. tepidum RLP (37), R. palustris RLP2 (this paper), and G. kaustophilus RLP (33, 39). The RLP from C. tepidum and the RLP2 from R. palustris are structurally very similar at the active site but possess four different active-site residues compared to the B. subtilis and G. kaustophilus proteins. Specific catalytic residues appear to be differentially conserved among the two lineages. The major difference is the Glu versus the Lys at Asn-123 (spinach RubisCO numbering), suggesting possible differences in hydrogen-bonding patterns with their respective substrates. In addition, Asn versus Val/Met identities at the Lys-177 position in C. tepidum versus B. subtilis groups of RLPs, respectively, may indicate different needs or participants for proton abstraction at the presumptive active site (see below), whereas Phe versus Pro identities at Arg-295, the residue that interacts with P2 phosphate in spinach RubisCO, likely indicate that each type of RLP reacts with distinct substrates with different hydrophobicities at the P2 site. The B. subtilis YkrW/MtnW protein and, more recently, its M. aeruginosa and G. kaustophilus RLP homologs, have all been shown to function as a 2,3-diketo-5-methylthiopentyl-1-phosphate enolase in the methionine salvage pathway. Thus, a B. subtilis mutant lacking YkrW/MtnW has a relatively constrained phenotype that is manifested only under severe sulfur starvation conditions (45, 63). Based on structural comparisons discussed elsewhere in this review, it appears that 2,3-diketo-5-methylthiopentyl-1-phosphate is not compatible with the active-site pocket in C. tepidum RLP or R. palustris RLP2 (39). Thus, it was not surprising that inactivating these genes resulted in strains with distinct phenotypic properties in different organisms. For example, an insertionally inactivated RLP mutant of C. tepidum (strain Ω::RLP) had a highly pleiotropic phenotype, with defects observed in pigmentation, the ability to metabolize some sulfur compounds, and the aberrant expression of stress response proteins (31). More specifically, strain Ω::RLP is unable to oxidize thiosulfate efficiently, although the ability to oxidize sulfide remains unperturbed (30). Strain Ω::RLP is also deficient in oxidizing elemental sulfur, as it was found to produce significantly more extracellular elemental sulfur than the wild type (31). A null mutation in the gene encoding RLP in C. tepidum also results in the overproduction of two oxidative stress response-related proteins, i.e., a thiol-specific antioxidant (Tsa) protein and superoxide dismutase. The levels of these two proteins are 12- and 3-fold enhanced, respectively, in the Ω::RLP strain compared with the wild type. The accumulation of these proteins correlates with the transcript levels of the corresponding genes (30). The Ω::RLP strain is also significantly more resistant to hydrogen peroxide exposure during growth than is the wild type (30). Further analyses indicate that the C. tepidum genome also encodes two potentially relevant transcriptional regulators, i.e., the ferric ion uptake regulator (Fur) and the peroxide regulator (PerR). Since these regulators are reported to be involved in the regulation of oxidative stress response genes in various bacteria including Escherichia coli, Bacillus subtilis, and Staphylococcus aureus, the possibility that RLP might be involved with the function of these regulators was considered. However, insertional inactivation of both the fur and perR genes of C. tepidum did not affect the accumulation of the Tsa and superoxide dismutase proteins in the Ω::RLP mutant strain (Singh and Tabita, unpublished). How RLP specifically contributes to sulfur oxidation and oxidative stress in chlorobia is still unknown. These areas have received relatively little experimental attention in chlorobia to date, although this is beginning to change with the exploitation of available genomic data (12). Genes encoding RLPs have been found in all Chlorobium genomes sequenced to date (see http://img.jgi.doe.gov/cgi-bin/pub/main.cgi for details) even though these strains vary considerably in the spectra of reduced sulfur compounds used to support growth. Regarding oxidative stress, Chlorobium sp. strain GSB1, recently isolated from a hydrothermal vent sample (9), was found to maintain viability during prolonged exposure to molecular oxygen only in the absence of light and sulfide. Clearly, experiments utilizing oxidative stress elicitors other than molecular oxygen (i.e., organic hydroperoxides, methyl viologen, or diamide) in addition to experiments examining the interplay of light, sulfur compounds, and oxygen are required. Such studies of stress physiology and sulfur oxidation will likely contribute to delineating the function of RLP in C. tepidum. Phylogenetic analyses (Fig. (Fig.1)1
(ii) Local gene conservation as an indicator of different functions. One method for assigning a physiological role for the functions of unassigned gene products is “guilt by association,” or examining the conservation of genes that are colocalized with the gene of interest across multiple genomes. This assumes that functionally related genes will be linearly inherited or laterally transferred as conserved functional modules. This was examined for each RLP lineage and all form III RubisCOs by aligning genomes in the Integrated Microbial Genomes database against each other, centered on the gene encoding RLP (Fig. (Fig.55
When the genomic regions surrounding the genes encoding the C. tepidum and B. subtilis RLPs were compared, distinct patterns of gene conservation were observed. In C. tepidum and other green sulfur bacteria, a tightly conserved core of five genes was found in seven strains, with complete sequence coverage across the area (Fig. (Fig.5B).5B Aside from the form I and II bona fide RubisCOs and the IV-Photo and IV-YkrW lineages discussed above, only three other examples of local gene conservation were found. Form III RubisCOs in Methanosarcina spp. share conserved gene organizations downstream, including a methyl coenzyme (CoM) reductase operon; the polC gene, encoding a DNA polymerase; and others (Fig. (Fig.5A).5A Genomic Context-Based Analyses of Diverse RLPs Suggests Functional Diversity As there are more than 300 complete genome sequences available, we used bioinformatic approaches to assist us in understanding potential functions of RLPs, most of which are uncharacterized proteins. Four genomic context-based methods were used to infer protein functions based on comparisons of hundreds of genome sequences. The phylogenetic profile method infers protein functional linkages between two proteins based on their correlated evolution in multiple genomes (53). The Rosetta Stone method infers the linkages based on the fusion of two protein-encoded genes in another genome (24, 43). The gene neighbor method assigns protein functional linkages based on the close proximity of two genes on the chromosomes in many genomes (14, 50), and the gene cluster method infers the linkages between two genes based on the operon structures in prokaryotic genomes (10, 54). We calculated the functional linkages of 11 RLP sequences out of 44 known sequences using a confidence threshold of 0.5. Based on the functional linkages, the 11 RLP sequences can be divided into two major groups (Fig. (Fig.6).6
In summary, relationships based on sequence similarity (see above) indicate the presence of three different lineages of bona fide RubisCO and a fourth lineage representing the RLPs that can perhaps be divided into six different subgroups. Further genetic and biochemical studies should eventually clarify the functions of each of the different RLP groups and shed further light on the evolution of RLP and RubisCO. Ultimately, the final test of functional conservation across lineages will be the heterologous expression of RLPs from different lineages in mutant strains lacking the cognate RLP for that particular organism. Early reports indicate that a form II RubisCO gene could complement a B. subtilis mutant lacking YkrW (8). In addition, a cyanobacterial (M. aeruginosa) RLP gene has also been shown to functionally complement the B. subtilis mutant (11). Detailed functional and structural relationships among bona fide RubisCO and RLP are extensively discussed below; clearly, bioinformatic analyses suggest discrete functions for at least some of the phylogenetically diverse RLPs discussed here. PROBING THE EVOLUTIONARY ORIGINS OF RubisCO: EVIDENCE FOR ARCHAEAL CENTRAL METABOLISM AS THE ULTIMATE SOURCE OF ALL EXTANT RubisCO AND RLP SEQUENCES The reconstruction of phylogenetic associations can be used to infer evolutionary relationships among related sequences. Evolutionary questions regarding the development of the bona fide RubisCOs and their relationships to RLPs are clearly of interest. The relationships between these lineages were examined with four different phylogenetic reconstruction methods (neighbor joining [NJ], minimum evolution [ME], unpaired group mean average [UPGMA], and maximum parsimony [MP]) (38) after examining amino acid distance data across all sequences via the program ProtTest to suggest an appropriate rate distribution gamma parameter (1) (Fig. (Fig.22 In every phylogenetic reconstruction examined, the bona fide RubisCOs (forms I to III) form a coherent clade, suggesting that they share a common line of descent. With minimum evolution and neighbor joining, forms I and II are late-descending nodes in a clade where the deepest branches are form III RubisCO and two additional RubisCO sequences from Methanosaeta thermophila and Methanospirillum hungatei. These two archaeal sequences consistently clade with one another and separate from other archaeal RubisCO sequences in form III. In addition, the sequence of the RubisCO from Methanococcoides burtonii, a methanogenic archaeon isolated from Antarctic marine sediments (60), consistently branches at the base of the form II clade in every method employed. These sequences are quite divergent, averaging only 28% (M. thermophila and M. hungatei) and 24% (M. burtonii) identity with all other RubisCO/RLP sequences. This consistent distribution of archaeal sequences at the base of clades containing all known bona fide RubisCO sequences suggests that this clade may have originated in the Archaea and subsequently been distributed to bacteria, eukaryotic algae, and higher plants. Overall bootstrap support is high for nodes in both methods with mean values of 75% and 83% for NJ and ME, respectively. The lowest bootstrap values were observed for internal nodes of the RLP cluster, while all terminal nodes are strongly supported. The two other methods employed to reconstruct RubisCO/RLP relationships, UPGMA and MP, display different relationships among forms I to III. UPGMA maintains the same two lineages of form III observed by MP and NJ methods but places them as a sister group to the M. thermophila and M. hungatei sequences. This archaeal cluster is a sister clade to all form I sequences by the UPGMA method. With MP, the form III sequences are rearranged into two different lineages (III* and III** in Fig. Fig.2D)2D Obviously, the tree topologies are highly dependent on the phylogenetic inference method employed. Both UPGMA and MP are known to be the most reliable for estimating trees in data sets where evolutionary rates are nearly constant across lineages (15), while NJ with rate correction was found to operate reliably when faced with variable rates across lineages. ProtTest analysis of the RubisCO/RLP sequence set indicated a moderate amount of rate variability that could confound UPGMA and MP analyses. Thus, the phylogenetic relationships inferred by NJ and ME, which indicate an archaeal origin for RubisCO/RLP, appear to be the most robust. RubisCO and RLP appear to be more prevalent in the euryarchaea, which, along with the crenarchaeota, are the two major branches of descent in the archaea as delineated by 16S rRNA gene sequence comparisons (18). There are only two crenarchaeal form III sequences known, those from Hyperthermus butylicus DSM 5456 (YP_001012710) and Thermofilum pendens Hrk-5 (YP_920628). Within the archaeal RubisCOs, there appears to be more flexibility in the range of residues accepted at active-site positions (Fig. (Fig.3),3 When the phylogenetic distribution of RubisCO/RLP lineages (Table 1) was examined, a single transfer of RLP from a methanogenic euryarchaeon into an ancestor of the Firmicutes, Proteobacteria, and Chlorobia, with subsequent lateral transfer to chloroflexi, followed by gene losses, could account for the distribution of most of the RLP lineages. Likewise, lateral transfer of a form III RubisCO from a euryarchaeon to a common ancestor of Cyanobacteria and Proteobacteria and eukaryote RubisCOs being acquired via subsequent endosymbiotic events could account for the distribution of bona fide RubisCO lineages observed. From these considerations, the likely evolutionary development of the large subunit of RubisCO and RLP follows the model depicted in Fig. Fig.9.9
The most recent phylogenies of the archaea based on concatenated protein trees for informational processes place the Thermococcales (Pyrococcus spp. and Thermococcus spp.) as the deepest-branching euryarchaeal group. Within the form III sequences, the Thermococcales sequences form a coherent clade with sequences from the Haloarchaea and A. fulgidus (III-1) that is separate from sequences in the methanogenic euryarchaea (III-2) (Fig. (Fig.9).9 The scenario outlined above and in Fig. Fig.99 The archaeal origin model of RubisCO/RLP evolution proposed here is substantially different from that reported previously by Ashida et al. (7) and Carre-Mlouka et al. (11), who speculated that bona fide RubisCOs arose in the YkrW lineage. However, those authors relied on much smaller sets of sequences and more limited numbers of phylogenetic reconstructions to reach their conclusions. Clearly, models of RubisCO evolution will themselves need to evolve, as new sequences are continuously being reported, especially in metagenomic sequencing projects. However, at this point, no new distinct RubisCO forms have been uncovered; thus, the basic conclusions reached in Fig. Fig.99 Non-RubisCO/RLP Structural Homologs In attempts to reconstruct the evolution of the RubisCO/RLP superfamily, identification of a non-RubisCO/RLP-related sequence that could serve as a root or source of the superfamily has thus far been problematic. Even using score filtering to eliminate highly similar sequences from traditional similarity searches like BLAST, PHI-BLAST, and PSI-BLAST, the number of RubisCO/RLP sequences currently in databases makes for a complicated morass of sequence data that must be navigated to identify potential non-RubisCO/RLP sequences related to any query. By contrast, structural homology searches may be a useful method for identifying distantly related proteins since the degree of sequence similarity of conserved protein structural elements may be too low to be detected by typical algorithms. Moreover, the search space is also much less crowded, as relatively few structures have been determined compared to the numbers of gene sequences available. Structurally, all RubisCO and RLP structures solved to date are members of the triose phosphate isomerase (TIM)/mutase fold, characterized by an eight-membered β/α-barrel motif, the TIM barrel (46, 76). The TIM barrel fold is composed of 32 superfamilies in the latest release of the Structural Classification of Proteins (SCOP) database (version 1.71 [http://scop.mrc-lmb.cam.ac.uk/scop/]), the largest number of structural superfamilies of any fold within the SCOP class of alpha and beta proteins (α/β). The functional flexibility of the TIM barrel scaffold has been well documented (reviewed by Anantharaman et al. [3]). The evolution of TIM barrel proteins has been previously examined, and the RubisCO superfamily was found to cluster with other TIM barrel superfamilies containing a sugar phosphate binding motif (76). However, a separate PSI-BLAST analysis failed to link the RubisCO structure with other TIM families (46). To identify structural homologs of RubisCO and RLP, a total of five RubisCO/RLP structures representing each major lineage (Protein Data Bank [PDB] accession numbers 1RBL [form I], 5RUB [form II], 1GEH [form III], 1YKW [form IV/RLP], and 2OEJ) were used to search the PDB structure collection using the DALI fold comparison search tool (http://www.ebi.ac.uk/dali/index.html). The DALI server was chosen based on its favorable evaluation relative to other fold comparison servers (48). Structural homologs were considered only if the average DALI Z score was >10 and the structure was identified by each of the five queries (Table 3). The number of homologs returned was driven primarily by PDB accession number 5RUB, the form II RubisCO structure from Rhodospirillum rubrum, which retrieved the fewest homologs.
As expected, additional RubisCO structures were identified as the closest structural homologs of the queries, with an average Z score of 45.3 for the structures under accession numbers 1GK8 (form I RubisCO from Chlamydomonas reinhardtii) (72), 1TEL (an independently solved structure of C. tepidum RLP) (33), and 2RUS (activated complex of the R. rubrum form II enzyme) (42). Beyond other RubisCO and RLP structures, the detected structural homologs were all superfamilies within the TIM barrel fold. The three closest homologs were from the ribulose phosphate binding barrel superfamily, PDB accession numbers 1KV8 (3-keto-l-gulonate-6-phosphate decarboxylase from E. coli) (77), 2TYS (tryptophan synthase from Salmonella enterica serovar Typhimurium) (56), and 1RPX (d-ribulose-5-phosphate-3-epimerase from Solanum tuberosum chloroplasts) (36). Two of these enzymes play key roles in central metabolic pathways of amino acid biosynthesis or sugar phosphate interconversions of the pentose phosphate pathway, as do most other members of this SCOP superfamily (see http://scop.mrc-lmb.cam.ac.uk/scop/data/scop.b.d.b.c.html for details) (6). As expected for central metabolic enzymes, homologs of these sequences are encoded by nearly all archaeal genomes (see http://img.jgi.doe.gov/cgi-bin/pub/main.cgi for details). As outlined below, the form III RubisCO that we propose to be the evolutionary source of all other RubisCO and RLP sequences appears to have evolved to reclaim potentially dead-end five-carbon sugar bisphosphates and salvage them to central metabolic pathways. This structural comparison suggests that a second phosphate binding site may have been the key step in the evolution of RubisCO and that the source protein for RubisCO could have been recruited from a common, central metabolic pathway. Newer statistical methods of long-range phylogenetic reconstruction (i.e., hidden Markov models) may provide support for the structural comparison arguments posed above and identify specific candidates as the ultimate sources for the RubisCO superfamily. Physiological Role for Archaeal (Form III) RubisCO Previous studies have shown that genes that encode catalytically competent recombinant RubisCO (form III) are present in some archaea (25, 73), and the protein appears to be functional in some organisms (27). However, the lack of any demonstrable phosphoribulokinase (PRK) activity (or a gene that encodes this protein) from these same organisms that contain RubisCO has been a major curiosity, as such organisms seemingly would not possess a means to synthesize the unique keto sugar (RuBP) that is the substrate for RubisCO. This conundrum was recently addressed, with two possibilities considered: (i) form III archaeal RubisCO preferentially uses an alternative substrate and does not require RuBP for catalysis, or (ii) alternative means to synthesize RuBP that are unique to archaea exist. The first possibility, if true, would also suggest that RuBP-dependent RubisCO activity might have evolved from a protein that possessed some alternative activity, a theory espoused by those that believe that RLP is an evolutionary precursor to RubisCO (7, 11). However, exhaustive studies have thus far found no alternative to RuBP as a substrate or CO2 acceptor for archaeal (or any other) RubisCO (26), suggesting that it is unlikely that form III RubisCO is being used for anything other than producing 3-phosphoglyceric acid (PGA) from RuBP and CO2. In addition, the mesophilic archaeal RubisCO gene complemented a RubisCO deletion mutant of Rhodobacter capsulatus to autotrophic growth, showing that RuBP is also a substrate for this enzyme in vivo (27). With evidence pointing to RuBP as the exclusive substrate for archaeal RubisCO, how, then, does the organism synthesize RuBP? Again, negative data suggest that there is no demonstrable PRK activity in extracts from the organisms tested (26); moreover, analyses of the great majority of available genomes indicate no recognizable gene to encode PRK. Recently determined genomic sequences of Methanospirillum hungatei, Methanoculleus marisnigri, and Methanosaeta thermophila (http://img.jgi.doe.gov/cgi-bin/pub/main.cgi) represent the only archaeal organisms where potential PRK genes may exist. As for the vast majority of archaea, which possess no discernible PRK gene, a satisfying positive finding was the demonstration of a novel means to synthesize RuBP. Direct enzymatic assays using alternative substrates with extracts of Methanocaldococcus jannaschii provided evidence for a previously uncharacterized pathway for RuBP synthesis from 5-phosphoribose-d-1-pyrophosphate (PRPP) in M. jannaschii and other methanogenic archaea (26). Thus, these experiments, using PRPP as the sole substrate, resolved the need for a kinase dedicated to RuBP generation because PRPP already contains the relevant phosphates at both the C1 and C5 positions. Based on studies with other systems, it was hypothesized that either there is a selective enzymatic dephosphorylation step at the C1 position or nonenzymatic dephosphorylation occurs at the pyrophosphate at both moderate and high temperatures in the presence of magnesium at neutral pH (26). In either instance, the product would be ribose-1,5-bisphophosphate, a compound known to be synthesized in many other biological systems including macrophages and red blood cells under conditions of hypoxia (discussed in reference 24). Further indications of a novel and specific enzymatic reaction(s) was the stoichiometric conversion of PRPP to RuBP using extracts of M. jannaschii, such that that one molecule of PRPP was converted to two molecules of PGA. These results provided experimental verification for the proposed pathway (26). Inhibition of the PRPP-to-PGA conversion in vitro by both the RubisCO transition state analog CABP and antibodies to M. jannaschii RubisCO convincingly reinforced the idea that RubisCO catalysis is essential to convert PRPP to PGA. The proposed unique enzymatic step of this pathway is the conversion of ribose-1,5-bisphosphate, or ribose-1,2 cyclic phosphate-5-P (ribose-1,2cP-5-P), to RuBP. This work thus identified a novel means to synthesize the CO2 acceptor and substrate for RubisCO in the absence of a detectable kinase such as PRK. More recently, studies with Thermococcus kodakarensis confirmed and greatly extended those studies and again pointed to ribose-1,5-bisphosphate as the direct precursor to RuBP (59). Moreover, Sato et al. identified the enzymes and the requisite structural genes, including RubisCO, that are involved in a pathway of AMP metabolism (59). In that scheme, AMP, which could be produced from PRPP, is acted upon by an AMP phosphorylase to produce ribose-1,5-bisphosphate, followed by a ribose-1,5-bisphosphate isomerase, to yield RuBP. Both studies proposed that this route to RuBP might point to unique evolutionary links between purine-pyrimidine recycling pathways and the CBB cycle, with RubisCO catalysis and PRPP/AMP metabolism providing the needed anaplerotic levels of PGA (26, 59). Apparently, the genes of the AMP-to-RuBP pathway are conserved in virtually all archaea that contain form III RubisCO (59), suggesting that this might be a universal means by which archaea employ RubisCO in metabolism. RubisCO AND RLP STRUCTURES: SIMILAR YET DIFFERENT ENOUGH The structure of the RLP from C. tepidum was recently solved to a resolution of 2.0 Å and shown to be a homodimer of large subunits, similar to various form II and form III RubisCOs (39). Indeed, the overall secondary structures of individual monomeric units of bona fide RubisCOs from all sources and form IV (RLP) are quite similar. Each subunit of C. tepidum RLP is composed of a smaller N-terminal domain and a larger C-terminal domain. The N-terminal domain, residues 1 to 45, consists of a four-stranded β-sheet with helices on one side of the sheet. The C-terminal domain, residues 146 to 435, consists of an eight-stranded α/β-barrel with two additional small α-helices forming a cap at the C terminus (39). Like form I, form II, and form III RubisCOs, the presumptive active site of RLP is located in the subunit interface between the C-terminal domain of one subunit and the N-terminal domain of another subunit (Fig. (Fig.10).10
Loop 6, which is in the C-terminal α/β-barrel domain of RubisCO, plays an important role in catalysis (13). Among multiple form I RubisCO structures, loop 6 has been observed to partition between the “open” and “closed” conformations (20, 61, 62). In C. tepidum RLP, loop 6 is ordered and adopts a closed conformation similar to that found in the structure of activated RubisCO (PDB accession number 8RUC), although no substrate is bound at the active site. Loop 6 folds over and closes the active site. The backbone of a key residue on loop 6, Arg-327, superimposes well with that of Lys-334 in form I RubisCO (Fig. (Fig.10).10 In this review, we also report the second crystal structure of an RLP from the IV-Photo clade, RLP2 from R. palustris (PDB accession number 2QYG) (Table 4). As described above, the structure of R. palustris RLP2 is very similar to the structure of C. tepidum RLP, with a Cα atom rmsd of 0.8 Å (Fig. (Fig.11).11
More recently, structure-function studies of a YkrW-type RLP from Geobacillus kaustophilus (previously Bacillus stearothermophilus) established the structural basis for the “enolase” function of YkrW. Evidence points to the involvement of Lys-98 in proton abstraction, with this residue likely serving as the general base during catalysis, much as Lys-201 (or its equivalent in different forms) serves as the general base during RubisCO catalysis (Fig. (Fig.8)8 The N-terminal 18 residues in the Photo-type RLPs are missing in the YkrW type. In addition, there are two main differences between the structures of the Photo-type RLP and YkrW-type RLP in the N-terminal domain. Loop CD in G. kaustophilus RLP becomes a helix and slightly swings away from loop 6 but forms a tighter interaction interface with loop CD (which should be helix CD in this case) from the other monomer. The second main difference is in the region of residues 47 to 58, which was previously missing from the C. tepidum RLP structure. This region is less flexible in the structure of R. palustris RLP2, forming a loop, and partly becomes a helix in the structure of G. kaustophilus RLP (Fig. (Fig.11).11 Potential of Structural Comparisons To Enhance Functional Studies Combinations of several techniques such as structural analysis, sequence alignments, site-directed mutagenesis, and chemical modifications have been used to deduce the roles of several active-site residues in RubisCO (reviewed in references 5 and 66). Although we lack knowledge regarding the functions of different RLPs, the overall structural similarity and yet the subtle differences that they share with bona fide RubisCO enzymes (discussed above), present us with a strong rationale for carrying out genetic engineering studies to facilitate an understanding of the physiological roles of RLPs. Since structurally divergent RubisCO enzymes all employ a common reaction mechanism, it is not unreasonable to expect that subtle changes in the RLP structure could perhaps introduce catalytic competency for RuBP carboxylation/oxygenation. The most obvious targets for genetically engineering such changes are those RubisCO active-site residues that have been altered in the RLPs. Although nonidentical, only a few of these residue identities are nonconservative relative to the nature of the side chains of their RubisCO counterparts. Taking the C. tepidum RLP as an example, the nonidentical active-site residues are Q49, E119, N174, F288, I320, R327, G357, S359, and R383. The corresponding residues are T65, N123, K177, R295, H327, K334, S379, G381, and G404, respectively, in the form I (spinach) RubisCO (Fig. (Fig.10).10 Comparison of Secondary Structural Elements Unique to RLP and RubisCO: Possible Implications for RLP Structure-Function Relationships As noted above, sequence and structural alignments of the three bona fide forms of RubisCO with the form IV RLPs (Fig. (Fig.1212
The second structural region that demarcates RLPs from the three forms of RubisCO is a β-hairpin structure that appears to be juxtaposed by the N-terminal domain on one side and the C-terminal domain on the other side in all three forms of bona fide authentic RubisCO enzymes (Fig. (Fig.15).15
CONCLUSIONS AND OUTLOOK About 30 years have passed since it was discovered that microbes synthesize RubisCO molecules that differ from the typical plant paradigm. Clearly, three separate bona fide forms of RubisCO (forms I, II, and III) have now been described, each of which catalyzes the carboxylation or oxygenation of RuBP, albeit for potentially different physiological purposes. Moreover, a fourth class, the RLPs, or form IV proteins, is clearly structurally related to bona fide RubisCO, yet the RLPs do not function as RubisCO enzymes, but thus far, they all seem to catalyze reactions involved in sulfur metabolism. However, RubisCO and some RLPs do possess functional similarities in that both proteins catalyze reactions using analogous substrates in both cases via an initial enolization-type reaction. The great preponderance of RLP sequences now available has further shown that there are, at present, six different clades of RLPs, some of which appear to possess different physiological roles. Indeed, RubisCO and RLP molecules have now been described for each of the three recognized types of living organisms, and the huge number of sequences now available has allowed a coherent picture of the likely evolutional events that took place to account for the different classes of RubisCOs and RLPs to emerge. Our analyses are compatible with an archaeal origin of both RubisCO and RLP, with form III proteins from the Methanomicrobia being the likely precursors for all modern RubisCO and RLP lineages. Certainly, as additional information becomes available, we and others will build upon and/or challenge this hypothesis. However, at this time, no other evolutional scheme is compatible with the data. Finally, structural and functional studies of RubisCO and RLP will continue to provide information as to how the active sites of these proteins have become adapted for their specific functions. Acknowledgments Work in our laboratories was supported by NIH grant GM24497 and DOE grant DE-FG02-91ER20033 (F.R.T.), by NSF Career Award MCB-0447649 (T.E.H.), and by DOE-BER (H.L. and S.C.). We thank Simona Romagnoli for the rlp2 clone and Yim Wu for her assistance in protein purification and crystallization. Footnotes †Supplemental material for this article may be found at http://mmbr.asm.org. REFERENCES 1. Abascal, F., R. Zardoya, and D. Posada. 2005. Prottest: selection of best-fit models of protein evolution. Bioinformatics 21:2104-2105. [PubMed] 2. Alfreider, A., C. Vogt, D. Hoffmann, and W. Babel. 2003. Diversity of ribulose-1,5-bisphosphate carboxylase/oxygenase large-subunit genes from groundwater and aquifer microorganisms. Microb. Ecol. 45:317-328. [PubMed] 3. Anantharaman, V., L. Aravind, and E. V. Koonin. 2003. Emergence of diverse biochemical activities in evolutionarily conserved structural scaffolds of proteins. Curr. Opin. Chem. Biol. 7:12-20. [PubMed] 4. Andersson, I. 1996. Large structures at high resolution: the 1.6 Å crystal structure of spinach ribulose-1,5-bisphosphate carboxylase/oxygenase complexed with 2-carboxyarabinitol bisphosphate. J. Mol. Biol. 259:160-174. [PubMed] 5. Andersson, I., and T. C. Taylor. 2003. Structural framework for catalysis and regulation in ribulose-1,5-bisphosphate carboxylase/oxygenase. Arch. Biochem. Biophys. 414:130-140. [PubMed] 6. Andreeva, A., D. Howorth, S. E. Brenner, T. J. Hubbard, C. Chothia, and A. G. Murzin. 2004. SCOP database in 2004: refinements integrate structure and sequence family data. Nucleic Acids Res. 32:D226-229. [PubMed] 7. Ashida, H., A. Danchin, and A. Yokota. 2005. Was photosynthetic Rubisco recruited by acquisitive evolution from Rubisco-like proteins involved in sulfur metabolism? Res. Microbiol. 156:611-618. [PubMed] 8. Ashida, H., Y. Saito, C. Kojima, K. Kobayashi, N. Ogasawara, and A. Yokota. 2003. A functional link between Rubisco-like protein of Bacillus and photosynthetic Rubisco. Science 302:286-290. [PubMed] 9. Beatty, J. T., J. Overmann, M. T. Lince, A. K. Manske, A. S. Lang, R. E. Blankenship, C. L. Van Dover, T. A. Martinson, and F. G. Plumley. 2005. An obligately photosynthetic bacterial anaerobe from a deep-sea hydrothermal vent. Proc. Natl. Acad. Sci. USA 102:9306-9310. [PubMed] 10. Bowers, P. M., M. Pellegrini, M. J. Thompson, J. Fierro, T. O. Yeates, and D. Eisenberg. 2004. Prolinks: a database of protein functional linkages derived from coevolution. Genome Biol. 5:R35. [PubMed] 11. Carre-Mlouka, A., A. Mejean, P. Quillardet, H. Ashida, Y. Saito, A. Yokota, I. Callebaut, A. Sekowska, E. Dittmann, C. Bouchier, and N. T. de Marsac. 2006. A new Rubisco-like protein coexists with a photosynthetic Rubisco in the planktonic cyanobacteria Microcystis. J. Biol. Chem. 281:24462-24471. [PubMed] 12. Chan, L. K., R. Morgan-Kiss, and T. E. Hanson. Sulfur oxidation in Chlorobium tepidum (syn. Chlorobaculum tepidum): genetic and proteomic analyses. In C. Dahl and C. G. Friedrich (ed.), Proceedings of the International Symposium on Microbial Sulfur Metabolism, in press. Springer, New York, NY. 13. Cleland, W. W., J. T. Andrews, S. Gutteridge, F. C. Hartman, and G. H. Lorimer. 1998. Mechanism of RubisCO: the carbamate as general base. Chem. Rev. 98:549-561. [PubMed] 14. Dandekar, T., B. Snel, M. Huynen, and P. Bork. 1998. Conservation of gene order: a fingerprint of proteins that physically interact. Trends Biochem. Sci. 23:324-328. [PubMed] 15. DeBry, R. W. 1992. The consistency of several phylogeny-inference methods under varying evolutionary rates. Mol. Biol. Evol. 9:537-551. [PubMed] 16. Delwiche, C. F., and J. D. Palmer. 1996. Rampant horizontal transfer and duplication of Rubisco genes in eubacteria and plastids. Mol. Biol. Evol. 13:873-882. [PubMed] 17. Derelle, E., C. Ferraz, S. Rombauts, P. Rouze, A. Z. Worden, S. Robbens, F. Partensky, S. Degroeve, S. Echeynie, R. Cooke, Y. Saeys, J. Wuyts, K. Jabbari, C. Bowler, O. Panaud, B. Piegu, S. G. Ball, J. P. Ral, F. Y. Bouget, G. Piganeau, B. De Baets, A. Picard, M. Delseny, J. Demaille, Y. Van de Peer, and H. Moreau. 2006. Genome analysis of the smallest free-living eukaryote Ostreococcus tauri unveils many unique features. Proc. Natl. Acad. Sci. USA 103:11647-11652. [PubMed] 18. Doolittle, W. F. 1999. Phylogenetic classification and the universal tree. Science 284:2124-2129. [PubMed] 19. Dubbs, J. M., and F. R. Tabita. 2004. Regulators of nonsulfur purple phototrophic bacteria and the interactive control of CO2 assimilation, nitrogen fixation, hydrogen metabolism and energy generation. FEMS Microbiol. Rev. 28:353-376. [PubMed] 20. Duff, A. P., T. J. Andrews, and P. M. Curmi. 2000. The transition between the open and closed states of Rubisco is triggered by the inter-phosphate distance of the bound bisphosphate. J. Mol. Biol. 298:903-916. [PubMed] 21. Ellis, R. J. 1979. Most abundant protein in the world. Trends Biochem. Sci. 4:241-244. 22. Elsaied, H., and T. Naganuma. 2001. Phylogenetic diversity of ribulose-1,5-bisphosphate carboxylase/oxygenase large-subunit genes from deep-sea microorganisms. Appl. Environ. Microbiol. 67:1751-1765. [PubMed] 23. Elsaied, H. E., H. Kimura, and T. Naganuma. 2007. Composition of archaeal, bacterial, and eukaryal RuBisCO genotypes in three Western Pacific arc hydrothermal vent systems. Extremophiles 11:191-202. [PubMed] 24. Enright, A. J., I. Iliopoulos, N. C. Kyrpides, and C. A. Ouzounis. 1999. Protein interaction maps for complete genomes based on gene fusion events. Nature 402:86-90. [PubMed] 25. Ezaki, S., N. Maeda, T. Kishimoto, H. Atomi, and T. Imanaka. 1999. Presence of a structurally novel type ribulose-bisphosphate carboxylase/oxygenase in the hyperthermophilic archaeon, Pyrococcus kodakaraensis KOD1. J. Biol. Chem. 274:5078-5082. [PubMed] 26. Finn, M. W., and F. R. Tabita. 2004. Modified pathway to synthesize ribulose 1,5-bisphosphate in methanogenic archaea. J. Bacteriol. 186:6360-6366. [PubMed] 27. Finn, M. W., and F. R. Tabita. 2003. Synthesis of catalytically active form III ribulose 1,5-bisphosphate carboxylase/oxygenase in archaea. J. Bacteriol. 185:3049-3059. [PubMed] 28. Fuchs, G., S. Lange, E. Rude, S. Schaefer, R. Schauder, R. Scholtz, and E. Stupperich. 1987. Autotrophic CO2 fixation in chemotrophic anaerobic bacteria, p. 39-43. In H. W. Van Verseveld and J. A. Duine (ed.), Microbial growth on C1 compounds. Martinus Nijhoff, Dordrecht, The Netherlands. 29. Gibson, J. L., and F. R. Tabita. 1977. Different molecular forms of D-ribulose-1,5-bisphosphate carboxylase from Rhodopseudomonas sphaeroides. J. Biol. Chem. 252:943-949. [PubMed] 30. Hanson, T. E., and F. R. Tabita. 2003. Insights into the stress response and sulfur metabolism revealed by proteome analysis of a Chlorobium tepidum mutant lacking the Rubisco-like protein. Photosynth. Res. 78:231-248. [PubMed] 31. Hanson, T. E., and F. R. Tabita. 2001. A ribulose-1,5-bisphosphate carboxylase/oxygenase (RubisCO)-like protein from Chlorobium tepidum that is involved with sulfur metabolism and the response to oxidative stress. Proc. Natl. Acad. Sci. USA 98:4397-4402. [PubMed] 32. Hartman, F. C., and M. R. Harpel. 1994. Structure, function, regulation, and assembly of D-ribulose-1,5-bisphosphate carboxylase/oxygenase. Annu. Rev. Biochem. 63:197-234. [PubMed] 33. Imker, H. J., A. A. Fedorov, E. V. Fedorov, S. C. Almo, and J. A. Gerlt. 2007. Mechanistic diversity in the RuBisCO superfamily: the “enolase” in the methionine salvage pathway in Geobacillus kaustophilus. Biochemistry 46:4077-4089. [PubMed] 34. John, D. E., B. Wawrik, F. Tabita, and J. H. Paul. 2006. Gene diversity and organization in rbcL-containing genome fragments from uncultivated Synechococcus in the Gulf of Mexico. Mar. Ecol. Prog. Ser. 316:23-33. 35. Knight, S., I. Andersson, and C. I. Branden. 1990. Crystallographic analysis of ribulose 1,5-bisphosphate carboxylase from spinach at 2.4 Å resolution. Subunit interactions and active site. J. Mol. Biol. 215:113-160. [PubMed] 36. Kopp, J., S. Kopriva, K. H. Suss, and G. E. Schulz. 1999. Structure and mechanism of the amphibolic enzyme D-ribulose-5-phosphate 3-epimerase from potato chloroplasts. J. Mol. Biol. 287:761-771. [PubMed] 37. Kreel, N. E., and F. R. Tabita. 2007. Substitutions at methionine 295 of Archaeoglobus fulgidus ribulose-1,5-bisphosphate carboxylase/oxygenase affect oxygen binding and CO2/O2 specificity. J. Biol. Chem. 282:1341-1351. [PubMed] 38. Kumar, S., K. Tamura, and M. Nei. 2004. MEGA3: integrated software for molecular evolutionary genetics analysis and sequence alignment. Brief. Bioinform. 5:150-163. [PubMed] 39. Li, H., M. R. Sawaya, F. R. Tabita, and D. Eisenberg. 2005. Crystal structure of a RuBisCO-like protein from the green sulfur bacterium Chlorobium tepidum. Structure 13:779-789. [PubMed] 40. Li, L. A., and F. R. Tabita. 1994. Transcription control of ribulose bisphosphate carboxylase/oxygenase activase and adjacent genes in Anabaena species. J. Bacteriol. 176:6697-6706. [PubMed] 41. Lo, I., V. J. Denef, N. C. Verberkmoes, M. B. Shah, D. Goltsman, G. DiBartolo, G. W. Tyson, E. E. Allen, R. J. Ram, J. C. Detter, P. Richardson, M. P. Thelen, R. L. Hettich, and J. F. Banfield. 2007. Strain-resolved community proteomics reveals recombining genomes of acidophilic bacteria. Nature 446:537-541. [PubMed] 42. Lundqvist, T., and G. Schneider. 1991. Crystal structure of the ternary complex of ribulose-1,5-bisphosphate carboxylase, Mg(II), and activator CO2 at 2.3-Å resolution. Biochemistry 30:904-908. [PubMed] 43. Marcotte, E. M., M. Pellegrini, H. L. Ng, D. W. Rice, T. O. Yeates, and D. Eisenberg. 1999. Detecting protein function and protein-protein interactions from genome sequences. Science 285:751-753. [PubMed] 44. Morse, D., P. Salois, P. Markovic, and J. W. Hastings. 1995. A nuclear-encoded form II RuBisCO in dinoflagellates. Science 268:1622-1624. [PubMed] 45. Murphy, B. A., F. J. Grundy, and T. M. Henkin. 2002. Prediction of gene function in methylthioadenosine recycling from regulatory signals. J. Bacteriol. 184:2314-2318. [PubMed] 46. Nagano, N., C. A. Orengo, and J. M. Thornton. 2002. One fold with many functions: the evolutionary relationships between TIM barrel families based on their sequences, structures and functions. J. Mol. Biol. 321:741-765. [PubMed] 47. Nanba, K., G. M. King, and K. Dunfield. 2004. Analysis of facultative lithotroph distribution and diversity on volcanic deposits by use of the large subunit of ribulose 1,5-bisphosphate carboxylase/oxygenase. Appl. Environ. Microbiol. 70:2245-2253. [PubMed] 48. Novotny, M., D. Madsen, and G. J. Kleywegt. 2004. Evaluation of protein fold comparison servers. Proteins 54:260-270. [PubMed] 49. Ott, C. M., B. D. Smith, A. R. Portis, Jr., and R. J. Spreitzer. 2000. Activase region on chloroplast ribulose-1,5-bisphosphate carboxylase/oxygenase. Nonconservative substitution in the large subunit alters species specificity of protein interaction. J. Biol. Chem. 275:26241-26244. [PubMed] 50. Overbeek, R., M. Fonstein, M. D'Souza, G. D. Pusch, and N. Maltsev. 1999. The use of gene clusters to infer functional coupling. Proc. Natl. Acad. Sci. USA 96:2896-2901. [PubMed] 51. Palmer, J. D. 1995. Rubisco rules fall; gene transfer triumphs. Bioessays 17:1005-1008. [PubMed] 52. Palmer, J. D. 1996. Rubisco surprises in dinoflagellates. Plant Cell 8:343-345. [PubMed] 53. Pellegrini, M., E. M. Marcotte, M. J. Thompson, D. Eisenberg, and T. O. Yeates. 1999. Assigning protein functions by comparative genome analysis: protein phylogenetic profiles. Proc. Natl. Acad. Sci. USA 96:4285-4288. [PubMed] 54. Pellegrini, M., M. Thompson, J. Fierro, and P. Bowers. 2001. Computational method to assign microbial genes to pathways. J. Cell. Biochem. Suppl. 37:106-109. [PubMed] 55. Pichard, S. L., L. Campbell, and J. H. Paul. 1997. Diversity of the ribulose bisphosphate carboxylase/oxygenase form I gene (rbcL) in natural phytoplankton communities. Appl. Environ. Microbiol. 63:3600-3606. [PubMed] 56. Rhee, S., K. D. Parris, C. C. Hyde, S. A. Ahmed, E. W. Miles, and D. R. Davies. 1997. Crystal structures of a mutant (betaK87T) tryptophan synthase alpha2beta2 complex with ligands bound to the active sites of the alpha- and beta-subunits reveal ligand-induced conformational changes. Biochemistry 36:7664-7680. [PubMed] 57. Robbens, S., E. Derelle, C. Ferraz, J. Wuyts, H. Moreau, and Y. Van de Peer. 2007. The complete chloroplast and mitochondrial DNA sequence of Ostreococcus tauri: organelle genomes of the smallest eukaryote are examples of compaction. Mol. Biol. Evol. 24:956-968. [PubMed] 58. Rowan, R., S. M. Whitney, A. Fowler, and D. Yellowlees. 1996. Rubisco in marine symbiotic dinoflagellates: form II enzymes in eukaryotic oxygenic phototrophs encoded by a nuclear multigene family. Plant Cell 8:539-553. [PubMed] 59. Sato, T., H. Atomi, and T. Imanaka. 2007. Archaeal type III RuBisCOs function in a pathway for AMP metabolism. Science 315:1003-1006. [PubMed] 60. Saunders, N. F., T. Thomas, P. M. Curmi, J. S. Mattick, E. Kuczek, R. Slade, J. Davis, P. D. Franzmann, D. Boone, K. Rusterholtz, R. Feldman, C. Gates, S. Bench, K. Sowers, K. Kadner, A. Aerts, P. Dehal, C. Detter, T. Glavina, S. Lucas, P. Richardson, F. Larimer, L. Hauser, M. Land, and R. Cavicchioli. 2003. Mechanisms of thermal adaptation revealed from the genomes of the Antarctic archaea Methanogenium frigidum and Methanococcoides burtonii. Genome Res. 13:1580-1588. [PubMed] 61. Schneider, G., Y. Lindqvist, and C. I. Branden. 1992. Rubisco: structure and mechanism. Annu. Rev. Biophys. Biomol. Struct. 21:119-143. [PubMed] 62. Schreuder, H. A., S. Knight, P. M. Curmi, I. Andersson, D. Cascio, C. I. Branden, and D. Eisenberg. 1993. Formation of the active site of ribulose-1,5-bisphosphate carboxylase/oxygenase by a disorder-order transition from the unactivated to the activated form. Proc. Natl. Acad. Sci. USA 90:9968-9972. [PubMed] 63. Sekowska, A., and A. Danchin. 2002. The methionine salvage pathway in Bacillus subtilis. BMC Microbiol. 2:8. [PubMed] 64. Soper, T. S., R. J. Mural, F. W. Larimer, E. H. Lee, R. Machanoff, and F. C. Hartman. 1988. Essentiality of Lys-329 of ribulose-1,5-bisphosphate carboxylase/oxygenase from Rhodospirillum rubrum as demonstrated by site-directed mutagenesis. Protein Eng. 2:39-44. [PubMed] 65. Spiridonova, E. M., I. A. Berg, T. V. Kolganova, R. N. Ivanovskii, B. B. Kuznetsov, and T. P. Turova. 2004. An oligonucleotide primer system for amplification of the ribulose-1,5-bisphosphate carboxylase/oxygenase genes of bacteria of various taxonomic groups. Mikrobiologiia 73:377-387. (In Russian.) [PubMed] 66. Spreitzer, R. J., and M. E. Salvucci. 2002. Rubisco: structure, regulatory interactions, and possibilities for a better enzyme. Annu. Rev. Plant Biol. 53:449-475. [PubMed] 67. Tabita, F. R. 1995. The biochemistry and regulation of carbon metabolism and CO2 fixation in purple bacteria, p. 885-914. In R. E. Blankenship, M. T. Madigan, and C. E. Bauer (ed.), Anoxygenic photosynthetic bacteria. Kluwer Academic Publishers, Dordrecht, The Netherlands. 68. Tabita, F. R. 1999. Microbial ribulose 1,5-bisphosphate carboxylase/oxygenase: a different perspective. Photosynth. Res. 60:1-28. 69. Tabita, F. R., and B. A. McFadden. 1974. d-Ribulose 1,5-diphosphate carboxylase from Rhodospirillum rubrum. I. Levels, purification, and effects of metallic ions. J. Biol. Chem. 249:3453-3458. [PubMed] 70. Tabita, F. R., and B. A. McFadden. 1974. d-Ribulose 1,5-diphosphate carboxylase from Rhodospirillum rubrum. II. Quaternary structure, composition, catalytic, and immunological properties. J. Biol. Chem. 249:3459-3464. [PubMed] 71. Reference deleted. 72. Taylor, T. C., A. Backlund, K. Bjorhall, R. J. Spreitzer, and I. Andersson. 2001. First crystal structure of Rubisco from a green alga, Chlamydomonas reinhardtii. J. Biol. Chem. 276:48159-48164. [PubMed] 73. Watson, G. M., J. P. Yu, and F. R. Tabita. 1999. Unusual ribulose 1,5-bisphosphate carboxylase/oxygenase of anoxic archaea. J. Bacteriol. 181:1569-1575. [PubMed] 74. Watson, G. M. F., and F. R. Tabita. 1997. Microbial ribulose 1,5-bisphosphate carboxylase/oxygenase: a molecule for phylogenetic and enzymological investigation. FEMS Microbiol. Lett. 146:13-22. [PubMed] 75. Wawrik, B., J. H. Paul, and F. R. Tabita. 2002. Real-time PCR quantification of rbcL (ribulose-1,5-bisphosphate carboxylase/oxygenase) mRNA in diatoms and pelagophytes. Appl. Environ. Microbiol. 68:3771-3779. [PubMed] 76. Wierenga, R. K. 2001. The TIM-barrel fold: a versatile framework for efficient enzymes. FEBS Lett. 492:193-198. [PubMed] 77. Wise, E., W. S. Yew, P. C. Babbitt, J. A. Gerlt, and I. Rayment. 2002. Homologous (beta/alpha)-barrel enzymes that catalyze unrelated reactions: orotidine 5′-monophosphate decarboxylase and 3-keto-L-gulonate 6-phosphate decarboxylase. Biochemistry 41:3861-3869. [PubMed] 78. Xu, H. H., and F. R. Tabita. 1996. Ribulose-1,5-bisphosphate carboxylase/oxygenase gene expression and diversity of Lake Erie planktonic microorganisms. Appl. Environ. Microbiol. 62:1913-1921. [PubMed] 79. Yoon, K. S., T. E. Hanson, J. L. Gibson, and F. R. Tabita. 2000. Autotrophic CO2 metabolism, p. 349-358. In J. Lederburg (ed.), Encyclopedia of microbiology, 2nd ed. Academic Press Inc., San Diego, CA. 80. Yooseph, S., G. Sutton, D. B. Rusch, A. L. Halpern, S. J. Williamson, K. Remington, J. A. Eisen, K. B. Heidelberg, G. Manning, W. Li, L. Jaroszewski, P. Cieplak, C. S. Miller, H. Li, S. T. Mashiyama, M. P. Joachimiak, C. van Belle, J. M. Chandonia, D. A. Soergel, Y. Zhai, K. Natarajan, S. Lee, B. J. Raphael, V. Bafna, R. Friedman, S. E. Brenner, A. Godzik, D. Eisenberg, J. E. Dixon, S. S. Taylor, R. L. Strausberg, M. Frazier, and J. C. Venter. 2007. The Sorcerer II global ocean sampling expedition: expanding the universe of protein families. PLoS Biol. 5:e16. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||||||||||||||||||||||
Chem Rev. 1998 Apr 2; 98(2):549-562.
[Chem Rev. 1998]J Mol Biol. 1996 May 31; 259(1):160-74.
[J Mol Biol. 1996]J Mol Biol. 1990 Sep 5; 215(1):113-60.
[J Mol Biol. 1990]J Biol Chem. 1974 Jun 10; 249(11):3453-8.
[J Biol Chem. 1974]J Biol Chem. 1974 Jun 10; 249(11):3459-64.
[J Biol Chem. 1974]J Biol Chem. 1977 Feb 10; 252(3):943-9.
[J Biol Chem. 1977]Mol Biol Evol. 1996 Jul; 13(6):873-82.
[Mol Biol Evol. 1996]FEMS Microbiol Rev. 2004 Jun; 28(3):353-76.
[FEMS Microbiol Rev. 2004]FEMS Microbiol Lett. 1997 Jan 1; 146(1):13-22.
[FEMS Microbiol Lett. 1997]J Biol Chem. 1999 Feb 19; 274(8):5078-82.
[J Biol Chem. 1999]J Bacteriol. 1999 Mar; 181(5):1569-75.
[J Bacteriol. 1999]Proc Natl Acad Sci U S A. 2001 Apr 10; 98(8):4397-402.
[Proc Natl Acad Sci U S A. 2001]Photosynth Res. 2003; 78(3):231-48.
[Photosynth Res. 2003]J Bacteriol. 2002 Apr; 184(8):2314-8.
[J Bacteriol. 2002]BMC Microbiol. 2002 Apr 25; 2():8.
[BMC Microbiol. 2002]Res Microbiol. 2005 Jun-Jul; 156(5-6):611-8.
[Res Microbiol. 2005]Microb Ecol. 2003 May; 45(4):317-28.
[Microb Ecol. 2003]Appl Environ Microbiol. 2001 Apr; 67(4):1751-65.
[Appl Environ Microbiol. 2001]Extremophiles. 2007 Jan; 11(1):191-202.
[Extremophiles. 2007]J Mol Biol. 1990 Sep 5; 215(1):113-60.
[J Mol Biol. 1990]Appl Environ Microbiol. 2004 Apr; 70(4):2245-53.
[Appl Environ Microbiol. 2004]Proc Natl Acad Sci U S A. 2001 Apr 10; 98(8):4397-402.
[Proc Natl Acad Sci U S A. 2001]J Bacteriol. 2003 May; 185(10):3049-59.
[J Bacteriol. 2003]J Bacteriol. 1999 Mar; 181(5):1569-75.
[J Bacteriol. 1999]Chem Rev. 1998 Apr 2; 98(2):549-562.
[Chem Rev. 1998]Proc Natl Acad Sci U S A. 2001 Apr 10; 98(8):4397-402.
[Proc Natl Acad Sci U S A. 2001]Science. 2003 Oct 10; 302(5643):286-90.
[Science. 2003]Biochemistry. 2007 Apr 3; 46(13):4077-89.
[Biochemistry. 2007]Structure. 2005 May; 13(5):779-89.
[Structure. 2005]PLoS Biol. 2007 Mar; 5(3):e16.
[PLoS Biol. 2007]Nature. 2007 Mar 29; 446(7135):537-41.
[Nature. 2007]Photosynth Res. 2003; 78(3):231-48.
[Photosynth Res. 2003]Proc Natl Acad Sci U S A. 2001 Apr 10; 98(8):4397-402.
[Proc Natl Acad Sci U S A. 2001]Science. 2003 Oct 10; 302(5643):286-90.
[Science. 2003]Biochemistry. 2007 Apr 3; 46(13):4077-89.
[Biochemistry. 2007]J Bacteriol. 2002 Apr; 184(8):2314-8.
[J Bacteriol. 2002]J Bacteriol. 2002 Apr; 184(8):2314-8.
[J Bacteriol. 2002]BMC Microbiol. 2002 Apr 25; 2():8.
[BMC Microbiol. 2002]Structure. 2005 May; 13(5):779-89.
[Structure. 2005]Proc Natl Acad Sci U S A. 2001 Apr 10; 98(8):4397-402.
[Proc Natl Acad Sci U S A. 2001]Photosynth Res. 2003; 78(3):231-48.
[Photosynth Res. 2003]Photosynth Res. 2003; 78(3):231-48.
[Photosynth Res. 2003]Proc Natl Acad Sci U S A. 2005 Jun 28; 102(26):9306-10.
[Proc Natl Acad Sci U S A. 2005]Photosynth Res. 2003; 78(3):231-48.
[Photosynth Res. 2003]Science. 2003 Oct 10; 302(5643):286-90.
[Science. 2003]Proc Natl Acad Sci U S A. 2001 Apr 10; 98(8):4397-402.
[Proc Natl Acad Sci U S A. 2001]Res Microbiol. 2005 Jun-Jul; 156(5-6):611-8.
[Res Microbiol. 2005]Proc Natl Acad Sci U S A. 1999 Apr 13; 96(8):4285-8.
[Proc Natl Acad Sci U S A. 1999]Nature. 1999 Nov 4; 402(6757):86-90.
[Nature. 1999]Science. 1999 Jul 30; 285(5428):751-3.
[Science. 1999]Trends Biochem Sci. 1998 Sep; 23(9):324-8.
[Trends Biochem Sci. 1998]Proc Natl Acad Sci U S A. 1999 Mar 16; 96(6):2896-901.
[Proc Natl Acad Sci U S A. 1999]Science. 2003 Oct 10; 302(5643):286-90.
[Science. 2003]Science. 2003 Oct 10; 302(5643):286-90.
[Science. 2003]J Biol Chem. 2006 Aug 25; 281(34):24462-71.
[J Biol Chem. 2006]Brief Bioinform. 2004 Jun; 5(2):150-63.
[Brief Bioinform. 2004]Bioinformatics. 2005 May 1; 21(9):2104-5.
[Bioinformatics. 2005]Genome Res. 2003 Jul; 13(7):1580-8.
[Genome Res. 2003]Mol Biol Evol. 1992 May; 9(3):537-51.
[Mol Biol Evol. 1992]Science. 1999 Jun 25; 284(5423):2124-9.
[Science. 1999]Proc Natl Acad Sci U S A. 2006 Aug 1; 103(31):11647-52.
[Proc Natl Acad Sci U S A. 2006]Mol Biol Evol. 2007 Apr; 24(4):956-68.
[Mol Biol Evol. 2007]Science. 1995 Jun 16; 268(5217):1622-4.
[Science. 1995]Bioessays. 1995 Dec; 17(12):1005-8.
[Bioessays. 1995]Plant Cell. 1996 Mar; 8(3):343-5.
[Plant Cell. 1996]Res Microbiol. 2005 Jun-Jul; 156(5-6):611-8.
[Res Microbiol. 2005]J Biol Chem. 2006 Aug 25; 281(34):24462-71.
[J Biol Chem. 2006]J Mol Biol. 2002 Aug 30; 321(5):741-65.
[J Mol Biol. 2002]FEBS Lett. 2001 Mar 16; 492(3):193-8.
[FEBS Lett. 2001]Curr Opin Chem Biol. 2003 Feb; 7(1):12-20.
[Curr Opin Chem Biol. 2003]Proteins. 2004 Feb 1; 54(2):260-70.
[Proteins. 2004]J Biol Chem. 2001 Dec 21; 276(51):48159-64.
[J Biol Chem. 2001]Biochemistry. 2007 Apr 3; 46(13):4077-89.
[Biochemistry. 2007]Biochemistry. 1991 Jan 29; 30(4):904-8.
[Biochemistry. 1991]Biochemistry. 2002 Mar 26; 41(12):3861-9.
[Biochemistry. 2002]Biochemistry. 1997 Jun 24; 36(25):7664-80.
[Biochemistry. 1997]J Biol Chem. 1999 Feb 19; 274(8):5078-82.
[J Biol Chem. 1999]J Bacteriol. 1999 Mar; 181(5):1569-75.
[J Bacteriol. 1999]J Bacteriol. 2003 May; 185(10):3049-59.
[J Bacteriol. 2003]Res Microbiol. 2005 Jun-Jul; 156(5-6):611-8.
[Res Microbiol. 2005]J Biol Chem. 2006 Aug 25; 281(34):24462-71.
[J Biol Chem. 2006]J Bacteriol. 2004 Oct; 186(19):6360-6.
[J Bacteriol. 2004]Nature. 1999 Nov 4; 402(6757):86-90.
[Nature. 1999]Science. 2007 Feb 16; 315(5814):1003-6.
[Science. 2007]Structure. 2005 May; 13(5):779-89.
[Structure. 2005]Science. 2003 Oct 10; 302(5643):286-90.
[Science. 2003]Arch Biochem Biophys. 2003 Jun 15; 414(2):130-40.
[Arch Biochem Biophys. 2003]Chem Rev. 1998 Apr 2; 98(2):549-562.
[Chem Rev. 1998]J Mol Biol. 2000 May 19; 298(5):903-16.
[J Mol Biol. 2000]Annu Rev Biophys Biomol Struct. 1992; 21():119-43.
[Annu Rev Biophys Biomol Struct. 1992]Proc Natl Acad Sci U S A. 1993 Nov 1; 90(21):9968-72.
[Proc Natl Acad Sci U S A. 1993]Proc Natl Acad Sci U S A. 2001 Apr 10; 98(8):4397-402.
[Proc Natl Acad Sci U S A. 2001]Biochemistry. 2007 Apr 3; 46(13):4077-89.
[Biochemistry. 2007]Res Microbiol. 2005 Jun-Jul; 156(5-6):611-8.
[Res Microbiol. 2005]J Biol Chem. 2006 Aug 25; 281(34):24462-71.
[J Biol Chem. 2006]Structure. 2005 May; 13(5):779-89.
[Structure. 2005]Arch Biochem Biophys. 2003 Jun 15; 414(2):130-40.
[Arch Biochem Biophys. 2003]Annu Rev Plant Biol. 2002; 53():449-75.
[Annu Rev Plant Biol. 2002]Proc Natl Acad Sci U S A. 2001 Apr 10; 98(8):4397-402.
[Proc Natl Acad Sci U S A. 2001]Annu Rev Biochem. 1994; 63():197-234.
[Annu Rev Biochem. 1994]Biochemistry. 2007 Apr 3; 46(13):4077-89.
[Biochemistry. 2007]J Biol Chem. 2000 Aug 25; 275(34):26241-4.
[J Biol Chem. 2000]J Bacteriol. 1994 Nov; 176(21):6697-706.
[J Bacteriol. 1994]Structure. 2005 May; 13(5):779-89.
[Structure. 2005]Brief Bioinform. 2004 Jun; 5(2):150-63.
[Brief Bioinform. 2004]Bioinformatics. 2005 May 1; 21(9):2104-5.
[Bioinformatics. 2005]Bioinformatics. 2005 May 1; 21(9):2104-5.
[Bioinformatics. 2005]Chem Rev. 1998 Apr 2; 98(2):549-562.
[Chem Rev. 1998]Science. 2003 Oct 10; 302(5643):286-90.
[Science. 2003]Biochemistry. 2007 Apr 3; 46(13):4077-89.
[Biochemistry. 2007]