![]() | ![]() |
Formats:
|
||||||||||||||
Copyright © 2008 Rife et al; licensee BioMed Central Ltd. Sequence and structural evolution of the KsgA/Dim1 methyltransferase family 1Department of Medicinal Chemistry and Institute for Structural Biology and Drug Discovery, Virginia Commonwealth University, Richmond, Virginia 23219, USA 2Department of Biology, University of Rochester, Rochester, New York 14627, USA Corresponding author.Heather C O'Farrell: hcofarrell/at/vcu.edu; Zhili Xu: zxu6/at/mail.rochester.edu; Gloria M Culver: gculver/at/mail.rochester.edu; Jason P Rife: jason.rife/at/vcu.edu Received July 31, 2008; Accepted October 29, 2008. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Abstract Background One of the 60 or so genes conserved in all domains of life is the ksgA/dim1 orthologous group. Enzymes from this family perform the same post-transcriptional nucleotide modification in ribosome biogenesis, irrespective of organism. Despite this common function, divergence has enabled some family members to adopt new and sometimes radically different functions. For example, in S. cerevisiae Dim1 performs two distinct functions in ribosome biogenesis, while human mtTFB is not only an rRNA methyltransferase in the mitochondria but also a mitochondrial transcription factor. Thus, these proteins offer an unprecedented opportunity to study evolutionary aspects of structure/function relationships, especially with respect to our recently published work on the binding mode of a KsgA family member to its 30S subunit substrate. Here we compare and contrast KsgA orthologs from bacteria, eukaryotes, and mitochondria as well as the paralogous ErmC enzyme. Results By using structure and sequence comparisons in concert with a unified ribosome binding model, we have identified regions of the orthologs that are likely related to gains of function beyond the common methyltransferase function. There are core regions common to the entire enzyme class that are associated with ribosome binding, an event required in rRNA methylation activity, and regions that are conserved in subgroups that are presumably related to non-methyltransferase functions. Conclusion The ancient protein KsgA/Dim1 has adapted to cellular roles beyond that of merely an rRNA methyltransferase. These results provide a structural foundation for analysis of multiple aspects of ribosome biogenesis and mitochondrial transcription. Findings Ribosome biogenesis is a fundamental process in all cells, requiring the consumption of large quantities of cellular resources under the control of an extraordinary level of regulation. In comparing prokaryotic and eukaryotic ribosome biogenesis pathways, the conservation of the KsgA/Dim1 family is unique. The presence and function of this enzyme has been maintained in every evolutionary lineage, including eukaryotic organelles. KsgA catalyzes the conversion of two adjacent adenosines in the small subunit rRNA (A1518 and A1519 of 16S rRNA, E. coli numbering) to N6,N6-dimethyladenosines [1]. The KsgA family carries out this core methyltransferase function in all domains of life, including organelles, but has also added new roles as cellular organization became more complex. In eukaryotes, the KsgA ortholog Dim1 is essential for proper processing of the pre-18S small subunit rRNA [2], and Dim1 knockout is lethal. Pfc1, which is the KsgA ortholog found in chloroplasts of Arabidopsis thaliana, is important for chloroplast formation under chilling conditions [3]. A distinct eukaryotic ortholog, mtTFB, is transported into the mitochondria where, in addition to methylating the small subunit rRNA, it has adopted the ribosomally unrelated function of serving as a mitochondrial transcription factor [4,5]. In some mitochondria, there are two separate mtTFB proteins, mtTFB1 and mtTFB2, which are proposed to have arisen from a gene duplication event [5]. There is evidence that mtTFB1 has retained stronger methyltransferase activity, while mtTFB2 is more active as a transcription factor [5,6]. The fungi have only a single mtTFB, suggesting either loss of one of the paralogs in this lineage, or that the duplication occurred later in evolution. In at least one case, S. cerevisiae, the single mtTFB protein (sc-mtTFB) serves as a transcription factor but has lost its methyltransferase activity entirely [7]. sc-mtTFB lacks significant sequence homology to any of the KsgA/Dim1 enzymes; yeast mtTFBs are generally poorly conserved and difficult to identify via sequence homology [8]. Another important offshoot of the KsgA lineage is the Erm family of methyltransferases, which confer antibiotic resistance by methylating A2058 of the 23S rRNA [9]. The present-day Erm family almost certainly resulted from one or more gene duplications of a KsgA gene and subsequent evolution to permit recognition of a distinct target base [10]. KsgA's remarkable degree of conservation, coupled with the adaptation of new cellular functions, give us a unique opportunity to look at structural/function evolution of a single protein lineage. Previous studies have established the structural similarity between some rRNA adenosine dimethyltransferases [11,12]. Based on recent work in our groups we can now make predictions about how these proteins interact with their respective ribosomal targets. We also examine structural variations that may be important to the varied functions of KsgA orthologs and paralogs. Results E. coli KsgA [12], human Dim1 (hDim1; A. Dong, H. Wu, H. Zeng, P. Loppnau, M. Sundstrom, C. Arrowsmith, A. Edwards, A. Bochkarev and A. Plotnikov, unpublished data), Plasmodium falciparum Dim1 [13], and sc-mtTFB [11] have been characterized structurally. Figure Figure11
Divergent orthologues of bacterial KsgA, including Dim1 from the eukaryote S. cerevisiae and the archaeon Methanocaldococcus jannaschii as well as h-mtTFB1 and h-mtTFB2, can all complement for KsgA function in E. coli [5,14,15]. Therefore, it is expected that these functional orthologues all interact with substrate ribosome particles in a similar manner. We recently reported a mode of binding for KsgA onto 30S subunits [16]. As was noted in this work, the binding site for KsgA on 30S involves protein interactions with 16S rRNA in helix 44 and in the 790 loop, but not in helix 45, the site of the methylated adenosines. When 16S rRNA conservation is assessed across all three domains and the ribosome-containing organelles, regions that are important for translation show the highest evolutionary conservation [17]. Included in the set of highly conserved nucleotides are residues at the base of helix 44 and the 790 loop which interact with KsgA [16]. The high level of rRNA conservation at the KsgA binding site has likely constrained evolution of protein residues that interact with the rRNA, allowing evolutionarily divergent orthologues to recognize bacterial 30S subunits as a substrate. In order to assess KsgA conservation at the amino acid level, we aligned thirty-two KsgA, Dim1, mtTFB1 and Pfc1 proteins [see Additional file 1]. Since the mtTFB2 and mtTFB branches have become stronger transcription factors at the expense of the methyltransferase activity [5,6], we omitted these branches from this analysis. We used the program Expresso to construct the alignment and assess the conservation at each amino acid position [18], and then mapped the results onto the KsgA structure. RNA conservation was taken from the minimal ribosome modeled by Mears et al [17]. Figure Figure22
The KsgA enzymes share the Rossman-like structural fold common to many SAM-dependent methyltransferases [12,19]. This fold consists of a seven-stranded beta sheet surrounded by a variable number of alpha helices, with well-defined binding pockets for SAM and the target nucleotide. In this way, the methyltransferase function has a clear structural basis. However, the structural basis for the other functions of the KsgA/Dim1 enzymes is unknown. Extensive sequence alignments of KsgA, Dim1 and mtTFB enzymes identify elements unique to each group. The addition of specific inserts relative to KsgA likely has permitted the eukaryotic Dim1 and mtTFB enzymes to gain functions while maintaining their roles as methyltransferases. Eukaryotic Dim1 has a large insert in the C-terminal domain compared to KsgA (Figure (Figure3a).3a
A similar analysis can be performed by comparing KsgA and sc-mtTFB. mtTFB proteins have three inserts relative to KsgA proteins [20]; all of these inserts lie in the N-terminal domain (Figures (Figures3c3c sc-mtTFB, and possibly other fungal TFB proteins, have lost their methyltransferase activity during their evolution as transcription factors [7]. Figure Figure44
Structural comparison of KsgA and ErmC' shows that the two proteins are very similar, with the most notable divergence being in the C-terminal domains (Figure (Figure5a).5a
Conclusion To our knowledge, the KsgA proteins are unique in both their level of conservation and their functional flexibility. KsgA was one of the first rRNA methyltransferases to be identified and has received episodic scrutiny over the last thirty years. Despite a collection of important observations, a thorough understanding of this protein family's role in ribosome biogenesis has been elusive. Certainly, the KsgA/Dim1 family has significance beyond the methyltransferase activity that was first described. Sequence and structural comparisons of these proteins, in concert with data about the binding of KsgA to the 30S subunit, implicate certain regions that might be important to the various functions of KsgA orthologs. Further experiments will help define individual characteristics of this important family of proteins. Competing interests The authors declare that they have no competing interests. Authors' contributions All authors discussed results and contributed to the manuscript. HCO compiled sequences and prepared alignments. JPR docked proteins onto the 30S subunit structure. Additional file 1 Sequence alignment of KsgA orthologs. Archaeal, eukaryotic, and mitochondrial KsgA orthologues were identified by performing a genomic BLAST search using the E. coli protein sequence (accession number P06992) as the query sequence. Organisms were chosen to represent a broad evolutionary diversity of species. The structure-based sequence alignment was perfomed using the program Expresso [18]. Structures used for the alignment were 1QYR[12], 1ZQ9 (A. Dong, H. Wu, H. Zeng, P. Loppnau, M. Sundstrom, C. Arrowsmith, A. Edwards, A. Bochkarev and A. Plotnikov, unpublished data), and 2H1R[13]. Organisms represented are as follows. Eukaryotes: Arabidopsis thaliana (at), Dictyostelium discoideum (dd), Leishmania brazilensis (lb), Giardia lamblia (gl), Plasmodium vivax (pv), Homo sapiens (hs), Saccharomyces cerevisiae (sc), Drosophila melanogaster (dm), and Caenorhabditis elegans (ce). Archaea: Methanopyrus kandleri (mk), Methanosaeta thermophila (mth), Haloquadratum walsbyi (hw), Methanoculleus marisnigri (mma), Methanocaldococcus jannaschii (mj), Pyrococcus horikoshii (ph), Methanosphaera stadtmanae (ms), Picrophilus torridus (pt), Archaeoglobus fulgidus (af), Aeropyrum pernix (ap), Sulfolobus solfataricus (ss), Pyrobaculum aerophilum (pa), and Cenarchaeum symbiosum (cs). Bacteria: Synechococcus elongatus (se), Bacillus subtilis (bs), Mycobacterium tuberculosis (mtu), Thermus thermophilus (tt), Bacteroides fragilis (bf), Chlamydia trachomatis (ct), Borrelia burgdorferi (bb), and Escherichia coli (ec). Accession numbers for each sequence are found in Additional file 4. Click here for file(280K, pdf) Additional file 2 Sequence alignment of mtTFB, mtTFB1, and mtTFB2 proteins. The structure-based sequence alignment was perfomed using the program Expresso [18]. The structure 1I4W[11] was used for the alignment. Organisms represented are Caenorhabditis elegans (ce), Homo sapiens (hs), Drosophila melanogaster (dm), Anopheles gambiae (ag), Apis mellifera (am), Xenopus laevis (xl), Takifugu rubripes (tr), Ciona intestinalis (ci), Rattus norvegicus (rn), Pan troglodytes (pt), Mus musculus (mmu), Bos taurus (bt), Tetraodon nigroviridis (tn), Saccharomyces cerevisiae (sc), Schizosaccharomyces pombe (sp), Kluyveromyces lactis (kl), Eremothecium gossypii (eg), Candida albicans (ca), Dictyostelium discoideum (dd), Trypanosoma brucei (tb), and Leishmania major (lm). Accession numbers for each sequence are found in Additional file 4. Click here for file(355K, pdf) Additional file 3 Sequence alignment of Erm enzymes. Erm enzymes were identified using the Nomenclature Center for MLS Genes, maintained by Dr. Marilyn C. Roberst [28]. One member of each class was chosen, with two exceptions. ErmI was not used because a corresponding sequence could not be found. Erm32 was not used because this enzyme methylates G748 rather than A2058 [29]. The structure based sequence alignment was performed with Expresso [18]. Structures used for the alignment were 1QAM[23] and 1YUB[24]. Organisms represented are Staphylococcus aureus, Enterococcus faecalis, Bacillus subtilis, Bacillus licheniformis, Saccharopolyspora erythraea, Bacteroides fragilis, Lysinibacillus sphaericus, Streptomyces thermotolerans, Streptomyces fradiae, Streptomyces coelicolor, Clostridium perfringens, Aeromicrobium erythreum, Lactobacillus reuteri, Streptomyces lincolnensis, Streptomyces viridochromogenes, Micromonospora griseorubida, Corynebacterium jeikeium, Streptomyces ambofaciens, Streptomyces venezuelae, Staphylococcus sciuri, Bacillus clausii, Bacteroides coprosuis, Micrococcus luteus, Mycobacterium tuberculosis, Mycobacterium smegmatis, Mycobacterium fortuitum, Mycobacterium mageritense, and Mycobacterium abscessus, Accession numbers are found in Additional file 4. Click here for file(230K, pdf) Additional file 4 Sequences used in protein alignments. Sequences were compiled from NCBI and Ensembl; organisms and accession numbers are indicated. Click here for file(88K, pdf) Acknowledgements This work was supported by the U.S. National Institutes of Health grants GM066900 to JPR and GM62432 to GMC. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||
Nat New Biol. 1972 Jan 5; 235(53):6-9.
[Nat New Biol. 1972]Genes Dev. 1995 Oct 15; 9(20):2470-81.
[Genes Dev. 1995]Plant Cell. 1998 May; 10(5):699-711.
[Plant Cell. 1998]Mol Gen Genet. 1989 Oct; 219(1-2):125-8.
[Mol Gen Genet. 1989]Mol Cell. 2006 Dec 28; 24(6):813-25.
[Mol Cell. 2006]Nucleic Acids Res. 2007; 35(12):4042-54.
[Nucleic Acids Res. 2007]J Mol Biol. 1975 Sep 25; 97(3):337-50.
[J Mol Biol. 1975]Gene Expr. 1996; 6(4):219-30.
[Gene Expr. 1996]Antimicrob Agents Chemother. 1995 Mar; 39(3):577-85.
[Antimicrob Agents Chemother. 1995]Mol Gen Genet. 1988 Dec; 215(1):152-5.
[Mol Gen Genet. 1988]Protein Sci. 2001 Oct; 10(10):1980-8.
[Protein Sci. 2001]J Mol Biol. 2004 May 28; 339(2):337-53.
[J Mol Biol. 2004]J Mol Biol. 2004 May 28; 339(2):337-53.
[J Mol Biol. 2004]Mol Biochem Parasitol. 2007 Jan; 151(1):100-10.
[Mol Biochem Parasitol. 2007]Protein Sci. 2001 Oct; 10(10):1980-8.
[Protein Sci. 2001]J Mol Biol. 2004 May 28; 339(2):337-53.
[J Mol Biol. 2004]Protein Sci. 2001 Oct; 10(10):1980-8.
[Protein Sci. 2001]Mol Cell. 2006 Dec 28; 24(6):813-25.
[Mol Cell. 2006]J Mol Biol. 1994 Aug 19; 241(3):492-7.
[J Mol Biol. 1994]RNA. 2006 May; 12(5):725-33.
[RNA. 2006]Nat Struct Mol Biol. 2008 May; 15(5):534-6.
[Nat Struct Mol Biol. 2008]J Mol Biol. 2002 Aug 9; 321(2):215-34.
[J Mol Biol. 2002]Nat Struct Mol Biol. 2008 May; 15(5):534-6.
[Nat Struct Mol Biol. 2008]Mol Cell. 2006 Dec 28; 24(6):813-25.
[Mol Cell. 2006]Nucleic Acids Res. 2007; 35(12):4042-54.
[Nucleic Acids Res. 2007]Nucleic Acids Res. 2006 Jul 1; 34(Web Server issue):W604-8.
[Nucleic Acids Res. 2006]Nat Struct Mol Biol. 2008 May; 15(5):534-6.
[Nat Struct Mol Biol. 2008]J Mol Biol. 2004 May 28; 339(2):337-53.
[J Mol Biol. 2004]Mol Biol Evol. 2006 Jun; 23(6):1169-79.
[Mol Biol Evol. 2006]Genes Dev. 1997 Nov 1; 11(21):2897-909.
[Genes Dev. 1997]J Mol Biol. 1975 Sep 25; 97(3):337-50.
[J Mol Biol. 1975]Biochemistry. 1998 May 19; 37(20):7103-12.
[Biochemistry. 1998]Nat Struct Biol. 1997 Jun; 4(6):483-9.
[Nat Struct Biol. 1997]Protein Sci. 2001 Oct; 10(10):1980-8.
[Protein Sci. 2001]J Mol Biol. 1999 Jun 4; 289(2):277-91.
[J Mol Biol. 1999]Nat Struct Biol. 2001 Feb; 8(2):121-5.
[Nat Struct Biol. 2001]J Mol Biol. 2004 May 28; 339(2):337-53.
[J Mol Biol. 2004]J Mol Biol. 1999 Jun 4; 289(2):277-91.
[J Mol Biol. 1999]Antimicrob Agents Chemother. 1995 Mar; 39(3):577-85.
[Antimicrob Agents Chemother. 1995]Nucleic Acids Res. 2003 Aug 15; 31(16):4941-9.
[Nucleic Acids Res. 2003]Nat Struct Mol Biol. 2008 May; 15(5):534-6.
[Nat Struct Mol Biol. 2008]J Mol Biol. 1999 Jun 4; 289(2):277-91.
[J Mol Biol. 1999]Nucleic Acids Res. 2006 Jul 1; 34(Web Server issue):W604-8.
[Nucleic Acids Res. 2006]J Mol Biol. 2004 May 28; 339(2):337-53.
[J Mol Biol. 2004]Mol Biochem Parasitol. 2007 Jan; 151(1):100-10.
[Mol Biochem Parasitol. 2007]Nucleic Acids Res. 2006 Jul 1; 34(Web Server issue):W604-8.
[Nucleic Acids Res. 2006]Protein Sci. 2001 Oct; 10(10):1980-8.
[Protein Sci. 2001]Mol Microbiol. 2000 Aug; 37(4):811-20.
[Mol Microbiol. 2000]Nucleic Acids Res. 2006 Jul 1; 34(Web Server issue):W604-8.
[Nucleic Acids Res. 2006]J Mol Biol. 1999 Jun 4; 289(2):277-91.
[J Mol Biol. 1999]Nat Struct Biol. 1997 Jun; 4(6):483-9.
[Nat Struct Biol. 1997]