![]() | ![]() |
Formats:
|
||||||||||||||||||
Copyright © 2009 Voß et al; licensee BioMed Central Ltd. Biocomputational prediction of non-coding RNAs in model cyanobacteria 1University of Freiburg, Faculty of Biology, Genetics and Experimental Bioinformatics, Schänzlestr. 1, D-79104 Freiburg, Germany 2Freiburg Initiative in Systems Biology, Schänzlestr. 1, D-79104 Freiburg, Germany Corresponding author.Björn Voß: bjoern.voss/at/biologie.uni-freiburg.de; Jens Georg: jens.georg/at/biologie.uni-freiburg.de; Verena Schön: schoen.verena/at/web.de; Susanne Ude: Susanne.Ude/at/gmx.de; Wolfgang R Hess: wolfgang.hess/at/biologie.uni-freiburg.de Received August 29, 2008; Accepted March 23, 2009. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. This article has been cited by other articles in PMC.Abstract Background In bacteria, non-coding RNAs (ncRNA) are crucial regulators of gene expression, controlling various stress responses, virulence, and motility. Previous work revealed a relatively high number of ncRNAs in some marine cyanobacteria. However, for efficient genetic and biochemical analysis it would be desirable to identify a set of ncRNA candidate genes in model cyanobacteria that are easy to manipulate and for which extended mutant, transcriptomic and proteomic data sets are available. Results Here we have used comparative genome analysis for the biocomputational prediction of ncRNA genes and other sequence/structure-conserved elements in intergenic regions of the three unicellular model cyanobacteria Synechocystis PCC6803, Synechococcus elongatus PCC6301 and Thermosynechococcus elongatus BP1 plus the toxic Microcystis aeruginosa NIES843. The unfiltered numbers of predicted elements in these strains is 383, 168, 168, and 809, respectively, combined into 443 sequence clusters, whereas the numbers of individual elements with high support are 94, 56, 64, and 406, respectively. Removing also transposon-associated repeats, finally 78, 53, 42 and 168 sequences, respectively, are left belonging to 109 different clusters in the data set. Experimental analysis of selected ncRNA candidates in Synechocystis PCC6803 validated new ncRNAs originating from the fabF-hoxH and apcC-prmA intergenic spacers and three highly expressed ncRNAs belonging to the Yfr2 family of ncRNAs. Yfr2a promoter-luxAB fusions confirmed a very strong activity of this promoter and indicated a stimulation of expression if the cultures were exposed to elevated light intensities. Conclusion Comparison to entries in Rfam and experimental testing of selected ncRNA candidates in Synechocystis PCC6803 indicate a high reliability of the current prediction, despite some contamination by the high number of repetitive sequences in some of these species. In particular, we identified in the four species altogether 8 new ncRNA homologs belonging to the Yfr2 family of ncRNAs. Modelling of RNA secondary structures indicated two conserved single-stranded sequence motifs that might be involved in RNA-protein interactions or in the recognition of target RNAs. Since our analysis has been restricted to find ncRNA candidates with a reasonable high degree of conservation among these four cyanobacteria, there might be many more, requiring direct experimental approaches for their identification. Background In bacteria, non-coding RNAs (ncRNAs) are a heterogeneous group of sequence-specific regulators of gene expression, normally lacking a protein-coding function. They are typically 50–250 nucleotides in length [1], and regulate mRNA translation or decay but sometimes also directly modulate certain protein functions. Most stress responses in the organism best-studied in this respect, E. coli, include at least one small regulatory RNA as part of the regulon [2]. However, their functions also include the control of plasmid and viral replication [3], bacterial virulence [4], quorum sensing [5], or the acquired resistance against bacteriophages [6]. In many cases, these ncRNAs function through sequence-specific base pairing; hence they frequently have a (partial) base complementarity to their target RNA molecules. The vast majority of known ncRNAs is encoded at genomic locations far away from their target genes. However, some ncRNAs are transcribed from the reverse complementary strand of the respective target and hence these are fully or partially overlapping with their target RNAs, constituting the class of antisense RNAs. Except for the more common types of ncRNA (ribosomal RNA, tRNA, tmRNA, 6S RNA, RNAse P RNA and ffs RNA), genes encoding ncRNAs are not annotated during standard genome analysis. The efforts to accomplish their identification in bacteria can broadly be divided into (i) sequencing the population of small RNAs or (ii) prediction by bioinformatics tools (mostly) followed by experimental verification (see [7] for review). As a result of such systematic searches, more than 80 ncRNAs are now known in E. coli, most of which had been overlooked by traditional genome analysis. Cyanobacteria currently raise considerable interest as they perform oxygenic photosynthesis, fix atmospheric CO2 and nitrogen, frequently produce large quantities of bioactive secondary metabolites and due to their potential for the production of biofuels. As long as there is sufficient light available for photosynthesis, cyanobacteria populate widely diverse environments such as freshwater, the oceans, rock surfaces, desert soil or the polar regions. Their adaptation to vastly different environmental conditions suggests the existence of sophisticated regulatory mechanisms. Therefore, various types of regulatory RNA can be expected that interplay with the different signal transduction pathways and stress responses. Indeed, computational-experimental screens based on comparative genome analysis identified seven different ncRNAs in the marine cyanobacteria Prochlorococcus and Synechococcus [8] which were called Yfr1-7 for cYanobacterial Functional RNA. In a follow-up study making use of high density microarrays and exploiting the genome information from meanwhile 12 different Prochlorococcus genome sequences, additional 14 ncRNAs and 24 antisense RNAs were found [9]. Unicellular marine cyanobacteria of these genera provide an excellent dataset for computational predictions that require comparative genome information since currently 22 different genome sequences from very closely related isolates are available [10,11]. However, a major bottleneck in the work with these marine cyanobacteria is that despite some recent progress [12], protocols for genetic manipulation are very slow or not available at all. Therefore, the finding that two of these ncRNAs are phylogenetically widely distributed enabled direct genetic work on their functional relevance: Yfr1 is distributed throughout the cyanobacterial radiation [13] and might play a role in the adaptation to redox stress or the regulation of carbon uptake [14], whereas Yfr7 was identified as the homolog of the 6S RNA [15] which is found in all eubacteria [16]. However, for efficient genetic and biochemical analysis of cyanobacterial ncRNAs it would be very desirable to identify a set of ncRNA candidate genes in model cyanobacteria that are easy to manipulate and for which extended mutant, transcriptomic and proteomic data sets are available. In addition to Yfr1 which exists in all four unicellular cyanobacteria targeted here [13], the only currently known ncRNAs in model cyanobacteria are an antisense RNA covering the ferric uptake regulator gene furA in Anabaena PCC 7120 over its full length [17], and the antisense RNA IsrR, regulating the gene for the light-absorbing protein IsiA under conditions of iron limitation and redox stress in the unicellular Synechocystis PCC 6803 [18]. In recent years, comparative genomics-based prediction of ncRNA genes has become a standard method to search for such genes within bacterial genomes [8,19-23]. Thus, the availability of genome sequences from closely enough related species is a critical factor as is the conservation of ncRNAs. In case of unicellular cyanobacterial model organisms, the lack of genome sequences from close relatives has been hampering such studies. With the recent release of the Microcystis aeruginosa NIES-843 genome [24], however, a cyanobacterium relatively close to Synechocystis has been sequenced. Here we set out to identify possible ncRNA genes and other RNA elements (5' leader sequences and riboswitches) in the three unicellular model cyanobacteria Synechocystis PCC6803, Synechococcus elongatus PCC6301 and Thermosynechococcus elongatus BP1 plus the toxic Microcystis aeruginosa NIES-843 (from now on: Synechocystis, Synechococcus, Thermosynechococcus and Microcystis) by biocomputational comparative genome analysis with a focus on Synechocystis. Results and Discussion Computational screening for novel ncRNAs To screen for novel RNA elements, all intergenic regions >50 nt were extracted from the four genomes and analyzed as outlined in Fig. Fig.1,1
The analysis was basically focused on sequence and structure similarities. Detailed information on all clusters predicted by our method including the positions of all sequences is available online [25]. This information, which we show exemplarily in the inset in Fig. Fig.1,1 High-scoring putative RNA elements Filtering with P > = 0.5 or Z < = -2.0 reduced the initial number of 1528 individual sequences in the 443 predicted clusters to 113 sequence clusters with 620 individual elements, 94, 56, 64 and 406 in Synechocystis, Synechococcus, Thermosynechococcus and Microcystis, respectively. A summary of the highest scoring clusters is given in Table 1[29-31]. The Venn diagram in Fig. Fig.22
We previously showed the existence of Yfr1 in three out of four tested marine cyanobacteria belonging to the genera Prochlorococcus and Synechococcus [8] and later demonstrated its existence throughout the cyanobacterial radiation, including the four unicellular cyanobacteria targeted here [13]. It was, therefore, no surprise to find Yfr1 among the top-scoring elements (Z-score and probability of -4.340 and 1.0) in cluster 139 (Table 1). Although RNA elements in cyanobacteria are only scarcely covered by Rfam, the existing entries provided another positive control set: the thiamine riboswitch was correctly identified in three strains (cluster 149; Table 1) and also two RNA elements of unknown function were correctly found for Synechocystis and Microcystis but not for the other two cyanobacteria (cluster 216 and 107 in Table 1). However, we noted the functional role assumed for one of these conserved RNA structures, the ykkC/yxkD element (cluster 216), to switch efflux pumps and detoxification systems in response to harmful environmental molecules, may not apply to the cyanobacterial homologs since they are neither in Synechocystis nor in Microcystis located upstream of a putative transporter gene. Synteny among high-scoring RNA elements The genomic location of a predicted ncRNA gene or RNA element in the same sequence neighbourhood in some or all of the studied cyanobacteria can also be a powerful tool for finding related ncRNAs. Among the 25 high-scoring sequence clusters in Table 1, 9 (36%) showed at least partial synteny. The high scoring element in cluster 80 illustrates this fact. The primary annotation gives no hint about the possible relatedness of the flanking genes. The flanking gene sufR annotated in Microcystis encodes an iron-sulfur cluster biosynthesis transcriptional regulator and similarity searches revealed that sll0088, syc2358d and sufR actually are orthologs of each other (Fig. (Fig.3A).3A
Experimental verification For exemplary experimental verification of predicted ncRNA genes we chose two very different examples, one well-supported candidate with three members from cluster 159 (probability 0.933 and Z-score -2.00; Table 1) and one from cluster 294 (probability 1.0 and Z-score -2.64; Table 1). Northern hybridization of total RNA from Synechocystis using strand-specific RNA probes confirmed the existence of both ncRNAs (Fig. (Fig.4).4
In a more general sense these results demonstrate that, just judging from the prediction, both candidate ncRNA genes might have been expected to be 3'UTRs due to their close location to an mRNA 3'end. We did not investigate their origin from a specific promoter further as we did for the Yfr2a ncRNA (see below), but the results shown in Fig. Fig.4,4 A family of ncRNAs that is widely conserved among cyanobacteria The vast majority of the ~100 bacterial ncRNAs experimentally verified thus far have been identified in Escherichia coli [2] and a few other model proteobacteria and Pseudomonas species. Therefore it is not surprising that, with the exception of the four highly conserved ncRNAs 6S RNA, tmRNA, ffs and RnpB, ortholog genes for ncRNAs are known only among very closely related species such as between Salmonella sp. and Yersinia sp. [37]. Here, with cluster 219 eight sequences were identified with high sequence and predicted secondary structure similarity to a family of ncRNAs initially found in marine Prochlorococcus [8]. There are four such ncRNAs in Prochlorococcus MED4 which in the original publication had been named Yfr2, Yfr3, Yfr4 and Yfr5 [8]. From the eight new members to this family in cluster 219 three belong to Synechocystis, one to Thermosynechococcus and two each are predicted in Synechococcus and in Microcystis. Since none of them has a more pronounced similarity to any of the original Yfr2-Yfr5 ncRNAs from Prochlorococcus MED4, we decided to call them all "Yfr2" according to the first member in this group and then just to add a suffix. Therefore, the three predicted candidates belonging to this ncRNA family in Synechocystis are Yfr2a, Yfr2b and Yfr2c. All three are expressed in Synechocystis (Fig. (Fig.5).5
Sequence alignments and secondary structure predictions of the 8 Yfr2-5-type ncRNAs suggest a centrally located single-stranded loop element together with a short unpaired region at the 5' end that are highly conserved (Fig. (Fig.6).6
Expression analysis of Yfr2a Starting with the mapped initiation site of Yfr2a we chose the region located immediately upstream of it in a promoter fusion experiment with luxAB genes to prove that it actually does contain a functional promoter. Moreover, if the expression of an ncRNA is regulated under certain environmental conditions this sometimes gives a hint into which processes this ncRNA might be involved in. As controls, we chose the same DNA fragment in reverse orientation and amplified and cloned the psbA2 (slr1311) promoter, again in both orientations. The 300 nt upstream of Yfr2a provided indeed very strong expression to the reporter genes – under all tested conditions the measured fluorescence values were comparable to those obtained from the psbA2 promoter-driven luciferase gene expression, whereas the reverse orientation of the same fragments provided very low activity only (Fig (Fig7A).7A
Six Clusters containing repetitive sequences One problem when dealing with genome sequences of some cyanobacteria is the high number of repetitive sequences, mobile genetic elements and transposon-related sequences. Indeed, the output from our prediction pipeline was contaminated by imperfect inverted repeat sequences flanking different families of IS elements, mainly in Microcystis, and to a lesser extent in Synechocystis. If subtracted from the data set, the total number of predicted RNA elements (numbers in brackets correspond to high-scoring elements) in Synechocystis, Synechococcus, Thermosynechococcus and Microcystis drops to 339 (78), 160 (53), 144 (42) and 426 (168). Such repetitive sequences were collected in six sequence clusters, namely #12, #25, #38, #68, #270 and #383 with 78, 87, 153, 83, 42 and 16 individual sequences, respectively. All 459 sequences from these clusters have been collected in a separate file and are accessible from our website [41]. Conclusion Comparative genomics-based prediction of ncRNA genes and candidate ncRNA genes is more and more becoming a standard tool to search for such genes within bacterial genomes [8,19-23]. Here we provide the first list of ncRNA and other RNA element candidates for model unicellular cyanobacteria. Surprisingly, we identified with Yfr2a-Yfr2c a family of ncRNAs which is widely conserved among cyanobacteria and which become accumulated to high concentrations. Our experimental verification together with existing positive controls suggests a high number of positives in this candidate set. However, there are also putative 5' operon leaders, Rho-independent 3' transcriptional terminators and possibly yet unidentified riboswitches in this data set. Moreover, the output is contaminated to some extent by transposase-related sequences. Nevertheless, by analogy to other bacteria, including the most streamlined marine cyanobacterium Prochlorococcus MED4 [9], this number of ncRNAs and other RNA elements is probably a grave underestimation. Therefore this analysis should be considered as a first step to become complemented by more exhaustive experimental screens, for instance by using tiling arrays or deep sequencing in the near future. Methods Cultures and manipulation of cyanobacteria The Synechocystis Moscow strain was used in this study (obtained from A. Wilde, University of Giessen, Germany, originally from S. Shestakov, Moscow State University, Russia) and propagated on BG11 [42] 1% (w/v) agar (Bacto agar, Difco) plates. Liquid cultures of Synechocystis were grown at 30°C in BG11 medium under continuous illumination with white light of 50 μmol of photons•m-2•s-1 and a continuous stream of air. As a promoter test vector we used the pILA plasmid [43] into which ~300 bp long promoter fragments were cloned as transcriptional fusions with the luxAB genes. After transformation, this plasmid integrates into the slr0168 gene within the chromosome of Synechocystis by homologous recombination. Transformation and analysis of correct integration and segregation was carried out as described elsewhere [44]. Extraction and analysis of RNA Exponentially growing Synechocystis cultures (OD750 0,6 – 0,8) were collected by filtration (Pall Supor 800 Filter). Filters with cells were dissolved in 1 ml Trizol per 40 ml culture, immediately frozen in liquid nitrogen and incubated for 15 min at 65°C in a water bath. Further RNA isolation followed the manufacturer's protocol. Small RNA Northern blots were prepared from the separation of 10 to 25 μg of total RNA on 10% urea-polyacrylamide gels as described by Steglich et al. [9]. Polyacrylamide gels were stained with ethidium bromide (0.3 μg/l) in 1× TBE buffer, rinsed with 1× TBE and analyzed with an E-BOX video gel documentation system (Peqlab). Transcript sizes were determined by correlation to Fermentas' RiboRuler low range RNA marker. Blots for RNAs with higher molecular weight were prepared from the separation of 5 μg of total RNA on 1,5% denaturing agarose gels. Transcriptional start sites were determined by 5'-RACE as described [9]. Sequence data Genome sequences were obtained from the finished microbial genomes website at Genbank [45] with the following accession numbers: Synechococcus elongatus PCC 6301, NC_006576; Synechocystis sp. PCC 6803, NC_000911; Thermosynechococcus elongatus BP-1, NC_004113; Microcystis aeruginosa NIES843, NC_010296. Prediction of RNA elements in a comparative approach We performed a comparative prediction of ncRNA elements within intergenic regions (IGRs). Therefore, all IGRs longer than 50 nt were extracted and compared among the different genomes using Blast. Intragenomic analyses with the settings given in Fig. Fig.11 Matching predictions to Rfam and TransTermHP All individual sequences were matched against Rfam [27] using the batch search feature provided by Rfam. Mapping of predicted sequences to information about Rho-independent terminators provided by TransTermHP [28] was done using Vmatch [46]. Therefore, TransTermHP predictions for Microcystis were computed using TransTermHP 2.06 with default parameters, while for Synechocystis, Synechococcus and Thermosynechococcus existing predictions were downloaded from the TransTermHP website. All predictions were converted to FASTA-format and searched for at least 30 nt long hits with 100% identity to candidate sequences. Comparative sequence/structure analysis Multiple sequence alignments were generated using ClustalW [47] with default parameters for DNA. Comparative structure prediction was done with RNAlishapes [48], a tool which predicts a consensus structure for a set of aligned sequences by taking covariance and free energy into account. The resulting consensus structure was analysed together with the multiple sequence alignment using RALEE [49]. The latter served also for manual optimisation of the alignment and the consensus structure, respectively, and for the production of colour annotated alignments. Colour plots of Consensus structures were generated using RNAalifold [50]. Oligonucleotides Oligonucleotide primers for the generation of hybridization probes (T7 promoter sequence in boldface letters): Yfr2c_for: 5'-TAATACGACTCACTATAGGccgccagcgccattgcttc-3' Yfr2c_rev: 5'-cttaggacaggtgtgaggaaattag-3' Yfr2b_for: 5'-TAATACGACTCACTATAGGcggggagcatagaccagcttg-3' Yfr2b_rev: 5'-ggaagttattatctagaggtgtgtgag-3' Yfr2a_for: 5'-TAATACGACTCACTATAGGaggcaaaaaaataaggaagtccgcaag-3' Yfr2a_rev: 5'-cggctatcccgcccttagg-3' Syr1_for: 5'-TAATACGACTCACTATAGGccgagggcatatctaggagaac-3' Syr1_rev: 5'-ggctatggaaacccgacagaattc-3' Syr2_for: 5'-TAATACGACTCACTATAGGcaaacaaaaaaagaggccattgctgacc-3' Syr2_rev: 5'-gactagttgttgctaatttagcaatgttg-3' Oligonucleotides for promoter fusion experiments (KpnI restriction sites 5'-GGTACC-3' introduced for cloning are labelled in boldface letters): Yfr2a (fw): 5'-GGTACCCTAGATGACACCGGCACG-3' Yfr2a (rev): 5'-GGTACCCTCCTCACACACAAATAAATGTTAG-3' psbA2 (fw): 5'-CCTTGGTACCAAGAGTAATGGCGTGC-3' psbA2 (rev): 5'-GATTGGTACCGGAACTGACTAAACTTAGTC-3' Oligonucleotides for specific mapping of 5'ends through 5'RACE: Yfr2c_5'RT: 5'-CCTAAAAATTGCCATAAAAAAACAC-3' Yfr2c_5'Race: 5'-TCTTTCCTTGTTTCGACTCCAG-3' sll1477_5'RT: 5'-GCGGCCAGAGGTTTCC-3' sll1477_5'Race: 5'-CAGCGTAGCTAGGGAAATCACCACCAG-3' Yfr2b_5'RT: 5'-AAAAGGCAAGAAAAAGCCCC-3' Yfr2b_5'Race: 5'-TTTCCTTGTTTCGACTCCGGGG-3' slr0199_5'RT: 5'-TGACCCAGATACCCTAAAAG-3' slr0199_5'Race: 5'-CTTTTGATAATCTTGGCGGCC-3' Yfr2a_5'RT: 5'-GGAGTCTTGCCATGTTTCG-3' Yfr2a_5'Race: 5'-CCTCCGGCTGCTTCCTT-3' Hybridization conditions Northern hybridization was performed at 62°C in hybridization buffer (50% deionized formamide, 7% SDS, 250 mM NaCl, 120 mM Na(PO4), pH 7.2) as described by Steglich et al. [9]. Single stranded probes were generated from PCR-amplified templates incorporating the T7 promoter in one of the oligonucleotide primers, using the MAXIscript Kit (Ambion, USA) and 100 ng PCR-generated DNA template. Abbreviations IGR: intergenic region; ncRNA: non-coding RNA. Authors' contributions BV designed and carried out bioinformatic analyses and participated in drafting the manuscript, JG performed all RNA analyses in the laboratory, SU and VS constructed the promoter test constructs and performed the reporter gene assays. WRH designed research and wrote the manuscript. All authors read and approved the final manuscript. Acknowledgements This work was supported by the Deutsche Forschungsgemeinschaft Focus program "Sensory and regulatory RNAs in Prokaryotes" SPP1258 (project HE 2544/4-1), the graduate school "Signal systems in plant model organisms" (to JG) and by the Freiburg Initiative in Systems Biology. We thank Martin Hagemann for the gift of promoter test vectors. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||
Curr Opin Microbiol. 2006 Dec; 9(6):605-11.
[Curr Opin Microbiol. 2006]Trends Genet. 2005 Jul; 21(7):399-404.
[Trends Genet. 2005]Annu Rev Microbiol. 1994; 48():713-42.
[Annu Rev Microbiol. 1994]Mol Microbiol. 2007 Jan; 63(1):193-217.
[Mol Microbiol. 2007]Cell. 2004 Jul 9; 118(1):69-82.
[Cell. 2004]Biol Chem. 2005 Dec; 386(12):1219-38.
[Biol Chem. 2005]Genome Biol. 2005; 6(9):R73.
[Genome Biol. 2005]PLoS Genet. 2008 Aug 29; 4(8):e1000173.
[PLoS Genet. 2008]Genome Biol. 2008; 9(5):R90.
[Genome Biol. 2008]PLoS Genet. 2007 Dec; 3(12):e231.
[PLoS Genet. 2007]Appl Environ Microbiol. 2006 Dec; 72(12):7607-13.
[Appl Environ Microbiol. 2006]Genome Biol. 2005; 6(9):R73.
[Genome Biol. 2005]Mol Microbiol. 2007 Dec; 66(5):1080-91.
[Mol Microbiol. 2007]BMC Genomics. 2007 Dec 19; 8():467.
[BMC Genomics. 2007]DNA Res. 2007 Dec 31; 14(6):247-56.
[DNA Res. 2007]Proc Natl Acad Sci U S A. 2005 Feb 15; 102(7):2454-9.
[Proc Natl Acad Sci U S A. 2005]Proc Natl Acad Sci U S A. 2005 Feb 15; 102(7):2454-9.
[Proc Natl Acad Sci U S A. 2005]Nucleic Acids Res. 2005 Jan 1; 33(Database issue):D121-4.
[Nucleic Acids Res. 2005]Genome Biol. 2007; 8(2):R22.
[Genome Biol. 2007]Proc Natl Acad Sci U S A. 2004 Apr 27; 101(17):6421-6.
[Proc Natl Acad Sci U S A. 2004]Prog Nucleic Acid Res Mol Biol. 1994; 47():331-70.
[Prog Nucleic Acid Res Mol Biol. 1994]Proc Natl Acad Sci U S A. 2005 Feb 15; 102(7):2454-9.
[Proc Natl Acad Sci U S A. 2005]Genome Biol. 2005; 6(9):R73.
[Genome Biol. 2005]BMC Genomics. 2007 Oct 17; 8():375.
[BMC Genomics. 2007]FEBS Lett. 2003 Aug 14; 549(1-3):57-62.
[FEBS Lett. 2003]Biochem Biophys Res Commun. 1997 Oct 9; 239(1):291-7.
[Biochem Biophys Res Commun. 1997]J Bacteriol. 1985 Mar; 161(3):1162-70.
[J Bacteriol. 1985]Nucleic Acids Res. 2004; 32(20):6057-68.
[Nucleic Acids Res. 2004]Proc Natl Acad Sci U S A. 2006 May 2; 103(18):7054-8.
[Proc Natl Acad Sci U S A. 2006]Trends Genet. 2005 Jul; 21(7):399-404.
[Trends Genet. 2005]Nucleic Acids Res. 2003 Apr 1; 31(7):1813-20.
[Nucleic Acids Res. 2003]Genome Biol. 2005; 6(9):R73.
[Genome Biol. 2005]Genome Biol. 2005; 6(9):R73.
[Genome Biol. 2005]Structure. 2000 Mar 15; 8(3):R47-54.
[Structure. 2000]Curr Opin Microbiol. 2007 Apr; 10(2):156-63.
[Curr Opin Microbiol. 2007]J Biol Chem. 2004 Jun 11; 279(24):25066-74.
[J Biol Chem. 2004]Genome Biol. 2005; 6(9):R73.
[Genome Biol. 2005]Mol Microbiol. 2007 Dec; 66(5):1080-91.
[Mol Microbiol. 2007]BMC Genomics. 2007 Dec 19; 8():467.
[BMC Genomics. 2007]PLoS Genet. 2008 Aug 29; 4(8):e1000173.
[PLoS Genet. 2008]J Microbiol Methods. 2000 Aug; 41(3):185-94.
[J Microbiol Methods. 2000]Microbiology. 2008 Oct; 154(Pt 10):3134-43.
[Microbiology. 2008]PLoS Genet. 2008 Aug 29; 4(8):e1000173.
[PLoS Genet. 2008]Nucleic Acids Res. 2005 Jan 1; 33(Database issue):D121-4.
[Nucleic Acids Res. 2005]Genome Biol. 2007; 8(2):R22.
[Genome Biol. 2007]Nucleic Acids Res. 1994 Nov 11; 22(22):4673-80.
[Nucleic Acids Res. 1994]Nucleic Acids Res. 2006; 34(19):5471-81.
[Nucleic Acids Res. 2006]Bioinformatics. 2005 Jan 15; 21(2):257-9.
[Bioinformatics. 2005]J Mol Biol. 2002 Jun 21; 319(5):1059-66.
[J Mol Biol. 2002]PLoS Genet. 2008 Aug 29; 4(8):e1000173.
[PLoS Genet. 2008]