![]() | ![]() |
Formats:
|
||||||||||||||
Copyright © 2005 Podar et al; licensee BioMed Central Ltd. Evolution of a microbial nitrilase gene family: a comparative and environmental genomics study 1Diversa Corporation, 4955 Directors Place, San Diego, CA 92131 USA Corresponding author.Mircea Podar: mpodar/at/diversa.com; Jonathan R Eads: jeads/at/diversa.com; Toby H Richardson: trichardson/at/diversa.com Received May 5, 2005; Accepted August 6, 2005. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. This article has been cited by other articles in PMC.Abstract Background Completed genomes and environmental genomic sequences are bringing a significant contribution to understanding the evolution of gene families, microbial metabolism and community eco-physiology. Here, we used comparative genomics and phylogenetic analyses in conjunction with enzymatic data to probe the evolution and functions of a microbial nitrilase gene family. Nitrilases are relatively rare in bacterial genomes, their biological function being unclear. Results We examined the genetic neighborhood of the different subfamily genes and discovered conserved gene clusters or operons associated with specific nitrilase clades. The inferred evolutionary transitions that separate nitrilases which belong to different gene clusters correlated with changes in their enzymatic properties. We present evidence that Darwinian adaptation acted during one of those transitions and identified sites in the enzyme that may have been under positive selection. Conclusion Changes in the observed biochemical properties of the nitrilases associated with the different gene clusters are consistent with a hypothesis that those enzymes have been recruited to a novel metabolic pathway following gene duplication and neofunctionalization. These results demonstrate the benefits of combining environmental genomic sampling and completed genomes data with evolutionary and biochemical analyses in the study of gene families. They also open new directions for studying the functions of nitrilases and the genes they are associated with. Background Having colonized virtually every environment, bacteria and archaea have evolved enzymatic solutions for a wide range of metabolic biochemical transformations [1,2]. Studying enzymes derived from organisms inhabiting these environments is important for understanding how microbes adapt, react to and transform the environment. The overwhelming majority of microbial species remain however uncultivated [3]. A variety of functional and sequence-based approaches have been developed for discovering and characterizing genes, operons and even entire genomes directly from the environment, collectively referred to as metagenomics or environmental genomics [4]. The use of environmental genomics has already led to important discoveries such as genes responsible for novel biological functions [5], microbial community metabolic traits [6-8] and dramatic increases in the diversity of various enzyme families [9,10]. Subsequent biochemical and evolutionary analyses can strengthen the biological end ecological inferences even before organisms that carry that genetic information are isolated in culture [11-13]. From a practical perspective, microbial environmental genomics has been a successful approach for the discovery of enzymes for a broad spectrum of biotechnological applications [14-17]. To gain insight into the evolution of function in a gene family that has been extensively sampled by environmental genomic screening and characterized biochemically, we focused on bacterial nitrilases. These enzymes are members of the carbon-nitrogen hydrolase superfamily which catalyze the hydrolysis of a wide range of non-peptide carbon-nitrogen bonds [18-20]. The nitrilase family hydrolyzes nitriles to their corresponding carboxylic acids, releasing ammonia. This reaction is likely involved in detoxification of xenobiotics and nitriles produced as defense chemicals by other microorganisms and plants, as well as in secondary metabolite biosynthetic pathways. Nitrilases appear to be rare in bacteria (out of over 150 sequenced bacterial genomes only 10 contain nitrilase genes). Recently, over 130 nitrilases were identified by functional screening of hundreds of environmental DNA libraries, for use in industrial biocatalysis applications [9]. Those enzymes were characterized biochemically and classified into six subfamilies, four of them with no representatives in known bacterial species. It was found that a number of enzymatic properties (substrate specificity and enantioselectivity) were specific to subfamilies and, in some cases, correlated with the biogeography and ecology of the environmental samples. The role of gene duplication, natural selection and functional diversification in the evolution of the nitrilase gene family is unknown. The correlation of distinct enzymatic properties with the different genes subfamilies suggest that nitrilases have diverged functionally to accommodate distinct biological roles in microbial communities that occupy various ecological niches. Functional divergence is the result of changes in selection pressure and is often accompanied by associations with novel gene clusters or operons which encode for enzymes with coupled metabolic activities. To begin addressing some of these aspects, we analyzed the genetic neighborhoods of all available nitrilase genes, identified conserved patterns of conserved gene clustering relative to biochemical data and phylogeny and propose a hypothesis on nitrilase evolution involving gene duplications and Darwinian selection. Results and discussion The nitrilases from cultivated bacteria belong to clade-specific gene clusters Bacterial nitrilases (137 environmental sequences and 10 sequences from cultivated species) have been recently classified into six major clades [9] that we refer to as subfamilies. We analyzed more recently released genome sequences and found an additional nine novel nitrilases. Phylogenetic analysis of a sequence dataset consisting of all nitrilase genes from cultivated bacteria shows that 18 sequences belong to subfamilies one and two (Fig. (Fig.1).1
In bacteria, genes are often organized in clusters (e.g. operons, regulons) that reflect involvement in a common metabolic process or association in a supramolecular complex [21-23]. To determine if nitrilase function could be inferred from the nature of the surrounding genes, we analyzed those genes in the available genomic data. We found that all of the known seven subfamily 1 nitrilase genes (six genomic and one on a plasmid) belong to a conserved and previously undescribed cluster of seven genes, Nit1C (Figure (Figure11
In the case of subfamily 2, gene neighborhood information was available for only four of the twelve genes from cultivated bacteria. In Bacillus sp. and Pseudomonas syringae, the nitrilase gene is apparently co-transcribed with a downstream phenylacetaldoxime dehydratase gene and preceded by an araC transcription factor transcribed from the other strand. The other nitrilase genes (from Burkholderia, Bradyrhizobium and Ralstonia) are part of unrelated clusters (Figure (Figure11 In addition to the nitrilases from completed genomes of cultivated bacteria, we searched for such enzymes in two large environmental sequence datasets: the acid-mine drainage microbial mats [7] and the Sargasso Sea [10] using BLASTP. No nitrilases were found in the acid-mine dataset. In the Sargasso Sea dataset we identified 17 nitrilases that were full-length or long enough to be phylogenetically informative. Three of the genes appear to be eukaryotic while eight bacterial genes are close relatives to nitrilases from Synechoocccus or Burkholderia. The remaining six genes do not appear to have close relatives among known nitrilases and belong to subfamilies 2, 4 and 5 [see Additional file 1]. Finding so few nitrilase genes in such a large dataset suggests that for uncovering the sequence space of a gene family, functional screening of a large number of samples from very different environments is more efficient than deep sequence coverage of one or a few environments. Nitrilases associated with different types of gene clusters have distinct enzymatic properties For the nitrilase genes identified from environmental DNA, the identity of the host organism is unknown. However, because those libraries were constructed using fragments of genomic DNA several times larger than the average nitrilase gene length (~1 kb), we also analyzed the the gene neighborhood of the environmental nitrilase. Because of the highly conserved nature of the Nit1C cluster and its occurrence in distant taxa of bacteria, we first focused on mapping its distribution among the environmental nitrilase clones. We found that the Nit1C cluster is strictly confined to a group of subfamily 1 nitrilases that includes the seven genes identified in completed genomes and 14 of the environmental ones. Four of the subfamily 1 nitrilases from the Sargasso Sea dataset had small flanking sequences and we identified the presence of the Nit1C type genes (ORFs 1 or 3), similar to those of their close relatives from Synechococcus and Burkholderia. However, because of their incomplete length, those sequences were not included in further analyses. The nitrilase genes that belong to the Nit1C cluster are indicated on a maximum likelihood phylogenetic tree calculated using the subfamily 1 genes as well as several outgroup sequences from subfamilies 2 and 3 (Figure (Figure3A).3A
The sister group of subfamily 1 nitrilases, subfamily 3, consists of only three environmental type genes. We had sufficient flanking sequence to determine the nature of the neighboring genes for only one of the genes (3A1), flanked by two hypothetical ORFs with no identifiable homologs. Therefore, the Nit1C cluster appears to have originated with and is restricted to a subset of subfamily 1 nitrilases. The more distantly related nitrilases from subfamilies 4, 5 and 6 have no apparent associations with a conserved gene cluster (data not shown). In our previous study [9] we uncovered a number of correlations between the biochemical properties of the environmental microbial nitrilases and their phylogenetic classification. Distinct gains or losses of activity or switches in enantioselectivity coincided with the evolutionary events that led to the formation of the main subfamilies. One of the most interesting findings was a reversal in enantioselectivity (R to S) that occurred in subfamily 1, against the model substrate hydroxyglutaronitrile. To correlate the differences in types of gene clusters with the nitrilase biochemical properties, we graphed the available hydroxyglutaronitrile activity data on the side of the phylogenetic tree (Figure (Figure3C).3C Analysis of the subfamily 1 nitrilase gene clusters Having determined that subfamily 1 nitrilases belong to two distinct subgroups based on their associated gene clusters and enzymatic properties, we analyzed the nitrilase neighboring genes for clues to their individual metabolic roles. First in the Nit1C cluster, ORF1 proteins are highly conserved in length (160–163 amino acids) and sequence (>60% identity between any two genes). However, no other homologs were found using standard searching techniques of current databases. Using HMM structural homology modeling (Superfamily 1.63 server) [25], we tentatively assigned the hypothetical protein 1 to the YchN1-like superfamily and fold, whose biochemical activity is unknown. Next in the cluster is the nitrilase gene. The third gene encodes a member of the radical SAM superfamily (Pfam 04055), enzymes that catalyze a wide variety of radical-based reactions through reductive cleavage of S-adenosylmethionine at an iron-sulfur center [26]. The Nit1C SAM genes form a strongly supported clade (~50% average sequence identity), most closely related to bacterial and archaeal genes annotated as biotin synthase-related enzymes (COG2516) [see Additional file 2]. ORF4 in the Nit1C cluster also forms a clade of closely related sequences and belong to the GCN5-related N-acetyltransferase (GNAT) superfamily (Pfam 00583) [27]. These enzymes are involved in antibiotic detoxification as well as in histone acetylation in eukaryotes. The closest homologs to the Nit1C GNAT genes are a number of other acetylases from bacteria like Rhodobacter and Enterococcus [see Additional file 2]. The fifth gene in the cluster encodes members of the large 5'-phosphorybosyl-5-aminoimidazole synthase-related proteins superfamily (AIRS, Pfam 00586). Enzymes in this superfamily are involved in de novo purine biosynthesis, selenophosphate synthesis, or maturation of NifE hydrogenase. These genes form a unique clade, most closely related to a group of archaeal genes encoding phosphoribosylformylglycinamide synthases [see Additional file 2]. The last invariant position in the cluster, ORF6, encodes a protein of approximately 100 amino acids. While the sequence identity between the individual genes surpasses 70%, we could not find any other relatives to these genes by any sequence analysis approach. The seventh ORF of Nit1C is located at either end of the cluster, on either coding strand. This gene is a member of the pyridine nucleotide-disulphide oxidoreductases (Pfam 00070, COG2072), that include flavin-containing monooxygenases and flavoproteins involved in K+ transport. The closest relatives to the Nit1C genes are putative monooxygenases found in several species of Pseudomonas [see Additional file 2]. All Nit1C genes form clusters of closely related sequences within their respective superfamilies, suggesting a common function, possibly in a pathway for detoxification of plant or microbial defense compounds. Members of the nitrilase clade that split after the transition event are exclusively of environmental origin, with no sequence representatives in characterized bacterial species. Approximately two thirds of the nitrilases in this group are associated with genes encoding a MarR transcriptional regulator, epimerases and epoxide hydrolases. MarR genes (PFam 01047) are transcriptional repressors controlling the expression of the Mar operon, involved in multiple antibiotic resistances [28]. The nitrilase-associated MarR genes form a specific clade, most closely related to genes from Xanthomonas and Desulfitobacterium (30–40% identity) [see Additional file 3] and are always upstream of the nitrilase gene. The location of the epimerase and epoxide hydrolase varies somewhat, the epimerase ORF being usually between the nitrilase and the epoxide hydrolase ORFs. Epimerases are a large class of enzymes that reversibly determine stereochemical inversions of hydroxyl substituents in carbohydrates, participating in numerous metabolic pathways [29,30]. The nitrilase-associated epimerases form a unique clade in which the relationship between the genes parallels that of their associated nitrilases. Their closest relatives are epimerases from species of Streptomyces (~35% identity) [see Additional file 3]. Epoxide hydrolases belong to the large superfamily of alpha-beta fold hydrolases and hydrate chemically reactive epoxides to more stable dihydrodiols. This reaction is of major importance in detoxification of a large number of endogenous epoxide metabolites and xenobiotic compounds in all organisms [31]. The association of all these genes with nitrilases could indicate the requirement for coupled reactions under the transcriptional control of MarR, perhaps involved in detoxifying sugar-based cyanogenic compounds in soils rich in decaying plant material. Positive selection as a possible driving force for nitrilase functional diversification The observed changes in associated gene clusters and in enzymatic properties suggest that the hypothetical gene duplication in subfamily 1 was followed by nitrilase recruitment to novel metabolic functions, possibly under selective constraints. A powerful approach to studying changes in the selective pressure in protein encoding genes involves calculation of the nonsynonymous/synonymous substitution rate ratio (ω = dN/dS) (reviewed in [32,33]). A ratio below one indicates negative (purifying) selection, restricting amino acid changes that could interfere with a well-established protein function, while ω = 1 suggests that the gene evolves neutrally. On the other hand, a ratio significantly higher than one may indicate a selective advantage for fixation of amino acid changes. This can be considered evidence of positive selection associated with functional divergence after events such as gene duplications or changes in the environment (e.g. [34,35]). Using a relative rate test [36], we first investigated the rate variation between the branches flanking the transition event (1A23/1A25 and 1A21). A likelihood ratio test based on a three-taxon tree (consisting of 1A25 and 1A21 as test sequences and 1A29 as outgroup) compared the null hypothesis (equal rates for both branches following the transition event) with an alternative model with unconstrained rates. The null model was rejected (P = 2 × 10-6, df = 1), supporting a 5.6 times faster overall rate for the 1A21 lineage than for 1A25, which has maintained the Nit1C association. A rate increase is predicted when gene duplication is followed by functional divergence and could occur because of positive Darwinian selection or an increase in fixation of neutral mutations as result of relaxation of functional constraints [37-40]. To test if positive selection acted along the nitrilase lineages flanking the cluster transition event, we used a maximum likelihood (ML) approach based on codon substitution models [34]. These models take into account sequence features such as transition-transversion rate biases, codon usage variation and allow testing hypotheses at specific branches in a phylogeny by employing heterogeneous ω values among sites and lineages. Positive selection can also be investigated using a parsimony-based method, there being some controversy on to which of the two methods is more reliable [41-43]. The tree used for ω estimation was obtained based on the nitrilase DNA sequences, focusing on the genes around the transition event (Figure 6A). The first set of likelihood models that we used, site-specific [44], assume variations in the selective pressure across sites but no variations among individual genes. Using these models we determined that purifying selection has a dominant role across subfamily 1 nitrilases (ω = 0.04) (Table 1). This is reflected in the large number of conserved amino acids: 86 invariant (~25% of sites) and 149 conserved at 90% level in this data set. No significant positive selection signal was identified using this category of models. However, since these models average the substitution ratios of individual sites over all lineages, they are known to lack sensitivity in detecting positive selection that acts only along a few lineages (e.g. [44,45].
To investigate if adaptive evolution acted alongside branches around the transition event, we also used a more recently developed set of maximum likelihood models, which allow the ω ratio to vary among both sites and lineages [46]. These models are more sensitive in detecting positively selected sites along a pre-specified lineage of interest ("foreground" branch) as compared to the rest of the genes ("background" branches). These models were applied to the two lineages that followed the transition event (branches 1 and 2 in Figure Figure4A).4A
High resolution structures are not yet available for nitrilases. However, the structures of two homologs, the C. elegans NitFhit protein and the Agrobacterium radiobacter N-carbamoyl-D-amino acid amidohydrolase (D-NCAase) have been solved [47,48]. Both proteins form tetramers with two dimer subunits and revealed a novel four layer α-β-β-α fold. It is believed that all members of the nitrilase superfamily share this fold and the catalytic triad Glu-Lys-Cys in the active site. A three dimensional model of 1A21 (the first nitrilase outside the Nit1C group) was derived based on the D-NCAase structure coordinates, and used to map the location of the residues under positive selection at the CTE. Three of those, T41, Q157 and Y184, were found to be buried within the protein, close to the catalytic triad (E44, K126, C160) (Figure (Figure4B).4B Conclusion In this study, we combined genomic and biochemical analysis of a microbial enzyme family to understand evolutionary events that have shaped the genome organization and metabolism of organisms inhabiting various environments. It has long been known that bacterial genes often cluster based on linked functions. The gene location sometimes correlates with the order of the individual reactions in an enzymatic cascade or facilitate regulatory mechanisms of gene expression. Various models have been proposed to explain the formation, the evolutionary and physiological significance of operons and other gene clusters [23]. Comparative genomic studies have shown that recognition of clusters can assist in functional annotation of novel genes but clusters often they break apart with increasing taxonomic distance [49-53]. The Nit1C cluster that we described is remarkable in that it is highly conserved across several bacterial phyla and is present in organisms that inhabit extremely diverse environments. While limited rearrangements have occurred in Nit1C, the preservation of all seven genes suggests there is selective pressure for maintenance of the entire gene cluster regardless of the genomic dynamics in that neighborhood. The internal rearrangements of Nit1C correlate with high level taxa (cyanobacteria, beta and gamma proteobacteria). There is no experimental evidence for an involvement of any of the Nit1C genes in a known metabolic transformation. Two of the cluster genes have no close homologs or predictable biochemical activities while the remaining genes, even though have a predictable type of biochemical activity, belong to classes of enzymes that are involved in a wide range of transformations. Predicting function for remote homologs in the absence of experimental data is still a major difficulty in genomics [54,55]. Having a defined cluster of genes such as Nit1C, likely to be functionally connected, sets the ground for future experimental genetic and biochemical investigation in search of its biological function. Phylogenetically, the nitrilases from the Nit1C cluster appear strictly confined to a basal subset of subfamily 1 genes. More recent diversification of the genes in this subfamily has been accompanied by a change in the type of associated gene clusters and is paralleled by changes in biochemical properties of the nitrilases. While overall, subfamily 1 nitrilases are under strong purifying selection pressure, we detected a significant positive selection signal for the lineage following the transition event and identified several residues under such selection. This supports a hypothesis that a group of nitrilases diverged functionally from the Nit1C-type enzymes, became associated with other metabolic enzymes possibly as part of a novel pathway and advantageous mutations were fixed at specific sites under positive selection. Future studies of bacterial nitrilases and biochemical and genetic characterization of mutations at these residues are needed to better understand the determinants of substrate specificity and the functional differences between the nitrilase subfamilies. Environmental microbial genomics has demonstrated its utility in studying large scale ecological processes [5,6,11], discovering valuable biocatalysts [15] and reassembling the genomic and metabolic blueprint of natural microbial communities thorough shotgun sequencing [7,8,10]. Vast amounts of sequence data could potentially be used to answer a wide range of questions, although there are open questions regarding experimental design, data analysis and breadth of biological significance [4,56,57]. A broad environmental sampling from worldwide geographical locations coupled with experimental biochemical validation and comparative genomic analysis allowed us to test metabolic and evolutionary hypotheses difficult to approach by using sequence data from only a few environments. Methods DNA sequences The nitrilase sequences discovered from environmental DNA libraries are available from Genbank (AY487426-AY487562). Nitrilase sequences from sequenced bacterial genomes and their corresponding flanking genes were also obtained from GenBank, their names and accession numbers being indicated in the corresponding figures. For Verrucomicrobium spinosum DSM 4136, preliminary sequence data was obtained from the The Institute for Genome Research website [58] and for Burkholderia fungorum and Rubrivivax gelatinosus from the DOE Joint Genome Institute website [59]. Enzymatic activity The biochemical characterization data used in this study for the environmental nitrilases tested on the non physiological substrate hydroxyglutaronitrile has been published [9]. Sequence analysis and annotation For the analysis of the ORFs flanking the nitrilase genes in known bacterial genomes we used the sequence coordinates available in the corresponding GenBank files. For the environmental DNA clones containing nitrilase genes we identified and annotated the other open reading frames (ORFs) contiguous with the nitrilase in the genomic insert using standard approaches. The inserts varied in size from 1 to 7 kb and in most cases contained information to identify at least one or more ORFs in addition to the nitrilase gene. Annotation was derived based on available experimental or predicted function or biochemical activity using information associated with those genes in GenBank, PFAM, COG and KEGG databases. Phylogenetic reconstructions Amino acid sequences were aligned in BioEdit [60] followed by manual refinement. Sequence alignments are provided [see Additional files 4, 5]. Phylogenetic trees were constructed in PROML (PHYLIP 3.6) [61] using maximum likelihood, JTT amino acid substitution matrix, five global rearrangements with randomized sequence input order and among-site rate variation modeled with an eight rate category discrete approximation to a gamma distribution. The model parameters were estimated using TREE-PUZZLE 5.1. [62]. Branch support was obtained by bootstrapping (100 replicates). Analysis for positive selection A DNA sequence alignment for the nitrilase genes was obtained based on the protein alignment and used for phylogenetic reconstructions in PAUP* 4.0 [63] using maximum likelihood and is provided [see Additional file 6]. The model of sequence evolution (GTR+I+G) was selected using Modeltest v.3.06 [64]. To test specific branches for possible rate changes we used Hy-Phy [36]. The topologies for the DNA tree and the protein tree were identical. The tree topology was used in the program codeml (PAML [65], to estimate dN/dS ratios based on maximum likelihood codon substitution models. Two categories of models were used, site specific [44] as well as branch-site models [46]. Statistical comparisons between the results from different nested models were done using likelihood ratio tests [66]. Molecular modeling A three-dimensional model for a clade 1 nitrilase (1A21) was obtained based on the structure of the homologous protein N-carbamoyl-D-amino acid amidohydrolase [48], using the Jackal software [67]. Analysis of the model and mapping of amino acid residues involved in catalysis or subject to positive selection was done in PyMol [68]. Authors' contributions MP participated in the design of the study, performed phylogenetic, comparative genomic and statistical analyses and drafted the manuscript. JE performed sequence analysis and functional annotation. TR participated in the design of the study, performed comparative genomic and gene function analyses. All authors contributed to the writing and approved the final manuscript. Additional file 1 Protein neighbor-joining tree for nitrilase genes from cultivated bacteria and from environmental samples. The environmental sequences are represented by GenBank accession numbers and gene names for those derived from Robertson et al, 2004. The Sargasso Sea sequences are shaded. Click here for file(234K, pdf) Additional file 2 Maximum likelihood phylogenetic trees for genes that belong to the Nit1C clusters identified in known bacterial species, in the context of their respective protein families. Numbers represent bootstrap support (for major clades only). The Nit1C ORF sequences are shaded. Click here for file(150K, pdf) Additional file 3 Maximum likelihood phylogenetic trees for two genes associated with nitrilases after the subfamily 1 cluster transition event, in the context of their respective larger protein families. The nitrilase associated genes are shaded. Numbers represent bootstrap support (for major clades only). Click here for file(139K, pdf) Additional files 4 Alignment of nitrilase amino acid sequences from cultivated bacteria (used to generate the tree in Figure Figure11 Click here for file(12K, txt) Additional files 5 Alignment of nitrilase amino acid sequences used to generate the tree in Figure Figure33 Click here for file(19K, txt) Additional file 6 Alignment of DNA sequences of nitrilase genes used to test for positive selection and to generate the tree in Figure Figure44 Click here for file(20K, txt) Acknowledgements We thank Jay Short and Michiel Noordewier for their support and guidance, the Diversa Research and Development team, especially, Dan Robertson, Jenny Chaplin and Grace Desantis for leading the nitrilase discovery and characterization projects, David Lomelin and Cosmin Deciu for bioinformatics analysis support and Mark Wall for the three dimensional model of the nitrilase. Special thanks also to Melvin Simon and Phil Hugenholtz for stimulating discussions and suggestions. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||
Science. 1997 May 2; 276(5313):734-40.
[Science. 1997]Annu Rev Microbiol. 2003; 57():369-94.
[Annu Rev Microbiol. 2003]Nat Rev Microbiol. 2004 Feb; 2(2):141-50.
[Nat Rev Microbiol. 2004]Microbiol Mol Biol Rev. 2004 Dec; 68(4):669-85.
[Microbiol Mol Biol Rev. 2004]Science. 2000 Sep 15; 289(5486):1902-6.
[Science. 2000]Curr Opin Struct Biol. 2002 Dec; 12(6):775-82.
[Curr Opin Struct Biol. 2002]Genome Biol. 2001; 2(1):REVIEWS0001.
[Genome Biol. 2001]Appl Environ Microbiol. 2004 Apr; 70(4):2429-36.
[Appl Environ Microbiol. 2004]Appl Environ Microbiol. 2004 Apr; 70(4):2429-36.
[Appl Environ Microbiol. 2004]Trends Biochem Sci. 2000 Oct; 25(10):474-9.
[Trends Biochem Sci. 2000]Annu Rev Microbiol. 2003; 57():419-40.
[Annu Rev Microbiol. 2003]Nucleic Acids Res. 2005; 33(3):880-92.
[Nucleic Acids Res. 2005]Nature. 2004 Mar 4; 428(6978):37-43.
[Nature. 2004]Science. 2004 Apr 2; 304(5667):66-74.
[Science. 2004]Appl Environ Microbiol. 2004 Apr; 70(4):2429-36.
[Appl Environ Microbiol. 2004]J Mol Biol. 2001 Nov 2; 313(4):903-19.
[J Mol Biol. 2001]Nucleic Acids Res. 2001 Mar 1; 29(5):1097-106.
[Nucleic Acids Res. 2001]Structure. 1999 May; 7(5):497-507.
[Structure. 1999]Mol Med. 1995 May; 1(4):436-46.
[Mol Med. 1995]Cell Mol Life Sci. 2001 Oct; 58(11):1650-65.
[Cell Mol Life Sci. 2001]Acc Chem Res. 2002 Apr; 35(4):237-46.
[Acc Chem Res. 2002]Chem Biol Interact. 2000 Dec 1; 129(1-2):41-59.
[Chem Biol Interact. 2000]Trends Ecol Evol. 2000 Dec 1; 15(12):496-503.
[Trends Ecol Evol. 2000]Curr Opin Genet Dev. 2002 Dec; 12(6):688-94.
[Curr Opin Genet Dev. 2002]J Struct Funct Genomics. 2003; 3(1-4):201-12.
[J Struct Funct Genomics. 2003]Proc Natl Acad Sci U S A. 1998 Mar 31; 95(7):3708-13.
[Proc Natl Acad Sci U S A. 1998]Mol Biol Evol. 1994 Sep; 11(5):715-24.
[Mol Biol Evol. 1994]J Struct Funct Genomics. 2003; 3(1-4):201-12.
[J Struct Funct Genomics. 2003]Mol Biol Evol. 2002 Nov; 19(11):1865-9.
[Mol Biol Evol. 2002]Genetics. 2004 Oct; 168(2):1041-51.
[Genetics. 2004]Genetics. 2000 May; 155(1):431-49.
[Genetics. 2000]Mol Biol Evol. 1996 May; 13(5):685-90.
[Mol Biol Evol. 1996]Mol Biol Evol. 2002 Jun; 19(6):908-17.
[Mol Biol Evol. 2002]Curr Biol. 2000 Jul 27-Aug 10; 10(15):907-17.
[Curr Biol. 2000]J Mol Biol. 2001 Feb 16; 306(2):251-61.
[J Mol Biol. 2001]Annu Rev Microbiol. 2003; 57():419-40.
[Annu Rev Microbiol. 2003]Proc Natl Acad Sci U S A. 1999 Mar 16; 96(6):2896-901.
[Proc Natl Acad Sci U S A. 1999]Genome Biol. 2001; 2(6):RESEARCH0020.
[Genome Biol. 2001]Genome Biol. 2003; 4(8):115.
[Genome Biol. 2003]Curr Opin Chem Biol. 2003 Apr; 7(2):230-7.
[Curr Opin Chem Biol. 2003]Science. 2000 Sep 15; 289(5486):1902-6.
[Science. 2000]Science. 2004 Sep 3; 305(5689):1457-62.
[Science. 2004]Nature. 2001 Jun 14; 411(6839):786-9.
[Nature. 2001]Nature. 2004 Mar 4; 428(6978):37-43.
[Nature. 2004]Science. 2005 Apr 22; 308(5721):554-7.
[Science. 2005]Appl Environ Microbiol. 2004 Apr; 70(4):2429-36.
[Appl Environ Microbiol. 2004]Bioinformatics. 2002 Mar; 18(3):502-4.
[Bioinformatics. 2002]Bioinformatics. 1998; 14(9):817-8.
[Bioinformatics. 1998]Mol Biol Evol. 1994 Sep; 11(5):715-24.
[Mol Biol Evol. 1994]Comput Appl Biosci. 1997 Oct; 13(5):555-6.
[Comput Appl Biosci. 1997]Genetics. 2000 May; 155(1):431-49.
[Genetics. 2000]Mol Biol Evol. 2002 Jun; 19(6):908-17.
[Mol Biol Evol. 2002]Mol Biol Evol. 1998 May; 15(5):568-73.
[Mol Biol Evol. 1998]J Mol Biol. 2001 Feb 16; 306(2):251-61.
[J Mol Biol. 2001]