![]() | ![]() |
Formats:
|
||||||||||||||||||
Copyright Sun et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Evolutionary Patterns in the Sequence and Structure of Transfer RNA: A Window into Early Translation and the Genetic Code Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America Cathal Seoighe, Editor University of Cape Town, South Africa * E-mail: gca/at/uiuc.edu Conceived and designed the experiments: FJS GCA. Performed the experiments: FJS GCA. Analyzed the data: FJS GCA. Wrote the paper: FJS GCA. Received February 11, 2008; Accepted July 2, 2008. Abstract Transfer RNA (tRNA) molecules play vital roles during protein synthesis. Their acceptor arms are aminoacylated with specific amino acid residues while their anticodons delimit codon specificity. The history of these two functions has been generally linked in evolutionary studies of the genetic code. However, these functions could have been differentially recruited as evolutionary signatures were left embedded in tRNA molecules. Here we built phylogenies derived from the sequence and structure of tRNA, we forced taxa into monophyletic groups using constraint analyses, tested competing evolutionary hypotheses, and generated timelines of amino acid charging and codon discovery. Charging of Sec, Tyr, Ser and Leu appeared ancient, while specificities related to Asn, Met, and Arg were derived. The timelines also uncovered an early role of the second and then first codon bases, identified codons for Ala and Pro as the most ancient, and revealed important evolutionary take-overs related to the loss of the long variable arm in tRNA. The lack of correlation between ancestries of amino acid charging and encoding indicated that the separate discoveries of these functions reflected independent histories of recruitment. These histories were probably curbed by co-options and important take-overs during early diversification of the living world. Introduction Modern day proteins are synthesized in ribosomes, complex molecular machines made of proteins and RNA. The relatively small L-shaped tRNA adaptors are central to protein biosynthesis and establish numerous interactions with important macromolecules in addition to ribosomal RNA [1]. Specific amino acids are charged to the acceptor arms through the activity of cognate aminoacyl-tRNA synthetases (aaRSs), while the ‘anticodon’ arms contain triplets of bases that recognize complementary ‘codon’ sequences in messenger RNA. These interactions shape the genetic code, delimit the identity, degeneracy, and function of tRNA, and are therefore fundamental to our understanding of how the biosynthetic machinery and the genetic code were set up into place in an emergent world of proteins and organisms. It seems commonly accepted that early in the history of life only few amino acids were encoded and that most of the possible codons were fairly soon brought into use [2]. However, the composition of the initial group of amino acids that was used by ancient translation systems has been controversial. Numerous groups of amino acids have been suggested as candidates ([3] and reference therein) and many hypotheses have been proposed to explain the underlying genetic code. For example, the well-known co-evolution theory postulates that the expansion of amino acids is achieved by biosynthetic transformation of precursor amino acids into product amino acids [4]. According to this hypothesis, the earliest encoded proteins were made up of pre-biotically synthesized amino acids, specifically Gly, Ala, Ser, Asp, and Glu. Three phases of amino acid entry into proteins were later proposed, in which amino acids originated first from pre-biotic synthesis (Gly, Ala, Ser, Asp, Glu, Val, Leu, Ile, and possibly Pro and Thr), later from protein-mediated biosynthetic pathways (Arg, His, Met, Trp, Asn, Gln, Lys, and possibly Phe, Tyr, and Cys), and then from post-translational modification without direct genetic encoding [5]. While the co-evolution theory is popularly supported [6]–[8], it has been criticized and remains controversial [9], [10]. For example, a group of four amino acids (Gly, Ala, Asp, and Glu) were proposed to be the first to enter the biosphere [11]. This group was later redefined by replacing Asp and Glu with Arg and Pro and postulating that families of related amino acids evolved from the initial amino acids, as the genetic code expanded [12], [13]. Yarus [14] suggested that Arg was the first amino acid, based on the unique nature of its RNA binding site. In a synthesis effort, Trifonov [3] used 60 criteria to propose a chronology of appearance of amino acids and their respective codons, each of which provided a temporal order. The order of amino acid appearance followed the sequence Gly, Ala, Asp, Val, Pro, Ser, Glu, (Leu and Thr), Arg, (Ile, Gln, and Asn), His, Lys, Cys, Phe, Tyr, Met, and Trp, with the earliest 10 amino acids (from Gly to Arg) being synthesized in the imitation experiments of Miller [15], [16]. However, this boundary may be unrealistic because Miller [16] also indicated that with the possible exception of only three amino acids (Arg, Lys, and His), all other amino acids could be derived from pre-biotic synthesis. Amino acid usage rates have also been used to infer evolution of amino acids and the genetic code, using either asymmetries in substitution matrices among closely related organisms [17], [18] or ancestral sequence reconstructions of ancient protein lineages [19], [20]. Brooks et al. [19] showed that nine amino acids (Ala, Asn, Asp, Gly, His, Ile, Ser, Thr, and Val) had a decreased frequency in proteins and could have been introduced early into the genetic code, once organismal diversification was in place. However, Jordan et al. [18] recently revealed a quite different group of early amino acids with declining frequencies in proteins (Ala, Glu, Gly, and Pro). These probabilistic methods and their implicit assumptions were recently questioned [21], and a more stringent approach of only counting fully conserved positions in ribosomal proteins was used to propose that Gln, Gly, Leu, and possibly Pro, Asp, and Asn were encoded earlier, while Cys, Phe, Glu, Ile, Val, Trp, Tyr, and possibly Lys, Glu, and Ser were late additions to the genetic code. It is quite clear that our understanding of how tRNA function has evolved is far from complete. In this study we use information embedded in the sequence and structure of tRNA molecules to study the history of amino acid charging and encoding. We first build an intrinsically rooted global tree of tRNA molecules using a well-established cladistic method [22], [23] that embeds RNA structure directly into phylogenetic analysis [24]. The approach was previously used to study evolutionary patterns in ribosomal RNA, spacer RNA, short interspersed element RNA, and many other functional RNA molecules [22], [23], [25]–[27][Sun and Caetano-Anollés, unpublished], and in particular, the origin and evolution of the major structural and functional components of tRNA [28]. Since tRNA embeds a history of recruitment in which structures gain or co-opt new identities and functions or take over established ones in processes that restrict the acquisition of phenotypic traits or functions in lineages, we sorted out these confounding processes by forcing monophyletic groupings of taxa (sets that share a common ancestor) during tree building to test alterative hypotheses or establish evolutionary timelines of structural and functional diversification [27]. This phylogenetic method (known as constraint analysis) is powerful and was recently used to gain insight into the origin of cellular superkingdoms and viruses [27]. Here, the method opens an unanticipated window into early translation and the genetic code. Results We generated rooted universal trees of tRNA from the sequence and structure of 571 tRNAs representing part 2 of the bayreuth trna database. This data set contains information on modified bases, molecules from organisms in the three superkingdoms of life and viruses, and all isoacceptor variants and amino acid specificities [27]. The optimal most parsimonious trees had lengths of 10,083 steps and were intrinsically rooted (Figures 1
Type II tRNA molecules with long variable arms, including tRNASec and most tRNASer, tRNATyr, and tRNALeu isoacceptors, appeared at the base of the rooted trees as a paraphyletic group (Figure 1
In order to uncover evolutionary patterns and test alternative hypotheses we forced groups of tRNAs related by functions (amino acid charging specificity and codon identity) into monophyly using constraint analyses. We then examined the length of the most parsimonious trees that were obtained and the number of additional steps (S) that were needed to force the constraint. This exercise was generally done either with or without forcing types I and II tRNA molecules into separate groups, but overall results were congruent. The values of S for constraints related to amino acid charging specificity ranged from 113 steps for tRNASec to 255 for tRNAArg or from 130 for tRNATyr to 266 for tRNAArg, with or without forcing types I and II tRNA molecules into groups, respectively (Table 1). These values delimited the following consensus chronology of amino acid charging, starting with the most ancient charging functions and ending with the most recent: (SecII, TyrII), (SerII, LeuII, LeuI, AlaI, CysI, ProI), HisI, SerI, (TyrI, PheI, IleI, TrpI), GlyI, (ValI, GluI), (ThrI, LysI, IniI, AspI), GlnI, AsnI, MetI, and ArgI (subscripts indicate tRNA types and parentheses indicate groups of functions that cannot be dissected in the timeline). Lower S values corresponded to ancient tRNAs in the timeline and this trend was derived from the rooted trees (and embedded assumptions of polarization; see Materials and Methods). These tendencies were for example confirmed when ancestries of isoacceptor groups derived from cumulative frequency distribution plots (expressed as average or minimum nd values) were plotted against S, normalized to a 0–1 scale (Figure 2B
Constraining tRNA molecules based on amino acids synthesized in Miller's experiments [16] was more parsimonious (S = 339) than ancient tRNA groups circumscribed by Trifonov [3], Brooks et al. [19], Jordan et al. [18], or Fournier and Gogarten [21] (S = 357–566; Table 2). Remarkably, only 135–253 steps were needed to force type II tRNA molecules containing the long variable arms into monophyly.
When constraining tRNAs according to codon identity (Table 3), we found that forcing tRNAs sharing the second bases in codons (S = 267–345) was more parsimonious than forcing tRNAs sharing the first (S = 247–466) or third bases (S = 393–896) in codons into monophyly (Figure 3A = 144–186) than tRNAs sharing G (S = 186–270), U (S = 188–309), or A (S = 211–268), in that order (Figure 3B
A plot of amino acid charging ancestries (Saac for amino acid charging constraints) versus codon ancestries (Scod for codon constraints) showed poor correlation (p>0.05) between timelines of amino acid charging and codon discovery (Figure 4
Discussion Deep evolutionary patterns embedded in tRNA phylogenies In order to uncover evolutionary patterns related to amino acid charging and the genetic code, we first generated rooted phylogenetic trees using information in the sequence and structure of tRNA (Figures 1 As shown previously [27], [28][Sun and Caetano-Anollés, unpublished], type II tRNA molecules with long variable arms coding for Sec, Ser, Tyr, and Leu appeared at the base of the rooted trees and were ancient. However, we were unable to reveal any other patterns of significance in the trees. In particular, there were no clear monophyletic groupings related to molecules with similar amino acid charging functions or codon identities (e.g., Figure 2A In order to unravel the intricate history of tRNA, we explored competing (alternative) or non-competing phylogenetic hypotheses by reconstructing sub-optimal trees containing constrained monophyletic groupings of taxa [27]. Competing hypotheses were quantitatively contrasted based on the number of additional steps (S) relative to the optimal tree, and those that were more parsimonious were not rejected. We used this approach to test for example competing hypotheses of amino acid chronology. In contrast, non-competing hypotheses were ranked by the values of S and were used to define timelines of amino acid specificities and codon discovery. Hypotheses with smaller values of S (more parsimonious) were considered less affected by the confounding effects of recruitment in lineages and represented processes that were more ancient. In other words, lineages defined by these hypotheses merged (coalesced) in backwards time more easily to fit the constraint. We have validated this fundamental assumption of ‘polarization’ by mapping the correlation between S and number of nodes from a hypothetical tRNA ancestor in the trees (Figure 2B The analysis is supported by two fundamental assumptions. First, we assume tRNA structures recruited new identities and functions as the genetic coded expanded, and that different structures were co-opted in different lineages and different functional contexts. Recruitment is pervasive in evolution of macromolecules and has been demonstrated in cellular metabolism, where protein enzymes are often recruited from one pathway to another to perform new enzymatic tasks [31]–[33]. At RNA level, tRNA structural diversification appears to have predated organismal diversification [27], [34][Sun and Caetano-Anollés, unpublished] and the functions and identities affiliated to present-day tRNA structures probably evolved in lineages and were swapped by horizontal gene transfer events in evolution. Second, we assume old tRNA structures developed or recruited new functions (co-options) more often than new tRNA structures acquired old functions (take-overs), an assumption that is supported by global studies of enzyme recruitment in metabolism [Kim et al., unpublished]. The trees show several instances of take-overs, indicating modern type I structures lacking the variable arms took over ancient amino acid charging functions associated with type II structures (Figure 1 The analysis also depends on the validity of our evolutionary models and associated assumptions of character polarization. Phylogenetic reconstruction produces trees that are rooted according to specific models of character transformation, i.e. models that define how individual phylogenetic characters transform from one character state to another along the branches of the trees. In contrast with standard phylogenetic methods, our models include a central hypothesis or axiom that invokes an evolutionary search of conformational order in molecules which defines the general direction of the evolutionary path [22], [23], [25]–[28]. Trees are therefore rooted without the need and associated uncertainties of local external hypotheses of relationship (e.g., the use of ‘outgroup’ taxa). We note however that the validity of the models that we use is well-supported by statistical mechanic, thermodynamic, and phylogenetic considerations. Character argumentation is described in detail in Materials and Methods. Any phylogenetic analysis rests on how strongly the data support the topology of the tree, and our tRNA phylogenies are no exception. Tree reconstruction showed the existence of well-resolved tRNA relationships but revealed low consistency indices (CI) and BS values (Figure 1 Using the same strategy we apply here, we recently established an evolutionary timeline of organismal diversification [27]. The study showed that the lineage of Archaea segregated from an ancient community of ancestral organisms early in evolution. We also demonstrated that organismal diversification predates the discovery of modern amino acid charging. A separate line of evidence also supports this conclusion [34]. Here, we focus on timelines of amino acid charging and codon discovery. Timelines of amino acid charging specificity We constrained each and every group of tRNA molecules coding for specific amino acids and ranked them according to the values of S (Table 1). This ranking defined a timeline for the amino acid charging function (see Results) that separated ancient type II molecules coding for Sec, Tyr, Ser, and Leu from the rest of tRNAs, and placed type I molecules coding for Asn, Met, and Arg as the most derived group (Figure 2B The early origin of the Sec charging function Our timeline clearly supports the ancestral nature of the Sec charging function. Sec, one of the two non-canonical amino acid residues (the other is Pyl), is introduced into proteins during translation under the direction of UGA, a typically stop codon which also codes for Cys [41] and Trp [42]–[45]. Uniquely, Sec is synthesized co-translationally on tRNASec [46], [47] without a cognate aminoacyl-tRNA synthetase and the tRNASec (designated as tRNA[Ser]Sec) is initially aminoacylated with Ser [48]–[53]. Seryl-tRNA synthetase (SerRS) forms Ser-tRNASec which is conversed into selenocysteyl-tRNASec in all three domains of life, Bacteria [51], Archaea [54], [55], and Eukarya [56]. In Bacteria, the formation of Sec from Ser is achieved in a single step by Sec synthase. In both Eukarya and Archaea, an additional phosphorylation step is required, catalyzed by O-phosphoseryl-tRNASec kinase (PSTK) and converting the resulting O-phosphoeryl-tRNASec (Sep-tRNASec) to Sec-tRNASec by Sep-tRNA:Sec-tRNA synthase (SepSecS) [57], [58]. Phylogenetic analyses have shown that PSTK co-evolved precisely with SepSecS and that the archaeal and eukaryotic PSTKs originated before the evolutionary divergence of the superkingdoms Archaea and Eukarya [59]. The origin of Sec has remained uncertain and controversial [60]. Two strikingly opposing hypotheses have been proposed to explain its evolutionary ancestry. On one hand, it was suggested that UGA was originally a sense codon for Sec, one of the earliest amino acids to be charged, and later evolved into a new coding function, such as termination or Trp codons in the case of mycoplasma or mitochondria [51], [61]. The use of Sec could have been counter selected by the introduction of oxygen into the earth's atmosphere. This excluded the use of this highly oxidizable amino acid except in anaerobic or well-protected chemical environments. This scenario was supported by the discovery of proteins with high contents of Sec in a symbiotic δ-proteobacterium of a gutless worm [62]. However, it was also suggested that anaerobic environments could actually support the use of Sec [62], [63]. On the other hand, it was argued that Sec evolved in the later stages of the development of the genetic code [64]. The Sec moiety is part of the active center [49], [65] in most enzymes that contain Sec [61]. Three hallmarks characterize the Sec utilization system: (i) Sec is always encoded by UGA, (ii) the incorporation of Sec always requires a stem-loop specificity sequence—the SECIS element, and (iii) there is always a dedicated translation elongation factor plus an RNA-binding component. These hallmarks support the concept of a common ancestor. Phylogenetic analysis demonstrated that bacterial, archaeal and eukaryotic selenocysteine incorporation machineries already existed at the time of the last universal common ancestor [57]. This also strongly supports the hypothesis that all life began with the opportunity to utilize Sec, and that Sec utilization has been lost by many groups of organisms during evolution, most likely due to the limited supply of selenium [66]. This is consistent with the observation that organisms have only a limited number of selenoproteins and that so many organisms lack selenoproteins altogether. Together with our observations, these results strongly support the ancestral nature of Sec and the co-translational insertion of Sec in the genetic code prior to the separation of the three superkindoms of life. It also agrees with the early evolutionary history of SepSecS, the enzyme that catalyzes the formation of Sec-tRNASec, that shows tRNA-dependent Sec formation is a primordial process [67]. Timelines of codon discovery and the evolutionary significance of the second codon position The standard genetic code maps a set of 64 (43) base triplets (codons) to 20 standard amino acid molecules, plus Sec [61] and Pyl [68] for organismal subsets, and 3 translation stop signals [69]. The code has a non-random design, in which similar amino acids are generally delimited by codons that differ in the first and second positions [2]. Therefore, it is highly redundant. When we constrained tRNA molecules sharing the first, second, or third bases in codons to form monophyletic groups, molecules sharing the second bases had the lowest S values (Figure 3A It has been argued that similar codons correspond to similar amino acids because the earliest forms of translation were imprecise, and the distant ancestors of tRNAs were only able to encode classes of similar codons (an extreme form of wobble) and classes of similar amino acids [74]–[76]. These classes of similar amino acids could have shared the same chemical or biological properties. As far as similar amino acids are concerned, Woese et al. [75] found that U in the second position codes for amino acids with hydrophobic side chains and that amino acids coded for by C in the second position seem to have consistently similar polar requirement. This observation was further supported by a multivariate study of the relationship between the genetic code and the physical-chemical properties of amino acids [77]. A relationship existed between the physical-chemical properties of the amino acids and which of the A, U, or C nucleotide was used in the second codon position. However, the amino acids coded for by G in the second codon position did not participate in this relationship. Haig and Hurst [78] calculated the average effect of changing a codon by a single base for all possible single-base changes in the genetic code and for changes in the first, second, or third codon positions separately. They concluded that amino acids whose codons differed by a single base in the first and third codon positions were very similar with respect to polar requirement and hydropathy, and that the major differences between amino acids were specified by the second codon position, i. e., codons with U in the second position were hydrophobic, whereas most codons with A in the second position were hydrophilic. The arguments by Woese et al. [75] that amino acids coded by C in the second codon position seem to have similar polar requirement indicate that these similar amino acids were among the first group of amino acids recognized by ancestor tRNAs. The results of our constraint analysis agree with this conclusion and indicate that codons with C in the second position may be the earliest codons to define the modern genetic code. A striking feature of the timelines of codon discovery of our study was that the most ancient codons belonged to type I tRNA molecules with the most ancient charging functions (Ala and Pro) and that the most ancient charging functions of type II tRNA (Sec, Tyr, Ser, and Leu) had codons that were much more derived. Even when we excluded from constraints type I tRNA take-over molecules coding for Tyr, Ser, and Leu that we identified in our trees (Figure 1 While tRNA molecular identity appears to have been established in evolution prior to cognate aaRSs [82], [83], we see groups of functions that are clustered in our ancestry plot (Figure 4 Conclusions Since it was deciphered [69], the evolution of genetic code has been the subject of much study. However, the expansion of amino acids building blocks through evolution has been generally linked to the evolution of the genetic code. We provide here clear indication that the evolution of these two tRNA functions was unlinked. We focus on how function (amino acid charging and codon assignment identity) evolved in the reconstructed trees derived from sequence and structure of tRNA molecules by using novel phylogenetic methods. Our results revealed the effects of recruitment processes and how these have impacted the history of this molecule. The use of constraint analyses uncovered disjoint evolutionary patterns associated with evolution of amino acid specificity and codon identity, indicating that co-options and take-overs embedded perhaps in horizontal gene transfer affected differentially the amino acid charging and codon identity functions. The proposed timelines of amino acid charging showed for example that type II tRNA molecules were ancient and sustained important take-overs related to codon identity. However, the timelines of codon history showed the importance of the second and then the first codon position in evolution and revealed several appealing patterns, including a role of strength of hydrogen bonds in the birth of the genetic code. Our results appear for the most part consistent with recent statistical analyses of tRNA sequences that support a strand symmetric ancient world in which tRNA had both a genetic and functional role [85]. Phylogenies reconstructed from the structure of several functional RNA molecules at different taxonomical levels (from the subspecies/species levels to the universal tree) generally matched phylogenies reconstructed from sequence (e.g., [22], [25], [26], [86]–[88]). While this supports the validity of the method, it also reveals congruent phylogenetic signals in the sequence and structure of the molecules examined. A number of recent studies have used the sequences of specific tRNA isoacceptors to build trees delimiting the three superkingdoms of life (e.g., tRNALys [82], tRNACys [83]; tRNAAsn and tRNAGln [89]). However, tRNA phylogenies that incorporate structural information, as those presented here, generally failed to group tRNAs belonging to individual superkingdoms into monophyletic groups, with the exception of some isoacceptor-specific trees [28]. Since diversification of tRNA structure appears to predate organismal diversification [27], [34], we reason structures carry deep phylogenetic signal while sequences embed more recent molecular history. This explains lack of congruence between phylogenies reconstructed from slow evolving, ancient structures and phylogenies reconstructed from sequences, which change at faster pace. This scenario is supported by the existence of vast networks in sequence space defining common structures that expand when structures evolve for reduced conformational plasticity and increase molecular order [90]. We here show that deep phylogenetic signal in tRNA structure can be nevertheless mined efficiently with the tools of phylogenetic constraint. Materials and Methods Data part 2 (compilation of trna sequence) of the bayreuth trna database (http://www.uni-bayreuth.de/departments/biochemie/trna; September 2004 edition) contains a total of 571 tRNA sequences for which there is information about base modifications. These tRNAs have cloverleaf secondary structures that were derived by comparative analysis using an alignment that is most compatible with tRNA phylogenies and known 3-dimensional models of structure [91], [92]. We took the entire data set and scored a total of 42 structural characters describing geometrical features of tRNA molecules, establishing character homology by the relative position of substructures in the cloverleaf [27], [28]. We coded the length (the total number of bases or base pairs) and number of the substructures as character states and defined them in alphanumerical format with numbers from 0 to 9 and letters from A to F. We gave the minimum state (0) to missing substructures. Modified bases were treated as deviations from the cloverleaf model and were not allowed to establish canonical Watson-Crick pairs. We scored each helical stem region as two complementary sequences (5′ and 3′ sides). We partitioned the dataset into subsets categorized by molecules belonging to superkingdoms (Archaea, Bacteria, and Eukarya) or viruses/bacteriophages, charging functions, or codon identity. In this study, we invoked a “total evidence” approach [93], [94] (also called “simultaneous analysis” [95]) in phylogenetic analysis to combine both sequence and structure data of the complete (571 tRNAs) and partitioned matrices. The goal was to provide stronger support for the phylogenetic groupings recovered from analyses of structural data. A total of 99 characters were scored from aligned tRNA sequences. Character coding We treated observable features describing the structure of molecules as phylogenetic multi-state characters. These characters exhibit character states, variants of each structural feature that is homologous. Our characters transform from one character state to another along linearly ordered and reversible pathways in which a particular path of possible evolution is specified. In particular, we treat geometrical features in structure as linearly ordered characters because RNA structures evolve in discrete manner by adding and deleting nucleotide units. This generates gradual extension or contraction of geometrical features, disfavoring the possible but costly insertion or deletion events. We defined the direction of the evolutionary path by polarizing our character transformation series, i.e. by identifying the ancestral (plesiomorphic) and derived (apomorphic) states in the sequence. In order to polarize structural characters we assume the existence of a generalized evolutionary trend in RNA structure that maximizes molecular order. This results in reversible character transformation sequences that are directional and show asymmetry between character gains and losses. We defined the maximum and minimum character states as the ancestral states for structures that stabilize (stems, modified bases, and G:U base pairs) and destabilize tRNAs (bulges, hairpin loops, and other unpaired regions), respectively. Character argumentation The use of ordered and polarized multistate phylogenetic characters that describe the geometry and statistical properties of the structure of RNA molecules has been discussed in detail elsewhere [22], [23], [25]–[28]. Character argumentation is however important because conclusions about molecular origins depend on the axiomatic component of our models that establishes which are the ancestral states. The polarization hypothesis towards order invokes a general tendency of molecules to be more stable, less plastic (more unique), and more modular [sensu Ancel and Fontana [90]), and this tendency is falsifiable. So far, a considerable body of theoretical and experimental evidence has supported these polarization trends:
Phylogenetic analysis We used maximum parsimony (MP) to search for the most parsimonious trees, i.e., solutions that require the least amount of change. We analyzed all data matrices using equally weighted MP as the optimality criterion in PAUP* v. 4.0 [115]. Our selection of MP over maximum likelihood (ML) approaches is particularly suitable. For example, in our analyses we decrease the likelihood of revisiting the same character state on the underlying tree by using multi-state characters and provide conditions for characters to evolve with equal probability but varying rates, making ML precisely MP [116], [117]. MP trees were reconstructed using heuristic search strategies. Specifically, 1,000 heuristic searches were initiated using random addition starting taxa, with tree bisection reconnection (TBR) branch swapping and the MulTrees option selected. One shortest tree was saved from each search. We included the hypothetical ancestors in the searches for the most parsimonious trees using the Ancstates command. For all the phylogenetic trees, we calculated the bootstrap support (BS) values [118] from 105 replicate analyses using “fast” stepwise addition of taxa in PAUP*. We also calculated the g1 statistic of skewed tree length distribution from 104 random parsimony trees to assess the amount of nonrandom structure in the data [119]. Constraint analysis Constraint analysis generally restricts the search of optimal trees to pre-specified tree topologies delimiting specific monophyletic groups. Here we used constraint analyses to explore alternative or compare non-mutually exclusive hypotheses of tRNA groupings. The number of additional steps (S) required to force particular taxa into a monophyletic group was obtained by using the enforce topological constraint option of PAUP*. The values of S circumscribe an evolutionary distance that can be used to quantitatively contrast alternative phylogenetic hypotheses or to compare hypotheses that are not mutually exclusive. We used the latter approach to construct evolutionary timelines. This method was used previously to establish the evolutionary timeline of organismal diversification [27]. In the present study, we conducted constraint analyses on the basis of amino acid specificity (including the ancestry of groups of amino acids circumscribed by various authors), the first, second, third, or the first two bases of the codons (i.e., the third, second, first, or the last two bases of the anticodon). Table S1 (0.06 MB PDF) Click here for additional data file.(60K, pdf) Figure S1 The global phylogenetic tree of tRNA molecules with labeled terminal taxa. This tree is shown in four parts due to its size. For every tRNA, species name is followed by the anticodon (symbols of modified bases are adopted from the BAYREUTH tRNA DATABASE), amino acid specificity, and if any, a number to indicate the presence of multiple accessions. tRNAs derived from viruses are indicated with V. Numbers above the branches are bootstrap values. tRNAs with long variable arms are highlighted in pink, while those specifying for Tyr, Leu, and Ser with short variable arms are highlighted in red. Symbols used to describe modified bases in anticodon sequences: ., unknown nucleotide; H, unknown modified adenosine; [, 2-methylthio-N6-threonylcarbamoyladenosine; I, inosine; <, unknown modified cytidine; B, 2′-O-methylcytidine; M, N4-acetylcytidine; }, lysidine; >, 5-formylcytidin; °, 2-O-methyl-5-formylcytidin; ;, unknown modified guanosine; K, 1-methylguanosine; #, 2′-O-methylguanosine; 7, 7-methylguanosine; Q, queuosine; 8, mannosyl-queuosine; 9, galactosyl-queuosine; N, unknown modified uridine; {, 5-methylaminomethyluridine; 2, 2-thiouridine; J, 2′-O-methyluridine; &, 5-carbamoylmethyluridine; 1, 5-methoxycarbonylmethyluridine; S, 5-methylaminomethyl-2-thiouridine; 3, 5-methoxycarbonylmethyl-2-thiouridine; V, uridine 5-oxyacetic acid; 5, 5-methoxyuridine; !, 5-carboxymethylaminomethyluridine; $, 5-carboxymethylaminomethyl-2-thiouridine; ), 5-carboxymethylaminomethyl-2′-O-methyluridine; P, pseudouridine; ], 1-methylpseudouridine. (1.16 MB PDF) Click here for additional data file.(1.1M, pdf) Figure S2 The global phylogenetic tree of tRNA molecules with labeled terminal taxa described as a phylogram. This tree is shown in four parts due to its size. tRNAs are labeled as described in Figure S1. (1.06 MB PDF) Click here for additional data file.(1.0M, pdf) Figure S3 Phylogenetic trees of tRNAs derived from maximum parsimony analyses of 17 partitioned data matrices. A. Bacillus subtilis. B. Bos Taurus. C. Drosophila melanogaster, D. Escherichia coli. E. Halobacterium cutirubrum. F. Haloferax volcanii. G. Homo sapiens. H. Lupinus spp. I. Mus musculus. J. Mycoplasma capricolum. K. Neurospora crassa. L. Nicotiana spp. M. Phage. N. Phaseolus vulgaris. O. Saccharomyces cerevisiae. P. Rattus norvegicus. Q. Spinacia oleracea. Terminal leaves are labeled as anticodons (symbols of modified bases are defined in Figure S1) followed by amino acid specificities and if any, a number to indicate the presence of multiple accessions. Numbers above the branches are bootstrap values. Type II tRNA molecules are highlighted in red. Detailed descriptions of the trees are given in Table S1. (0.46 MB PDF) Click here for additional data file.(446K, pdf) Acknowledgments We thank Hee Shin Kim, Ajith Harish, Minglei Wang, and Jay E. Mittenthal for helpful discussions, and Minglei Wang for scripts to calculate node distances. Any opinions, findings, and conclusions and recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the funding agencies. Footnotes Competing Interests: The authors have declared that no competing interests exist. Funding: This work was supported by National Science Foundation grant MCB-0343126 and the University of Illinois Critical Research Initiative. References 1. Söll D, RajBhandary UL. tRNA: structure, biosynthesis, and function. Washington, DC: ASM Press; 1995. pp. 225–250. 2. Crick FHC. The origin of the genetic code. J Mol Biol. 1968;38:367–379. [PubMed] 3. Trifonov EN. The triplet code from first principles. J Biomol Struc Dynamics. 2004;22:1–11. 4. Wong JT. A coevolution theory of the genetic code. Proc Natl Acad Sci U S A. 1975;72:1909–1912. [PubMed] 5. Wong JT. Coevolution of the genetic code and amino acids biosynthesis. Trends Biochem Sci. 1981;6:33–36. 6. Di Giulio M. The phylogeny of tRNAs seems to confirm the predictions of the coevolution theory of the origin of the genetic code. Orig Life Evol Biosph. 1995;25:549–564. [PubMed] 7. Di Giulio M. The origin of the tRNA molecule: implication for the origin of protein synthesis. J Theor Biol. 2004;226:89–93. [PubMed] 8. Chaley MB, Korotkov EV, Phoenix DA. Relationships among isoacceptor tRNAs seems to support the coevolution theory of the origin of the genetic code. J Mol Evol. 1999;48:168–177. [PubMed] 9. Amirnovin R. An analysis of the metabolic theory of the origin of the genetic code. J Mol Evol. 1997;44:473–476. [PubMed] 10. Ronneberg TA, Landweber LF, Freeland SJ. Testing a biosynthetic theory of the genetic code: Fact or artifact? Proc Natl Acad Sci U S A. 2000;97:13690–13695. [PubMed] 11. Hartman H. Speculations on the origin and evolution of metabolism. J Mol Evol. 1975;4:359–370. [PubMed] 12. Hartman H. Speculations on the evolution of the genetic code II. Orig Life. 1978;9:133–136. [PubMed] 13. Hartman H. Speculations of the origin of the genetic code. J Mol Evol. 1995;40:541–544. [PubMed] 14. Yarus M. Specificity of Arginine binding by the Tetrahymena intron. Biochemistry. 1989;28:980–988. [PubMed] 15. Miller SL. A production of amino acids under possible primitive earth conditions. Science. 1953;117:528–529. [PubMed] 16. Miller SL. Which organic compounds could have occurred on the prebiotic earth? Cold Spring Harbor Symp Quant Biol. 1987;52:17–27. [PubMed] 17. Zuckerkandl E, Derancourt J, Vogel H. Mutational trends and random processes in the evolution of informational macromolecules. J Mol Biol. 1971;59:473–490. [PubMed] 18. Jordan IK, Kondrashov FA, Adzhubei IA, Wolf YI, Koonin EV, et al. A universal trend of amino acid gain and loss in protein evolution. Nature. 2005;433:633–638. [PubMed] 19. Brooks DJ, Fresco JR, Lesk AM, Singh M. Evolution of amino acid frequencies in proteins over deep time: inferred order of introduction of amino acids into the genetic code. Mol Biol Evol. 2002;19:1645–1655. [PubMed] 20. Brooks DJ, Fresco JR, Singh M. A novel method for estimating ancestral amino acid composition and its application to proteins of the Last Universal Ancestor. Bioinformatics. 2004;20:2251–2257. [PubMed] 21. Fournier GP, Gogarten JP. Signature of a primitive genetic code in ancient protein lineages. J Mol Evol. 2007;65:425–436. [PubMed] 22. Caetano-Anollés G. Evolved RNA secondary structure and the rooting of the universal tree of life. J Mol Evol. 2002;54:333–345. [PubMed] 23. Caetano-Anollés G. Tracing the evolution of RNA structure in ribosomes. Nucleic Acids Res. 2002;30:2575–2587. [PubMed] 24. Pollock D. The Zuckerkandl Prize: structure and evolution. J Mol Evol. 2003;56:375–376. 25. Caetano-Anollés G. Grass evolution inferred from chromosomal rearrangements and geometrical and statistical features in RNA structure. J Mol Evol. 2005;60:635–652. [PubMed] 26. Sun F-J, Fleurdépine S, Bousquet-Antonelli C, Caetano-Anollés G, Deragon J-M. Common evolutionary trends for SINE RNA structures. Trends Genet. 2007;23:26–33. [PubMed] 27. Sun F-J, Caetano-Anollés G. Evolutionary patterns in the sequence and structure of transfer RNA: early origins of Archaea and viruses. PLoS Comput Biol. 2008;4:e1000018. [PubMed] 28. Sun F-J, Caetano-Anollés G. The origin and evolution of tRNA inferred from phylogenetic analysis of structure. J Mol Evol. 2008;66:21–35. [PubMed] 29. Ashby WR. An Introduction to cybernetics. London: Chapman and Hall; 1956. p. 295. 30. Doyle JA. Seed ferns and the origin of angiosperms. J Torrey Bot Soc. 2006;133:169–209. 31. Teichmann SA, Rison SCG, Thornton JM, Riley M, Gough J, Chothia C. Small-molecule metabolism: an enzyme mosaic. Trends Biotech. 2001;19:482–486. 32. Kim HS, Mittenthal JE, Caetano-Anollés G. MANET: Tracing evolution of protein architecture in metabolic networks. BMC Bioinfomatics. 2006;7:351. 33. Wang M, Yafremava LS, Caetano-Anollés D, Mittenthal JE, Caetano-Anollés G. Reductive evolution of architectural repertoires in proteomes and the birth of the tripartite world. Genome Res. 2007;17:1572–1585. [PubMed] 34. Widmann J, Di Giulio M, Yarus M, Knight R. tRNA creation by hairpin duplication. J Mol Evol. 2005;61:524–530. [PubMed] 35. Sanderson ML, Donoghue MJ. Patterns of variation in levels of homoplasy. Evolution. 1989;43:1781–1795. 36. Hauser D, Boyajian G. Proportional change and patterns of homoplasy: Sanderson and Donoghue revisited. Cladistics. 1997;13:97–100. 37. Soltis PS, Soltis DE. Applying the bootstrap in phylogeny reconstruction. Stat Sci. 2003;18:256–267. 38. Bremer B, Jansen RK, Oxelman B, Backlund M, Lantz H, Kim K-J. More characters or more taxa for a robust phylogeny–case study from the coffee family (Rubiaceae). Syst Biol. 1999;48:413–435. [PubMed] 39. Sanderson MJ, Wojciechowski MF. Improved bootstrap confidence limits in large-scale phylogenies, with an example from Neo-Astragalus (Leguminosae). Syst Biol. 2000;49:671–685. [PubMed] 40. McClain WH. tRNA identity. FASEB J. 1993;7:72–78. [PubMed] 41. Meyer F, Schmidt HJ, Plumper E, Hasilik A, Mersmann G, et al. UGA is translated as cysteine in pheromone 3 of Euplotes octocarinatus. Proc Natl Acad Sci U S A. 1991;88:3758–3761. [PubMed] 42. Lovett PS, Ambulos NP, Jr, Mulbry W, Noguchi N, Rogers EJ. UGA can be decoded as tryptophan at low efficiency in Bacillus subtilis. J Bacteriol. 1991;173:1810–1812. [PubMed] 43. Osawa S, Jukes TH, Watanabe K, Muto A. Recent evidence for evolution of the genetic code. Microbiol Rev. 1992;56:229–264. [PubMed] 44. Watanabe K, Osawa S. tRNA sequences and variations in the genetic code. In: Söll D, RajBhandary UL, editors. tRNA: structure, biosynthesis, and function. Washington, DC: ASM Press; 1995. pp. 225–250. 45. Weiner AM, Weber K. A single UGA codon functions as a natural termination signal in the coliphage Qb coat protein cistron. J Mol Biol. 1973;80:837–855. [PubMed] 46. Zinoni F, Birkmann A, Leinfelder W, Böck A. Cotranslational insertion of selenocysteine into formate dehydrogenase from Escherichia coli directed by a UGA codon. Proc Natl Acad Sci U S A. 1987;84:3156–3160. [PubMed] 47. Commans S, Böck A. Selenocysteine inserting tRNAs: an overview. FEMS Microbiol Rev. 1999;23:335–351. [PubMed] 48. Hatfield DL, Gladyshev VN, Park J, Park SI, Chittum HS, et al. Biosynthesis of selenocysteine and its incorporation into protein as the 21st amino acid. Comp Nat Prod Chem. 1999;4:353–380. 49. Stadtman TC. Selenocysteine. Annu Rev Biochem. 1996;65:83–100. [PubMed] 50. Low SC, Berry MJ. Knowing when not to stop: selenocysteine incorporation in eukaryotes. TIBS. 1996;21:203–208. [PubMed] 51. Leinfelder W, Zehelein E, Mandrand-Berthelot MA, Böck A. Gene for a novel tRNA species that accepts L-serine and cotranslationally inserts selenocysteine. Nature. 1988;331:723–725. [PubMed] 52. Böck A, Forchhammer K, Heider J, Leinfelder W, Sawers G, et al. Selenocysteine: the 21st amino acid. Mol Microbiol. 1991;5:515–520. [PubMed] 53. Hatfield DL, Diamond AM. UGA: a split personality in the universal genetic code. Trends Genet. 1993;9:69–70. [PubMed] 54. Bilokapic S, Korencic D, Söll D, Weygand-Durasevic I. The unusual methanogenic seryl-tRNA synthetase recognizes tRNASer species from all three kingdoms of life. Eur J Biochem. 2004;271:694–702. [PubMed] 55. Kaiser JT, Gromadski K, Rother M, Engelhardt H, Rodnina MV, et al. Structural and functional investigation of a putative archaeal selenocysteine synthase. Biochemistry. 2005;44:13315–13327. [PubMed] 56. Ohama T, Yang DCH, Hatfield DL. Selenocysteine tRNA and serine tRNA are aminoacylated by the same synthetase, but may manifest different identities with respect to the long extra arm. Arch Biochem Biophys. 1994;315:293–301. [PubMed] 57. Yuan J, Palioura S, Salazar JC, Su D, O'Donoghue P, et al. RNA-dependent conversion of phosphoserine forms selenocysteine in eukaryotes and archaea. Proc Natl Acad Sci U S A. 2006;103:18923–18927. [PubMed] 58. Xu XM, Carlson BA, Mix H, Zhang Y, Saira K, et al. Biosynthesis of selenocysteine on its tRNA in eukaryotes. PLoS Biol. 2007;5:e4. [PubMed] 59. Sherrer RL, O'Donoghue P, Söll D. Characterization and evolutionary history of an archaeal kinase involved in selenocysteinyl-tRNA formation. Nucleic Acids Res. 2008;4:1247–1259. [PubMed] 60. Ambrogelly A, Palioura S, Söll D. Natural expansion of the genetic code. Nat Chem Biol. 2007;3:29–35. [PubMed] 61. Böck A, Forchhammer K, Heider J, Baron C. Selenoprotein synthesis: an expansion of the genetic code. Trends Biochem Sci. 1991;16:463–467. [PubMed] 62. Zhang Y, Gladyshev VN. High content of proteins containing 21st and 22nd amino acids, selenocysteine and pyrrolysine, in a symbiotic deltaproteobacterium of gutless worm Olavius algarvensis. Nucleic Acids Res. 2007;35:4952–4963. [PubMed] 63. Zhang Y, Romero H, Salinas G, Gladyshev VN. Dynamic evolution of selenocysteine utilization in bacteria: a balance between selenoprotein loss and evolution of selenocysteine from redox active cysteine residues. Genome Biol. 2006;7:R94. [PubMed] 64. Gladyshev VN, Kryukov GV. Evolution of selenocysteine-containing proteins: Significance of identification and functional characterization of selenoproteins. BioFactors. 2001;14:87–92. [PubMed] 65. Stadtman TC. Selenium biochemistry. Annu Rev Biochem. 1990;59:111–128. [PubMed] 66. Copeland PR. Making sense of nonsense: the evolution of selenocysteine usage in proteins. Genome Biol. 2005;6:221. [PubMed] 67. Araiso Y, Palioura S, Ishitani R, Sherrer RL, O'Donoghue P, et al. Structural insights into RNA-dependent eukaryal and archaeal selenocysteine formation. Nucleic Acids Res. 2008;4:1187–1199. [PubMed] 68. Srinivasan G, James CM, Krzycki JA. Pyrrolysine encoded by UAG in Archaea: charging of a UAG-decoding specialized tRNA. Science. 2002;296:1459–1462. [PubMed] 69. Nirenberg M, Caskey T, Marshall R, Brimacombe R, Kellogg D, et al. The RNA code and protein synthesis. Cold Spring Harb Symp Quant Biol. 1966;31:11–24. [PubMed] 70. Rodin S, Ohno S, Rodin A. Transfer RNAs with complementary anticodons: could they reflect early evolution of discriminative genetic code adaptors? Proc Natl Acad Sci U S A. 1993;90:4723–4727. [PubMed] 71. Rodin S, Rodin A, Ohno S. The presence of codon-anticodon pairs in the acceptor stem of tRNAs. Proc Natl Acad Sci U S A. 1996;93:4537–4542. [PubMed] 72. Rodin SN, Rodin AS. Origin of the genetic code: first aminoacyl-tRNA systheses could replace isofunctional ribozymes when only the second base of codons was established. DNA Cell Biol. 2006;25:365–375. [PubMed] 73. Trifonov EN, Bettecken T. Sequence fossils, triplet expansion, and reconstruction of earliest codons. Gene. 1997;205:1–6. [PubMed] 74. Woese CR. On the evolution of the genetic code. Proc Natl Acad Sci U S A. 1965;54:1546–1552. [PubMed] 75. Woese CR, Dugre DH, Dugre SA, Kondo M, Saxinger WC. On the fundamental nature and evolution of the genetic code. Cold Spring Harbor Symp Quant Biol. 1966;31:723–736. [PubMed] 76. Woese CR. Evolution of the genetic code. Naturwissenschaften. 1973;60:447–459. [PubMed] 77. Sjöström M, Wold S. A multivariate study of the relationship between the genetic code and the physical-chemical properties of amino acids. J Mol Evol. 1985;22:272–277. [PubMed] 78. Haig D, Hurst LD. A quantitative measure of error minimization in the genetic code. J Mol Evol. 1991;33:412–417. [PubMed] 79. Szathmáry E. The origin of the genetic code: amino acids as cofactors in an RNA world. Trends Genet. 1999;15:223–229. [PubMed] 80. Schimmel P, Giegé R, Moras D, Yokoyama S. An operational RNA code for amino acids and possible relationship to genetic code. Proc Natl Acad Sci U S A. 1993;90:8763–8768. [PubMed] 81. Woese CR, Olsen GJ, Ibba M, Söll D. Aminoacyl-tRNA synthetases, the genetic code, and the evolutionary process. Microbiol Mol Biol Rev. 2000;64:202–236. [PubMed] 82. Ribas de Pouplana L, Turner RJ, Steer BA, Schimmel P. Genetic code origins: tRNAs older than their synthetases? Proc Natl Acad Sci U S A. 1998;95:11295–11300. [PubMed] 83. Hohn MJ, Park H-S, O'Donoghue P, Schnitzbauer M, Söll D. Emergence of the universal genetic code imprinted in an RNA record. Proc Natl Acad Sci U S A. 2006;103:18095–18100. [PubMed] 84. O'Donoghue P, Luthey-Schultem Z. On the evolution of structure in aminoacyl-tRNA synthetases. Microbiol Mol Biol Rev. 2003;67:550–573. [PubMed] 85. Rodin SN, Rodin AS. On the origin of the genetic code: signatures of its primordial complementarity in tRNAs and aminoacyl-tRNA synthetases. Heredity. 2008;100:341–355. [PubMed] 86. Billoud B, Guerrucci MA, Masselot M, Deutsch JS. Cirripede phylogeny using a novel approach: molecular morphometrics. Mol Biol Evol. 2000;17:1435–1445. [PubMed] 87. Collins LJ, Moulton V, Penny D. Use of RNA secondary structure for studying the evolution of RNase P and RNase MRP. J Mol Evol. 2000;51:194–2004. [PubMed] 88. Swain TD, Taylor DJ. Structural rRNA characters support monophyly of raptorial limbs and paraphyly of limb specialization in water fleas. Proc R Soc London B. 2003;270:887–896. 89. Sheppard K, Söll D. On the evolution of the tRNA-dependent amidotransferases, GatCAB and GatDE. J Mol Biol. 2008;377:831–844. [PubMed] 90. Ancel LW, Fontana W. Plasticity, evolvability, and modularity in RNA. J Exp Zool (Mol Dev Evol). 2000;288:242–283. 91. Steinberg S, Misch A, Sprinzl M. Compilation of tRNA sequences and sequences of tRNA genes. Nucleic Acids Res. 1993;21:3011–3015. [PubMed] 92. Sprinzl M, Vassilenko KS. Compilation of tRNA sequences and sequences of tRNA genes. Nucleic Acids Res. 2005;33:D139–D140. [PubMed] 93. Kluge AG. A concern for evidence and a phylogenetic hypothesis of relationships among Epicrates (Boidae, Serpentes). Syst Zool. 1989;38:7–25. 94. Kluge AG, Wolf AJ. Cladistics: What's in a word? Cladistics. 1993;9:183–199. 95. Nixon KC, Carpenter JM. On simultaneous analysis. Cladistics. 1996;12:221–241. 96. Le S-Y, Maizel JV. A method for assessing the statistical significance of RNA folding. J Theor Biol. 1989;138:495–510. [PubMed] 97. Stegger G, Hofman H, Fortsch J, Gross HJ, Randles JW, et al. Conformational transitions in viroids and virusoids: comparison of results from energy minimization algorithm and from experimental data. J Biomol Struct Dynam. 1984;2:543–571. 98. Higgs PG. RNA secondary structure: a comparison of real and random sequences. J Phys I France. 1993;3:43–59. 99. Higgs PG. Thermodynamic properties of transfer RNA: a computational study. J Chem Soc Faraday Trans. 1995;91:2531–2540. 100. Schultes EA, Hraber PT, LaBean TH. Estimating the contributions of selection and self-organization in RNA secondary structure. J Mol Evol. 1999;49:76–83. [PubMed] 101. Steffens W, Digby D. mRNA have greater negative folding free energies than shuffled or codon choice randomized sequences. Nucleic Acids Res. 1999;27:1578–1584. [PubMed] 102. Gultyaev PA, van Batenburg FHD, Pleij CWA. Selective pressures on RNA hairpins in vivo and in vitro. J Mol Evol. 2002;54:1–8. [PubMed] 103. Forsdyke DR. Calculation of folding energies of single-stranded nucleic acid sequences: Conceptual issues. J Theor Biol. 2007;248:745–753. [PubMed] 104. Schultes EA, Spasic A, Mohanty U, Bartel DP. Compact and ordered collapse of randomly generated RNA sequences. Nat Struct Mol Biol. 2005;12:1130–1136. [PubMed] 105. Hecht MH, Das A, Go A, Bradley LH, Wei YN. De novo proteins from designed combinatorial libraries. Protein Sci. 2004;13:1711–1723. [PubMed] 106. Schultes EA, LaBean TH, Hraber PT. A parameterization of RNA sequence space. Complexity. 1999;4:61–67. 107. Gladyshev GP, Ershov YA. Principles of the thermodynamics of biological systems. J Theor Biol. 1982;94:301–343. [PubMed] 108. Schrödinger E. What is life? Cambridge: Cambridge University Press; 1944. p. 91. 109. Schneider ED, Kay JJ. Life as a manifestation of the second law of thermodynamics. Math Comp Modeling. 1994;19:25–48. 110. Schneider ED, Kay JJ. Complexity and thermodynamics: towards a new ecology. Futures. 1994;26:626–647. 111. Wagner A. Robustness and evolvability: a paradox resolved. Proc R Soc Lond B. 2008;275:91–100. 112. Higgs PG. RNA secondary structure: physical and computational aspects. Quart Rev Biophys. 2000;33:199–253. 113. Fontana W. Modelling ‘evo-devo’ with RNA. BioEssays. 2002;24:1164–1177. [PubMed] 114. Schultes EA, Bartel DP. One sequence, two ribozymes: implications for the emergence of new ribozyme folds. Science. 2000;289:448–452. [PubMed] 115. Swofford DL. PAUP*: Phylogenetic Analysis Using Parsimony (*and other methods), version 4.0b10. 2002. Sinauer Associates, Sunderland, MA. 116. Steel M, Penny D. Parsimony, likelihood, and the role of models in molecular phylogenetics. Mol Biol Evol. 2000;17:839–850. [PubMed] 117. Steel M, Penny D. Two further links between MP and ML under the poisson model. Appl Math Lett. 2004;17:785–790. 118. Felsenstein J. Confidence limits on phylogenies: an approach using the bootstrap. Evolution. 1985;39:783–791. 119. Hillis DM, Huelsenbeck JP. Signal, noise, and reliability in molecular phylogenetic analyses. J Hered. 1992;83:189–195. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||
J Mol Biol. 1968 Dec; 38(3):367-79.
[J Mol Biol. 1968]Proc Natl Acad Sci U S A. 1975 May; 72(5):1909-12.
[Proc Natl Acad Sci U S A. 1975]Orig Life Evol Biosph. 1995 Dec; 25(6):549-64.
[Orig Life Evol Biosph. 1995]J Mol Evol. 1999 Feb; 48(2):168-77.
[J Mol Evol. 1999]J Mol Evol. 1997 May; 44(5):473-6.
[J Mol Evol. 1997]J Mol Evol. 2002 Mar; 54(3):333-45.
[J Mol Evol. 2002]Nucleic Acids Res. 2002 Jun 1; 30(11):2575-87.
[Nucleic Acids Res. 2002]J Mol Evol. 2005 May; 60(5):635-52.
[J Mol Evol. 2005]PLoS Comput Biol. 2008 Mar 7; 4(3):e1000018.
[PLoS Comput Biol. 2008]J Mol Evol. 2008 Jan; 66(1):21-35.
[J Mol Evol. 2008]PLoS Comput Biol. 2008 Mar 7; 4(3):e1000018.
[PLoS Comput Biol. 2008]Nucleic Acids Res. 2002 Jun 1; 30(11):2575-87.
[Nucleic Acids Res. 2002]Cold Spring Harb Symp Quant Biol. 1987; 52():17-27.
[Cold Spring Harb Symp Quant Biol. 1987]Nature. 2005 Feb 10; 433(7026):633-8.
[Nature. 2005]Mol Biol Evol. 2002 Oct; 19(10):1645-55.
[Mol Biol Evol. 2002]J Mol Evol. 2007 Oct; 65(4):425-36.
[J Mol Evol. 2007]PLoS Comput Biol. 2008 Mar 7; 4(3):e1000018.
[PLoS Comput Biol. 2008]Cold Spring Harb Symp Quant Biol. 1987; 52():17-27.
[Cold Spring Harb Symp Quant Biol. 1987]Mol Biol Evol. 2002 Oct; 19(10):1645-55.
[Mol Biol Evol. 2002]Nature. 2005 Feb 10; 433(7026):633-8.
[Nature. 2005]J Mol Evol. 2007 Oct; 65(4):425-36.
[J Mol Evol. 2007]PLoS Comput Biol. 2008 Mar 7; 4(3):e1000018.
[PLoS Comput Biol. 2008]J Mol Evol. 2008 Jan; 66(1):21-35.
[J Mol Evol. 2008]PLoS Comput Biol. 2008 Mar 7; 4(3):e1000018.
[PLoS Comput Biol. 2008]Genome Res. 2007 Nov; 17(11):1572-85.
[Genome Res. 2007]PLoS Comput Biol. 2008 Mar 7; 4(3):e1000018.
[PLoS Comput Biol. 2008]J Mol Evol. 2005 Oct; 61(4):524-30.
[J Mol Evol. 2005]J Mol Evol. 2002 Mar; 54(3):333-45.
[J Mol Evol. 2002]Nucleic Acids Res. 2002 Jun 1; 30(11):2575-87.
[Nucleic Acids Res. 2002]J Mol Evol. 2005 May; 60(5):635-52.
[J Mol Evol. 2005]J Mol Evol. 2008 Jan; 66(1):21-35.
[J Mol Evol. 2008]J Mol Evol. 2008 Jan; 66(1):21-35.
[J Mol Evol. 2008]PLoS Comput Biol. 2008 Mar 7; 4(3):e1000018.
[PLoS Comput Biol. 2008]Syst Biol. 1999 Sep; 48(3):413-35.
[Syst Biol. 1999]Syst Biol. 2000 Dec; 49(4):671-85.
[Syst Biol. 2000]Genome Res. 2007 Nov; 17(11):1572-85.
[Genome Res. 2007]PLoS Comput Biol. 2008 Mar 7; 4(3):e1000018.
[PLoS Comput Biol. 2008]J Mol Evol. 2005 Oct; 61(4):524-30.
[J Mol Evol. 2005]Cold Spring Harb Symp Quant Biol. 1987; 52():17-27.
[Cold Spring Harb Symp Quant Biol. 1987]Biochemistry. 1989 Feb 7; 28(3):980-8.
[Biochemistry. 1989]Nature. 2005 Feb 10; 433(7026):633-8.
[Nature. 2005]Mol Biol Evol. 2002 Oct; 19(10):1645-55.
[Mol Biol Evol. 2002]J Mol Evol. 2007 Oct; 65(4):425-36.
[J Mol Evol. 2007]Proc Natl Acad Sci U S A. 1991 May 1; 88(9):3758-61.
[Proc Natl Acad Sci U S A. 1991]J Bacteriol. 1991 Mar; 173(5):1810-2.
[J Bacteriol. 1991]J Mol Biol. 1973 Nov 15; 80(4):837-55.
[J Mol Biol. 1973]Proc Natl Acad Sci U S A. 1987 May; 84(10):3156-60.
[Proc Natl Acad Sci U S A. 1987]FEMS Microbiol Rev. 1999 Jun; 23(3):335-51.
[FEMS Microbiol Rev. 1999]Nat Chem Biol. 2007 Jan; 3(1):29-35.
[Nat Chem Biol. 2007]Nature. 1988 Feb 25; 331(6158):723-5.
[Nature. 1988]Trends Biochem Sci. 1991 Dec; 16(12):463-7.
[Trends Biochem Sci. 1991]Nucleic Acids Res. 2007; 35(15):4952-63.
[Nucleic Acids Res. 2007]Genome Biol. 2006; 7(10):R94.
[Genome Biol. 2006]Annu Rev Biochem. 1996; 65():83-100.
[Annu Rev Biochem. 1996]Annu Rev Biochem. 1990; 59():111-27.
[Annu Rev Biochem. 1990]Trends Biochem Sci. 1991 Dec; 16(12):463-7.
[Trends Biochem Sci. 1991]Proc Natl Acad Sci U S A. 2006 Dec 12; 103(50):18923-7.
[Proc Natl Acad Sci U S A. 2006]Genome Biol. 2005; 6(6):221.
[Genome Biol. 2005]Trends Biochem Sci. 1991 Dec; 16(12):463-7.
[Trends Biochem Sci. 1991]Science. 2002 May 24; 296(5572):1459-62.
[Science. 2002]Cold Spring Harb Symp Quant Biol. 1966; 31():11-24.
[Cold Spring Harb Symp Quant Biol. 1966]J Mol Biol. 1968 Dec; 38(3):367-79.
[J Mol Biol. 1968]Proc Natl Acad Sci U S A. 1993 May 15; 90(10):4723-7.
[Proc Natl Acad Sci U S A. 1993]Proc Natl Acad Sci U S A. 1965 Dec; 54(6):1546-52.
[Proc Natl Acad Sci U S A. 1965]Naturwissenschaften. 1973 Oct; 60(10):447-59.
[Naturwissenschaften. 1973]Cold Spring Harb Symp Quant Biol. 1966; 31():723-36.
[Cold Spring Harb Symp Quant Biol. 1966]J Mol Evol. 1985; 22(3):272-7.
[J Mol Evol. 1985]J Mol Evol. 1991 Nov; 33(5):412-7.
[J Mol Evol. 1991]Cold Spring Harb Symp Quant Biol. 1966; 31():723-36.
[Cold Spring Harb Symp Quant Biol. 1966]Trends Genet. 1999 Jun; 15(6):223-9.
[Trends Genet. 1999]Proc Natl Acad Sci U S A. 1993 May 15; 90(10):4723-7.
[Proc Natl Acad Sci U S A. 1993]Gene. 1997 Dec 31; 205(1-2):1-6.
[Gene. 1997]Proc Natl Acad Sci U S A. 1993 Oct 1; 90(19):8763-8.
[Proc Natl Acad Sci U S A. 1993]Microbiol Mol Biol Rev. 2000 Mar; 64(1):202-36.
[Microbiol Mol Biol Rev. 2000]Proc Natl Acad Sci U S A. 1998 Sep 15; 95(19):11295-300.
[Proc Natl Acad Sci U S A. 1998]Proc Natl Acad Sci U S A. 2006 Nov 28; 103(48):18095-100.
[Proc Natl Acad Sci U S A. 2006]Microbiol Mol Biol Rev. 2003 Dec; 67(4):550-73.
[Microbiol Mol Biol Rev. 2003]Cold Spring Harb Symp Quant Biol. 1966; 31():11-24.
[Cold Spring Harb Symp Quant Biol. 1966]Heredity. 2008 Apr; 100(4):341-55.
[Heredity. 2008]J Mol Evol. 2002 Mar; 54(3):333-45.
[J Mol Evol. 2002]J Mol Evol. 2005 May; 60(5):635-52.
[J Mol Evol. 2005]Trends Genet. 2007 Jan; 23(1):26-33.
[Trends Genet. 2007]Mol Biol Evol. 2000 Oct; 17(10):1435-45.
[Mol Biol Evol. 2000]Proc Natl Acad Sci U S A. 1998 Sep 15; 95(19):11295-300.
[Proc Natl Acad Sci U S A. 1998]Nucleic Acids Res. 1993 Jul 1; 21(13):3011-5.
[Nucleic Acids Res. 1993]Nucleic Acids Res. 2005 Jan 1; 33(Database issue):D139-40.
[Nucleic Acids Res. 2005]PLoS Comput Biol. 2008 Mar 7; 4(3):e1000018.
[PLoS Comput Biol. 2008]J Mol Evol. 2008 Jan; 66(1):21-35.
[J Mol Evol. 2008]J Mol Evol. 2002 Mar; 54(3):333-45.
[J Mol Evol. 2002]Nucleic Acids Res. 2002 Jun 1; 30(11):2575-87.
[Nucleic Acids Res. 2002]J Mol Evol. 2005 May; 60(5):635-52.
[J Mol Evol. 2005]J Mol Evol. 2008 Jan; 66(1):21-35.
[J Mol Evol. 2008]J Mol Evol. 2005 May; 60(5):635-52.
[J Mol Evol. 2005]J Theor Biol. 1989 Jun 22; 138(4):495-510.
[J Theor Biol. 1989]J Mol Evol. 2002 Jan; 54(1):1-8.
[J Mol Evol. 2002]J Theor Biol. 2007 Oct 21; 248(4):745-53.
[J Theor Biol. 2007]Nat Struct Mol Biol. 2005 Dec; 12(12):1130-6.
[Nat Struct Mol Biol. 2005]J Theor Biol. 1982 Jan 21; 94(2):301-43.
[J Theor Biol. 1982]Science. 2000 Jul 21; 289(5478):448-52.
[Science. 2000]J Mol Evol. 2005 May; 60(5):635-52.
[J Mol Evol. 2005]Trends Genet. 2007 Jan; 23(1):26-33.
[Trends Genet. 2007]J Mol Evol. 2008 Jan; 66(1):21-35.
[J Mol Evol. 2008]J Mol Evol. 2002 Mar; 54(3):333-45.
[J Mol Evol. 2002]Nucleic Acids Res. 2002 Jun 1; 30(11):2575-87.
[Nucleic Acids Res. 2002]Mol Biol Evol. 2000 Jun; 17(6):839-50.
[Mol Biol Evol. 2000]J Hered. 1992 May-Jun; 83(3):189-95.
[J Hered. 1992]PLoS Comput Biol. 2008 Mar 7; 4(3):e1000018.
[PLoS Comput Biol. 2008]