![]() | ![]() |
Formats:
|
||||||||||||||||
Copyright Sun, Caetano-Anollés. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Evolutionary Patterns in the Sequence and Structure of Transfer RNA: Early Origins of Archaea and Viruses Department of Crop Sciences, University of Illinois Urbana-Champaign, Urbana, Illinois, United States of America Philip E. Bourne, Editor University of California San Diego, United States of America * E-mail: gca/at/uiuc.edu Conceived and designed the experiments: F-JS GC-A. Performed the experiments: F-JS GC-A. Analyzed the data: F-JS GC-A. Wrote the paper: F-JS GC-A. Received November 19, 2007; Accepted February 1, 2008. This article has been cited by other articles in PMC.Abstract Transfer RNAs (tRNAs) are ancient molecules that are central to translation. Since they probably carry evolutionary signatures that were left behind when the living world diversified, we reconstructed phylogenies directly from the sequence and structure of tRNA using well-established phylogenetic methods. The trees placed tRNAs with long variable arms charging Sec, Tyr, Ser, and Leu consistently at the base of the rooted phylogenies, but failed to reveal groupings that would indicate clear evolutionary links to organismal origin or molecular functions. In order to uncover evolutionary patterns in the trees, we forced tRNAs into monophyletic groups using constraint analyses to generate timelines of organismal diversification and test competing evolutionary hypotheses. Remarkably, organismal timelines showed Archaea was the most ancestral superkingdom, followed by viruses, then superkingdoms Eukarya and Bacteria, in that order, supporting conclusions from recent phylogenomic studies of protein architecture. Strikingly, constraint analyses showed that the origin of viruses was not only ancient, but was linked to Archaea. Our findings have important implications. They support the notion that the archaeal lineage was very ancient, resulted in the first organismal divide, and predated diversification of tRNA function and specificity. Results are also consistent with the concept that viruses contributed to the development of the DNA replication machinery during the early diversification of the living world. Author Summary The origins of the three major cellular lineages of life—Archaea, Bacteria, and Eukarya—and of viruses have been shrouded in mystery. In this study, we focus on transfer RNA, an ancient nucleic acid molecule that takes center stage in the process of protein biosynthesis and can be found everywhere in life. In a process that reconstructs history from molecular sequence and structure and at the same time forces molecules belonging to lineages into groups, we tested alternative hypotheses of origin and established when major organismal lineages appeared in evolution. Remarkably, timelines showed that Archaea was the most ancient lineage on earth and that viruses originated early in the archaeal lineage. Our findings unroot the universal tree of life, and, for the first time, provide evidence for an evolutionary origin of viruses. Introduction Transfer RNA (tRNA) molecules are central to the entire translation process. They interact with the ribosomal RNA (rRNA) subunits as they are being ratcheted through the center of the ribosome [1],[2]. Their acceptor arms charge specific amino acids through the activity of cognate aminoacyl-tRNA synthetases, while triplets of bases on their ‘anticodon’ arms recognize complementary ‘codon’ sequences in messenger RNA. These and many other molecular interactions define the identities and functions of these tRNA adaptors and establish a genetic code that translates nucleic acid into protein information in the cell. The structural make-up of tRNA is therefore fundamental to our understanding of how the biosynthetic machinery was set up into place in an emerging protein and organismal world. tRNAs are clearly ancient molecules [3] and they have been used profusely to study the evolution of ancient life [4]–[8]. The identity and function of tRNAs are fundamentally delimited by the structure of the molecules, and structure is more conserved than sequence. In fact, we recently showed that tRNA structure carries deep phylogenetic signal and can be used directly to reconstruct evolutionary history [9]. However, understanding phylogenetic trees is challenging because tRNA evolution embeds a history of recruitment in which structures gain or co-opt new identities and functions or takeover established ones. The hierarchical branching patterns of the universal tree of life portray the natural history of the living world. The current accepted universal tree proposes a tripartite world ruled by three superkingdoms, Archaea, Bacteria, and Eukarya [10]. This view stems fundamentally from the study of the small subunit of rRNA, a molecule that is also ancient and central to translation. The rise of evolutionary genomics with an analysis of entire repertoires of nucleic acid and protein molecules supports this tripartite scheme [11],[12]. However, the root of the universal tree remains controversial and so is the nature of the universal ancestor of all life that this root defines [13],[14]. We recently embarked on a systematic and global study of evolution of domain structure and organization in proteins [15],[16] (Wang and Caetano-Anollés, submitted). Structures were assigned to protein sequences in hundreds of completely sequenced genomes and a structural census of protein domains used to generate phylogenomic trees of protein architectures. The evolutionary genomic analysis defined a universal ancestor that was eukaryotic-like and had a relatively complex proteome [16]. It also established that the archaeal lineage was the most ancient and originated from reductive evolutionary tendencies in the use of protein architectures. In order to explore if similar phylogenetic signatures were present in tRNA, we apply a well-established cladistic method [17],[18] that embeds structure directly into phylogenetic analysis [19]. The method involves identifying features characteristic of the secondary structure of RNA molecules, coding these features as linearly ordered multi-state characters, and using them to build phylogenetic trees with optimal tree search methods. The strategy has been used to reconstruct a tripartite tree of life from rRNA structure [17], trace evolution of rRNA in ribosomes [18], study the evolution of closely related phytopathogenic fungi [17] or distantly related members of the grass family [20], and explore the structural origin and evolution of retrotransposons in eukaryotes [21]. We also used the approach to study the evolution of the major structural and functional components of tRNA, establishing that tRNA molecules originated in the acceptor arm and providing further support to the ‘genomic tag’ hypothesis [9]. Here we reconstruct global phylogenetic trees using information embedded in both the sequence and structure of tRNA molecules. As we have shown previously (Sun and Caetano-Anollés, submitted), the intrinsically rooted trees revealed that tRNA with long variable arms (known as class II or type II tRNA) coding for amino acids Sec, Ser, Tyr, and Leu were ancient. However, trees failed to show clear patterns related to tRNA function, an observation that underscores the importance of recruitment and phylogenetic constraint (factors that restrict the acquisition of phenotypic traits or functions in lineages) in tRNA evolution. In order to sort out these confounding processes we built trees while forcing monophyletic groupings of taxa (sets that share a common ancestor) to test alterative hypotheses or establish evolutionary timelines of structural, functional, or organismal diversification. This strategy (known as constraint analysis in phylogenetics) provided an unanticipated window into early evolution of life. Results Phylogenetic analyses of the combined dataset of sequence and structure of 571 tRNAs produced most parsimonious trees that were 10,083 steps in length and were intrinsically rooted (Figure 1
In order to uncover deep phylogenetic signals and test alternative evolutionary hypotheses we forced groups of tRNAs that shared a same organismal origin (molecules from each superkingdom of life or viruses) into monophyly using constraint analyses. We then recorded the length of the most parsimonious trees that were obtained and the number of additional steps (S) that were needed to force the constraint. This exercise was generally done with or without forcing class I and II tRNA molecules into separate groups, but overall results were congruent. Constraints related to the diversification of the organismal world (Table 1) consistently showed Archaea as the ancestral group (i.e., forcing archaeal tRNAs into monophyly was always associated with low S), followed by viruses, Eukarya, and Bacteria (with S increasing in that order) (Figure 2
We also explored the origins of viruses by constraining molecules from each individual superkingdom or viruses into monophyletic groups, together [e.g., (AV)] or separately [e.g., ((A)(V))] (Table 2). The most parsimonious scenario always linked the origins of viruses to the archaeal lineage, with S values matching those defining the organismal timeline (Figure 2
Finally, we constrained trees according to isoacceptor group and then according to organismal group, or vice versa, with or without constraining tRNA categories (Table 3). A scenario in which organismal (superkingdom) diversification predated tRNA functional divergence was always more parsimonious (S = 2,338–2,481) than one where functional divergence predated organismal diversification (S = 2,415–2,534).
Since constraint analyses could be biased by unequal rates of evolution, we calculated average number of character change per branch in consensus trees generated from partitioned data matrices (Table 4). An analysis of variance (ANOVA) showed values were not significantly different in the three superkindoms of life and viruses (p>0.05). Similarly, we did not find differences when random trees were compared (not shown).
Discussion Deep evolutionary patterns embedded in tRNA phylogenies In order to uncover evolutionary patterns related to organismal diversification, we first generated rooted phylogenetic trees using information embedded in the structure and sequence of tRNA (Figure 1 Two fundamental assumptions support the analysis. First, we assume tRNA structures acquired new identities and functions as the genetic code expanded, and that different structures were co-opted for the task in different lineages and different functional contexts. This assumption seems reasonable. Recruitment processes are common in evolution of macromolecules. In cellular metabolism, for example, enzymes are often recruited into different pathways to perform new enzymatic functions [16],[30],[31]. Moreover, structural diversification of tRNA appeared to have predated organismal diversification [32] (Sun and Caetano-Anollés, submitted) and the functions and identities attached to present-day tRNA structures probably developed in lineages and were shuffled by horizontal gene transfer. Second, we assume old tRNA structures developed or recruited new functions (co-options) more often than new tRNA structures acquired old functions (takeovers). This assumption is also reasonable and appears to be supported by our studies of enzyme recruitment in metabolism (Kim et al., ms. in preparation). Our trees show several instances of takeovers, in which modern class I structures lacking the long variable arms took over ancient amino acid charging functions associated with class II structures (Figure 1 We also assume phylogenies are free from systematic errors and the confounding effects of mutational saturation, long branch attraction artifacts, and unequal rates of evolution along branches of the trees [11]. However, most branching events in these phylogenies happened a relatively long time ago and phylogenetic analyses of ancient molecules carry all the problems of deep reconstruction [33]. While the impact of some of these effects diminishes when using multi-state characters in tRNA structure [34],[35], different rates of change could alter the coalescense of lineages and our results. For example, increased rates of change known to occur in rapidly evolving viral molecules could increase expected S values, making the viral lineage artificially younger. Nevertheless, an analysis of rates of change in consensus and random trees derived from partitioned data matrices showed that evolutionary rates of tRNAs in the three superkingdoms of life or viruses were not significantly different in our analysis (Table 4). The fact that evolutionary rates in the four lineages were similar decreases the impact of unequal rates of evolution and underscores the conserved nature of tRNA structure when compared to sequence. Similarly, problems of statistical consistency related to long branch attraction could bias the reconstruction of the tRNA tree. These artifacts, which are rather common in sequence analysis, result from unequal rates of variation in branches and the interplay of short and long branches in a tree [36]. They are however not so much related to branch length (which in our analyses do not vary considerably; Table 4) but to changes of a same character state occurring preferentially in long branches, forcing the tree-building method to join them artificially. However, the probability of these covarying homoplasies is known to decrease with increases in character states, as with the multi-state structural characters of this study [34], and when branches are separated by increased taxon sampling [37]–[39]. Consequently, large trees as the tRNA trees we have reconstructed from sequence and structure in this study should be considerably less prone to consistency problems [38],[39] than the four-taxon statements related to sequences originally used to define them [36], especially if they involve multiple character states depicting structure. Timelines of organismal diversification and the birth of the tripartite world We constrained tRNA groups according to organismal origin using different schemes and tested possible competing and non-competing hypotheses describing timelines of organismal diversification and possible topologies of the universal tree of life (Figure 2 The timeline of organismal diversification provides evidence that the lineage of Archaea segregated from an ancient community of ancestral organisms and established the first organismal divide. The scenario of organismal diversification described above is congruent with our recent phylogenomic analyses of protein structure [16] and domain organization (Wang and Caetano-Anollés, submitted) in hundreds of completely sequenced genomes. The result is also congruent with recent studies that have used tRNA paralog (alloacceptor) clustering as a measure of ancestry of tRNA genotypes [40] and multiple lines of evidence [41],[42] to suggest a Methanopyrus-proximal root of life. Although it is popularly accepted that the universal tree of life based on molecular phylogenies is rooted in the prokaryotes and that Archaea and Eukarya are sister groups, these recent results together with those presented in this paper offer compelling arguments in favor of an early appearance of the Archaea. Our evolutionary timeline is also remarkable in that it identifies three epochs in the evolution of the organismal world that were analogous to those proposed earlier [16]: (1) an architectural diversification epoch in which tRNA molecules diversified their structural repertoires (light green areas in Figure 2 The evolutionary patterns observed in timelines appeared consistently in the absence or presence of class I or class II tRNA structural constraints (Figure 2 The early evolutionary appearance of viruses The organismal timeline inferred from tRNA sequence and structure showed Archaea was the most ancient superkingdom but established that viruses were also ancient. Viruses are relatively simple living entities and in many cases maintain a regular structure. They have long been considered fragments of cellular genomes and not living organisms and were generally excluded from consideration in evolutionary scenarios of the tripartite world, despite being important components of the biosphere. The importance of viruses and their potential roles in early cellular evolution were recently reevaluated [43]. A comparative analysis of structure and function, including virion assembly principles, suggested both RNA and DNA viruses may have been more ancient than previously thought, possibly even more ancient than the common ancestor of life [43]. However, they probably had a polyphyletic origin because structurally and functionally related viruses infect hosts in different lineages and even in different superkingdoms of the universal tree [44],[45]. It is therefore possible that viruses form lineages and share a common ancestor, and that these lineages extend from the root to all branches in the tree of life. For example, the overall similarity of viral structures, such as coat protein folds enclosing nucleoprotein filaments, suggests a common mechanism for their appearance. The construction of phylogenies addressing the questions of origin and evolution of viruses in the context of the three superkingdoms are now possible with the increasing number of sequenced genomes of viral origin. In fact, comparative genomic analyses suggested viruses could be the source of new proteins for cells [46]. Many DNA informational proteins encoded today in cellular genomes probably originated in the viral world and were later transferred into the three cellular superkingdoms. Forterre recently proposed that DNA itself appeared in ancestral viral lineages [47],[48]. He later on extended this proposal by suggesting that the DNA replication machineries of each superkingdom originated from three different ancestral viral lineages [49]. In his latest proposal, each cellular superkingdom originated independently from the fusion of an RNA-based cell and a large DNA virus [50]. In order to establish if the origin of the viruses was linked to one or more of the three superkingdoms of life we constrained viral and individual superkingdom tRNAs into competing monophyletic relationships (Table 3). Remarkably, most parsimonious constraints indicated viruses that associate with Eukarya and Bacteria had an origin in the archaeal lineage (Figure 2 The origin of viruses is generally complex and may involve more than one mechanism [56]. Although several major classes of viruses are monophyletic, a common viral ancestry has not been evident [57]. Sequence analysis of viral genomes with various lengths (ranging from a few to hundreds of kilobases and containing several to hundreds of genes) and types (ranging from double-stranded DNA to single-stranded RNA) failed to reveal a common origin, suggesting instead polyphyletic (multiple) origins. However, a focus on sequence alone could be misleading. The viruses as a group contain more structural genomic diversity than cellular organisms such as plants, animals, or bacteria put together, and their sequences are fast evolving [58]. This could erase deep evolutionary history and confound analysis. Moreover, viruses also share many common features (e.g., genes coding for key proteins involved in viral replication and morphogenesis, parasitic nature of the replication mechanisms) not shared by any kind of cellular organisms [57], and these could be used to claim monophyly. This is especially true if the proposed ancient viral world existed [57]. This world harbored viral genes that retained their identity throughout the entire history of life. By this definition, the primordial pool of primitive genetic elements would be the ancestors of modern cellular and viral genes. This means that most, if not all, modern viruses were derived from elements that belonged to the primordial genetic pool, perhaps representing primitive form of self replicating DNA and precursor of life [59]. We end by noting that due to the small number of viral sequences sampled in our study, the conclusions drawn here should be taken with caution. However, a separate undergoing study analyzing a comprehensive dataset of tRNA sequences and structures but lacking information on base modifications support the evolutionary patterns presented in this study (Ospina, Sun, and Caetano-Anollés, unpublished). Materials and Methods Data Part 2 (compilation of tRNA sequences) of the Bayreuth tRNA Database (http://www.staff.uni-bayreuth.de/btc914/search/index.html; September 2004 edition; Table S1) contains a total of 571 tRNA sequences at RNA level with cloverleaf secondary structures. The structures were derived by comparative analysis using an alignment that is most compatible with tRNA phylogenies and known 3-dimensional models of structure [60],[61]. The composition of part 2 was not pruned in our analyses and represents the most complete tRNA dataset currently available that contains information about base modifications. A total of 42 structural characters describing geometrical features of tRNA molecules (Table S2) were scored, establishing character homology by the relative position of substructures in the cloverleaf [9] (Sun and Caetano-Anollés, submitted). The length (the total number of bases or base pairs) and number of the substructures were coded as character states and were defined in alphanumerical format with numbers from 0 to 9 and letters from A to F. The minimum state (0) was given to missing substructures. We followed the Bayreuth database to treat the modified bases as deviations from the cloverleaf model. They were not allowed to establish canonical Watson-Crick pairs. Each helical stem region was scored as two complementary sequences (5′ and 3′ sides). The dataset was then partitioned into four subsets categorized by molecules belonging to each of the three superkingdoms or viruses/bacteriophages. In this study, a “total evidence” approach [62],[63] (also called “simultaneous analysis” [64]) was invoked in phylogenetic analysis to combine both sequence and structure data of the complete (571 tRNAs) and partitioned matrices. The goal of this analysis was to provide stronger support for the phylogenetic groupings recovered from analyses of structural data. Phylogenetic analysis We treated structural features in molecules as phylogenetic multi-state characters with character states transforming according to linearly ordered and reversible pathways. Character state transformations were polarized by assuming an evolutionary tendency towards molecular order. Characters were analyzed using maximum parsimony (MP), a popular phylogenetic optimization method that searches for solutions that require the least amount of change. It is appropriate to treat geometrical features as linearly ordered characters because RNA structures change in discrete manner by addition or removal of nucleotide units. This causes gradual extension or contraction of geometrical features. Although insertion and deletion are also possible, they are more costly. The validity of character argumentation has been discussed in detail elsewhere [9],[17],[18],[20]. A considerable body of evidence supports our polarization hypothesis depicting generalized trends applied to the structure of molecules: (i) the study of extant and randomized sequences shows that evolution enhances conformational order and diminishes conflicting molecular interactions over those intrinsically acquired by self-organization [20], [65]–[70], (ii) a molecular tendency towards order and stability has been experimentally verified using thermodynamic principles generalized to account for non-equilibrium conditions [71]; (iii) a large body of theoretical evidence supports the structural repertoire of evolving sequences from energetic and kinetic perspectives [72]–[74], with some important predictions confirmed experimentally [75], (iv) phylogenies generated using geometrical and statistical structural characters are congruent [9],[20],[21], and (v) the reconstructions of rooted trees generated from sequence, structure, and genomic rearrangements at different taxonomical levels are congruent [17], [18], [20], [21], [76]–[78]. Phylogenetic trees were polarized by distinguishing ancestral states as those thermodynamically more stable. This results in reversible character transformation sequences that are directional and show asymmetry between gains and losses. Maximum and minimum character states were defined as the ancestral states for structures that stabilize (stems, modified bases, and G:U base pairs) and destabilize tRNAs (bulges, hairpin loops, and other unpaired regions), respectively. All data matrices were analyzed using equally weighted MP as the optimality criterion in PAUP* v. 4.0 [79]. Because MP may outperform maximum likelihood (ML) approaches [34],[35], the use of MP is particularly appropriate for our analysis. ML is precisely MP when character changes occur with equal probability but rates vary freely between characters in each branch and when using large multi-step character state spaces (decreasing the likelihood of revisiting a same character state on the underlying tree). This makes MP statistically consistent. Reconstructions of MP trees were sought using heuristic search strategies; 1,000 heuristic searches were initiated using random addition starting taxa, with tree bisection reconnection (TBR) branch swapping and the MulTrees option selected. One shortest tree was saved from each search. Hypothetical ancestors were included in the searches for the most parsimonious trees using the Ancstates command. BS values [80] were calculated from 105 replicate analyses using “fast” stepwise addition of taxa in PAUP*. The g1 statistic of skewed tree length distribution calculated from 104 random parsimony trees was used to assess the amount of nonrandom structure in the data [81]. Constraint analysis Constraint analysis restricts the search of optimal trees to pre-specified tree topologies defining specific monophyletic groups, and was used here to test alternative or compare non-mutually exclusive hypotheses. The number of additional steps (S) required to force (constrain) particular taxa into a monophyletic group was examined using the “enforce topological constraint” option of PAUP*. The additional steps define an evolutionary distance that can be use to test alternative phylogenetic hypotheses or to compare hypotheses that are not mutually exclusive. The latter approach was used to construct evolutionary timelines, in which lower S values corresponded to ancient tRNAs, a trend that was derived from the rooted trees (and embedded assumptions of polarization). Constraint analyses were conducted based on amino acid specificity or grouping of molecules by organismal superkingdoms or viruses. Table S1 Taxonomic distributions of the 571 tRNA molecules examined in the phylogenetic study. (0.04 MB DOC) Click here for additional data file.(1.1K, doc) Table S2 Structural characters and their statistics (range and mean ± standard deviation) used in the phylogenetic analyses of 571 tRNA molecules. (0.10 MB DOC) Click here for additional data file.(1.1K, doc) Acknowledgments We thank Hee Shin Kim, Ajith Harish, Minglei Wang, and Jay E. Mittenthal for helpful discussions. Any opinions, findings, and conclusions and recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the funding agencies. Footnotes The authors have declared that no competing interests exist. This work was supported by National Science Foundation grant MCB-0343126 and the University of Illinois Critical Research Initiative. References 1. Yusupov MM, Yusupov GZ, Baucom A, Lieberman K, Earnest TN, Cate JHD, Noller HF. Crystal structure of the ribosome at 5.5 Å resolution. Science. 2001;292:883–895. [PubMed] 2. Selmer M, Dunham CM, Murphy FV, IV, Weixlbaumer A, Petry S, Kelley AC, Weir JR, Ramakrishnan V. Structure of the 70S ribosome complexed with mRNA and tRNA. Science. 2006;313:1935–1942. [PubMed] 3. Dick TP, Schamel WWA. Molecular evolution of transfer RNA from two precursor hairpins: implications for the origin of protein synthesis. J Mol Evol. 1995;41:1–9. [PubMed] 4. Wong JT. A coevolution theory of the genetic code. Proc Natl Acad Sci U S A. 1975;72:1909–1912. [PubMed] 5. Eigen M, Winkler-Oswatitsch R. Transfer-RNA, an early gene? Naturwissenschaften. 1981;68:282–292. [PubMed] 6. Eigen M, Winkler-Oswatitsch R. Transfer-RNA: the early adaptor. Naturwissenschaften. 1981;68:217–228. [PubMed] 7. Fitch WM, Upper K. The phylogeny of tRNA sequences provides evidence for ambiguity reduction in the origin of the genetic code. Cold Spring Harbor Symp Quant Biol. 1987;52:759–767. [PubMed] 8. Di Giulio M. The RNA world, the genetic code and the tRNA molecule. Trends Genet. 2000;16:17–18. [PubMed] 9. Sun F-J, Caetano-Anollés G. The origin and evolution of tRNA inferred from phylogenetic analysis of structure. J Mol Evol. 2008;66:21–35. [PubMed] 10. Woese CR, Kandler O, Wheelis ML. Towards a natural system of organisms: proposal for domains Archaea, Bacteria and Eucarya. Proc Natl Acad Sci U S A. 1990;95:6854–6859. [PubMed] 11. Delsuc F, Brinkmann H, Philippe H. Phylogenomics and the reconstruction of the tree of life. Nat Rev Genet. 2005;6:361–375. [PubMed] 12. Doolittle RF. Evolutionary aspects of whole-genome biology. Curr Opin Struct Biol. 2005;15:248–253. [PubMed] 13. Woese CR. The universal ancestor. Proc Natl Acad Sci U S A. 1998;95:6854–6859. [PubMed] 14. Penny D, Poole A. The nature of the last universal common ancestor. Curr Opin Genet Dev. 1999;9:672–677. [PubMed] 15. Caetano-Anollés G, Caetano-Anollés D. An evolutionarily structured universe of protein architecture. Genome Res. 2003;13:1563–1571. [PubMed] 16. Wang M, Yafremava LS, Caetano-Anollés D, Mittenthal JE, Caetano-Anollés G. Reductive evolution of architectural repertoires in proteomes and the birth of the tripartite world. Genome Res. 2007;17:1572–1585. [PubMed] 17. Caetano-Anollés G. Evolved RNA secondary structure and the rooting of the universal tree of life. J Mol Evol. 2002;54:333–345. [PubMed] 18. Caetano-Anollés G. Tracing the evolution of RNA structure in ribosomes. Nucleic Acids Res. 2002;30:2575–2587. [PubMed] 19. Pollock D. The Zuckerkandl Prize: structure and evolution. J Mol Evol. 2003;56:375–376. 20. Caetano-Anollés G. Grass evolution inferred from chromosomal rearrangements and geometrical and statistical features in RNA structure. J Mol Evol. 2005;60:635–652. [PubMed] 21. Sun F-J, Fleurdepine S, Bousquet-Antonelli C, Caetano-Anollés G, Deragon J-M. Common evolutionary trends for tRNA-derived SINE RNA structures. Trends Genet. 2007;23:26–33. [PubMed] 22. Miller SL. Which organic compounds could have occurred on the prebiotic earth? Cold Spring Harbor Symp Quant Biol. 1987;52:17–27. [PubMed] 23. Trifonov EN. The triplet code from first principles. J Biomol Struc Dynamics. 2004;22:1–11. 24. Brooks DJ, Fresco JR, Lesk AM, Singh M. Evolution of amino acid frequencies in proteins over deep time: inferred order of introduction of amino acids into the genetic code. Mol Biol Evol. 2002;19:1645–1655. [PubMed] 25. Brooks DJ, Fresco JR, Singh M. A novel method for estimating ancestral amino acid composition and its application to proteins of the Last Universal Ancestor. Bioinformatics. 2004;20:2251–2257. [PubMed] 26. Jordan IK, Kondrashov FA, Adzhubei IA, Wolf YI, Koonin EV, Kondrashov AS, Sunyaev S. A universal trend of amino acid gain and loss in protein evolution. Nature. 2005;433:633–638. [PubMed] 27. Fournier GP, Gogarten JP. Signature of a primitive genetic code in ancient protein lineages. J Mol Evol. 2007;65:425–36. [PubMed] 28. Ashby WR. London: Chapman and Hall; 1956. An introduction to cybernetics. p. 295. 29. Doyle JA. Seed ferns and the origin of angiosperms. J Torrey Bot Soc. 2006;133:169–209. 30. Teichmann SA, Rison SCG, Thornton JM, Riley M, Gough J, Chothia C. Small-molecule metabolism: an enzyme mosaic. Trends Biotech. 2001;19:482–486. 31. Kim HS, Mittenthal JE, Caetano-Anollés G. MANET: Tracing evolution of protein architecture in metabolic networks. BMC Bioinfomatics. 2006;7:351. 32. Widmann J, Di Giulio M, Yarus M, Knight R. tRNA creation by hairpin duplication. J Mol Evol. 2005;61:524–530. [PubMed] 33. Mischler BD. The logic of the data matrix in phylogenetic analysis. In: Albert VA, editor. Parsimony, Phylogeny and Genomics. New York: Oxford University Press; 2005. pp. 57–70. 34. Steel M, Penny D. Parsimony, likelihood, and the role of models in molecular phylogenetics. Mol Biol Evol. 2000;17:839–850. [PubMed] 35. Steel M, Penny D. Two further links between MP and ML under the Poisson model. Appl Math Lett. 2004;17:785–790. 36. Felsenstein J. Cases in which parsimony or compatibility methods will be positively misleading. Syst Zool. 1978;27:401–410. 37. Hendy MD, Penny D. A framework for the quantitative study of evolutionary trees. Syst Zool. 1989;38:297–309. 38. Hillis DM. Inferring complex phylogenies. Nature. 1996;282:130–131. [PubMed] 39. Graybeal A. Is it better to add taxa or characters to a difficult phylogenetic problem? Syst Biol. 1998;47:9–17. [PubMed] 40. Xue H, Tong K-L, Marck C, Grosjean H, Wong T-F. Transfer RNA paralogs: evidence for genetic code-amino acid biosynthesis coevolution and an archaeal root of life. Gene. 2003;310:59–66. [PubMed] 41. Xue H, Ng S-K, Tong K-L, Wong T-F. Congruence of evidence for a Methanopyrus-proximal root of life based on transfer RNA and aminoacyl-tRNA synthetase genes. Gene. 2005;360:120–130. [PubMed] 42. Wong T-F, Chen J, Mat W-K, Ng S-K, Xue H. Polyphasic evidence delineating the root of life and roots of biological domains. Gene. 2007;403:39–52. [PubMed] 43. Bamford DH. Do viruses form lineages across different domains of life? Res Microbiol. 2003;154:231–236. [PubMed] 44. Mindich L, Bamford DH. Lipid-containing bacteriophages, In: Calendar R, editor. The Bacteriophages. New York: Plenum; 1988. pp. 475–520. 45. Bamford DH, Caldentey J, Bamford JKH. Bacteriophage PRD1: A broad host range dsDNA Tectivirus with an internal membrane. Adv Virus Res. 1995;45:281–319. [PubMed] 46. Daubin V, Ochman H. Start-up entities in the evolution of new genes. Curr Opin Genet Dev. 2004;14:616–619. [PubMed] 47. Forterre P. The origin of DNA genomes and DNA replication proteins. Curr Opin Microbiol. 2002;5:525–532. [PubMed] 48. Forterre P, Filé J, Myllykallio H. Origin and evolution of DNA and DNA replication machineries. In: Ribas de Pouplana L, editor. The genetic code and the origin of life. New York: Springer; 2004. pp. 145–168. 49. Forterre P. The two ages of the RNA world, and the transition to the DNA world: a story of viruses and cells. Biochimie. 2005;87:793–803. [PubMed] 50. Forterre p. Three RNA cells for ribosomal lineages and three DNA viruses to replicate their genomes: A hypothesis for the origin of cellular domain. Proc Natl Acad Sci U S A. 2006;103:3669–3674. [PubMed] 51. Prangishvili D, Forterre P, Garrett RA. Viruses of the Archaea: a unifying view. Nature Rev Microbiol. 2006;4:837–848. [PubMed] 52. Luria SE, Darnell JE., Jr New York: John Wiley and Sons; 1967. General Virology. p. 512. 53. Bandea CI. A new theory on the origin and the nature of viruses. J Theor Biol. 1983;105:591–602. [PubMed] 54. Hendrix RW, Lawrence JG, Hatfull GF, Casjens S. The origins and ongoing evolution of viruses. Trends Microbiol. 2000;8:504–508. [PubMed] 55. Forterre P. The great virus comeback-from an evolutionary perspective. Res Microbiol. 2003;154:223–225. [PubMed] 56. Holmes EC, Drummond AJ. The evolutionary genetics of viral emergence. Curr Top Microbiol Immunol. 2007;315:51–66. [PubMed] 57. Koonin EV, Senkevich TG, Dolja VV. The ancient virus world and evolution of cells. Biol Direct. 2006;1:29. [PubMed] 58. Flinth SJ, Enquist LW, Racaniello VR, Skalka AM. Washington, DC: ASM Press; 2004. Principles of virology: molecular biology, pathogenesis, and control of animal viruses. 2nd edition. 59. Koonin EV. The Biological Big Bang model for the major transitions in evolution. Biol Direct. 2007;2:21. [PubMed] 60. Steinberg S, Misch A, Sprinzl M. Compilation of tRNA sequences and sequences of tRNA genes. Nucleic Acids Res. 1993;21:3011–3015. [PubMed] 61. Sprinzl M, Vassilenko KS. Compilation of tRNA sequences and sequences of tRNA genes. Nucleic Acids Res. 2005;33:D139–D140. [PubMed] 62. Kluge AG. A concern for evidence and a phylogenetic hypothesis of relationships among Epicrates (Boidae, Serpentes). Syst Zool. 1989;38:7–25. 63. Kluge AG, Wolf AJ. Cladistics: What's in a word? Cladistics. 1993;9:183–199. 64. Nixon KC, Carpenter JM. On simultaneous analysis. Cladistics. 1996;12:221–241. 65. Stegger G, Hofman H, Fortsch J, Gross HJ, Randles JW, Sanger HL, Riesner D. Conformational transitions in viroids and virusoids: comparison of results from energy minimization algorithm and from experimental data. J Biomol Struct Dynam. 1984;2:543–571. 66. Higgs PG. RNA secondary structure: a comparison of real and random sequences. J Phys I France. 1993;3:43–59. 67. Higgs PG. Thermodynamic properties of transfer RNA: a computational study. J Chem Soc Faraday Trans. 1995;91:2531–2540. 68. Schultes EA, Hraber PT, LaBean TH. Estimating the contributions of selection and self-organization in RNA secondary structure. J Mol Evol. 1999;49:76–83. [PubMed] 69. Steffens W, Digby D. mRNA have greater negative folding free energies than shuffled or codon choice randomized sequences. Nucleic Acids Res. 1999;27:1578–1584. [PubMed] 70. Gultyaev PA, van Batenburg FHD, Pleij CWA. Selective pressures on RNA hairpins in vivo and in vitro. J Mol Evol. 2002;54:1–8. [PubMed] 71. Gladyshev GP, Ershov YA. Principles of the thermodynamics of biological systems. J Theor Biol. 1982;94:301–343. [PubMed] 72. Ancel LW, Fontana W. Plasticity, evolvability, and modularity in RNA. J Exp Zool (Mol Dev Evol). 2000;288:242–283. 73. Higgs PG. RNA secondary structure: physical and computational aspects. Quarterly Rev Biophys. 2000;33:199–253. 74. Fontana W. Modelling ‘evo-devo’ with RNA. BioEssays. 2002;24:1164–1177. [PubMed] 75. Schultes EA, Bartel DP. One sequence, two ribozymes: implications for the emergence of new ribozyme folds. Science. 2000;289:448–452. [PubMed] 76. Billoud B, Guerrucci MA, Masselot M, Deutsch JS. Cirripede phylogeny using a novel approach: molecular morphometrics. Mol Biol Evol. 2000;17:1435–1445. [PubMed] 77. Collins LJ, Moulton V, Penny D. Use of RNA secondary structure for studying the evolution of RNase P and RNase MRP. J Mol Evol. 2000;51:194–2004. [PubMed] 78. Swain TD, Taylor DJ. Structural rRNA characters support monophyly of raptorial limbs and paraphyly of limb specialization in water fleas. Proc R Soc London B. 2003;270:887–896. 79. Swofford DL. Sunderland, MA: Sinauer Associates; 2002. PAUP*: Phylogenetic Analysis Using Parsimony (*and other methods), version 4.0b10. 80. Felsenstein J. Confidence limits on phylogenies: an approach using the bootstrap. Evolution. 1985;39:783–791. 81. Hillis DM, Huelsenbeck JP. Signal, noise, and reliability in molecular phylogenetic analyses. J Hered. 1992;83:189–195. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||
Science. 2001 May 4; 292(5518):883-96.
[Science. 2001]Science. 2006 Sep 29; 313(5795):1935-42.
[Science. 2006]J Mol Evol. 1995 Jul; 41(1):1-9.
[J Mol Evol. 1995]Proc Natl Acad Sci U S A. 1975 May; 72(5):1909-12.
[Proc Natl Acad Sci U S A. 1975]Trends Genet. 2000 Jan; 16(1):17-9.
[Trends Genet. 2000]Proc Natl Acad Sci U S A. 1998 Jun 9; 95(12):6854-9.
[Proc Natl Acad Sci U S A. 1998]Nat Rev Genet. 2005 May; 6(5):361-75.
[Nat Rev Genet. 2005]Curr Opin Struct Biol. 2005 Jun; 15(3):248-53.
[Curr Opin Struct Biol. 2005]Proc Natl Acad Sci U S A. 1998 Jun 9; 95(12):6854-9.
[Proc Natl Acad Sci U S A. 1998]Curr Opin Genet Dev. 1999 Dec; 9(6):672-7.
[Curr Opin Genet Dev. 1999]J Mol Evol. 2002 Mar; 54(3):333-45.
[J Mol Evol. 2002]Nucleic Acids Res. 2002 Jun 1; 30(11):2575-87.
[Nucleic Acids Res. 2002]J Mol Evol. 2005 May; 60(5):635-52.
[J Mol Evol. 2005]Trends Genet. 2007 Jan; 23(1):26-33.
[Trends Genet. 2007]J Mol Evol. 2008 Jan; 66(1):21-35.
[J Mol Evol. 2008]Cold Spring Harb Symp Quant Biol. 1987; 52():17-27.
[Cold Spring Harb Symp Quant Biol. 1987]J Mol Evol. 2007 Oct; 65(4):425-36.
[J Mol Evol. 2007]Genome Res. 2007 Nov; 17(11):1572-85.
[Genome Res. 2007]J Mol Evol. 2005 Oct; 61(4):524-30.
[J Mol Evol. 2005]Nat Rev Genet. 2005 May; 6(5):361-75.
[Nat Rev Genet. 2005]Mol Biol Evol. 2000 Jun; 17(6):839-50.
[Mol Biol Evol. 2000]Syst Biol. 1998 Mar; 47(1):9-17.
[Syst Biol. 1998]Nature. 1996 Sep 12; 383(6596):130-1.
[Nature. 1996]Genome Res. 2007 Nov; 17(11):1572-85.
[Genome Res. 2007]Gene. 2003 May 22; 310():59-66.
[Gene. 2003]Gene. 2005 Nov 7; 360(2):120-30.
[Gene. 2005]Gene. 2007 Nov 15; 403(1-2):39-52.
[Gene. 2007]Genome Res. 2007 Nov; 17(11):1572-85.
[Genome Res. 2007]J Mol Evol. 2005 Oct; 61(4):524-30.
[J Mol Evol. 2005]Res Microbiol. 2003 May; 154(4):231-6.
[Res Microbiol. 2003]Adv Virus Res. 1995; 45():281-319.
[Adv Virus Res. 1995]Curr Opin Genet Dev. 2004 Dec; 14(6):616-9.
[Curr Opin Genet Dev. 2004]Curr Opin Microbiol. 2002 Oct; 5(5):525-32.
[Curr Opin Microbiol. 2002]Biochimie. 2005 Sep-Oct; 87(9-10):793-803.
[Biochimie. 2005]Nat Rev Microbiol. 2006 Nov; 4(11):837-48.
[Nat Rev Microbiol. 2006]Genome Res. 2007 Nov; 17(11):1572-85.
[Genome Res. 2007]Res Microbiol. 2003 May; 154(4):223-5.
[Res Microbiol. 2003]Curr Top Microbiol Immunol. 2007; 315():51-66.
[Curr Top Microbiol Immunol. 2007]Biol Direct. 2006 Sep 19; 1():29.
[Biol Direct. 2006]Biol Direct. 2007 Aug 20; 2():21.
[Biol Direct. 2007]Nucleic Acids Res. 1993 Jul 1; 21(13):3011-5.
[Nucleic Acids Res. 1993]Nucleic Acids Res. 2005 Jan 1; 33(Database issue):D139-40.
[Nucleic Acids Res. 2005]J Mol Evol. 2008 Jan; 66(1):21-35.
[J Mol Evol. 2008]J Mol Evol. 2008 Jan; 66(1):21-35.
[J Mol Evol. 2008]J Mol Evol. 2002 Mar; 54(3):333-45.
[J Mol Evol. 2002]Nucleic Acids Res. 2002 Jun 1; 30(11):2575-87.
[Nucleic Acids Res. 2002]J Mol Evol. 2005 May; 60(5):635-52.
[J Mol Evol. 2005]J Mol Evol. 2002 Jan; 54(1):1-8.
[J Mol Evol. 2002]Mol Biol Evol. 2000 Jun; 17(6):839-50.
[Mol Biol Evol. 2000]J Hered. 1992 May-Jun; 83(3):189-95.
[J Hered. 1992]