![]() | ![]() |
Formats:
|
||||||||||||
Copyright © 2009 by Cold Spring Harbor Laboratory Press Mosaic retroposon insertion patterns in placental mammals 1 Institute of Experimental Pathology, Center for Molecular Biology of Inflammation, University of Münster, 48149 Münster, Germany; 2 Department of Biomolecular Engineering, University of California, Santa Cruz, California 95064, USA 3These authors contributed equally to this work. 4Present address: LWL-Museum für Naturkunde, 48161 Münster, Germany. 5Corresponding authors.E-mail jueschm/at/uni-muenster.de; fax 49-251-8352134.E-mail churakov/at/uni-muenster.de; fax 49-251-8352134. Received December 19, 2008; Accepted March 3, 2009. Abstract One and a half centuries after Charles Darwin and Alfred Russel Wallace outlined our current understanding of evolution, a new scientific era is dawning that enables direct observations of genetic variation. However, pure sequence-based molecular attempts to resolve the basal origin of placental mammals have so far resulted only in apparently conflicting hypotheses. By contrast, in the mammalian genomes where they were highly active, the insertion of retroelements and their comparative insertion patterns constitute a neutral, virtually homoplasy-free archive of evolutionary histories. The “presence” of a retroelement at an orthologous genomic position in two species indicates their common ancestry in contrast to its “absence” in more distant species. To resolve the placental origin controversy we extracted ~2 million potentially phylogenetically informative, retroposon-containing loci from representatives of the major placental mammalian lineages and found highly significant evidence challenging all current single hypotheses of their basal origin. The Exafroplacentalia hypothesis (Afrotheria as the sister group to all remaining placentals) is significantly supported by five retroposon insertions, the Epitheria hypothesis (Xenarthra as the sister group to all remaining placentals) by nine insertion patterns, and the Atlantogenata hypothesis (a monophyletic clade comprising Xenarthra and Afrotheria as the sister group to Boreotheria comprising all remaining placentals) by eight insertion patterns. These findings provide significant support for a “soft” polytomy of the major mammalian clades. Ancestral successive hybridization events and/or incomplete lineage sorting associated with short speciation intervals are viable explanations for the mosaic retroposon insertion patterns of recent placental mammals and for the futile search for a clear root dichotomy. Genomic variation is based on genetic convertibility and is amplified by gene flow and sexual recombination. While only a small fraction of genetic variation is directly exposed to the natural selection driving the evolution of species, the majority of changes are neutral (Kimura 1968). In the molecular Darwinian and post-genomic age when such variations are now directly observable in prodigious numbers, one still needs only compare the volumes of controversy in scientific reports to dispel the notion that all the mysteries of evolutionary history are now an open book. The basal node of the placental mammals is a prime example of such a controversy. After continuous morphological inconsistencies and a lack of resolution due to narrow temporal speciation events at the deeper divergences of the placental mammalian tree, the hope was that large-scale DNA sequencing and an increasing number of sampled species might resolve the higher-level mammalian relationships by reducing systematic biases. But the extraction and analysis of molecular traces of evolutionary history has been no less sensitive to misinterpretation than was the understanding of dusty specimens 150 years ago. The reliability of sequence data in genome-scale phylogenetic approaches is subject to unequal evolutionary rates among lineages, regional- or lineage-specific compositional biases, shifts in site-specific evolutionary rates over time and sequence regions (e.g., Nishihara et al. 2007), variable accuracies of multisequence alignments and analytical tools, and last but not least, the general liability of such data to homoplasy. In a seminal work, Murphy et al. (2001a) investigated ~10 kb of sequence information to substantiate the previously proposed (Waddell et al. 1999) major mammalian groups Afrotheria, Xenarthra, Laurasiatheria, and Euarchontoglires. Moderate support was found for Afrotheria as the basal split of placentals (Exafroplacentalia; Fig. 1A
By contrast, the presence/absence patterns of inserted retroelements constitute a virtually homoplasy-free marker system with theoretically infinite character states (Steel and Penny 2000). Specific genomic insertions of such elements in the ancestor of two species reliably document their common ancestry. The very few known examples of discordance in retroelement presence/absence data can be explained by deletion via illegitimate recombination between perfect direct repeats flanking each insertion (van de Lagemaat et al. 2005), exact parallel insertions (Cantrell et al. 2001), or lineage sorting, a phenomenon related to incomplete allele fixation and frequent intervals of speciation events (Shedlock et al. 2004; Ray et al. 2006). Analyses of a smaller number of retroelements as phylogenetic markers in mammals validated the four superordinal mammalian clades (Nishihara et al. 2005; Kriegs et al. 2006; Möller-Krull et al. 2007). However, in support of Shoshani and McKenna (1998), but in contradiction to most other molecular investigations, we found the first evidence, two L1MB5 retroelements, that placed Xenarthra (Epitheria hypothesis) (Fig. 1B With the intention of resolving the conflicting topologies of placental lineages, we aimed to examine a much larger sample of retroposed elements and in more species. For this study we took a slightly different tack and aligned whole genomes of three representative placental species, including additional species for informative loci to represent a maximum divergence within each of the groups, and scanned them for diagnostic retroelement insertions. In an analysis of ~2 million potential phylogenetically informative loci, we found multiple retroposon insertion patterns significantly supporting all three placental speciation hypotheses, indicating a complex ancestral speciation scenario including successive early divergences in close temporal proximity and hybridization and/or lineage sorting events as viable sources for an effective “soft” polytomy. This very rare example of multiple retroposon incongruence goes a long way toward explaining the decades-long stream of apparently conflicting evidence for one or the other placental evolutionary history based on multiple marker systems including both morphological and molecular sequence evidence. Results We focused our computational screening for phylogenetically informative markers on the insertion patterns of L1MB elements that were active during the critical period of early placental evolution ~100 Mya (Kriegs et al. 2006; Murphy et al. 2007), and independently tested the three viable scenarios of placental origin: the Exafroplacentalia, Epitheria, and Atlantogenata hypotheses (Fig. 1
In strong contrast to most of the mammalian branches that we and others investigated previously with this method (e.g., Nishihara et al. 2005, 2006; Huchon et al. 2007; Kriegs et al. 2007; Möller-Krull et al. 2007; Xing et al. 2007; Warren et al. 2008), we found a rare example of virtually equal support for three competing hypotheses (Fig. 2B Of all the phylogenetically informative mammalian markers investigated to date (Kriegs et al. 2006; Nishihara et al. 2006; Murphy et al. 2007), just the deepest split of the major placental clades has exhibited such extreme contradictions. The only hypothesis to best explain these apparently conflicting results is that there is no clear dichotomy at the base of placentals but rather a trifurcation. Discussion Comparative analyses of large-scale genome data afford a new era of phylogenetic inference combining the fields of evolution and genomics into one of phylogenomics. A completely objective history of the mammalian orders is one of the main visions of this new field of phylogenomics. Most of the interordinal mammalian branches have already been confirmed with convincing support from genome data, but problematic areas of the placental tree have also been recognized (Hallström and Janke 2008). As an example, narrow ancestral splitting events such as the early divergence of placentals have been contradictorily resolved by genome-wide analyses of protein-coding data (Hallström et al. 2007; Nikolaev et al. 2007). But, because nucleotide sequences are highly exposed to homoplasy, using them as phylogenetic markers cannot resolve these contradictions. Our current data, with significant, nearly equivalent support for all three hypotheses in one study, could be explained by three possible scenarios: deletion via illegitimate recombination between perfect direct repeats flanking each insertion; exact parallel insertions; or lineage sorting, a phenomenon related to incomplete allele fixation and frequent intervals of speciation events (Shedlock et al. 2004; Ray et al. 2006). Precise deletion was observed in 0.5% of LINE1-mobilized SINE insertions in primates (van de Lagemaat et al. 2005). Statistically speaking, this could affect at most six of our 1227 candidate loci, and the chance of all six being in the 22 markers used for the final analysis is infinitesimally small. Moreover, the process of precise deletion is restricted to a very small period of time where both element flanking direct repeats must be identical to allow recombination. Van de Lagemaat et al. (2005) also did not consider the effects of lineage sorting or mosaic evolution in their calculation, hence the expected number of real precise deletions should be much lower than 0.5%. From a previous investigation (Ray et al. 2006), it is known that precise parallel insertion occurs in 0.05% of observed primate LINE1-mobilized SINE insertions. Considering our total of 1227 candidate loci, this is less than one. Furthermore, all previous investigations did not consider random truncations that are an essential, additional factor for establishing LINE element orthology. Therefore, as the homoplasy of retroposon insertion patterns is extremely rare in mammals and was only described recently for one apparent conflicting marker challenging the newly described Pegasoferae clade (Nishihara et al. 2006), this leaves us with the last explanation, lineage sorting. We believe that the most parsimonious interpretation of the current data is that the ancestral placental populations were characterized by severe ancestral subdivisions and rejoinings, leading to a complex mosaic of phylogenetic relationships in recent species (Fig. 4
Two different scenarios or a mixture of both might be responsible for the mosaic pattern that we found in our screening for informative retroposon insertions and that other studies found in phylogenetic investigations of subsets of genomic sequence data for these species. The earliest eutherian mammals (extant placentals plus closely related extinct mammals) originated about 125 Mya in the Early Cretaceous and based on appearance can probably be represented by the recently discovered and reconstructed skeletal remains of Eomaia scansoria (Ji et al. 2002). In the first scenario, the Eomaia-like ancestral population probably diverged into three distinct lineages of preAfrotheria, preXenarthra, and preBoreotheria. Limited gene flow and hybridization that occurred among these populations left behind mosaic signs of intermittent relationships that we identify about 125 million years later as a patchwork of Exafroplacentalia (I), Epitheria (II), and Atlantogenata (III) roots (Fig. 4A A mosaic of evolutionary signals, blueprinted in the human genome, was also recently described by Ebersberger et al. (2007). They reported that 23% of the genomic sequences (equal representations of genes and intergenic regions; 23,210 alignments, each ≥300 nucleotides [nt]) that they analyzed represented an incongruent genealogy in which chimpanzee was not the closest relative to human. The majority of sequence-based investigations aimed at resolving the placental root, by examining numerous genes, sequence partition, and various analytic tools, also found almost equal support for one or the other of the various hypotheses (e.g., Madsen et al. 2001; Delsuc et al. 2002; Waddell and Shelley 2003), once more emphasizing the notion that early placental divergence constituted a patchwork evolution of the initial lineages. That such conflicting results are so very rare requires some discussion of the actual markers that we recovered. The 22 phylogenetically informative markers providing significant support for a trifurcation at the basal node of placental mammals that were retrieved in the current screen included both of the previously described Atlantogenata markers (Murphy et al. 2007) but only one of the two previously described Epitheria markers (Kriegs et al. 2006). Although the undetected one was not present in the three-way multiple alignment format (MAF) alignments, we do consider it to be a reliable marker (Supplemental Fig. S2). A distinct advantage of the current expanded screening compared with previous searches is a more unbiased evaluation of equal amounts of data for all three hypothetical scenarios of placental origin based on whole-genome alignments. We based our previous strategies (Kriegs et al. 2006; Strategy II) on a random search of human intron data and screens of the limited amounts of trace sequences of elephant and armadillo available at the time, while the specifically designed, whole-genome, three-way alignments currently employed offer a much more comprehensive data source resulting in detection of a high number of potentially informative markers. However, the fact that we recovered only three of the four previously described retroposon markers for early placental splits indicates that the current limitations of an exhaustive and maximized unbiased search is the naturally imperfect and variable quality of available specific whole-genome alignments. One critical point in using retroposon insertion data, especially in deep phylogenetic splits, is the continuous divergence of the elements and their flanking regions. Nevertheless, we recently demonstrated that retroposon data could provide reliable markers in deep mammalian phylogeny by supporting the more than 140-million-year-old therian split (Warren et al. 2008). As stated previously, 20 of the retroposons examined here had clearly recognizable information concerning their direct repeats and the other two, though recognizable, were not quite as clear. Direct repeats generated by target site duplications in the process of retroposition are valuable landmarks of an orthologous insertion but are unlikely to remain unchanged throughout deep phylogenetic splits. Hence, there is an additional and more reliable orthology criterion available, particularly for the LINE-related retroposons such as L1MB elements. These elements insert almost exclusively as randomly truncated forms. Thus, as independent insertions of identically truncated LINE elements of the same subtype at the same genomic locus of two species is highly unlikely, determining the exact point of truncation of the observed orthologous elements will clearly trace them back to a common origin. A genome-wide screening for the 20,700 LINE1 (L1MB) truncations demonstrated a significant random distribution of truncations with no bias for specific truncation points (P < 0.002, Supplemental Fig. S3), and all investigated diagnostic LINE1 elements differ in their lengths (Supplemental Fig. S3). Thus, retroposon truncation points serve as an additional, powerful criterion for orthologous insertions. The randomness of the insertions of LINE1-related elements is well demonstrated by Ray et al. (2006) with only a very few cases of observed conflicts. A genome-wide investigation of target site preferences indicates a slight preference for TT/AAAAA sites (Supplemental Fig. S4). This is well known as a kinkable DNA site (Jurka et al. 1998). However, the full target site duplication is much longer (8–30 nt) and rather individual. Nevertheless, the multiplicity of criteria, including (1) identical genomic insertion points, (2) element orientation, (3) identical LINE1 subtypes, (4) matching length and sequence of direct repeats, (5) concurrent truncation points, and (6) the complete consistency in diverse members of representative orders, make it improbable to have screened for artifacts. Investigations of very young retroposon insertion events present another problem, lineage-specific exact or nearly exact deletions. For young retroposon insertions flanked by perfect identical direct repeats, precise removal might be possible via recombination involving the direct repeats (van de Lagemaat et al. 2005). Because direct repeats diversify quickly after insertion, this possibility should not be critical for deep phylogenetic splits (after altering of perfect direct repeats), but it is not completely impossible for such random perfect or nearly perfect deletions of retroelements to have accumulated, especially in deep divergences where exact comparisons of diverged direct repeats are difficult. However, the tendency of LINE1-related elements to integrate within A/T-rich sequence regions could promote such deletions. Comparing potential retroposon absence regions with a clear absence in an outgroup species is one way to diminish such potential misinterpretations. Unfortunately, for placentals the next outgroup are marsupials that diverged substantially. Because of the evolutionary distance between marsupials and placentals, even if possible, alignments between these species are, particularly for intergenic or intronic regions, only approximate and mostly not a critical proof of true absence. On the other hand, the outgroup comparison for informative L1MB elements is not essential, because L1MB elements are placental-specific and therefore necessarily absent in any outgroup species. Thus, the suggestion of Murphy et al. (2007) that one of our informative L1MB5 markers for Epitheria (Kriegs et al. 2006) is traceable in opossum may be based on a misalignment and misannotation of the corresponding BLAT browser location (Supplemental Fig. S2). Conclusions Comparing the genome-wide insertion patterns of retroelements is a powerful strategy to gain a virtually homoplasy-free picture of the evolutionary histories of species. Virtually homoplasy-free implies that this marker system is, as any other marker system, not insensitive to the effects of ancestral hybridization and incomplete lineage sorting combined with short splitting times. Because of the clear character polarity (presence of an element as the derived state) and the low probability of orthologous exact parallel insertions or deletions, the retroposon presence/absence marker system is exceptionally effective for detecting the ancestral exchange of genetic material among lineages and the lineage sorting effects that are difficult to pinpoint by the statistical evaluation of heterogeneous sequence data (Hallström and Janke 2008). Such effects are more difficult to recognize in deep phylogenetic splits and require maximized, unbiased, multidirectional screening of whole-genome phylogenetic signals. They must also be conducted with the notion in mind that in such rare cases, the minimum of three clear markers proposed by Waddell et al. (2001) is not absolutely sufficient as a significant clade support. The present example of apparently incongruent markers inherent in the early branching of placentals offers a seminal example of conflicting retroposon presence/absence patterns that, at the same time, shed new light on the decades-old, controversial scenarios of sequence-based phylogenies. As demonstrated, retroposon screening offers a powerful test of critical signals in narrow splits to detect nodes with more than two immediate descending branches. The basal rodent splits, the Euarchonta relationships, and the affiliations among laurasiatherians are only a few additional contentious examples that may possibly find elucidation by comparing the insertion histories of genome-wide, multidirectionally extracted, phylogenetically informative retroelements. Methods Genome alignments To perform an exhaustive, genome-wide screening for phylogenetically informative retroposon insertions in placentals and to test the three viable phylogenetic hypotheses of the basal origin of placentals, we derived two different three-way (species) whole-genome sequence alignments in MAF format (UCSC): (1) armadillo–elephant–human (3050 gigabases) and (2) elephant–human–armadillo (2730 gigabases), the decisive difference between the two alignments being the full retroposon representation of the leading species that was used as the profile for the remaining species. The three-way alignments were prepared using pairwise (armadillo–elephant and armadillo–human for the first three-way alignment and elephant–human and armadillo–elephant for the second three-way alignment) BLASTZ alignments (Chiaromonte et al. 2002; Kent et al. 2003; Schwartz et al. 2003) and the standard matrix used in the UCSC browser. The alignments were fed into axtChain with the parameters −chainMinScore = 3000 − chainLinearGap = medium, which organizes all alignments by chromosome (or scaffold) and creates a kd-tree from the gapless subsections (blocks) of the alignments. A dynamic program was then run over the kd-trees to find the maximally scoring chains of these blocks. Finally, MULTIZ (Blanchette et al. 2004) was used to generate the two three-way alignments referencing elephant and armadillo. We used the first alignment to search for elements supporting the Atlantogenata [(armadillo + elephant) − human] and Exafroplacentalia [(armadillo + human) − elephant] hypotheses, and the second one to test the Epitheria [(elephant + human) − armadillo] hypothesis. Presence/absence loci The MAF alignments were organized in blocks of conservation including chromosomal coordinates (Fig. 2A Computational screening for potential informative retroposons With the local RepeatMasker (www.repeatmasker.org) we screened the triple block sequences for mammalian-specific L1MB LINEs. We selected triple blocks in which at least 50% of block 2 was composed of an L1MB element (human + elephant 284 loci; human + armadillo 256 loci; elephant + armadillo 677 loci). In addition, to detect slightly misaligned but informative loci, all conserved blocks were independently screened for retroelements, and the corresponding regions were compared among all three species for potential absences (10 cases). Visual screening for informative retroposons Links to the Genome Browser (http://genome.ucsc.edu/cgi-bin/hgBlat) were generated to filter out loci with less than 70% sequence similarity between multiple species and retroposons that overlapped with blocks 1 and/or 3, leaving us with a total of 22 phylogenetically informative loci. Experimental amplification to detect presence/absence of retroposons in sloth and tenrec As sufficient sequence data were not yet available for all the phylogenetically informative loci in sloth and tenrec, we PCR-amplified some of those missing loci to provide a more complete presence/absence pattern. PCR reactions were performed using Phusion DNA Polymerase (New England BioLabs) for 30 sec at 98°C followed by 35 cycles of 10 sec at 98°C, 30 sec at primer-specific temperatures (Supplemental Table S3), and 30 sec at 72°C. Amplified PCR fragments were sequenced directly or purified on agarose gels, ligated into the pDrive Cloning Vector (Qiagen), and electroporated into TOP10 cells (Invitrogen). Sequencing was performed using the AmpliTaq FS Big Dye Terminator Kit (PE Biosystems). A list of the PCR primers used is shown in Supplemental Table S3. Comparative analyses For each locus identified in the MAF alignments, we compiled available sequence information of other selected mammalian species from trace or genomic data sequences from mouse, guinea pig, rabbit, dog, horse, cow, tenrec, hyrax, sloth, and as far as available, opossum. All inserted L1MB elements were aligned against the consensus Repbase sequences (www.girinst.org/repbase) (Kapitonov and Jurka 2008). Each insertion site was carefully checked to validate the orthology of the insertions based on the following criteria: (1) identical genomic insertion points; (2) element orientation; (3) identical LINE1 subtypes; (4) matching length and sequence of direct repeats; and (5) identical truncation points. We tried as much as was possible to compile a representative and consistent species sampling for each marker (Supplemental Table S1; Supplemental Material 1). This was not always possible because the genomic information from some species remains fragmented. For consistent comparisons for most markers we selected two Supraprimates from human, mouse, guinea pig, and rabbit; two Laurasiatheria from dog, horse, and cow; two Afrotheria from elephant, tenrec, and hyrax; two Xenarthra (armadillo and sloth); and if alignable, the opossum outgroup sequence. Acknowledgments We thank Marsha Bundman for editorial assistance and Ronald Blakey for providing the two terrestrial globes in Figure 4 Footnotes [Supplemental material is available online at www.genome.org.] Article is online at http://www.genome.org/cgi/doi/10.1101/gr.090647.108. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||
Nature. 1968 Feb 17; 217(5129):624-6.
[Nature. 1968]Genome Biol. 2007; 8(9):R199.
[Genome Biol. 2007]Syst Biol. 1999 Mar; 48(1):119-37.
[Syst Biol. 1999]Nature. 2001 Feb 1; 409(6820):610-4.
[Nature. 2001]PLoS Genet. 2007 Jan 5; 3(1):e2.
[PLoS Genet. 2007]Nature. 2007 Jun 14; 447(7146):799-816.
[Nature. 2007]Mol Biol Evol. 2007 Sep; 24(9):2059-68.
[Mol Biol Evol. 2007]Mol Biol Evol. 2000 Jun; 17(6):839-50.
[Mol Biol Evol. 2000]Genome Res. 2005 Sep; 15(9):1243-9.
[Genome Res. 2005]Genetics. 2001 Jun; 158(2):769-77.
[Genetics. 2001]Trends Ecol Evol. 2004 Oct; 19(10):545-53.
[Trends Ecol Evol. 2004]Syst Biol. 2006 Dec; 55(6):928-35.
[Syst Biol. 2006]Mol Biol Evol. 2005 Sep; 22(9):1823-33.
[Mol Biol Evol. 2005]PLoS Biol. 2006 Apr; 4(4):e91.
[PLoS Biol. 2006]Mol Biol Evol. 2007 Nov; 24(11):2573-82.
[Mol Biol Evol. 2007]Mol Phylogenet Evol. 1998 Jun; 9(3):572-84.
[Mol Phylogenet Evol. 1998]Genome Res. 2007 Apr; 17(4):413-21.
[Genome Res. 2007]PLoS Biol. 2006 Apr; 4(4):e91.
[PLoS Biol. 2006]Genome Res. 2007 Apr; 17(4):413-21.
[Genome Res. 2007]Mol Biol Evol. 2005 Sep; 22(9):1823-33.
[Mol Biol Evol. 2005]Proc Natl Acad Sci U S A. 2006 Jun 27; 103(26):9929-34.
[Proc Natl Acad Sci U S A. 2006]Proc Natl Acad Sci U S A. 2007 May 1; 104(18):7495-9.
[Proc Natl Acad Sci U S A. 2007]Trends Genet. 2007 Apr; 23(4):158-61.
[Trends Genet. 2007]Mol Biol Evol. 2007 Nov; 24(11):2573-82.
[Mol Biol Evol. 2007]PLoS Biol. 2006 Apr; 4(4):e91.
[PLoS Biol. 2006]Proc Natl Acad Sci U S A. 2006 Jun 27; 103(26):9929-34.
[Proc Natl Acad Sci U S A. 2006]Genome Res. 2007 Apr; 17(4):413-21.
[Genome Res. 2007]BMC Evol Biol. 2008 May 27; 8():162.
[BMC Evol Biol. 2008]Mol Biol Evol. 2007 Sep; 24(9):2059-68.
[Mol Biol Evol. 2007]PLoS Genet. 2007 Jan 5; 3(1):e2.
[PLoS Genet. 2007]Trends Ecol Evol. 2004 Oct; 19(10):545-53.
[Trends Ecol Evol. 2004]Syst Biol. 2006 Dec; 55(6):928-35.
[Syst Biol. 2006]Genome Res. 2005 Sep; 15(9):1243-9.
[Genome Res. 2005]Proc Natl Acad Sci U S A. 2006 Jun 27; 103(26):9929-34.
[Proc Natl Acad Sci U S A. 2006]Proc Natl Acad Sci U S A. 2007 Sep 4; 104(36):14395-400.
[Proc Natl Acad Sci U S A. 2007]PLoS Biol. 2006 Apr; 4(4):e91.
[PLoS Biol. 2006]Nature. 2002 Apr 25; 416(6883):816-22.
[Nature. 2002]Mol Biol Evol. 2007 Oct; 24(10):2266-76.
[Mol Biol Evol. 2007]Nature. 2001 Feb 1; 409(6820):610-4.
[Nature. 2001]Mol Biol Evol. 2002 Oct; 19(10):1656-71.
[Mol Biol Evol. 2002]Mol Phylogenet Evol. 2003 Aug; 28(2):197-224.
[Mol Phylogenet Evol. 2003]Genome Res. 2007 Apr; 17(4):413-21.
[Genome Res. 2007]PLoS Biol. 2006 Apr; 4(4):e91.
[PLoS Biol. 2006]Nature. 2008 May 8; 453(7192):175-83.
[Nature. 2008]Syst Biol. 2006 Dec; 55(6):928-35.
[Syst Biol. 2006]J Biomol Struct Dyn. 1998 Feb; 15(4):717-21.
[J Biomol Struct Dyn. 1998]Genome Res. 2005 Sep; 15(9):1243-9.
[Genome Res. 2005]Genome Res. 2007 Apr; 17(4):413-21.
[Genome Res. 2007]PLoS Biol. 2006 Apr; 4(4):e91.
[PLoS Biol. 2006]BMC Evol Biol. 2008 May 27; 8():162.
[BMC Evol Biol. 2008]Genome Inform. 2001; 12():141-54.
[Genome Inform. 2001]Proc Natl Acad Sci U S A. 2003 Sep 30; 100(20):11484-9.
[Proc Natl Acad Sci U S A. 2003]Genome Res. 2003 Jan; 13(1):103-7.
[Genome Res. 2003]Genome Res. 2004 Apr; 14(4):708-15.
[Genome Res. 2004]Nat Rev Genet. 2008 May; 9(5):411-2; author reply 414.
[Nat Rev Genet. 2008]