![]() | ![]() |
Formats:
|
||||||||
Copyright © 2005 The Royal Society Pancrustacean phylogeny: hexapods are terrestrial crustaceans and maxillopods are not monophyletic 1Center for Biosystems Research, University of Maryland Biotechnology Institute 2Department of Entomology, University of Maryland, College Park, MD 20742, USA * Author for correspondence (Email: regier/at/umd.edu). Received May 17, 2004; Accepted August 26, 2004. This article has been cited by other articles in PMC.Abstract Recent molecular analyses indicate that crustaceans and hexapods form a clade (Pancrustacea or Tetraconata), but relationships among its constituent lineages, including monophyly of crustaceans, are controversial. Our phylogenetic analysis of three protein-coding nuclear genes from 62 arthropods and lobopods (Onychophora and Tardigrada) demonstrates that Hexapoda is most closely related to the crustaceans Branchiopoda (fairy shrimp, water fleas, etc.) and Cephalocarida+Remipedia, thereby making hexapods terrestrial crustaceans and the traditionally defined Crustacea paraphyletic. Additional findings are that Malacostraca (crabs, isopods, etc.) unites with Cirripedia (barnacles, etc.) and they, in turn, with Copepoda, making the traditional crustacean class Maxillopoda paraphyletic. Ostracoda (seed shrimp)—either all or a subgroup—is associated with Branchiura (fish lice) and likely to be basal to all other pancrustaceans. A Bayesian statistical (non-clock) estimate of divergence times suggests a Precambrian origin for Pancrustacea (600 Myr ago or more), which precedes the first unambiguous arthropod fossils by over 60 Myr. Keywords: arthropod phylogeny, Cambrian explosion, Crustacea, Hexapoda, molecular systematics, Pancrustacea 1. Introduction Establishing phylogenetic relationships among the major arthropod groups, especially the hyper-speciose Hexapoda and the morphologically diverse Crustacea, would be a major advance toward resolving the tree of life. Recent molecular analyses indicate that hexapods and crustaceans form a clade (Pancrustacea or Tetraconata) (Friedrich & Tautz 1995; Boore et al. 1998; Giribet et al. 2001; Regier & Shultz 2001; Mallatt et al. 2004), but relationships among its constituent lineages are controversial (Spears & Abele 1998; Giribet et al. 2001; Martin & Davis 2001; Regier & Shultz 2001; Lavrov et al. 2004; Mallatt et al. 2004). Resolving these lineages would provide an improved phylogenetic context for documenting the many complex morphological transformations that have occurred during arthropod evolution. Clarifying the role of homoplasy (i.e. parallelisms due to constraints and convergences due to natural selection) would be another benefit of a robust phylogeny. Unfortunately, identifying sufficient characters to robustly resolve closely spaced Palaeozoic (or earlier) divergences has been a challenge. Sequence data from multiple, appropriately evolving, protein-coding nuclear genes have been successfully used to resolve other lineages (see, for example, Murphy et al. 2001) and also hold promise for arthropods. We address relationships within Pancrustacea by analysing sequence data from three such genes and strongly resolve relationships of several major groups. 2. Material and methods 2.1 Taxon sampling and data generation Sixty-two species of Arthropoda, Tardigrada and Onychophora were sampled (see table S1 in electronic Appendix A). Specific RNA sequences were amplified by reverse transcription followed by polymerase chain reaction (PCR); gel-isolated PCR fragments were then reamplified using nested PCR, re-gel-isolated and sequenced; sequences were assembled and datasets for phylogenetic analyses constructed (see references in Regier & Shultz (2001)). Sequence data were derived from three genes:elongation factor-1α (1131 nucleotides), the largest subunit of RNA polymerase II (2025 nucleotides) and elongation factor-2 (2178 nucleotides). GenBank numbers (see table S1 in electronic Appendix A) and the aligned nucleotide dataset (see dataset in the electronic Appendices or go to www.umbi.umd.edu/users/jcrlab/Arthropoda3gn2004.doc) are available. 2.2 Phylogenetic analyses Nucleotides with third codon positions removed from the 3-gene concatenated sequence were analysed by maximum parsimony under equal weights and by maximum likelihood (Swofford 2002). The latter incorporated a general time reversible model with among-site-rate-variation modelled by a gamma distribution plus a separate parameter for invariable sites. Concatenated amino acids (conceptually translated in MacClade; Maddison & Maddison 2002) were analysed by maximum parsimony under equal weights (Swofford 2002), a Bayesian statistical approach (Huelsenbeck & Ronquist 2001) using the Jones, Taylor and Thornton model (Jones et al. 1992), and a modified-likelihood approach (Adachi & Hasegawa 1994), in which the favoured protml tree was selected from the 91673 most-parsimonious trees (tree lengths =6853–6861). Non-parametric bootstrap analyses (Felsenstein 1985) were performed for all approaches except protml. To calculate bootstrap values for the Bayesian analysis, we wrote a computer program in C (called BP_link, freely downloadable from http://www.umbi.umd.edu/users/jcrlab) that semi-automates this process by linking already available software packages. This allowed a direct comparison of bootstrap percentages and posterior probabilities. 2.3 Divergence time estimates Divergence time estimates at 12 nodes were performed using a Markov chain Monte Carlo procedure for Bayesian analysis of amino acid sequences (Thorne & Kishino 2002) from 17 diverse arthropods, one onychophoran and one tardigrade. Evolutionary rates at adjoining nodes were assumed to be autocorrelated rather than following a strict molecular clock, and individual genes were assigned separate autocorrelation parameters. Fossil-based boundary conditions were also incorporated (Benton 1993). More details can be found in the legend to figure S1 (see electronic Appendices). 3. Results and discussion 3.1 Assessing node support In the current study, 40 new crustacean sequences were obtained from elongation factor-1 α, elongation factor-2 and RNA polymerase II (total, ca. 5334 nucleotides (nt) or ca. 1778 inferred amino acids (aa)) and combined with published sequences, including those from the outgroups Onychophora and Tardigrada (Regier et al. 2004b). A total of 62 taxa (table S1) were analysed using maximum parsimony (MP), maximum likelihood (ML), Bayesian (B) statistical methods, and protein-based parsimony + likelihood (protml). Rapidly evolving third-codon positions were not included in analyses of nucleotides, and a χ2-test could not reject base compositional homogeneity for the remaining nucleotides (ρ = 0.671 assess the degree of clade support we evaluated: (i) sensitivity to five method-character combinations (i.e. MLnt, MPnt, MPaa, Baa and protml); (ii) posterior probabilities from Baa analysis; and (iii) non-parametric bootstrap support from MLnt, MPnt, MPaa and Baa. To facilitate discussion, we will call ‘well supported’ those nodes that are recovered by all five method-character combinations, that have bootstrap support of greater than 85% in at least one combination, and that have a posterior probability (in the Baa analysis) of 100%. The criteria for ‘supported’ nodes are the same except that their highest bootstrap support is 70–85%. Individual gene analyses supported no clades that were not also supported by combined data analyses (unpublished observations). The protml tree of highest likelihood is shown in figure 1
3.2 Monophyly of Pancrustacea, Myriapoda and Chelicerata, but uncertain interrelationships Our results reaffirm that extant arthropods are arranged in three well-supported monophyletic lineages—Pancrustacea, Myriapoda and Chelicerata. Relationships within these groups are not affected by removal of the panarthropod outgroups (unpublished observations). Until recently, myriapod monophyly was challenged primarily by the possibility that hexapods originated from within the group, a view now largely, but not completely (Kraus 2001), abandoned in favour of the Pancrustacea concept. Monophyly of Chelicerata was questioned by one study (Giribet et al. 2001) that placed Pycnogonida (sea spiders) as the sister to all other arthropods, but other studies (Regier & Shultz 2001; Vilpoux & Waloszek 2003), including this one, resolve pycnogonids as well-supported, basally divergent chelicerates. Relationships among the three major arthropod lineages are uncertain in the present analyses, with some favouring Mandibulata (= Pancrustacea + Myriapoda) and others favouring Paradoxopoda (= Chelicerata + Myriapoda). Uncertainty is present in others studies as well (Mandibulata: Edgecombe et al. 2000; Giribet et al. 2001; Paradoxopoda: Friedrich & Tautz 1995; Hwang et al. 2001; Mallatt et al. 2004). 3.3 Major splits within Pancrustacea As in other recent analyses, we recover Pancrustacea as a well-supported monophyletic group, but our results are unique in also identifying three pancrustacean lineages (clades 21, 24, 33 in figure 1 Clade 33, which is supported, includes four classes within two well-supported clades (Hexapoda (Regier et al. 2004a) and Branchiopoda) and one supported clade (Cephalocarida + Remipedia; table 1), although relationships among the lineages remain ambiguous. It is noteworthy that extant members occupy either non-marine environments—Hexapoda on land and in freshwater and Branchiopoda generally in freshwater (Schram 1986)—or marine environments so unusual that the taxa remained undiscovered until the latter half of the twentieth century—Remipedia in anchialine caves (Yager 1981; Schram 1986) and Cephalocarida in benthic flocculent suspensions (Sanders 1955; Schram 1986). It is possible that early members of the clade had a proclivity for near-shore or marginal marine habitats or were competitively excluded by other ‘crustaceans’, perhaps together with myriapods and most chelicerates. These findings reinforce the value of developmental genetic studies that use Artemia brine shrimp (Branchiopoda) as a model for understanding morphological evolution in hexapods (e.g. Averof & Cohen 1997). The advent of the Pancrustacea hypothesis inspired studies of the evolution of hexapod morphology from a primitive ‘crustacean’ condition. However, interpretations of comparative analyses are problematic when relevant phylogenetic relationships are unknown, in this case, the sister group to Hexapoda. Many morphological characters deemed consistent with the molecule-based Pancrustacea concept were based on malacostracans (Dohle 2001), implying that these crustaceans would be most informative of hexapod evolution. However, our results indicate that hexapods are more closely related to a subset of non-malacostracan lineages, which includes the Branchiopoda. The three non-hexapod classes in clade 33—Branchiopoda, Cephalocarida, Remipedia—have each been regarded as the ‘most primitive’ and most basal crustaceans by a subset of workers (see Martin & Davis 2001), a debate driven, in part, by carcinology’s somewhat anachronistic search for the ‘Ur-crustacean’. By contrast, our analyses show that these lineages are phylogenetically derived, whether they retain primitive morphologies or not. One parsimony-based study using 18S rDNA also united Cephalocarida and Remipedia and placed them in a non-basal position within Crustacea, but the authors seemed to regard this as an artefact of long-branch attraction (Spears & Abele 1998). However, our result supports a literal interpretation of their findings, as ours is based on different genes and generated by probabilistic-model-based likelihood and Bayesian methods that are less prone to long-branch anomalies than parsimony. Our recovery of clades 24 and 33 and their subclades differs from that found with 18S+28S ribosomal nucleotides (Mallatt et al. 2004). However, in that case a maximum-likelihood reanalysis of the data (in which only approximately one-third as many pancrustceans were sampled) reveals that bootstrap values are 65% or less for all inter-class relationships (unpublished observations), so we consider the current results based on protein-coding nuclear genes to be more compelling and not in strong conflict with the ribosomal analysis. Our results also differ from those published in a recent study on mitochondrial gene order (Lavrov et al. 2004). Here, too, there is reason to be circumspect. In particular, the single character (a tRNA rearrangment) that defines a clade consisting of Cirripedia, Cephalocarida, Branchiura and Pentastomida is actually missing in Cirripedia. Instead, the cirripede is included because it shares a separate (and also homoplasious) tRNA rearrangement with the cephalocarid, which in turn is missing the diagnostic transposition defining Pancrustacea. Although the authors’ parsimony-based argument is valid, the fact that this conclusion is based on only two homoplasious characters should raise concern. Indeed, the ease of interpretation and certainty with which mitochondrial gene order was originally hoped to provide phylogenetic information are now being challenged, particularly in arthropods (see Negrisolo et al. 2004). As for other characters, those based on gene order are undoubtedly appropriate only at certain taxonomic levels (Hickerson & Cunningham 2000). Clade 21 includes Ostracoda and Branchiura (the latter being most closely related to the unrepresented Pentastomida (tongue worms); Abele et al. 1989; Lavrov et al. 2004) as a group (Oligostraca; Zrzavy et al. 1998) that here forms the sister group to all other pancrustaceans. Although not supported (recovered by only three out of five approaches), it is consistent with analyses of nuclear ribosomal nucleotides (Spears & Abele 1998; Mallatt et al. 2004) and morphology (Zrzavy et al. 1998). Its members have oligomeric bodies and share aspects of ovariole structure. The early divergence of Oligostraca is compatible with Cambrian-age fossils of ostracod-like arthropods (Schram 1986) and purported pentastomids (Walossek & Müller 1994). Furthermore, our two molecular alternatives (MPaa, MLnt; see figure 1 3.4 Absolute divergence times Using the above phylogenetic results, clade divergence time estimations were performed for a subset of 19 panarthropod taxa (figure S1). Abbreviated results (i.e. averaged time estimates without ranges or standard deviations) are mapped onto figure 1 Acknowledgments This work was supported by the National Science Foundation (USA) and the Maryland Agricultural Experiment Station (J.W.S.). The authors thank Valerie Cappola, Greg Edgecombe, Henrik Enghoff, Ann Gauzens, Sarah Gerken, Gonzalo Giribet, Robert Hessler, Diane Nelson, Carles Ribera, Mike Roman, Trisha Spears, Nobuo Tsurusaki, Grace Wyngaard and Jill Yager for specimens, Jeff Thorne for software and Diane Shi for technical assistance. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||
Nature. 1995 Jul 13; 376(6536):165-7.
[Nature. 1995]Nature. 1998 Apr 16; 392(6677):667-8.
[Nature. 1998]Nature. 2001 Sep 13; 413(6852):157-61.
[Nature. 2001]Mol Phylogenet Evol. 2001 Jul; 20(1):136-48.
[Mol Phylogenet Evol. 2001]Mol Phylogenet Evol. 2004 Apr; 31(1):178-91.
[Mol Phylogenet Evol. 2004]Mol Phylogenet Evol. 2001 Jul; 20(1):136-48.
[Mol Phylogenet Evol. 2001]Bioinformatics. 2001 Aug; 17(8):754-5.
[Bioinformatics. 2001]Syst Biol. 2002 Oct; 51(5):689-702.
[Syst Biol. 2002]Syst Biol. 2003 Oct; 52(5):665-73.
[Syst Biol. 2003]Nature. 2001 Sep 13; 413(6852):157-61.
[Nature. 2001]Mol Phylogenet Evol. 2001 Jul; 20(1):136-48.
[Mol Phylogenet Evol. 2001]Nature. 1995 Jul 13; 376(6536):165-7.
[Nature. 1995]Nature. 2001 Sep 13; 413(6852):154-7.
[Nature. 2001]Mol Phylogenet Evol. 2004 Apr; 31(1):178-91.
[Mol Phylogenet Evol. 2004]Proc Natl Acad Sci U S A. 1955 Jan 15; 41(1):61-6.
[Proc Natl Acad Sci U S A. 1955]Nature. 1997 Feb 13; 385(6617):627-30.
[Nature. 1997]Mol Phylogenet Evol. 2004 Apr; 31(1):178-91.
[Mol Phylogenet Evol. 2004]J Mol Evol. 2004 Apr; 58(4):413-23.
[J Mol Evol. 2004]Mol Biol Evol. 2000 Apr; 17(4):639-44.
[Mol Biol Evol. 2000]Mol Phylogenet Evol. 2004 Apr; 31(1):178-91.
[Mol Phylogenet Evol. 2004]Proc Natl Acad Sci U S A. 2002 Feb 5; 99(3):1426-30.
[Proc Natl Acad Sci U S A. 2002]BMC Biol. 2004 Jan 19; 2():1.
[BMC Biol. 2004]