Format

Send to

Choose Destination
See comment in PubMed Commons below
Science. 2004 Nov 12;306(5699):1172-4.

Prospects for building the tree of life from large sequence databases.

Author information

1
Section of Evolution and Ecology, University of California, One Shields Avenue, Davis, CA 95616, USA. acdriskell@ucdavis.edu

Abstract

We assess the phylogenetic potential of approximately 300,000 protein sequences sampled from Swiss-Prot and GenBank. Although only a small subset of these data was potentially phylogenetically informative, this subset retained a substantial fraction of the original taxonomic diversity. Sampling biases in the databases necessitate building phylogenetic data sets that have large numbers of missing entries. However, an analysis of two "supermatrices" suggests that even data sets with as much as 92% missing data can provide insights into broad sections of the tree of life.

PMID:
15539599
DOI:
10.1126/science.1102036
[Indexed for MEDLINE]
Free full text
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for HighWire
    Loading ...
    Support Center