Format

Send to

Choose Destination
See comment in PubMed Commons below
Bioinformatics. 2012 Jan 1;28(1):48-55. doi: 10.1093/bioinformatics/btr592. Epub 2011 Oct 28.

Determining the evolutionary history of gene families.

Author information

1
Faculty of Life Sciences, University of Manchester, Oxford Road, Manchester M13 9PL, UK.

Abstract

MOTIVATION:

Recent large-scale studies of individuals within a population have demonstrated that there is widespread variation in copy number in many gene families. In addition, there is increasing evidence that the variation in gene copy number can give rise to substantial phenotypic effects. In some cases, these variations have been shown to be adaptive. These observations show that a full understanding of the evolution of biological function requires an understanding of gene gain and gene loss. Accurate, robust evolutionary models of gain and loss events are, therefore, required.

RESULTS:

We have developed weighted parsimony and maximum likelihood methods for inferring gain and loss events. To test these methods, we have used Markov models of gain and loss to simulate data with known properties. We examine three models: a simple birth-death model, a single rate model and a birth-death innovation model with parameters estimated from Drosophila genome data. We find that for all simulations maximum likelihood-based methods are very accurate for reconstructing the number of duplication events on the phylogenetic tree, and that maximum likelihood and weighted parsimony have similar accuracy for reconstructing the ancestral state. Our implementations are robust to different model parameters and provide accurate inferences of ancestral states and the number of gain and loss events. For ancestral reconstruction, we recommend weighted parsimony because it has similar accuracy to maximum likelihood, but is much faster. For inferring the number of individual gene loss or gain events, maximum likelihood is noticeably more accurate, albeit at greater computational cost.

AVAILABILITY:

www.bioinf.manchester.ac.uk/dupliphy

CONTACT:

simon.lovell@manchester.ac.uk; simon.whelan@manchester.ac.uk

SUPPLEMENTARY INFORMATION:

Supplementary data are available at Bioinformatics online.

PMID:
22039210
DOI:
10.1093/bioinformatics/btr592
[Indexed for MEDLINE]
PubMed Commons home

PubMed Commons

0 comments

    Supplemental Content

    Full text links

    Icon for Silverchair Information Systems
    Loading ...
    Support Center