Format

Send to

Choose Destination
Mol Biol Evol. 2015 May;32(5):1342-53. doi: 10.1093/molbev/msv022. Epub 2015 Feb 19.

Less is more: an adaptive branch-site random effects model for efficient detection of episodic diversifying selection.

Author information

1
Graduate Program in Bioinformatics and Systems Biology, University of California San Diego.
2
Department of Medicine, University of California San Diego.
3
Department of Medicine, University of California San Diego Department of Mathematical Sciences, Stellenbosch University, Stellenbosch, South Africa.
4
Department of Medicine, University of California San Diego spond@ucsd.edu.

Abstract

Over the past two decades, comparative sequence analysis using codon-substitution models has been honed into a powerful and popular approach for detecting signatures of natural selection from molecular data. A substantial body of work has focused on developing a class of "branch-site" models which permit selective pressures on sequences, quantified by the ω ratio, to vary among both codon sites and individual branches in the phylogeny. We develop and present a method in this class, adaptive branch-site random effects likelihood (aBSREL), whose key innovation is variable parametric complexity chosen with an information theoretic criterion. By applying models of different complexity to different branches in the phylogeny, aBSREL delivers statistical performance matching or exceeding best-in-class existing approaches, while running an order of magnitude faster. Based on simulated data analysis, we offer guidelines for what extent and strength of diversifying positive selection can be detected reliably and suggest that there is a natural limit on the optimal parametric complexity for "branch-site" models. An aBSREL analysis of 8,893 Euteleostomes gene alignments demonstrates that over 80% of branches in typical gene phylogenies can be adequately modeled with a single ω ratio model, that is, current models are unnecessarily complicated. However, there are a relatively small number of key branches, whose identities are derived from the data using a model selection procedure, for which it is essential to accurately model evolutionary complexity.

KEYWORDS:

branch-site model; episodic selection; evolutionary model; model complexity; random effects model; variable selection

PMID:
25697341
PMCID:
PMC4408413
DOI:
10.1093/molbev/msv022
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Silverchair Information Systems Icon for PubMed Central
Loading ...
Support Center