Format

Send to

Choose Destination
Syst Biol. 2016 Mar;65(2):334-44. doi: 10.1093/sysbio/syv082. Epub 2015 Nov 1.

SimPhy: Phylogenomic Simulation of Gene, Locus, and Species Trees.

Author information

1
Department of Biochemistry, Genetics and Immunology, University of Vigo, Vigo 36310, Spain dmallo@uvigo.es.
2
Department of Biochemistry, Genetics and Immunology, University of Vigo, Vigo 36310, Spain.

Abstract

We present a fast and flexible software package--SimPhy--for the simulation of multiple gene families evolving under incomplete lineage sorting, gene duplication and loss, horizontal gene transfer--all three potentially leading to species tree/gene tree discordance--and gene conversion. SimPhy implements a hierarchical phylogenetic model in which the evolution of species, locus, and gene trees is governed by global and local parameters (e.g., genome-wide, species-specific, locus-specific), that can be fixed or be sampled from a priori statistical distributions. SimPhy also incorporates comprehensive models of substitution rate variation among lineages (uncorrelated relaxed clocks) and the capability of simulating partitioned nucleotide, codon, and protein multilocus sequence alignments under a plethora of substitution models using the program INDELible. We validate SimPhy's output using theoretical expectations and other programs, and show that it scales extremely well with complex models and/or large trees, being an order of magnitude faster than the most similar program (DLCoal-Sim). In addition, we demonstrate how SimPhy can be useful to understand interactions among different evolutionary processes, conducting a simulation study to characterize the systematic overestimation of the duplication time when using standard reconciliation methods. SimPhy is available at https://github.com/adamallo/SimPhy, where users can find the source code, precompiled executables, a detailed manual and example cases.

KEYWORDS:

Gene conversion; gene duplication and loss; gene family evolution; horizontal gene transfer; incomplete lineage sorting; locus tree; simulation; species tree

PMID:
26526427
PMCID:
PMC4748750
DOI:
10.1093/sysbio/syv082
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Silverchair Information Systems Icon for PubMed Central
Loading ...
Support Center