Format

Send to

Choose Destination
Genetics. 2014 Jul;197(3):925-37. doi: 10.1534/genetics.114.161299. Epub 2014 May 1.

A Bayesian approach to inferring the phylogenetic structure of communities from metagenomic data.

Author information

1
Department of Mathematics, Bowdoin College, Brunswick, Maine 04011 jobrien@bowdoin.edu daniel_falush@eva.mpg.de.
2
School of Public Health, Imperial College London, London W2 1PG, United Kingdom.
3
Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, United Kingdom.
4
Department of Statistics, University of Oxford, Oxford OX1 3TG, United Kingdom.
5
Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany jobrien@bowdoin.edu daniel_falush@eva.mpg.de.

Abstract

Metagenomics provides a powerful new tool set for investigating evolutionary interactions with the environment. However, an absence of model-based statistical methods means that researchers are often not able to make full use of this complex information. We present a Bayesian method for inferring the phylogenetic relationship among related organisms found within metagenomic samples. Our approach exploits variation in the frequency of taxa among samples to simultaneously infer each lineage haplotype, the phylogenetic tree connecting them, and their frequency within each sample. Applications of the algorithm to simulated data show that our method can recover a substantial fraction of the phylogenetic structure even in the presence of high rates of migration among sample sites. We provide examples of the method applied to data from green sulfur bacteria recovered from an Antarctic lake, plastids from mixed Plasmodium falciparum infections, and virulent Neisseria meningitidis samples.

KEYWORDS:

Bayesian phylogenetics; metagenomics; microevolution

PMID:
24793089
PMCID:
PMC4096371
DOI:
10.1534/genetics.114.161299
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for HighWire Icon for PubMed Central
Loading ...
Support Center