Format

Send to

Choose Destination
Mol Ecol Resour. 2016 Mar;16(2):511-23. doi: 10.1111/1755-0998.12463. Epub 2015 Sep 26.

Potential and pitfalls of eukaryotic metagenome skimming: a test case for lichens.

Author information

1
Institute of Cell Biology and Neuroscience, Goethe University Frankfurt, Max-von-Laue Str. 13, D-60438, Frankfurt, Germany.
2
Senckenberg Biodiversity and Climate Research Centre (BiK-F), Senckenberg Anlage 25, D-60325, Frankfurt, Germany.
3
Institute of Ecology, Evolution and Diversity, Goethe University Frankfurt, Max-von-Laue Str. 13, D-60438, Frankfurt, Germany.

Abstract

Whole-genome shotgun sequencing of multispecies communities using only a single library layout is commonly used to assess taxonomic and functional diversity of microbial assemblages. Here, we investigate to what extent such metagenome skimming approaches are applicable for in-depth genomic characterizations of eukaryotic communities, for example lichens. We address how to best assemble a particular eukaryotic metagenome skimming data, what pitfalls can occur, and what genome quality can be expected from these data. To facilitate a project-specific benchmarking, we introduce the concept of twin sets, simulated data resembling the outcome of a particular metagenome sequencing study. We show that the quality of genome reconstructions depends essentially on assembler choice. Individual tools, including the metagenome assemblers Omega and MetaVelvet, are surprisingly sensitive to low and uneven coverages. In combination with the routine of assembly parameter choice to optimize the assembly N50 size, these tools can preclude an entire genome from the assembly. In contrast, MIRA, an all-purpose overlap assembler, and SPAdes, a multisized de Bruijn graph assembler, facilitate a comprehensive view on the individual genomes across a wide range of coverage ratios. Testing assemblers on a real-world metagenome skimming data from the lichen Lasallia pustulata demonstrates the applicability of twin sets for guiding method selection. Furthermore, it reveals that the assembly outcome for the photobiont Trebouxia sp. falls behind the a priori expectation given the simulations. Although the underlying reasons remain still unclear, this highlights that further studies on this organism require special attention during sequence data generation and downstream analysis.

KEYWORDS:

bioinformatics; genome assembly; genomics; lichen; metagenomics; symbiosis

PMID:
26345272
DOI:
10.1111/1755-0998.12463
[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for Wiley
Loading ...
Support Center