Send to

Choose Destination
PLoS One. 2014 May 16;9(5):e97876. doi: 10.1371/journal.pone.0097876. eCollection 2014.

Common contaminants in next-generation sequencing that hinder discovery of low-abundance microbes.

Author information

Shipshaw Labs, Montreal, Quebec, Canada.
Yale Comprehensive Cancer Center, Yale School of Medicine, New Haven, Connecticut, United States of America.
Department of Therapeutic Radiology, Yale School of Medicine, New Haven, Connecticut, United States of America.


Unbiased high-throughput sequencing of whole metagenome shotgun DNA libraries is a promising new approach to identifying microbes in clinical specimens, which, unlike other techniques, is not limited to known sequences. Unlike most sequencing applications, it is highly sensitive to laboratory contaminants as these will appear to originate from the clinical specimens. To assess the extent and diversity of sequence contaminants, we aligned 57 "1000 Genomes Project" sequencing runs from six centers against the four largest NCBI BLAST databases, detecting reads of diverse contaminant species in all runs and identifying the most common of these contaminant genera (Bradyrhizobium) in assembled genomes from the NCBI Genome database. Many of these microorganisms have been reported as contaminants of ultrapure water systems. Studies aiming to identify novel microbes in clinical specimens will greatly benefit from not only preventive measures such as extensive UV irradiation of water and cross-validation using independent techniques, but also a concerted effort to sequence the complete genomes of common contaminants so that they may be subtracted computationally.

[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Public Library of Science Icon for PubMed Central
Loading ...
Support Center