Format

Send to

Choose Destination
See comment in PubMed Commons below
PLoS One. 2013 Jun 17;8(6):e66643. doi: 10.1371/journal.pone.0066643. Print 2013.

A filtering method to generate high quality short reads using illumina paired-end technology.

Author information

1
Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, Massachusetts, USA.

Erratum in

  • PLoS One. 2013;8(6). doi:10.1371/annotation/afa5c40d-c604-46ae-84c4-82cb92193a5e.

Abstract

Consensus between independent reads improves the accuracy of genome and transcriptome analyses, however lack of consensus between very similar sequences in metagenomic studies can and often does represent natural variation of biological significance. The common use of machine-assigned quality scores on next generation platforms does not necessarily correlate with accuracy. Here, we describe using the overlap of paired-end, short sequence reads to identify error-prone reads in marker gene analyses and their contribution to spurious OTUs following clustering analysis using QIIME. Our approach can also reduce error in shotgun sequencing data generated from libraries with small, tightly constrained insert sizes. The open-source implementation of this algorithm in Python programming language with user instructions can be obtained from https://github.com/meren/illumina-utils.

PMID:
23799126
PMCID:
PMC3684618
DOI:
10.1371/journal.pone.0066643
[Indexed for MEDLINE]
Free PMC Article
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for Public Library of Science Icon for PubMed Central
    Loading ...
    Support Center