Display Settings:

Format

Send to:

Choose Destination
We are sorry, but NCBI web applications do not support your browser and may not function properly. More information
Bioinformatics. 2009 Nov 1;25(21):2839-40. doi: 10.1093/bioinformatics/btp527. Epub 2009 Sep 7.

TagDust--a program to eliminate artifacts from next generation sequencing data.

Author information

  • 1Omics Science Center, Riken Yokohama Institute, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan. timolassmann@gmail.com

Abstract

MOTIVATION:

Next-generation parallel sequencing technologies produce large quantities of short sequence reads. Due to experimental procedures various types of artifacts are commonly sequenced alongside the targeted RNA or DNA sequences. Identification of such artifacts is important during the development of novel sequencing assays and for the downstream analysis of the sequenced libraries.

RESULTS:

Here we present TagDust, a program identifying artifactual sequences in large sequencing runs. Given a user-defined cutoff for the false discovery rate, TagDust identifies all reads explainable by combinations and partial matches to known sequences used during library preparation. We demonstrate the quality of our method on sequencing runs performed on Illumina's Genome Analyzer platform.

AVAILABILITY:

Executables and documentation are available from http://genome.gsc.riken.jp/osc/english/software/.

CONTACT:

timolassmann@gmail.com.

PMID:
19737799
[PubMed - indexed for MEDLINE]
PMCID:
PMC2781754
Free PMC Article
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for HighWire Icon for PubMed Central
    Loading ...
    Write to the Help Desk