Format

Send to

Choose Destination
Genomics. 2014 May-Jun;103(5-6):323-8. doi: 10.1016/j.ygeno.2014.03.006. Epub 2014 Apr 3.

Multi-perspective quality control of Illumina exome sequencing data using QC3.

Author information

1
Vanderbilt Ingram Cancer Center, Center for Quantitative Sciences, Nashville, TN, USA. Electronic address: yan.guo@vanderbilt.edu.
2
Vanderbilt Ingram Cancer Center, Center for Quantitative Sciences, Nashville, TN, USA. Electronic address: shilin.zhao@vanderbilt.edu.
3
Vanderbilt Ingram Cancer Center, Center for Quantitative Sciences, Nashville, TN, USA. Electronic address: Quanhu.sheng@vanderbilt.edu.
4
Vanderbilt Ingram Cancer Center, Center for Quantitative Sciences, Nashville, TN, USA. Electronic address: Fei.ye@vanderbilt.edu.
5
Vanderbilt Ingram Cancer Center, Center for Quantitative Sciences, Nashville, TN, USA. Electronic address: riverlee2008@gmail.com.
6
Department of Biochemistry, Vanderbilt University, Nashville, TN 37027, USA. Electronic address: brian.d.lehmann@vanderbilt.edu.
7
Department of Biochemistry, Vanderbilt University, Nashville, TN 37027, USA. Electronic address: j.pietenpol@vanderbilt.edu.
8
Center for Human Genetics Research, Vanderbilt University Medical Center, Nashville, TN, USA. Electronic address: david.samuels@chgr.mc.vanderbilt.edu.
9
Vanderbilt Ingram Cancer Center, Center for Quantitative Sciences, Nashville, TN, USA. Electronic address: yu.shyr@Vanderbilt.Edu.

Abstract

Advances in next-generation sequencing (NGS) technologies have greatly improved our ability to detect genomic variants for biomedical research. The advance in NGS technologies has also created significant challenges in bioinformatics. One of the major challenges is the quality control of sequencing data. There has been heavy focus on performing raw data quality control. In order to correctly interpret the quality of the DNA sequencing data, however, proper quality control should be conducted at all stages of DNA sequencing data analysis: raw data, alignment, and variant detection. We designed QC3, a quality control tool aimed at those three major stages of DNA sequencing. QC3 monitors quality control metrics at each stage of NGS data and provides unique and independent evaluations of the data quality from different perspectives. QC3 offers unique features such as detection of batch effect and cross contamination. QC3 and its source code are freely downloadable at https://github.com/slzhao/QC3.

KEYWORDS:

Alignment; Exome sequencing; Quality control; Raw data; Variant call

PMID:
24703969
PMCID:
PMC5755963
DOI:
10.1016/j.ygeno.2014.03.006
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Elsevier Science Icon for PubMed Central
Loading ...
Support Center