Format

Send to

Choose Destination
BMC Syst Biol. 2016 Aug 1;10 Suppl 2:47. doi: 10.1186/s12918-016-0300-5.

SNVSniffer: an integrated caller for germline and somatic single-nucleotide and indel mutations.

Author information

1
School of Computational Science & Engineering, Georgia Institute of Technology, Atlanta, 30332, Georgia, USA. yliu@cc.gatech.edu.
2
Translational Oncology, Johannes Gutenberg University Medical Center gGmbH Mainz, Mainz, 55131, Germany.
3
School of Computational Science & Engineering, Georgia Institute of Technology, Atlanta, 30332, Georgia, USA.
4
Institute of Computer Science, Johannes Gutenberg University Mainz, Mainz, 55128, Germany.

Abstract

BACKGROUND:

Various approaches to calling single-nucleotide variants (SNVs) or insertion-or-deletion (indel) mutations have been developed based on next-generation sequencing (NGS). However, most of them are dedicated to a particular type of mutation, e.g. germline SNVs in normal cells, somatic SNVs in cancer/tumor cells, or indels only. In the literature, efficient and integrated callers for both germline and somatic SNVs/indels have not yet been extensively investigated.

RESULTS:

We present SNVSniffer, an efficient and integrated caller identifying both germline and somatic SNVs/indels from NGS data. In this algorithm, we propose the use of Bayesian probabilistic models to identify SNVs and investigate a multiple ungapped alignment approach to call indels. For germline variant calling, we model allele counts per site to follow a multinomial conditional distribution. For somatic variant calling, we rely on paired tumor-normal pairs from identical individuals and introduce a hybrid subtraction and joint sample analysis approach by modeling tumor-normal allele counts per site to follow a joint multinomial conditional distribution. A comprehensive performance evaluation has been conducted using a diversity of variant calling benchmarks. For germline variant calling, SNVSniffer demonstrates highly competitive accuracy with superior speed in comparison with the state-of-the-art FaSD, GATK and SAMtools. For somatic variant calling, our algorithm achieves comparable or even better accuracy, at fast speed, than the leading VarScan2, SomaticSniper, JointSNVMix2 and MuTect.

CONCLUSIONS:

SNVSniffers demonstrates the feasibility to develop integrated solutions to fast and efficient identification of germline and somatic variants. Nonetheless, accurate discovery of genetic variations is critical yet challenging, and still requires substantially more research efforts being devoted. SNVSniffer and synthetic samples are publicly available at http://snvsniffer.sourceforge.net .

KEYWORDS:

Bayesian model; Indel calling; SNP calling; Somatic SNV calling

PMID:
27489955
PMCID:
PMC4977481
DOI:
10.1186/s12918-016-0300-5
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for BioMed Central Icon for PubMed Central
Loading ...
Support Center