Format

Send to

Choose Destination
BMC Genomics. 2018 Feb 21;19(1):158. doi: 10.1186/s12864-018-4544-x.

Analytical parameters and validation of homopolymer detection in a pyrosequencing-based next generation sequencing system.

Author information

1
Department of Laboratory Medicine, University of Debrecen, Nagyerdei krt. 98, Debrecen, H-4032, Hungary.
2
Division of Clinical Genetics, University of Debrecen, Nagyerdei krt. 98, Debrecen, H-4032, Hungary.
3
Department of Biology and Medical Genetics, Second Faculty of Medicine and University Hospital Motol, Charles University, Prague, Czech Republic.
4
Genomic Medicine and Bioinformatic Core Facility, University of Debrecen, Debrecen, Hungary.
5
Department of Laboratory Medicine, University of Debrecen, Nagyerdei krt. 98, Debrecen, H-4032, Hungary. balogh@med.unideb.hu.
6
Division of Clinical Genetics, University of Debrecen, Nagyerdei krt. 98, Debrecen, H-4032, Hungary. balogh@med.unideb.hu.

Abstract

BACKGROUND:

Current technologies in next-generation sequencing are offering high throughput reads at low costs, but still suffer from various sequencing errors. Although pyro- and ion semiconductor sequencing both have the advantage of delivering long and high quality reads, problems might occur when sequencing homopolymer-containing regions, since the repeating identical bases are going to incorporate during the same synthesis cycle, which leads to uncertainty in base calling. The aim of this study was to evaluate the analytical performance of a pyrosequencing-based next-generation sequencing system in detecting homopolymer sequences using homopolymer-preintegrated plasmid constructs and human DNA samples originating from patients with cystic fibrosis.

RESULTS:

In the plasmid system average correct genotyping was 95.8% in 4-mers, 87.4% in 5-mers and 72.1% in 6-mers. Despite the experienced low genotyping accuracy in 5- and 6-mers, it was possible to generate amplicons with more than a 90% adequate detection rate in every homopolymer tract. When homopolymers in the CFTR gene were sequenced average accuracy was 89.3%, but varied in a wide range (52.2 - 99.1%). In all but one case, an optimal amplicon-sequencing primer combination could be identified. In that single case (7A tract in exon 14 (c.2046_2052)), none of the tested primer sets produced the required analytical performance.

CONCLUSIONS:

Our results show that pyrosequencing is the most reliable in case of 4-mers and as homopolymer length gradually increases, accuracy deteriorates. With careful primer selection, the NGS system was able to correctly genotype all but one of the homopolymers in the CFTR gene. In conclusion, we configured a plasmid test system that can be used to assess genotyping accuracy of NGS devices and developed an accurate NGS assay for the molecular diagnosis of CF using self-designed primers for amplification and sequencing.

KEYWORDS:

Cystic fibrosis; Homopolymer detection; Pyrosequencing

PMID:
29466940
PMCID:
PMC5822529
DOI:
10.1186/s12864-018-4544-x
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for BioMed Central Icon for PubMed Central
Loading ...
Support Center