Strategies for complete mitochondrial genome sequencing on Ion Torrent PGM™ platform in forensic sciences

Forensic Sci Int Genet. 2016 May:22:11-21. doi: 10.1016/j.fsigen.2016.01.004. Epub 2016 Jan 15.

Abstract

Next generation sequencing (NGS) is a time saving and cost-efficient method to detect the complete mitochondrial genome (mtGenome) compared to Sanger sequencing. In this study we focused on developing strategies for mtGenome sequencing on the Ion Torrent PGM™ platform and NGS data analysis. With our experience, 4, 15 and 30 samples could be loaded onto Ion 314™, Ion 316™ and Ion 318™ chips respectively at a pooling concentration of 26pM, achieving to sufficient average coverage of ≥1500 × and well strand balance of 1.05. Data processing software is essential to NGS mega data analysis. The in-house Perl scripts were developed for primary data analysis to screen out uncertain positions and samples from variant call format (VCF) reports and for pedigree study to perform pairwise comparisons. The Integrative Genomic Viewer (IGV) and the NextGENe software were introduced to secondary data analysis. The mthap and EMMA were employed for haplogroup assignment. The dataset was reviewed and approved by the EMPOP as the final version, which showed 2.66% error rate generated from the Torrent Variant Caller (TVC). Across the mtGenome, 4022 variants were found at 725 nucleotide positions, where ratio of transitions to transversions was estimated at 20.89:1 and 22.18% of variants was concentrated at hypervariable segments I and II (HVS-I and HVS-II). Totally, 107 complete mtGenome haplotypes were observed from 107 Northern Chinese Han and assigned to 88 haplogroups. The random match probability (RMP) of complete mtGenome was calculated as 0.009345794, decreasing 26.19% by comparison to that of HVS-I only, and the haplotype diversity (HD) was evaluated as 1, increasing 0.33% by comparison to that of HVS-I only. Principal component analysis (PCA) showed that our population was clustered to East and Southeast Asians. The strategies in this study are suitable for complete mtGenome sequencing on Ion Torrent PGM™ platform and Northern Chinese Han (EMP00670) is the first complete mtGenome dataset contributed to the EMPOP from East Asian.

Keywords: Forensic sciences; Ion Torrent PGM™; Mitochondrial genome (mtGenome); Next generation sequencing (NGS); Northern Chinese Han.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Asian People / genetics
  • Base Sequence
  • DNA, Mitochondrial / analysis
  • DNA, Mitochondrial / genetics*
  • Forensic Sciences / instrumentation
  • Forensic Sciences / methods*
  • Genome, Mitochondrial*
  • Haplotypes
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • Mitochondria / chemistry
  • Mitochondria / genetics*
  • Sequence Analysis, DNA / methods*

Substances

  • DNA, Mitochondrial