Format

Send to

Choose Destination
BMC Genomics. 2015 Dec 7;16:1034. doi: 10.1186/s12864-015-2241-6.

A method for positive forensic identification of samples from extremely low-coverage sequence data.

Author information

1
Department of Biomolecular Engineering, University of California, Santa Cruz, 1156 High St, Santa Cruz, CA, 95064, USA. svohr@ucsc.edu.
2
Facultad de Ciencias, Universidad Nacional Autónoma de México, Av. Universidad 3000 Circuito Exterior S/N, Delegación Coyoacán, Ciudad Universitaria, C.P. 04510, Ciudad de México, D.F., México. fernandobuenabad@ciencias.unam.mx.
3
Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, 1156 High St, Santa Cruz, CA, 95064, USA. bashapir@ucsc.edu.
4
Department of Biomolecular Engineering, University of California, Santa Cruz, 1156 High St, Santa Cruz, CA, 95064, USA. ed@soe.ucsc.edu.

Abstract

BACKGROUND:

Determining whether two DNA samples originate from the same individual is difficult when the amount of retrievable DNA is limited. This is often the case for ancient, historic, and forensic samples. The most widely used approaches rely on amplification of a defined panel of multi-allelic markers and comparison to similar data from other samples. When the amount retrievable DNA is low these approaches fail.

RESULTS:

We describe a new method for assessing whether shotgun DNA sequence data from two samples are consistent with originating from the same or different individuals. Our approach makes use of the large catalogs of single nucleotide polymorphism (SNP) markers to maximize the chances of observing potentially discriminating alleles. We further reduce the amount of data required by taking advantage of patterns of linkage disequilibrium modeled by a reference panel of haplotypes to indirectly compare observations at pairs of linked SNPs. Using both coalescent simulations and real sequencing data from modern and ancient sources, we show that this approach is robust with respect to the reference panel and has power to detect positive identity from DNA libraries with less than 1 % random and non-overlapping genome coverage in each sample.

CONCLUSION:

We present a powerful new approach that can determine whether DNA from two samples originated from the same individual even when only minute quantities of DNA are recoverable from each.

PMID:
26643904
PMCID:
PMC4672566
DOI:
10.1186/s12864-015-2241-6
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for BioMed Central Icon for PubMed Central
Loading ...
Support Center