Format

Send to

Choose Destination
J Comput Biol. 2015 May;22(5):451-62. doi: 10.1089/cmb.2014.0151. Epub 2014 Dec 19.

A spatial haplotype copying model with applications to genotype imputation.

Author information

1
1 Department of Computer Science, University of California , Los Angeles, California.

Abstract

Ever since its introduction, the haplotype copy model has proven to be one of the most successful approaches for modeling genetic variation in human populations, with applications ranging from ancestry inference to genotype phasing and imputation. Motivated by coalescent theory, this approach assumes that any chromosome (haplotype) can be modeled as a mosaic of segments copied from a set of chromosomes sampled from the same population. At the core of the model is the assumption that any chromosome from the sample is equally likely to contribute a priori to the copying process. Motivated by recent works that model genetic variation in a geographic continuum, we propose a new spatial-aware haplotype copy model that jointly models geography and the haplotype copying process. We extend hidden Markov models of haplotype diversity such that at any given location, haplotypes that are closest in the genetic-geographic continuum map are a priori more likely to contribute to the copying process than distant ones. Through simulations starting from the 1000 Genomes data, we show that our model achieves superior accuracy in genotype imputation over the standard spatial-unaware haplotype copy model. In addition, we show the utility of our model in selecting a small personalized reference panel for imputation that leads to both improved accuracy as well as to a lower computational runtime than the standard approach. Finally, we show our proposed model can be used to localize individuals on the genetic-geographical map on the basis of their genotype data.

KEYWORDS:

1000 Genomes; expectation maximization (EM) algorithm; genotype imputation; linkage disequilibrium; polymorphism; single nucleotide; spatial genetics; stochastic gradient descent

PMID:
25526526
PMCID:
PMC4425418
DOI:
10.1089/cmb.2014.0151
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for PubMed Central
Loading ...
Support Center