Format

Send to

Choose Destination
Hum Genet. 2018 Apr;137(4):281-292. doi: 10.1007/s00439-018-1881-4. Epub 2018 Apr 10.

Genotype imputation performance of three reference panels using African ancestry individuals.

Author information

1
Johns Hopkins University, School of Medicine, Baltimore, MD, USA.
2
Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA.
3
National School of Public Health, Universidad de Antioquia, Medellín, Colombia.
4
School of Medicine, Universidad Pontificia Bolivariana, Medellín, Colombia.
5
Division of Pulmonary and Critical Care Medicine, Brigham and Women's Hospital, Boston, MA, USA.
6
Johns Hopkins University, Bloomberg School of Public Health, Baltimore, MD, USA.
7
Johns Hopkins University, Bloomberg School of Public Health, Baltimore, MD, USA. pduggal@jhu.edu.

Abstract

Genotype imputation estimates unobserved genotypes from genome-wide makers, to increase genome coverage and power for genome-wide association studies. Imputation has been successful for European ancestry populations in which very large reference panels are available. Smaller subsets of African descent populations are available in 1000 Genomes (1000G), the Consortium on Asthma among African ancestry Populations in the Americas (CAAPA) and the Haplotype Reference Consortium (HRC). We compared the performance of these reference panels when imputing variation in 3747 African Americans (AA) from two cohorts (HCV and COPDGene) genotyped using Illumina Omni microarrays. The haplotypes of 2504 (1000G), 883 (CAAPA) and 32,470 individuals (HRC) were used as reference. We compared the number of variants, imputation quality, imputation accuracy and coverage between panels. In both cohorts, 1000G imputed 1.5-1.6× more variants than CAAPA and 1.2× more than HRC. Similar findings were observed for variants with imputation R2 > 0.5 and for rare, low-frequency, and common variants. When merging imputed variants of the three panels, the total number was 62-63 M with 20 M overlapping variants imputed by all three panels, and a range of 5-15 M variants imputed exclusively with one of them. For overlapping variants, imputation quality was highest for HRC, followed by 1000G, then CAAPA, and improved as the minor allele frequency increased. 1000G, HRC and CAAPA provided high performance and accuracy for imputation of African American individuals, increasing the number of variants available for subsequent analyses. These panels are complementary and would benefit from the development of an integrated African reference panel.

PMID:
29637265
PMCID:
PMC6209094
[Available on 2019-04-10]
DOI:
10.1007/s00439-018-1881-4

Supplemental Content

Full text links

Icon for Springer
Loading ...
Support Center