Format

Send to

Choose Destination
Nat Genet. 2017 Apr;49(4):588-593. doi: 10.1038/ng.3801. Epub 2017 Feb 27.

Diversity in non-repetitive human sequences not found in the reference genome.

Author information

1
deCODE Genetics/Amgen, Inc., Reykjavik, Iceland.
2
Berlin Institute of Health, Berlin, Germany.
3
School of Engineering and Natural Sciences, University of Iceland, Reykjavik, Iceland.
4
Faculty of Medicine, School of Health Sciences, University of Iceland, Reykjavik, Iceland.
5
Department of Internal Medicine, Landspitali-National University Hospital, Reykjavik, Iceland.
6
Department of Clinical Biochemistry, Landspitali-National University Hospital, Reykjavik, Iceland.
7
Department of Anthropology, University of Iceland, Reykjavik, Iceland.
8
School of Science and Engineering, Reykjavik University, Reykjavik, Iceland.

Abstract

Genomes usually contain some non-repetitive sequences that are missing from the reference genome and occur only in a population subset. Such non-repetitive, non-reference (NRNR) sequences have remained largely unexplored in terms of their characterization and downstream analyses. Here we describe 3,791 breakpoint-resolved NRNR sequence variants called using PopIns from whole-genome sequence data of 15,219 Icelanders. We found that over 95% of the 244 NRNR sequences that are 200 bp or longer are present in chimpanzees, indicating that they are ancestral. Furthermore, 149 variant loci are in linkage disequilibrium (r2 > 0.8) with a genome-wide association study (GWAS) catalog marker, suggesting disease relevance. Additionally, we report an association (P = 3.8 × 10-8, odds ratio (OR) = 0.92) with myocardial infarction (23,360 cases, 300,771 controls) for a 766-bp NRNR sequence variant. Our results underline the importance of including variation of all complexity levels when searching for variants that associate with disease.

PMID:
28250455
DOI:
10.1038/ng.3801
[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for Nature Publishing Group
Loading ...
Support Center