A. Distribution of locus-specific read-depth measurements in 808 OR loci. Altogether 121,200 data points are depicted (808 loci times 150 samples). Points relate the GC-adjusted read-depth to the expected read-depth, which is estimated based on the k-mer mappability of a locus and the genomic sequencing coverage of a sample. CopySeq copy-number genotypes are indicated by colors (bottom to top): ‘0’, red; ‘1’, orange; ‘2’, grey; ‘3’, blue; ‘4’, purple; ‘5’, green; ‘6’, brown; ‘7’, yellow; ‘8’, light blue; ‘9’, black). B–D. Dissecting a complex CNV region with CopySeq. The displayed region (chr11:4,921,968–4,930,581) harbors a multi-allelic CNV involving both a deletion and a duplication. The deletion results in an OR51A2—OR51A4 fusion-gene [4]. Read-depths are shown on the left and the inferred locus-structure on the right. CopySeq was carried out in conjunction with breakpoint-junction analysis [26], generating the following copy-number genotypes. NA19138: ‘2’ for OR51A4, ‘2’ for OR51A2, ‘0’ for the fusion-gene (B); NA12716: ‘0’, ‘0’, ‘2’ (C); NA19172: ‘2’, ‘4’, ‘0’ (D). Orange and blue boxes indicate open-reading frames (ORFs), and orange/blue lines denote the respective loci (with 3′ and 5′- regions). Both ORFs are on the reverse strand of the reference genome. The gene fusion occurred near the ORFs' 5′-end within a sequence stretch where both share extensive homology (thus, no reads map to this stretch uniquely). E. Copy-number genotype map of OR loci in 150 individuals. Each bar represents the frequency of a copy-number genotype (y-axis) at a particular OR locus (x-axis). Colors indicate copy-number genotype frequencies (color scheme is on the right).