Strategies for detection of the genome-wide selection signatures in table 1. Consider a small gene region that displays SNP variation at 17 adjacent sites (vertical columns in all panels). (a) Eight individuals in species 1 (human) carry alternative white and green alleles (synonymous variants) and also a codon-altering non-synonymous allele (red and white). A related species (chimpanzee), examined at the same SNP sites, displays a divergence pattern from the index (human species); positive selection of one SNP allele alters the random distribution pattern when examining non-synonymous alleles only (red and white). Graphs on right plot departure of genome-wide average for parameter (measured by the seven selection tests described in table 1). (a) Comparing sequence divergence between species (table 1, I–III). Gene regions with past actions of selection show an altered sequence organization that can be revealed by comparing changes between homologous sequences by three different approaches. (I) Phylogenetic shadowing: comparing divergence of orthologous sequences across the genome. The genome segments with low divergence between species compared with the genome-wide averages can indicate purifying selection or positive selection. (II) Increased function-altering mutation rates: comparing the ratio of non-synonymous (dN: left panel; changes indicated in red) to synonymous changes (dS: right panel; changes in green). This comparison could be accomplished by (i) comparing the dN/dS ratio between the candidate gene of interest and the genome-wide average for other genes and (ii) comparing diversity with divergence ratio for dN versus dS for homologous sequences. (III) Interspecies divergence versus intraspecies polymorphism: comparing intraspecific divergence (e.g. between chimpanzee and human) with interspecific polymorphism (within the human species). Selection decreases variation within an affected species (dark orange), and the scope of this decrease can be assessed by contrasting with divergence between species sequences (light orange) unaffected by the species-specific adaptation. (b) Comparing sequence variation patterns within a species (table 1, IV–VIII). Positive selection results in an elevated frequency of haplotypes carrying the advantageous allele at the expense of the others in the process called ‘selective sweep’ (Maynard Smith & Haigh 1974), followed by the gradual incorporation of derived variation seen as a skewed ‘frequency spectrum’. These signatures can all be revealed by comparing sequences within or between populations of the same species. Five tests (described in table 1) include: (IV) Local reduction in genetic variation: comparison of levels of polymorphism in and around the selected locus to the estimated neutral expectation or to the genome-wide averages (left panel; ancestral alleles are in blue or light blue). (V) Changes in the shape of the frequency distribution: identifying an excess of derived alleles, low-frequency polymorphic sites or singletons. Generations after the selective sweep, new (derived) mutations (yellow) are slowly introduced back into the recently selected region, and most appear at low frequencies expected under mutation/drift equilibrium, resulting in a skewed frequency distribution (spectrum) of polymorphisms (left panel). (VI) Differentiating between populations: identifying regions of unusually high population divergence. Local reduction of genomic variation in a selected population (left panel, middle) results in a local increase in genomic differentiation between sequences (unaffected population is not shown in the figure but can be approximated by the population before selection: left panel, top). Comparisons can be made for levels of differentiation calculated as FST around the selected loci to the neutral expectations, to a set of neutral loci or to the genome-wide averages. (VII) Extended LD segments: comparing the relative length and frequency of selected haplotypes. Positive selection results in an elevated frequency of haplotypes carrying the advantageous allele at the expense of the others. Owing to the generations of recombination, long haplotypes are also rare. However, selection sweep creates haplotypes that are both long and frequent in a population (red and light red: right panel, middle and bottom). These methods are used to identify relatively recent and incomplete sweeps. (VIII) Elevated admixture contribution from one population: identifying sections of the genome with unusually high or low ancestry in a mixed population using MALD. Similar to VII, when two populations meet, one may carry a beneficial allele that can be later detected as a regional increase in ancestry, using a genome-wide map of highly differentiating population markers, and evaluated against the genome-wide expectation. I–VIII: blue line, genome-wide average.