
The Genetic Architecture of Domestication in Animals
Abstract
Domestication has been essential to the progress of human civilization, and the process itself has fascinated biologists for hundreds of years. Domestication has led to a series of remarkable changes in a variety of plants and animals, in what is termed the “domestication phenotype.” In domesticated animals, this general phenotype typically consists of similar changes in tameness, behavior, size/morphology, color, brain composition, and adrenal gland size. This domestication phenotype is seen in a range of different animals. However, the genetic basis of these associated changes is still puzzling. The genes for these different traits tend to be grouped together in clusters in the genome, though it is still not clear whether these clusters represent pleiotropic effects, or are in fact linked clusters. This review focuses on what is currently known about the genetic architecture of domesticated animal species, if genes of large effect (often referred to as major genes) are prevalent in driving the domestication phenotype, and whether pleiotropy can explain the loci underpinning these diverse traits being colocated.
Domestication as a process has been an integral part of human civilization and instrumental in allowing the rapid spread of the human race throughout the globe. Domestication itself is used for a wide variety of analyses and theories, with Darwin himself using it as a model for evolution.1 More recently, it has helped recreate human migration patterns,2,3 and the huge phenotypic variation generated in domesticated species have been used to map causal genes and the mutations underlying a variety of trait types.4 In this way, domestication can aid in completing the genotype–phenotype map in a variety of plants and animals and increase our understanding of how changes in the genome can bring about alterations in both quantitative and discrete traits.
One of the greatest conundrums concerning domestication itself is related to the wide variety of traits that are modified by this selective process. These traits show similar alterations in a wide variety of different domestic species and often the traits themselves seem coincidental or nonbeneficial to the main focus of domestication.5 This combined domesticated phenotype, as it has been termed,6 represents a complex of convergent traits. Domestic animals tend to be smaller than their wild counterparts and have reduced or altered pigmentation. Particular pigmental patterns, ranging from albinism, piebaldness, and the like, have all been selected for in a variety of animals.7,8 More strictly, size has also been selected to be both larger as well as smaller in certain domestic animals, most notably in dogs.7 Furthermore, different morphological proportions (for example, skull size, shape, and leg length in dogs7) have also been selected for. Brain size and composition has also been altered in domestic animals, with an enlarged telencephalon often occurring, and a reduced relative brain size.9 Behavioral and physiological traits are also often modified. In particular, tameness, also referred to as a reduced fear of human beings in the literature, is increased,6,10 aggression is decreased, and activity level and explorative tendencies are altered.11 An earlier onset of sexual maturity,12–15 increased reproduction (number of estruses, egg production, and the like) and altered adrenal development are all also observed. In animals, it has been posited that selection for tameness was the initial primary focus of domestication. However, it remains to be seen whether there was active selection for all the remaining traits, or whether pleiotropy, drift, relaxed selection, or other forces led to some of them being fixed as well.
Given these markedly different traits that have all been selected for during domestication, a major question is how all these are controlled at the genetic level. Given that selection acting upon such a variety of domestic animals can produce such similar phenotypes, this raises the question of whether this is indicative of how selection acts in all such cases? Are there unique genomic properties of domesticated animals that allows for such development, or is this solely the product of strong artificial selection? Do domesticated species represent multiple independent case studies of how such selection can progress? To be able to answer these questions, it is necessary to identify the genes behind these different traits (termed the genetic architecture), and how they interact to give rise to the domestic phenotype. First, do genes of large effect give rise to the observed changes, or are they governed by numerous small-effect loci that are additive in nature (as predicted by Fisher). The next question is to what extent is pleiotropy a factor in causing these traits, ie, does one mutation lead to numerous different phenotypic alterations that could help explain the diverse set of changes in the domesticated phenotype? These two questions are examined below.
Major Genes for Domestication
The first stage to the elucidation of the genetic architecture of a trait is generally to identify genes or regions of large effect – often termed major genes. The genetic architecture in this case refers to the number of genes, their effect size, and their location in the genome that govern the variation present in or between any given population. In this context, major genes account for a large percentage of the variation in a given trait. Although appealing because of their relative ease in identification (in comparison to small-effect loci), relatively few examples of major genes for quantitative traits in outbred populations exist.16 Major genes have been identified in a variety of domestic animals; however, care must be taken in their definition. Many are virtually monogenic in effect, meaning that the trait in question is almost entirely governed by the single gene/mutation in question. For true quantitative traits, the number of major genes identified is far lower, though examples are apparent. The most common example of major genes is for coat color, with the same genes implicated in the regulation of coat color across a variety of different species (including numerous domesticated ones). For example, the genes that have been identified as affecting different coat traits in dogs (most notably MC1R, ASIP, TYRP1, CBD103, and SILV/PMEL17 – see review in Ref. 17) are also found affecting color in a multitude of other species.18,19 Similarly, the KIT gene is associated with white spotting pattern among a variety of color phenotypes in horses,20 dominant white color in pigs,21 and proportion of black color in cattle.22 Such large effects that are exhibited in a cross-species manner is representative of many of these genes for color.17,23 Coat color is a truly quantitative trait, and while major genes account for a large percentage of the genetic variation of the trait, numerous other genes of much smaller effect are also involved. For example, in the case of the proportion of black coloration in cattle described earlier, although the genes KIT and MITF and one other locus explained 24% of the variation present, a large number of other loci were required to explain the remaining variation using a genomic prediction model.22 Therefore, a pertinent question is how many of the major genes identified are really monogenic characteristics, with the gene in question explaining all of the variations in the trait, and how many are part of a wider genetic architecture governing a particular trait. Similarly, the above coat color mutations also highlight how the same gene may be subject to multiple different causal mutations. In the example given earlier, a number of different mutations have affected the same gene in multiple different species (eg, KIT and MC1R). However, multiple domestication events can potentially lead to different mutations in the same gene in the same species. For example, the Wx gene in rice contains multiple mutations affecting the same trait.24 A breakdown of major genes present in domestic animals is presented below and in Table 1.
Chickens
The comb of the chicken has been a source of interest since the early days of genetics, when Bateson and Punnett used the pea comb, rose comb, and walnut comb mutations to illustrate the first example of Mendelian inheritance and epistasis in animals.25 These comb mutations were all identified in domesticated chickens and have subsequently been identified at the genomic level.26,27 Given that these were used as a model of Mendelian inheritance, it is hardly surprising that they are monogenic in effect (although the rose comb and pea comb mutations do interact epistatically to reveal a further comb phenotype – the walnut comb). Other monogenic domestication traits have also been identified in the chicken – principally yellow skin color (controlled by the BCDO2 gene),28 duplex comb,29 and barred coat color genes.30
Dogs
In dogs, strong breed differences and extreme inbreeding have aided in the identification of numerous monogenic genes affecting the domesticated phenotype. Genes for wrinkled skin,31 leg length,32 and body size33 have all been discovered. In the case of the last two genes, these traits are also quantitative, though the genes identified explain virtually all the variations between long- and short-legged breeds of dogs, and large and small breeds. Almost all fur growth type and texture phenotypes have also been shown to be explained by only three genes (FGF5, KRT71, and RSPO2).34
Cattle, sheep, and horses
In cattle, a deletion mutation has been identified in the MSTN gene that leads to extreme muscular development (an increase in muscle mass of 20%),35 with the same gene that causes this increased musculature also exhibiting similar effects in sheep.36 Regarding production traits in cattle, a nonsense mutation in the gene FMO3 has been identified as causing a characteristic fish-odor in milk from cattle.37 Also relating to milk production, the DGAT1 gene accounts for 30% of the variation in fat percentage in milk,38 while the ABCG2 gene affects milk composition.39 Other domestication-related genes have also been identified in the sheep. The gene RXFP2 has been linked with horn type,40 though in this case no functional data or mutation identification is currently available to confirm this effect. Also in sheep, an X-linked mutation in a specific breed causes increased ovulation rate and twin births,41 while an autosomal mutation in BMPR1B causes the Booroola phenotype that also affects ovulation.42 In horses, the gene DMRT3 has recently been shown to be required for the unique gait types present in Icelandic horses.43 In the case of the DMRT3 locus, this is particularly intriguing as it is probably the only major gene identified in domestic animals that can be thought of as modifying behavior.
Pigs
In pigs, a major gene for malignant hyperthermia has been associated with the ryr1 gene44 that also controls meat quality traits, while a nonconservative mutation in the gene PRKAG3 leads to high glycogen content in muscles.45 A gene affecting muscularity has also been identified in pigs (IGF2), which also affects back fat and heart size,46 while the gene PPARD has been identified as affecting ear size.47 In both the latter cases, the traits are fully quantitative, yet the identified genes account for a large percentage of the trait variation – 40% in the case of PPARD and 15%–30% in the case of IGF2. Polymorphisms in the CAST gene have also been associated with meat tenderness in pigs, though more in-depth functional assays have yet to be made beyond this association.48
Quantitative trait loci studies
In the examples shown earlier, the bulk of major genes identified act in a monogenic fashion, in that they explain almost all the variation of the traits they regulate. A few of the genes in question are part of the architecture of a standard quantitative trait, though certainly examples are present (as discussed earlier). In terms of analyzing whether major genes are more prevalent in domesticated species, it is important to bear in mind that the identification of the actual causal genes or mutation is extremely rare. In contrast, more standard Quantitative Trait Loci (QTL) mapping studies can be used to identify the overall genetic architecture of a trait in terms of estimating the number of loci involved and their relative strength of effect.49 Such studies are far more abundant and still give an idea of the underlying architecture. There are, of course, problems with this approach. The use of an F2 or similar type of second-generation intercross population means that the resolution in terms of the size of the QTL regions identified is far too low for candidate gene identification. Each QTL typically covers a span of ~20–30 cM, depending on the size of the cross, which can equate to hundreds or thousands of genes, depending on the genome size and recombination rate of the species used. This low resolution, caused by the limited number of recombinations that have accrued in these intercross populations, can also lead to a misleadingly simplified genetic architecture. This potential problem occurs when a QTL is in fact made up of multiple smaller loci; however because of the low number of recombinations present in the population and hence the poor resolution, these manifest as a larger effect and would be falsely described as a major gene/QTL effect. Similarly, if the loci are antagonistic (ie, have the opposite direction of effect in terms of the additive effect of each sublocus), a QTL may be missed. Nevertheless, the relative ease of performing a QTL analysis means the number of identified loci and the number of successful studies are very high, allowing a more holistic examination of the prevalence of major genes (or rather major-effect QTL). Table 2 lists the current genome build and online QTL resources for the domestic animals listed below.
In the chicken, major-effect loci have been identified for comb mass,50 growth,13,51 and susceptibility to feather pecking.52 In contrast, most other traits measured in domestic chickens compared to their wild counterparts show more standard effect sizes (mean ~5%)49,53 and no large-effect genes have been found in traits ranging from bone allocation,54,55 fear-avoidance behavior,12,56 onset of sexual maturity,15 number of eggs produced,57 and production characteristics (muscle pH, color, and egg taste).58
In pigs, QTL for a wide variety of production-related traits have been identified, including teat number,59 porcine leukocyte count,60 carcass quality and growth traits,61–63 meat quality64 and pH,64 litter size/fecundity,65 and disease resistance.66 There are well over 100 QTL studies involving the pig, with the results summarized in several reviews65–67 and a pig QTL database.68 Some large-effect QTL have been observed, including loci for shoulder weight, loin weight and ham weight,63 and pH,64 with the rest being standard effect sizes.
A multitude of QTL studies in cattle have looked at characteristics such as growth, meat quality, milk composition, behaviour, fecundity and reproduction, mastitis and pigmentation. For a full summary, the bovine QTL website (http://bovinegenome.org/bovineqtl_v2/login.jsp) gives a full list of location, effect, and breed, among other information. Of the QTL, once again the majority are of standard (ie, relatively modest) effect sizes, though milk composition and pigmentation traits contained major loci (with the genes identified, as discussed earlier).
Summary of major genes in domestication
A common pattern occurs in almost all the domestic species – although some major genes are present, this does not appear to be the norm for the bulk of the loci that affect these domestication characters. The fact that such loci are present at all is most likely because of the strong directional selection brought about by domestication, as it would be relatively easy to identify and fix such polymorphisms/mutations as they are phenotypically identified. Further evidence for this directional selection favoring some major gene effects is also provided by the genetic architecture of morphology in the dog. The dog has undergone some of the most extreme directional selection among domesticated animals, leading to the most phenotypically diverse mammalian species on the planet.69 This selection, in combination with the extreme inbreeding within breeds, appears to have led to a remarkably simple genetic architecture, especially for morphological traits. Numerous monogenic mutations have therefore arisen in the dog and aided gene identification, while even more standard morphological studies tend to have >70% of the variation explained by two to six QTL.70 Reasons posited for this simplistic architecture have been based on the fact that many modern breeds were created during the Victorian era, when breeders focused heavily on novelty and the preservation of discrete mutations.70 Such discrete mutations would then be easy to breed into a variety of different genetic backgrounds, which then expanded the breed diversity. This is potentially at odds with the selection occurring in other domesticated breeds, where selection instead is being directed on economically important traits by using the standing genetic variation already present within a wild population, rather than large-effect discrete mutations. This would then imply that the large-effect genes that have been identified across domesticated species are more recent mutations that were far easier to select on, whereas the bulk of the variation present in a domestic species came from the standing variation present in the wild progenitors.
Pleiotropy of Domestication Effects
One of the most enduring questions regarding the domestication process has been the recurrent evolution of a discrete set of traits that make up the domestication phenotype. Darwin himself first noted this when examining domestic mammals in relation to their wild counterparts1 and was drawn on whether this was because of something in the domestication process per se. In this role, domestication could serve as a general model for convergent evolution when species undergo strong directional selection.71 As stated earlier, the domestication syndrome itself tends to comprise changes in tameness, coat color/pigmentation, skull morphology, reproductive alterations, hormone and neurotransmitter concentrations, and brain composition. Given these changes, it is therefore possible that the common domestic environment induces this suite of alterations or that the genes under selection have pleiotropic effects on multiple traits. Most pertinently, domestication has been posited to initially be driven by selection for tameness.10,72 One early potential theory for these domestication phenotypes are through common environmental effects, what Darwin referred to as gentler conditions of living in the domestics causing these changes.1,73 However, the evidence for such environmental effects is very sparse. Heritability estimates of different domestication traits reveal a strongly heritable component on the behavior and morphology traits associated with domestication,74,75 which would not occur if these were environmentally driven. Further, feral populations of wild-living domesticates do not show the complete loss of the phenotypes fixed during domestication, though this is complicated by hybridization with wild species.76 For example, feral chickens that live in Hawaii have been shown to maintain signatures of both color and vocalization in their current state, despite having hybridized with the progenitor Red Junglefowl on the island.77 In relation to pleiotropy, further hypotheses have also been posited. Most recently, a role has been proposed for neural crest cells (NCCs) driving the changes, with this based on the reduced size and function of the adrenal glands.78 A further theory, also based on this alteration in adrenal size, hypothesizes that the domestication phenotype is because of a single genetic regulatory network, with the main genetic changes occurring in upstream regulators, and thereby affecting a large number of diverse downstream systems.79 The common denominator for both these hypotheses is obviously that the changes induced are actually pleiotropic and affect many different traits.
Evidence for pleiotropy in domestication
Perhaps the best evidence for pleiotropy is the correlations that exist between separate components of the domestication syndrome when domestication is recreated through selection lines. The most well known of these selection lines was based on the silver fox and consisted of directional selection solely based on tameness and aggression.72,80 It was found that after only 8–10 generations, color changes appeared (yellowing and piebaldness), adults maintained the juvenile floppy ears, skull changes occurred (leading to short and wide faces), as did curly tails.79,81 These results suggest that selection on a single behavioral trait can lead to a range of morphological changes. In a similar experiment, rats also divergently selected for tameness/aggression exhibited changes in neurotransmitters, hormone levels, and morphology.82,83 These studies imply that the loci for tameness show pleiotropic effects on multiple different domestication syndrome traits, though in these instances close linkage between different genes affecting each trait could also be a potentially plausible explanation.
If pleiotropy is occurring, QTL studies between wild and domestic animals should show an overlap between the multiple different domestication traits. Plant and animal studies using this mapping approach show a marked similarity in their findings. Multiple domestication traits are found clustered fairly closely together throughout the genome. These clusters are fairly loose, typically ≥20–30 cM (1 cM roughly equates to a 1% recombination rate, with the physical distance varying depending on the species in question – in human beings and mice, in a standard design, 1 cM is ~1 Mb; in chickens, 1 cM is ~350 kb). In plants, these clusters have been found in beans,84,85 maize,86,87 sunflowers,88 rice,89 and pearl millet.90 Initial work using the aforementioned rat selection lines found that when divergent lines were crossed together to map the underlying genes, clusters of QTL were also once again revealed. In the case of the rat, these were in two clusters spanning 41 and 29 cM, respectively.82 Using an intercross between wild and domestic chickens, QTL clusters were found that indicated multiple different trait types were overlapping,13 with loci for comb size (a sexually selected ornament in the chicken) overlapping with onset of sexual maturity loci,15 as well as with bone density and egg production traits.57 In the silver foxes differentially selected for behavior (also mentioned previously), a QTL analysis for behavior indicated an overlap between different behavioral QTL, though this study was restricted to behavioral traits, so other characteristics were not mapped simultaneously.91
The problem with these examples is related to the mechanics of QTL analysis as it is generally performed. All these studies use variations on an F2 intercross, meaning that the resolution of such studies tends to be rather low, with the confidence intervals (~30 cM or more) equating to at least this number of megabases in almost all the species examined. Loci can be overlapping but this may be because of linkage (ie, close physical proximity between loci) rather than direct pleiotropy from a single causal loci. For example, in the case of the wild x domestic chicken study, statistical analysis aimed at separating these two states found that most clusters in the F2 population appeared to be due to linkage, with only the central core of QTL in each cluster unable to be distinguished as either closely linked or pleiotropic.13 To ascertain if true pleiotropy is occurring in such clusters, there are a number of techniques that can be used. First, you can attempt to increase the number of recombinations to expand the genetic map, by intercrossing for further generations (thereby increasing the resolution of the detected QTL).92 If these loci are pleiotropic, then the clusters should remain the same size, rather than expanding along with the map. In the case of the chicken intercross, this was expanded to eight generations of intercrossing, with the result being far narrower intervals generated. One such locus affecting comb size was reduced to 600 kb, with only two genes in the interval, and with expression analyses indicating greater effects from one gene, HAO1.50 This gene was also found to show pleiotropic effects on bone and egg production characteristics, once again using a combination of QTL analysis and gene expression correlations between the traits and genes involved. When a more global expression QTL (eQTL) analysis of both comb and bone samples was performed (enabling multiple genes to be tested for effects on expression in both comb and bone tissue), results suggested that both pleiotropy and linkage were present.93
To truly define pleiotropy, however, the actual causal mutation/polymorphism must be identified. In this instance, it is possible to identify genuine pleiotropic effects. Unfortunately, as shown earlier, the identification of such loci is extremely rare. More plant-based examples exist than their animal counterparts, and even here, the polymorphisms that are more readily identified are most commonly of very large effect. This makes them almost functionally identical to major genes rather than the small-effect loci more commonly associated with quantitative traits. Although this review focuses on animal domestication, plant genetics has some surprising and relevant results to elucidate the mechanisms at work. Where polymorphisms have been identified, there appears to be pleiotropy. The Q locus in wheat affects a variety of traits, including free-threshing and floral development characteristics,94,95 and the tb1 locus in maize affects a variety of domestication characters.87,96 However, in the case of tb1, there are also other mutations in linked loci that affect domestication traits.86,87 Other examples demonstrate that, in fact, different mutations can be present in close proximity. The shattering mutation in rice is a single recessive mutation that is tightly linked to loci affecting seed dormancy and the pericarp.97 Animal examples of the identification of the causal mutation for a quantitative trait in a domestic animal includes the IGF2 mutation in pigs, which also affects muscle mass and fat deposition.46 Somewhat strikingly, many of the other mutations identified in animals do not exhibit extensive pleiotropy, and certainly none of the extreme pleiotropy that would be required to explain some of the more diverse aspects of the domestication syndrome. Therefore, it seems that if one considers both plant and animals examples, pleiotropy does exist, and also that domestication mutations often exist in linked groups of multiple mutations/genes (certainly in the case of plant domestication genetic architecture).
It is also of interest to compare the extent of pleiotropy as seen in domestication with that observed in quantitative traits in general. One issue here is that there are several different types of quantitative traits (morphological, behavioral, and life history in general) that may be under very different forms of selection. The extent of pleiotropy present in general was thought to be high because of predictions based on extensions of Fisher’s geometric model. This led to the universal pleiotropy hypothesis that states that pleiotropy arises so frequently as to almost be considered universal.98 Pleiotropy is also the foundation of the cost of complexity hypothesis, which posits that complex organisms are less adaptable or responsive to evolution because of the increased degree of pleiotropy that is present.99,100 However, despite these hypotheses,98 QTL studies in general have found there to be less pleiotropy than expected (see pleiotropic scans in mice101 and stickleback102). Similarly, empirical data from gene knockout and knockdown experiments in a variety of taxa ranging from yeast to mice also indicates that pleiotropy is generally fairly low (see the review by Wagner and Zhang103). It therefore appears that domestication traits do not appear to exhibit more pleiotropy than quantitative traits in general, though with the caveat that additional mutation identification is most likely required to verify this statement.
Phenotypic buffering, eQTL, and pleiotropy
Phenotypic buffering was first proposed by Waddington and was initially used to explain how the phenotype of an animal could be resilient to genetic and environmental perturbations.104 This idea has recently been taken into account for potential pleiotropy in eQTL analyses – if buffering is correct, then when certain key loci are perturbed, this will lead to a raft of other changes being revealed. The underlying mutations would no longer be buffered, hence their effects would then be revealed by the single buffering locus alteration. eQTL analysis itself consists of standard QTL mapping, though in this instance uses global gene expression from a particular tissue as the source of the phenotypes. By combining these with the genotypic information of each individual, it is possible to map genes that vary according to their underlying genotype. This should then be observed by hotspots in the eQTL with multiple trans effects, showing genes from dispersed areas of the genome are all controlled by a single genetic locus.105,106 Although the resolution issues of QTL analysis can once again be an issue here, the phenotype (gene expression) tends to be less noisy than conventional analysis. Where such analysis has been performed, pleiotropic hotspots appear to be rather limited, even when they are found. For example, in the study by Fu et al,106 several phenotypic hotspots were found, as were hotspots for metabolite abundance; yet, these were only reflected in a single eQTL hotspot. Chesler et al found a relatively small number of QTL correlated with large gene expression modules situated in the genome, hypothesizing that this indicated pleiotropy.107 If multitrait pleiotropy for the domestication syndrome does exist, then obviously such eQTL hotspots should also be observed in wild × domestic intercross eQTL analyses. In the case of the rat lines divergently selected for tameness, an eQTL analysis was performed on 150 individuals and only identified one locus that contained more trans eQTL than was expected by chance, with this being only just significant (seven in the region and six expected by chance),108 with this hotspot not overlapping any of the tameness QTL. In the case of the chicken, multiple tissues have been used for this type of eQTL analysis. In bone samples, some trans hotspots occurred, though most did not overlap any of the bone QTL observed.55 Therefore, a similar pattern is once again seen as with major gene pleiotropy.
Domestication by Modules
Given that although pleiotropy does occur, its effects seem somewhat limited and cannot alone explain the large clusters of multiple different trait QTL all grouping in loose clusters throughout the genome. The most parsimonious explanation is therefore that linked genes surround a pleiotropic core in each of these clusters or modules. The population geneticist Grant noticed how in plants even a strong domestication phenotype could be rapidly broken down during reverse selection or through cross-breeding.109 This is similar to what has been shown in animal examples.13,108,110 This system would allow the rapid introgression of desirable traits into a domesticated species, with the remainder of the domesticated phenotype easily recapitulated in this new hybrid. It still remains to be ascertained whether such a system of linkages is specific to domesticated species (and therefore has the potential to help explain why so relatively few species have been domesticated, despite multiple opportunities existing for additional species) or is in fact a more general effect brought about by directional selection.
Modularity is found not only with loci affecting different aspects of the domestic phenotype but can also be seen with gene expression in general and how the genome is organized. For example, although it appears that hundreds or even thousands of genes govern the differences between wild and domestic populations, judging by the gene expression changes seen in microarrays,111–113 these genes tend to be located in discrete clusters or modules. Therefore, genes with similar expression patterns are located in close proximity. A particularly interesting study by Litvin and colleagues114 combined both eQTL analysis and clustering analysis. By first identifying where genes were clustered together in the genome, in terms of sharing similar expression patterns, they then went on to use this to identify genes that were controlled by multiple different loci. In this case, 44 different modules were detected throughout the genome (with each module linking five or more genes), so a relatively small number of modules tie in the majority of gene expression differences between wild and domestic yeast. It also showed that multiple interacting loci influence the expression of these modules as opposed to one single pleiotropic locus for each cluster. A similar pattern has been observed using wild inbred lines of Drosophila,115 with differentially expressed genes grouped together in clusters in the genome.
It is possible to model how linkage blocks may evolve. Using a model of allogamous and autogamous species, D’Ennequin demonstrated that by starting with loci distributed evenly throughout the genome for a variety of traits, continuous selection can lead to clusters of QTL forming in the genome, with between two and four loci linked together in each cluster in their model.116 This mimicked the observed effects seen in a variety of domestic plants and animals. The authors of that study draw parallels between gene clusters for domestication and those for Batesian mimicry in butterflies. In the case of Batesian mimcry, genes affecting various aspects of mimicry phenotypes are all grouped together in supergenes.117 Any recombinants are therefore strongly selected against, as they will be poorer mimics. A mechanism that has been proposed to explain this development is the sieve hypothesis,118 and there is, therefore, a great deal of support for such clustering, both empirically (from assorted QTL studies) and theoretically.
Concluding Remarks
The huge changes brought about by domestication selection provide us with a puzzle as to how the domestication phenotype appears to have been replicated in so many distinctly different species. Questions as to how these changes are brought about at the genetic level and whether they are typical of directional selection in general, or rather represent a specific case, have yet to be fully addressed. Perhaps most intriguing is the thought that certain species may contain the right genomic makeup, allowing them to be more easily domesticated, whereas others will be almost impossible to domestic for the same reason.
Several different theories have been raised to potentially explain how the domesticated phenotype develops genetically, and how multiple diverse characteristics are selected and fixed in the genome. These theories can be broadly divided into the ‘neural crest hypothesis’76, the ‘single genetic network’ hypothesis77 and what I have termed the ‘clustered linkage’ hypothesis (after a theory first posited by Grant109). The neural crest hypothesis posits that mild neural crest deficits during embryonic development can drive the diverse range of phenotypes affected by the domestication syndrome. NCCs are precursor cells for a wide variety of tissue types,119,120 so domestication selection for behavior could be targeting genetic variants that modify NCC numbers or migration, which in turn leads to a variety of pleiotropic effects. The single genetic network hypothesis is built on a theory developed by Belyaev,72 whereby reduced stress levels in the domestic environment leads to hormonal responses that reset gene expression patterns. This theory has then been expanded to a single genetic regulatory network underlying the domesticated phenotype. Mutations or polymorphisms are then selected for upstream regulators of the network, resulting in numerous downstream changes.79 These upstream mutations can be genetic or epigenetic in nature. Finally, the clustered linkage hypothesis has been outlined in detail earlier. Of the three, the first two predict strong and diverse (in terms of the traits affected) pleiotropy, while the third suggests that linkage is the most dominant factor at play. When weighing up the current evidence, it is noteworthy that, although present, pleiotropy does not appear to be so widespread in domesticated species as is predicted by the first two hypotheses. When pleiotropy is found, the traits affected tend to be those that are related functionally, for example, different aspects of bone physiology, etc. They do not appear to explain the more diverse aspects of domestication (eg, behavior and morpho logy), when either the causal gene is known or fine-scale QTL analysis is used. In contrast, clusters of QTL have been identified in the genomes of multiple different domesticated plant and animal species. Similarly, eQTL analysis also finds multiple modules/clusters but less pleiotropy. Therefore, the current evidence appears to be against the neural crest and genetic network hypotheses and in favor of the linkage clusters. As a slight modification to the latter theory, I would suggest that a pleiotropic core surrounded by more loosely linked loci would best explain the observed data and is the most parsimonious explanation. However, until the actual mutations underlying the diverse traits that make up the domestication phenotype are revealed, it will be impossible to verify this extension.
The advances in genomic resources can only aid the field of domestication genetics and the identification of the genetic architecture that underlies it. High-density SNP chips allow Genome Wide Association Studies to be more easily performed on pedigree populations, removing the need for tailored intercrosses. This is especially useful for large animals with longer generation times. Genetic sequencing has reduced greatly in price and a variety of domestication 1000 genomes projects have started with a view to mapping variants within species.121 Similarly, transcriptomic, methylomic, and other forms of sequence mapping (CHIP-seq, siRNA-seq, and the like) raise the possibility to gain a holistic view of the raft of potential changes underpinning domestication.
Acknowledgments
The author would like to thank Dr. R. Henriksen for valuable comments on the manuscript.
Footnotes
ACADEMIC EDITOR: J. T. Efird, Associate Editor
PEER REVIEW: Seven peer reviewers contributed to the peer review report. Reviewers’ reports totaled 2,129 words, excluding any confidential comments to the academic editor.
FUNDING: DW is supported by grants from the Swedish Research Council (VR) and the Swedish Research Council for Environment, Agricultural Sciences and Spatial Planning (FORMAS), as well as the LiU-Neuro network. The author confirms that the funder had no influence over the study design, content of the article, or selection of this journal.
COMPETING INTERESTS: Author discloses no potential conflicts of interest.
Paper subject to independent expert blind peer review. All editorial decisions made by independent academic editor. Upon submission manuscript was subject to anti-plagiarism scanning. Prior to publication all authors have given signed confirmation of agreement to article publication and compliance with all applicable ethical and legal requirements, including the accuracy of author and contributor information, disclosure of competing interests and funding sources, compliance with ethical requirements relating to human and animal study participants, and compliance with any copyright requirements of third parties. This journal is a member of the Committee on Publication Ethics (COPE).
Author Contributions
Wrote the first draft of the manuscript: DW. Developed the structure and arguments for the paper: DW. Made critical revisions: DW. The author reviewed and approved of the final manuscript.