![]() | ![]() |
Formats:
|
||||||||||||||||||||||||
Copyright © 2008 The American Society of Human Genetics. Published by Elsevier Ltd. All right reserved.. Runs of Homozygosity in European Populations 1Public Health Sciences, University of Edinburgh Medical School, Edinburgh EH8 9AG, UK 2Unité de Recherche en Génétique Epidémiologique et Structure des Populations Humaines, INSERM U535, BP 1000, 94817 Villejuif, France 3Institute for Anthropological Research, 10000 Zagreb, Croatia 4Andrija Stampar School of Public Health, Faculty of Medicine, University of Zagreb, 10000 Zagreb, Croatia 5Colon Cancer Genetics Group, Institute of Genetics and Molecular Medicine, University of Edinburgh and MRC Human Genetics Unit, Edinburgh EH4 2XU, UK 6Medical Genetics Section, University of Edinburgh, Molecular Medicine Centre, Edinburgh EH4 2XU, UK 7MRC Human Genetics Unit, Western General Hospital, Edinburgh EH4 2XU, UK 8Croatian Centre for Global Health, Faculty of Medicine, University of Split, 21000 Split, Croatia 9Institute for Clinical Medical Research, University Hospital “Sestre Milosrdnice,” HR-10000 Zagreb, Croatia James F. Wilson: jim.wilson/at/hgu.mrc.ac.uk Corresponding author Email: jim.wilson/at/hgu.mrc.ac.ukReceived June 29, 2008; Revised August 12, 2008; Accepted August 13, 2008. This document may be redistributed and reused, subject to certain conditions. This article has been corrected. See Am J Hum Genet. 2008 November 17; 83(5): 658. This article has been cited by other articles in PMC.Abstract Estimating individual genome-wide autozygosity is important both in the identification of recessive disease variants via homozygosity mapping and in the investigation of the effects of genome-wide homozygosity on traits of biomedical importance. Approaches have tended to involve either single-point estimates or rather complex multipoint methods of inferring individual autozygosity, all on the basis of limited marker data. Now, with the availability of high-density genome scans, a multipoint, observational method of estimating individual autozygosity is possible. Using data from a 300,000 SNP panel in 2618 individuals from two isolated and two more-cosmopolitan populations of European origin, we explore the potential of estimating individual autozygosity from data on runs of homozygosity (ROHs). Termed Froh, this is defined as the proportion of the autosomal genome in runs of homozygosity above a specified length. Mean Froh distinguishes clearly between subpopulations classified in terms of grandparental endogamy and population size. With the use of good pedigree data for one of the populations (Orkney), Froh was found to correlate strongly with the inbreeding coefficient estimated from pedigrees (r = 0.86). Using pedigrees to identify individuals with no shared maternal and paternal ancestors in five, and probably at least ten, generations, we show that ROHs measuring up to 4 Mb are common in demonstrably outbred individuals. Given the stochastic variation in ROH number, length, and location and the fact that ROHs are important whether ancient or recent in origin, approaches such as this will provide a more useful description of genomic autozygosity than has hitherto been possible. Introduction In plant and animal genetics, the detrimental effects of parental relatedness on fitness have long been recognized.1 The mechanism of these effects is thought to be increased levels of homozygosity for deleterious recessive alleles, although overdominance might also play a role.2 In human populations in which consanguinity is customary or population size and isolation result in elevated levels of background parental relatedness, evidence has been reported of several effects, including an increased risk of monogenic disorders,3–5an increased risk of complex diseases involving recessive variants with intermediate or large effect sizes,6–9 and genome-wide effects on disease traits such as blood pressure10–17and LDL cholesterol.15 These are consistent with a causal role for many recessive variants with individually small effects scattered throughout the genome. Central to any investigation of the effects of parental relatedness on the health of offspring is the need for a reliable and accurate method of quantifying this phenomenon at an individual level. The first method proposed was the inbreeding coefficient, F, defined as the probability of inheriting two identical-by-descent (IBD) alleles at an autosomal locus or, equivalently, the average proportion of the autosomal genome that is inherited IBD.18 This is estimated with Wright's path method,19 which calculates an individual's probability of inheriting two IBD alleles, given a specified pedigree and given that an allele present in a parent is transmitted to a specified offspring with a probability of 0.5. Before the availability of marker data from high-density genome scans, researchers had no option but to use this approach, despite the fact that, even where pedigrees are known and accurate, it has two major disadvantages.20 First, meiosis is a highly random process. Whereas on average, half of the DNA making up a gamete is maternally derived and half is paternally derived, there is a high degree of stochastic variance about this average.21,22 As a consequence, grandchildren vary in the proportion of DNA they inherit from each of their four grandparents, and although the mean F coefficient of the offspring of first cousins is 0.0625, the standard deviation is 0.0243.20 This variance increases with each meiosis (i.e., each degree of cousinship), so it is perfectly possible for the offspring of third cousins to be more autozygous (homozygous by descent) than the offspring of second cousins. Because the F coefficient (denoted here as Fped to distinguish it from genomic estimates of autozygosity) is derived on the basis of this expectation, it is, therefore, only a very approximate estimate of individual genome-wide autozygosity. Second, Fped estimates the proportion of an individual's genome that is IBD, relative to that of a poorly characterized founder generation. This generation is usually fairly recent, and, moreover, the founders are presumed to be unrelated, when in fact, members of historical populations were often related several times over through multiple lines of descent. As a result, this approach fails to capture the effects of distant parental relationships and, therefore, underestimates autozygosity, particularly in small, isolated populations or in populations with a long tradition of consanguineous marriage.23,24 With the increasing availability of high-density genome-scan data, interest has grown in exploring whether a more reliable and accurate estimate of autozygosity might be derived on the basis of genomic marker data. Much of the impetus for this comes from those searching for specific disease genes via homozygosity mapping, rather than from a general interest in the health effects of parental relatedness. Since the 1980s, many autosomal-recessive genes underlying monogenic human diseases have been identified with homozygosity mapping, which exploits the fact that regions flanking the disease gene will be identical by descent (IBD) in people with the disease whose parents are related to each other.25 Botstein and Risch identified nearly 200 studies, published between 1995 and 2003, that used homozygosity mapping in consanguineous families to identify rare recessive disease genes.26 Homozygosity mapping requires an estimate of the proportion of the genome that is autozygous for each affected individual, on the basis of which a LOD score for linkage to a specified locus is computed. Accurate estimation of autozygosity is crucial: underestimation results in an inflated LOD score and, thus, false evidence for linkage,27,28 and overestimation results in false negatives. Quantification of individual autozygosity is also of interest to those investigating recessive effects in complex-disease genetics. Several studies in consanguineous or small, isolated populations with above average levels of parental relatedness have found evidence for a genome-wide effect of homozygosity on coronary heart disease,29–31 cancer,29,32–34 blood pressure,10–17 and LDL cholesterol.15 These findings are consistent with studies suggesting that the variants associated with increased risk of common complex disease are more likely to be rare than to be common in the population;35,36 are more likely to be distributed abundantly rather than sparsely across the genome,37 and are more likely to be recessive than to be dominant.38 Further empirical development of this idea has, however, been hampered by the inadequacy of available measures of autozygosity. Here, we describe a multipoint, observational approach to estimating autozygosity from genomic data that exploits the fact that autozygous genotypes are not evenly distributed throughout the genome but are distributed in runs or tracts (Figure 1
This paper has three objectives. First, it uses various measures derived from ROHs to compare four European populations: two isolated island populations and two more-cosmopolitan populations. The key study population is the Scottish isolate of Orkney, a remote archipelago off the north coast of Scotland. Three additional populations are used for comparison: a representative Scottish comparison population,40 an isolate population from a Dalmatian island in Croatia,15 and the HapMap CEU (northwest-European-derived population from Utah, USA) founders from the Centre d'Étude du Polymorphisme Humain (CEPH).41 Second, with the use of high-quality pedigree information available for the Orkney population, correlations are reported between Fped and a genome-wide autozygosity measure derived from ROHs (Froh). Finally, this study assesses the utility of Froh as a measure of autozygosity. Subjects and Methods Study Populations The Orkney Complex Disease Study (ORCADES) is an ongoing, family-based, cross-sectional study that seeks to identify genetic factors influencing cardiovascular and other disease risk in the population isolate of the Orkney Isles in northern Scotland. The North Isles of Orkney, the focus of this study, consist of a subgroup of ten inhabited islands with census populations varying from ~30 to ~600 people on each island. Although transport links have steadily improved between the North Isles and the rest of Orkney, the geographical position of these islands, coupled with weather and sea conditions, means that even today they are isolated and that they would have been considerably more so in the past. Although consanguinity is not the cultural norm in Orkney—indeed, there is evidence of consanguinity avoidance during the twentieth century42—two key factors make the North Isles population ideal for this type of study. First, the North Isles have experienced a period of severe population decline over the last 150 years, fueled by high emigration and low fertility. The population fell from an estimated peak of 7700 in the 1860s to 2217 by 2001. Second, endogamous marriage was widespread during the nineteenth century and into the twentieth centuries.43 Therefore, despite consanguinity avoidance, the combined effects of steep population decline and endogamy have led to inflated levels of parental relatedness in the current population. ORCADES received ethical approval from the appropriate research ethics committees in 2004. Data collection was carried out in Orkney between 2005 and 2007. Informed consent and blood samples were provided by 1019 Orcadian volunteers who had at least one grandparent from the North Isles of Orkney. A Scottish comparison population was derived from the controls of the Scottish Colon Cancer Study (SOCCS).40 This consists of 984 subjects, not known to have colon cancer, matched by residential postal area and age to a series of incident cases of colorectal cancer. Subjects were resident throughout Scotland, with dates of birth ranging from 1921 to 1983. The Dalmatian sample consists of 849 Croatian individuals, aged 18–93, sampled from the population of one island.15 Both the SOCCS and the Croatian projects were approved by the relevant ethics committees. The CEU sample consists of 60 unrelated individuals from Utah, USA, of northwest-European ancestry, collected by the CEPH in 1980.41 Genotyping Genotyping procedures for the Scottish,40 Dalmatian,44 and CEU45 samples are described elsewhere. All were genotyped on the Illumina Infinium HumanHap300v2 platform (Illumina, San Diego, CA, USA). After extraction of genomic DNA from whole blood with the use of Nucleon kits (Tepnel, Manchester, UK), 758 Orcadian samples were genotyped, according to the manufacturer's instructions, on the Illumina Infinium HumanHap300v2 platform. Analysis of the raw data was done via BeadStudio software, with the recommended parameters for the Infinium assay, with the use of the genotype-cluster files provided by Illumina. Individuals with less than 95% call rate were removed, as were SNPs with more than 10% missing genotypes. SNPs failing Hardy-Weinberg equilibrium at a threshold of 0.0001 were removed. IBD sharing between all first- and second-degree relative pairs was assessed with the Genome program in PLINK,46 and individuals falling outside expected ranges were removed from the study. Sex checking was performed with PLINK, and individuals with discordant pedigree and genomic data were removed. On completion of data-cleaning and quality-control procedures, 725 individuals and 316,364 autosomal SNPs remained. The male-to-female ratio of study participants is 0.86. The mean year of birth is 1952, varying from 1909 to 1988. A consensus SNP panel was then created, with use of only those markers that satisfied these quality control criteria in all four populations, leaving a final sample of 289,738 autosomal SNPs and 2618 individuals (60 from CEU, 725 from Orkney, 849 from the Dalmatian island, and 984 from Scotland). Fped Estimates The pedigrees of all individuals in the ORCADES sample were traced back for as many generations as possible in all ancestral lineages, with the use of official birth, marriage, death, and census records held by the General Register Office for Scotland in Edinburgh. Fped was calculated for each individual via Wright's path method.19 Limited pedigree information is available for the Dalmatian-isolate data set, and this is too incomplete for an estimate of Fped. It was, however, possible to analyze these data with the use of grandparental-endogamy levels. No pedigree information is available for the Scotland data set; however, we analyzed data according to the rurality of subjects' residential address47 in order to determine whether there is any evidence for an association between remote rurality and autozygosity in Scotland. Runs of Homozygosity ROHs were identified via the Runs of Homozygosity program implemented in PLINK version 1.0.46 This slides a moving window of 5000 kb (minimum 50 SNPs) across the genome to detect long contiguous runs of homozygous genotypes. An occasional genotyping error or missing genotype occurring in an otherwise-unbroken homozygous segment could result in the underestimation of ROHs. To address this, the program allows one heterozygous and five missing calls per window. A threshold was set for the minimum length (kb) needed for a tract to qualify as homozygous. Because strong linkage disequilibrium (LD), typically extending up to about 100 kb, is common throughout the genome,48–51 short tracts of homozygosity are very prevalent. For exclusion of these short and very common ROHs that occur in all individuals in all populations, the minimum length for an ROH was set at 500 kb. All empirical studies have identified a few very long stretches of LD, measuring up to several hundred kb in length,49 which could result in the occurrence of longer ROHs in outbred individuals. Such ROHs will not be excluded by this methodology; however, the purpose here is not to identify only those ROHs that result from parental relatedness but to identify all ROHs and then relate these to pedigree and population data for an assessment of the extent to which these result from parental relatedness and population isolation. We set a threshold for the minimum number of SNPs constituting a ROH in order to ensure that these are true ROHs—i.e., that between the first SNP and the last SNP the entire unobserved stretch of the chromosome is homozygous. With, for example, only three consecutive homozygous genotypes, there would be a very high probability that these three could be homozygous by chance alone and that the intervening, unobserved chromosomal stretches could be heterozygous. We have deliberately not taken LD into account here. By using a minimum-length cutoff of 500 kb, most shorter ROHs resulting from LD will be eliminated; however, some longer stretches will remain. This is intentional: we are interested in identifying and quantifying these common ROHs, whatever their origin. We used allele frequencies for a random sample of chromosomal segments across the entire autosomes to estimate the mean probability of finding 10, 25, and 50 consecutive homozygous SNPs by chance alone in each population. On this basis, the minimum number of contiguous homozygous SNPs constituting a ROH was set at 25 (p < 0.0001 in each of the four populations). Two additional parameters were added for ensuring that estimates of F were not artificially inflated by apparently homozygous tracts in sparsely covered genomic regions: tracts with a mean tract density > 50 kb/SNP were excluded, and the maximum gap between two consecutive homozygous SNPs was set at 100 kb. For exclusion of the possibility that apparent ROHs are in fact regions of hemizygous deletion, an analysis of deletions was carried out in the Orkney data set. An Objective Bayes' Hidden Markov model, as employed in QuantiSNP v. 1.0, was used for identification of heterozygous deletions with a sliding window of 2 Mb over the genome and 25 iterations. All of the samples were corrected for genomic GC content prior to copy-number inference as a means of ensuring that the variation of the observed log2 R ratio is not attributed to the region-specific GC content.52 We included in the downstream analysis all heterozygous deletions with an estimated Bayes' factor ≥ 10 to ensure a low false-negative rate, as reported in Colella et al., 2007.53 A custom Perl script was developed for comparison of the identified heterozygous deletions and ROHs. All deletions overlapping with ROHs were identified. When deletions covered the entire length of the ROH or when less than 0.5 Mb of the tract remained after the deletion was taken account of, the ROH was removed from the analysis. Because the Dalmatian, CEU, and Scotland data sets were uncorrected for deletions, uncorrected Orkney data are shown when there are population comparisons. Analyses using only the Orkney data set use data corrected for deletions. Froh Estimates A genomic measure of individual autozygosity (Froh) was derived, defined as the proportion of the autosomal genome in runs of homozygosity above a specified length threshold: Statistical Analysis For statistical analyses, the Orkney population was split into endogamous Orcadians, defined as those with at least three grandparents born in Orkney, on the same island, typically ~10 km2 in size and with a population of 50–500 (n = 390); mixed Orcadians, defined as those with at least three grandparents born in Orkney but on different islands in the archipelago—i.e., from an area over 500 km2 with a population of ~20,000 (n = 286); and half Orcadians, defined as those with one pair of Orcadian-born and one pair of Scottish-mainland-born grandparents (n = 49). Although pedigree information is not available for an assessment of whether the parents of half-Orcadian subjects are related beyond five generations in the past, it is reasonable to assume that they are likely to be unrelated for at least 10–12 generations. It is known that there was major Scottish immigration to Orkney in the 15th and 16th centuries, before10–12 generations ago. Although Scottish immigration has certainly occurred sporadically since then, rates have been low. An analysis of the area of origin of the Scottish parents of our half-Orcadian subjects shows that they came from all over Scotland: we found no evidence for strong Orcadian connections with any specific Scottish settlement, which might increase the chances of parental relatedness in this group. Furthermore, the surnames of the ancestors of the Orcadian parents of this group were markedly different from those of the ancestors of the non-Orcadian Scottish parents. The Dalmatian population was split into endogamous Dalmatians, defined as those with all four grandparents born in the same village—i.e., from a 1 km2 area, with a population of < 2000 (n = 431); mixed Dalmatian, defined as those with all four grandparents born on the same island but not in the same village—i.e., from a 90 km2 area with a population of 3600 (n = 221); and Croatian, defined as residents of the island with grandparents born elsewhere in Croatia (n = 197). The CEU and Scottish populations were not subdivided. All calculations were performed with SPSS and Excel software. The proportions of each subpopulation with ROHs measuring less than 1, 1.5, and 2 Mb were calculated. All subjects in all subpopulations had ROHs shorter than 1.5 Mb. Subpopulations start to become differentiated from each other for ROHs > 1.5 Mb, with the effects of endogamy on ROHs starting to emerge above this threshold. Unless otherwise specified, all analyses exploring the effects of endogamy and parental relatedness on ROHs therefore define a ROH as measuring ≥ 1.5 Mb. Subpopulation means were calculated for the total length of ROHs per individual. The number of ROHs was plotted against the total length of ROHs, per individual, for each subpopulation. The correlation between Fped and Froh was calculated with the use of a subset of 249 individuals, from the Orkney sample, who satisfied the condition of having at least two grandparents on the same side of the family born in Orkney and no grandparents born outside of Scotland and who were either the offspring of consanguineous parents (parents related as 2nd cousins or closer) or those for whom it was possible to establish pedigrees for at least six generations in all Orcadian ancestral lineages or five generations in non-Orcadian ancestral lineages. Correlations were also calculated between Froh, Fped, and two other measures: multilocus heterozgyosity (MLH), which is defined as the proportion of markers that are heterozygous,54 and the measure of autozygosity implemented in PLINK, termed here Fplink, which estimates autozygosity from genotype frequencies, giving more weight to rare alleles.46 Prevalence and Genomic Location of ROHs in Different Subpopulations Next, we explored the hypothesis that ROHs in outbred individuals tend to cluster in the same genomic locations, whereas those present in the offspring of related parents tend to be more randomly distributed across the autosomes. We compared the location of ROHs in three groups: the half-Orcadian group, consisting of all half Orcadians with at least one ROH measuring ≥ 1.5 Mb (n = 46); an offspring-of-cousins group, which was constructed by consideration of all individuals from the Orkney sample with parents related as 3rd cousins or closer and the selection of those 20 with the greatest total length of ROHs; and a control population derived from our cross-sectional sample from Scotland. Because some individuals in the Scottish sample have long ROHs that could be indicative of parental relatedness, we restricted the control sample to those with no more than eight ROHs, totaling no more than 17 Mb: the maximum values in the half-Orcadian group, the members of which are known to be the offspring of unrelated parents. There were 943 individuals in the control group. ROHs measuring at least 1.5 Mb in all three groups were compared. Control-group ROHs overlapping by at least 0.5 Mb with ROHs in either Orcadian group were counted. The number of control overlaps per ROH (and per Mb of ROH) in the half-Orcadian group was compared with that in the offspring-of-cousins group. We then investigated whether ROHs in half Orcadians occurred in regions of lower-than-average recombination. Based on sex-averaged mean recombination rates per Mb, derived from the deCODE genetic map, we used the UCSC Genome Browser (March 2006)55 to calculate the mean recombination rate of all complete Mb of ROH in our half-Orcadian sample. Results Copy-Number Variation We detected 224 deletions that overlapped with ROHs (median length of deletion 995 kb). Overlapping deletions were detected in 57 individuals (7.6% of sample). After removal of these overlaps from the sample and removal of the entire affected ROH if less than 0.5 Mb remained, ROH statistics were recalculated. There was no significant difference between results before and after correction for deletion for the mean total length of ROHs (correcting for deletions reduced this by less than 0.3% in the sample as a whole) or the mean number of ROHs (reduced by 0.02%). Furthermore, no significant differences were found when data were analyzed by subpopulation and when different length parameters were used for defining ROHs. This provides strong evidence that the ROHs identified are true homozygous tracts and not hemizgyous deletions. Urban versus Rural Analysis of Scottish Sample No difference was found in the mean total length of ROHs between those living in rural areas and those in urban areas of Scotland, regardless of whether the analysis used a dichotomous classification or a more-detailed, eight-category classification, from large urban to remote rural (data not shown). Data were also analyzed for a subset (n = 426) of the sample with information on grandparental country of birth. On average, those with four Scottish-born grandparents (n = 254) had a slightly greater sum of ROHs than did those with at least one grandparent born outside of Scotland, but differences were not significant (data not shown). The Scottish sample was, therefore, not split into subpopulations for further analyses. Effect of Stochastic Variation on Individual Autozygosity On average, the difference in the total length of ROHs between full sibling pairs was 10.3 Mb. However, the distribution is skewed, with half of all individuals having less than 5 Mb difference yet some 7% differing by more than 30 Mb. The greatest difference between sibling pairs was 91 Mb, or 3.4% of the autosomes (paternity was confirmed from patterns of genomic sharing in all cases). Effects of Population Isolation and Endogamy on Length and Number of ROHs The proportions of subpopulations with ROHs of a given length are shown in Figure 2
For ROHs above 1.5 Mb, three distinct groupings, which are clearly related to endogamy and isolation, emerge: a greater proportion of the endogamous Dalmatian and Orcadian samples than of the other samples have long ROHs (28% have ROHs > 10 Mb); only a small proportion of the CEU, Scottish, and half-Orcadian samples have long ROHs (0.5% > 10 Mb), and the proportion of Croatian and mixed Dalmatian and Orcadian samples with long ROHs falls in between (10% > 10 Mb). Forty-nine individuals had no ROHs longer than 1.5 Mb. This number included at least one individual from each subpopulation, although they were predominantly half-Orcadian, Scottish, and CEU samples. The shortest sum of ROHs across all of the samples was found in a Scottish individual, who had ROHs longer than 0.5 Mb covering only 1.5% of the autosomes (39 Mb). This compares with a mean of 3.5% across all of the populations (93 Mb). The number of ROHs longer than 1.5 Mb per individual, plotted against the total length of those ROHs, is shown for each group in Figure 3
The effect of different degrees of parental relatedness on the sum and number of ROHs is shown in Figure 4
Comparison of Fped and Froh A subset of 249 Orcadian individuals with complete and reliable pedigree data were used to compare Fped and Froh. The mean (standard error) Fped of the sample is 0.0038 (0.0005), approximately equivalent to a parental relationship of third cousins. Mean Fped values for Orcadian subpopulations are shown in Table 1. These vary from 0.02, for the offspring of 1st or 2nd cousins, to 0.0002 (equivalent to a parental relationship of 5th cousins) in the mixed Orcadian group. Mean Fped values are compared with mean Froh values for a range of minimum-length thresholds. The mean value of Froh 5 (i.e., with a minimum-length threshold of 5 Mb) is closest to that of Fped, whereas Froh 0.5 (i.e., with a minimum-length threshold of 0.5 Mb) is an order of magnitude higher. This suggests that a shared maternal and paternal ancestor in the preceding six generations results predominantly in ROHs longer than 5 Mb. It is clear from the half-Orcadian group, whose parents do not share a common ancestor for at least six generations and probably at least 10–12 generations, that ROHs measuring less than 3 or 4 Mb are not uncommon in the absence of parental relatedness. On average, these individuals have over 3% (84 Mb) of their autosomes in ROHs over 0.5 Mb long and 0.2% (almost 6 Mb) in ROHs longer than 1.5 Mb.
Correlation between Froh, Fped, Fplink, and MLH We used the total sample to examine correlations between different genetic estimates of autozygosity or homozygosity. Because MLH is in fact a measure of heterozygosity, we have used 1 − MLH in our calculations. Allele frequencies for Fplink were estimated by naive counting in all individuals, as implemented in PLINK. Fplink and 1 − MLH are highly correlated (r = 0.94). Froh 1.5 is more highly correlated with 1 − MLH (r = 0.80) than with Fplink (r = 0.74). We used a subset of the Orcadian sample (n = 249) to estimate correlations with Fped. Froh 1.5 was most highly correlated with Fped (r = 0.86; 95% confidence interval 0.83–0.89). Correlations between Fped and Froh 1.5 were significantly higher than both the correlation between Fplink and Fped (r = 0.77; 0.72–0.82) and that between 1 − MLH and Fped (r = 0.76; 0.71–0.82). Froh 1.5 was slightly, but not significantly, more strongly correlated with Fped than was either Froh 0.5 or Froh 5. Correlations between Fped and Froh 0.5, Froh1.5, and Froh 5 are shown in Figure 5
Mean Froh by Subpopulation Mean Froh and the mean total length of ROHs for each subpopulation are shown for a range of minimum ROH lengths in Figure 6
Comparison of ROHs in the Offspring of Unrelated Parents and the Offspring of Cousins We next investigated whether ROHs found in half Orcadians are more common than those found in the offspring of related parents. We defined “common” as overlapping by at least 0.5 Mb with ROHs found in a subset of the Scottish sample. The number of ROHs measuring ≥ 1.5 Mb was 143 in the half-Orcadian sample, 3159 in the Scottish control sample, and 382 in the offspring-of-cousins sample. Results are summarized in Table 2. On average, each half-Orcadian ROH overlapped with more than twice as many controls as did ROHs in the offspring-of-cousins group. Only 12.6% of half-Orcadian ROHs, but almost a third of ROHs in the offspring-of-cousins group, did not overlap with any controls. We also looked at the mean number of overlaps per Mb of ROH in the two samples in order to correct for the fact that ROHs in the offspring-of-cousins group tend to be longer. There were more than three times as many control overlaps per Mb of ROH in the half-Orcadian group than there were in the offspring-of-cousins group. If we consider only those ROHs measuring > 5 Mb in the offspring-of-cousins sample (i.e., those that are most likely to result from recent shared parental ancestry), the mean number of overlaps per Mb was only1.4 (SD 2.0).
Data on chromosome 1 for ten individuals in the half-Orcadian group (shown in blue) and seven individuals in the offspring-of-cousins group (shown in red) are illustrated by way of example in Figure 7
Next, we identified all ROHs in the half-Orcadian group that overlapped by at least 0.5 Mb with common ROHs identified by Lencz.56 In a sample of 322 non-Hispanic European Americans, Lencz identified 339 ROHs present in at least ten subjects. Of the 143 half-Orcadian ROHs, 57% overlapped with Lencz et al.'s list. Only 7% (ten ROHs) overlapped with neither Lencz et al.'s list nor our control group. Finally, we investigated whether the ROHs in half Orcadians were found in areas of lower-than-average recombination. The mean recombination rate for the regions where half-Orcadian ROHs are located is 0.52 of the mean genome-wide recombination rate. For common ROHs (i.e., half-Orcadian ROHs that overlap with ROHs in the control group), this figure was 0.38 of the genome-wide mean. Discussion Our findings are consistent with a number of recent observational studies using high-density genome-scan data, which have suggested that ROHs longer than 1 Mb are more common in outbred individuals than previously thought.39,56–60 We have quantified this phenomenon by describing the number and length of ROHs in individuals who are known to have no common maternal and paternal ancestor in at least five generations (and probably 10–12 generations). Our analysis of copy-number variation in the Orkney sample is consistent with studies that have shown that observed ROHs are true homozygous tracts and not deletions or other chromosomal abnormalities.39,45,57,60 Heterozygous deletions are not easily differentiated from ROHs, because the employed algorithm uses the B allele frequency as one of its input parameters to infer CNV status. Therefore, homozygosity at consecutive SNPs increases the posterior probability of being called a heterozygous deletion. In other words, this is a very robust estimation of the prevalence of ROHs in the Orkney sample, which to some extent overcorrects for heterozygous deletions. Other studies have suggested that ROHs cluster in regions of the genome where recombination rates are low,57–60 and our data support this. The picture of genome-wide homozygosity now emerging is that short stretches, measuring tens of kb and indicative of ancient LD patterns, are common, covering up to one third of the genome.45 At the other end of the spectrum, very long ROHs, measuring tens of Mb, are the signature of parental relatedness. In between, ROHs might result from recent parental relatedness or might be autozygous segments of much older pedigree that have occurred because of the chance inheritance through both parents of extended haplotypes that are at a high frequency in the general population, possibly because they convey or conveyed some selective advantage.56 The Phase II HapMap study estimates that ROHs measuring in excess of around 100 kb constitute 13%–14% of the genome in Europeans.45 Lencz et al.56 give a similar estimate. The findings of our study are not directly comparable, given that we have not examined ROHs shorter than 500 kb; however, we have shown (Figure 2 We have demonstrated clearly that data on ROHs measuring more than 1.5 Mb accurately reflect differences in population isolation, as measured by grandparental endogamy (Figures 2, 3, and 6 We found that Froh is strongly correlated with Fped, significantly more so than the other two measures investigated. Perfect correlation is not expected, largely because of the deficiencies of Fped. This is particularly the case in isolated populations, where multiple distant parental relationships, undetectable with only a few generations of pedigree information, inflate autozygosity, such that the offspring of distant cousins can be almost as autozygous as the offspring of first cousins.24 The individual with the second highest Froh in the Orkney sample, for example, is the offspring of a couple whose closest relationship is that of 3rd cousins but who are multiply related at least 24 different ways in the last eight generations alone. We illustrate the deficiencies of Fped in Figure 5 A key objective of this research was to explore whether ROHs could be used for derivation of a measure of individual autozygosity. Before the advent of dense genome scans, the approach to estimating autozygosity from genetic-marker data was invariably inferential. We propose a very different, observational approach. Termed Froh, this is defined as the proportion of the autosomal genome in ROHs above a specified length threshold. Our purpose here is not to develop a fully fledged statistical methodology tested against the alternatives—further work is needed to refine the methodology, particularly in relation to the most appropriate length threshold for defining ROHs—but, rather, to outline a broad approach and highlight issues for future consideration. Equally, a detailed evaluation of alternative methods is beyond the scope of this paper; however, we have made some preliminary comparisons with two of the measures, Fplink and multilocal heterozygosity (MLH). Both correlate strongly with Froh. Whereas 1 − MLH is a measure of genome-wide homozygosity54 with no attempt to distinguish loci that are homozygous because of IBD and loci that are homozygous by chance, Fplink46 uses expected genome heterozygosity to control for homozygosity by chance. Carothers et al.20 have proposed another measure of autozygosity, which uses locus-specific heterozygosity to give more weight to polymorphic loci that are homozygous. Unlike our approach, all three methods are single-point approaches and do not exploit the nature of autozygosity that comes in runs or tracts. Another drawback of Fplink and the method proposed by Carothers et al. is that they require estimation of population allele frequencies, a nontrivial problem in many populations.62 Leutenegger et al.22 have also proposed a multipoint approach to autozygosity inference. Their method uses a hidden Markov model that requires that markers are in linkage equilibrium. Hence, it is computationally more complex to deal with extremely dense SNP maps, because LD needs to be taken into account or a subset of SNPs in low LD needs to be selected. Both of these are subject to ongoing research. The method is, on the other hand, very well suited for dense microsatellite maps or mixed microsatellite-SNP maps.28 Froh differs from all these approaches in that it is based on the assumption that ROHs are a signature of autozygosity (Figure 1 Consistent with the findings of other studies,56,59 we have shown that these shorter ROHs are almost invariably common but not universal in the population, occurring in both a Scottish control group (Figure 7 When the research aim is to use homozygosity mapping to identify the variants causing rare recessive diseases, Froh can be modified in order to reflect only the effects of recent parental relatedness. Our analysis of the genomic location of ROHs shows that many of the most common ROHs are equally present in the offspring of both related and unrelated parents (see Table 2 and Figure 7 We have shown here that ROHs measuring 1.5 Mb and longer can be used to distinguish between populations with different histories of isolation. ROHs also distinguish effectively between individuals with different degrees of parental relatedness in their ancestry. This approach is simple, observational, and based on sound theoretical justification. Although our study is based on Illumina data, this method is generally applicable, and we see no reason why it could not be used with data generated on other platforms. With some refinement, Froh has potential as a measure of individual genome-wide autozygosity for comparison to phenotype. The essential challenge in any attempt to estimate individual autozygosity from genomic data is to set a limit distinguishing autozygous from merely homozygous genotypes. Single-point methodologies based on estimation of population allele frequencies implicitly use time as a limit but face the serious drawback of requiring allele-frequency data for a founder or reference population. Our multipoint approach, which exploits the potential of ROHs as a measure of autozygosity, uses ROH length as a limit. Here, we have described how Froh is affected by the length threshold used and by the inclusion of common ROHs. The next challenge is to establish the optimum-length threshold and determine to what extent Froh should be modified with reference to the prevalence of common ROHs. These issues are the subject of ongoing research, involving the simulation of high-density genotype data by gene dropping fully phased Hap300 data down representative pedigrees. Work is also in progress to apply this approach to data sets from highly consanguineous populations and, in particular, to investigate whether the Froh length cutoff used here is universally applicable. Common, shorter ROHs also merit further investigation as a risk factor in common complex disease and will have utility in narrowing down genomic regions in the search for functional genetic variants.56 The availability of denser genome-wide scans with 1 million or more SNPs will facilitate more reliable identification and enumeration of shorter ROHs, and the use of these large data sets in different populations will improve understanding of the frequency of common ROHs and how these differ among populations. Web Resources The URLs for data presented herein are as follows:
Acknowledgments We thank the people of Orkney; Lorraine Anderson and the research nurse team in Orkney; Rosa Bisset, Kay Lindsay, Gail Crosbie, and the administrative team at Public Health Sciences, University of Edinburgh; Ruby McMenemy, Sam Harcus, George Gray, Orkney Library and Archive, and the Orkney Family History Society for help with reconstructing pedigrees; Graeme Grimes, Colin Semple, Sarfraz Mohammed, Dave du Feu, Craig Nicol, and Lesley McGoohan for IT support; and Evi Theodoratu for advice relating to the Scottish data set. ORCADES is supported by the Chief Scientist Office of the Scottish Executive, the MRC Human Genetics Unit, the Royal Society, the Edinburgh Wellcome Trust Clinical Research Facility, and the European Union Framework Program 6. Ruth McQuillan is supported by a University of Edinburgh College of Medicine and Veterinary Medicine Ph.D. studentship. Rehab Abdel-Rahman is supported by a fund from the Supreme Council of Egyptian Universities. Igor Rudan is supported by grant no. 108-1080315-0302, Branka Janicijevic by grant no. 196-1962766-2763, Nina Smolej-Narancic by grant no. 196-1962766-2747, and Pavao Rudan by grant no. 196-1962766-2751 of the Croatian Ministry of Science, Education and Sport. References 1. Keller L., Waller D. Inbreeding effects in wild populations. Trends Ecol. Evol. 2002;17:230–241. 2. Charlesworth B., Charlesworth D. The genetic basis of inbreeding depression. Genet. Res. 1999;74:329–340. [PubMed] 3. Bittles A.H. Consanguineous marriage and childhood health. Dev. Med. Child Neurol. 2003;45:571–576. [PubMed] 4. Khlat M., Khoury M. Inbreeding and diseases: demographic, genetic, and epidemiologic perspectives. Epidemiol. Rev. 1991;13:28–41. [PubMed] 5. Modell B., Darr A. Science and society: genetic counselling and customary consanguineous marriage. Nat. Rev. Genet. 2002;3:225–229. [PubMed] 6. Bonifati V., Rizzu P., van Baren M.J., Schaap O., Breedveld G.J., Krieger E., Dekker M.C., Squitieri F., Ibanez P., Joosse M. Mutations in the DJ-1 gene associated with autosomal recessive early-onset parkinsonism. Science. 2003;299:256–259. [PubMed] 7. van Duijn C.M., Dekker M.C., Bonifati V., Galjaard R.J., Houwing-Duistermaat J.J., Snijders P.J., Testers L., Breedveld G.J., Horstink M., Sandkuijl L.A. Park7, a novel locus for autosomal recessive early-onset parkinsonism, on chromosome 1p36. Am. J. Hum. Genet. 2001;69:629–634. [PubMed] 8. Ewald H., Wikman F.P., Teruel B.M., Buttenschon H.N., Torralba M., Als T.D., El Daoud A., Flint T.J., Jorgensen T.H., Blanco L. A genome-wide search for risk genes using homozygosity mapping and microarrays with 1,494 single-nucleotide polymorphisms in 22 eastern Cuban families with bipolar disorder. Am. J. Med. Genet. B. Neuropsychiatr. Genet. 2005;133:25–30. [PubMed] 9. Mani A., Meraji S.M., Houshyar R., Radhakrishnan J., Mani A., Ahangar M., Rezaie T.M., Taghavinejad M.A., Broumand B., Zhao H. Finding genetic contributions to sporadic disease: a recessive locus at 12q24 commonly contributes to patent ductus arteriosus. Proc. Natl. Acad. Sci. USA. 2002;99:15054–15059. [PubMed] 10. Rudan I., Smolej-Narancic N., Campbell H., Carothers A., Wright A., Janicijevic B., Rudan P. Inbreeding and the genetic complexity of human hypertension. Genetics. 2003;163:1011–1021. [PubMed] 11. Krieger H. Inbreeding effects on metrical traits in Northeastern Brazil. Am. J. Hum. Genet. 1969;21:537–546. [PubMed] 12. Martin A.O., Kurczynski T.W., Steinberg A.G. Familial studies of medical and anthropometric variables in a human isolate. Am. J. Hum. Genet. 1973;25:581–593. [PubMed] 13. Hurwich B.J., Nubani N. Blood pressures in a highly inbred community–Abu Ghosh, Israel. 1. Original survey. Isr. J. Med. Sci. 1978;14:962–969. [PubMed] 14. Halberstein R.A. Blood pressure in the Caribbean. Hum. Biol. 1999;71:659–684. [PubMed] 15. Campbell H., Carothers A.D., Rudan I., Hayward C., Biloglav Z., Barac L., Pericic M., Janicijevic B., Smolej-Narancic N., Polasek O. Effects of genome-wide heterozygosity on a range of biomedically relevant human quantitative traits. Hum. Mol. Genet. 2007;16:233–241. [PubMed] 16. Saleh E.A., Mahfouz A.A., Tayel K.Y., Naguib M.K., Bin-al-Shaikh N.M. Hypertension and its determinants among primary-school children in Kuwait: an epidemiological study. East. Mediterr. Health J. 2000;6:333–337. [PubMed] 17. Badaruddoza Inbreeding effects on metrical phenotypes among north Indian children. Collegium Antropologicum. 2004;28(Suppl(2)):311–319. [PubMed] 18. Hartl D., Clark A.G. Sinauer Associates; Sunderland, MA: 1997. Principles of Population Genetics. 19. Wright S. Coefficients of inbreeding and relationship. Am. Nat. 1922;56:330–338. 20. Carothers A.D., Rudan I., Kolcic I., Polasek O., Hayward C., Wright A.F., Campbell H., Teague P., Hastie N.D., Weber J.L. Estimating human inbreeding coefficients: comparison of genealogical and marker heterozygosity approaches. Ann. Hum. Genet. 2006;70:666–676. [PubMed] 21. Stam P. The distribution of the fraction of the genome identical by descent in finite random mating populations. Genet. Res. 1980;35:131–155. 22. Leutenegger A.L., Prum B., Genin E., Verny C., Lemainque A., Clerget-Darpoux F., Thompson E.A. Estimation of the inbreeding coefficient through use of genomic data. Am. J. Hum. Genet. 2003;73:516–523. [PubMed] 23. Woods C.G., Valente E.M., Bond J., Roberts E. A new method for autozygosity mapping using single nucleotide polymorphisms (SNPs) and EXCLUDEAR. J. Med. Genet. 2004;41:e101. [PubMed] 24. Liu F., Elefante S., van Duijn C.M., Aulchenko Y.S. Ignoring distant genealogic loops leads to false-positives in homozygosity mapping. Ann. Hum. Genet. 2006;70:965–970. [PubMed] 25. Smith C. The detection of linkage in human genetics. J. Royal Stat. Soc. B. 1953;15:153–192. 26. Botstein D., Risch N. Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease. Nat. Genet. 2003;33(Suppl):228–237. [PubMed] 27. Miano M.G., Jacobson S.G., Carothers A., Hanson I., Teague P., Lovell J., Cideciyan A.V., Haider N., Stone E.M., Sheffield V.C. Pitfalls in homozygosity mapping. Am. J. Hum. Genet. 2000;67:1348–1351. [PubMed] 28. Leutenegger A.L., Labalme A., Genin E., Toutain A., Steichen E., Clerget-Darpoux F., Edery P. Using genomic inbreeding coefficient estimates for homozygosity mapping of rare recessive traits: application to Taybi-Linder syndrome. Am. J. Hum. Genet. 2006;79:62–66. [PubMed] 29. Shami S.A., Qaisar R., Bittles A.H. Consanguinity and adult morbidity in Pakistan. Lancet. 1991;338:954. [PubMed] 30. Puzyrev V.P., Lemza S.V., Nazarenko L.P., Panphilov V.I. Influence of genetic and demographic factors on etiology and pathogenesis of chronic disease in north Siberian aborigines. Arctic Med. Res. 1992;51:136–142. [PubMed] 31. Ismail J., Jafar T.H., Jafary F.H., White F., Faruqui A.M., Chaturvedi N. Risk factors for non-fatal myocardial infarction in young South Asian adults. Heart. 2004;90:259–263. [PubMed] 32. Simpson J.L., Martin A.O., Elias S., Sarto G.E., Dunn J.K. Cancers of the breast and female genital system: search for recessive genetic factors through analysis of human isolate. Am. J. Obstet. Gynecol. 1981;141:629–636. [PubMed] 33. Lebel R.R., Gallagher W.B. Wisconsin consanguinity studies. II: Familial adenocarcinomatosis. Am. J. Med. Genet. 1989;33:1–6. [PubMed] 34. Rudan I. Inbreeding and cancer incidence in human isolates. Hum. Biol. 1999;71:173–187. [PubMed] 35. Carlson C.S., Eberle M.A., Kruglyak L., Nickerson D.A. Mapping complex disease loci in whole-genome association studies. Nature. 2004;429:446–452. [PubMed] 36. Freimer N., Sabatti C. The use of pedigree, sib-pair and association studies of common diseases for genetic mapping and epidemiology. Nat. Genet. 2004;36:1045–1051. [PubMed] 37. Wright A., Charlesworth B., Rudan I., Carothers A., Campbell H. A polygenic basis for late-onset disease. Trends Genet. 2003;19:97–106. [PubMed] 38. Reich D.E., Lander E.S. On the allelic spectrum of human disease. Trends Genet. 2001;17:502–510. [PubMed] 39. Broman K.W., Weber J.L. Long homozygous chromosomal segments in reference families from the centre d'Etude du polymorphisme humain. Am. J. Hum. Genet. 1999;65:1493–1500. [PubMed] 40. Tenesa A., Farrington S.M., Prendergast J.G., Porteous M.E., Walker M., Haq N., Barnetson R.A., Theodoratou E., Cetnarskyj R., Cartwright N. Genome-wide association scan identifies a colorectal cancer susceptibility locus on 11q23 and replicates risk loci at 8q24 and 18q21. Nat. Genet. 2008;40:631–637. [PubMed] 41. The International HapMap Project. Nature. 2003;426:789–796. [PubMed] 42. Brennan E.R., Relethford J.H. Temporal variation in the mating structure of Sanday, Orkney Islands. Ann. Hum. Biol. 1983;10:265–280. [PubMed] 43. Boyce A.J., Holdsworth V.M.L., Brothwell D. Demographic and genetic studies in the Orkney islands. In: Roberts D.F., Sunderland E., editors. Genetic Variation in Britain. Taylor and Francis; London: 1973. 44. Vitart V., Rudan I., Hayward C., Gray N.K., Floyd J., Palmer C.N., Knott S.A., Kolcic I., Polasek O., Graessler J. SLC2A9 is a newly identified urate transporter influencing serum urate concentration, urate excretion and gout. Nat. Genet. 2008;40:437–442. [PubMed] 45. Frazer K.A., Ballinger D.G., Cox D.R., Hinds D.A., Stuve L.L., Gibbs R.A., Belmont J.W., Boudreau A., Hardenbol P., Leal S.M. A second generation human haplotype map of over 3.1 million SNPs. Nature. 2007;449:851–861. [PubMed] 46. Purcell S., Neale B., Todd-Brown K., Thomas L., Ferreira M.A., Bender D., Maller J., Sklar P., de Bakker P.I., Daly M.J. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 2007;81:559–575. [PubMed] 47. Department, R.A., ed. (2004). Scottish Executive Urban Rural Classification 2003–2004, E.a. 48. Abecasis G.R., Ghosh D., Nichols T.E. Linkage disequilibrium: ancient history drives the new genetics. Hum. Hered. 2005;59:118–124. [PubMed] 49. Wall J.D., Pritchard J.K. Haplotype blocks and linkage disequilibrium in the human genome. Nat. Rev. Genet. 2003;4:587–597. [PubMed] 50. Abecasis G.R., Noguchi E., Heinzmann A., Traherne J.A., Bhattacharyya S., Leaves N.I., Anderson G.G., Zhang Y., Lench N.J., Carey A. Extent and distribution of linkage disequilibrium in three genomic regions. Am. J. Hum. Genet. 2001;68:191–197. [PubMed] 51. Reich D.E., Cargill M., Bolk S., Ireland J., Sabeti P.C., Richter D.J., Lavery T., Kouyoumjian R., Farhadian S.F., Ward R. Linkage disequilibrium in the human genome. Nature. 2001;411:199–204. [PubMed] 52. Marioni J.C., Thorne N.P., Valsesia A., Fitzgerald T., Redon R., Fiegler H., Andrews T.D., Stranger B.E., Lynch A.G., Dermitzakis E.T. Breaking the waves: improved detection of copy number variation from microarray-based comparative genomic hybridization. Genome Biol. 2007;8:R228. [PubMed] 53. Colella S., Yau C., Taylor J.M., Mirza G., Butler H., Clouston P., Bassett A.S., Seller A., Holmes C.C., Ragoussis J. QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping data. Nucleic Acids Res. 2007;35:2013–2025. [PubMed] 54. Charpentier M., Setchell J.M., Prugnolle F., Knapp L.A., Wickings E.J., Peignot P., Hossaert-McKey M. Genetic diversity and reproductive success in mandrills (Mandrillus sphinx). Proc. Natl. Acad. Sci. USA. 2005;102:16723–16728. [PubMed] 55. Kent W.J., Sugnet C.W., Furey T.S., Roskin K.M., Pringle T.H., Zahler A.M., Haussler D. The human genome browser at UCSC. Genome Res. 2002;12:996–1006. [PubMed] 56. Lencz T., Lambert C., DeRosse P., Burdick K.E., Morgan T.V., Kane J.M., Kucherlapati R., Malhotra A.K. Runs of homozygosity reveal highly penetrant recessive loci in schizophrenia. Proc. Natl. Acad. Sci. USA. 2007;104:19942–19947. [PubMed] 57. Simon-Sanchez J., Scholz S., Fung H.C., Matarin M., Hernandez D., Gibbs J.R., Britton A., de Vrieze F.W., Peckham E., Gwinn-Hardy K. Genome-wide SNP assay reveals structural genomic variation, extended homozygosity and cell-line induced alterations in normal individuals. Hum. Mol. Genet. 2007;16:1–14. [PubMed] 58. Gibson J., Morton N.E., Collins A. Extended tracts of homozygosity in outbred human populations. Hum. Mol. Genet. 2006;15:789–795. [PubMed] 59. Curtis D., Vine A.E., Knight J. Study of regions of extended homozygosity provides a powerful method to explore haplotype structure of human populations. Ann. Hum. Genet. 2008;72:261–278. [PubMed] 60. Li L.H., Ho S.F., Chen C.H., Wei C.Y., Wong W.C., Li L.Y., Hung S.I., Chung W.H., Pan W.H., Lee M.T. Long contiguous stretches of homozygosity in the human genome. Hum. Mutat. 2006;27:1115–1121. [PubMed] 61. Vitart V., Carothers A.D., Hayward C., Teague P., Hastie N.D., Campbell H., Wright A.F. Increased level of linkage disequilibrium in rural compared with urban communities: a factor to consider in association-study design. Am. J. Hum. Genet. 2005;76:763–772. [PubMed] 62. Hoffman J.I., Boyd I.L., Amos W. Exploring the relationship between parental relatedness and male reproductive success in the Antarctic fur seal Arctocephalus gazella. Evolution Int. J. Org. Evolution. 2004;58:2087–2099. |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||||
Genet Res. 1999 Dec; 74(3):329-40.
[Genet Res. 1999]Dev Med Child Neurol. 2003 Aug; 45(8):571-6.
[Dev Med Child Neurol. 2003]Epidemiol Rev. 1991; 13():28-41.
[Epidemiol Rev. 1991]Nat Rev Genet. 2002 Mar; 3(3):225-9.
[Nat Rev Genet. 2002]Science. 2003 Jan 10; 299(5604):256-9.
[Science. 2003]Am J Hum Genet. 2001 Sep; 69(3):629-34.
[Am J Hum Genet. 2001]Ann Hum Genet. 2006 Sep; 70(Pt 5):666-76.
[Ann Hum Genet. 2006]Am J Hum Genet. 2003 Sep; 73(3):516-23.
[Am J Hum Genet. 2003]Ann Hum Genet. 2006 Sep; 70(Pt 5):666-76.
[Ann Hum Genet. 2006]J Med Genet. 2004 Aug; 41(8):e101.
[J Med Genet. 2004]Ann Hum Genet. 2006 Nov; 70(Pt 6):965-70.
[Ann Hum Genet. 2006]Nat Genet. 2003 Mar; 33 Suppl():228-37.
[Nat Genet. 2003]Am J Hum Genet. 2000 Nov; 67(5):1348-51.
[Am J Hum Genet. 2000]Am J Hum Genet. 2006 Jul; 79(1):62-6.
[Am J Hum Genet. 2006]Lancet. 1991 Oct 12; 338(8772):954.
[Lancet. 1991]Arctic Med Res. 1992 Jul; 51(3):136-42.
[Arctic Med Res. 1992]Heart. 2004 Mar; 90(3):259-63.
[Heart. 2004]Lancet. 1991 Oct 12; 338(8772):954.
[Lancet. 1991]Am J Obstet Gynecol. 1981 Nov 15; 141(6):629-36.
[Am J Obstet Gynecol. 1981]Am J Hum Genet. 1999 Dec; 65(6):1493-500.
[Am J Hum Genet. 1999]Nat Genet. 2008 May; 40(5):631-7.
[Nat Genet. 2008]Hum Mol Genet. 2007 Jan 15; 16(2):233-41.
[Hum Mol Genet. 2007]Nature. 2003 Dec 18; 426(6968):789-96.
[Nature. 2003]Ann Hum Biol. 1983 May-Jun; 10(3):265-80.
[Ann Hum Biol. 1983]Nat Genet. 2008 May; 40(5):631-7.
[Nat Genet. 2008]Hum Mol Genet. 2007 Jan 15; 16(2):233-41.
[Hum Mol Genet. 2007]Nature. 2003 Dec 18; 426(6968):789-96.
[Nature. 2003]Nat Genet. 2008 May; 40(5):631-7.
[Nat Genet. 2008]Nat Genet. 2008 Apr; 40(4):437-42.
[Nat Genet. 2008]Nature. 2007 Oct 18; 449(7164):851-61.
[Nature. 2007]Am J Hum Genet. 2007 Sep; 81(3):559-75.
[Am J Hum Genet. 2007]Am J Hum Genet. 2007 Sep; 81(3):559-75.
[Am J Hum Genet. 2007]Hum Hered. 2005; 59(2):118-24.
[Hum Hered. 2005]Nat Rev Genet. 2003 Aug; 4(8):587-97.
[Nat Rev Genet. 2003]Am J Hum Genet. 2001 Jan; 68(1):191-197.
[Am J Hum Genet. 2001]Nature. 2001 May 10; 411(6834):199-204.
[Nature. 2001]Nat Rev Genet. 2003 Aug; 4(8):587-97.
[Nat Rev Genet. 2003]Genome Biol. 2007; 8(10):R228.
[Genome Biol. 2007]Nucleic Acids Res. 2007; 35(6):2013-25.
[Nucleic Acids Res. 2007]Proc Natl Acad Sci U S A. 2005 Nov 15; 102(46):16723-8.
[Proc Natl Acad Sci U S A. 2005]Am J Hum Genet. 2007 Sep; 81(3):559-75.
[Am J Hum Genet. 2007]Genome Res. 2002 Jun; 12(6):996-1006.
[Genome Res. 2002]Proc Natl Acad Sci U S A. 2007 Dec 11; 104(50):19942-7.
[Proc Natl Acad Sci U S A. 2007]Am J Hum Genet. 1999 Dec; 65(6):1493-500.
[Am J Hum Genet. 1999]Proc Natl Acad Sci U S A. 2007 Dec 11; 104(50):19942-7.
[Proc Natl Acad Sci U S A. 2007]Hum Mol Genet. 2007 Jan 1; 16(1):1-14.
[Hum Mol Genet. 2007]Hum Mol Genet. 2006 Mar 1; 15(5):789-95.
[Hum Mol Genet. 2006]Ann Hum Genet. 2008 Mar; 72(Pt 2):261-78.
[Ann Hum Genet. 2008]Am J Hum Genet. 1999 Dec; 65(6):1493-500.
[Am J Hum Genet. 1999]Nature. 2007 Oct 18; 449(7164):851-61.
[Nature. 2007]Hum Mol Genet. 2007 Jan 1; 16(1):1-14.
[Hum Mol Genet. 2007]Hum Mutat. 2006 Nov; 27(11):1115-21.
[Hum Mutat. 2006]Hum Mol Genet. 2007 Jan 1; 16(1):1-14.
[Hum Mol Genet. 2007]Am J Hum Genet. 2005 May; 76(5):763-72.
[Am J Hum Genet. 2005]Nature. 2007 Oct 18; 449(7164):851-61.
[Nature. 2007]Ann Hum Genet. 2006 Nov; 70(Pt 6):965-70.
[Ann Hum Genet. 2006]Proc Natl Acad Sci U S A. 2005 Nov 15; 102(46):16723-8.
[Proc Natl Acad Sci U S A. 2005]Am J Hum Genet. 2007 Sep; 81(3):559-75.
[Am J Hum Genet. 2007]Ann Hum Genet. 2006 Sep; 70(Pt 5):666-76.
[Ann Hum Genet. 2006]Am J Hum Genet. 2003 Sep; 73(3):516-23.
[Am J Hum Genet. 2003]Am J Hum Genet. 2006 Jul; 79(1):62-6.
[Am J Hum Genet. 2006]Proc Natl Acad Sci U S A. 2007 Dec 11; 104(50):19942-7.
[Proc Natl Acad Sci U S A. 2007]Ann Hum Genet. 2008 Mar; 72(Pt 2):261-78.
[Ann Hum Genet. 2008]Proc Natl Acad Sci U S A. 2007 Dec 11; 104(50):19942-7.
[Proc Natl Acad Sci U S A. 2007]Hum Hered. 2005; 59(2):118-24.
[Hum Hered. 2005]Nat Rev Genet. 2003 Aug; 4(8):587-97.
[Nat Rev Genet. 2003]Proc Natl Acad Sci U S A. 2007 Dec 11; 104(50):19942-7.
[Proc Natl Acad Sci U S A. 2007]