Logo of ajhgGuide for AuthorsAbout this journalExplore this journalAmerican Journal of Human Genetics
Am J Hum Genet. 2006 Apr; 78(4): 713–720.
Published online 2006 Feb 17. doi: 10.1086/502682
PMCID: PMC1424681
PMID: 16532401

The Power to Detect Disease Associations with Mitochondrial DNA Haplogroups

Abstract

Genetic variation of mitochondrial DNA (mtDNA) has been linked to a number of multifactorial diseases, but there is currently no tool available to predict the optimal size for these investigations. We used a simulation-based (Monte Carlo) permutation test to generate power curves for European mtDNA haplogroup studies, to derive a universal equation to enable power calculations for prospective studies across the globe, and to show that very large cohorts are required to reliably detect an association with complex human diseases. In some populations, geographical variation in haplogroup frequencies will prevent the reliable detection of subtle haplogroup associations with uncommon disorders.

Human mtDNA codes for 13 essential polypeptide components of the mitochondrial respiratory chain that generates the principal source of intracellular energy, ATP (DiMauro and Schon 2003). mtDNA is inherited almost exclusively through the maternal line and is highly polymorphic. Human populations can be divided into several mtDNA haplogroups on the basis of specific SNPs scattered throughout the mitochondrial genome, reflecting mutations accumulated by a discrete maternal lineage. Each mtDNA haplogroup forms a mutually exclusive category (Torroni and Wallace 1994). Among Europeans, 95% of the population belongs to 1 of 10 haplogroups: H, I, J, K, M, T, U, V, W, and X (Torroni et al. 1996). Given the fundamental role of the mitochondrial genome in cellular metabolism, there have been a number of studies investigating the association between mtDNA lineages and multifactorial diseases and aging (e.g., De Benedictis et al. 1999; Ruiz-Pesini et al. 2000; Carrieri et al. 2001; Niemi et al. 2003; Mancuso et al. 2004; van der Walt et al. 2004). Despite initially promising results, often reaching high levels of statistical significance (α), it has rarely been possible to replicate the findings of the original reports. There are a number of explanations for the inconsistency, including difficulties in matching cases and controls and when there is a genuine variation in the size of a genetic effect in different populations. However, the consistent inability to reproduce the original result brings into question the power of individual studies.

The standard statistical method for analyzing haplogroup distributions is to use a 2×NH contingency table and the χ2 test, where NH is the number of haplogroups. However, low counts in the less common haplogroups will tend to inflate the χ2 value and lead to a false-positive result (Roff and Bentzen 1989). This can been tackled in one of two ways, either by performing Fisher’s exact test for each haplogroup in turn, with an appropriate correction for multiple significance testing, or by grouping the less frequent haplogroups. Both approaches potentially mask a real difference between two haplotypes, particularly if they are uncommon in the general population. In addition, correcting for multiple significance testing is not straightforward, since each haplogroup frequency is dependent on the others. To address this problem, we developed a Monte Carlo permutation test to determine the exact type I error (false-positive rate) associated with a specific data set under study. (See the “Methods” section in appendix A.) This Monte Carlo permutation test has the advantage that it is not dependent on any prior assumptions about the distribution of the data or the number of haplogroups under study. We then compared the results of the Monte Carlo simulations with the results of a number of published studies.

In most cases, direct simulation generated a level of significance (α) that was greater than the theoretical value based on an assumed null χ2 distribution (table 1). For some studies, the simulated result was not significant by conventional criteria. As expected, these studies were thorough in determining the frequency of rare haplogroups (defined as <5% of controls). However, because of the limited study size, a number of haplotype groups contained fewer than five individuals. The simulations confirmed that, under these circumstances, the χ2 value is artificially elevated, and tabulated values of χ2 should not be used to determine statistical significance. This problem can be avoided by applying Cochran’s “rule of thumb” that no expected frequency be <1 and that ⩽20% of the expected frequencies be <5 (Cochran 1954). If the results of a study do not comply with this rule, then direct simulation provides an unbiased and reliable alternative. This is often the case for mtDNA association studies, in which 6 of the 10 European haplogroups are each found in <10% of the population but account for 17% of the total (Torroni et al. 1996).

Table 1

Comparison of Published mtDNA Haplogroup Association Studies with the Results of the Present Study[Note]

Data from Previously Published Study
PhenotypeSourceNo. of ControlsNo. of CasesαaExact αb
CentenarianDe Benedictis et al. 19995126.017.0226
Sperm motility (males only)cRuiz-Pesini et al. 2000180365.0311.1435
Alzheimer disease, with an APOE ɛ4 alleledCarrieri et al. 200111994.018.0233
Dementia, with Lewy bodiesChinnery et al. 200017984NR.1347
Parkinson diseasevan der Walt et al. 2004340609NR.1374
Longevity (>90 years):Niemi et al. 2003
 General controls400225.01.011
 Comparison with healthy infants257225.00005.0003
Amyotrophic lateral sclerosisMancuso et al. 2004150222.016e.0204
Parkinson diseasePyle et al. 2005447455<.001<.0001

Note.— Our reanalysis of the original data in table 3 of Mancuso et al (2004) revealed a significant difference in the overall haplogroup distribution between the patients and controls (χ2=18.54; α=.016). Five of the cells in the χ2 table had expected counts of <5, thus potentially elevating the χ2 value and leading to an inappropriately low α. This was confirmed by Monte Carlo simulation in which the exact α was much greater, .0204. Our subsequent power studies show that very large sample sizes are required to reliably detect a modest difference in haplogroup frequencies between two groups. This does not mean that it is impossible to detect a difference with a smaller study size, but it does mean that the chance of detecting such a difference is much lower.

aNR = not reported.
bDetermined by Monte Carlo simulation; 10,000 iterations.
cComparison of males with asthenozoospermia and males who were nonasthenozoospermic. Haplogroups K and U were merged in this study.
dPatients with Alzheimer disease who carry an APOE ɛ4 allele compared with patients with Alzheimer disease who do not carry an APOE ɛ4 allele,
eMancuso et al (2004) report “the frequency of haplogroups H, J, K, T, U, V, W and X did not differ between the two groups” (p. 160).

We used a Monte Carlo simulation to determine the power to detect a difference between cases and controls at different levels of significance, on the basis of the known distribution of the 10 major European haplogroups (Torroni et al. 1996) (see fig. 1 for examples). We simulated both increases and decreases in the frequency of a each haplogroup, with the difference distributed proportionally between the remaining haplogroups, thus simulating a typical exploratory mtDNA haplogroup study for which there is no a priori assumption of an association with any one specific haplogroup. Figure 2A shows simulated data for five haplogroups, at α=.05. As this plot shows, the power values do not have a simple relationship with the raw number of cases and controls. In figure 2B, we normalize the X-axis, using a relationship derived from standard binomial theory (Armitage et al. 2002), and show that there is a simple relationship between the power and the normalized number of cases and controls. These results were compared with the theoretical curve for two haplogroups (fig. 2B for α=.05; fig. A1 for α=.01 and α=.001). The curve described by the simulation data for the 10 European haplogroups was similar in shape to the theoretical 2×2 curve (corresponding to 2 haplogroups) but was displaced to the right. To explore the effect of different numbers of haplogroups on the position power curve, we then simulated other theoretical haplogroup distributions within the population (2, 5, or 20 discrete categories). These subdivisions could correspond to superhaplogroups, haplogroup clusters, or any mutually exclusive sequence variants in any population. The entire data set collapsed to a single universal curve when the X-axis was normalized by the number of haplogroups, NH, raised to the power 0.37, by use of the expression

equation image

where Nc is the number of cases with a disease (assumed to be equal to the number of controls), p0 is the frequency of the haplogroup in the control population, and p1 is the frequency of the haplogroup in the disease group (fig. 2C for α=.05; fig. A2 for α=.01 and α=.001). The power ± SD (0.37±0.01) was determined by the standard approach of curve fitting. For the European haplogroup data studied here, NH=11, reflecting additional category “other” for individuals not belonging to the 10 major haplogroups.

An external file that holds a picture, illustration, etc.
Object name is AJHGv78p713fg1.jpg

Examples of mtDNA haplogroup power curves determined by simulation—haplogroup H (A), haplogroup J (B), and haplogroup I (C)—on the basis of control data for the European population. The abscissa shows the number of cases and controls simulated (e.g., 400 = 400 cases and 400 controls). Black squares/solid line = .05 significance level; green circles/dashed line = .01 significance level; red triangles/dotted line = .001 significance level. The figures on the graph describe the percentage change in haplogroup frequency associated with a particular group of curves; for example, 100%↑ = haplogroup increases by 100%, with the remaining cases distributed evenly over the remaining nine haplogroups. For clarity, only the .05 level data is shown for haplogroup I.

An external file that holds a picture, illustration, etc.
Object name is AJHGv78p713fg2.jpg

The power to detect an association between mtDNA haplogroups and disease. In all situations, the number of disease cases is assumed to be equal to the number of control subjects. A, Monte Carlo simulation data for the European haplogroups H, I, J, K, and U. Data from the individual simulations, including those shown in figure 1, are shown on the same graph for different changes in the percentage level of a particular haplogroup at the α=.05 significance level. This demonstrates the scatter of the data points. B, These data collapse onto a simple sigmoid curve, with the X-axis scaling based on standard binomial theory. Nc is the number of disease cases (equal to the number of controls), p0 is the frequency of the haplogroup in the control population, and p1 is the frequency of the haplogroup in the cases. Data are shown for the significance level α=.05 (for α=.01 and α=.001, see fig. A1). Both the data and the theoretical curve describe the same sigmoidal shape, with the European haplogroup simulation data (with the 10 major European haplogroups plus “others” being equivalent to a 2×11 table) shifted to the right. C, Simulations (symbols) and theoretical 2×2 curve (red line) for populations with different numbers of mutually exclusive haplogroups. The simulated subdivisions could correspond to superhaplogroups, haplogroup clusters, or any mutually exclusive sequence variants in any population. All of the data collapse onto a single curve when the X-axis is normalized by the number of haplogroups, NH, raised to the power 0.37 (eq. [1]). Data are shown for the significance level α=.05 (for α=.01 and α=.001, see figure A2). D, Example showing the number of cases and controls required to generate 90% power at the .05 significance level for a study of the 10 major European haplogroups (NH=11 to account for the <5% that do not fall into these 10 groups and are considered “others”). Haplogroup proportions in the control group are based on published values (haplogroup H=0.41 in controls [black line]; I=.02 in controls [blue line]; J=.11 in controls [red line]) (Torroni et al. 1996).

Equation (1) can be used to determine the minimum number of disease cases (NCmin) and control subjects required for a specific level of power,

equation image

by use of the scaling value, Nscaled, derived from the universal curves at each significance level α (table 2). Ninety percent power is achieved when Nscaled is 8.5 for α=.05, 11 for α=.01, and 14 for α=.001, as derived from figure 2C (table 3). This approach can be used for any number of mutually exclusive genotypes in any population (fig. 3). Resolving equation (2) in terms of the odds ratio (OR) is complex, but equation (2) can be used by determining the proportion of cases that correspond to a specific OR for a given haplogroup, by use of the standard equation

equation image

An external file that holds a picture, illustration, etc.
Object name is AJHGv78p713fg3.jpg

Algorithm to determine the power of an mtDNA haplogroup association study at a given level of statistical significance. The algorithm is based on the assumption that no specific haplogroup is suspected of being associated with a disease. α = statistical significance; NH = the number of haplogroups (mutually exclusive genotypes) in a given study; p0 = the frequency of the allele in control subjects; p1 = the frequency of the allele in individuals with the disease; NCmin = the minimum number of cases to generate a given power.

Table 2

Parameter Values for the Power, Used in Equation (A1)[Note]

Variableα=.05α=.01α=.001
A−28 ± 5−13 ± 2−7 ± 1
x02.6 ± 0.25.0 ± 0.17.4 ± 0.1
d2.4 ± 0.12.5 ± 0.12.8 ± 0.1

Note.— These values can be used with equation (A2) to determine the minimum number of cases and controls required for a given power.

Table 3

Scaled Number of Cases Required to Generate a Given Level of Power at a Given Level of Significance[Note]

Scaled No. of Casesand Controls (Nscaled)
Power
(%)
α=.05α=.01α=.001
503.65.57.8
756.08.110.7
908.510.813.8

Note.— The values are derived from a Bolzmann fit (eq. [A1]; r2>0.98) of simulations shown in fig. 2C. The scaled number of cases is defined in equation (1). The scaled number of cases and controls required to generate a given power can be read from these curves or calculated from equation (A1) and equation (A2). For example, for a control population with NH haplogroups, the number of cases and controls required to detect a change in the proportion of a given haplogroup from p0 in controls to p1 in the cases is given in equation (2). This approach was used to generate the example shown in figure 2D.

The ORs, based on one published control data set (Torroni et al. 1996), corresponding to percentage changes in haplogroup frequency for the 10 European haplogroups are shown in tables tablesA1A1 and andA2.A2. An example of this approach is given in figure 2D, with use of the European haplogroup distribution for 90% power at α=.05. These results also show the limited power of moderate study sizes. With no prior reason to suspect an association with haplogroup H, ∼6,000 cases and ∼6,000 controls are required to form a study with 90% power to detect a 10% change in the frequency of haplogroup H in European populations (corresponding to an OR >1.18). Studies with only 500 cases and 500 controls would have only 90% power to detect >35% increase in the frequency of haplogroup H, corresponding to an OR >1.75. For less common haplogroups or more-subtle changes in haplogroup frequency, even greater sample sizes will be required (fig. 2D).

mtDNA is inherited almost exclusively down the maternal line and undergoes little, if any, intermolecular recombination. This makes the haplotype distribution in a given sample exquisitely sensitive to minor population genetic bottlenecks, which lead to differences in haplogroup frequency over short geographical distances. These factors increase the chance of detecting a spurious difference between cases and controls (false-positive or type I error). The standard approach to this problem is to consider the first study result as preliminary and to then generate a hypothesis that should be tested on an independent cohort. Here, we present a single equation that can be applied to mtDNA haplogroup studies around the globe, enabling a priori power calculations for exploratory or confirmatory studies. However, subtle changes in haplogroup frequency will require massive cohorts. In some populations, the frequency of mtDNA haplogroups in controls varies considerably over short geographic differences (Ghezzi et al. 2005), which makes it impossible to collect an adequately sized homogeneous study cohort, particularly for uncommon diseases. The same applies to population-genetics studies based on haplogroup comparisons between control populations across the globe, which have been used to deduce patterns of population migration (Mishmar et al. 2003; Ruiz-Pesini et al. 2004). These studies are equally likely to suffer from type I error, and they often show subtle differences in haplogroup frequency but require adequate sample sizes to demonstrate and confirm geographic differences in haplogroup distribution. An inability to reproduce the result with an adequately powered study would question the validity of the original result.

Acknowledgments

P.F.C. is a Wellcome Trust Senior Fellow in Clinical Science and receives additional grant support from The Wellcome Trust, Ataxia UK, The Alzheimer’s Research Trust, the Association Française Contre les Myopathies, the United Mitochondrial Diseases Foundation, and the European Union FP6 program MITOCIRCLE.

Appendix A: Methods

To determine whether there is significant heterogeneity in a given data set, the standard approach is to calculate χ2, which is the sum of squares for the absolute difference between observed and expected values relative to the expected value, for each cell in a contingency table. For large sample sizes, statistical significance can be determined by comparing the calculated χ2 value with tabulated values at specified degrees of freedom. However, when the number of observed counts is low, this approach is not reliable. Under these circumstances, the statistical significance can be determined by directly simulating the χ2 distribution for a given data set. The Monte Carlo permutation test we present here is based on this principle and was adapted from the method of Roff and Bentzen (1989). For a given data set, the program generates a series of contingency tables by randomly assigning individual observations to each cell. The χ2 value is then calculated for each simulated data set and is compared with the original χ2 value. The probability of observing the original data set under the null hypothesis equates to the proportion of simulated χ2 values greater than the χ2 value for the original data set. Power simulations used the same algorithm, with simulation of different alterations in the haplogroup distribution for studies of different sizes. Subsequent analyses, curve fitting, and the generation of power curves were performed with Microcal Origin software (v. 6).

The power in figure figure2B2B and and2C2C is also well described (R2>0.98) by a Boltzmann equation,

equation image

which is easy to apply, where the power is expressed as a percentage and Nscaled is the scaled X-axis of figure 2, defined in equation (1). A, x0, and d are parameters of the fit (values given in table 2) allowing the power to be derived for any haplogroup distribution, significance level, and study size. The Boltzmann equation

equation image

can be solved to give the value of Nscaled required to achieve a given power.

These equations are equally applicable to increases or decreases in the haplogroup frequency.

Figure A1

An external file that holds a picture, illustration, etc.
Object name is AJHGv78p713fg4.jpg

Monte Carlo simulation data for the European haplogroups H, I, J, K, and U. Data are from the individual simulations (fig. 1) and the theoretical curve for a 2×2 table based on binomial theory for the significance levels α=.01 and α=.001. These data describe a simple sigmoid curve, with the X-axis based on standard binomial theory. Nc is the number of disease cases (which is equal to the number of controls), p0 is the frequency of the haplogroup in the control population, and p1 is the frequency of the haplogroup in the cases. Both the data and the theoretical curve describe the same sigmoidal shape, with the European haplogroup simulation data (with the 10 major European haplogroups plus “others” being equivalent to a 2×11 table) shifted to the right.

Figure A2

An external file that holds a picture, illustration, etc.
Object name is AJHGv78p713fg5.jpg

Simulations (symbols) and theoretical 2×2 curve (red line) for populations with different numbers of mutually exclusive haplogroups. The simulated subdivisions could correspond to superhaplogroups, haplogroup clusters, or any mutually exclusive sequence variants in any population. All of the data collapse onto a single curve when the X-axis is normalized by the number of haplogroups, NH, raised to the power 0.37 (eq. [1]). Data are shown for significance level α=.01 and α=.001.

Table A1

ORs Corresponding to an Increase in Haplogroup Frequency for the 10 European Haplogroups[Note]

OR by Haplogroup(Frequency in Controls)
Increase in
Haplogroup
Frequency
(%)
H
(.41)
I
(.02)
J
(.11)
K
(.08)
M
(.01)
T
(.13)
U
(.15)
V
(.03)
W
(.02)
X
(.02)
101.181.101.111.111.101.121.121.101.101.10
201.391.201.231.221.201.241.241.211.201.20
301.641.311.351.331.301.361.371.311.311.31
401.941.411.471.451.411.491.511.421.411.41
502.301.521.601.571.511.621.651.521.521.52
602.741.621.731.691.611.761.791.631.621.62
703.311.721.861.811.711.901.941.741.721.72
804.051.832.001.931.812.042.101.851.831.83
905.071.942.142.061.922.202.261.951.941.94
1006.562.042.282.192.022.352.432.062.042.04
1108.912.152.432.322.122.512.612.172.152.15
12013.242.262.582.462.232.682.792.282.262.26
13023.812.362.742.592.332.852.982.402.362.36
14088.502.472.902.732.433.033.192.512.472.47
1502.583.072.882.543.223.402.622.582.58
1602.693.243.022.643.423.622.742.692.69
1702.803.423.172.753.623.862.852.802.80
1802.913.603.322.853.834.102.972.912.91
1903.023.793.472.964.054.363.083.023.02
2003.133.993.633.064.284.643.203.133.13
2103.244.193.793.174.524.933.323.243.24
2203.354.403.963.274.775.233.433.353.35
2303.464.614.133.385.035.553.553.463.46
2403.584.834.303.485.305.903.673.583.58
2503.695.074.473.595.596.263.793.693.69
2603.805.304.653.705.896.653.913.803.80
2703.925.554.843.806.207.074.043.923.92
2804.035.815.023.916.537.514.164.034.03
2904.156.085.224.026.887.994.284.154.15
3004.266.365.414.137.258.504.414.264.26
3104.386.655.614.237.649.054.534.384.38
3204.496.955.824.348.059.654.664.494.49
3304.617.266.034.458.4810.304.794.614.61
3404.737.596.254.568.9411.004.924.734.73
3504.857.936.474.669.4311.775.054.854.85
3604.968.296.704.779.9612.615.184.964.96
3705.088.666.934.8810.5113.545.315.085.08
3805.209.057.174.9911.1114.575.445.205.20
3905.329.467.415.1011.7415.725.575.325.32
4005.449.897.675.2112.4317.005.715.445.44

Note.— Values are derived from equation (3) and a published data set (Torroni et al. 1996). These values will differ for control groups with a different background-haplogroup frequency.

Table A2

ORs Corresponding to a Decrease In Haplogroup Frequency for the 10 European Haplogroups[Note]

OR by Haplogroup(Frequency in Controls)
Decrease in
Haplogroup
Frequency
(%)
H
(.41)
I
(.02)
J
(.11)
K
(.08)
M
(.01)
T
(.13)
U
(.15)
V
(.03)
W
(.02)
X
(.02)
10.84.90.89.89.90.89.88.90.90.90
20.70.80.78.79.80.78.77.80.80.80
30.58.70.67.68.70.67.66.69.70.70
40.47.60.57.58.60.57.56.59.60.60
50.37.49.47.48.50.47.46.49.49.49
60.28.40.37.38.40.37.36.39.40.40
70.20.30.28.28.30.27.27.29.30.30
80.13.20.18.19.20.18.18.20.20.20
90.06.10.09.09.10.09.09.10.10.10

Note.— Values derived from equation (3) and a published data set (Torroni et al. 1996). These values will differ for control groups with a different background-haplogroup frequency.

References

Armitage P, Berry G, Matthews JNS (2002) Statistical methods in medical research. Blackwell, Oxford, United Kingdom [Google Scholar]
Carrieri G, Bonafe M, De Luca M, Rose G, Varcasia O, Bruni A, Maletta R, Nicmias B, Sorbi S, Corsonello F, Feraco E, Andreev K, Yashin AI, Francheschi C, De Benedictis G (2001) Mitochondrial DNA haplogroups and APOE4 allele are not independent variables in sporadic Alzheimer’s disease. Hum Genet 108:194–19810.1007/s004390100463 [PubMed] [CrossRef] [Google Scholar]
Chinnery PF, Taylor G, Howell N, Andrews RM, Morris CM, McKeith IG, Perry RH, Edwardson JA, Turnbull DM (2000) Mitochondrial DNA haplogroups and susceptibility to AD and dementia with Lewy bodies. Neurology 55:302–304 [PubMed] [Google Scholar]
Cochran WG (1954) Some methods for strengthening the common χ2 tests. Biometrics 10:417–451 [Google Scholar]
De Benedictis G, Rose G, Carrieri G, De Luca M, Falcone E, Passarino G, Bonafe M, Monti D, Baggio G, Bertolini S, Mari D, Mattace R, Franceschi C (1999) Mitochondrial DNA inherited variants are associated with successful aging and longevity in humans. FASEB J 13:1532–1536 [PubMed] [Google Scholar]
DiMauro S, Schon EA (2003) Mitochondrial respiratory-chain diseases. N Engl J Med 348:2656–266810.1056/NEJMra022567 [PubMed] [CrossRef] [Google Scholar]
Ghezzi D, Marelli C, Achilli A, Goldwurm S, Pezzoli G, Barone P, Pellecchia MT, Stanzione P, Brusa L, Bentivoglio AR, Bonuccelli U, Petrozzi L, Abbruzzese G, Marchese R, Cortelli P, Grimaldi D, Martinelli P, Ferrarese C, Garavaglia B, Sangiorgi S, Carelli V, Torroni A, Albanese A, Zeviani M (2005) Mitochondrial DNA haplogroup K is associated with a lower risk of Parkinson’s disease in Italians. Eur J Hum Genet 13:748–75210.1038/sj.ejhg.5201425 [PubMed] [CrossRef] [Google Scholar]
Mancuso M, Conforti FL, Rocchi A, Tessitore A, Muglia M, Tedeschi G, Panza D, et al (2004) Could mitochondrial haplogroups play a role in sporadic amyotrophic lateral sclerosis? Neurosci Lett 371:158–16210.1016/j.neulet.2004.08.060 [PubMed] [CrossRef] [Google Scholar]
Mishmar D, Ruiz-Pesini E, Golik P, Macaulay V, Clark AG, Hosseini S, Brandon M, Easley K, Chen E, Brown MD, Sukernik RI, Olckers A, Wallace DC (2003) Natural selection shaped regional mtDNA variation in humans. Proc Natl Acad Sci USA 100:171–17610.1073/pnas.0136972100 [PMC free article] [PubMed] [CrossRef] [Google Scholar]
Niemi AK, Hervonen A, Hurme M, Karhunen PJ, Jylha M, Majamaa K (2003) Mitochondrial DNA polymorphisms associated with longevity in a Finnish population. Hum Genet 112:29–3310.1007/s00439-002-0843-y [PubMed] [CrossRef] [Google Scholar]
Pyle A, Foltynie T, Tiangyou W, Lambert C, Keers SM, Allcock LM, Davison J, Lewis SJ, Perry RH, Barker R, Burn DJ, Chinnery PF (2005) Mitochondrial DNA haplogroup cluster UKJT reduces the risk of PD. Ann Neurol 57:564–56710.1002/ana.20417 [PubMed] [CrossRef] [Google Scholar]
Roff DA, Bentzen P (1989) The statistical analysis of mitochondrial DNA polymorphisms: chi 2 and the problem of small samples. Mol Biol Evol 6:539–545 [PubMed] [Google Scholar]
Ruiz-Pesini E, Lapeña A-C, Díez-Sánchez C, Pérez-Martos A, Montoya J, Alvarez E, Díaz M, Urriés A, Montoro L, López-Pérez MJ, Enriquez JA (2000) Human mtDNA haplogroups associated with a high or reduced spermatozoa motility. Am J Hum Genet 67:682–696 [PMC free article] [PubMed] [Google Scholar]
Ruiz-Pesini E, Mishmar D, Brandon M, Procaccio V, Wallace DC (2004) Effects of purifying and adaptive selection on regional variation in human mtDNA. Science 303:223–22610.1126/science.1088434 [PubMed] [CrossRef] [Google Scholar]
Torroni A, Huoponen K, Francalacci P, Petrozzi M, Morelli L, Scozzari R, Obinu D, Savontaus ML, Wallace DC (1996) Classification of European mtDNAs from an analysis of three European populations. Genetics 144:1835–1850 [PMC free article] [PubMed] [Google Scholar]
Torroni A, Wallace DC (1994) Mitochondrial DNA variation in human populations and implications for detection of mitochondrial DNA mutations of pathological significance. J Bioenerg Biomembr 26:261–27110.1007/BF00763098 [PubMed] [CrossRef] [Google Scholar]
van der Walt JM, Dementieva YA, Martin ER, Scott WK, Nicodemus KK, Kroner CC, Welsh-Bohmer KA, Saunders AM, Roses AD, Small GW, Schmechel DE, Murali Doraiswamy P, Gilbert JR, Haines JL, Vance JM, Pericak-Vance MA (2004) Analysis of European mitochondrial haplogroups with Alzheimer disease risk. Neurosci Lett 365:28–3210.1016/j.neulet.2004.04.051 [PubMed] [CrossRef] [Google Scholar]

Articles from American Journal of Human Genetics are provided here courtesy of American Society of Human Genetics