# Case-Control Association Testing with Related Individuals: A More Powerful Quasi-Likelihood Score Test

## Abstract

We consider the problem of genomewide association testing of a binary trait when some sampled individuals are related, with known relationships. This commonly arises when families sampled for a linkage study are included in an association study. Furthermore, power to detect association with complex traits can be increased when affected individuals with affected relatives are sampled, because they are more likely to carry disease alleles than are randomly sampled affected individuals. With related individuals, correlations among relatives must be taken into account, to ensure validity of the test, and consideration of these correlations can also improve power. We provide new insight into the use of pedigree-based weights to improve power, and we propose a novel test, the *M*_{QLS} test, which, as we demonstrate, represents an overall, and in many cases, substantial, improvement in power over previous tests, while retaining a computational simplicity that makes it useful in genomewide association studies in arbitrary pedigrees. Other features of the *M*_{QLS} are as follows: (1) it is applicable to completely general combinations of family and case-control designs, (2) it can incorporate both unaffected controls and controls of unknown phenotype into the same analysis, and (3) it can incorporate phenotype data about relatives with missing genotype data. The methods are applied to data from the Genetic Analysis Workshop 14 Collaborative Study of the Genetics of Alcoholism, where the *M*_{QLS} detects genomewide significant association (after Bonferroni correction) with an alcoholism-related phenotype for four different single-nucleotide polymorphisms: *tsc1177811* (*P*=5.9×10^{-7}), *tsc1750530* (*P*=4.0×10^{-7}), *tsc0046696* (*P*=4.7×10^{-7}), and *tsc0057290* (*P*=5.2×10^{-7}) on chromosomes 1, 16, 18, and 18, respectively. Three of these four significant associations were not detected in previous studies analyzing these data.

We focus on the problem of testing for association between a binary trait and a genetic marker when cases and/or controls are related, with the pedigree(s) assumed known. An advantage in using multiplex families in association studies is that affected individuals who have affected relatives have a higher expected frequency of the alleles that increase susceptibility for a genetic trait than do affected individuals who do not have affected relatives. As a result, the power to detect association is expected to increase when affected individuals with affected relatives are included in the study.

Family-based association tests, such as the transmission/disequilibrium test (TDT),^{1} have the advantage that they are robust to population heterogeneity. However, such tests typically require genotype data for family members of an affected individual. Case-control designs are less restrictive than family-based designs, because they can allow but do not require genotype data for relatives of affected individuals, and they are generally more powerful than family-based designs.^{2}

When related individuals are used in case-control studies, one must account for the fact that subjects who are biologically related have correlated genotypes. One approach is to use the standard χ^{2} statistic with a correction factor that takes into account the pedigree information^{3} (*W*_{χ2corr}) or with a correction factor that takes into account the conditional probability of identity-by-descent (IBD) sharing, given both the observed genotype data and the pedigree information (the “posterior kinship coefficient”)^{4}, giving a statistic that we refer to as “*W*_{SS}.” Such approaches correct the type I error but still use equal weighting of individuals, among cases and among controls, which is expected to be suboptimal, in terms of power, when individuals are related. As an alternative approach, a quasi-likelihood score (*W*_{QLS}) test has been proposed.^{3} Like the *W*_{χ2corr}*,* the *W*_{QLS} accounts for the correlations among related individuals, to obtain the proper type I error rates. In addition, for a given alternative model, optimal weights, depending on the pedigree information, are used in the *W*_{QLS}, in an effort to improve power. For the situation when controls are not related to cases, a method for testing association of a binary trait to a haplotype has been proposed,^{5} where this method uses a similar weighting scheme to that of the *W*_{QLS}.

We analyze the strengths and weaknesses of the *W*_{χ2corr}, *W*_{QLS}, and *W*_{SS} tests, and we use this improved understanding to propose a new and more powerful test, the *M*_{QLS} test. (The *M* in *M*_{QLS} stands for “more powerful” or “modified.”) The *M*_{QLS} test is more widely applicable than the previously proposed tests, in two ways. First, it distinguishes between unaffected controls and controls of unknown phenotype (i.e., individuals on whom no direct phenotype information is measured) and can incorporate both into the analysis. Unaffected and unphenotyped general-population controls are the two standard types of controls in case-control studies of disease. Association tests based on combined samples may include both types. Sample sizes of cases and of controls strongly influence the power of the test, so including all available controls is desirable. However, under the alternative hypothesis, individuals who are known to be unaffected have a lower expected frequency of a predisposing allele than do individuals of unknown phenotype (when other factors such as relatives' phenotypes are held constant). This poses the problem of how best to combine the two types of controls in the analysis without compromising power. The *M*_{QLS} method provides a solution to this problem.

A second way in which the *M*_{QLS} test is more widely applicable than are the previously proposed tests is that it incorporates phenotype data about relatives who have missing genotype data at the marker being tested. This information is used to optimize the weights given to relatives with nonmissing genotype data at the marker being tested, following the principle that there is enrichment for predisposing variants in affected individuals with affected relatives. This enrichment principle implies, for example, that an affected individual with no phenotyped relatives should be weighted differently from an affected individual with an affected sibling and that this should still hold true when the affected sibling happens to have missing genotype data at the marker being tested. At the same time, the genotypes of the two siblings are dependent, so there should be downweighting of the siblings when they are both genotyped, which does not occur when only one is typed. The *M*_{QLS} takes into account both the enrichment principle and the effects of dependence in setting the weights.

In addition to the differences just mentioned, we show that, compared with the *W*_{χ2corr} and *W*_{SS}, the *M*_{QLS} improves power by providing a more efficient estimator of allele frequency under the null hypothesis. In large pedigrees, a further power difference between the *M*_{QLS} and *W*_{SS} would theoretically arise from the fact that the *W*_{SS} essentially corrects for the presence of linkage in a family when testing for association,^{5} whereas the *M*_{QLS} allows both linkage and association to contribute to the test statistic. A more relevant difference between the *W*_{SS} and *M*_{QLS} is that the *M*_{QLS} is computationally feasible in large pedigrees, whereas the *W*_{SS} is not. Improvement of the *M*_{QLS} over the *W*_{QLS} is obtained primarily by capitalizing on the property that there is an enrichment for predisposing variants in affected individuals with affected relatives. The *M*_{QLS} is remarkable in its computational simplicity and can be used for any set of individuals, regardless of the complexity of the relationships of the individuals.

We give a statistical argument that the *M*_{QLS} should be a powerful test: the *M*_{QLS} maximizes the noncentrality parameter over a general class of linear statistics for all two-allele (single-locus) disease models in outbred samples, as the effect size tends to 0, where we allow environmental effects that do not have familial correlation (see the “^{Development and Justification of the MQLS Test}” section). We simulate various multilocus disease models and directly compare the type I error and the power of the *M*_{QLS}, *W*_{QLS}, and *W*_{χ2corr} tests in samples of related individuals. Since the current implementation of the *W*_{SS} does not allow it to be applied to these particular simulated data sets, we instead use the true IBD-sharing information from the simulations to investigate the use of posterior kinship coefficients to compute the variances of any of the case-control statistics. We find that, even if correction for linkage is desired, the extra computation required to use posterior kinship coefficients is generally not worthwhile in small pedigrees and is not computationally practical in larger pedigrees. We apply our methods to the Genetic Analysis Workshop (GAW) 14 Collaborative Study of the Genetics of Alcoholism (COGA) data,^{6} to identify regions of the genome that are associated with alcohol dependence (MIM 103780).

## Methods

### Association Testing with a Biallelic Marker

Suppose that we have phenotype information about a binary trait for *n*+*m* sampled individuals, with each individual coded as “affected,” “unaffected,” or “unknown.” Consider a single biallelic marker with allele labels “0” and “1” (the extension to multiallelic markers is given in appendix A), and suppose that the first *n* of the *n*+*m* listed individuals have nonmissing genotype data at the given marker, whereas the last *m* individuals have missing genotype data at the marker. For the first *n* individuals, let *Y*=(*Y*_{1},…*Y*_{n})^{T}*,* where *Y*_{i}= 1/2×(the number of alleles of type 1 in individual *i*), so the value of *Y*_{i} is 0, 1/2, or 1. The *n*+*m* individuals are assumed to be ascertained with respect to phenotype, and they may be arbitrarily related (including inbreeding), with the pedigree(s) that specify the relationships assumed to be known. Let *p* be the frequency of allele 1 in the general population, where 0<*p*<1. Under the null hypothesis of no association between the given marker and the trait (and making the obvious assumption that ascertainment is conditionally independent of marker genotype, given phenotype and pedigree information), *Y* has mean *p*1, where **1** is a column vector of length *n* with every entry equal to 1. To obtain the null variance of *Y,* more assumptions are required. Namely, we assume that, under the null hypothesis, (1) there is neither linkage nor association between the given marker and the trait and (2) the pedigree founders are drawn from a population in Hardy-Weinberg equilibrium (HWE) for the given marker. (Note that we do not require HWE in the founders under the alternative model.) In that case, the null variance is given by (see, e.g., the work of Bourgain et al.^{3}), where Φ is the kinship matrix of the nonmissing individuals, given by

and *h*_{i} is the inbreeding coefficient of individual *i,* and _{ij} is the kinship coefficient between individuals *i* and *j,* where 1*i**n* and 1*j**n**.*

### Overview of the *W*_{QLS}, *W*_{χ2corr},*W*_{SS}, and *M*_{QLS} Tests

To our knowledge, it has not been previously pointed out that the case-control association test statistics *W*_{QLS}, *W*_{χ2corr}*, W*_{SS}, and *M*_{QLS} can all be thought of as having the common form

where

is an estimator of allele frequency calculated under the assumption of no association,

is a contrasting estimator of allele frequency that should have a different expectation from when there is association, and denotes variance calculated under the assumption that the null hypothesis is true. For each of the four statistics, *w*_{test} and *w*_{null} are column vectors of length *n,* which can be written as functions of the phenotype and pedigree information.

The *W*_{QLS}, *W*_{χ2corr}*,* and *W*_{SS} tests assume that every genotyped individual is coded as either “case” or “control,” with no particular distinction made between unaffected controls and controls of unknown phenotype (e.g., general-population controls). Furthermore, these tests ignore the phenotype information on the additional *m* relatives with missing genotypes. In contrast, the *M*_{QLS} treats unaffected controls and controls of unknown phenotype differently and can handle samples that contain both types. It also takes into account the phenotype data for the additional *m* relatives with missing genotypes. For the *W*_{QLS}, *W*_{χ2corr}*,* and *W*_{SS}, phenotype data are accordingly assumed to be coded as 1_{c}*,* a column vector of length *n* with *i*th entry 1 if individual *i* is a case and 0 if *i* is a control. We define *n*_{c}=1^{T}1_{c} as the number of cases among the *n* individuals with nonmissing genotypes. For the *M*_{QLS} test, the phenotype data are coded as *A,* a column vector of length *n*+*m* having *i*th entry 1 if individual *i* is affected,

if *i* is unaffected, and 0 if *i* is of unknown phenotype. Here 0<*k*<1 is a constant that must be specified (see the “^{Development and Justification of the MQLS Test}” section for details). Write *A*^{T}=(*A*^{T}_{N},*A*^{T}_{M}), where *A*_{N} and *A*_{M} are column vectors containing the first *n* elements and last *m* elements of *A,* respectively, where *N* and *M* stand for the sets of individuals with nonmissing and missing genotypes, respectively, at the particular marker. For the *M*_{QLS} test to use the additional phenotype information from the *m* individuals with missing genotype data, additional relationship information is needed in the form of an *n*×*m* matrix Φ_{N,M}*,* giving the kinship coefficients between the nonmissing and missing individuals—that is, Φ_{N,M} has (*i*,*j*)th element equal to 2_{i,n+j}, where _{i,n+j} is the kinship coefficient between the *i*th nonmissing individual and the *j*th missing individual. We write Φ_{N,NM} for the *n*×(*n*+*m*) matrix with first *n* columns equal to the Φ of equation (1) and last *m* columns equal to Φ_{N,M}*.*

Table 1 gives *w*_{test} and *w*_{null} for *W*_{χ2corr}, *W*_{SS}, *W*_{QLS}, and *M*_{QLS}, and for *W*_{χ2corr}, *W*_{QLS}, and *M*_{QLS}. When the *W*_{QLS}, *W*_{χ2corr}, and *M*_{QLS} statistics are calculated from data, the *p* in the variance expression is estimated by . From table 1, we see that the weights and variances for *W*_{QLS} and *M*_{QLS} involve Φ^{-1}. It can be shown that Φ is invertible provided that the sample of individuals with nonmissing genotypes does not include both members of any MZ twin pair (see appendix B). We have also developed an extension of the *W*_{QLS} and *M*_{QLS} to the case when the genotyped sample does include both members of one or more MZ twin pairs. In that case, we calculate the *W*_{QLS} and *M*_{QLS} by coding each MZ twin pair as a single individual, where, for the *W*_{QLS}, we assign the phenotype to be the average of the phenotype values for the pair (so the phenotype would take value 0, 1, or 1/2 if the members of the pair were both controls, both cases, or one of each, respectively) and, for the *M*_{QLS}, we assign the phenotype value to be the sum of the phenotype values for the MZ twin pair (so the phenotype would take value 0,

or 2 if the members of the pair were both of unknown phenotype, both unaffected, or both affected, respectively, and value

if one is affected and the other unaffected, etc.). It turns out that this procedure is justified by the fact that it is mathematically equivalent to substituting the Moore-Penrose generalized inverse (see, e.g., the work of Schott^{7}) of Φ in place of Φ^{-1} in the formulas of table 1. Under regularity conditions on how Φ_{N,NM} behaves as *n*+*m*→∞, which are sufficient for a central-limit theorem to hold, each of the four statistics follows a χ^{2} distribution with 1 df asymptotically under the null hypothesis. A previous simulation study documented the accuracy of the χ^{2} approximation for type I error of *W*_{χ2corr} and *W*_{QLS}.^{3}

For both the *W*_{χ2corr} and *W*_{SS}, is the sample mean based on cases and is the sample mean based on everyone. The distinction between the two statistics is in the variance calculation. Previous work^{4} gives the variance calculation for *W*_{SS}, for the case of outbred individuals, as a function of variances and covariances of genotype indicators for sampled individuals. The variances of the indicators are then calculated as a function of genotype probabilities and are valid under violations of HWE. However, the covariances are calculated with use of an ITO method that is valid only under HWE and with use of posterior IBD probabilities that are also calculated assuming HWE. Arguably the most substantial distinction between the variance calculations for *W*_{χ2corr} and *W*_{SS} is in the use of so-called prior versus posterior kinship coefficients or, equivalently, the probability of IBD sharing for a pair based on the pedigree alone versus conditional on genotype data as well as on the pedigree. When the data are in HWE and all genotyped individuals are outbred, the variance expression for *W*_{SS} is the same as that for *W*_{χ2corr} in table 1, except that Φ is replaced by Φ_{posterior}, where Φ_{posterior} has (*i,j*)th element equal to

where ξ_{ij}(*k*)=*P*(*i* *and* *j* *share* *k* *allelesIBDatthegivenmarker*|*allgenotypeandpedigreeinformation*). We call Φ_{posterior} the posterior kinship matrix, and, in the “^{Assessment of Use of Φ versus Φposterior in Variance Calculations}” section, we consider the effect on power of the use of Φ versus Φ_{posterior} in the variance calculations for each of the statistics.

For *W*_{QLS}, is the best linear unbiased estimator (BLUE)^{8} based on everyone, and, in the special situation when the controls are not related to the cases, is the BLUE based on cases. The original presentation^{3} of the *W*_{QLS} was as a quasi-likelihood score statistic based on the mean model

with μ=*p*1+*r*1_{c}—that is,

where 0<*p*+*r*<1. Under the null hypothesis of no association, *r*=0. An equivalent expression^{3} for *W*_{QLS} is

where is the quasi-likelihood estimator of *p* when *r*=0, is the covariance matrix Σ evaluated at *r*=0 and —that is, —and is the mean vector μ evaluated at *r*=0 and —that is, . The *W*_{QLS} test has been shown^{3} to have certain optimality properties based on this model.

For the *M*_{QLS}, is the BLUE based on everyone. In the special case of no missing genotype data at the marker, for the *M*_{QLS} is calculated by taking a weighted average of the elements of *Y* in which each affected individual’s *Y* value gets a weight of 1, each unaffected individual’s *Y* value gets a weight of

and *Y* values of individuals of unknown phenotype get weight of 0. When some individuals have missing genotype data, then *w*_{test,i}, the weight of the *i*th typed individual’s *Y* value in , has an additional term added on, whose value is a function of the phenotypes of individuals with missing genotype data that occur in the same pedigree with individual *i.* In that case, *w*_{test,i}=*A*_{i}+(Φ^{-1}_{f}Φ_{f,N,M}*A*_{f,M})_{i}, where *f* is the pedigree containing individual *i,* and Φ^{-1}_{f}, Φ_{f,N,M}*,* and *A*_{f,M} are, respectively, Φ^{-1}, Φ_{N,M}, and *A*_{M}*,* restricted to members of *f.* In the “^{Development and Justification of the MQLS Test}” section, we justify the form of the *M*_{QLS}, given in table 1, on theoretical grounds, and, in the “^{Power Comparison of Wχ2corr, WQLS, and MQLS}” and the “^{Assessment of Use of Φ versus Φposterior in Variance Calculations}” sections, we demonstrate that it generally improves power over previously proposed tests.

### Development and Justification of the *M*_{QLS} Test

The model on which *W*_{QLS} is based (eq. [2]) has the advantage of being simple and intuitive, and it works well in samples of unrelated individuals. However, when the sample consists of related individuals and the trait is complex, this model does not capture certain features of allele-frequency differences associated with a genetic trait. Situations in which the model in equation (2) would be expected to hold more or less exactly, even in a sample of related individuals, include (1) testing for an allele-frequency difference between two distinct populations (e.g., Swedes and Japanese) without admixture and (2) testing for association with a trait within a single population when the true genetic model is a rare, fully penetrant dominant allele. In such cases, the *W*_{QLS} enjoys certain optimality properties,^{3} which can be verified by simulation (see the “Results” section). One might hope that the simple model would be robust enough to maintain power with complex traits, but we find that power can be improved substantially by modifying the model while still retaining the computational simplicity of the original method.

The motivation for development of the *M*_{QLS} is to improve the power of the quasi-likelihood score test *W*_{QLS} by explicitly taking into account the fact that affected individuals who have affected relatives have a higher expected frequency of the alleles that increase susceptibility for a genetic trait than do individuals who do not have affected relatives. The simple model in equation (2) ignores this fact. Furthermore, when case individual *i* and control individual *j* are related, the model in equation (2) specifies that *E*(*Y*_{i}-*Y*_{j})=*r*, regardless of how closely related *i* and *j* are, whereas, for example, if *i* and *j* were MZ twins, then, in reality, *Y*_{i}-*Y*_{j}=0, and, more generally, the expected difference in *Y* values between two individuals depends on their relationship as well as on their phenotypes.

For the *M*_{QLS}, we develop a new mean model that is a function of the relationships and the phenotypes of all *n*+*m* individuals and is as follows: *E*(*Y*|*A*)=μ=(μ_{1},…μ_{i},…μ_{n})^{T}, with μ=*p*1+*r*Φ_{N,NM}*A*—that is,

where we constrain 0<*p*+*r*(Φ_{N,NM}*A*)_{i}<1 for *i*=1,…,*n**.* For a better understanding of this mean model, let us consider, for example, a case-control study in which all of the individuals in the study are outbred. If a sampled individual *i* has no relatives in the study or if the phenotypes of all of *i*’s relatives are unknown, then, under the model of equation (4), *i*’s expected allele frequency is μ_{i}=*p* if *i* is of unknown phenotype, μ_{i}=*p*+*r* if *i* is affected, and

if *i* is unaffected. For each affected sibling *i* has, 1/2*r* is added to the baseline value of μ_{i}*,* whereas, for each unaffected sibling,

is added. For example, if *i* is affected and has three affected siblings and no other relatives of known phenotype in the study, then μ_{i}=*p*+ 5/2*r*. Note that, under model (4), MZ twins would always have the same expected allele frequency, even if one is a case and the other is a control.

The theoretical justification for this new mean model is as follows: assume, for the moment, that the trait is caused by the marker according to an arbitrary two-allele (single-locus) disease model, in which we allow environmental effects that do not have familial correlation. This model is specified by the population-allele frequency *p* and the penetrances of the three possible genotypes. Set the constant *k* (used in the calculation of *A*) to be *K*_{p}*,* the population prevalence of the trait. Consider an arbitrary set of possibly related outbred individuals, and calculate the true value, under the model, of the expected allele frequency in individual *i* conditional on all available phenotypes of *i* and *i*’s relatives, μ^{*}_{i}=*E*(*Y*_{i}|*A*), for any individual *i* in the sample. If we consider the ratio

where *a* is any affected individual with no phenotyped relatives, then, as the effect size (or differences among penetrance probabilities) tend to zero, this ratio tends to (see appendix C for the proof). Thus, the mean model in equation (4) is asymptotically the correct one for any two-allele disease model as the effect size goes to zero. Moreover, if the individuals are inbred, then the model is asymptotically correct for an additive or multiplicative, two-allele disease model as the effect size goes to zero (but not for a general, two-allele disease model in the inbred case). Although the assumptions under which this mean model is derived are somewhat simplistic, the model captures the important feature that individuals with affected relatives are likely to be enriched for the predisposing allele relative to individuals without affected relatives.

The *M*_{QLS} statistic given in table 1 is derived as the quasi-likelihood score statistic based on the model in equation (4). This is obtained by substituting Φ_{N,NM}*A* in place of 1_{c} in equation (3). The resulting formula for *M*_{QLS} is

where α=*A*_{N}+Φ^{-1}Φ_{N,M}*A*_{M},

, , and . Following the same reasoning^{3} as for the *W*_{QLS}, the *M*_{QLS} has maximal noncentrality parameter, against the alternative specified in equation (4), among a general class of linear statistics of the form , where *S*=*V*^{T}*Y* and *V*≠0 where **0** is a column vector of 0’s of length *n.* Note that *W*_{χ2corr}, *W*_{QLS}, and *M*_{QLS} are all of this form. As a result, under suitable regularity conditions, which are not discussed here, the *M*_{QLS} would be asymptotically locally most powerful against the alternative specified in equation (4). Simulation studies are undertaken in the “Results” section, to assess the usefulness of this test for complex traits, where two-allele models are not expected to hold.

To use the *M*_{QLS} test, a value for the constant *k* must be specified. We emphasize that the test will be valid for any value of *k* satisfying 0<*k*<1. The value of *k* affects the power of the test, and, under a two-allele model, for outbred individuals, optimal power is attained when *k* is the population prevalence of the trait. For complex traits, we recommend setting *k* equal to an estimate of the population prevalence from previous studies or registry data from the population. In the “Results” section, we demonstrate, through simulation, that power is in fact quite robust to misspecification of *k.*

*M*_{QLS} Power-Improvement Diagnostic

We propose a method that uses only pedigree information and phenotype data (without genotype data) to determine whether the analyses using *W*_{χ2corr}, *W*_{QLS}, and *M*_{QLS} would be expected to give similar or dissimilar results. (If they are expected to give dissimilar results, then, in most cases, the *M*_{QLS} is expected to have higher power.) One possible advantage of applying this diagnostic could be to avoid having to correct association *P* values for having performed multiple analyses unnecessarily, in the case when the results are predicted to be similar on the basis of the diagnostic. Because the diagnostic is based only on phenotype and pedigree information, and not on genotype data, such an approach does not create any bias in the results.

The idea is to calculate the weights assigned to the observations for each of the statistics and compare them. For the *W*_{χ2corr}, the weight of an observation depends only on the individual’s case-control status. In contrast, the weights can vary among cases and among controls in *W*_{QLS} and *M*_{QLS}, depending on the relationship configurations, as well as on the phenotypes of relatives. The total weights vary slightly from locus to locus, depending on missing data patterns. However, on the basis of the study design, weights for the genotyped individuals could be computed to get an idea of how different the analyses are likely to be under *W*_{QLS}, *M*_{QLS}, and *W*_{χ2corr}. Details of the power improvement diagnostic are given in appendix D, where we find that, to compare the *W*_{χ2corr} weights with the *M*_{QLS} weights, we need only calculate the coefficients of variation of total *M*_{QLS} weights among cases and total *M*_{QLS} weights among controls. If these values are close to zero, then the *M*_{QLS} test results should be similar to the *W*_{χ2corr} test results. The same principle holds for comparison of *W*_{χ2corr} weights with *W*_{QLS} weights. Large differences in association-testing results between the *W*_{χ2corr} and either the *M*_{QLS} or the *W*_{QLS} tests would be expected to occur when one or both of the relevant coefficients of variation are far from zero. Our experience, in the context of our simulations, is that a coefficient of variation of ~1 or more in absolute value for the cases and/or controls will result in an improvement in power for the *M*_{QLS} over the *W*_{χ2corr}.

### Use of Prior versus Posterior Kinship Coefficients

Kinship coefficients (and, for an inbred population, inbreeding coefficients) are used in two different ways in the construction of the four statistics: *W*_{χ2corr}, *W*_{SS}, *W*_{QLS}, and *M*_{QLS}. First, kinship coefficients can be used in the calculation of the total weight vector, *V,* for each statistic (as they are in *W*_{QLS} and *M*_{QLS} but not in *W*_{χ2corr} and *W*_{SS}). Second, they are used in the calculation of the variance term for each statistic. The question arises^{3}^{,}^{5} as to whether it is preferable to use the unconditional (i.e., prior) kinship coefficients,^{3} given by Φ, or the posterior kinship coefficients, Φ_{posterior}, which are the conditional probabilities of IBD sharing given both the observed genotype data and the pedigree information,^{4} keeping in mind that Φ is vastly easier to compute than Φ_{posterior}. In fact, we generally caution against the use of Φ_{posterior} in the calculation of *V* for *W*_{QLS} and *M*_{QLS}. To understand why, first consider the simpler problem of allele-frequency estimation from a sample of related individuals. The BLUE for that problem^{8} is the same as

When Φ_{posterior} is substituted for Φ in the formula for , the resulting estimator is, in general, biased and even inconsistent as *n*→∞ when there is partial IBD information. This occurs because the random variables Φ_{posterior} and *Y* are, in general, dependent. (Note that, in principle, these difficulties could be avoided in certain situations when Φ_{posterior} is estimated in such a way that *E*(*Y*|Φ_{posterior})=*EY*; for example, if the true IBD sharing is known or if it is estimated from markers not in linkage disequilibrium [LD] with the marker of interest.) From these considerations, it seems clear that, if Φ_{posterior} is used in the calculation of *V* for the *W*_{QLS} and *M*_{QLS}, the resulting test statistic could be badly behaved. This might explain the difficulties encountered in previous work,^{5} which considered the use of Φ_{posterior} in the calculation of *V* for a haplotype-based method with a weighting scheme similar to that of the *W*_{QLS}.

There remains the possibility of using Φ_{posterior} in the calculation of the variance term. Intuitively, this would be expected to correct the statistic for the presence of linkage.^{5} In a context in which one would be willing to detect either linkage or association or a combination of the two, then the use of Φ_{posterior} instead of Φ in the variance calculation would be expected to result in lower power, as well as more onerous computation. However, in a context in which linkage has already been established and one wishes to correct for it in testing for association, the use of Φ_{posterior} instead of Φ in the variance calculation would be expected to result in better control of type I error. In the “^{Assessment of Use of Φ versus Φposterior in Variance Calculations}” section, we describe simulations to test the validity of this intuition.

To use Φ_{posterior} in the variance calculation for *W*_{χ2corr}, one need only replace Φ with Φ_{posterior} in the variance formula in table 1. We call the resulting statistic *W*^{COND}_{χ2}, where “COND” refers to conditional kinship coefficients. To calculate *W*^{COND}_{QLS} and *M*^{COND}_{QLS}, we replace the corresponding variance formulas in table 1 with

where *w*_{test} is defined differently for *W*_{QLS} and *M*_{QLS}, with the definitions given in table 1. Software such as Merlin^{9} can be used to calculate Φ_{posterior} in small-to-moderate–sized pedigrees.

### GAW 14 COGA Data

We analyze a COGA data set^{6} that was previously analyzed in the Genetics Analysis Workshop (GAW) 14. There are a total of 1,614 individuals from 143 pedigrees, with each pedigree containing at least three affected individuals. We include in our analysis only those individuals who are coded as “white, non-Hispanic.” We designate as cases those individuals who are affected with ALDX1 or who have symptoms of ALDX1, where ALDX1 is defined to be DSM-III-R alcohol dependence with the Feighner Alc Definite phenotype. By these criteria, there are 830 cases with available SNP data. We designate as “unaffected controls” those individuals who are labeled as “pure unaffected,” and we designate as “controls of unknown phenotype” those individuals who are labeled as “never drank alcohol.” Among individuals with available SNP data, these criteria result in 187 unaffected controls and 13 unknown controls. Note that the *M*_{QLS} makes a distinction between the two control types, whereas the *W*_{χ2corr} and *W*_{QLS} do not. The data set includes 10,810 autosomal SNPs. We exclude 403 SNPs that are not polymorphic (minor-allele frequency <0.01). We analyze the remaining 10,407 SNPs using the *W*_{χ2corr}, *W*_{QLS}, and *M*_{QLS} tests. We could not use the *W*_{SS} software package to analyze these data because, at the time of our analysis, to the best of our knowledge, there was no implementation available that would handle the situation in which controls are related to cases (although, in principle, it could be extended to that situation).

## Results

### Simulation Studies

We perform simulation studies to (1) assess the type I error of the *M*_{QLS}; (2) compare power of the *W*_{χ2corr}, *W*_{QLS}, and *M*_{QLS}; (3) assess the practical impact of the use of Φ versus Φ_{posterior} in the variance calculations for the statistics; and (4) assess the robustness of power of *M*_{QLS} to the choice of the parameter *k.* For each simulation described below, 5,000 replicates were performed.

We consider three different study designs. In the first, affected and unaffected individuals from 60 outbred, three-generation pedigrees are sampled. Each pedigree has a total of 16 individuals, related as in figure 1, with the pattern of affected and unaffected individuals varying randomly according to one of the trait models described in the next paragraph. Pedigrees are sampled conditional on obtaining exactly 20 pedigrees with 4 affected individuals, 20 with 5, and 20 with 6. In each sampled pedigree, phenotypes for all 16 individuals are observed. For each individual in a sampled pedigree, the individual’s genotypes are observed if and only if at least 30% of the individual’s siblings, parents, and offspring in the sampled pedigree are affected. The second study design is similar to the first, with two differences: (1) an additional 200 unrelated, unaffected controls are included in the study; and (2) for each individual in a sampled pedigree, the individual’s genotypes are observed if and only if at least half of the individual’s siblings, parents, and offspring in the sampled pedigree are affected. In the third study design, individuals from three extended pedigrees are sampled, with each pedigree consisting of 154 individuals over five generations. The pedigrees are sampled conditional on having at least 50 affected individuals. In each sampled pedigree, phenotypes for all 154 individuals are observed. For each individual in a sampled pedigree, the individual’s genotypes are observed if and only if at least half of the individual’s siblings, parents, and offspring in the sampled pedigree are affected.

**...**

We consider four different classes of multigene trait models.^{10} Model I has two unlinked causal SNPs, with epistasis between them and both of them acting dominantly. In model I, the frequencies of allele 1 at SNPs 1 and 2 are *p*_{1} and *p*_{2}, respectively. Individuals with at least one copy of allele 1 at SNP 1 and at least one copy of allele 1 at SNP 2 have a penetrance of *f*_{1}. All other individuals have a penetrance of *f*_{2}<*f*_{1}. We consider three different parameter settings for model I, which are listed as models I-a, I-b, and I-c in table 2. Model II also consists of two unlinked causal SNPs with epistasis between them, with SNP 1 acting recessively and SNP 2 following a general two-allele model. There are four penetrance parameters for this model, with *f*_{1}>*f*_{2}>*f*_{3}>*f*_{4}. Individuals with two copies of allele 1 at SNP 1 and two copies of allele 1 at SNP 2 have a penetrance of *f*_{1}. Individuals with two copies of allele 1 at SNP 1 and one copy of allele 1 at SNP 2 have a penetrance of *f*_{2}. Individuals with two copies of allele 1 at SNP 1 and no copies of allele 1 at SNP 2 have a penetrance of *f*_{3}. All other individuals have a penetrance of *f*_{4}. We consider one parameter setting for this class of model, which is listed as model II-a in table 2. Model III has three unlinked causal SNPs with epistasis between them and with each SNP acting dominantly. Individuals with at least one copy of allele 1 at SNP 1 and at least one copy of allele 1 at SNP 2 and/or SNP 3 have a penetrance of *f*_{1}. All other individuals have a penetrance of *f*_{2}<*f*_{1}. We consider two different parameter settings for this class of model, which are listed as models III-a and III-b in table 2. Model V is the same as model I except that, in model V, the two causal SNPs are tightly linked and in linkage equilibrium, whereas, in model I, the two causal SNPs are unlinked and in linkage equilibrium. We consider one parameter setting for model V, which is listed as model V-a in table 2. In addition to the multigene models, we also consider a single-gene dominant model, model IV, in which individuals with at least one copy of allele 1 at SNP 1 have a penetrance of *f*_{1} and all other individuals have a penetrance of *f*_{2}<*f*_{1}. We consider one parameter setting for model IV, which is listed as model IV-a in table 2 and which represents a rare dominant trait that is almost fully penetrant. In addition to the allele frequencies and penetrance parameters for each model, table 2 contains the resulting population prevalence *K*_{p}, prevalence *K*_{s} conditioned on having an affected sibling, and the sibling risk ratio

these last 3 are calculated in outbreds. A broad range of models were chosen for our simulation studies: from highly penetrant disease models to disease models with low penetrance and models with high heritability to models with low heritability.

### Assessment of Type I Error of *M*_{QLS}

Previous simulation studies^{3} verified that the use of the χ^{2} approximations to the null distributions of *W*_{χ2corr} and *W*_{QLS} give the appropriate type I error for the tests. We perform a similar verification for the *M*_{QLS} by simulating under the null hypothesis of no association and no linkage. We compare the proportion of simulations in which the statistic exceeds the (1-α)th quantile of the χ^{2} distribution to the nominal type I error level α, for α=.01 and .05. Simulations are performed on the basis of the second study design, which consists of individuals from 60 moderate-sized pedigrees plus 200 unrelated unaffected individuals. The phenotype is simulated from model I-a (table 2). We test at an unlinked, unassociated SNP with three different allele-frequency settings, which are given in table 3.

Table 3 gives the empirical type I error of the *M*_{QLS}, estimated from 5,000 simulations, for nominal levels .05 and .01. For each simulation scenario, the empirical type I error is not significantly different from the nominal. These results verify that the use of the χ^{2} approximation results in an accurate assessment of significance for the *M*_{QLS}.

### Power Comparison of *W*_{χ2corr}, *W*_{QLS}, and *M*_{QLS}

To compare the power of *W*_{χ2corr}, *W*_{QLS}, and *M*_{QLS}, we perform simulations on the basis of the second study design, which consists of individuals from 60 moderate-sized pedigrees plus 200 unrelated unaffected individuals. Five thousand replicates from each of models I-a, I-b, I-c, II-a, III-a, and III-b are simulated. The test is performed at SNP 2 for each model, with the significance threshold set to .05.

Estimated power with the SE for the *W*_{χ2corr}, *W*_{QLS}, and *M*_{QLS} tests is given in figure 2. Recall that the numbers of cases and controls in each replicate are randomly determined. The average number of cases for a given simulation setting has a range of 70.2–121.6, and the average number of controls has a range of 390.2–417.4. The average coefficient of variation for the total weights of cases in the *M*_{QLS} has a range of 0.3–0.4, and, for controls, it has a range from −2.6 to −1.6. Because the average coefficient of variation of total weights of controls is >1 in absolute value for the *M*_{QLS} for every model considered, we expect an improvement in power of the *M*_{QLS} over the *W*_{χ2corr}. As shown in figure 2, the *M*_{QLS} is more powerful than both the *W*_{χ2corr} and *W*_{QLS} in our simulation studies. The increase in power for the *M*_{QLS} is substantial (a difference in power of at least 0.20) for models I-a, I-b, and III-a. Our theoretical results indicate that the *M*_{QLS} should be powerful for two-allele disease models as effect size tends to zero. Our simulation studies indicate that, in fact, the *M*_{QLS} performs well for a range of more-complex disease models.

*W*

_{QLS},

*W*

_{χ2corr}, and

*M*

_{QLS}tests, on the basis of 5,000 simulated realizations for each of six different models.

In the “^{Development and Justification of the MQLS Test}” section, we note that an example in which the *W*_{QLS} has certain theoretical optimality properties is the case of a rare, fully penetrant dominant trait. To demonstrate by simulation that there can be cases in which the *W*_{QLS} has higher power than the other statistics, we perform simulations that are based on the first study design (60 moderate-sized pedigrees) with the phenotype simulated from model IV-a, which approximates a rare, fully penetrant dominant. We perform the tests at the causal SNP and also at a tightly linked SNP that has allele frequency 0.5 and is associated with the causal SNP, with *D*^{′}=.2. When the causal SNP was tested, all three tests had power close to 1 (results not shown). Table 4 compares the power of the three tests when they are performed at the tightly linked SNP that has allele frequency 0.5 and is associated with the causal SNP with *D*^{′}=.2. As expected, *W*_{QLS} has higher power than the other statistics in the case of a rare, fully penetrant dominant. This is because the conditional expected frequency of the allele in an individual given the phenotype information on everyone depends on only whether the individual is affected or unaffected, so the model on which the *W*_{QLS} is based holds.

### Assessment of Use of Φ versus Φ_{posterior} in Variance Calculations

We assess by simulation two predicted consequences of the use of Φ_{posterior} in place of Φ in the variance calculations of the statistics. To avoid the extra burden of calculating Φ_{posterior}, we instead use the true IBD-sharing information in place of Φ_{posterior}. This corresponds to the best-case scenario for the use of Φ_{posterior}, in which the markers provide complete IBD information.

The first predicted consequence is that use of Φ_{posterior} in place of Φ in the variance calculations would result in lower power in a context in which one would be willing to detect either linkage or association or a combination of the two; that is, when the null hypothesis is no association and no linkage. To test this, we simulate on the basis of the first (60 moderate-sized pedigrees) and third (three extended pedigrees) study designs, with the phenotype simulated from model V-a. We perform the tests at SNP 2, which is both linked and associated with the phenotype.

The second predicted consequence is that use of Φ_{posterior} would give better control of type I error in the context in which linkage has already been established and one wishes to correct for it in testing for association; that is, when the null hypothesis is no association. To test this, we use the same simulation scenario as for the first predicted consequence except that, instead of testing at SNP 2, which is both linked and associated with the phenotype, we tested at SNP 3, with allele frequency 0.5, which is tightly linked to both SNPs 1 and 2 but is not associated with either of them.

Table 5 demonstrates that, for moderate-size pedigrees, there is almost no difference between the use of Φ and Φ_{posterior} in the variance calculations for the statistics. This lack of difference between the two approaches holds for a SNP that is linked but not associated, as well as for a SNP that is both linked and associated. This is good news, because the calculation of Φ is vastly simpler than that of Φ_{posterior}.

_{posterior}in Variance Calculations, for Simulations under Model V-a, When Testing at a Tightly Linked Marker, Based on 5,000 Simulated Replicates

^{[Note]}

For extended pedigrees, on the other hand, table 5 demonstrates that our predictions were correct—namely, that use of Φ_{posterior} in place of Φ in the variance calculations results in (1) lower power to test the joint null hypothesis of no association and no linkage and (2) better control of type I error to test the null hypothesis of no association in the presence of linkage. This is because, when Φ is used in the variance calculation, linkage is allowed to contribute to the signal, whereas, when Φ_{posterior} is used in the variance calculation, linkage is not allowed to contribute to the signal. In extended pedigrees, the calculation of Φ_{posterior} can present substantial difficulties. Particularly in the context of whole-genome association, use of Φ_{posterior} might be unfeasible in extended pedigrees, and, as we have seen, it makes almost no difference in moderate-size pedigrees. Therefore, as a practical matter, it seems to make sense to use Φ, with the understanding that, in extended pedigrees, this provides a test of the joint null hypothesis of no association and no linkage.

### Robustness of Power of *M*_{QLS} to Choice of *k*

The *M*_{QLS} test is valid for any choice of the parameter *k.* In outbreds, when *k* is equal to the population prevalence of the disease, we have argued that the *M*_{QLS} is asymptotically locally most powerful for all two-allele disease models as the effect size tends to zero. In reality, the trait will usually be complex, and the prevalence will be estimated. To see how the power of the *M*_{QLS} test is affected by different choices of *k,* we perform a simulation study on the basis of the second study design, with the phenotype simulated from model III-a. For model III-a, the true population prevalence is *K*_{p}=.078. We perform the *M*_{QLS} test with different settings of the parameter *k,* which are given in table 6.

Table 6 gives power results for the *M*_{QLS} test for values for *k* ranging from one-quarter of the true value to 6 times the true value. In this case, choosing *k* to be within a factor of 3 or 4 of the population prevalence appears to give high power, suggesting that the procedure is quite robust to choice of *k.*

### GAW 14 COGA Data

The National Institute on Alcohol Abuse and Alcoholism has estimated^{11} that the prevalence of alcohol dependence in the United States is ~5%. For the *M*_{QLS}, we accordingly set *k*=0.05 in the analysis. The average coefficient of variation of the weights given by the *W*_{QLS} for the cases is 1.568 and for the controls is −0.655. For the *M*_{QLS}, they are 2.510 for the cases and −0.668 for the controls. The fact that the coefficients of variation for the cases are >1 for both the *M*_{QLS} and *W*_{QLS} suggests that these tests have the potential to give different results from those given by the *W*_{χ2corr} test, and, in particular, we might expect that there is some advantage to be gained by applying the *M*_{QLS}. Table 7 gives the results of the analyses for those SNPs for which at least one of the tests has a nominal *P*<4.0×10^{-5}. For 13 of these 15 SNPs, the *M*_{QLS} test has the smallest *P* value among the three tests used. After Bonferroni correction to adjust for three different tests of association at each of 10,407 SNPs, the *M*_{QLS} test is significant at the 5% level for four SNPs: *tsc1177811* on chromosome 1 (*P*=5.9×10^{-7} uncorrected; .018 corrected), *tsc1750530* on chromosome 16 (*P*=4.0×10^{-7} uncorrected; .012 corrected), *tsc0046696* on chromosome 18 (*P*=4.7×10^{-7} uncorrected; .015 corrected), and *tsc0057290* on chromosome 18 (*P*=5.2×10^{-7} uncorrected; .016 corrected). Note that the two significant SNPs on chromosome 18 are 71 cM apart. The *W*_{χ2corr} test is significant at the 5% level for an additional SNP, *tsc0571038* on chromosome 11 (*P*=6.2×10^{-7} uncorrected; .019 corrected).

Three of the five significant SNPs in table 7 are near genes: *tsc1750530* is 3 kb from the gene encoding HEAT repeat containing 3 (*HEATR3* [^{Affymetrix}^{; }^{EntrezGene}]), located at 16q12.1. The same SNP is also 25 kb from the gene encoding transmembrane protein 188 (*TMEM188* [^{Affymetrix}^{; }^{EntrezGene}]), located at 16q12.1. SNP *tsc0046696* is 238 kb from the gene encoding F-box protein 15 (*FBXO15* [MIM 609093; ^{Affymetrix}^{; }^{EntrezGene}]), located at 18q22.3. The other significant SNP on chromosome 18, *tsc0057290,* is 411 kb from the gene encoding VAMP (vesicle-associated membrane protein)–associated protein A (*VAPA* [MIM 605703; ^{Affymetrix}^{; }^{EntrezGene}]), 33 kDa, located at 18p11.22. To our knowledge, none of these genes are obvious candidates. The two significant SNPs that are not in close proximity to any known genes, *tsc1177811* and *tsc0571038,* are located at 1p31.1 and 11q21, respectively (^{Affymetrix}). Among the other SNPs in the table, two are located in or near genes of potential interest: (1) *tsc1687605* is in the 3′ UTR of the gene that encodes cytochrome P450, family 2, subfamily C, polypeptide 18 (*CYP2C18* [MIM 601131; ^{Affymetrix}^{; }^{EntrezGene}]) located at 10q24—*CYP2C18* is a member of the CYP2C subfamily of P450 enzymes that is involved with drug metabolism (^{EntrezGene}^{; }^{OMIM}); and (2) SNP *tsc0768481* is in the same cytogenetic region (13q14-q21) as the gene that encodes 5-hydroxytryptamine receptor 2A (*HTR2A* [MIM 182135; ^{Affymetrix}^{; }^{EntrezGene}]). There is evidence that *HTR2A* is associated with alcohol dependence.^{12}

Three previous analyses^{13}^{–}^{15} of these data used the ALDX1 phenotype and performed family-based association tests using FBAT.^{16} When only white individuals were analyzed, no SNPs were significant at the 5% level with use of FBAT with Bonferroni correction.^{13} When all individuals were analyzed, three SNPs were significant after Bonferroni correction,^{14} only one of which (*tsc1750530* on chromosome 16) is in the set of five SNPs we detect. These SNPs and their *P* values were reported^{14} as *tsc0515272* on chromosome 3 (*P*=3.8×10^{-7} uncorrected), *tsc0029429* on chromosome 9 (*P*=2.0×10^{-8} uncorrected), and *tsc1750530* on chromosome 16 (*P*=4.5×10^{-7} uncorrected). False-discovery rates for these SNPs are reported^{15} as .0270, .0019, and .0094, respectively. The corresponding uncorrected *P* values for these SNPs by the *M*_{QLS} are .058, .061, and 4.0×10^{-7}, respectively. Note that the *M*_{QLS} detected four significant SNPs (and the *W*_{χ2corr} detected a fifth significant SNP) with a smaller sample than that used by the FBAT to detect three significant SNPs. This indicates that the *M*_{QLS} is a powerful test that provides additional results complementary to those provided by FBAT.

Analysis of 10,407 SNPs with three tests (*M*_{QLS}, *W*_{QLS}, and *W*_{χ2corr}) took ~35 minutes on a Pentium 4 3-GHz machine with 1 GB RAM. The calculations should scale linearly with the number of SNPs. We have not made serious attempts to optimize the code, so this time could presumably be improved. The slow step is the Cholesky decomposition^{7} of Φ, which would need to be performed only once if, for every SNP, the same individuals had missing genotype data. However, this is generally not the case, so, in our implementation, the Cholesky decomposition is recomputed for every SNP.

## Discussion

Despite major advances in high-density genome scans, disappointing results in the mapping of many common diseases illustrate the need for more-powerful methods for detection of susceptibility loci. We specifically address the problem of genomewide association analysis of binary traits when some individuals in the sample are related with known kinship. This arises naturally, for instance, when families sampled for a linkage study are included in an association study. This can be desirable, because it is expected that affected individuals from multiplex families would have a higher expected frequency of the alleles that increase susceptibility for a genetic trait than would affected individuals who do not have affected relatives. As a result, the power to detect association is expected to increase when affected individuals from multiplex families are included in a study. However, analysis of such data presents statistical and computational challenges.

We have developed a new test, the *M*_{QLS}, which is applicable to association studies with completely general combinations of family and case-control designs. For instance, the *M*_{QLS} allows cases to be related to controls, and it is equally applicable to complex inbred pedigrees and to simpler study designs consisting of unrelated individuals and small outbred families. The *M*_{QLS} distinguishes between unaffected controls and controls of unknown phenotype and can incorporate both into the same analysis. Furthermore, it makes use of phenotype data about relatives who have missing genotype data at a given SNP, where this information is used to optimize the weights given to relatives with nonmissing data at the SNP. We also extend the test to multiallelic markers. Our method is computationally feasible to use for genomewide association studies with hundreds of thousands or millions of SNPs. Our simulations indicate that the *M*_{QLS} represents an overall—and, in many cases, substantial—improvement in power over competing methods for a broad range of multigene trait models, while controlling type I error. In a reanalysis of the GAW 14 COGA data, the *M*_{QLS} detected four SNPs with genomewide-significant association to alcoholism, three of which had not been identified as significant in previous analyses.

We suggest a simple diagnostic, based on only phenotype information, that determines whether the *M*_{QLS}, *W*_{QLS}, and *W*_{χ2corr} would be expected to give different results. This can allow one to avoid correcting for use of three different tests in situations in which they are expected to give similar results. In our simulation studies, when the diagnostic indicated that the tests would give different results, the *M*_{QLS} was generally the most powerful. In the GAW 14 COGA data, the diagnostic indicated that the tests would give different results, and, indeed, this was the case with the *M*_{QLS}, *W*_{QLS}, and *W*_{χ2corr} identifying 4, 0, and 1 significant markers, respectively, where the marker identified by *W*_{χ2corr} was not among those identified by *M*_{QLS}. (In this case, the Bonferroni correction took into account the three different tests as well as the number of SNPs tested.) In our simulations, a diagnostic result of >1 in absolute value in either cases or controls corresponded to a noticeable power difference between the tests, but, with larger sample sizes, a smaller diagnostic result might still correspond to a substantial power difference.

We have developed a modified version of the CC-QLS software program^{3} that outputs the results of our new *M*_{QLS} test for each SNP, as well as the results of the previously proposed^{3} *W*_{χ2corr} and *W*_{QLS} tests. The source code will be available (see M.S.M.'s Web page).

In the simulations and data analysis, we focus on inclusion of small-to-moderate–size outbred families in case-control association studies. However, it is important to note that the *M*_{QLS} is equally applicable to case-control association testing in founder populations, provided that the genealogy is known. Founder populations—for example, the Tasmanian population^{17} and the Hutterites^{18}—are of interest for the mapping of complex traits for various reasons, including (1) avoidance of the problems of unknown population substructure and (2) the expectation that there would be fewer risk alleles involved in complex disorders in founder populations than in diverse continental populations. The *M*_{QLS} is computationally feasible, even in a founder population as complicated as the Hutterites, among whom many of the individuals are related through multiple lines of descent and exact likelihood calculation is not feasible.^{18}

We have examined the question of whether to use prior or posterior kinship coefficients in calculating the weights for *W*_{QLS} and *M*_{QLS} and the variances for *W*_{χ2corr}, *W*_{QLS}, and *M*_{QLS}. We recommend that prior kinship coefficients always be used in calculating the weights; otherwise, the theoretical justification for the statistics might not hold, and they could be badly behaved. In calculating the variance, we found no difference in the results obtained for small-to-moderate–size pedigrees with the two different types of kinship coefficients. Therefore, we recommend prior kinship coefficients for that calculation also, because they are much faster and simpler to compute. For large pedigrees, posterior kinship coefficients are unfeasible to obtain exactly, so it is somewhat academic to debate which is better. Nevertheless, on the basis of our simulations, we can say that, for a design consisting of a small number of large pedigrees, if one is willing to detect a signal that is driven by a combination of linkage and association, then one should obtain higher power with prior kinship coefficients, whereas, if one wants to correct for a known linkage signal to obtain a pure association test, then better type I error properties would be obtained with posterior kinship coefficients. (It is reasonable that these differences should disappear with multiple small-to-moderate–size pedigrees, because, if there is linkage but no association, then different alleles would tend to be associated with the trait in different pedigrees.)

Use of the *M*_{QLS} requires specification of a constant *k* in the test statistic. We emphasize that the test is valid for any value of *k.* To optimize power, we recommend that *k* be set to the best available estimate of the population prevalence of the trait. Our simulation studies suggest that the power of the test is very robust to the choice of *k.* When *k* was misspecified within a factor of 3 or 4 of the true prevalence, there was little or no loss of power in our simulations.

## Acknowledgments

We thank Dr. Mark Abney, for discussion and critical comments; Wataru Yoshimura from Affymetrix Application Support, for assistance with the ^{Affymetrix} (NetAffx) Web site; and two anonymous reviewers, for helpful comments. This study was supported in part by a David and Lucile Packard Fellowship (to T.T.) and by National Institutes of Health grants HG001645 and HL084715. Data were provided by the COGA (U10AA008401). We gratefully acknowledge COGA and thank Dr. Ray Crowe of the University of Iowa and Dr. Jean MacCluer of the Southwest Foundation for Biomedical Research for their help in obtaining permission to analyze the COGA data.

## Appendix A: Extension of *M*_{QLS} to Multiallelic Case

We extend the *M*_{QLS} procedure to test for association between a trait and a multiallelic marker. Extensions of the *W*_{χ2corr} and *W*_{QLS} tests to multiallelic markers have been given elsewhere.^{3} Suppose there are *a* allelic types at the marker, and let be an [(*a*-1)*n*] vector, where *Y*_{i}=(*Y*_{i1},…,*Y*_{in}) and *Y*_{ij}= 1/2× (the number of alleles of type *i* that individual *j* has). Let *p*=(*p*_{1},…,*p*_{a-1})^{T} denote the allele-frequency distribution at the marker in the general population, where *p*_{i}>0 is the frequency of allelic type *i,* and 1^{T}*p*<1. Define *r*=(*r*_{1},…,*r*_{a-1})^{T} to be the (*a*-1) vector of expected changes in allele frequencies for a case randomly sampled from the population. Then the mean model for the *M*_{QLS} in the multiallelic case is *EY*=μ=*p*1+*r*(Φ_{N,NM}*A*), where is the Kronecker product (see, e.g., the work of Schott^{7}^{[p253]}), and **1** is a vector of 1s of length *n*; that is

where we constrain

for all 1*i**a*-1, 1*j**n*. Under the null hypothesis of no association between the marker and the trait, we have *r*=0, where **0** is a zero vector of length (*a*-1). Let *F* denote an (*a*-1)(*a*-1) matrix with (*i*,*j*)th entry *F*_{ij}= 1/2*p*_{i}(1-*p*_{i}) if *i*=*j* and *F*_{ij}=- 1/2*p*_{i}*p*_{j} if *i*≠*j**.* Note that, under the null hypothesis of no association and no linkage, and when the pedigree founders are drawn from a population in HWE under the null hypothesis, *Var*(*Y*)=*F*Φ. The *M*_{QLS} test statistic for the multiallelic case is

where α and Γ are as defined for equations (5) and (6) and is the (*i*,*k*) entry of *F*^{-1} evaluated at , where is the maximum quasi-likelihood estimate of *p* when *r*=0, or equivalently, the BLUE of *p* based on everyone, which previous work^{8} has shown to be for each *i.* Under the null hypothesis, the *M*_{QLS} statistic follows a χ^{2} distribution with *a*-1 df (asymptotically, under regularity conditions).

## Appendix B: Connection between MZ Twins and Invertibility of Φ

We prove that Φ is invertible if and only if the set *N* does not include both members of any MZ twin pair. Because Σ is a covariance matrix, it must be symmetric and positive semidefinite and is invertible if and only if it is positive definite. Φ inherits these properties. Note that Σ and hence Φ is positive definite if and only if there is no linear combination *c*^{T}*Y*, *c**R*^{n}{0}, such that *Pr*(*c*^{T}*Y*=0)=1. This is because *Var*(*c*^{T}*Y*)=*c*^{T}Σ*c*, and *Var*(*c*^{T}*Y*)=0 if and only if *Pr*(*c*^{T}*Y*=0)=1. If *N* includes *i* and *j* who are MZ twins, then *Pr*(*Y*_{i}-*Y*_{j}=0)=1, and Φ is not invertible. Suppose Φ is not invertible. Then there must be some individual *i* in *N* such that *Y*_{i} can be written as a linear combination of *Y*_{-i}, *Pr*(*Y*_{i}=*d*^{T}*Y*_{-i})=1, *d**R*^{n-1}, where *Y*_{-i} is *Y* excluding the *i*th element. Consider the situation in which every individual in *N*{*i*} is heterozygous at the binary marker, which has positive probability for 0<*p*<1. Then, if *i* is not MZ twin to anyone in *N*{*i*}, it is possible for *i* to have any genotype, and so it cannot be true that *Pr*(*Y*_{i}=*d*^{T}*Y*_{-i})=1. Thus, *Pr*(*Y*_{i}=*d*^{T}*Y*_{-i})=1 implies that *i* has an MZ twin in *N*{*i*}.

## Appendix C: Derivation of *M*_{QLS} Mean Model

We show that, under a two-allele disease model, for an arbitrary set of possibly related outbred individuals, the ratio

(given in the “^{Development and Justification of the MQLS Test}” section) tends to

as the effect size (or differences among penetrance probabilities) tend to zero. Throughout, we condition on the pattern of missing phenotype information. Consider a two-allele disease model with penetrance probabilities *k,* *k*-*c*_{2}, and *k*-*c*_{3} for individuals who have 2, 1, or 0 alleles of type 1, respectively, where *k*-1*c*_{2}*k* and *k*-1*c*_{3}*k*, with at least one of *c*_{2} and *c*_{3} nonzero. (Note that, as *c*_{2} and *c*_{3} tend to zero, the population prevalence *K*_{p} will tend to *k.*) Under this model, we have

Furthermore, we can express

where *A* is the entire phenotype vector, *Y*_{-i} denotes (*Y*_{1},…*Y*_{i-1},*Y*_{i+1}…,*Y*_{n+m}), *Y*_{-1} denotes (*Y*_{2},…,*Y*_{n+m}), and *Y*_{-(n+m)} denotes (*Y*_{1},…,*Y*_{n+m-1}), where *Y*_{n+1},…*Y*_{n+m} denote the true (unobserved) genotype values for individuals *n*+1,…,*n*+*m*. As *c*_{2} and *c*_{3} approach 0, we have the expansions

and

where *n*_{c} is the number of affected, *n*_{u} is the number of unaffected, *n*+*m*-*n*_{c}-*n*_{u} is the number of individuals of unknown phenotype in the study, *n*_{v,c} is the number of affected individuals having a *Y* value of *v* (regardless of whether *Y* is actually observed or is missing in the study), and, similarly, *n*_{v,u} is the number of unaffected individuals having a *Y* value of *v*. Note that, for example,

so

where *P*(*Y*_{j}=.5|*Y*_{i}=*s*) can be calculated in terms of the relationship between the pair of individuals (*i,j*), without having to consider multiple individuals jointly. Applying a similar argument to the other terms in equation (C2) and reorganizing terms, we can obtain

Plugging into equation (C1) and noting that, for a pair of outbred individuals (*i,j*),

and

we obtain

Then, plugging in the derived expressions for μ^{*}_{a}-*p* and *P*(*A*), and letting *c*_{2} and *c*_{3} tend to 0, we get

If we let *r* represent the quantity μ^{*}_{a}-*p**,* then this leads to our model .

## Appendix D: Details of *M*_{QLS} Power Improvement Diagnostic

Recall that the test statistics *W*_{χ2corr}, *W*_{QLS}, and *M*_{QLS} each have the form *,* where *S*=*V*^{T}*Y**,* *V*=(*V*_{1},…*V*_{i},…*V*_{n})^{T}, and *V*_{i} can be viewed as the total weight given by the test statistic to individual *i,* with

Here, . Note that, by construction, for each test statistic. To measure the difference in weights between a pair of statistics, say *W*_{χ2corr} and *M*_{QLS}, for case and control individuals, we propose

and

where, for each statistic, the corresponding *B* is a vector of length *n* with *i*th component

which is *V*_{i} normalized by the mean total weight among cases if *i* is a case and is 0 if *i* is a control. Similarly, for each statistic, the corresponding *C* is a vector of length *n* with *i*th component

which is *V*_{i} normalized by the mean total weight among controls if *i* is a control and is 0 if *i* is a case. The definitions of the ψs for any other pair of statistics are analogous. Note that *B*_{χ2corr}=1_{c} and *C*_{χ2corr}=1-1_{c}, so that ψ(*W*_{χ2corr},*M*_{QLS},*case*) reduces to the absolute value of the coefficient of variation of *V*_{MQLS} among cases, and, similarly, ψ(*W*_{χ2corr},*M*_{QLS},*control*), ψ(*W*_{χ2corr},*W*_{QLS},*case*), and ψ(*W*_{χ2corr},*W*_{QLS},*control*) reduce to the absolute values of the coefficients of variation of *V*_{MQLS} among controls, *V*_{WQLS} among cases, and *V*_{WQLS} among controls, respectively, where the coefficient of variation is the SD divided by the mean. In the situation in which there is a group of permutations of the *n*+*m* individuals such that (1) every element of preserves genotyped/missing status; (2) every element of preserves *A*; (3) every element of preserves Φ_{N,NM}; (4) for every pair of genotyped case individuals *i* and *j,* there is a permutation in that maps *i* to *j*; and (5) for every pair of genotyped control individuals *k* and *l,* there is a permutation in that maps *k* to *l,* then the coefficients of variation for the total weights of the cases and for the total weights of the controls are both equal to 0 for the *M*_{QLS} and the *W*_{QLS} statistics, and these two statistics are equivalent to the *W*_{χ2corr} test. For example, these conditions hold, at a marker with no missing genotypes, if the cases are affected sib pairs and the controls are unaffected unrelated individuals.

## Web Resources

The URLs for data presented herein are as follows:

*FBXO15, VAPA, CYP2C18,*and

*HTR2A*)

## References

**American Society of Human Genetics**

## Formats:

- Article |
- PubReader |
- ePub (beta) |
- PDF (293K) |
- Citation

- A test for linkage and association in general pedigrees: the pedigree disequilibrium test.[Am J Hum Genet. 2000]
*Martin ER, Monks SA, Warren LL, Kaplan NL.**Am J Hum Genet. 2000 Jul; 67(1):146-54. Epub 2000 May 23.* - Efficient study designs for test of genetic association using sibship data and unrelated cases and controls.[Am J Hum Genet. 2006]
*Li M, Boehnke M, Abecasis GR.**Am J Hum Genet. 2006 May; 78(5):778-92. Epub 2006 Mar 20.* - Case-control single-marker and haplotypic association analysis of pedigree data.[Genet Epidemiol. 2005]
*Browning SR, Briley JD, Briley LP, Chandra G, Charnecki JH, Ehm MG, Johansson KA, Jones BJ, Karter AJ, Yarnall DP, et al.**Genet Epidemiol. 2005 Feb; 28(2):110-22.* - The relative power of family-based and case-control designs for linkage disequilibrium studies of complex human diseases I. DNA pooling.[Genome Res. 1998]
*Risch N, Teng J.**Genome Res. 1998 Dec; 8(12):1273-88.* - Linkage and association.[Genet Epidemiol. 1998]
*Elston RC.**Genet Epidemiol. 1998; 15(6):565-76.*

- Endogenous Retrovirus Insertion in the KIT Oncogene Determines White and White spotting in Domestic Cats[G3: Genes|Genomes|Genetics. ]
*David VA, Menotti-Raymond M, Wallace AC, Roelke M, Kehler J, Leighty R, Eizirik E, Hannah SS, Nelson G, Schäffer AA, Connelly CJ, O’Brien SJ, Ryugo DK.**G3: Genes|Genomes|Genetics. 4(10)1881-1891* - Whole-Genome Sequencing of Individuals from a Founder Population Identifies Candidate Genes for Asthma[PLoS ONE. ]
*Campbell CD, Mohajeri K, Malig M, Hormozdiari F, Nelson B, Du G, Patterson KM, Eng C, Torgerson DG, Hu D, Herman C, Chong JX, Ko A, O'Roak BJ, Krumm N, Vives L, Lee C, Roth LA, Rodriguez-Cintron W, Rodriguez-Santana J, Brigino-Buenaventura E, Davis A, Meade K, LeNoir MA, Thyne S, Jackson DJ, Gern JE, Lemanske RF Jr, Shendure J, Abney M, Burchard EG, Ober C, Eichler EE.**PLoS ONE. 9(8)e104396* - Comparison of Methods to Account for Relatedness in Genome-Wide Association Studies with Family-Based Data[PLoS Genetics. ]
*Eu-ahsunthornwattana J, Miller EN, Fakiola M, Wellcome Trust Case Control Consortium 2, Jeronimo SM, Blackwell JM, Cordell HJ.**PLoS Genetics. 10(7)e1004445* - Little evidence for association between the TGFBR1*6A variant and colorectal cancer: a family-based association study on non-syndromic family members from Australia and Spain[BMC Cancer. ]
*Ross JP, Lockett LJ, Tabor B, Saunders IW, Young GP, Macrae F, Blanco I, Capella G, Brown GS, Lockett TJ, Hannan GN.**BMC Cancer. 14475* - Role of Nicotine Dependence in the Association between the Dopamine Receptor Gene DRD3 and Major Depressive Disorder[PLoS ONE. ]
*Korhonen T, Loukola A, Wedenoja J, Nyman E, Latvala A, Broms U, Häppölä A, Paunio T, Schrage AJ, Vink JM, Mbarek H, Boomsma DI, Penninx BW, Pergadia ML, Madden PA, Kaprio J.**PLoS ONE. 9(6)e98199*

- PubMedPubMedPubMed citations for these articles

- Case-Control Association Testing with Related Individuals: A More Powerful Quasi...Case-Control Association Testing with Related Individuals: A More Powerful Quasi-Likelihood Score TestAmerican Journal of Human Genetics. Aug 2007; 81(2)321

Your browsing activity is empty.

Activity recording is turned off.

See more...