# Analysis of oligonucleotide array experiments with repeated measures using mixed models

^{}

^{1}Constance L Wood,

^{1}Thomas V Getchell,

^{2,}

^{4}Marilyn L Getchell,

^{3,}

^{4}and Arnold J Stromberg

^{1}

^{1}Department of Statistics, 815 Patterson Office Tower, University of Kentucky, Lexington, Kentucky 40506-0027, USA

^{2}Department of Physiology, University of Kentucky, 800 Rose Street, Lexington, KY 40536-0298, USA

^{3}Department of Anatomy and Neurobiology, College of Medicine, Univeristy of Kentucky, Lexington, KY 40536-0298, USA

^{4}309 Sanders-Brown Center on Aging, 800 South Limestone Street, University of Kentucky College of Medicine, Lexington, KY 40536-0230, USA

^{}Corresponding author.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

## Abstract

### Background

Two or more factor mixed factorial experiments are becoming increasingly common in microarray data analysis. In this case study, the two factors are presence (Patients with Alzheimer's disease) or absence (Control) of the disease, and brain regions including olfactory bulb (OB) or cerebellum (CER). In the design considered in this manuscript, OB and CER are repeated measurements from the same subject and, hence, are correlated. It is critical to identify sources of variability in the analysis of oligonucleotide array experiments with repeated measures and correlations among data points have to be considered. In addition, multiple testing problems are more complicated in experiments with multi-level treatments or treatment combinations.

### Results

In this study we adopted a linear mixed model to analyze oligonucleotide array experiments with repeated measures. We first construct a generalized F test to select differentially expressed genes. The Benjamini and Hochberg (BH) procedure of controlling false discovery rate (FDR) at 5% was applied to the P values of the generalized F test. For those genes with significant generalized F test, we then categorize them based on whether the interaction terms were significant or not at the *α*-level (*α*_{new }= 0.0033) determined by the FDR procedure. Since simple effects may be examined for the genes with significant interaction effect, we adopt the protected Fisher's least significant difference test (LSD) procedure at the level of *α*_{new }to control the family-wise error rate (FWER) for each gene examined.

### Conclusions

A linear mixed model is appropriate for analysis of oligonucleotide array experiments with repeated measures. We constructed a generalized F test to select differentially expressed genes, and then applied a specific sequence of tests to identify factorial effects. This sequence of tests applied was designed to control for gene based FWER.

## Background

Experiments in which subjects are assigned randomly to levels of a treatment factor (or treatment combinations of more than one factor) and then are measured for trends at several sampling times, spaces or regions (within-subject factors) are increasingly common in clinical and medical research. The analysis of interaction, main effects and simple effects are appropriate for analyzing these types of experiments [1]. Main effects are average effects of a factor, and interaction effects measure differences between the effects of one factor at different levels of the other factor. As an example, this paper studies a 2 × 2 factorial treatment design, in which effects of two factors (treatment and region, for example) are studied and each factor has only two levels (with or without certain treatment, two different regions of studied subjects). The measurements from different regions of a subject are repeated measures on the individual and are correlated. In combination with microarray technology [2], this type of design allows one to investigate how treatments alter changes in gene expression in time or region simultaneously across a large number of genes. Two issues are crucial in the analysis of microarray experiments with repeated measures. Firstly, sources of variability must be identified, and the correlation structure among within-subject measurements needs to be taken into account; and secondly, multiple testing is also an immediate concern if tests of interaction, main effects, and/or simple effects are performed for each gene.

It has been shown that replication is the key not only to increasing the precision of estimation but also to estimating errors associated with tests of significance [3]. Previously, a number of ways to identify and model various sources of errors were proposed for replicated microarray experiments, and corresponding methods of extracting differentially expressed genes were suggested [4-8]. Recently, a linear modelling approach [9] and analysis of microarray experiments using mixed models were also introduced [10-12], in which the dependency structure of repeated measurements at the probe level were discussed. Statistical methods to analyze more complicated experiments, where correlated measurements are taken on one or more factor levels have not yet been fully described. In this study, we modified the two-staged linear mixed models [10], and extended them to more complicated designs.

Attention to the multiplicity problem in gene expression analysis has been increasing. Numerous methods are available for controlling the family-wise type I error rate (FWER) [13-17]. Since microarray experiments are frequently exploratory in nature and the sample sizes are usually small, Benjamini and Hochberg [18]suggested a potentially more powerful procedure, the false discovery rate (FDR), to control the proportion of errors among the identified differentially expressed genes. A number of studies for controlling FDR have followed [17,19-25]. However, these approaches for dealing with the multiplicity problems in microarray experiments are largely focused on relatively simple one-way layout experimental designs, and the number of genes that are involved in an experiment was the major concern. More complicated designs, such factorial designs with two or more factors, intensify the multiplicity problem not only because thousands of genes are involved in an experiment, but also because tests for interactions, main effects, and, possibly, simple effects need to be performed to further characterize differences for each gene. It has not been suggested explicitly, however, how to deal with such multiple-testing problems for two factors (or more than two factors) factorial experiments in the microarray literature.

In this paper, we present a method for analyzing oligonuleotide array experiments with repeated measures using a linear mixed model, which allows us to model variance-covariance structures associated with such complicated experiments. Our method is also related to that of Wolfinger *et al*., 2001, Chu *et al*., 2002, Kerr *et al*., 2000, and Wernisch *et al*., 2002 [5,9-11]. In addition, we construct a generalized F test to test the null hypothesis that all the means for all Disease by Region combinations are equal. Benjamini and Hochberg (BH) procedure of controlling FDR at 5% is used for comparing P values of the generalized F tests. The test to determine whether the interaction term is significant is performed only for each gene with a significant generalized F test. In addition, simple effects are examined for the genes with significant interaction effect and main effects are tested for those differentially expressed genes which do no exhibit significant interaction. In the 2 × 2 factorial, this sequence of tests controls the maximum FWER and, hence the FWER for all genes. We also illustrate how to summarize and categorize the interactions using simple diagrams. We demonstrate our method on the analysis of microarray data from two regions of the brain, the olfactory bulb (OB) and the cerebellum (CER), from control subjects and patients with AD. Although a 2 × 2 experiment was used in this manuscript, our methods can be extended to designs with more than 2 factors or more than 2 levels in one or more factors. The OB was used because AD patients show pronounced decrements in their olfactory sensitivity early in the clinical course of the disease [26]. The cerebellum was selected as a control tissue because it is generally considered to be minimally affected in AD.

## Results

### Analysis of gene expression in OB and CER of controls and AD patients

Based on the statistical methods described (see Methods), 708 genes were considered to be significant by the procedure of controlling FDR at 5% for multiple testing across genes. The largest P-value considered to be significant was 0.0033 determined by the FDR procedure. Among the 708 genes, 137 show significant interaction at the level of 0.0033, 49 genes with significant disease effect (32 were up-regulated and 17 were down-regulated in AD patients) and 559 genes with significant regional effects (331 were up-regulated and 228 were down-regulated in the OB) (Table (Table1).1). There were 37 genes that appear on both lists of significant disease and regional effects (not shown). Further validation studies, such as real time RT-PCR, could be performed to examine which interpretation is more reasonable.

A significant interaction effect for a gene has to be explained so that the gene can be further categorized based on the nature of the possible alterations of their expression levels. The interaction patterns were identified based on the change directions and test results for the following simple effects: control *vs *AD for OB, control *vs *AD for CER, CER *vs *OB for control, and CER *vs *OB for AD. The interaction effects can also be illustrated using simple diagrams by plotting together the average log2 based intensities under control and AD conditions for both OB and CER. Nonparallel lines in a diagram often imply an interaction effect. An interaction effect can be either directional or magnitudinal. In this study, directional interactions refer to the situations when the changes (in gene expression) between AD and control in OB are in the opposite directions compared to the changes between AD and control in CER. In a magnitudinal interaction, the directions of the changes between AD and control are the same but the magnitudes of changes are significantly different. The gene LOC91614 (UniGene Cluster Hs.180545), which encodes novel 58.3 kDa protein, is an example with directional interaction effect. As shown in Figure Figure11 A1, it is significantly up-regulated 2.08 fold in the OBs of AD patients (Table (Table2)2) and significantly down-regulated 3 fold (1/0.33) in their CERs (Table (Table2).2). The function of this gene is unknown, but, based on the domains identified in its protein sequence, it is likely to be involved in intracellular signalling cascades. Given this divergence in the direction of regulation in these 2 brain regions, this gene would be of interest for further characterization. The gene encoding the proteolytic lysosomal enzyme cathepsin H (UniGene Cluster Hs.114931) has a different pattern of interaction effects as shown in Figure Figure11 A2. It was significantly up-regulated 3.51 fold in the OBs of AD patients (Table (Table2)2) and shows a slight non-significant trend toward up-regulation in their CERs (Table (Table2).2). This is consistent with the pronounced activation of lysosomal enzymes that occurs in regions of the AD brain vulnerable to neurodegeneration (Nixon *et al*., 2000), and with the slight increase in lysosomal density in the CER compared with the pronounced increase in sites of the AD brain with significant neuropathology (prefrontal cortex and hippocampus; [27]). The patterns for other genes with significant interaction effects were also determined by the similar method described above.

**Simple diagrams to illustrate significant interaction, main effects of Disease and Region**. The average log transformed intensities under control and AD conditions for both OB and CER were plotted together for each gene with either significant interaction

**...**

In the absence of interaction effects, main effects are often meaningful. The genes that have either significant main effect of Disease or Region were also identified and characterized by examining the average difference between AD and controls or the average difference between OB and CER. Main effects can also be illustrated by the simple diagrams described above, in which the lines are often parallel. Four genes were used as examples to illustrate main effect of Disease and main effect of Region (Table (Table3,3, Figure Figure11).

The genes *HMGN2 *and *TSG101 *both have significant effect of Disease (Table (Table3).3). *HMGN2 *(high mobility group nucleosomal binding protein 2) was significantly down-regulated (1.6 fold; *p *= 0.0009) in the OBs of AD patients compared to elderly non-demented controls; there was no significant difference in mean expression levels in the OB and CER as shown in Figure Figure11 B1 and B2. Its down-regulation in the OBs of AD patients is consistent with the generally reduced level of gene expression that has been described in the AD brain [28]. The gene *TSG101*, was up-regulated 1.51 fold (*p *= 0.0021), with no significant differences in expression levels in the OB and CER. The encoded protein is a member of the mammalian class E vps proteins, which mediate ubiquitination-dependent receptor sorting within the endosomal pathway. The up-regulation of *TSG101 *suggests a potential disruption of OB neurogenesis.

Two examples of genes with Regional effects are *RELN *and *B2M *(Table (Table3).3). *RELN *is expressed at lower levels in the OB than in the CER (2.7 fold, *p *<0.0005) as shown in Figure Figure11 C1 and C2. The encoded protein is a secreted extracellular matrix molecule that interacts with integrin signalling to generate a signal for migratory developing neurons to stop and form layers; thus, a defect in this gene results in improper development of the cerebellum as well as other brain regions [29]. *B2M*, the gene encoding *β*_{2 }microglobulin, is expressed at 2.2-fold higher levels in the OB than in the CER (*p *< 0.0002). One potential explanation for the higher levels of *B2M *expression in the OB than the CER is that antigens can enter the brain directly along the pathway provided by the axon of the olfactory receptor neuron or within the sheath of the olfactory nerve; numerous proteins and pathogens enter the brain via this route (*e.g.*, [30,31]. The potentially higher level of antigenic stimulation in the OB may result in the up-regulation of *B2M *expression, which would not occur in the CER due to the lack of such a direct connection with the external environment.

The remaining genes with either significant effects of Disease or Region were also identified and categorized in a similar way and summarized in Table Table11.

## Discussion

In this study, we adopted a linear mixed model to analyze oligonuleotide array experiments with repeated measures. We constructed a generalized F test to select differentially expressed genes and compared our method to another frequently used approach. Using the method described above, we identified 708 differentially expressed genes, 137 of which have significant interaction, and 571 genes have main effect of either Disease or Region. Using simple diagrams, we can illustrate and further categorize the interactions and main effects.

This linear mixed model approach allows us to identify various sources of variability, including experimental effects, random effects of subjects and random error. The performance of the generalized F statistic depends on the validity of the assumed covariance structures and the degree of replication. We assumed homoscedastic variances for each gene. This may not be true for all genes in reality. With small sample sizes, which are common in microarray studies, simpler covariance structures which require the estimation of fewer variance components are preferred. Simulation studies showed that, with sample size of 3, the generalized F test performs reasonably well in cases with homoscedastic variances.

We also tested the factorial effects on the 708 genes which were identified by BH procedure using the more conservative Bonferroni adjustment to the *α*-level in order to simultaneously control FDR and the possibility of performing multiple tests for the factorial effects. For example, controlling FDR at 0.05/3 = 1.67%, produced a list of 77 genes. The method we developed is more powerful. In addition, only regional effects were identified without significant interaction and main effect of disease by the alternative method.

In this manuscript, we adopt BH procedure to control FDR at 5% based on the generalized F tests. Any other standard multiple testing procedures may also be applied. A specific sequence of tests was used to identify factorial effects and control the gene-based FWER in our study. For researchers who are interested in all pairwise comparisons among treatment groups, Hayter's modification of the LSD method [32] controls the FWER for all genes.

We also assumed independence of the significant tests among genes. This assumption, which is also adopted in the majority of the microarray literature, may not be completely valid since gene expression is tightly regulated. The correlation among the genes varies from developmental stages, tissue to tissue, etc., and we may never be able to quantify it precisely. The assumption that genes are correlated in small clusters has been adopted by Benjamini and Yekutieli [21] in their FDR control study. This assumption, however, has not been completely verified.

## Conclusions

A linear mixed model is appropriate for analysis of oligonucleotide array experiments with repeated measures, allowing us to quantify various sources of error. We constructed a generalized F test to select differentially expressed genes, and then applied a specific sequence of tests to identify factorial effects. This sequence of tests applied was designed to control for gene based FWER. Our methods can be extended to designs with more than 2 factors or more than 2 levels in one or more factors. The generalized F test can be constructed for any number of factors or levels of factors.

## Methods

### Sources and processing of tissue

OBs were obtained with appropriate informed consent from patients with Alzheimer's disease (AD) and control subjects enrolled in the Biologically Resilient Adults in Neurological Studies (BRAiNS) project of the Sanders-Brown Center on Aging. At autopsy, OBs and pieces of the lateral tip of the cerebellums were removed from 6 females, 3 with AD (mean age, 79.0 years; mean postmortem interval 3.8 h) and 3 controls (mean age, 78.6 years; mean postmortem interval 2.9 h), and immediately placed in liquid nitrogen. BRAiNS control subjects had no clinical evidence of dementia or other neurological problems and scored within the normal range on yearly mental status tests; on neuropathological examination, their brains exhibited age-related but not disease-related changes. AD patients received a diagnosis of probable AD in the Memory Disorders Clinic; on neuropathological examination, their brains met multiple criteria for definite AD and exhibited no indications of complications from cerebrovascular disease [33].

OBs and cerebellum were homogenized in TRI-Reagent (Molecular Research, Inc., Cincinnati, OH), and total RNA was extracted according to the manufacturer's protocol. RNA concentration was determined spectrophotometrically; its integrity and quality were assessed by spectrophotometry, agarose gel electrophoresis, and Bioanalyzer (Agilent, Technologies, Wilmington, DE) virtual gels. Following target preparation, the samples were hybridized onto the Affymetrix Human Genome U133_A and _B GeneChips at the University of Kentucky Microarray Core Facility according to Affymetrix protocols.

### Experimental design

OBs and pieces of the lateral tip of the cerebellums were previously removed from each of 3 control subjects and 3 patients with AD (all female with similar ages). Total RNA was extracted from OB and CER tissues for each subject. Five *μ*g RNA from the OB and CER of each individual were hybridized with Affymetrix Human Genome U133_A and _B chips (2 GeneChips/tissue/individual = 24 GeneChips). Data from U133_A and _B chips for each RNA sample were combined to give 12 data sets with signal intensities for 44828 targets. Under the assumption of independence among genes, we have a 2 × 2 factorial design for each gene with one factor being either control or AD and with repeated measures (regions, OB or CER) on each subject. The arrangement for the 2 × 2 mixed factorial design in this experiment is shown as in Table Table4,4, where *μ _{11}*,

*μ*,

_{12}*μ*, and

_{21}*μ*denotes the average log 2 based expression levels measured in OB of controls, CER of controls, OB of AD patients and CER of AD patients respectively. Corresponding measurements from the same subject are correlated and they are marked as same color. Our primary interests are to identify various sources of variability and differentially expressed genes.

_{22 }### Data preparation

#### Normalization

Background correction and initial total intensity normalization were first performed for the microarray raw data using Affymetrix Version 5 software [34], resulting in gene intensities for each gene-chip combination. The log intensities values were used in later processing. We chose the local regression method (loess) [35-37] to normalize the chips within each of the four treatment combinations. The total intensity method was performed to normalize array across treatment combinations.

#### Data Filtering

In our study, all positive control genes and genes that resulted in an "absent" call for all chips were removed from further analysis. If there was no evidence that these genes were expressed in any of the samples, then these genes can be removed to reduce problems associated with multiple comparisons. Other methods of removing low intensity points were also suggested by Bolstad *et al*., 2003 [37]. All ESTs were also removed from the analysis. Since the primary interest of these experiments is to identify known genes that are differentially regulated, eliminating ESTs will further reduce problems with multiple comparisons. After data filtering steps, 10,590 genes remained, and the base-2 logarithms of background-corrected and normalized intensities of these genes were subject to further statistical analyses.

### Algorithm and analysis

#### Analysis of variance components

We use a linear mixed model to describe the experiment.

Let Y_{gijk }be the base-2 logarithm of background-corrected and normalized intensity of the g*th *gene, *g *= 1, ..., 10590, in the i*th *Treatment group *i *= 1, 2, from the j*th *Region, *j *= 1, 2, on the k*th *subject *k *= 1, 2, 3. "Treatment" here signifies the health condition of the subjects (controls or AD patients). A complete linear mixed model for this experiment:

Y_{gijk }= *μ *+ D_{i }+ *S*_{ik }+ R_{j }+ (DR)_{ij }+ *A*_{ijk }+ G_{g }+ (GD)_{gi }+ (GR)_{gj }+ (GDR)_{gij }+ *ε*_{gijk}, (1)

where *μ *is the grand mean, D_{i }and R_{j }and are the main effects of treatments, regions respectively, and (DR)_{ij }are the treatment-region interaction effects. Here *S*_{ik }are the random effects of subjects within disease group and *A*_{ijk }are the random effects of chips. The symbols G_{g}, (GD)_{gi}, (GR)_{gj }and (GDR)_{gij }represent the main effect of gene, gene-treatment interaction effects, gene-region interaction effects, gene-treatment-region interaction effects, while *ε*_{gijk }are the additive stochastic errors

In general, it is impractical, using currently available software, to fit linear models such as (1) with microarray data involving manipulation of the full covariance matrix of observation variables that usually contains thousands of levels. To be conceptually and computationally more efficient, Wolfinger *et al*., 2001 [10] suggested a two-step model to separate experimental-wise systematic effects (normalization sub-model) and the remaining effects for each gene (gene sub-model). In our case, however, the design matrix for the fixed effects of D_{i}, R_{j }and (DR)_{ij }is orthogonal to the design matrix for the fixed effects involving each gene, including G_{g}, (GD)_{gi}, (GR)_{gj }and (GDR)_{gij}. Therefore, the normalization model has no effect on the inference for each gene under the assumption in (1). A simpler model can be adopted for each gene, and the random effect *S*_{ik }is absorbed by *S*_{gik }terms and *A*_{ijk }is absorbed by *ε*_{gijk }terms. We make standard stochastic assumptions that the random effects *S*_{gik}, and *ε*_{gijk }are normally distributed with zero means with variances *σ*_{gs}^{2}, and *σ*_{g}^{2 }respectively. These random effects are assumed to be independent both across their indices. The model equation then becomes

Y_{gijk }= *μ*_{g }+ D_{gi }+ R_{gj }+ (DR)_{gij }+ *S*_{gik }+ *ε*_{gijk}. (2)

In matrix notation, the model equation for each gene can be written

**Y **= **X β **+ Z

**u**+

**ε,**(3)

where **Y **is a vector of observations, **X **and Z are matrices of known constants for the fixed effects and random effects, respectively, ** β **is a vector containing fixed effect parameters D

_{gi}, R

_{gj}, and (DR)

_{gij},

**u**is a vector of random effects, and

**is the error or residual vector. Therefore,**

*ε***Y**~ MVN (

**X**) where

*β,V**V*=

**+**

*ZDZ'***Σ**. The covariance matrices

**= var(**

*D***u**) and

**Σ**= var(

**) can have any valid variance-covariance matrix form. The variances of gene specific subject effects**

*ε**S*can vary for different treatments and different genes, while

*ε*effects can have different variances for different treatments, regions and different genes. The remaining terms are fixed effects. All effects and variance components in the model can be estimated using the method of restricted maximum likelihood (REML) [38].

In the homogeneous variance case assumed here, since observations across subjects are independent, the variance-covariance matrix for gene g, ** V_{g}**, is block diagonal where

** V_{g }**=

*diag*(

**Σ**)

_{g}and

If the assumption of homoscedasticity is not viable, the variance-covariance for gene g can easily be accommodated by allowing **Σ _{g }**to vary across disease groups.

#### Estimation of model parameters

The estimate of primary interest is ** β**, which containing treatment, region effects and treatment-region interaction for each gene. For each gene,

**is estimated by**

*β*The estimated has covariance

where in practice components of ** V **are replaced by their REML estimates. See Verbeke and Molenberghs (2000) [1] for methods to derive equations (3)–(5) and REML estimates of the random components in details.

#### Construct a generalized F test

Genes showing significant interaction effects are defined as those in which the difference in expression levels between control and AD is not the same with the difference between OB and CER. Main effects are meaningful in the absence of interaction effect. Genes showing a significant disease-related effect or main effect of disease are defined as those either under- or over-expressed by AD patients compared to controls at the same extent in both OB and CER, while genes with significant main effect of region are those either under- or over-expressed in OB compared to CER at the same extent by both AD and controls. If the expression levels for a gene are the same across all treatment-region combinations, then there will be neither significant interaction nor main effects; therefore this gene should be excluded from further analysis. The expression of other genes may be altered by treatment or/and region effects, and further analysis of these genes is needed to characterize the experimental effects. Therefore the first step to select differentially expressed genes in factorial designs is to choose those for each of which the hypothesis of equality of all cell means, *μ _{11 }*=

*μ*=

_{12 }*μ*=

_{21 }*μ*, is rejected. Because of the specific variance-covariance structure for a repeated measures experiment with two levels of the within subject factor, it is convenient to test the equivalent composite hypothesis for each gene g which is stated in terms of the main effects and the interaction. Specifically, we consider

_{22}We can test this composite null hypothesis of no interaction and main effects simultaneously by setting up 3 corresponding linear contrasts listed in Table Table5.5. A contrast is a linear combination of parameters, for which the coefficients sum to zero [39]. Let **L **be the 3 × 8 matrix containing the coefficients of the 3 contrasts, then *H _{o }*is simplified as

**L**=

*β***0**, where

**L**is estimable, and can be tested using the generalized F test

*β*Under *H _{o}*, the generalized F is distributed approximately as Snedecor's F with degrees of freedom rank(L) and

*ν*(

**F**

_{[rank(L), ν]}). Since the variance-covariance matrix V satisfies a compound symmetry condition, in our example this statistic is distributed as

**F**

_{[3, 4]}. Under other assumptions of the variance-covariance structures, the denominator degrees of freedom

*ν*can be approximated by the degrees of freedom to estimate

**using Satterthwaite's procedure [38,40]. Details about how to select appropriate covariance structures were discussed by Littell et al. (1996) [38] and Keselman et al. (1998) [41].**

*L(X'V*^{-1 }X)^{-1 }L'#### Adjustment for multiple tests

Multiple testing problems in microarray experiments with factorial designs are at least two-fold. Usually, hypothesis tests are performed for each of thousands of genes involved, and tests of main effects and interactions may also be needed for each gene. Based on the generalized F test we constructed above, we now suggest a method for adjusting multiple tests.

The most commonly used methods to adjust multiple tests are of controlling either FWER or FDR. These methods are first applied to the P-values from the generalized F tests, providing a list of genes that exhibit significant difference among the four cell means of Disease by Region combination. Some of these genes may have significant interactions, or only the main effects of treatment and/or region are significant. Further characterizing the significant interactions are one of the major interests for researchers, and methods for investigate interaction contrasts are available [42-44]. In our study, simple effects were examined for the genes the have a significant interaction to detect the difference between specific comparisons. Protected by the generalized F test, Fisher's least significant difference test (LSD) method can be used to test the necessary simple effects. Here the appropriate error terms for these simple effects depend on whether the comparisons involve measurements from same Disease groups or not. This sequence of tests proposed in this paper are more powerful, while still allowing for the control of FWER or FDR, compared to directly adjusting P-values using BH procedure with Bonferroni correction. In the latter method, if we control overall FDR at 5%, we would perform BH procedure at level of 1.67% or 0.05/3 for each test of interaction, main effect of disease or region.

#### Recipe of the analysis

A short summary of the statistical methods used in this study follows:

1. Linear mixed models were used to describe the data based on the experimental design and some common assumptions, and the variance components were specified.

2. For each gene, a generalized F test was performed based on the described model, and the corresponding P-value was obtained.

3. To adjust the multiple tests for numbers of genes, the BH method of controlling FDR [18] at 5% was applied to the P-values obtained above, providing a list of genes (list I) that exhibit significant differences among the means of the Disease*Region combinations.

4. Using *α*_{new}, which equals to the largest P-value considered to be significant in step3 as the cut-off point, we choose genes with significant interactions (list II) from list I and, for genes in list II, to test the simple effects. By complete enumerating of all possible combinations of main effects and interaction effect, one can prove that *α*_{new }is an appropriate choice to control the FWERs while selecting genes with either significant interaction or main effects in 2 × 2 factorial experiments. From the remaining genes, significant main effects of either disease or region (list III) were selected. In the example used in this study, *α*_{new }= 0.0033.

#### Statistical software

Data normalization and generation of simulated data were performed using S-plus version 6.1. We used SAS (version 9.0) proc mixed procedure to do Model fitting and significance analysis. The SAS program implementing linear mixed models for the AD data is available on request from the first author.

### Simulation studies

We constructed a generalized F test to select differentially expressed genes (see method). To assess the performance of the constructed generalized F test with small sample sizes, we performed simulation studies. Since expression levels of genes in the OB or CER from an individual (either a control or patient with AD) are considered to be repeated measures, correlated data should be generated for the simulations. First, we studied the case (case I) with equal variance and covariance structure for each individual subject (control or AD patient). We generated 10,000 sets of correlated data; each set has 6 bivariate observations, with mean 20 and the following covariance structure for each subject under either Disease condition (i = 1 for control, and i = 2 for AD; j or j' = 1 for OB, and j or j' = 2 for CER, k = 1, 2, 3):

where Y_{gijk }and Y_{gij'k }are measurements from the j^{th }and j'^{th }levels of Region for the kth subject in the i*th *level of Disease for gene g. The generalized F-statistics were computed for each of the 10,000 data sets and the histogram of the generalized F-statistics was compared with that of randomly generated F values from a **F**_{[3, 4] }distribution as shown in figure figure2A.2A. The histogram of the generalized F-statistics has a slightly larger tail. The proportion of the generalized F-statistics that were no larger than the critical value, F_{[3, 4, α = 0.05] }= 6.59 was 5.12%, instead of the nominal 5%, 4.97% random generated F values were smaller than 6.59.

**Histograms of Simulated F statistics**. The histograms of the F statistics from F

_{[3, 4] }(grey in A, B), simulated data with same covariance structure among individuals (cyan, case I in A) or unequal variance for subjects from controls and AD patients (blue,

**...**

More complicated variance and covariance structure can also be assumed. For example, the controls and AD patients may have a different covariance matrix. We then generated simulated data to study cases like above (case II). Using the same covariance structure as above, we generate 10,000 sets of data for controls. For AD, we generated 10,000 sets of data using a different covariance structure

Then we computed the generalized F-statistics and compared them with randomly generated F values described above (Figure (Figure2B).2B). The histogram of the generalized F-statistics has a slightly larger tail than those of both random generated F values and case II. There was 5.42% of the generalized F-statistics in case II were larger than the critical value 6.59. With small sample size in both cases (n = 3), the constructed generalized F-statistics behave reasonably well.

## Authors' contributions

HL carried out the study. AJS, and CLW supervised the study. TVG and MLG carried out the molecular genetics studies. All authors contributed to the writing of this manuscript. All authors read and approved the final manuscript.

## Acknowledgements

This work was supported by NIH AG-16345 (MLG), NIH-P20RR16481 and NSF-EPS-0132295 (AJS), NIH AG-016824-23 (TVG). We also wish to thank Donna Wall, Microarray Facility, and Radhika Vaishnav, Department of Anatomy and Neurobiology, for their expertise.

## References

- Verbeke G, Molenberghs G. Linear Mixed models for longitudinal data. Springer, New York, NY. 2000.
- Lockhart D, Dong H, Byrne M, Follettie M, Gallo M, Chee M, Mittmann M, Wang C, Kobayashi M, Horton H, Brown EL. Expression monitoring by hybridization to high-density oligonucleotide arrays. Nat Biotechnol. 1996;14:1675. doi: 10.1038/nbt1296-1675. [PubMed] [Cross Ref]
- Fisher RA. The Design of Experiments, 6th edn. Edinburgh: Oliver and Boyd Ltd. 1951.
- Lee ML, Kuo FC, Whitmore GA, Sklar J. Importance of replication in microarray gene expression studies: statistical methods and evidence from repetitive cDNA hybridizations. Proc Natl Acad Sci USA. 2000;97:9834–9839. doi: 10.1073/pnas.97.18.9834. [PMC free article] [PubMed] [Cross Ref]
- Kerr MK, Martin M, Churchill GA. Analysis of variance for gene expression microarray data. J Comput Biol. 2001;7:819–837. doi: 10.1089/10665270050514954. [PubMed] [Cross Ref]
- Kerr MK, Churchill GA. Statistical design and the analysis of gene expression microarray data. Genet Res. 2001;77:123–128. doi: 10.1017/S0016672301005055. [PubMed] [Cross Ref]
- Huang X, Pan W. Comparing three methods for variance estimation with duplicated high density oligonucleotide arrays. Funct Integr Genomics. 2002;2:126–133. doi: 10.1007/s10142-002-0066-2. [PubMed] [Cross Ref]
- Šášik R, Calvo E, Corveil J. Statistical analysis of high-density oligonucleotide arrays: a multiplicative noise model. Bioinformatics. 2002;18:1633–1640. doi: 10.1093/bioinformatics/18.12.1633. [PubMed] [Cross Ref]
- Chu TM, Weir B, Wolfinger R. A systematic statistical linear modeling approach to oligonucleotide array experiments. Math Biosci. 2002;176:35–51. doi: 10.1016/S0025-5564(01)00107-9. [PubMed] [Cross Ref]
- Wolfinger RD, Gibson G, Wolfinger ED, Bennett L, Hamadeh H, Bushel P, Afshari C, Paules RS. Assessing gene significance from cDNA microarray expression data via mixed models. J Comput Biol, 2001;8:625–637. doi: 10.1089/106652701753307520. [PubMed] [Cross Ref]
- Wernisch L, Kendall SL, Soneji S, Wietzorrek A, Parish T, Hinds J, Butcher PD, Stocker NG. Analysis of whole-genome microarray replicates using mixed models. Bioinformatics. 2003;19:53–61. doi: 10.1093/bioinformatics/19.1.53. [PubMed] [Cross Ref]
- Cui X, Churchill GA. Statistical tests for differential expression in cDNA microarray experiments. Genome Bio. 2003;4:210. doi: 10.1186/gb-2003-4-4-210. [PMC free article] [PubMed] [Cross Ref]
- Duncan DB. Multiple range and multiple F tests. Biometrics. 1955;11:1.
- Westfall PH, Young SS. p-value adjustment for multiple tests in multivariate binomial models. J Am Stat Assoc. 1989;84:780–786.
- Shaffer JP. Multiple hypothesis testing. Annu Rev Psychol. 1995;46:561–724. doi: 10.1146/annurev.ps.46.020195.003021. [Cross Ref]
- Hsu JC. Multiple comparisons. London: Chapman and Hall. 1996.
- Dudoit S, Yang YH, Callow MJ, Speed TP. Statistical methods for identifying differentially expressed genes in replicated cDNA microarray experiments. Stat Sinica. 2002;12:111–139.
- Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. JRoy Stat Soc. 1995;B:289–300.
- Yekutieli D, Benjamini Y. Resampling-based false discovery rate controlling multiple test procedures for correlated test statistics. J Stat Plan Infer. 1999;82:171–196. doi: 10.1016/S0378-3758(99)00041-5. [Cross Ref]
- Abramovich F, Benjamini Y, Donoho D, Johnstone I. Adapting to unknown sparsity by controlling the false discovery rate. Technical Report No 2000-19 Department of Statistics, Stanford University. 2000.
- Benjamini Y, Yekutieli D. The control of the false discovery rate under dependency. Ann Stat. 2001;29:1165–1188. doi: 10.1214/aos/1013699998. [Cross Ref]
- Efron B, Tibshirani R, Storey JD, Tusher V. Empirical Bayes analysis of a microarray experiment. J Am Stat Assoc. 2001; 96:1151–1160. doi: 10.1198/016214501753382129. [Cross Ref]
- Storey JD. The positive false discovery rate: A Bayesian interpretation and the Q-Value. Technical Report 2001-12. Department of Statistics, Stanford University. 2001.
- Tusher VG, Tibshirani R, Chu G. Significance analysis of microarrays applied to the ionizing radiation response. Proc Natl Acad Sci USA. 2001;98:5116–5121. doi: 10.1073/pnas.091062498. [PMC free article] [PubMed] [Cross Ref]
- Sabatti C, Karsten SL, Geschwind DH. Thresholding rules for recovering a sparse signal from microarray experiments. Math Biosci. 2002;176:17–34. doi: 10.1016/S0025-5564(01)00102-X. [PubMed] [Cross Ref]
- Murphy C, Gilmore MM, Seery CS, Salmon DP, Lasker BR. Olfactory thresholds are associated with degree of dementia in Alzheimer's disease. Neurobiol Aging. 1990;11:465–469. doi: 10.1016/0197-4580(90)90014-Q. [PubMed] [Cross Ref]
- Cataldo AM, Hamilton DJ, Barnett JL, Paskevich PA, Nixon RA. Properties of the endosomal-lysosomal system in the human central nervous system: Disturbances mark most neurons in populations at risk to degenerate in Alzheimer's disease. JNeurosci. 1996;16:186–199. [PubMed]
- McLachlan DR, Lukiw WJWL, Bergeron C, Bech-Hansen NT. Selective messenger RNA reduction in Alzheimer's disease. Brain Res. 1988;427:255–261. [PubMed]
- Tissir F, Goffinet AM. Reelin and brain development. Nature Rev Neurosci. 2003;4:496–505. doi: 10.1038/nrn1113. [PubMed] [Cross Ref]
- Reiss CS, Plakhov IV, Komatsu T. Viral replication in olfactory receptor neurons and entry into the olfactory bulb and brain. Ann N Y Acad Sci. 1998;855:751–761. [PubMed]
- Schwob JE, Saha S, Youngentob SL, Jubelt B. Intranasal inoculation with the olfactory bulb line variant of mouse hepatitis virus causes extensive destruction of the olfactory bulb and accelerated turnover of neurons in the olfactory epithelium of mice. Chem Senses. 2001;26:937–952. doi: 10.1093/chemse/26.8.937. [PubMed] [Cross Ref]
- AJ Hayter AJ. The maximum familywise error rate of Fisher's least significant difference test. J Am Stat Assoc. 1986;81:1000–1004.
- Getchell ML, Shah DS, Buch SK, Davis DG, Getchell TV. 3-Nitrotyrosine immunoreactivity in olfactory receptor neurons of patients with Alzheimer's disease: implication for impaired odor sensitivity. Aging. 2003;24:663–673. [PubMed]
- Affymetrix Statistical algorithms reference guide, Technical report, Affymetrix. 2001.
- Li C, Wong WH. Model-based analysis of oligonucleotide arrays: model validation, design issues and standard error applications. Genome Biol. 2001;2:1–11. [PMC free article] [PubMed]
- Schadt E, Li C, Eliss B, Wong WH. Feature extraction and normalization algorithms for high-density oligonucleotide gene expression array data. J Cell Biochem. 2002;84:120–125. doi: 10.1002/jcb.10073. [PubMed] [Cross Ref]
- Bolstad BM, Irizarry RA, Åstrand M, Speed TP. A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics. 2003;19:185–193. doi: 10.1093/bioinformatics/19.2.185. [PubMed] [Cross Ref]
- Littell RC, Milliken GA, Stroup WW, Wolfinger RD. SAS system for mixed models, SAS institute Inc. 1996.
- Scheffé H. The analysis of variance. Wiley classics library edition pub. Pp66. John Wiley & Sons, INC. 1999.
- Kenward MG, Roger JH. Small sample inference for fixed effects from restricted maximum likelihood. Biometrics. 1997;53:983–997. [PubMed]
- Keselman HJ, Algina J, Kowalchuk RK, Wolfinger RD. A comparison of two approaches for selecting covariance structures in the anlaysis of repeated measurements. Comm Stat-SimComp. 1998;27:591–604.
- Boik RJ. Interaction, partial interactions, and interaction contrasts in the analysis of variance. Psych Bulletin. 1979;68:1084–1089.
- Hochberg Y, Tamhane AC. Multiple comparison procedures. New York: Wiley. 1987.
- Lix LM, Keselman HJ. Interaction contrasts in repeated measures designs. British J Math Stat Psych. 1996;49:147–162.

**BioMed Central**

## Formats:

- Article |
- PubReader |
- ePub (beta) |
- PDF (334K) |
- Citation

- Identification of gene expression patterns using planned linear contrasts.[BMC Bioinformatics. 2006]
*Li H, Wood CL, Liu Y, Getchell TV, Getchell ML, Stromberg AJ.**BMC Bioinformatics. 2006 May 5; 7:245. Epub 2006 May 5.* - Comparison of seven methods for producing Affymetrix expression scores based on False Discovery Rates in disease profiling data.[BMC Bioinformatics. 2005]
*Shedden K, Chen W, Kuick R, Ghosh D, Macdonald J, Cho KR, Giordano TJ, Gruber SB, Fearon ER, Taylor JM, et al.**BMC Bioinformatics. 2005 Feb 10; 6:26. Epub 2005 Feb 10.* - Identifying differentially expressed genes using false discovery rate controlling procedures.[Bioinformatics. 2003]
*Reiner A, Yekutieli D, Benjamini Y.**Bioinformatics. 2003 Feb 12; 19(3):368-75.* - [Multiple comparison procedures: principles, limits. Applications to microarray phenotype-genotype analysis].[Rev Epidemiol Sante Publique. 2004]
*Dalmasso C, Broët P, Moreau T.**Rev Epidemiol Sante Publique. 2004 Dec; 52(6):523-37.* - Statistical considerations for analysis of microarray experiments.[Clin Transl Sci. 2011]
*Owzar K, Barry WT, Jung SH.**Clin Transl Sci. 2011 Dec; 4(6):466-77. Epub 2011 Nov 7.*

- A novel approach for biomarker selection and the integration of repeated measures experiments from two assays[BMC Bioinformatics. ]
*Liquet B, Cao KA, Hocini H, Thiébaut R.**BMC Bioinformatics. 13325* - MetabR: an R script for linear model analysis of quantitative metabolomic data[BMC Research Notes. ]
*Ernest B, Gooding JR, Campagna SR, Saxton AM, Voy BH.**BMC Research Notes. 5596* - Differential gene expression in liver and small intestine from lactating rats compared to age-matched virgin controls detects increased mRNA of cholesterol biosynthetic genes[BMC Genomics. ]
*Athippozhy A, Huang L, Wooton-Kee CR, Zhao T, Jungsuwadee P, Stromberg AJ, Vore M.**BMC Genomics. 1295* - Distinct clinical phenotypes associated with JAK2V617F reflect differential STAT1 signaling[Cancer cell. 2010]
*Chen E, Beer PA, Godfrey AL, Ortmann CA, Li J, Costa-Pereira AP, Ingle CE, Dermitzakis ET, Campbell PJ, Green AR.**Cancer cell. 2010 Nov 16; 18(5)524-535* - Statistics and bioinformatics in nutritional sciences: analysis of complex data in the era of systems biology[The Journal of nutritional biochemistry. 20...]
*Fu WJ, Stromberg AJ, Viele K, Carroll RJ, Wu G.**The Journal of nutritional biochemistry. 2010 Jul; 21(7)561-572*

- CompoundCompoundPubChem chemical compound records that cite the current articles. These references are taken from those provided on submitted PubChem chemical substance records. Multiple substance records may contribute to the PubChem compound record.
- PubMedPubMedPubMed citations for these articles
- SubstanceSubstancePubChem chemical substance records that cite the current articles. These references are taken from those provided on submitted PubChem chemical substance records.
- TaxonomyTaxonomyTaxonomy records associated with the current articles through taxonomic information on related molecular database records (Nucleotide, Protein, Gene, SNP, Structure).
- Taxonomy TreeTaxonomy Tree

- Analysis of oligonucleotide array experiments with repeated measures using mixed...Analysis of oligonucleotide array experiments with repeated measures using mixed modelsBMC Bioinformatics. 2004; 5()209

Your browsing activity is empty.

Activity recording is turned off.

See more...