- Journal List
- Gene Regul Syst Bio
- v.1; 2007
- PMC2759137

# Evaluation of Two Outlier-Detection-Based Methods for Detecting Tissue-Selective Genes from Microarray Data

^{1}Graduate School of Agricultural and Life Sciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo-ku, Tokyo 113-8657, Japan

^{2}Faculty of Bioresource Sciences, Akita Prefectural University, Shimoshinjyo, Nakano, Akita 010-0195, Japan

## Abstract

Large-scale expression profiling using DNA microarrays enables identification of tissue-selective genes for which expression is considerably higher and/or lower in some tissues than in others. Among numerous possible methods, only two outlier-detection-based methods (an AIC-based method and Sprent’s non-parametric method) can treat equally various types of selective patterns, but they produce substantially different results. We investigated the performance of these two methods for different parameter settings and for a reduced number of samples. We focused on their ability to detect selective expression patterns robustly. We applied them to public microarray data collected from 36 normal human tissue samples and analyzed the effects of both changing the parameter settings and reducing the number of samples. The AIC-based method was more robust in both cases. The findings confirm that the use of the AIC-based method in the recently proposed ROKU method for detecting tissue-selective expression patterns is correct and that Sprent’s method is not suitable for ROKU.

**Keywords:**microarray, tissue selectivity, differential expression, AIC

## Introduction

The majority of microarray studies have focused on the detection of differentially expressed genes. Of these, tissue-selective genes for which expression in a single or small number of tissues is significantly different than in other tissues have attracted great interest due to their value in revealing the biological and physiological functions of tissues and organs at the molecular level (Kadota et al. 2006; Liang et al. 2006).

Numerous methods have been used to detect tissue-selective genes in microarrays (Greller and Tobin, 1999; Pavlidis and Noble, 2001; Kadota et al. 2003a; Schug et al. 2005; Ge et al. 2005; Yanai et al. 2005; Liang et al. 2006; Kadota et al. 2006). Of these, A recent study (ROKU; Kadota et al. 2006) demonstrated the effectiveness of using both Shannon entropy for ranking genes on the basis of their tissue selectivity (Schug et al. 2005) and an outlier-detection-based method for identifying tissues in which a gene is selective (the AIC-based method; Kadota et al. 2003a). However, it did not clarify why an AIC-based method was used even though other types of outlier-detection-based methods are applicable (Kadota et al. 2006). For example, Sprent’s non-parametric method could be used (Ge et al. 2005).

We have now evaluated and compared two outlier-based methods previously used for the detection of tissue-selective genes: the AIC-based method (Kadota et al. 2003a) and Sprent’s non-parametric method (Ge et al. 2005). Their outputs greatly vary mainly with changes in two factors. One is the maximum number of outlier candidates. For example, the AIC-based method sets this parameter to half the sample number interrogated; it can of course be set to other numbers. The other is the number of samples in the dataset. Researchers may subtract (or add) samples from a dataset if the data quality is under a post-determined threshold (or because other samples are added). The outputs for the common parts from two slightly different datasets can differ. Of course, we want to use a method for which the output is robust against changes in both factors. The two outlier-detection-based methods were evaluated in terms of these two factors.

## Methods

### Gene expression data

Expression data for normal human tissues were obtained from a dataset consisting of data for 36 various types of tissues in Affymetrix high-density oligonucleotide microarrays representing 22283 clones and controls (http://www.genome.rcast.u-tokyo.ac.jp/normal/). The raw (probe-level) data were processed using the SuperNORM algorithm (Konishi, 2004 and 2006) and log_{2} transformed.

### Detecting specific tissues using the AIC-based method

Detection of specific tissues using the AIC-based method (Kadota et al. 2003a) is performed as follows: (i) normalize gene vector *x* = (*x*_{1}, *x*_{2}, …, *x** _{N}*) for

*N*tissues (

*x*

_{1}<

*x*

_{2}… <

*x*

*) by subtracting the mean and dividing by the standard deviation (SD); (ii) calculate statistics $U=n\times \text{log}\sigma +\sqrt{2}\times s\times (\text{log}n!)/n$ for various combinations of outlier candidates, where*

_{N}*n*and

*s*denote the numbers of non-outlier and outlier candidates and σ denotes the SD of the observations of the

*n*non-outlier candidates; and (iii) regard tissues corresponding to outliers detected in the combination of minimum

*U*as specific. The maximum number,

*N*

*, of outlier candidates was originally set to*

_{max}*N*/2 (Kadota et al. 2003a). We analyzed the effect of changing

*N*

*. The R code is available in the additional file 2.*

_{max}### Detecting specific tissues using Sprent’s method

Detection of specific tissues using Sprent’s non-parametric method (Ge et al. 2005) is performed as follows: (i) normalize gene vector *x* = (*x*_{1}, *x*_{2}, …, *x** _{N}*) for

*N*tissues by subtracting the median and dividing by the median absolute deviation (MAD); (ii) regard tissues corresponding to absolute values >

*k*as specific. Parameter

*k*was originally set to 5 (Ge et al. 2005). We analyzed the effect of changing

*k*.

## Results and Discussion

The purpose of this study was to compare two outlier-detection-based methods (the AIC-based method and Sprent’s non-parametric method) for the detection of tissues in which a gene is selective. Compared to other statistical methods excluding ROKU, which uses the AIC-based method (Kadota et al. 2006), both methods have two advantages. First, they can treat equally various types of tissue-selective genes: (a) ‘up-type’ genes selectively over-expressed in a single or small number of tissues, (b) ‘down-type’ genes selectively under-expressed in some tissues, and (c) ‘mixed-type’ genes selectively over- and under-expressed in some tissues (Kadota et al. 2006). Second, they can extract genes whose expression is considerably different only in arbitrarily selected tissues. Other methods such as template matching (Pavlidis and Noble, 2001) and Schug’s *Q*-statistic (Schug et al. 2005) sometimes detect genes considerably different in other tissues in addition to the objective tissue (Kadota et al. 2003a; Kadota et al. 2006).

Although neither method can rank genes on the basis of their overall tissue selectivity, ROKU can compensate for this by adding an entropy-based score for individual genes (Kadota et al. 2006). For ROKU users who want to detect various types of tissue-selective patterns, the remaining issue is whether another published method (Sprent’s method; Ge et al. 2005) is suitable for ROKU. Fortunately, the two methods have two common characteristics: (i) the same output format and (ii) only one parameter can affect the output (*N** _{max}* for the AIC-based method and

*k*for Sprent’s method). These similarities facilitate direct comparison with no modifications.

Here we examine the effects of (1) different parameter settings and (2) a reduced number of samples on robustness. We do this using the expression data for 22283 clones and 36 samples. We first present an example using a hypothetical expression vector for ten tissues, *x* = (12, 51, 52, 54, 57, 59, 60, 63, 85, 88) and then evaluate the two methods using actual microarray data. Both methods output a vector (consisting of 1 for over-expressed outliers, −1 for under-expressed outliers, and 0 for non-outliers) that corresponds to the input expression vector. We only need compare these outlier vectors.

### Effect of different parameter settings

The outlier vectors produced using outlier-detection-based methods vary with the parameters (*N** _{max}* for the AIC-based method and

*k*for Sprent’s method) (Figure 1). In general, the number of detected outliers (the number of nonzero elements in the outlier vector) tends to be lower when

*N*

*is small and*

_{max}*k*is large. For example, reducing

*N*

*, which is the maximum number of outlier candidates, from 5 to 1 produced two different outlier vectors: (−1, 0, 0, 0, 0, 0, 0, 0, 1, 1) for*

_{max}*N*

*= 3 to 5 and (−1, 0, 0, 0, 0, 0, 0, 0, 0, 0) for*

_{max}*N*

*= 1 and 2 (Figure 1a). This is not surprising since the latter values of*

_{max}*N*

*are less than the number of outliers detected using the former values of*

_{max}*N*

*(1 or 2 <3). There is also some variation in the outlier vectors produced using different values of parameter*

_{max}*k*in Sprent’s method (Figure 1b).

For the hypothetical vector, the two outlier-detection-based methods with the default parameter settings (*N** _{max}* =

*k*= 5) produce different outlier vectors. The difference is whether the second highest observation (the value of “85”) is detected as an over-expressed outlier (the AIC-based method) or a non-outlier (Sprent’s method). Since we designed the original hypothetical expression vector to have three significantly different observations than in the others (the same as the outlier vector obtained using the AIC-based method), the observation should be detected as an over-expressed outlier. Some researchers, however, disagree with our judgment and think, for example, there is only one tissue (T1) in which the hypothetical vector is selective. The final decision about tissue selectivity thus suffers from some subjectivity. Accordingly, we would be unable to determine which of the alternative methods performs better even if demonstrations for many hypothetical expression vectors and many actual vectors were provided. Figure 1 merely presents an example of producing different outlier vectors with different parameter settings.

Figure 2 shows the average percentage of detected outliers for various values of *N** _{max}* (Figure 2a) and

*k*(Figure 2b) when actual gene expression vectors for 36 normal human tissues (Ge et al. 2005) were analyzed. The results with the default parameter settings (

*N*

*=*

_{max}*N*/2 = 18;

*k*= 5) yielded similar average percentages: 2.43% for the AIC-based method and 2.32% for Sprent’s method. Clearly, the percentages for the AIC-based method were insensitive to changes in the parameter value while those for Sprent’s method were sensitive. For example, changing

*N*

*from 9 (*

_{max}*N**1/4) to 27 (

*N**3/4) yielded a difference of 0.06% (2.43– 2.37%) (Figure 2a), while changing

*k*from 4.0 to 6.0 yielded a difference of 2.64% (4.11–1.47%) (Figure 2b). Although the ranges for the AIC-based method (9–27) and Sprent’s method (4.0–6.0) are not directly comparable, these parameters are possible. These results suggest that researchers who want a method for detecting tissues in which a gene is selective that is insensitive to variations in these parameters should use the AIC-based method. The “outlier matrix” (consisting of 1 for over-expressed outliers, −1 for under-expressed outliers, and 0 for non-outliers) that corresponds to the actual gene expression matrix when the AIC-based method is used with the default parameter setting is available in the additional file 1.

An interesting exercise is to change the
$\sqrt{2}$ in the AIC criterion for detecting outliers to other values such as 1 or 2 though the original equation (
$U=n\times \text{log}\sigma +\sqrt{2}\times s\times (\text{log}n!)/n$) has a solid theoretical basis (Ueda T, 1996; Kadota et al. 2003a; Kadota et al. 2003b). A decrease (or increase) in the weight for the penalty results in an increased (or decreased) number of outliers. Changing
$\sqrt{2}$ to 1 (or 2) with the default value of *N** _{max}* (18) yielded 5.19% (or 1.13%) for the average percentage of detected outliers. The AIC-based method remained robust against changes in

*N*

*when these other weights were used (data not shown).*

_{max}### Effect of reduced number of samples

In addition to the effect of different parameter settings, outlier vectors could also vary with the addition or reduction of samples even when the same parameter values are used. To examine the effect of reducing the number of samples, we generated *N* leave-one-out input vectors consisting of (*N*−1) samples from an expression vector originally consisting of *N* samples. Consider, for example, a hypothetical vector consisting of ten observations. Ten leave-one-out input vectors, each of which has nine observations, can be analyzed. If the method is good, the ten leave-one-out output vectors should be the same as the original output vector of ten observations.

Figure 3 shows the results of the “leave-one-out outlier detection” (LOOOD) analysis for the hypothetical vector using (a) the AIC-based method and (b) Sprent’s method, with the default parameter settings (*N** _{max}* =

*k*= 5). Clearly, the AIC-based method is more robust against a reduction in the number of samples, at least for this hypothetical expression vector.

To examine the two methods further using actual data, we defined a basis for evaluation as follows: (i) the outlier vector obtained from the original vector (not a leave-one-out vector) is “true,” (ii) the outliers (“−1” or “1”) in the outlier vector are “positive,” and (iii) the non-outliers (“0”) are “negative.” Accordingly, the LOOOD results give rise to four quantities:

True positive (TP): outliers that are detected as outliers in the outlier vector obtained from the original expression vector consisting of *N* observations

True negative (TN): non-outliers that are detected as non-outliers in the original outlier vector

False positive (FP): outliers that are detected as non-outliers in the original outlier vector

False negative (FN): non-outliers that are detected as outliers in the original outlier vector

For example, the values for Sprent’s method (Figure 3b) were TP = 16, TN = 67, FP = 5, and FN = 2. Zviling et al. (2005) stated that any single number that represents the power of the method must account for all the categories listed above. We define two such numbers: “accuracy” = (TP + TN)/(TP + TN + FP + FN) and “Matthews correlation coefficient (MCC)” = (TP*TN − FP*FN)/((TP + FN)*(TN + FP)*(TP + FP)*(TN + FN))^{1/2} (Matthews, 1975). Accuracy represents the fraction of the unchanged vectors among LOOOD test, and MCC represents the correlation between the original vector and the LOOOD results when the Pearson correlation coefficient is used. These statistics can take values in the following ranges: 0 ≤ accuracy ≤1; −1 ≤ MCC ≤1. The higher the value, the greater the robustness against a reduction in the number of samples. The LOOOD results for the hypothetical vector and Sprent’s method were accuracy = 92.22% and MCC = 77.50% (Figure 3b); for the AIC-based method (Figure 3a), they were accuracy = MCC = 100% since FP = FN = 0.

Figure 4 shows the LOOOD results for actual data using (a) the AIC-based method and (b) Sprent’s method. Accuracy and MCC were calculated for each parameter value (*N** _{max}* = 9 – 27 and

*k*= 4.0 – 6.0) around the default values (

*N*

*= 18 and*

_{max}*k*= 5). Obviously, the values for the AIC-based method were higher than those for Sprent’s method. We verified these results by varying the value of

*N*in leave-

*N*-out outlier detection (data not shown). These results suggest that the AIC-based method is less affected by slight changes in the input vector than Sprent’s method.

As mentioned above, objective comparison of methods for detecting tissue-selective patterns is understandably difficult. We know of only two reports in which the authors explicitly compared their method to other methods using the same dataset: (i) Kadota et al. (2003a) reported that the AIC-based method is superior to template matching and ANOVA, and (ii) Kadota et al. (2006) reported that ROKU can compensate for the disadvantages of the AIC-based method and of the entropy-based method proposed by Schug et al. (2005). The reports on the consistency between the results for a reduced number of samples and those for all the samples (Broberg P, 2003; Breitling et al. 2004) are of limited value because the results for all the samples were assumed to be correct (Jeffery et al. 2006). There is of course no guarantee, but it is probably safe to say that a higher number of samples should produce better results. Therefore, we still appreciate the advantages of the AIC-based method compared to Sprent’s method.

## Conclusion

We compared two outlier-detection-based methods previously used for the detection of tissue-selective genes. The AIC-based method was found to be better than Sprent’s non-parametric method in terms of robustness of the output against (1) a change in the parameter settings and (2) a reduction in the numbers of samples. These findings suggest that the use of the AIC-based method rather than Sprent’s method in the recently proposed ROKU method for detecting tissue-selective expression patterns was correct.

More work remains to be done. First, while the AIC-based method has clear advantages compared to Sprent’s method, the Bayesian information criterion (BIC) should also be applicable. It would be interesting to develop a BIC-based method and compare its performance to that of the AIC-based method. Second, the approach used here is not suitable for comparing ROKU with other methods such as the Tukey-Kramer’s honestly significant difference test due to their different output formats and the lack of genuine tissue-selective genes. We plan to develop a better approach for comparing a number of methods for detecting tissue-selective expression patterns.

## Additional File

**Additional file 1 (additional.txt) – includes all data analyzed using AIC-based method for dataset of Ge et al. (2005).**

For the original gene expression matrix, an outlier matrix (consisting of 1 for over-expressed outliers, −1 for under-expressed outliers, and 0 for non-outliers) is provided. It also contains an entropy score (*H*′) measured by ROKU.

^{(1.9M, txt)}

**Additional file 2 (r_code.txt) – R function of AIC-based method.**

The R function for the AIC-based method is provided. An example analysis using the hypothetical vector is also provided.

^{(1.2K, txt)}

## Acknowledgements

We are greatful to M. Abe for his helpful discussions and J. J. Rodrigue for helping to improve the English. This study was performed through Special Coordination Funds for Promoting Science and Technology of the Ministry of Education, Culture, Sports, Science and Technology, the Japanese Government. This study was also supported by a Grant-in Aid for Young Scientists (B) (19700273) to K. Kadota from the Ministry of Education, Culture, Sports, Science and Technology, the Japanese Government.

## References

- Breitling R, Armengaud P, Amtmann A, et al. Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments. FEBS Lett. 2004;573:83–92. [PubMed]
- Broberg P. Statistical methods for ranking differentially expressed genes. Genome Biol. 2003;4:R41. [PMC free article] [PubMed]
- Fan W, Pritchard JI, Olson JM, et al. A class of models for analyzing GeneChip gene expression analysis array data. BMC Genomics. 2005;6:16. [PMC free article] [PubMed]
- Ge XJ, Yamamoto S, Tsutsumi S, et al. Interpreting expression profiles of cancers by genome-wide survey of breadth of expression in normal tissues. Genomics. 2005;86:127–141. [PubMed]
- Greller LD, Tobin FL. Detecting selective expression of genes and proteins. Genome Res. 1999;9:282–296. [PMC free article] [PubMed]
- Jeffery IB, Higgins DG, Culhane AC. Comparison and evaluation of methods for generating differentially expressed gene lists from microarray data. BMC Bioinformatics. 2006;7:359. [PMC free article] [PubMed]
- Kadota K, Nishimura SI, Bono H, et al. Detection of genes with tissue-specific expression patterns using Akaike’s Information Criterion (AIC) procedure. Physiol Genomics. 2003a;12:251–259. [PubMed]
- Kadota K, Tominaga D, Akiyama Y, et al. Detecting outlying samples in microarray data: A critical assessment of the effect of outliers on sample classification. Chem-Bio Informatics J. 2003b;3:30–45.
- Kadota K, Ye J, Nakai Y, et al. ROKU: a novel method for identification of tissue-specific genes. BMC Bioinformatics. 2006;7:294. [PMC free article] [PubMed]
- Konishi T. Three-parameter lognormal distribution ubiquitously found in cDNA microarray data and its application to parametric data treatment. BMC Bioinformatics. 2004;5:5. [PMC free article] [PubMed]
- Konishi T. Detection and restoration of hybridization troubles in Affymetrix GeneChip data by parametric scanning. Genome Inform. 2006;17:100–109. [PubMed]
- Liang S, Li Y, Be X, et al. Detecting and profiling tissue-selective genes. Physiol Genomics. 2006;26:158–162. [PubMed]
- Matthews BW. Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochim Biophys Acta. 1975;405:442–451. [PubMed]
- Pavlidis P, Noble WS. Analysis of strain and regional variation in gene expression in mouse brain. Genome Biol. 2001;2:research 0042. [PMC free article] [PubMed]
- Schug J, Schuller WP, Kappen C, et al. Promoter features related to tissue specificity as measured by Shannon entropy. Genome Biol. 2005;6:R33. [PMC free article] [PubMed]
- Ueda T. Simple method for the detection of outliers [in Japanese] Japanese J Appl Stat. 1996;25:17–26.
- Yanai I, Benjamin H, Shmoish M, et al. Genome-wide midrange transcription profiles reveal expression level relationships in human tissue specification. Bioinformatics. 2005;21:650–659. [PubMed]
- Zviling M, Leonov H, Arkin IT. Genetic algorithm-based optimization of hydrophobicity tables. Bioinformatics. 2005;21:2651–2656. [PubMed]

**Libertas Academica**

## Formats:

- Article |
- PubReader |
- ePub (beta) |
- PDF (371K) |
- Citation

- ROKU: a novel method for identification of tissue-specific genes.[BMC Bioinformatics. 2006]
*Kadota K, Ye J, Nakai Y, Terada T, Shimizu K.**BMC Bioinformatics. 2006 Jun 12; 7:294. Epub 2006 Jun 12.* - Large scale real-time PCR validation on gene expression measurements from two commercial long-oligonucleotide microarrays.[BMC Genomics. 2006]
*Wang Y, Barbacioru C, Hyland F, Xiao W, Hunkapiller KL, Blake J, Chan F, Gonzalez C, Zhang L, Samaha RR.**BMC Genomics. 2006 Mar 21; 7:59. Epub 2006 Mar 21.* - The distribution-based p-value for the outlier sum in differential gene expression analysis.[Biometrika. 2010]
*Chen LA, Chen DT, Chan W.**Biometrika. 2010 Mar; 97(1):246-253. Epub 2010 Jan 25.* - Filter versus wrapper gene selection approaches in DNA microarray domains.[Artif Intell Med. 2004]
*Inza I, Larrañaga P, Blanco R, Cerrolaza AJ.**Artif Intell Med. 2004 Jun; 31(2):91-103.* - Unsupervised outlier profile analysis.[Cancer Inform. 2014]
*Ghosh D, Li S.**Cancer Inform. 2014; 13(Suppl 4):25-33. Epub 2014 Oct 15.*

- Two-way AIC: detection of differentially expressed genes from large scale microarray meta-dataset[BMC Genomics. ]
*Tsuyuzaki K, Tominaga D, Kwon Y, Miyazaki S.**BMC Genomics. 14(Suppl 2)S9* - Literature aided determination of data quality and statistical significance threshold for gene expression studies[BMC Genomics. ]
*Xu L, Cheng C, George EO, Homayouni R.**BMC Genomics. 13(Suppl 8)S23* - Evaluating methods for ranking differentially expressed genes applied to microArray quality control data[BMC Bioinformatics. ]
*Kadota K, Shimizu K.**BMC Bioinformatics. 12227* - A weighted average difference method for detecting differentially expressed genes from microarray data[Algorithms for Molecular Biology : AMB. ]
*Kadota K, Nakai Y, Shimizu K.**Algorithms for Molecular Biology : AMB. 38*

- PubMedPubMedPubMed citations for these articles

- Evaluation of Two Outlier-Detection-Based Methods for Detecting Tissue-Selective...Evaluation of Two Outlier-Detection-Based Methods for Detecting Tissue-Selective Genes from Microarray DataGene Regulation and Systems Biology. 2007; 1()9

Your browsing activity is empty.

Activity recording is turned off.

See more...