![]() | ![]() |
Formats:
|
||||||||||||||||||||||
Copyright © 2008 by the authors An Integrative Analysis of microRNA and mRNA Expression—A Case Study Department of Epidemiology and Biostatistics, Memorial Sloan-Kettering Cancer Center, New York, New York, U.S.A Correspondence: Li-Xuan Qin. 307 E. 63rd St, New York, NY 10021, U.S.A. 646-735-8116; Email: qinl/at/mskcc.org This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/). Abstract Background MicroRNAs are believed to play an important role in gene expression regulation. They have been shown to be involved in cell cycle regulation and cancer. MicroRNA expression profiling became available owing to recent technology advancement. In some studies, both microRNA expression and mRNA expression are measured, which allows an integrated analysis of microRNA and mRNA expression. Results We demonstrated three aspects of an integrated analysis of microRNA and mRNA expression, through a case study of human cancer data. We showed that (1) microRNA expression efficiently sorts tumors from normal tissues regardless of tumor type, while gene expression does not; (2) many microRNAs are down-regulated in tumors and these microRNAs can be clustered in two ways: microRNAs similarly affected by cancer and microRNAs similarly interacting with genes; (3) taking let-7f as an example, targets genes can be identified and they can be clustered based on their relationship with let-7f expression. Discussion Our findings in this paper were made using novel applications of existing statistical methods: hierarchical clustering was applied with a new distance measure—the co-clustering frequency—to identify sample clusters that are stable; microRNA-gene correlation profiles were subject to hierarchical clustering to identify microRNAs that similarly interact with genes and hence are likely functionally related; the clustering of regression models method was applied to identify microRNAs similarly related to cancer while adjusting for tissue type and genes similarly related to microRNA while adjusting for disease status. These analytic methods are applicable to interrogate multiple types of -omics data in general. Keywords: clustering, expression, microarray, microRNA Background MicroRNAs (miRNAs) are a class of small non-coding RNAs that are believed to regulate gene expression [1, 2]. The first two miRNAs, lin-4 and let-7, were experimentally discovered in 1993 and 2000 [3, 4]. Since then more than 4300 miRNAs have been identified in plants, animals, and viruses using cDNA sequencing and computational predictions [5–8]. MiRNAs regulate their target genes, through base-pairing, by inducing mRNA degradation and translational repression [1, 2]. In humans, miRNAs might regulate as many as a third of the protein coding genes [9]. MiRNAs are likely to have an important impact on development in various cellular processes, such as cancer. MiRNAs have been shown to be linked to a number of cancer types in several studies on individual miRNAs [10, 11]. Cancer is a complex and heterogeneous disease whose initiation and progression are influenced by a variety of molecular changes [12]. A complete characterization of the genomic changes may help predict the pathologic behavior of cancer. Genome-wide profiling of gene expression has been increasingly applied in clinical settings to understand genomic changes at the mRNA level. Examples can be found in breast, colon, ovarian, and prostate cancer [13–16]. Such knowledge has improved our understanding of cancer biology and facilitated the discovery of new cancer subtypes and new biomarkers for cancer diagnosis, prognosis, and treatment [17, 18]. Large-scale expression profiling has recently become available for miRNAs as well [19]. Profiling methods for miRNA expression are mostly based on glass-slide microarrays [20–22] and the latest development is the bead-based flow cytometry technique [23]. Genome-wide miRNA studies allow the investigation of genomic changes in cancer at the miRNA level and are likely to provide additional clues to the mechanisms of tumorigenesis [23–25]. In particular, when miRNA and mRNA expression are both measured on the same samples, an integrative analysis can be performed to compare miRNAs and mRNAs profiles and to study their interaction patterns. The goal of this paper is to demonstrate three aspects of such an integrative analysis through novel applications of existing statistical methods. In this paper, we will focus on the following three aspects of an integrative analysis of mRNA and miRNA expression data, while taking into account of sample phenotype data (for example, tumor versus normal samples).
We will demonstrate these three aspects of an integrative analysis using a published study of miRNA and mRNA expression in various types of tumor samples [23]. A set of 46 samples, whose miRNA expression and gene expression were both measured, was used in our analysis (Supplementary Table 1). These 46 samples consist of 28 tumor samples belonging to five tissue types and their 18 normal counterparts (>1 normal per tissue type). MiRNAs and genes with truncated values in >10% samples are excluded, which results in 128 miRNAs and 7149 genes in our analysis. Results Clustering samples Pioneered by Eisen et al. [26], hierarchical clustering is the most commonly used method for sample clustering using expression profiles. With hierarchical clustering, a distance measure is calculated between the expression profiles of each gene (or gene cluster) pair, and a recursive bottom-up or top-down algorithm is then employed to merge or split genes based on their distance. Examples of distance measures include the Euclidean distance and one minus the Pearson correlation coefficient. Hierarchical clustering does not require the number of clusters to be pre-specified and has nice visualization properties (dendrogram and heatmap). Similar to many other clustering algorithms, a well-recognized drawback of hierarchical clustering, however, is that it always generates a clustering even when there is no real underlying clustering in the data. It is not apparent whether the clustering structure reflects a ‘true’ pattern in the data or is just an artefact of the clustering algorithm. Methods based on resampling have been proposed to evaluate the significance of a clustering [27–29]. These methods simulate perturbations of the original data and assess the stability of the clustering results. Also based on resampling, Monti et al. proposed a method, called ‘consensus clustering’, that makes use of the resampling results to guide clustering [30]. Briefly, consensus clustering quantifies the agreement among clustering runs over the perturbed data sets, measured by a consensus matrix whose elements are the frequency that two samples are clustered together, and then performs hierarchical clustering using the consensus matrix as similarity matrix. In the consensus clustering, the co-clustering frequency measure counts co-clustering frequency of two samples among perturbed data sets that include both samples. Instead, we apply the clustering of each perturbed data set to classify samples in the original data set using the nearest-centroid method and then count the frequency of two samples being classified together among all perturbations. We will call this method as ‘stable hierarchical clustering’. We used a partitional clustering method, PAM (partitioning around medoids) [31], to cluster each perturbed data set in this paper. Details of the stable hierarchical clustering method are provided in Method section. We first applied stable hierarchical clustering to identify stable sample clusters based on miRNA expression (Fig. 1a
We also applied stable hierarchical clustering to cluster samples based on gene expression. It did not separate tumors from normal samples as efficiently and it tended to recognize tissue types rather than disease status (Fig. 1b Hierarchical clustering has been applied to cluster samples in the original publication by Lu et al. [23]; however, they clustered tumor samples only and discovered clusters reflecting various tumor characteristics. We showed, through subjecting both tumors and normals to clustering and adopting a new distance measure for clustering, that miRNA expression can clearly distinguish tumors from normals, regardless of tissue type. Lu et al. showed the inferiority of mRNA expression in distinguishing GI versus Non-GI tumors, while we showed its inferiority in distinguishing tumors versus normals. Identifying and clustering cancer-related miRNAs Filtering has been commonly adopted as a useful pre-processing step, to remove uninformative genes and to reduce computational burden, for gene clustering. We applied similar filtering step for miRNA clustering. Specifically, we selected a subset of miRNAs that are related to cancer, by modelling miRNA expression using a per-gene linear regression model with disease status and tissue type as covariates [32] and evaluating the significance of the correlation with disease status using an Empirical Bayesian t test [33]. Among the 128 miRNAs, 89 (70%) were found to be related to cancer at the significance level of 0.001. The 89 miRNAs were all down-regulated in tumors. In the subsequent cluster analysis, we focused on a subset of 38 miRNAs, which had both a small p-value (<0.001) and a large fold change (>3) (Table 1, Fig. 2
To better understand the grouping structure among these 38 cancer-related miRNAs, we clustered them in two ways. One is to identify groups of miRNAs that similarly interact with genes. The other is to identify groups of miRNAs that are similarly affected by disease status. MiRNAs were clustered based on their correlation patterns with gene expression. The correlation matrix between miRNA expression and gene expression were calculated and graphically displayed with a heatmap, where rows are genes and columns are miRNAs. MiRNAs were then clustered based on this correlation matrix, so that miRNAs with similar correlation profiles (ie. similar columns of the correlation matrix) were clustered together. For the clustering step, the hierarchical clustering method was employed, with the Euclidean distance as the distance measure. MiRNAs closer on the dendrogram share similar correlation pattern with genes and are thus likely functionally related. The miRNA-gene correlation heatmap and the corresponding miRNA clustering were generated separately for tumor samples and normal samples (Figs. 3a–b
MiRNAs were also clustered based on their relationship to disease status. A new clustering method, the clustering of regression models (CORM) method, models expression using regression and assumes that miRNAs in the same cluster share the same regression coefficients [34]. This method tends to provide more stable clustering than K-means clustering, as it explicitly models different sources of variations and bases clustering solely on the systematic variation [35]. Using the CORM method with disease status and tissue type as the covariates, miRNAs were clustered so that miRNAs in the same cluster have similar mean expression among tumor samples and normal samples for each tissue type (Table 1, Fig. 4
Identifying and clustering target genes for let-7f MiRNAs are thought to negatively regulate mRNA in one of two ways depending on the degree of complementarity between the miRNA and its target [37]: (1) miRNAs that bind perfectly to their target’s coding sequence are thought to result in mRNA degradation, and (2) miRNAs that bind with imperfect complementarity to the 3′ UTR block target gene expression at the level of protein translation. Target gene prediction is an important but complicated task for miRNA studies [38]. Several algorithms have been proposed for miRNA target prediction, which mostly rely on the assumption of base pairing and evolutionary conservation. Examples of the sequence-based prediction algorithms include MiRanda, PicTar, and TargetScanS [39–42]. A large number of miRNA targets are still unknown [43]. MiRNA expression profiling provides an alternative for identifying target genes, especially those targeted through degradation, by correlating miRNA and gene expression. It can potentially provide in vivo evidence of gene targeting, as opposed to the in silico evidence provided by the sequence-based prediction algorithms. We took let-7f as an example to demonstrate how to identify and cluster miRNA target genes using miRNA and gene expression. For each gene, the expression level is modeled using a linear regression model with let-7f expression, disease status, and their interaction as the covariates. This full model is then compared to a reduced linear regression model with disease status as the covariate, using a likelihood ratio test, to evaluate the association between gene expression and let-7f expression. A set of 178 genes showed significant association with let-7f expression using a significance cut-off of p-value = 0.001 (Supplementary Table 2). These 178 are potential let-7f target genes, with a false discovery rate of about 0.015 [44]. These 178 genes are enriched in nucleic acid binding and regulation of DNA replication or transcription; a number of the predicted target genes are related to cancer, such as RASSF7, RAB34, ARAF, BCL2L14, MLL3, MORF4L2, PERP, and SELENBP1. The RAS family has been shown to be let-7 targets experimentally [45]. In our analysis, RASSF7 (alias HRAS1) and RAB34 (member of RAS super-family) were predicted to be let-7f targets. Specifically, RASSF7 is negatively correlated with let-7f expression among normal samples and positively correlated among tumors (Fig. 5a
Using CORM with let-7f expression, disease status, and their interaction as covariates, the 178 genes were clustered so that genes in the same cluster vary similarly as let-7f varies given the disease status (Supplementary Table 2). Interestingly, clusters 1–4 seem to be mirror images of clusters 5–8, respectively (Fig. 6
Methods Stable hierarchical clustering Stable hierarchical clustering groups samples based on the co-clustering frequency among repeated bootstrap sampling. Specifically, (i) bootstrap sample sets are generated by resampling with replacement from the original sample set; (ii) for each bootstrap sample set, samples were partitioned to a pre-specified number of clusters using PAM (partitioning around medoids) method and the corresponding cluster centers are applied to classify the samples in the original sample set using the nearest-centroid method; (iii) for each sample pair in the original sample set, the frequency of being assigned to the same cluster is calculated across bootstrap sample sets. The co-clustering frequency is then used as a similarity measure for hierarchical clustering to identify stable clusters of samples. Clustering of regression models method As for per-gene regression analysis, CORM uses regression to model systematic variation in expression levels but, in addition, assumes that genes in the same cluster share the same values of regression coefficients [34]. To identify gene clusters, CORM was applied using the same regression model as for their per-gene analysis. Let Xgi (ngi × p) denote the design matrix for gene g and sample i, Fβk,ξk the conditional distribution of genes in cluster k given the covariates with parameters βk and ξk, βk (p × 1) the vector of regression coefficients, and μ(.; .) the regression function. The model underlying CORM can be written as where ug is a random variable on (1, 2, …, K) with probabilities (π1, π2, …, πK). Complete specification of the CORM modeling framework requires identification of the error structure (parameterized by ξ), which depends on the form of the regression model. The specific form of the regression model used for CORM is flexible. For example, it can be the linear model, the linear mixed model, the nonlinear model, and the nonparametric regression model. Its choice should depend on the experimental design and the scientific question. The clustering of linear models (CLM) method can be applied to cross-sectional data to find genes whose expression levels are similarly related to a set of covariates. In cross-sectional data, a single expression value is measured for a gene on a sample; hence, ygi reduces to ygi and Xgi to xgi. The underlying model for CLM can be written as The EM algorithm can be used to fit the CLM model, and implementation details can be found in [34]. For the analysis of the human cancer data in this paper, the CLM method was used to (1) identify miRNAs similarly related to disease status and tissue type with disease status and tissue type as covariates, and (2) identify genes similarly related to let-7f and disease status with let-7f expression level, disease status, and their interaction as covariates. CLM is closely related to K-means clustering, both being partitional clustering; however, K-means clusters gene based on the expression levels directly, while CLM based on the relationship between expression and the covariates and hence pools information across samples. Comparing to K-means, CLM tends to identify more stable clusters across samples [35]. Discussion The study of miRNAs has received a lot of attention lately [1, 2]. There is evidence that miRNAs are involved in animal development and cell cycle regulation. MiRNAs might play an important role in cancer and in regulating cancer-related genes [10, 11, 23, 45]. Our findings in this paper suggest the significance of microRNA expression itself in cancer diagnosis. Using let-7f as an example, we showed that its expression is correlated with the expression of a number of known oncogenes and the directionality of the correlation (positive correlation versus negative correlation) may be different in tumors and in normal tissues. Target prediction is an important component in understanding miRNAs and their functions. As an alternative to existing sequence-based algorithms, an expression-based strategy for miRNA target prediction was proposed in this paper and its feasibility was demonstrated through an application to a human cancer data set. Like the sequence-based predictions, the expression-based predictions also have limitations. For example, correlation is not direct interaction and genes correlated with a miRNA might be down-stream genes of miRNA direct targets. Rather the two predictions are complementary and could be combined to prioritize candidate targets for experimental validation. Many types of high-throughput ‘-omics’ data have recently emerged, such as gene copy number, gene expression, and proteomics data. The interpretation and integration of these data pose a challenge for both experimental and quantitative scientists in this field. The analytic methods in this paper provide a new tool to interrogate these high-throughput data in an integrative fashion. In particular, CORM has been previously applied to data collected under various experimental designs, such as cross sectional, longitudinal with no replication, and longitudinal with replications. In this paper, we demonstrated yet another application of CORM to clustering miRNAs or genes with respect to specific covariates of interests. We have focused on the cluster analysis in this paper, including the clustering of samples, miRNAs, and genes. These exploratory analyses are one aspect of an integrative analysis of miRNA and gene expression. We will investigate other types of integrative analysis in the future to gain a better understanding of the relationship between miRNAs and genes as well as their joint behaviors. Table S2: Predicted target genes for let-7f based on expression profiles List of the 178 genes predicted as targets for let-7f, together with their CORM cluster memberships. Click here to view.(85K, doc) Acknowledgments We are grateful for the Golub group for making their data publicly available. We cordially thank Dr. Colin Begg for insightful comments to the manuscript. This work was supported in part by NIH grants 2P01CA047179-15A2 and UL1RR024996 of the Clinical and Translation Science Center at Weill Cornell Medical College. References 1. Ambros V. The functions of animal microRNAs. Nature. 2004;431:350–5. [PubMed] 2. Bartel D. MicroRNAs: genomics, biogenesis, mechanism, and function. Cell. 2004;116:281–97. [PubMed] 3. Lee RC, et al. The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14. Cell. 1993;75:843–54. [PubMed] 4. Reinhart B, et al. The 21-nucleotide let-7 RNA regulates developmental timing in Caenorhabditis elegans. Nature. 2000;403:901–6. [PubMed] 5. Lai EC, et al. Computational identification of Drosophila microRNA genes. Genome Biology. 2003;4:R.42. 6. Lim LP, et al. The microRNAs of Caenorhabditis elegans. Genes and Development. 2003;17:991–1008. [PubMed] 7. Lim LP, et al. Vertebrate microRNA genes. Science. 2003;299:1540. [PubMed] 8. Griffiths-Jones S, et al. miRBase: microRNA sequences, targets and gene nomenclature. NAR. 2006;34:D140–144. [PubMed] 9. Lewis B, et al. Conserved seed pairing, often flanked by adenosine, indicates that thousands of human Genes are microRNA targets. Cell. 2005;120:15–20. [PubMed] 10. He L, et al. A microRNA polycistron as a potential human oncogene. Nature. 2005;435:828–33. [PubMed] 11. O’Donnell KA, et al. c-Myc-regulated microRNAs modulate E2F1 expression. Nature. 2005;435:839–43. [PubMed] 12. Hanahan D, Weinberg RA. The hallmarks of cancer. Cell. 2000;100:57–70. [PubMed] 13. van ‘t Veer LJ, et al. Gene expression profiling predicts clinical outcome of breast cancer. Nature. 2002;415:530–6. [PubMed] 14. Wang Y, et al. Gene expression profiles and molecular markers to predict recurrence of Dukes’ B. colon cancer. Journal of Clinical Oncology. 2004;22:1564–71. [PubMed] 15. Spentzos D, et al. Unique gene expression profile based on pathologic response in epithelial ovarian cancer. Journal of Clinical Oncology. 2005;23:7911–8. [PubMed] 16. Stephenson AJ, et al. Integration of gene expression profiling and clinical variables to predict prostate carcinoma recurrence after radical prostatectomy. Cancer. 2005;104:290–8. [PubMed] 17. Strausberg RL, et al. Oncogenomics and the development of new cancer therapies. Nature. 2004;429:469–74. [PubMed] 18. Jares P, Campo E. Genomic platforms for cancer research: potential diagnostic and prognostic applications in clinical oncology. Clinical Translational Oncology. 2006;8:161–72. [PubMed] 19. Kim V, Nam J. Genomics of microRNA. Trends in Genetics. 2006;22:165–73. [PubMed] 20. Thomson J, et al. A custom microarray platform for analysis of microRNA gene expression. Nature Methods. 2004;1:47–53. [PubMed] 21. Nelson P, et al. Microarray-based, high-throughput gene expression profiling of microRNAs. Nature Methods. 2004;1:155–161. [PubMed] 22. Liu C, et al. An oligonucleotide microchip for genome-wide microRNA profiling in human and mouse tissues. PNAS. 2004;101:9740–4. [PubMed] 23. Lu J, et al. MicroRNA expression profiles classify human cancers. Nature. 2005;435:834–8. [PubMed] 24. Calin GA, Croce CM. MicroRNA-cancer connection: The beginning of a new tale. Cancer Research. 2006;66:7390–4. [PubMed] 25. Garzon R, et al. MicroRNA expression and function in cancer. Trends in Molecular Medicine. 2006;12:580–7. [PubMed] 26. Eisen MB, et al. Cluster analysis and display of genome-wide expression patterns. PNAS. 1998;95:14863–8. [PubMed] 27. Kerr MK, Churchill G. Bootstrapping cluster analysis: assessing the reliability of conclusions from microarray experiments. PNAS. 2001;98:8961–5. [PubMed] 28. Dudoit S, Fridlyand J. A prediction-based resampling method for estimating the number of clusters in a dataset. Genome Biology. 2002;3:RESEARCH0036. [PubMed] 29. Tibshirani, et al. Cluster validation by prediction strength. Journal of Computational and Graphical Statistics. 2005;14:511–28. 30. Monti S, et al. Consensus clustering: A resampling-based method for Class discovery and visualization of gene expression microarray data. Machine Learning. 2003;52:91–118. 31. Kaufman L, Rousseeuw PJ. Finding groups in data. New Jersey: Wiley Interscience; 1990. 32. van Belle G, et al. Biostatistics: a methodology for the health sciences. 2. New Jersey: Wiley Interscience; 2004. 33. Smyth GK. Linear models and empirical Bayes methods for assessing differential expression in microarray experiments. Statistical Applications in Genetics and Molecular Biology. 2004;3 Article3. 34. Qin LX, Self SG. The clustering of regression models method with applications to gene expression data. Biometrics. 2006;62:526–33. [PubMed] 35. Qin LX, Self SG. On comparing the clustering of regression models method with K-means clustering. Working Paper. Memorial Sloan-Kettering Cancer Center. Department of Epidemiology and Biostatistics. 2007 36. Lagos-Quintana M, et al. New microRNAs from mouse and human. RNA. 2003;9:175–9. [PubMed] 37. Farh K, et al. The widespread impact of mammalian MicroRNAs on mRNA repression and evolution. Science. 2005;310:1817–21. [PubMed] 38. Brennecke J, et al. Principles of microRNA–target recognition. PLoS Biol. 2005;3:e85. [PubMed] 39. Enright AJ, et al. MicroRNA target detection. Genome Biology. 2003;5:R.1. 40. Krek A, et al. Combinatorial microRNA target predictions. Nature Genetics. 2005;37:495–500. [PubMed] 41. Lewis B, et al. Prediction of mammalian microRNA targets. Cell. 2003;115:787–98. [PubMed] 42. Sethupathy P, et al. A guide through present computational approaches for the identification of mammalian microRNA targets. Nature Methods. 2006;3:881–6. [PubMed] 43. Sethupathy P, et al. TarBase: A comprehensive database of experimentally supported animal microRNA targets. RNA. 2006;12:192–7. [PubMed] 44. Storey JD, Tibshirani R. Statistical significance for genome-wide studies. PNAS. 2003;100:9440–5. [PubMed] 45. Johnson SM, et al. RAS is regulated by the let-7 microRNA family. Cell. 2005;120:635–647. [PubMed] 46. Croce CM, Calin GA. miRNAs, cancer, and stem cell division. Cell. 2005;122:6–7. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||
Nature. 2004 Sep 16; 431(7006):350-5.
[Nature. 2004]Cell. 2004 Jan 23; 116(2):281-97.
[Cell. 2004]Cell. 1993 Dec 3; 75(5):843-54.
[Cell. 1993]Nature. 2000 Feb 24; 403(6772):901-6.
[Nature. 2000]Nucleic Acids Res. 2006 Jan 1; 34(Database issue):D140-4.
[Nucleic Acids Res. 2006]Cell. 2000 Jan 7; 100(1):57-70.
[Cell. 2000]Nature. 2002 Jan 31; 415(6871):530-6.
[Nature. 2002]Cancer. 2005 Jul 15; 104(2):290-8.
[Cancer. 2005]Nature. 2004 May 27; 429(6990):469-74.
[Nature. 2004]Clin Transl Oncol. 2006 Mar; 8(3):161-72.
[Clin Transl Oncol. 2006]Trends Genet. 2006 Mar; 22(3):165-73.
[Trends Genet. 2006]Nat Methods. 2004 Oct; 1(1):47-53.
[Nat Methods. 2004]Proc Natl Acad Sci U S A. 2004 Jun 29; 101(26):9740-4.
[Proc Natl Acad Sci U S A. 2004]Nature. 2005 Jun 9; 435(7043):834-8.
[Nature. 2005]Trends Mol Med. 2006 Dec; 12(12):580-7.
[Trends Mol Med. 2006]Nature. 2005 Jun 9; 435(7043):834-8.
[Nature. 2005]Proc Natl Acad Sci U S A. 1998 Dec 8; 95(25):14863-8.
[Proc Natl Acad Sci U S A. 1998]Proc Natl Acad Sci U S A. 2001 Jul 31; 98(16):8961-5.
[Proc Natl Acad Sci U S A. 2001]Nature. 2005 Jun 9; 435(7043):834-8.
[Nature. 2005]Biometrics. 2006 Jun; 62(2):526-33.
[Biometrics. 2006]RNA. 2003 Feb; 9(2):175-9.
[RNA. 2003]Science. 2005 Dec 16; 310(5755):1817-21.
[Science. 2005]PLoS Biol. 2005 Mar; 3(3):e85.
[PLoS Biol. 2005]Nat Methods. 2006 Nov; 3(11):881-6.
[Nat Methods. 2006]RNA. 2006 Feb; 12(2):192-7.
[RNA. 2006]Proc Natl Acad Sci U S A. 2003 Aug 5; 100(16):9440-5.
[Proc Natl Acad Sci U S A. 2003]Cell. 2005 Mar 11; 120(5):635-47.
[Cell. 2005]Biometrics. 2006 Jun; 62(2):526-33.
[Biometrics. 2006]Biometrics. 2006 Jun; 62(2):526-33.
[Biometrics. 2006]Nature. 2004 Sep 16; 431(7006):350-5.
[Nature. 2004]Cell. 2004 Jan 23; 116(2):281-97.
[Cell. 2004]Nature. 2005 Jun 9; 435(7043):828-33.
[Nature. 2005]Nature. 2005 Jun 9; 435(7043):839-43.
[Nature. 2005]Nature. 2005 Jun 9; 435(7043):834-8.
[Nature. 2005]