![]() | ![]() |
Formats:
|
||||||||||||||||||||||||||
Copyright © 2007 Liang et al; licensee BioMed Central Ltd. Characterization of microRNA expression profiles in normal human tissues 1Molecular and Cell Biology-R&D, Applied Biosystems, Foster City, CA 94404, USA Corresponding author.Yu Liang: Yu.Liang/at/appliedbiosystems.com; Dana Ridzon: ridzonda/at/appliedbiosystems.com; Linda Wong: Linda.Wong/at/appliedbiosystems.com; Caifu Chen: ChenCX/at/appliedbiosystems.com Received February 20, 2007; Accepted June 12, 2007. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. This article has been cited by other articles in PMC.Abstract Background Measuring the quantity of miRNAs in tissues of different physiological and pathological conditions is an important first step to investigate the functions of miRNAs. Matched samples from normal state can provide essential baseline references to analyze the variation of miRNA abundance. Results We provided expression data of 345 miRNAs in 40 normal human tissues, which identified universally expressed miRNAs, and several groups of miRNAs expressed exclusively or preferentially in certain tissue types. Many miRNAs with co-regulated expression patterns are located within the same genomic clusters, and candidate transcriptional factors that control the pattern of their expression may be identified by a comparative genomic strategy. Hierarchical clustering of normal tissues by their miRNA expression profiles basically followed the structure, anatomical locations, and physiological functions of the organs, suggesting that functions of a miRNA could be appreciated by linking to the biologies of the tissues in which it is uniquely expressed. Many predicted target genes of miRNAs that had specific reduced expression in brain and peripheral blood mononuclear cells are required for embryonic development of the nervous and hematopoietic systems based on database search. Conclusion We presented a global view of tissue distribution of miRNAs in relation to their chromosomal locations and genomic structures. We also described evidence from the cis-regulatory elements and the predicted target genes of miRNAs to support their tissue-specific functional roles to regulate the physiologies of the normal tissues in which they are expressed. Background MicroRNAs (miRNAs) belong to a family of small non-coding RNAs (18~22 nucleotides) that interact with their target coding mRNAs to inhibit translation by either degradation of the mRNAs, or blocking translation without degrading the targets [1]. Significant conservation of individual miRNAs across different species suggests their functional importance. It has been shown in several animal models that miRNAs participate in determination of cell fate, pattern formation in embryonic development, and in control of cell proliferation, cell differentiation, and cell death [2]. Therefore, it is reasonable to speculate that miRNAs are also involved in human diseases such as cancers [3]. Several groups of miRNAs have been identified to regulate the expression of tumor-associated genes [4], while others seem to hold prognostic value in predicting the survival of cancer patients [5]. Chromosomal location and genomic distribution of a miRNA gene are important determinants of its expression from at least three perspectives. First, about 80% of miRNA genes are located within introns of defined transcription units [6], and their expression is frequently correlated with the expression profiles of their host genes [7,8]. Second, many miRNA genes are distributed as clusters, and a microarray expression profiling of 175 miRNAs in 24 human tissues showed that proximally paired miRNA genes at a distance up to 50 kb are generally co-expressed [8]. The best example may be the four miRNA genes (miR-196b, miR-10a, miR-196a-2, and miR-10b) that are embedded in the Hox gene clusters (Hox A, Hox B, Hox C, and Hox D, respectively). By histochemical staining and in situ hybridization, expression patterns of miR-10a and Hoxb4 mRNA are very similar, suggesting that they share regulatory control of transcription [9]. Lastly, miRNA genes are frequently located at fragile sites, as well as in regions of loss of heterozygosity, regions of amplification, or common breakpoint regions [10]. Expression of miRNA genes within the regions afflicted by chromosomal aberration, a hallmark characteristic of neoplastic cells, could also be directly affected. For example, miR-15a and miR-16-1 are located at a frequently deleted site in most of the B cell chronic lymphocytic leukemia patients [11], and induce apoptosis in a leukemia cell line model [12]. Most expression profiling of miRNAs in normal human tissues has been explored in a rather small collection of tissues or miRNAs, in which some of them were restricted by time-consuming and laborious strategies such as Northern blotting or cloning [13]. One report used a bead-based detection platform to profile expression of 217 miRNAs in a broad spectrum of normal human tissues, but low sensitivity and specificity make the results problematic for miRNAs that are less abundant [14]. Microarrays have the advantage of high throughput and was used for profiling expression of miRNAs [8,15,16], but they have the same concern of sensitivity, and it might be difficult for them to differentiate closely related miRNAs in sequences. Sensitivity is always a major obstacle to examine tissue-specific expression patterns of miRNAs with low abundance. A new type of real time reverse transcription (RT)-PCR-based miRNA assays were recently developed that have better sensitivity and specificity compared to bead- and microarray-based technologies [17]. We used these assays to examine global profiles of distribution and expression of 345 unique miRNAs in 40 normal human tissues, so we could identify tissue-specific miRNAs that provide foundations to pursue diagnostic and therapeutic targets as well as molecular mechanisms underlying the phenotypic diversity of different tissues, and present universal baselines for investigating variations in miRNA expression under physiological or pathological conditions. Our data were also combined with public datasets to systematically analyze the association between genomic locations of miRNAs and their expression, and the correlation of expression between miRNAs and their predicted target genes. Results Hierarchical clustering of normal human tissues is mainly based on their anatomical locations and physiological functions using the miRNA expression profiles To assess the reproducibility of the TaqMan® MicroRNA Assays, we characterized the expression of miRNAs in three brain, two testes, and two peripheral blood mononuclear cells (PBMC) specimens. We found a high concordance between the three brain specimens (r = 0.945 and 0.952, respectively, Figure Figure1A),1A
The expression of 345 human miRNAs was quantitated in a spectrum of 40 normal human tissues that included specimens derived from brain, muscle, circulatory, respiratory, lymphoid, gastrointestinal, urinary, reproductive, and endocrine systems (see Additional file 1). We employed an unsupervised hierarchical clustering based on the variation of expression for each miRNA across the specimens examined to explore the correlation between different tissue types. In general, normal human tissues derived from similar anatomical locations or with related physiological functions were primarily clustered together (Figure (Figure2).2
However, the clustering patterns among some tissue types by their mRNA and our miRNA expression profiles were quite different (see Additional file 2). Lung was clustered together with female reproductive organs and esophagus by mRNA expression profile, but miRNA expression profile of lung was only similar to that of Fallopian tube. Thyroid was similar to different parts of the heart in miRNA expression but not in mRNA expression. Liver was clustered together with the gastrointestinal organs and kidney by mRNA expression profiles but not when the miRNA expression profiles were used. Brain, PBMC, thymus, adrenal gland, and testes formed a unique cluster separate from the other tissues by their miRNA expression profiles, but such similarities in mRNA expression profiles between these five tissue types were only observed separately between brain and testes, and between thymus and PBMC [18]. Some groups of miRNAs demonstrated highly differential patterns of tissue distribution that were not seen in the mRNA profiles. For example, a prominent expression of a group of miRNAs (miR-141, miR-200 family, miR-429, miR-375, and miR-31) mainly in epithelial tissues, such as lung, breast, and the gastrointestinal organs (r = 0.72, Figure Figure2),2 Localization in the same genomic cluster is the most recognizable feature for miRNAs that have correlated abundance and expression patterns among tissues Centered expression data for each miRNA as shown in Figure Figure22
Estimated average copy numbers converted from CT values for all miRNAs examined in placenta and PBMC were approximately 1,500 and 100 copies, respectively (Figure (Figure4,4
Three criteria were used to define universally expressed miRNAs if their (1) average CT values in all tissues were less than 30, (2) standard deviations of CT values in all tissues were less than 0.8, and (3) their maximal and minimal CT values in tissues differed by less than 4 (yellow highlighted miRNAs in the Additional file 1). Since the 15 miRNAs identified, which include the 4 miRNAs used to normalize our data (see Methods), showed rather consistent expression levels in the extensive list of tissues we surveyed, they would be strong candidates for normalizing miRNA expression should the types of normal human tissues beyond our list be examined. One of the 15 miRNAs, miR-16, has been found abundantly expressed in all tissues and was used as a control in several systems including animal models [19]. Because we did not have replicate samples for most tissue types, it is not sensible to use class prediction/marker discovery programs to identify tissue-specific miRNA markers or differentially expressed miRNAs. Instead, we used the following four criteria to select eight miRNA groups (Figure (Figure5).5
Group I (r = 0.937) that contains miR-1 and miR-133a/b showed highest expression in different parts of the heart and skeletal muscle as well as in vena cava, and in, unexpectedly, thyroid. This expression pattern is consistent with previous observations in their localization and functional analysis [8,20]. However, it was not previously appreciated that much lower expression of miR-1 and miR-133a/b was seen in some non-heart, non-skeletal muscle tissues. These are "hollow" organs composed of smooth muscle-containing wall, such as the gastrointestinal system, suggesting that expression of miR-1 and miR-133a/b might mark some features shared by different muscle types (i.e., skeletal, cardiac, and smooth). The group II shown in Figure Figure55 The miRNAs in some of the eight differentially expressed groups identified in Figures Figures55
Intronic miRNAs and their host genes have correlated expression patterns in normal tissues It is believed that miRNAs positioned in introns frequently have the same expression patterns to their host genes, and data from meta-analysis and RT-PCR have been used to validate some of the genes [7,8]. We reasoned that our miRNA expression profiling could reproduce these observations. We compiled a list of intronic miRNAs from a published literature based on the following rules modified from a previous report [7]: (1) the host gene and the miRNAs are transcribed from the same strand of DNA; (2) the host gene is a protein-coding gene with defined gene name and protein domains that link to its possible biological functions; (3) the miRNA does not have extra copies in other part of the genome since the transcription of each copy of the miRNA gene could be regulated by different mechanisms that would confound the result of our analyses. Among the 31 miRNAs qualified (Table 2), 22 of them had significant correlation (p < 0.05) with their host genes in expression among 19 tissue types. If the two miRNAs that had marginal correlation (p values between 0.05 and 0.07) was included (they could still be significant due to the use of different databases in this comparative study), total 77% of the miRNAs in our list had coherent expression patterns with their host genes. This result further corroborates the hypothesis that expression of intronic miRNAs is co-regulated with their host genes, and it also identifies the host genes that could be surrogate markers for expression of their intronic miRNAs.
Combination of predicting transcription factor binding sites, sequence comparison, and expression analyses identifies candidate factors contributing to the tissue-specific expression of miRNA We have shown that most of the miRNAs in the same differentially expressed group are located within the same genomic clusters, suggesting the presence of common regulatory mechanisms to their expression. Genomic sequences flanking these clusters may contain regulatory elements that control expression of these miRNAs. Predicting candidate transcription factors that might be associated with tissue-specific expression of miRNAs would offer valuable information to elucidate how miRNAs participate in cell differentiation and tissue specification. Although it is difficult to distinguish whether the presence of a cis-regulatory element is truly functional or a stochastic event without performing experimental validation such as chromatin immunoprecipitation [20], it has been shown that evolutionarily conserved non-coding genomic sequences is more likely to have a functional role and a better source to search transcription factor-binding sites [22]. We sought to provide the proof of concept that a comparative genomics-based resource using human and mouse sequences such as GenomeTraFac [23] can detect putative cis-regulatory regions that may contribute to tissue-specific expression of miRNAs in some of the eight differentially expressed groups (Figure (Figure6A6A
We first tested the feasibility by examining the 2 kb-upstream sequence of the miR-1 cluster at chromosome 18 in the muscle-specific expression group because a number of myogenic factors are known to bind to the upstream sequences of muscle-specific miRNAs [20]. We found one peak larger than 30 "Hits" (defined by the number of transcription factor binding sites shared by human and mouse within a 200 bp window), the maximal "Hits" value from the software's graphic output, without particular surge of frequency of transcription factor-binding site in both human and mouse sequences (see the left "Hits" peak circled with red in the upper part of the Additional file 5). The transcription factor-binding site prediction showed two MyoD-binding sites in this region. Another region close to the edge of this 2 kb segment had another larger-than-30 "Hits" peak but no MyoD-binding site was seen. There were another two peaks with "Hits" close to 30 about 500 bp downstream to the MyoD-binding site-containing peak that actually had two MyoD-binding sites (one for each, data not shown). We reasoned that prioritizing the genomic sequences for subsequent analysis would be critical if the transcription factor of our interest is unknown (unlike MyoD in this test case), so we chose the sequence with "Hits" larger than 30 as the first criteria for transcription factor-binding site prediction. The sequences from the graphic output generally showed larger than 40% identical between human and mouse, so we set 50% identical between human and mouse as a threshold. Expression of the transcription factors binding to the predicted sites was further examined in a host of more than 60 normal human tissues from the database of the Genomics Institute of the Novartis Research Foundation (GNF) [24], and the transcription factors with the same tissue distribution as the tissue-specific miRNAs would conceivably be a favorable (certainly not only) target for future validation. We applied this searching strategy to the three miRNAs in the group III (brain) and identified a zinc finger protein, MOK-2 (ZFP239) (Figure (Figure6B6B This searching strategy would likely fail if the transcription factor-binding sites are not conserved between human and mouse although sequence homology in general is acceptable (for example, 50% identical). This was best represented by the search of transcription factors for the group V and group VIII miRNAs. In the case of group V, the upstream sequence of the miR-302b locus had a peak of "Hits" but no binding site for heart-specific factors could be found from that region. However, a region closer to miR-302b showed a significant spike in binding-site frequency despite the very low "Hits" (Figure (Figure6E).6E Our results suggests that, using the GenomeTraFac web tool, we can identify several candidate transcription factors that may participate in tissue-specific expression of miRNAs by the following workflow (Figure (Figure6A):6A Predicted target genes of miRNAs with reduced expression in brain and peripheral blood mononuclear cells identifies a list of genes essential for development of these two tissue types in mouse models Identification of genes whose expression is regulated by miRNAs provides a lead for the functional roles of miRNAs, and predicting target genes of the tissue-specific miRNAs identified in our differentially expressed groups would greatly facilitate understanding the miRNA-regulated biological correlates of those tissues. One example is the group IVb miRNAs that had almost no expression in brain/PBMC compared to the rest of tissues that invariably had moderate to high abundance (Figure (Figure5).5 One member of the miR-199a (miR-199a-2) is located at only 5.6 kb away from miR-214, while miR-199a-1 and miR-199b are located at two separate regions with no other miRNAs nearby; miR-10a and miR-143 do not have relationship with miR-199a/199b/214 in genomic structure and were excluded from the analysis. To validate the low abundance of miR-199a/199b/214 in brain, their expression was examined in 6 additional adult brain specimens (including four derived from different regions of the brain and one fetal brain specimens) and all were reproducibly lower than the other tissue types (Table 3). Lower expression of miR-199a/214 was previously reported in brain than in liver, thymus, testes, and placenta by 16 to 180 folds in a study using microarrays [16]. Interestingly, expression of miR-199a/214 in brain compared to other major tissues was also reduced during zebrafish embryonic development [26].
All target genes (N = 1939) for miR-199a/199b/214 predicted by the miRanda web tool from miRBase [27] were combined, and their expression in 19 tissue types extracted from the GNF database (see Methods) was examined. To focus on more differentially expressed genes, 1125 genes with variation in expression among tissues equal or larger than 3.24 fold (1.8 under log2 base) were selected for analysis. Unexpectedly, almost all genes with reduced expression in non-brain and non-PBMC tissues did not show simultaneous increased expression in brain and PBMC; they rather showed elevated expression in either one or the other (Figure (Figure7).7
International Gene Trap Consortium (IGTC) maintains a database to curate functions of genes across the mouse genome by gene trapping that is a high-throughput approach to introduce insertional mutations and generate loss-of-function alleles in embryonic stem cells [28]. There were 31 out of the 168 predicted brain targets that had available phenotypic information in this database, and 23 of the 31 genes had either defect in the nervous system or in behavioral/neurological functions (Table 4 and see Additional file 8). To assure that such genes are not overrepresented in the IGTC database, 33 genes that had phenotypic information available were randomly selected from 6991 genes recorded in the database as of Feburary 13th, 2007, and only 7 genes had neurological defects when mutated, indicating that functions in the nervous system is indeed the most enriched category among these 31 genes (p = 2 × 10-12 by Chi test). The same strategy was used to evaluate the functional categories of the 146 predicted PBMC targets that had 30 genes with available phenotypic records in the database. Instead of 9 of the 33 randomly selected genes with defects in the immune and/or hematopoietic systems, 14 of the predicted PBMC genes had phenotypic changes in these two systems, which confirmed a moderate but still significant enrichment of genes in this functional category (p = 0.002).
Our refined 314 predicted targets could be more enriched than we observed in the functional categories of nervous and immune systems because the IGTC database lacks phenotypic records for most of these genes, and roles of many of them (for example, interleukins, tumor necrosis factors, ion channels, and neurotransmitter transporters) in these two systems have been well documented. Discussion Our miRNA expression profiles provide comprehensive information about general abundance as well as tissue-specificity of miRNAs In this study, we examined the expression of miRNAs in a comprehensive list of normal human tissues using 345 unique miRNA assays, and identified miRNAs that were expressed in specific tissues with minimal or no expression in other tissues we examined, such as miR-129/219/330 in brain, miR-124a/124b in brain and PBMC, and two groups of miRNAs primarily expressed in placenta and testes. We were also able to identify miRNAs with moderate to high expression in all tissues examined except for certain organs that had much lower or no expression at all, such as miR-199a/199b/214 in brain and PBMC and miR-10a/10b in brain. Our study provided an opportunity to re-visit and confirm the tissue-specific miRNAs previously reported in literature. For example, liver-specific miR-122a had significant expression in liver [13] and regulated cholesterol in plasma [19]; we indeed saw no expression in all non-liver tissues except for brain (34 copies) and thymus (19 copies) that had much less expression than liver (185-fold and 332-fold less, respectively). One of the brain-specific miRNAs, miR-219, described in a previous report [13] was also confirmed with our data, in which brain had 85 copies and merely 5 copies were found in PBMC, and no expression detected in the rest of the tissues. For reasons such as limited types of tissues and number of miRNAs examined and sensitivity of the assays, several miRNAs that were previously considered as "tissue-specific" actually showed significant expression in other tissues from our data, and therefore their tissue distribution needs to be redefined. For example, one "brain-specific" miRNA, miR-124 [29], was described in a number of reports and was demonstrated to shift the expression profiles of HeLa cells towards the brain signature [30,31]. Although our data did show most abundant expression of this miRNA in brain and no expression in most of the tissues surveyed, it also had significant expression in PBMC (from 1/4 to half of what was detected in brain) and detectable in thymus and one of the testes (from 50- to 100-fold less than what was in brain). It would be more reasonable to define miR-124a/b as preferentially expressed in brain and PBMC. In some cases, a broader tissue distribution for miRNAs described from our data suggested that they might have more functions than what was originally described. For example, miR-375 that was identified as "pancreas islet-specific" miRNA and functions as a regulator of insulin secretion from the islet cells [32], but we clearly showed that this miRNA belong to the epithelial subgroup of the GI/epithelial expression cluster, and was preferentially expressed in organs lined with epithelium (Group III in Figure Figure3).3 Not having enough tissue types for expression profiling might cause incorrect denotation of tissue-specific expression. For example, miR-134 was found to regulate development of dendritic spines of neurons, and regarded as "brain-specific" [33], whereas our data clearly demonstrated its expression in many other tissues with similar (placenta and testis) or even higher (adrenal gland) expression levels. We also discovered that another two "brain-specific" miRNAs, miR-135 and miR-183, identified previously [13] were more abundant in several non-brain tissues that were not examined in their original report. Universally expressed miRNAs are candidates participating fundamental metabolic pathways in normal cells We identified a group of 15 miRNAs (see Additional file 1) that are universally expressed at similar levels in normal tissues we examined based upon their CT values and the variability of their CT values among samples, and the scope of our tissue collection suggests that these miRNAs might exhibit similar expression patterns in tissues we did not examined. Such a feature characterizes these miRNAs as a candidate of universal reference to normalize miRNA expression in normal human tissues, as we did in our analysis using 4 miRNAs from this list. The expression pattern and tissue distribution of these 15 miRNAs suggests that they might be associated with fundamental functions, such as metabolism, required for normal human cells. For example, miR-29b was found to control the amount of the branched-chain α-ketoacid dehydrogenase complex that catalyzes the first irreversible step in branched-chain amino acid catabolism [34]. Furthermore, deregulated expression of these universally expressed miRNAs could also link to pathological states of a cell, such as neoplastic processes. Differential expression of some of them in tumors has been demonstrated, for example, miR-15/16 in chronic lymphocytic leukemia [11], miR-92 in lymphoma and lung cancer [35], and miR-140 in pancreatic cancer [36]. The clustering patterns of normal tissues by miRNA and mRNA expression profiles are similar Even with much less degree of freedom than mRNAs, miRNA expression profiles reflect the developmental lineage and differentiation state of cells and successfully classified poorly differentiated tumors that could not have definitive diagnosis by histopathology, while the classification based upon the mRNA profiles was highly inaccurate [14]. For this reason, one might anticipate that miRNA expression profiles would classify normal human tissues better than the mRNA profiles as well. To our surprise, the clustering of tissues using the miRNA expression profiles was very similar to that obtained by the mRNA expression profiles. One possible explanation is that miRNAs preserve more of the "cellular identity signature" compared to mRNAs under the genomic instability and heterogeneity that characterize neoplastic cells, whereas in normal tissues such a variable environment does not exist so the performance of both miRNA and mRNA expression profiles on tissue classification is comparable. The uniqueness of the tissue clustering using the miRNA expression profiles compared to that with the mRNA profiles appeared to be contributed by several groups of miRNAs with distinctive expression patterns, such as the ones in epithelial tissues, and those in placenta, testes, brain, and PBMC. The distinctive expression patterns in turn highlight the central roles played by miRNAs in histogenesis of epithelial tissues and in key physiologies of placenta, testis, brain, and the hematopoietic system. The group of epithelial miRNAs is probably the best example to support the concept of "cellular identity signature" since they also have marked expression in cancer cell lines derived from epithelial tissues [37]. Because a considerable portion of miRNAs had tissue-specific expression patterns and the average miRNA copy numbers in all tissues were highly variable, global normalization that assumes total abundance of the transcripts from all genes is constant across different tissues and is frequently used for normalization of mRNA expression data does not appear to be appropriate to normalize miRNA expression data. This characteristic of miRNA expression patterns among tissues we observed underscores the earlier findings in which total abundance of miRNAs was altered in tumors [14] as well as in Dicer-knockout animal models [38,39]. MicroRNA genes localized within a genomic cluster are preferentially co-expressed as a "transcription unit" Chromosomal abnormalities such as deletion/amplification of genes or loss/gain of chromosomes are characteristics of neoplastic cells, and such features could influence expression of genes within such afflicted regions that at least some of these genes show a coherent expression pattern, and this can be identified as distinctive expression clusters when global profiles of mRNA expression are analyzed by hierarchical clustering. For example, epidermal growth factor receptor (EGFR) is amplified in about 40% of GBM, and gene expression profiling of GBM revealed that EGFR and its neighboring genes were tightly clustered together and had substantially increased expression in the tumors that had EGFR amplification [40]. In the same study, a cluster of six C-C motif-containing cytokines located at Chromosome 17q12 within a 100 kb region (with spacing from 3.5 kb to 35 kb between them) could also be identified with a coordinated expression pattern, suggesting the presence of a co-regulated mechanism of transcription of these genes [40]. However, in most cases mRNAs in a hierarchical cluster with highly correlated expression are not mapped to the same genomic regions. This provides a striking contrast to what was observed in our miRNA expression profiles, in which most miRNAs with similar expression patterns are encoded from the same genomic region (usually with spacing from 1 kb to 5 kb between them). One possible explanation for this is that most miRNA genes located within the same genomic cluster are encoded in a polycistronic structure so they are synthesized, processed, and mature to final products in a parallel fashion. Conversely, miRNAs within the same genomic region that share common expression pattern would predict that these miRNA genes are transcribed as a polycistron. There were some exceptions in which miRNAs that are localized in the same genomic cluster but did not show coherent expression patterns in our data. This might be caused by different regulatory mechanisms for the transcription of these miRNAs, or these miRNAs share the same transcript (polycistronic) but differential control of the maturation process for each miRNA in the same transcript determines their final abundance. Target genes predicted from miRNAs with low expression in brain and peripheral blood mononuclear cells are candidates required for development and maintenance of these two tissue types It is believed that miRNAs down regulate the steady-state levels of their target mRNAs, which has been demonstrated in cell lines or entire organisms by examining limited number of miRNAs [19,30,41,42]. The major drawback of these studies is that they investigated the interaction between miRNAs and their target genes by either overexpression or knockdown experiments, since such an unnatural expression levels of miRNAs might cause artifacts. The only study that investigates the association of expression between miRNAs and their targets in normal and neoplastic specimens derived from a spectrum of tissue types without modifying their expression was using "tissue-specific" miRNAs extracted from other sources and analyzed the expression of predicted target genes of these miRNAs using published microarray datasets [25]. We used a comprehensive list of tissue specimens and highly quantitative miRNA assays to identify several groups of miRNAs that had specific expression in certain tissues, and examined the expression of their predicted target genes in the same tissue types from public microarray database. We initially expected to observe complementary expression patterns of miRNAs and their predicted target genes among the tissues, but in the case of miR-199a/199b/214 that had low expression in brain and PBMC compared to the rest of tissues, most predicted targets that showed brain/PBMC-specific expression only appeared in either brain or PBMC but hardly both. Similarly, expression of most predicted genes of miR-129/219/330 (higher expression in brain) had decreased expression in brain, but their expression in non-brain tissues was highly variable. It appears that expression of miRNA has a binary effect on expression of its targets, in that the suppression of its target by the miRNA is predominant when miRNA expression is high, whereas when the miRNA expression is reduced, other tissue-specific parameters such as transcription factors serve as a different level of gene expression control. This is supported by the observation of the same binary patterns in the expression of predicted miRNA target genes in our data from other tumor specimens and cell lines [37]. High-throughput identification of miRNA target genes could potentially rely on either manipulating expression of miRNAs (overexpression or knock-down) and examining resulting changes of gene expression profiles in cells/tissues using microarrays, or algorithm prediction followed by in vitro validation. Although in silico prediction appears to circumvent the time and cost issues associated with the transfection/knock-down experiments and microarrays, a long list of predicted output could be frustrating for investigators to focus on a few candidates for validation. Furthermore, genes predicted as targets might not be biologically meaningful if the miRNAs and their predicted target genes are never expressed in the same tissues. In our study, we filtered the original list of predicted targets by comparison of the tissue distribution between miRNAs and their target genes. This strategy seemed to produce a much-focused group of genes corresponding to the physiological functions of the organs. For example, many predicted target genes of miR-199a/199b/214 (low expression in brain and PBMC) are required by the developing nervous and hematopoietic systems. Conclusion Our data and analyses of expression patterns presented a global view of tissue distribution of miRNAs and the relation to their chromosomal locations. We presented evidence that such data support identification of specific miRNAs as markers to correlate with the functions of normal or disease tissues in which these miRNAs are expressed, and identification of the predicted miRNA target genes that are required in the developing nervous and hematopoietic systems. We also demonstrated a proof-of-principle strategy using the GenomeTraFac web source to precede future experimental validation for identifying candidate transcription factors associated with tissue-specific expression of miRNAs. Methods Total RNA samples Total RNA samples of normal human tissues from commercial sources were purchased from Ambion (Austin, TX), Stratagene (La Jolla, CA), and BD Biosciences (Mountain View, CA). Quantitation of miRNAs TaqMan® MicroRNA Assays were used to quantitate miRNAs in all of our studies according to the conditions published previously [17]. In brief, each 7.5 μl RT reaction contained purified 3.75 ng of total RNA, 50 nM stem-loop RT primer (Applied Biosystems, Foster City, CA), 1×RT buffer (Applied Biosystems), 0.25 mM each of dNTPs, 3.33 U/μl MultiScribe™ reverse transcriptase (Applied Biosystems) and 0.25 U/μl RNase inhibitor (Applied Biosystems). The reactions were incubated in an Applied Biosystems 9700 Thermocycler in a 384-well plate for 30 min at 16°C, 30 min at 42°C, followed by 5 min at 85°C, and then held at 4°C. RT products were diluted three times with H2O prior to setting up PCR reaction. Each real-time PCR for each microRNA assay (10 μl volume) was carried out in quadruplicate, and each 10 μl reaction mixture included 2 μl of diluted RT product, 5 μl of 2×TaqMan® Universal PCR Master Mix, 0.2 μM TaqMan® probe, 1.5 μM forward primer, and 0.7 μM reverse primer, respectively. The reaction was incubated in an Applied Biosystems 7900HT Fast Real-Time PCR System in 384-well plates at 95°C for 10 min, followed by 40 cycles of 95°C for 15 sec and 60°C for 1 min. The threshold cycle (CT) is defined as the fractional cycle number at which the fluorescence exceeds the fixed threshold of 0.2. Automated multi-well distribution of samples was done using the HYDRA® II PLUS-ONE System (Matrix Technologies, Hudson, NH). Data adjustment and filtering for hierarchical clustering Four human miRNAs (miR-30e, miR-92, miR-92N, and miR-423) that were least variable among the 40 normal tissues in this study were identified, and the average quantity of these four in each tissue was used to normalize the RNA input. Normalized data from assays with CT values greater than 35 were treated as 35 and were subject to hierarchical clustering by two ways. One is mean-centering data for each miRNA but not tissues, followed by correlation similarity metrics for both miRNA and tissue clustering (Figure (Figure2);2 When gene expression data were extracted from the GNF [43] database, the signal intensity of each gene was divided by the average of signal intensities of all genes extracted, followed by log2 transformation. Data adjustment for centering and similarity metric for hierarchical clustering was the same as described above. Differentially expressed genes were selected to avoid spurious clustering results by removing genes with variation in expression among tissues less than 1.8 (under log2 base). Higher cutoff will generate less number of genes and therefore was avoided. Genes from the Stanford GBM database were selected if they had analyzable data in more than 80% of the samples among the tissues examined. Extraction of data from public databases Genomic sequence of miRNA cluster and the predicted transcription factor binding sites were extracted from the "Cis-element clusters within BlastZ Aligments" option at the GenomeTraFac [44]. Tissue distribution of selected genes was derived from the GNF database [43], in which the expression data from the same tissue types examined in our study were extracted for analyses. Target genes of miRNAs were predicted using the miRanda open-source software [45] associated with the miRBase [46] with a cutoff p value less than 0.05. The IGTC database [47] was used to search the phenotype of mice carrying loss-of-function mutations. Statistical analyses The correlation coefficient (r) between repeating specimens and the correlation of expression between intronic miRNAs and their host genes (threshold p value was set to 0.05) were analyzed using Pearson regression. Comparison of mean was evaluated using 1-tail Student's t test with unequal variance. Chi test was used to analyze the significance of genomically clustered miRNAs in each differentially expressed group: the expected frequency was the number of all miRNAs among the 345 miRNAs examined that are located within the genomic clusters carrying the genomically clustered miRNAs regardless whether or not they were in the differentially expressed groups. Abbreviations miRNA, microRNA; PBMC, peripheral blood mononuclear cells; GNF, Genomics Institute of the Novartis Research Foundation; IGTC, International Gene Trap Consortium. Authors' contributions YL designed experiments, performed assays, analyzed data, and wrote the manuscript; DR and LW performed assays; CC designed assays and edited the manuscript. Additional file 1 Complete data of miRNA expression in 40 normal human tissues. Complete data normalized by the least variable miRNAs and small nuclear RNAs. Click here for file(391K, xls) Additional file 2 The clustering patterns of normal human tissues using mRNA expression profiles. The clustering patterns of normal human tissues using mRNA expression profiles taken from Shyamsunder et al. [18]. Click here for file(62K, pdf) Additional file 3 Differential abundance of miRNAs in normal human tissues. A color-coded diagram illustrates the differential abundance of miRNAs in normal human tissues. Normal tissues were generally arranged by their positions in human body, as highlighted at the right side of the diagram, whereas miRNAs were sorted based upon their annotated ID. Click here for file(5.3M, tiff) Additional file 4 Estimated copy numbers of miRNAs in normal human tissues. Complete data of estimated copy numbers of miRNAs in normal human tissues transformed from the Additional file 1. Click here for file(219K, xls) Additional file 5 The "regulogram" of the genomic sequence close to the hsa-miR-1-2 locus where MyoD binding site was identified. The "regulogram" from GenomeTraFac showed the genomic sequence close to the hsa-miR-1-2 locus where MyoD binding site was identified. Click here for file(967K, pdf) Additional file 6 Expression patterns of INSM1, STAT5, and SOX5 in normal human tissues. Expression patterns of INSM1, STAT5, and SOX5 in normal human tissues from the GNF database. Click here for file(983K, pdf) Additional file 7 Binding sites for Nkx2-5, SOX5, and FOXF2 and their tissue distribution. Binding sites for Nkx2-5, SOX5, and FOXF2 from GenomeTraFac, and their tissue distribution from the GNF database. Click here for file(1.8M, pdf) Additional file 8 Phenotypic data for the refined list of predicted target genes of miR-199a/199b/214. Complete phenotypic data extracted from IGTC for the refined list of predicted target genes of miR-199a/199b/214. Click here for file(21K, xls) Acknowledgements We thank Dr. Victor Ambros for critical reading of the manuscript. We are also grateful to the support provided by Applied Biosystems. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||||||
Development. 2005 Nov; 132(21):4645-52.
[Development. 2005]Development. 2005 Nov; 132(21):4653-62.
[Development. 2005]Cell. 2005 Jul 15; 122(1):6-7.
[Cell. 2005]N Engl J Med. 2005 Oct 27; 353(17):1768-71.
[N Engl J Med. 2005]N Engl J Med. 2005 Apr 21; 352(16):1667-76.
[N Engl J Med. 2005]Genome Res. 2004 Oct; 14(10A):1902-10.
[Genome Res. 2004]RNA. 2005 Mar; 11(3):241-7.
[RNA. 2005]Nat Genet. 2004 Oct; 36(10):1079-83.
[Nat Genet. 2004]Genome Biol. 2004; 5(3):R13.
[Genome Biol. 2004]Nature. 2005 Jun 9; 435(7043):834-8.
[Nature. 2005]RNA. 2005 Mar; 11(3):241-7.
[RNA. 2005]Proc Natl Acad Sci U S A. 2004 Jun 29; 101(26):9740-4.
[Proc Natl Acad Sci U S A. 2004]Genome Res. 2004 Dec; 14(12):2486-94.
[Genome Res. 2004]Nucleic Acids Res. 2005 Nov 27; 33(20):e179.
[Nucleic Acids Res. 2005]Genome Biol. 2005; 6(3):R22.
[Genome Biol. 2005]Genome Biol. 2005; 6(3):R22.
[Genome Biol. 2005]Nature. 2005 Dec 1; 438(7068):685-9.
[Nature. 2005]RNA. 2005 Mar; 11(3):241-7.
[RNA. 2005]Proc Natl Acad Sci U S A. 2006 Jun 6; 103(23):8721-6.
[Proc Natl Acad Sci U S A. 2006]Science. 2005 Sep 2; 309(5740):1577-81.
[Science. 2005]Genome Res. 2004 Oct; 14(10A):1902-10.
[Genome Res. 2004]RNA. 2005 Mar; 11(3):241-7.
[RNA. 2005]Genome Res. 2004 Oct; 14(10A):1902-10.
[Genome Res. 2004]Proc Natl Acad Sci U S A. 2006 Jun 6; 103(23):8721-6.
[Proc Natl Acad Sci U S A. 2006]Nat Rev Genet. 2003 Apr; 4(4):251-62.
[Nat Rev Genet. 2003]Nucleic Acids Res. 2007 Jan; 35(Database issue):D116-21.
[Nucleic Acids Res. 2007]Proc Natl Acad Sci U S A. 2006 Jun 6; 103(23):8721-6.
[Proc Natl Acad Sci U S A. 2006]Proc Natl Acad Sci U S A. 2004 Apr 20; 101(16):6062-7.
[Proc Natl Acad Sci U S A. 2004]Proc Natl Acad Sci U S A. 2006 Feb 21; 103(8):2746-51.
[Proc Natl Acad Sci U S A. 2006]Genome Res. 2004 Dec; 14(12):2486-94.
[Genome Res. 2004]Science. 2005 Jul 8; 309(5732):310-1.
[Science. 2005]PLoS Biol. 2004 Nov; 2(11):e363.
[PLoS Biol. 2004]Nat Genet. 2004 Jun; 36(6):543-4.
[Nat Genet. 2004]Genome Biol. 2004; 5(3):R13.
[Genome Biol. 2004]Nature. 2005 Dec 1; 438(7068):685-9.
[Nature. 2005]Curr Biol. 2002 Apr 30; 12(9):735-9.
[Curr Biol. 2002]Nature. 2005 Feb 17; 433(7027):769-73.
[Nature. 2005]Proc Natl Acad Sci U S A. 2006 Feb 14; 103(7):2422-7.
[Proc Natl Acad Sci U S A. 2006]Nature. 2004 Nov 11; 432(7014):226-30.
[Nature. 2004]Nature. 2006 Jan 19; 439(7074):283-9.
[Nature. 2006]Genome Biol. 2004; 5(3):R13.
[Genome Biol. 2004]Hum Mol Genet. 2005 Nov 15; 14(22):3371-7.
[Hum Mol Genet. 2005]Nature. 2005 Jun 9; 435(7043):828-33.
[Nature. 2005]J Clin Oncol. 2006 Oct 10; 24(29):4677-84.
[J Clin Oncol. 2006]Nature. 2005 Jun 9; 435(7043):834-8.
[Nature. 2005]Cancer Res. 2007 Mar 15; 67(6):2456-68.
[Cancer Res. 2007]Nature. 2005 Jun 9; 435(7043):834-8.
[Nature. 2005]Genes Dev. 2005 Feb 15; 19(4):489-501.
[Genes Dev. 2005]Nat Genet. 2003 Nov; 35(3):215-7.
[Nat Genet. 2003]Proc Natl Acad Sci U S A. 2005 Apr 19; 102(16):5814-9.
[Proc Natl Acad Sci U S A. 2005]Nature. 2005 Dec 1; 438(7068):685-9.
[Nature. 2005]Nature. 2005 Feb 17; 433(7027):769-73.
[Nature. 2005]Nat Biotechnol. 2003 Jun; 21(6):635-7.
[Nat Biotechnol. 2003]Cell. 2005 Aug 26; 122(4):553-63.
[Cell. 2005]Proc Natl Acad Sci U S A. 2006 Feb 21; 103(8):2746-51.
[Proc Natl Acad Sci U S A. 2006]Cancer Res. 2007 Mar 15; 67(6):2456-68.
[Cancer Res. 2007]Nucleic Acids Res. 2005 Nov 27; 33(20):e179.
[Nucleic Acids Res. 2005]Nucleic Acids Res. 2005 Nov 27; 33(20):e179.
[Nucleic Acids Res. 2005]Genome Biol. 2005; 6(3):R22.
[Genome Biol. 2005]