![]() | ![]() |
Formats:
|
||||||||||||||
Copyright © Copyright 2005 by RNA Society Microarray profiling of microRNAs reveals frequent coexpression with neighboring miRNAs and host genes 1Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, Cambridge, Massachusetts 02142, USA 2Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA Reprint requests to: David P. Bartel, Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, 9 Cambridge Center, Cambridge, MA 02142, USA; e-mail: dbartel/at/wi.mit.edu; fax: (617) 258-5287. Received November 17, 2004; Accepted December 6, 2004. This article has been cited by other articles in PMC.Abstract MicroRNAs (miRNAs) are short endogenous RNAs known to post-transcriptionally repress gene expression in animals and plants. A microarray profiling survey revealed the expression patterns of 175 human miRNAs across 24 different human organs. Our results show that proximal pairs of miRNAs are generally coexpressed. In addition, an abrupt transition in the correlation between pairs of expressed miRNAs occurs at a distance of 50 kb, implying that miRNAs separated by <50 kb typically derive from a common transcript. Some microRNAs are within the introns of host genes. Intronic miRNAs are usually coordinately expressed with their host gene mRNA, implying that they also generally derive from a common transcript, and that in situ analyses of host gene expression can be used to probe the spatial and temporal localization of intronic miRNAs. Keywords: noncoding RNA, miRNA, human gene expression, gene expression profiling, cistronic elements INTRODUCTION MicroRNAs are a group of short RNAs, initially identified in Caenorhabditis elegans as regulators of developmental timing (Lee et al. 1993; Bartel 2004). These RNAs have been identified in a broad range of animals, and some are conserved from C. elegans through Drosophila melanogaster to Homo sapiens (Pasquinelli et al. 2000; Lagos-Quintana et al. 2001; Lau et al. 2001; Lee and Ambros 2001). MicroRNA genes are abundant, encompassing ~1% of the genes of worms, flies, and humans (Lai et al. 2003; Lim et al. 2003a,b). Their known role as modulators of metazoan gene expression continues to expand. Examples include lsy-6, a miRNA gene in C. elegans that controls neuronal left/right asymmetry, and bantam, a miRNA in D. melanogaster that regulates cell proliferation (Brennecke et al. 2003; Johnston and Hobert 2003). In mammals, miRNAs have been implicated in the modulation of hematopoietic lineage differentiation (Chen et al. 2004), and recent computational predictions of miRNA target sites implicate their involvement in a broader regulatory network of gene interactions (Lewis et al. 2003; Kiriakidou et al. 2004). RESULTS AND DISCUSSION To analyze the global expression of miRNAs found in human tissues, a DNA oligonucleotide-based microarray was designed. Preparation and fluorophore (Cy3) labeling of biological samples for hybridization to the array were based on techniques previously developed to clone and sequence populations of small RNAs (Fig. 1 ![]()
To explore array design options and hybridization conditions, a pilot array was designed that contained single and multiple mismatched probes. Using this array, miRNA sequences that differed by a single nucleotide could often be resolved under stringent hybridization conditions (Supplementary Fig. S1, http://web.wi.mit.edu/bartel/pub/bartel_publications.html). A more comprehensive array was printed, which included a nonredundant set of Homo sapiens, Mus musculus, and C. elegans miRNA probes. In an effort to achieve comparable stringency for each probe, the lengths of the probes were adjusted to bring the predicted melting temperature (Tm) of each sequence to a uniform temperature. In principle, our method, which normalized to a uniform amount of each reference oligonucleotide, should reveal the relative levels of each miRNA in the biological sample. To test the correlation between microarray signal and miRNA abundance, total RNA from mixed-stage C. elegans was cloned and analyzed with the microarray. The observed scores from the microarray were in good accordance with the cloning frequencies of these miRNAs (Supplementary Fig. S2, http://web.wi.mit.edu/bartel/pub/bartel_publications.html). Deviations from a perfect correlation can be attributed to multiple factors, including potentially differing qualities of DNA synthesis for the oligonucleotides used to make the reference sample, and the potential crosshybridization of reference oligonucleotides, preferentially increasing the reference signal for miRNAs in closely related families. Cloning frequencies are in turn known to generally correlate with the molecular abundance of the miRNAs (Lim et al. 2003b). Nonetheless, our current methods are more reliable for comparing expression of the same miRNAs in different cells or conditions than for comparing the relative levels of different miRNAs. Other systems have recently been described for the expression profiling of miRNAs (Babak et al. 2004; Liu et al. 2004; Miska et al. 2004; Nelson et al. 2004; Thomson et al. 2004). The method of Miska et al. (2004), similar to ours, includes a PCR amplification step, which likely increases sensitivity but is not as convenient as direct labeling. The method of Thomson et al. (2004), like ours, uses a synthetic reference set, which provides a uniform positive control for hybridization and a valuable internal standard for normalization. The array was used to profile the expression of 175 miRNAs from 24 different human organs and the HeLa S3 cell line, the broadest survey of miRNA expression to date (Fig. 2A ![]() ![]()
The coexpression of closely clustered miRNAs has been used as evidence that they derive from a common primary transcript (Lau et al. 2001; Sempere et al. 2004), and for a few human and fly miRNA clusters, RT-PCR has confirmed the presence of a polycistronic transcript (Lee et al. 2002, 2004; Aravin et al. 2003). To explore the potential for co-expression among clusters of human miRNAs, we first mapped the distance between the miRNA precursors annotated in the July 2003 build of the human genome. The distances separating pairs of miRNAs on the same chromosome in this sample ranged from 101 nucleotides to >100 Mb. For each chromosome, pairwise comparisons were made between the expression profiles of all miRNAs oriented in the same direction, calculating for each pair a correlation coefficient ranging between -1 (anti-correlated) and 1 (perfectly correlated). The miRNAs were ranked by the distance separating each pair of miRNA genes from each other, and correlations were then plotted based on this ranking, with closer genes having lower rank numbers (Fig. 3 ![]() ![]() ![]()
Most human miRNAs lie between protein-coding genes, whereas about one-third are within the introns of annotated mRNAs. These intronic miRNAs are usually in the same orientation as the pre-mRNA, and thus could be under the control of the promoter driving the primary mRNA transcript. Alternatively, they could be independently transcribed through an alternative promoter, and the common orientation could be explained by the prospect of colliding polymerases disfavoring opposing orientations. To test whether these intronic miRNAs are coordinately expressed with host mRNAs, we examined the correlation between the miRNA expression profiles observed from the microarray data (Fig. 2A ![]() ), providing evidence that the miRNAs are processed from the same primary transcripts as their host genes. Similar findings for two miRNAs were also recently reported (Rodriguez et al. 2004).
In situ analysis of miRNA primary transcripts has been successful in fly embryos but has not yet been reported in mammals (Kosman et al. 2004). The coexpression of intronic miRNAs and host genes implies that existing mRNA in situ expression data already provide high-resolution information regarding which cells express these intronic miRNAs. Based on the specific expression of some miRNAs in mammalian neurons, it has been proposed that miRNAs may regulate translation in mammalian neurons, and that these miRNAs could have roles in synaptic remodeling (Kim et al. 2004). The expression profile of miRNAs was highly correlated with some host genes. miR-9-1 is embedded in intron 2 of the human CROC-4 gene, and its expression highly correlates with the expression of this gene (corr., 0.99). CROC-4 is expressed in proliferating and migrating cells in mouse and is present in the developing forebrain. When observed at higher resolution, specific staining is observed in the olfactory epithelium lining the olfactory pit and in the medial nasal process. In rat, CROC-4 expression is associated with cells proliferating around the fourth ventricle and in the developing optic vesicle. It has been suggested, based on its localization with proliferating and migrating cells during early brain development, that CROC-4 participates in pathways involved in modulating cellular architecture during neuronal development and plasticity (Jeffrey et al. 2000). The same could be true of its resident miRNA, miR-9-1. In situ analysis exists for other correlated host genes; particularly interesting examples are AATYK (miR-338) and EGFL7 (miR-126) (Baker et al. 2001; Fitch et al. 2004). In summary, these findings support the idea that clusters of proximal miRNAs are typically expressed as polycistronic, coregulated units and that intronic miRNAs are generally coexpressed with their host genes. Host gene-miRNA coexpression is illuminating in cases where detailed host gene in situ analysis is available and can reveal complex miRNA expression patterns. Microarray profiling of miRNA expression is a useful strategy for examining the global expression profiles of this abundant class of small RNAs. The contents of this array are easily expanded; as new miRNAs are discovered in other systems, probes for these miRNAs can be easily incorporated into the existing array. It will be interesting, in the future, to explore the dynamics of global miRNA expression in developing tissues or whole organisms and to probe the relationship between miRNA expression and biological function, thus providing essential information for placing these abundant riboregulators into the gene regulatory circuitry of the animal. MATERIALS AND METHODS Microarray design Antisense probes for the oligonucleotide array were synthesized at a 0.01 μM scale (MWG biotech), modified at their 5′ ends with a 6-carbon linker and a primary amine (MWG biotech). Each probe oligonucleotide sequence was designed to have a predicted Tm of ~55°C (20 nM probe, 50 mM NaCl) when paired to its cognate DNA (Breslauer et al. 1986). Two strategies were employed to narrow the distribution of melting temperatures on the array to allow tighter control of specificity during sample hybridization. Probe oligonucleotides with calculated Tm’s exceeding 55°C were truncated to bring their Tm to ~55°C, while retaining the segment that afforded the greatest discrimination among known miRNAs. In a few cases, the predicted Tm did not approach 55°C, even when hybridizing to the entire mature miRNA. These probes were extended with up to five additional nucleotides on their 3′ ends to extend complementarities into the constant flanking sequence of the labeled single-stranded DNA. For example, a probe that is the antisense of hsa-miR-190, ACCTAATATATCAAACATATCA, has a calculated Tm of 44.8°C while its extended version, ACCTAATATATCAAACATATCATTTCA has a Tm of 53.5°C (underlined sequence, 3′ extension). Synthetic probe oligonucleotides were spotted onto activated slides according to the manufacturer’s instructions (Codelink, Amersham Biosciences). Reference sample A synthetic reference oligonucleotide was synthesized for every probe on the array (MWG biotech). Each oligonucleotide was synthesized as the sense strand of the miRNA sequence plus the constant sequences flanking each end that were used as primer sites for PCR amplification. For example, the reference oligonucleotide for hsa-miR-190 is: ATCGTAGGCACCTGAAATGAT ATGTTTGATATATTAGGTCTGTAGGCACCATCAAT (primer sites underlined). The reference sample was amplified from a mixture containing 0.22 nM synthetic oligonucleotides. The sample was labeled during 10 cycles of PCR amplification using a Cy5 dye-labeled oligonucleotide (Cy5-ATCGTAGGCACCTGAAA, IDT) as the sense-strand primer. The antisense oligonucleotide primer (ATTGATGGTGCCTACAG-C18-(A)20) contained a C18 spacer followed by a 20-nucleotide polyadenosine sequence. The shorter, Cy5-labeled strand of the amplified dsDNA product was purified from a denaturing polyacrylamide gel (6% polyacrylamide, 8 M urea). MicroRNA samples Samples for hybridization to the array were size-selected, ligated to adapter oligos, reverse-transcribed, and amplified as described for miRNA cloning (Lau et al. 2001). To prevent the nonuniform amplification that can happen during late PCR cycles, all amplifications in this process were stopped at a point where product first became visible on an ethidium-stained agarose gel (4% NuSieve, Cambrex). For all tissues except bone marrow, total RNA was purchased from Ambion. Bone marrow total RNA was extracted from human bone marrow cells (Cambrex) using the Tri reagent according to the manufacturer’s protocol (Sigma). Biological replicates of human hepatocytes isolated from two separate transplant grade livers were prepared using the same method (Supplementary Fig. S3, http://web.wi.mit.edu/bartel/pub/bartel_publications.html). Labeling and purification of the single-stranded DNA was as described for the reference sample, except that Cy5 was replaced with Cy3 (Cy3-ATCGTAGGCACCTGAAA, IDT). Hybridization and analysis Before use, microarrays were prehybridized (Pre-Hyb solution, 3.5 × SSC, 1% BSA, 0.1% SDS, 50°C, 45 min). For hybridization to the microarray, 10 pmol of the labeled ssDNA library was mixed with 10 pmol of the reference set in hybridization solution [25 μL final volume, 3.5 × SSC, 1% BSA, 0.1% SDS, 0.1 mg/mL herring sperm DNA (Sigma), 0.2 mg/mL yeast tRNA (Sigma), 0.4 mg/mL polyA RNA (Sigma), 50°C]. Hybridization was beneath lifter cover slips, in an aluminum chamber submerged in a water bath (57°C, 6 h). Slides were washed (2 × SSC, 0.1%, SDS 50°C, 5 min, 0.1 × SSC, 0.1% SDS, 10 min, wash 3× with 0.1 × SSC, 1 min), dried, and scanned (Genepix pro 4000B, Axon). Spots with an unacceptably low reference signal (defined as less than or equal to the median background plus two times its standard deviation) were eliminated from the analysis (dark green features in Fig. 2 ![]() Acknowledgments We thank M. Jones-Rhoades for gathering and organizing the initial miRNA data sets used in the design of this microarray, N. Lau for human bone marrow total mRNA, M. Axtell for many helpful discussions, and B. Chevalier for human liver samples used in Supplementary Figure S3. This work was supported by NIH grant DK068348. Notes Article and publication are at http://www.rnajournal.org/cgi/doi/10.1261/rna.7240905. REFERENCES
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||
Cell. 1993 Dec 3; 75(5):843-54.
[Cell. 1993]Cell. 2004 Jan 23; 116(2):281-97.
[Cell. 2004]Nature. 2000 Nov 2; 408(6808):86-9.
[Nature. 2000]Science. 2001 Oct 26; 294(5543):853-8.
[Science. 2001]Science. 2001 Oct 26; 294(5543):858-62.
[Science. 2001]Science. 2001 Oct 26; 294(5543):858-62.
[Science. 2001]Genes Dev. 2003 Apr 15; 17(8):991-1008.
[Genes Dev. 2003]RNA. 2004 Nov; 10(11):1813-9.
[RNA. 2004]Proc Natl Acad Sci U S A. 2004 Jun 29; 101(26):9740-4.
[Proc Natl Acad Sci U S A. 2004]Genome Biol. 2004; 5(9):R68.
[Genome Biol. 2004]Nat Methods. 2004 Nov; 1(2):155-61.
[Nat Methods. 2004]Curr Biol. 2002 Apr 30; 12(9):735-9.
[Curr Biol. 2002]RNA. 2003 Feb; 9(2):175-9.
[RNA. 2003]Genome Biol. 2004; 5(3):R13.
[Genome Biol. 2004]Science. 2001 Oct 26; 294(5543):862-4.
[Science. 2001]Science. 2004 Jan 2; 303(5654):83-6.
[Science. 2004]Science. 2001 Oct 26; 294(5543):858-62.
[Science. 2001]Genome Biol. 2004; 5(3):R13.
[Genome Biol. 2004]EMBO J. 2002 Sep 2; 21(17):4663-70.
[EMBO J. 2002]EMBO J. 2004 Oct 13; 23(20):4051-60.
[EMBO J. 2004]Bioinformatics. 1998; 14(8):656-64.
[Bioinformatics. 1998]Genome Res. 2004 Oct; 14(10A):1902-10.
[Genome Res. 2004]Science. 2004 Aug 6; 305(5685):846.
[Science. 2004]Proc Natl Acad Sci U S A. 2004 Jan 6; 101(1):360-5.
[Proc Natl Acad Sci U S A. 2004]Mol Cell Neurosci. 2000 Sep; 16(3):185-96.
[Mol Cell Neurosci. 2000]Oncogene. 2001 Mar 1; 20(9):1015-21.
[Oncogene. 2001]Dev Dyn. 2004 Jun; 230(2):316-24.
[Dev Dyn. 2004]Proc Natl Acad Sci U S A. 1986 Jun; 83(11):3746-50.
[Proc Natl Acad Sci U S A. 1986]Science. 2001 Oct 26; 294(5543):858-62.
[Science. 2001]Science. 2001 Oct 26; 294(5543):858-62.
[Science. 2001]Nucleic Acids Res. 1995 Oct 25; 23(20):4220-1.
[Nucleic Acids Res. 1995]