Logo of plosonePLoS OneView this ArticleSubmit to PLoSGet E-mail AlertsContact UsPublic Library of Science (PLoS)
PLoS One. 2009; 4(12): e8419.
Published online 2009 Dec 23. doi:  10.1371/journal.pone.0008419
PMCID: PMC2791866

Reprogramming of 3′ Untranslated Regions of mRNAs by Alternative Polyadenylation in Generation of Pluripotent Stem Cells from Different Cell Types

Juan Valcarcel, Editor



The 3′ untranslated regions (3′UTRs) of mRNAs contain cis elements involved in post-transcriptional regulation of gene expression. Over half of all mammalian genes contain multiple polyadenylation sites that lead to different 3′UTRs for a gene. Studies have shown that the alternative polyadenylation (APA) pattern varies across tissues, and is dynamically regulated in proliferating or differentiating cells. Generation of induced pluripotent stem (iPS) cells, in which differentiated cells are reprogrammed to an embryonic stem (ES) cell-like state, has been intensively studied in recent years. However, it is not known how 3′UTRs are regulated during cell reprogramming.

Methods/Main Findings

Using a computational method that robustly examines APA across DNA microarray data sets, we analyzed 3′UTR dynamics in generation of iPS cells from different cell types. We found that 3′UTRs shorten during reprogramming of somatic cells, the extent of which depends on the type of source cell. By contrast, reprogramming of spermatogonial cells involves 3′UTR lengthening. The alternative polyadenylation sites that are highly responsive to change of cell state in generation of iPS cells are also highly regulated during embryonic development in opposite directions. Compared with other sites, they are more conserved, can lead to longer alternative 3′UTRs, and are associated with more cis elements for polyadenylation. Consistently, reprogramming of somatic cells and germ cells involves significant upregulation and downregulation, respectively, of mRNAs encoding polyadenylation factors, and RNA processing is one of the most significantly regulated biological processes during cell reprogramming. Furthermore, genes containing target sites of ES cell-specific microRNAs (miRNAs) in different portions of 3′UTR are distinctively regulated during cell reprogramming, suggesting impact of APA on miRNA targeting.


Taken together, these findings indicate that reprogramming of 3′UTRs by APA, which result from regulation of both general polyadenylation activity and cell type-specific factors and can reset post-transcriptional gene regulatory programs in the cell, is an integral part of iPS cell generation, and the APA pattern can be a good biomarker for cell type and state, useful for sample classification. The results also suggest that perturbation of the mRNA polyadenylation machinery or RNA processing activity may facilitate generation of iPS cells.


Embryonic stem (ES) cells, derived from inner cell mass of the blastocyst, an early stage embryo, are pluripotent cells which can differentiate into any of the three germ layers, and thus are believed to have the potential to treat a wide range of degenerative diseases and tissue damages [1]. Recent advances in induced pluripotent stem (iPS) cells, which are derived by reprogramming of differentiated cells to ES-like cells using a set of defined factors [2], [3], have stimulated the excitement that many ethical and technical barriers associated with clinical application of ES cells may be overcome by using iPS cells. Current approaches to generate iPS cells involve ectopic expression of a set of transcription factors that are essential for self-renewal, such as Oct4, Sox2, Klf4, c-Myc, Nanog, and LIN28, through various viral vectors, plasmids, or recombinant proteins [4], [5]. In addition, pluripotent ES-like cells have been derived from germ cells under proper culturing conditions, including primordial germ cells from the embryo [6] and spermatogonial cells from neonatal or adult testes [7], [8], [9].

The 3′ untranslated regions (3′UTRs) of mRNAs contain various cis elements involved in post-transcriptional gene regulation, such as mRNA localization, stability, and translation [10], [11], [12]. Cis elements that are widely encoded in 3′UTRs include miRNA target sites [13], AU-rich elements [14], and GU-rich elements [15]. Over half of all mammalian genes have multiple polyadenylation sites, or poly(A) sites, resulting in mRNA isoforms with different 3′UTRs and/or coding sequences (CDS) [16], [17]. Compared with constitutive regions of 3′UTRs, alternative regions are usually longer by ∼2 fold, have higher AU content, and contain more cis elements [18].

Poly(A) sites, recognized and processed by the mRNA polyadenylation machinery, are essential for 3′ end maturation of almost all mRNAs in eukaryotic cells [19], [20]. About 90 protein factors have been shown to be part of or associated with the mRNA polyadenylation machinery in human cells [21]. Both upstream and downstream elements surrounding a poly(A) site are critical for mRNA polyadenylation. For example, the CPSF complex interacts with the upstream AAUAAA/AUUAAA hexamer, also known as polyadenylation signal (PAS); and the CstF complex interacts with the downstream U-rich and GU-rich elements. In addition, various upstream and downstream auxiliary elements have been found to play regulatory roles in poly(A) site usage [22], [23].

Regulation of alternative polyadenylation (APA) under various biological conditions has been analyzed for many cellular and viral genes [24], [25]. Recent global analyses have indicated that the APA pattern varies among tissue types [26], [27]. For example, mRNAs expressed in brain tissues tend to have longer 3′UTRs than other tissue types [26], and those expressed in testes tend to have short 3′UTRs resulting from poly(A) sites that are not frequently used in other tissues [26], [28]. In addition, APA can be dynamically regulated in response to extracellular signals, for example activation of neuronal cells [29]. A general trend of 3′UTR shortening in proliferating cells was reported by Sandberg and coworkers [30]. Mayr and Bartel found that expression of mRNAs with shortened 3′UTRs are more apparent in transformed cells than nontransformed ones with similar proliferating rate [31]. We recently reported that 3′UTRs progressively lengthen via APA during mouse embryonic development [18] and this regulation coordinates with various aspects of development including proliferation, differentiation, and morphogenesis, and likely results from weakening of the general mRNA polyadenylation activity when cells are committed to specific types.


Analysis of 3′UTR Regulation in Generation of iPS Cells Using a Robust Computational Method

A number of studies have used DNA microarrays to profile gene expression in reprogramming of somatic cells into iPS cells, all of which focused on mRNA levels. We were interested in how 3′UTRs were regulated during the reprogramming process. To this end, we collected data from a set of studies that utilized Affymetrix GeneChip microarrays, because their design was amenable to 3′UTR analysis for a large number of genes [18], [30]. As listed in Tables S1 and S2, these studies included 5 data sets for mouse iPS cells derived from B lymphocytes [32], mouse embryonic fibroblasts (MEFs) [33], and adult neural stem cells (NSCs) [34], [35], and 4 data sets of human iPS cells derived from neonatal foreskin fibroblasts and fetal lung fibroblasts [36], [37], [38], [39]. In addition, we included a data set for human pluripotent stem cells derived from spermatogonial cells (SC), a type of germ cell, from human adult testis [9]. For simplicity, these SC-derived cells were also referred to as iPS cells in this study.

For genes with APA, the first and last poly(A) sites in the 3′-most exon were named proximal and distal sites, respectively (Figure 1). The regions upstream and downstream of a proximal site were named constitutive UTR (cUTR) and alternative UTR (aUTR), respectively. Accordingly, the 3′UTR in genes without APA was named single UTR (sUTR). Our method to analyze 3′UTR regulation was based on comparison of microarray probes targeting cUTRs with those targeting aUTRs with respect to intensity changes between samples in a sample set (Figures 1 and S1). A score named Relative Usage of Distal poly(A) site, or RUD, was used to represent relative 3′UTR length for a gene in a sample, with high RUD indicating long 3′UTR. The median RUD of all surveyed genes in a sample was used to represent global 3′UTR length for the sample.

Figure 1
Schematic of APA and analysis of APA using Affymetrix GeneChip probes.

Interestingly, we found that microarray samples processed at different times could have systematic differences in RUD (Figure S3), presumably due to differences in sample processing. We thus developed a method to normalize RUD values across samples by using probes targeting sUTRs (Figure S1). Our rationale was that comparing 5′ sUTR probes with 3′ ones (Figure 1B) could provide a background difference for probes in cUTRs and aUTRs, controlling variation of RUD values between samples that are attributable to technical reasons. As shown in Figure S3, this approach significantly reduced sample-to-sample variations in RUD calculation: Similar biological samples processed at different times had closer normalized RUD (nRUD) values than not normalized ones.

Dynamic Regulation of 3′UTR by APA in Reprogramming of Different Cell Types

Using nRUD we found that genes tended to express mRNAs with shorter 3′UTRs in iPS cells than in source somatic cells (Figures 2A and 2B). By contrast, 3′UTRs lengthened during generation of iPS cells from SC (Figure 2C). For data sets that included ES cells, the 3′UTR length in reprogrammed iPS cells was generally closer to ES cells than to source cells, which was consistent with their phenotypes and gene expression profiles. The dynamics of 3′UTR regulation in cell reprogramming can be clearly manifested in the NSC.a study, which included both reprogramming of NSCs to iPS cells and differentiation of reprogrammed cells back to NSCs (Figure S4).

Figure 2
Dynamic regulation of 3′UTR in generation of iPS cells.

Three studies included cells that were partially reprogrammed (B lymph. and MEF.a in Figure 2A and NHDF in Figure 2B), which were believed to be trapped at intermediate stages of reprogramming. Interestingly, the 3′UTR lengths of mRNAs in these partially reprogrammed cells appeared to be longer than those in fully reprogrammed ones. The differences appeared to be consistent with their phenotypes. For example, P1 and P2 in the B lymph. data set represented 2 different partially reprogrammed cells, with P2 being closer to fully reprogrammed iPS cells and ES cells based on the gene expression profie and growth behavior [32]. In line with this, the 3′UTR length for P2 was closer to fully reprogrammed cells and P1 was closer to source cells (Figure 2A). This result suggests that regulation of 3′UTR is a continuous process during generation of iPS cells.

We found that while 3′UTR shortening in reprogramming of somatic cells to iPS cells could be discerned for most human and mouse genes and the consistency across data sets was much higher than random (Figure S5), the extent of shortening, however, appeared to be variable for different cell types (Figures S6). Reprogramming of NSC involved more drastic 3′UTR shortening than other cell types, which is similar in extent, but opposite in direction, to reprogramming of SC (Figures 2 and S6). This result indicates that the direction and extent of 3′UTR regulation in generation of iPS cells reflect the difference between source cells and iPS cells. On this note, it has been reported that mRNAs expressed in neuronal cells tend to have longer 3′UTRs than other cell types, whereas those expressed in testes have shorter ones [26], [28]. In addition, each cell type has a set of genes with a different direction of 3′UTR regulation than the global trend of the cell (Figure S6C), suggesting cell-specific regulation of APA for certain genes. In support of this notion, we found that the APA pattern can be used to separate samples according to reprogramming state and cell type by hierarchical clustering (Figure 2D) and principal component analysis (Figure 2E).

Regulation of 3′UTR in Generation of iPS Cells Is Related to That in Embryonic Development

Generation of iPS cells involves reprogramming of differentiated cells into an undifferentiated state, analogous to reversal of development. We thus wanted to know how APA in generation of iPS cells was related to that in embryonic development, during which 3′UTRs progressively lengthen [18]. To this end, we focused on mouse genes because of availability of microarray data for mouse embryonic development. For all mouse genes with APA surveyed in this study, we first modeled change of 3′UTR length against 2 reprogramming states, i.e. before and after reprogramming, in 6 cell lines using logistic regression (see Methods for detail) [40]. The P-value of the model reflects the correlation between 3′UTR change and reprogramming state. An example is shown in Figure 3A. As expected, there were significantly more genes having negative correlation than those having positive correlation (Figure 3B), consistent with general shortening of 3′UTR during reprogramming of somatic cells.

Figure 3
The 3′UTR regulation in generation of iPS cells is related to that in embryonic development.

We next calculated correlation between 3′UTR length and embryonic development stages by Pearson Correlation, using data sets corresponding to 8 different tissues from embryonic day (E) 8.5 to postnatal day (P) 0 [18]. An example is shown in Figure 3C. We then selected genes with significant 3′UTR regulation in embryonic development (P<0.05, Figure 3D), and examined how they were regulated in generation of iPS cells. As shown in Figure 3E, genes with 3′UTR shortening in generation of iPS cells were more likely to have 3′UTR lengthening in embryonic development (groups 1–3), whereas genes with 3′UTR lengthening in generation of iPS cells were more likely to have 3′UTR shortening in embryonic development (groups 4 and 5). This result indicates that 3′UTR regulation in generation of iPS cells from somatic cells is largely reversal of that in embryonic development.

We previously selected 2 sets of genes whose mRNA expression levels positively or negatively correlated with 3′UTR length in embryonic development [18]. They were called Positive Correlation Set, (PCS) and Negative Correlation Set (NCS), respectively. Intuitively, they are marker genes for 3′UTR length in cells. We examined how their expression changes correlated with 3′UTR changes in generation of iPS cells, including all source cells, and partially and fully reprogrammed cells. As shown in Figure 3F, mRNA expression changes of both NCS genes and PCS genes significantly (P = 4×10−15 and 5×10−16) correlated with 3′UTR length with good negative and positive r values (r = −0.71 and 0.72), respectively, further indicating the close relation between generation of iPS cells and embryonic development in 3′UTR regulation.

Regulation of mRNA Polyadenylation Activity May Be Responsible for 3′UTR Dynamics

Alternative 3′UTRs are generated by alternative use of poly(A) sites. Using human and mouse orthologous poly(A) sites [41], we found that the proximal poly(A) sites of genes with more significant 3′UTR shortening in generation of iPS cells tend to be more conserved (Figure 4A). By contrast, there is no difference between distal poly(A) site groups, suggesting that regulation of 3′UTRs by APA is chiefly through proximal poly(A) sites. In addition, using alignments of human, mouse, rat, and dog genomes, we found that sequences surrounding proximal poly(A) sites involved in 3′UTR shortening (groups 1 and 2) were more conserved than those involved in lengthening (groups 4 and 5), whereas those surrounding distal poly(A) sites showed no such differences (Figure 4B).

Figure 4
Analysis of poly(A) sites responsible for 3′UTR regulation.

We found also that genes with more significant 3′UTR shortening in generation of iPS cells had significantly longer aUTRs (Figure 4C), for example P-value = 2.7×10−9 (Wilcoxon rank sum test) for groups 1+2 vs. groups 4+5. But this trend was not detected for cUTRs. This result may suggest that APA events involving longer aUTRs are more likely to be detected by our method. However, given the trend in conservation of proximal poly(A) sites (Figures 4A and 4B), it is more likely that the proximal sites with longer aUTRs are more easily regulated.

We next reasoned that proximal poly(A) sites that were differentially regulated in generation of iPS cells and embryonic development might be surrounded by different cis elements. To this end, we compared the frequency of occurrence for all 5-mers near proximal poly(A) sites of genes in groups 1 and 2 vs. those in groups 4 and 5. We analyzed 4 regions, i.e. −100 to −41 nt, −40 to −1 nt, +1 to +40 nt, and +41 to +100 nt surrounding the poly(A) site. As shown in Figure 4D, we found that the major differences between these groups were elements located downstream of poly(A) sites, including UUUUU, UGUGU, and GUGUG, which were the binding sites for CstF-64, a factor in the CstF complex of the polyadenylation machinery [42], [43], suggesting that the activity of CstF complex may be regulated during generation of iPS cells.

We then analyzed expression of 94 genes encoding proteins that are part of or associate with the mRNA polyadenylation machinery, largely based on a recent proteomics study [21]. For simplicity, they were called poly(A) genes (see Table S3 for the complete list). Overall, these poly(A) genes were significantly upregulated during somatic cell reprogramming (Figure S8) and there existed a significant negative correlation between expression of these genes and 3′UTR length (Figure 5A) across all data sets including SC and partially reprogrammed cells, suggesting that regulation of polyadenylation activity may be responsible for the regulation of 3′UTR in cell reprogramming. For example, 15 out of 23 genes encoding core polyadenylation factors (Figure 5B) and 31 out of 71 genes encoding associated factors (Figure S9) were consistently upregulated during generation of iPS cells across data sets, most of which were downregulated during reprogramming of SC and embryonic development. Interestingly, all 3 genes encoding factors in the CstF complex were significantly regulated, which was in good agreement with the cis element result described above. In addition, most of the genes encoding the CPSF complex were also upregulated, suggesting coordination in gene expression between CPSF and CstF complexes.

Figure 5
Gene expression analysis in generation of iPS cells.

To understand the significance of regulation of poly(A) genes in a global context, we analyzed all Gene Ontology (GO) Biological Processes (BPs) with respect to regulation of associated genes in generation of iPS cells. As shown in Figure 5C, consistent with 3′UTR regulation, ‘RNA processing’, which included over half of the 94 poly(A) genes, was the most significantly upregulated BP in reprogramming of somatic cells. Significantly, poly(A) genes, if treated as a BP group, would rank 7th based on P-value in upregulated BPs. In addition, consistent with our previous finding that 3′UTR regulation coincides with regulation of genes involved in proliferation, differentiation, and morphogenesis during embryonic development, many related BPs were significantly regulated, most of which in opposite directions in generation of iPS cells from somatic cells compared with embryonic development. Interestingly, similar to 3′UTR regulation, most of the significant BPs for generation of iPS cells from somatic cells were also significant in reprogramming of SC, but in opposite directions, indicating substantial differences between germ cells and somatic cells in gene expression during the reprogramming process.

Impact of 3′UTR Regulation on Post-Transcriptional Gene Regulation in Generation of iPS Cells

Regulation of 3′UTR length by APA can impact on cis elements located in 3′UTRs, resulting in different mRNA metabolism for different mRNA isoforms. Generally, cis elements in the 3′UTRs function to inhibit gene expression by repressing translation or facilitating mRNA degradation, such as miRNA target sites and AU-rich and GU-rich elements. Consistent with this notion, we found that genes with significant 3′UTR shortening in generation of iPS cells from somatic cells (group 1) were significantly less downregulated than other genes (Figure 6A).

Figure 6
Impact of 3′UTR dynamics on gene expression and miRNA targeting.

Several miRNAs have been reported to be expressed in ES cells [44], [45] and introduction of ES cell-specific miRNAs was shown to promote generation of iPS cells [46]. To examine how miRNAs were expressed and functioned in iPS cells, we first predicted functional miRNAs using mRNA profiles of their target genes. This analysis was based on the rationale that mRNAs targeted by miRNAs would show downregulated expression profiles compared with non-target mRNAs using microarrays [47], [48]. We focused on 211 miRNA families that were conserved between human and mouse (Table S4), and used the TargetScan method to predict target sites [13] with requirement of conservation in human, mouse, rat, and dog (see Methods for detail). In good agreement with previous reports [49], [50], we found that miRNA target sites tended to be located near 5′ and 3′ ends for all UTR groups (Figure S10), i.e. sUTR, cUTR, and aUTR. Using Fisher's exact test, we calculated significance score for each miRNA family which indicated whether its target genes were significantly upregulated or downregulated compared with other genes. By this method, we predicted a number of miRNA families which were likely to be functional in each iPS cell line (see Table S4 for the full list). Significantly, the top 6 miRNAs with consistently high significance scores across all data sets were all reported to be expressed in ES cells (Figure 6B)[44], [45], indicating concordance between iPS cells and ES cells with respect to miRNA-mediated gene regulation.

We next reasoned that genes with miRNA target sites located in aUTRs might be regulated differently than genes with sites located in cUTRs because of change of 3′UTR length during generation of iPS cells [51], as illustrated in Figure 6C. We focused on target sites for the top 6 miRNA families that were reported to be expressed in ES cells. As shown in Figure 6D, genes with target sites in aUTRs were significantly (P<0.04) less downregulated than those with target sites in cUTRs or sUTRs during generation of mouse iPS cells from somatic cells, suggesting that shortening of 3′UTRs leads to evasion of miRNA targeting in aUTRs. In line with this finding, genes with aUTR targets tended to be more downregulated than those with target sites in cUTRs or sUTRs during reprogramming of human SC, during which 3′UTRs were lengthened (Figure 6E). Furthermore, we also found that change of 3′UTR length during reprogramming was more significant for genes with miRNA target sites in aUTRs than those with sites in cUTRs (Figure 6F), suggesting that mRNA isoforms with aUTRs were selectively degraded, leading to relatively higher abundance of isoforms with only cUTRs.


Here we show that 3′UTRs are reprogrammed by APA during generation of iPS cells from different cell types, which appears to be reversal of their regulation in development: Shortening of 3′UTRs in reprogramming of somatic cells is opposite to their regulation in embryonic development, and lengthening of 3′UTRs in reprogramming of germ cells appears to be reversal of their regulation in postnatal development of testis [18]. These results underline the dynamic nature of 3′UTR regulation in development, and indicate that APA is an integral part of cell reprogramming process. Interestingly, the developmental potency of a cell type, i.e. germ cells>ES cells>partially differentiated cells > terminally differentiated cells, seems to inversely correlate with the global 3′UTR length. On this note, it remains to be seen whether the 3′UTR length can be used as a marker to monitor cell reprogramming process, and whether perturbation of APA may alter the efficiency of generation of iPS cells.

Sandberg et al. reported 3′UTR shortening in T cell activation and a general correlation between 3′UTR shortening and cell proliferation [30]. Mayr and Bartel further found that 3′UTR shortening is more apparent in transformed cells than nontransformed ones with similar proliferating rate [31]. Taken together their findings and ours, we propose that the 3′UTR lengthen is controlled by both proliferation and differentiation states, as depicted in Figure 7. Given the good correlation between expression of poly(A) genes and global 3′UTR length, one attractive model for the underlying mechanism is that the polyadenylation machinery is dynamically regulated during proliferation/differentiation. In line with this, we found that binding sites of several transcription factors related to proliferation/differentiation, including E2F, c-myc and p53, are enriched in the promoter regions of RNA processing genes (Figure S11), which included more than half of the poly(A) genes, and factors in E2F and pRB families were reported to bind promoter regions of genes encoding CstF factors [52].

Figure 7
A model for regulation of 3′UTR by APA in proliferation/differentiation.

By comparing 3′UTR dynamics in cell reprogramming with that in embryonic development, we uncovered several features that are important for regulation of proximal poly(A) sites: 1) The proximal sites that are highly responsive to change of cell state during cell reprogramming, such as group 1 poly(A) sites in Figure 3B, are also highly regulated in embryonic development, leading to 3′UTR shortening and lengthening in reprogramming and development, respectively. 2) Compared with other sites, these highly regulatable ones are more conserved across species, indicating their functional importance. Their flanking regions are under higher evolutionary constraint, suggesting more elements may be located around these sites, which may function under different conditions. On this note, it was recently reported that the proximal poly(A) sites having higher variation of usage in different human tissues tend be to flanked by sequences with higher conservation rate [27]. 3) Highly regulatable proximal poly(A) sites lead to longer aUTRs when not used. On the one hand, this suggests that more cis elements in aUTRs can be regulated by APA for the corresponding genes; on the other hand, longer aUTRs may allow more effective regulation of proximal poly(A) sites with less interference from distal poly(A) sites. 4) Highly regulatable proximal sites have higher occurrences of downstream U-rich and GU-rich elements, which are the binding sites for the CstF complex. While the roles of elements located upstream or beyond our investigated region (−100 to +100 nt) cannot be ruled out, this result argues that ability to recruit the CstF complex is crucial for regulatability of a proximal poly(A) site when cellular polyadenylation activity changes during proliferation/differentiation. Presumably, this is due to the fact that proximal poly(A) sites have weaker upstream PAS compared with distal poly(A) sites [16], and thus are more susceptible to regulation by downstream elements. This finding is also consistent with an early discovery that CstF-64 regulates APA of IgM heavy chain gene in B cell differentiation [53].

Interestingly, by GO analysis we found that genes in group 1 tend to have functions in signal transduction, such as protein ubiquitination and phosphorylation, RNA processing, and cell cycle process (Table S5). As shown previously and here, 3′UTR lengthening and shortening can impact on miRNA targeting, mRNA stability, and protein production. Since these genes are highly regulated in cell proliferation and differentiation and may play roles in APA, it is possible that regulation of their 3′UTRs by APA forms feedforward and feedback regulatory circuitries, which can ensure rapid regulation of their gene expression during cell proliferation and differentiation. This is to be examined in detail in the future.

While the general trend of 3′UTR regulation is consistent across different somatic cell types, the extent of regulation varies. Noise in analyzing heterogeneous data sets can be a contributing factor. However, the good sample clustering (Figures 2D and 2E) according to cell type and state using gene nRUD argues against that it is a major one. The difference in general polyadenylation activity can explain some of the variation, given the correlation between expression of poly(A) genes and sample nRUD (Figure 5A). However, cell-specific regulation is very likely to contribute additional variations in 3′UTR regulation. For example, NSCs have the most drastic shortening of 3′UTRs of all cell types analyzed, but poly(A) genes do not appear to be more regulated than in other cells (Figure 5C). Thus, other cell-specific APA regulatory mechanisms are in play in addition to regulation of general polyadenylation factors. In support of this notion, a growing number of factors have been shown to regulate polyadenylation [24]. For example, the RNA binding protein Nova has been shown to have a widespread impact on APA in neuronal cells [54]. Conversely, since the APA pattern is determined by both cell type and developmental state, it can be used as biomarker for sample clustering and classification. Since the APA pattern is calculated by comparing different isoforms, the data are internally normalized, and can be more robust than mRNA levels for separating samples.

Our RUD values were based on microarray probes targeting cUTR and aUTRs that are defined by proximal and distal poly(A) sites. Notably, about 40–50% human and mouse genes with APA contain also poly(A) sites between proximal and distal sites, which we previously named ‘middle’ poly(A) sites [16]. Analysis of middle poly(A) sites would require dividing aUTR probes into different groups, i.e. upstream and downstream of each middle poly(A) site. Due to the limited number of microarray probes for aUTRs, only a very small number of genes can be examined by this approach. Therefore, while we did observe higher probe-to-probe variations within aUTRs than those within cUTRs (data not shown), which may be partially attributable to usage of middle poly(A) sites, we grouped all aUTR probes together in this study. In a sense, our analysis detects only a general trend of upstream poly(A) site usage vs. downstream site usage. Since middle poly(A) sites are usually less frequently used than proximal and distal sites [16], [41], variation of their usage would have only a marginal effect, if any, on median-based sample RUD and detection of global trend of 3′UTR lengthening or shortening. On the other hand, it is noteworthy that regulation of some middle poly(A) sites can be functionally relevant, as a sizable fraction of these sites are conserved in mammals, and some tissues appear to favor their usage [26]. The technical limitation in analysis of middle poly(A) sites can be significantly mitigated when genome-tiling microarrays or deep sequencing techniques are employed, which are just beginning to be used in iPS cell research.

We found that microarray data generated at different times for the same biological sample can give different 3′UTR length measurements, suggesting systematic variation in sample processing. One possibility is difference in reverse transcription which starts at the 3′ end and stops at regions that are variable between samples. Presumably, this can not only affect analysis of 3′ UTRs but also gene expression. Thus, precautions need to be taken when data sets from different studies or sample processing times are compared. In this study, we used sUTRs to normalize RUD values derived from cUTRs and aUTRs. While it is possible that some sUTRs may contain alternative poly(A) sites that are yet to be identified by the PolyA_DB database used in this study [55], the method appeared to significantly reduce sample-to-sample variations. In theory, this approach can also be applied to other platforms, such as genome-tiling microarray and deep sequencing.


Microarray Data Analysis

Microarray data were downloaded from the NCBI GEO database and are listed in Table S1. The data set for embryonic development of 8 tissues were described in [18]. Gene expression analysis was carried out by the Affymetrix Expression Console software using the Robust Multi-array Average (RMA) method for normalization. The MAS 5.0 program was used to get ‘absent’ and ‘present’ calls. Affymetrix GeneChip probes were mapped to cUTR and aUTR sequences as previously described [18]. Poly(A) sites in human and mouse genomes were obtained from PolyA_DB [55]. To ensure data quality, we used only those aUTRs supported by RefSeq, and required at least 2 probes in both cUTR and aUTR for each gene. The numbers of probes used for cUTRs and aUTRs are shown in Figure S2. Analysis of 3′UTR regulation was based on ratios of cUTR probe intensities to aUTR ones across a sample set, as illustrated in Figure S1. Each gene was assigned a value named Relative Usage of Distal poly(A) site score, or RUD, reflecting relative 3′UTR length in a given sample. The median RUD of all genes in a sample is RUD for the sample.

We found that in some cases, RUD can be influenced by experimental design. As shown in Figure S3, a batch of samples processed together could have different RUD values than another batch which was not consistent with biological grouping. This systematic variation might be due to technical differences between batches of microarray experiments, such as reverse transcription, labeling, hybridization, etc. To control this variation, we included a normalization step using genes without APA, as described in detail in Figure S1. The normalized RUD value is called nRUD.

Analysis of APA Regulation

A logistic regression model was employed to examine the correlation between nRUD values and reprogramming states, i.e. iPS or somatic cells. Let p be the probability that a sample is iPS cell. We define logit (p) = ln (p/1−p) = a+b*x, where x is a vector of nRUD values; a and b are coefficients to be estimated. Chi-square test was used to derive a P-value for the model fitting. The significance of APA regulation during reprogramming was represented by a significance score (SS). SS = log10 (P-value)*s, where s = 1 when b<0, and s = −1 otherwise. Thus, a negative SS indicates 3′UTR shortening in cell reprogramming, and a positive value for lengthening.

Poly(A) Site Analysis

Conservation of poly(A) sites was analyzed by the UCSC liftover program using human and mouse genome alignments, allowing +/−24 nucleotides (nt) for finding orthologous sites, as described in [41]. Sequence conservation of the poly(A) region (−100 to +100 nt) was based on the genome alignment of human, mouse, rat and dog, obtained from UCSC MultiZ files. The average percent of identity value at each position was calculated to indicate the conservation rate for the position. To identify potential cis elements associated with 3′UTR regulation by APA, occurrences of all 5-mers in the poly(A) regions were enumerated. Poly(A) sites were divided into 5 groups based on regulation of 3′UTR in generation of iPS cells (Figure 3B). The Fisher's exact test was used to examine the significance of association between a 5-mer and gene groups, resulting in P-value 1 for bias to groups 1+2 and P-value 2 for bias to groups 4+5. A significance score (SS) was used to represent the overall significance. SS = −log10 (P-value 1) when P-value 1<P-value 2, and = log10(P-value 2) otherwise.

Gene Ontology (GO) Analysis

We used NCBI GO annotations for genes. Only the Biological Process (BP) category was used. The GO Parser program from BioPerl was used to find all associated GO terms for a given gene. Only genes with detectable signals based on A/P call of MAS 5.0 in >50% of samples were used. Fold change >1.2 and P-value <0.05 (t-test) between before and after reprogramming samples were used to select regulated genes. We used the Fisher's exact test to examine whether a significant fraction of genes associated with a GO term were upregulated (P-value 1) or downregulated (P-value 2). Each GO term was given a significance score (SS) based on the P-values. SS = −log10 (P-value 1) when P-value 1<P-value 2, and = log10(P-value 2) otherwise.

miRNA Analysis

We used miRNA seed matches to identify miRNA target sites by the TargetScan 4.1 program [13]. Target sites were those with matches at seed region (2–7 nucleotides, nt) and either M8 or A1. We required the seed region to be conserved among human, mouse, rat, and dog genomes. The alignments of UTR sequences were obtained from UCSC. miRNA information was obtained from the TargetScan 5.1 web download page. miRNAs were grouped into families based on the 2–8 nt region and only those conserved between human and mouse in the seed region were used (211 families in total). To predict miRNA effects, we used Fisher's exact test to examine whether a significant fraction of targeted genes for a given miRNA were upregulated (P-value 1) or downregulated (P-value 2) using fold change >1.2 and P<0.05 (t-test) as cutoff. Significance score (SS) was then derived as described above for SS in GO analysis.

Promoter Analysis

Position-specific scoring matrices (PSSMs) for transcription factor binding sites (TFBS) were obtained from the Transfac database (Version 11.4). Only those with quality score 1–4 were used. The transcription start sites (TSS) were defined by RefSeq. The MATCH tool [56] was used to scan the −700 nt to +300 nt promoter region surrounding TSS with the mode to minimize false negative. We required all hits to be 100% conserved between human and mouse genomes. Fisher exact test was used to identify TFBS that were significantly associated with RNA processing genes. For factors with multiple PSSMs, the one with the most significant P-value was shown.

Supporting Information

Figure S1

Analysis of APA using normalized Relative Usage of Distal poly(A) site score (nRUD). Method to calculate nRUD. Top, a gene with APA; Bottom, a gene without APA. CDS, coding sequence; cUTR, constitutive UTR; aUTR, alternative UTR; sUTR, single UTR; AAA, poly(A) tail. Red bars are Affymetrix (Affy) GeneChip probes.

(0.02 MB PDF)

Figure S2

Histograms of cUTR and aUTR probe numbers for Affymetrix HU133 v2.0 GeneChip and Mouse 430 v2.0 GeneChip. All human data sets used Hu133 v2.0 and all mouse data sets used Mouse 430 v2.0.

(0.01 MB PDF)

Figure S3

Correction of systematic differences between data sets using the nRUD method. (A) Data set for the generation of iPS cells from human BJ fibroblast (BJ in Figure 1). (B) RUD values without normalization. (C) nRUD values, i.e. RUD values with normalization. Samples in the same iPS cell group have more consistent nRUD values than original RUD values. (D) RUD' values derived from probes targeting sUTRs. The difference between February/March samples and May samples indicates systematic differences in sample processing.

(0.02 MB PDF)

Figure S4

Dynamic regulation of 3′UTR by APA in generation and differentiation of iPS cells. Shortening of 3′UTRs in generation of iPS cells from adult mouse neural stem cells (NSC), and lengthening of 3′UTRs in differentiation of iPS cells to NSC.

(0.02 MB PDF)

Figure S5

Consistent regulation of 3′UTR in generation of iPS cells across different sample sets. (A) Heatmap showing 674 mouse genes with APA surveyed in 6 sample sets. The 3′UTR regulation was measured by gene nRUD, which is represented by color according to the scale shown in the graph, with red indicating 3′UTR lengthening and green 3′UTR shortening. Samples and genes were also clustered using gene nRUD by hierarchical clustering using Pearson Correlation. (B) Genes in (A) with 3′UTR shortening in 5 out of 6 sample sets were selected (47% of total). (C) The percent of selected genes (47%) is significantly higher than expected. The histogram shows distribution of percent of genes having 3′UTR shortening in 5 out of 6 sample sets when genes are randomized in each column. The red line in the graph indicates the observed percent of genes. (D) As in (A), 996 human genes were surveyed. (E) Genes in (D) with 3′UTR shortening in 3 out 4 sample sets were selected, excluding SC. The data for SC are included for comparison. (F) As in (C), the percent of genes selected (49%) is significantly higher than that using randomized data.

(0.28 MB PDF)

Figure S6

Regulation of 3′UTR in generation of iPS cells. (A) Left, distribution of nRUD for genes surveyed in NSC.a; Right, scatter plot of genes with APA. Each dot is a gene surveyed by microarray probes. X-axis and Y-axis are log2(A/B) values for probes targeting cUTRs and aUTRs, respectively, where A and B are average probe intensities for samples after and before reprogramming, respectively. Genes with nRUD greater than 1.5*(standard deviation of all genes) are shown in red, indicating significant 3′UTR lengthening, or green, indicating significant 3′UTR shortening. The numbers of genes for these 2 groups are indicated in insets. (B) Left, distribution nRUD for genes surveyed in SC; Right, scatter plot of genes with APA, as described for (A). (C) Selection of genes with 3′UTR regulation in generation of iPS cells using 3 cutoffs, i.e. 1, 1.5, and 2 standard deviation, as illustrated in (A) and (B). L and S are number of genes with 3′UTR lengthening and shortening, respectively. P-values are based on binomial tests for comparing L and S. (D) S/L and L/S are ratios. Three cutoffs were used to select genes.

(0.04 MB PDF)

Figure S7

Separation of samples using gene nRUD. Sample cluster shown in Figure 2D with sample names.

(0.03 MB PDF)

Figure S8

Poly(A) genes are significantly upregulated during cell reprogramming from different mouse cell types. For each data set, the ratios of gene expression after reprogramming to that before reprogramming for all genes and poly(A) genes were plotted in a cumulative distribution function (CDF) plot (left) and a boxplot (right). The difference between all genes and poly(A) genes is significant (P-value <0.005) for all data sets, based on Kolmogorov-Smironov test and Wilcoxon rank sum test.

(0.05 MB PDF)

Figure S9

Regulation of genes encoding auxiliary polyadenylation factors in generation of iPS cells and embryonic development. For generation of iPS cells, samples before and after reprogramming were compared. Positive values indicate upregulation after reprogramming. For embryonic development, the gene expression values in the first and last days of embryonic development were compared. Samples for 8 individual tissues and mixed tissue were used, as described for Figure 5. Only the genes with consistent trend of regulation, either upregulation or downregulation in 9 out of 10 data sets for somatic cell reprogramming are shown.

(0.02 MB PDF)

Figure S10

Distribution of miRNA target sites in different UTR groups. (A) Distribution of target sites for all 211 conserved miRNA families that are surveyed in this study. X-axis in each graph is relative location of target sites based on 10 evenly divided sub-regions in any given UTR. (B) Distribution of target sites for 6 miRNAs analyzed in this study.

(0.01 MB PDF)

Figure S11

Transcription Factor Binding Sites (TFBS) significantly associated with RNA processing genes. P-values were based on Fisher's exact test. Sequence logos for TFBS are also shown. See Methods for detail.

(0.03 MB PDF)

Table S1

Data sets used in this study.

(0.06 MB PDF)

Table S2

Microarray samples used in this study.

(0.02 MB PDF)

Table S3

Poly(A) genes analyzed in this study.

(0.11 MB PDF)

Table S4

Prediction of function in generation of iPS cells for 211 miRNA families.

(0.05 MB PDF)

Table S5

Significant Gene Ontology terms associated with genes in group 1.

(0.01 MB PDF)


We thank Diego Fraidenraich, Carol Lutz, and members of BT lab for helpful discussions.


Competing Interests: The authors have declared that no competing interests exist.

Funding: This work was funded by a grant from NIH (R01 GM084089) to BT. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.


1. Rossant J. Stem cells and lineage development in the mammalian blastocyst. Reprod Fertil Dev. 2007;19:111–118. [PubMed]
2. Takahashi K, Yamanaka S. Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors. Cell. 2006;126:663–676. [PubMed]
3. Yu J, Thomson JA. Pluripotent stem cell lines. Genes & Development. 2008;22:1987–1997. [PMC free article] [PubMed]
4. Lowry WE, Plath K. The many ways to make an iPS cell. Nat Biotechnol. 2008;26:1246–1248. [PubMed]
5. Zhou H, Wu S, Joo JY, Zhu S, Han DW, et al. Generation of induced pluripotent stem cells using recombinant proteins. Cell Stem Cell. 2009;4:381–384. [PubMed]
6. Matsui Y, Zsebo K, Hogan BL. Derivation of pluripotential embryonic stem cells from murine primordial germ cells in culture. Cell. 1992;70:841–847. [PubMed]
7. Kanatsu-Shinohara M, Inoue K, Lee J, Yoshimoto M, Ogonuki N, et al. Generation of pluripotent stem cells from neonatal mouse testis. Cell. 2004;119:1001–1012. [PubMed]
8. Guan K, Nayernia K, Maier LS, Wagner S, Dressel R, et al. Pluripotency of spermatogonial stem cells from adult mouse testis. Nature. 2006;440:1199–1203. [PubMed]
9. Conrad S, Renninger M, Hennenlotter J, Wiesner T, Just L, et al. Generation of pluripotent stem cells from adult human testis. Nature. 2008;456:344–349. [PubMed]
10. Wickens M, Anderson P, Jackson RJ. Life and death in the cytoplasm: messages from the 3′ end. Curr Opin Genet Dev. 1997;7:220–232. [PubMed]
11. Keene JD. RNA regulons: coordination of post-transcriptional events. Nat Rev Genet. 2007;8:533–543. [PubMed]
12. Garneau NL, Wilusz J, Wilusz CJ. The highways and byways of mRNA decay. Nat Rev Mol Cell Biol. 2007;8:113–126. [PubMed]
13. Lewis BP, Burge CB, Bartel DP. Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell. 2005;120:15–20. [PubMed]
14. Bakheet T, Williams BR, Khabar KS. ARED 3.0: the large and diverse AU-rich transcriptome. Nucleic Acids Res. 2006;34:D111–114. [PMC free article] [PubMed]
15. Vlasova IA, Tahoe NM, Fan D, Larsson O, Rattenbacher B, et al. Conserved GU-rich elements mediate mRNA decay by binding to CUG-binding protein 1. Mol Cell. 2008;29:263–270. [PMC free article] [PubMed]
16. Tian B, Hu J, Zhang H, Lutz CS. A large-scale analysis of mRNA polyadenylation of human and mouse genes. Nucleic Acids Res. 2005;33:201–212. [PMC free article] [PubMed]
17. Yan J, Marr TG. Computational analysis of 3′-ends of ESTs shows four classes of alternative polyadenylation in human, mouse, and rat. Genome Res. 2005;15:369–375. [PMC free article] [PubMed]
18. Ji Z, Lee JY, Pan Z, Jiang B, Tian B. Progressive lengthening of 3′ untranslated regions of mRNAs by alternative polyadenylation during mouse embryonic development. Proc Natl Acad Sci U S A. 2009;106:7028–7033. [PMC free article] [PubMed]
19. Colgan DF, Manley JL. Mechanism and regulation of mRNA polyadenylation. Genes Dev. 1997;11:2755–2766. [PubMed]
20. Zhao J, Hyman L, Moore C. Formation of mRNA 3′ ends in eukaryotes: mechanism, regulation, and interrelationships with other steps in mRNA synthesis. Microbiol Mol Biol Rev. 1999;63:405–445. [PMC free article] [PubMed]
21. Shi Y, Di Giammartino DC, Taylor D, Sarkeshik A, Rice WJ, et al. Molecular architecture of the human pre-mRNA 3′ processing complex. Mol Cell. 2009;33:365–376. [PMC free article] [PubMed]
22. Hu J, Lutz CS, Wilusz J, Tian B. Bioinformatic identification of candidate cis-regulatory elements involved in human mRNA polyadenylation. RNA. 2005;11:1485–1493. [PMC free article] [PubMed]
23. Danckwardt S, Hentze MW, Kulozik AE. 3′ end mRNA processing: molecular mechanisms and implications for health and disease. Embo J. 2008;27:482–498. [PMC free article] [PubMed]
24. Lutz CS. Alternative polyadenylation: a twist on mRNA 3′ end formation. ACS Chem Biol. 2008;3:609–617. [PubMed]
25. Edwalds-Gilbert G, Veraldi KL, Milcarek C. Alternative poly(A) site selection in complex transcription units: means to an end? Nucleic Acids Res. 1997;25:2547–2561. [PMC free article] [PubMed]
26. Zhang H, Lee JY, Tian B. Biased alternative polyadenylation in human tissues. Genome Biol. 2005;6:R100. [PMC free article] [PubMed]
27. Wang ET, Sandberg R, Luo S, Khrebtukova I, Zhang L, et al. Alternative isoform regulation in human tissue transcriptomes. Nature. 2008;456:470–476. [PMC free article] [PubMed]
28. Liu D, Brockman JM, Dass B, Hutchins LN, Singh P, et al. Systematic variation in mRNA 3′-processing signals during mouse spermatogenesis. Nucleic Acids Res. 2007;35:234–246. [PMC free article] [PubMed]
29. Flavell SW, Kim TK, Gray JM, Harmin DA, Hemberg M, et al. Genome-wide analysis of MEF2 transcriptional program reveals synaptic target genes and neuronal activity-dependent polyadenylation site selection. Neuron. 2008;60:1022–1038. [PMC free article] [PubMed]
30. Sandberg R, Neilson JR, Sarma A, Sharp PA, Burge CB. Proliferating cells express mRNAs with shortened 3′ untranslated regions and fewer microRNA target sites. Science. 2008;320:1643–1647. [PMC free article] [PubMed]
31. Mayr C, Bartel DP. Widespread Shortening of 3′UTRs by Alternative Cleavage and Polyadenylation Activates Oncogenes in Cancer Cells. Cell. 2009;138:673–684. [PMC free article] [PubMed]
32. Mikkelsen TS, Hanna J, Zhang X, Ku M, Wernig M, et al. Dissecting direct reprogramming through integrative genomic analysis. Nature. 2008;454:49–55. [PMC free article] [PubMed]
33. Sridharan R, Tchieu J, Mason MJ, Yachechko R, Kuoy E, et al. Role of the murine reprogramming factors in the induction of pluripotency. Cell. 2009;136:364–377. [PMC free article] [PubMed]
34. Kim JB, Zaehres H, Wu G, Gentile L, Ko K, et al. Pluripotent stem cells induced from adult neural stem cells by reprogramming with two factors. Nature. 2008;454:646–650. [PubMed]
35. Kim JB, Sebastiano V, Wu G, Arauzo-Bravo MJ, Sasse P, et al. Oct4-induced pluripotency in adult neural stem cells. Cell. 2009;136:411–419. [PubMed]
36. Maherali N, Ahfeldt T, Rigamonti A, Utikal J, Cowan C, et al. A high-efficiency system for the generation and study of human induced pluripotent stem cells. Cell Stem Cell. 2008;3:340–345. [PMC free article] [PubMed]
37. Park IH, Zhao R, West JA, Yabuuchi A, Huo H, et al. Reprogramming of human somatic cells to pluripotency with defined factors. Nature. 2008;451:141–146. [PubMed]
38. Masaki H, Ishikawa T, Takahashi S, Okumura M, Sakai N, et al. Heterogeneity of pluripotent marker gene expression in colonies generated in human iPS cell induction culture. Stem Cell Res. 2007;1:105–115. [PubMed]
39. Lowry WE, Richter L, Yachechko R, Pyle AD, Tchieu J, et al. Generation of human induced pluripotent stem cells from dermal fibroblasts. Proc Natl Acad Sci U S A. 2008;105:2883–2888. [PMC free article] [PubMed]
40. Venables WN, Ripley BD. Modern Applied Statistics with S; In: Chambers J, Eddy W, Hardle W, Sheather S, Tierney L, editors. Springer; 2002.
41. Lee JY, Ji Z, Tian B. Phylogenetic analysis of mRNA polyadenylation sites reveals a role of transposable elements in evolution of the 3′-end of genes. Nucleic Acids Res. 2008;36:5581–5590. [PMC free article] [PubMed]
42. Takagaki Y, Manley JL. RNA recognition by the human polyadenylation factor CstF. Mol Cell Biol. 1997;17:3907–3914. [PMC free article] [PubMed]
43. Perez Canadillas JM, Varani G. Recognition of GU-rich polyadenylation regulatory elements by human CstF-64 protein. EMBO J. 2003;22:2821–2830. [PMC free article] [PubMed]
44. Gangaraju VK, Lin H. MicroRNAs: key regulators of stem cells. Nat Rev Mol Cell Biol. 2009;10:116–125. [PMC free article] [PubMed]
45. Marson A, Levine SS, Cole MF, Frampton GM, Brambrink T, et al. Connecting microRNA genes to the core transcriptional regulatory circuitry of embryonic stem cells. Cell. 2008;134:521–533. [PMC free article] [PubMed]
46. Judson RL, Babiarz JE, Venere M, Blelloch R. Embryonic stem cell-specific microRNAs promote induced pluripotency. Nat Biotechnol. 2009;27:459–461. [PMC free article] [PubMed]
47. Farh KK, Grimson A, Jan C, Lewis BP, Johnston WK, et al. The widespread impact of mammalian MicroRNAs on mRNA repression and evolution. Science. 2005;310:1817–1821. [PubMed]
48. Sood P, Krek A, Zavolan M, Macino G, Rajewsky N. Cell-type-specific signatures of microRNAs on target mRNA expression. Proc Natl Acad Sci U S A. 2006;103:2746–2751. [PMC free article] [PubMed]
49. Majoros WH, Ohler U. Spatial preferences of microRNA targets in 3′ untranslated regions. BMC Genomics. 2007;8:152. [PMC free article] [PubMed]
50. Chi SW, Zang JB, Mele A, Darnell RB. Argonaute HITS-CLIP decodes microRNA-mRNA interaction maps. Nature. 2009;460:479–486. [PMC free article] [PubMed]
51. Legendre M, Ritchie W, Lopez F, Gautheret D. Differential repression of alternative transcripts: a screen for miRNA targets. PLoS Comput Biol. 2006;2:e43. [PMC free article] [PubMed]
52. Cam H, Balciunaite E, Blais A, Spektor A, Scarpulla RC, et al. A common set of gene regulatory networks links metabolism and growth inhibition. Mol Cell. 2004;16:399–411. [PubMed]
53. Takagaki Y, Seipelt RL, Peterson ML, Manley JL. The polyadenylation factor CstF-64 regulates alternative processing of IgM heavy chain pre-mRNA during B cell differentiation. Cell. 1996;87:941–952. [PubMed]
54. Licatalosi DD, Mele A, Fak JJ, Ule J, Kayikci M, et al. HITS-CLIP yields genome-wide insights into brain alternative RNA processing. Nature. 2008;456:464–469. [PMC free article] [PubMed]
55. Lee JY, Yeh I, Park JY, Tian B. PolyA_DB 2: mRNA polyadenylation sites in vertebrate genes. Nucleic Acids Res. 2007;35:D165–168. [PMC free article] [PubMed]
56. Kel AE, Gossling E, Reuter I, Cheremushkin E, Kel-Margoulis OV, et al. MATCH: A tool for searching transcription factor binding sites in DNA sequences. Nucleic Acids Res. 2003;31:3576–3579. [PMC free article] [PubMed]

Articles from PLoS ONE are provided here courtesy of Public Library of Science
PubReader format: click here to try


Save items

Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...


  • Compound
    PubChem chemical compound records that cite the current articles. These references are taken from those provided on submitted PubChem chemical substance records. Multiple substance records may contribute to the PubChem compound record.
  • MedGen
    Related information in MedGen
  • PubMed
    PubMed citations for these articles
  • Substance
    PubChem chemical substance records that cite the current articles. These references are taken from those provided on submitted PubChem chemical substance records.

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...