![]() | ![]() |
Formats:
|
||||||||||||||||||||
Copyright © 2005, The National Academy of Sciences Evolution Sequence conservation, relative isoform frequencies, and nonsense-mediated decay in evolutionarily conserved alternative splicing *Department of Bioengineering and ‡Howard Hughes Medical Institute and Department of Genome Sciences, University of Washington, Box 357730, Seattle, WA 98195 † To whom correspondence may be addressed. E-mail: baek/at/u.washington.edu or phg/at/u.washington.edu. Contributed by Phil Green, July 19, 2005 Freely available online through the PNAS open access option. This article has been cited by other articles in PMC.Abstract Studies of expressed sequence tag data sets have revealed large numbers of splicing variants for human genes, but it remains challenging to distinguish functionally important variants from aberrant splicing, clarify the nature of the alternative functions, and understand the signals that regulate splicing choices. To help address these issues, we have constructed and analyzed a large data set of 1,478 exon-skipping alternative splicing (AS) variants evolutionarily conserved in human and mouse. In about one-fifth of cases, one isoform appears subject to nonsense-mediated mRNA decay (NMD), supporting the idea that a major role of AS is to regulate gene expression; one-quarter of these NMD-inducing cases involve a conserved exon whose apparent sole purpose is to mediate destruction of the message when included. We explore sequence conservation likely related to splicing regulation, using in part a measure of the overall amount of conserved information in a sequence, and find that the increased conservation that has been observed within AS exons primarily affects synonymous sites, suggesting that regulatory signals significantly constrain synonymous substitution rates. We show that a lower frequency of the inclusion isoform relative to the exclusion isoform tends to be associated with weaker splice site signals, smaller exon size, and higher intronic sequence conservation, and provide evidence that all of these factors are under selection to control relative isoform frequencies. Some conserved instances of AS appear to represent aberrant splicing events that by chance have occurred in both species, and we develop a nonparametric likelihood approach to identify these. Two known roles for alternative splicing (AS) are to allow several proteins to be produced from a single gene, and to reduce gene expression by yielding isoforms that are degraded by nonsense-mediated mRNA decay (NMD) or other mechanisms (1, 2). Genome-wide studies using cDNA and EST databases have suggested that 22–74% of mammalian genes are alternatively spliced, with perhaps 35% of AS isoforms predicted to be subject to NMD (2–8). However, the biological significance of these estimates remains unclear, because many database variants may represent aberrant splicing events lacking a functional role (ref. 9 and references therein). Given that the primary function of NMD is presumably to remove aberrant transcripts, it is particularly difficult to assess the significance of NMD isoforms. Filtering out variants that are supported by few ESTs (5, 6), that have been found only in tumor cells or cell lines (10), or other anomalies (9), is not guaranteed to catch all aberrant splicing cases, and may incorrectly eliminate functional rare variants. Determining which variants are biologically important consequently remains a major challenge. An important approach to identifying functionally important AS is to focus on isoforms common to orthologous genes in two species (11–14). This method does not eliminate the possibility of false positives representing aberrant events that have occurred in both species, or false negatives resulting from insufficient data or lineage-specific functional AS (15), but it is probably the most reliable current computational approach to identify functional AS isoforms on a large scale. Comparisons of conserved and nonconserved AS reveal systematic differences, with nonconserved AS exons appearing more similar to constitutively spliced (CS) exons in several respects (16); this was interpreted as indicating that the nonconserved cases are likely enriched for aberrant events (16). Conserved AS exons display increased exonic and flanking intronic sequence conservation compared to CS exons (12–14), indicating that they have more bases under purifying selection, in accord with experimental work finding a high density of splicing enhancers and suppressors associated with AS regulation, in both exons and introns (refs. 17 and 18 and references therein). It is also known that, in comparison to CS exons, AS exons tend to be smaller and have weaker splice site signals (14, 17, 19–23). Here, we attempt a more detailed characterization of evolutionarily conserved AS. We focus on exon skipping (the most frequent type of AS in mammals; ref. 13) and classify AS cases by whether they are predicted to result in isoforms subject to NMD and whether the skipped “cassette” consists of single or multiple exons. Our data set is considerably larger than in earlier studies, which permits more powerful modes of analysis and several unique insights. Two methodological innovations in our analyses include a nonparametric likelihood-based approach to identify conserved AS cases that likely represent aberrant splicing that has occurred by chance in both species (if unremoved, such cases can bias analysis results), and a simple quantitative measure, the “effective number of conserved nucleotides” (Nc) to estimate the overall amount of evolutionarily conserved regulatory information associated with particular exons. We find that a significant number of apparent NMD-inducing AS isoforms are conserved in mouse and human and display increased sequence conservation relative to CS exons, supporting Lewis et al.'s suggestion (6) that some NMD-inducing AS is functionally important (although the proportion of overall cases we find that are NMD-inducing is smaller than in ref. 6). We show that the increased sequence conservation that had previously been observed in AS exons mainly affects synonymous sites, such that AS exons have substantially lower synonymous substitution rates and slightly lower nonsynonymous substitution rates than CS exons, likely reflecting the presence of exon splicing enhancers and suppressors. (Iida and Akashi had previously noted a lower synonymous substitution rate in AS exons, but proposed that it was due to selection on codon usage; ref. 24). Small CS exons also exhibit increased conservation in synonymous sites (although to a lesser degree than AS exons), suggesting that a higher density of signals is required to facilitate their splicing as well. We show that among AS genes, a higher rate of exon exclusion tends to be associated with weaker splice site signals, smaller exon size, and higher intronic sequence conservation, and that all of these factors appear to be under independent selection to control relative isoform frequencies. A model that predicts exclusion rate based on these factors confirms that putative NMD-inducing isoforms are underrepresented relative to expectation, consistent with their removal from the mRNA pool. Materials and Methods Supporting Information. For further details, see Supporting Text, Figs. 7–12, and Tables 3–8, which are published as supporting information on the PNAS web site. Identification of Conserved AS and CS Isoforms. For details of the procedures and a schematic overview, see Supporting Text and Fig. 7. We aligned 38,258 human near-full-length protein-coding cDNAs to the human genome sequence; after filtering out unspliced or problematic alignments, there were 28,172 aligned cDNAs (called “reference cDNAs” below) in 14,394 distinct genes. We obtained 6.26 million alignments of human ESTs and cDNAs to the human genome from the University of California, Santa Cruz, Genome Bioinformatics Site (25) (July 23, 2004, release), which reduced to 1.69 million alignments after filtering. The same procedures were used to obtain 26,239 mouse reference cDNAs for 12,499 distinct genes and 5.23 million alignments (0.95 million after filtering) of mouse ESTs and cDNAs (July 23, 2004, release) to the mouse genome. For each EST or cDNA that was not itself full length as judged by comparison to the reference cDNA, we inferred a putative reconstructed full-length structure by combining with the upstream and downstream structure from the corresponding reference cDNA. The reconstructed coding sequence was then conceptually translated to identify reading frame and the location of the stop codon. If the latter was ≥50 bp upstream of the final exon–exon junction, the corresponding transcript was designated NMD-inducing (26–28). (It should be noted that this 50-bp rule is imprecise; although it accords with most available experimental data, it may sometimes be violated and the efficiency of NMD is in any case somewhat variable; ref. 29.) Instances of exon-skipping AS were identified by comparing the genomic coordinates of pairs of reconstructed cDNAs from the same gene. We used the University of California, Santa Cruz, blastz (30) alignments of the mouse and human genomes to identify orthologous exons having identical predicted status (CS, or frame-preserving, non-frame-preserving non-NMD, or NMD-inducing AS) in both species. The data set of mouse–human conserved AS isoforms and their characteristics is given as Table 4. Splice Site Scores. From the aligned human reference cDNAs, a nonredundant set of internal exons was identified and grouped into four bins based on the G+C content of the flanking 200-kb region. Gene-specific log odds (base 2) weight matrices (31) for a window of 15 intronic and five exonic bases around the splice site were computed for donor and acceptor sites, by using bin and position-specific “foreground” frequencies of nucleotides from the sites in the bin, and gene-specific background frequencies determined from the transcribed region of the gene. A similar procedure was used to find weight matrices for mouse splice sites. Sequence Conservation. We used blastz-aligned mouse and human orthologous regions (exons, splice site 20-bp windows as defined above, or intronic 20-bp windows immediately adjacent to the splice site windows) to compute percent identity (PI) with respect to the human sequence. Unaligned positions were considered to be mismatches, and gaps in the mouse sequence were disregarded. Synonymous and nonsynonymous substitution rates (Ks and Ka) and their confidence intervals were estimated by the Pamilo–Bianchi–Li algorithm (32, 33) as implemented in the mega2 package (34). The conserved (non-splice site) intronic region was determined by assigning columns in the blastz alignment a score of +1 if the bases were identical, and –3 otherwise, and finding the highest scoring segment that starts at the end of the splice site window (i.e., at base 16 from the site) and extends further into the intron (truncating the search at the start of the next exon). Effective Number of Conserved Nucleotides. The conserved intronic region identified as above likely consists of a mixture of neutrally evolving bases and functional (enhancer or silencer) sites, so its length does not provide an accurate measure of the amount of conserved information. We propose the following simple method to quantitate this information, given a pair of orthologous aligned sequences. A region of size no aligned bases with average (fractional) identity ro between the two sequences is assumed to consist of nn neutrally evolving positions with average identity rn, and nc positions under purifying selection with average identity rc > rn. Then, ncrc + nnrn = noro. Because nn = no – nc we have nc = no(ro – rn)/(rc – rn). We estimate the neutral value rn from intergenic and intronic regions within a 200-kb window centered on the exon, separately for each gene (to allow for variation in the rate of neutral evolution across the genome; ref. 35). Although rc is unknown (and may vary for different functional motifs), note that the effect of using a different fixed value for rc is simply to multiply nc by a constant factor. Therefore, we can for simplicity take rc = 1.0. The value of nc obtained for this rc is the equivalent number of perfectly conserved nucleotides that would yield the same overall conservation level. We designate this Nc and call it the “effective number of conserved nucleotides.” A similar estimate for exonic synonymous sites is obtained by using a more complex procedure (see Supporting Text). Exon Exclusion Rate. Exon exclusion rate was computed as the number of ESTs or cDNAs supporting the exclusion isoform, divided by the total number of ESTs or cDNAs spanning a genomic region that contains the relevant exons; this constitutes a rough average over tissues according to their representation in the EST database, and in particular does not distinguish intertissue from intratissue differences in isoform frequencies. Statistical Analyses. Where relevant, a single value for each characteristic (e.g., exon size, splice site score) was obtained by averaging the human and mouse values. (In general, the mouse and human values are strongly correlated; for example, R2 for exon exclusion rate is 0.68.) Correlations between different characteristics were computed as nonparametric Spearman rank correlations. Ks and Ka were compared for different sets of exons by using a z test with analytically computed variances (32, 33). For other statistical tests, a two-sample Student t test was performed. Vertical error bars in figures represent 95% confidence intervals, which were computed assuming normality. All P values are two-sided. Results and Discussion Evolutionarily Conserved Exon-Skipping Includes Many Potential NMD-Inducing Cases. By comparing mouse and human EST and cDNA genomic alignments we detected 1,478 evolutionarily conserved exon-skipping AS variants (including 1,742 skipped exons in 1,093 distinct genes), and 14,368 CS exons within coding regions. We classified the AS variants according their effect on reading frame and the potential to induce NMD as predicted by the 50-bp premature termination codon rule (26–28). We also subcategorized AS cases according to whether single or multiple exons were skipped and whether the two isoforms represented simple presence/absence or mutually exclusive skipping (see Fig. 8 for examples of these types). Table 1 confirms that frame-preserving AS is the most prevalent type, accounting for 62% (920 of 1,478) of conserved AS cases. However, it is striking that 38% of conserved cases do not preserve frame, and 24% (361 of 1,478) appear to be subject to NMD. Use of a filter to remove likely aberrant splicing cases (see below) reduced this value to 21% (192 of 900 single-exon skipping cases). The latter figure is somewhat lower than the estimate of one-third in ref. 6, derived from human EST data without regard to conservation; however, it should be noted that our results suggesting underrepresentation of NMD-inducing AS isoforms (see below) imply that prevalence of NMD-inducing AS may be higher than we currently estimate. An earlier study of conserved AS based on substantially smaller data sets obtained a higher estimate (73%) for the fraction of frame-preserving AS (16); this probably reflects the fact that underrepresented NMD-inducing isoforms are more likely to be revealed as we obtain increasing amounts of data.
Roughly 14% (204 of 1,478) of AS cases involve multiple exons. One-third (29 of 88) of the frame-preserving multiple-exon skipping cases involve a combination of exons that individually have lengths not a multiple of 3, and which would therefore be frame-shifting if individually deleted. Sequence Conservation by AS Category. Previous studies (12–14, 24) have shown that evolutionarily conserved AS is accompanied by increased sequence conservation in both exons and introns compared to CS exons, likely reflecting the presence of splicing regulatory elements subject to purifying selection. We verified that each of the major AS classes in Table 1 exhibits significantly elevated sequence conservation in exons and 20-bp windows surrounding splice sites, relative to CS exons (Figs. (Figs.11
Discrimination of Misclassified Exons. As has been widely recognized (9, 10, 36), EST and cDNA databases contain numerous variants that represent aberrant splicing rather than functionally important AS. Our evolutionarily conserved AS cases could include two types of such variants: aberrantly included cryptic exons that derive from neutrally evolving intronic DNA, and exons that are functionally CS but have been aberrantly excluded from some transcripts. However, there is an asymmetry in the expected frequencies of these events. Cryptic exons should be rare in our data set given the requirement for orthologous isoforms in both species, because the high neutral sequence divergence between mouse and human (>30%) makes cryptic inclusion events in precisely orthologous locations in both species unlikely. In contrast, depending on rates of aberrant skipping and EST sampling depth, occasional aberrant skipping of the same functionally CS exon might well occur in both species. In fact, a consequence of the “exon definition” model (23) is that aberrant skipping is likely to be a relatively common type of splicing error, because failure at any step in exon recognition would tend to cause the exon to be skipped. With enough data, aberrant splicing events may cause virtually all genes to appear alternatively spliced (5), so conserved AS cases that represent congruent skipping errors in the two species will become increasingly frequent as the EST database expands. Consistent with this possibility, scatterplots (Fig. 10) indicate that some AS exons with low exclusion rate appear more similar to CS exons than to other AS exons with respect to sequence conservation, splice site score, and/or size, suggesting that the few exclusion transcripts for these exons may represent aberrant events. Moreover, some exons classified as CS in our data set may represent AS exons for which the exclusion isoform does not yet have EST support in either species. Consequently, as described in Supporting Text, we developed a log-likelihood-based discriminator LL(E) based on seven characteristics (exon size, combined splice site score, whether the exon size is a multiple of three, and four measures of exonic and intronic sequence conservation) to help distinguish “misclassified” exons. (We regard an exon as correctly classified as AS only if both variants are functionally important in the sense that selection acts to preserve them, a criterion that admittedly may not be possible to reliably determine without substantial experimental work, but for which a reasonable circumstantial case can often be made based on sequence conservation and other features.) AS exons should generally have positive values for LL(E), whereas CS exons should have negative values. Roughly 29% of the 1,274 single-exon skipping AS exons in our data set have LL(E) values <0, reflecting sequence characteristics indicative of CS exons, whereas 11% of the 14,368 CS exons have LL(E) values ≥0, reflecting sequence characteristics indicative of AS exons (Fig. 11). Fig. 11 shows evidence of a separate “peak” of AS exons with LL(E) scores <0; and these AS exons have, on average, a lower exclusion rate than AS exons with positive LL(E) scores [0.169 for AS exons with LL(E) < 0 vs. 0.453 for AS exons with LL(E) ≥ 0], consistent with the possibility that their exclusion transcripts represent aberrant splicing, although some may represent true AS cases at the distribution tail. Given that the LL(E) score distributions of true AS and CS exons probably overlap somewhat, the discriminator is not perfect. Many of the analyses we perform below could be biased either by including misclassified cases, or by excluding true cases whose values are at the distribution tail of the characteristic being analyzed. To minimize both biases to the extent possible, for each analysis we developed an LL(E) score based on an appropriate subset of the seven characteristics (i.e., not including parameters related to the ones being analyzed), and used this to filter out likely misclassified cases. The filters, and which analyses they were used in, are shown in Table 5. Analogous classifiers (however, based on an assumption that all conserved AS cases are genuine, and using different statistical principles) have recently been developed by several other authors (14, 37–39); their characteristics are compared in Table 6. Exonic Sequence Conservation Patterns. To illuminate patterns of exonic sequence conservation, we compared Ks (synonymous substitutions per synonymous site) and Ka (nonsynonymous substitutions per nonsynonymous site) in filtered CS and frame-preserving AS exons. Increased sequence conservation in AS exons derives primarily from synonymous sites: average Ks in AS exons is one third that in CS exons (0.166 vs. 0.510, P < 10–10), whereas Ka is only moderately lower (0.0648 vs. 0.0808, P < 10–10). Although the conservation is slightly stronger near splice sites, it is dispersed broadly over the entire exonic region (Fig. 2A
In general, small exons are thought to be poorly recognized by the splicing machinery, possibly as a result of steric hindrance between snRNPs and other splicing factors (22, 43); presumably to compensate for this, small CS exons tend to be accompanied by stronger splice site signals and splicing enhancer motifs (22, 44). Because AS exons tend to be small (21, 23), it is important to consider the possibility that the increased sequence conservation in AS exons is due solely to their size. Consequently, we examined Ks and Ka for CS and AS exons by exon size (Fig. 2B
These findings suggest that synonymous sites are important carriers of splicing regulation information in AS and small CS exons. One implication is that the common practice of using Ks to represent the neutral rate “baseline” for the purpose of assessing selection on protein coding sequences with the Ka/Ks ratio can be misleading for such exons. Factors Influencing Exclusion Rate. As a preliminary step in understanding the factors controlling the relative frequencies of AS isoforms, we computed correlations of various sequence characteristics with exclusion rate (Table 7), using frame-preserving AS cases because exclusion rates for NMD-inducing AS cases are likely distorted as a result of underrepresentation of the NMD-inducing isoform (see below). After filtering, splice site PI, intronic Nc, intronic PI, strength of splice site signal, and exon size all show significant correlation with exclusion rate, but exonic PI does not. Figs. Figs.44
To help interpret these trends we examined the correlation of exon size with other characteristics in CS exons (Table 8). In general the correlations, although highly statistically significant, are relatively weak; however, Fig. 6
We can now begin to interpret the exclusion rate trends. A fundamental question is the following: does selection act to produce particular exclusion rates, in which case the trends reflect the mechanisms by which the organism achieves the desired rates (e.g., weaker splice sites and smaller exons to produce a high exclusion rate); or is the observed exclusion rate instead simply a passive result of splice site score and exon size? The answer may depend on the gene to some degree, but overall our results on sequence level selection appear to favor the first possibility: if exclusion rate were not under selection, it is hard to understand why at higher exclusion rates there should be both low splice site scores and strong selection on those splice sites to maintain their sequences (Figs. (Figs.44 We carried out a multivariate regression to derive a linear model to predict exclusion rate as a function of exon size, intronic Nc, splice site score, and intronic PI (Table 2). The model is oversimplified in that it does not take into account the precise mix of enhancer and silencer motifs, and their tissue and/or developmental stage specificity, all of which must be important in determining exclusion rate for individual genes, but it may give some insight into the overall processes involved. The regression results confirm that each of the four factors contributes significantly to the determination of exclusion rate (Table 2), and together account for about one-quarter of its variance (R2 = 0.26).
The exclusion rate trends suggest that the organism achieves higher exclusion rates by dispersing the sequence information required for splicing, moving it out of the splice sites and into the introns and exons. Both the splice site signal and exon size play in some sense a “suppressor” role, which is presumably compensated to some degree by exonic and intronic enhancers. It will be of interest to assess the relative frequencies of enhancer and suppressor elements as a function of exclusion rate. Underrepresentation of NMD-Inducing AS Isoforms. Because they are removed by NMD, NMD-inducing AS isoforms are expected to be underrepresented in the EST and cDNA databases. To test this, we applied our regression model to predict the exclusion rate for each NMD AS case. For the “exclusion isoform NMD” category, the average predicted exclusion rate is 0.239, significantly higher (P < 10–10) than the observed rate of 0.107. For the “inclusion isoform NMD” category, the predicted rate (0.311) is significantly lower (P < 10–10) than the observed rate (0.696). In both cases, the direction of the deviation is consistent with the NMD-inducing isoform being underrepresented relative to what is predicted by the regression model, suggesting that this isoform is in fact being removed from the mRNA pool. These trends are reflected in the observed relative frequencies of the different AS groups along the exclusion rate axis (Fig. 12), with inclusion isoform NMD cases being relatively more frequent at high apparent exclusion rates, and exclusion isoform NMD cases more frequent at low exclusion rates. For non-frame-preserving non-NMD cases, the predicted rate (0.263) is slightly less than the observed rate (0.337) (P = 5.2 × 10–4), suggesting that some inclusion transcripts may be relatively underrepresented. It is possible that these represent instances where NMD occurs even though the 50-bp premature termination codon rule is not met (46, 47), or that they undergo decay via pathways other than NMD, such as a mRNA decay pathway that destroys transcripts with a termination codon in the last exon (48). It has been suggested that the bias toward frame preservation in AS exons with high exclusion rate reflects differential selection pressure (11). Our analyses show that NMD-inducing and other frame-altering AS exons are under comparable purifying selection to frame-preserving ones, and support an alternative hypothesis in which NMD-induced underrepresentation is responsible for the apparent bias toward frame preservation. Supporting Information
Acknowledgments We thank Steven Brenner and Chris Burge for helpful comments on the manuscript. This work was supported by the Howard Hughes Medical Institute and by a Samsung Humantech Scholarship (to D.B.). Notes Author contributions: D.B. and P.G. designed research; D.B. performed research; P.G. contributed new reagents/analytic tools; D.B. analyzed data; and D.B. and P.G. wrote the paper. Abbreviations: AS, alternative splicing; NMD, nonsense-mediated mRNA decay; CS, constitutively spliced; PI, percent identity. References 1. Graveley, B. R. (2001. ) Trends Genet. 17, 100–107. [PubMed] 2. Hillman, R. T., Green, R. E. & Brenner, S. E. (2004. ) Genome Biol. 5, R8. [PubMed] 3. Mironov, A. A., Fickett, J. W. & Gelfand, M. S. (1999. ) Genome Res. 9, 1288–1293. [PubMed] 4. Modrek, B., Resch, A., Grasso, C. & Lee, C. (2001. ) Nucleic Acids Res. 29, 2850–2859. [PubMed] 5. Kan, Z., States, D. & Gish, W. (2002. ) Genome Res. 12, 1837–1845. [PubMed] 6. Lewis, B. P., Green, R. E. & Brenner, S. E. (2003. ) Proc. Natl. Acad. Sci. USA 100, 189–192. [PubMed] 7. Johnson, J. M., Castle, J., Garrett-Engele, P., Kan, Z., Loerch, P. M., Armour, C. D., Santos, R., Schadt, E. E., Stoughton, R. & Shoemaker, D. D. (2003. ) Science 302, 2141–2144. [PubMed] 8. Croft, L., Schandorff, S., Clark, F., Burrage, K., Arctander, P. & Mattick, J. S. (2000. ) Nat. Genet. 24, 340–341. [PubMed] 9. Sorek, R. & Safer, H. M. (2003. ) Nucleic Acids Res. 31, 1067–1074. [PubMed] 10. Baranova, A. V., Lobashev, A. V., Ivanov, D. V., Krukovskaya, L. L., Yankovsky, N. K. & Kozlov, A. P. (2001. ) FEBS Lett. 508, 143–148. [PubMed] 11. Resch, A., Xing, Y., Alekseyenko, A., Modrek, B. & Lee, C. (2004. ) Nucleic Acids Res. 32, 1261–1269. [PubMed] 12. Sorek, R. & Ast, G. (2003. ) Genome Res. 13, 1631–1637. [PubMed] 13. Sugnet, C. W., Kent, W. J., Ares, M., Jr., & Haussler, D. (2004. ) Pac. Symp. Biocomput. 2004, 66–77. 14. Yeo, G. W., Van Nostrand, E., Holste, D., Poggio, T. & Burge, C. B. (2005. ) Proc. Natl. Acad. Sci. USA 102, 2850–2855. [PubMed] 15. Modrek, B. & Lee, C. J. (2003. ) Nat. Genet. 34, 177–180. [PubMed] 16. Sorek, R., Shamir, R. & Ast, G. (2004. ) Trends Genet. 20, 68–71. [PubMed] 17. Black, D. L. (2003. ) Annu. Rev. Biochem. 72, 291–336. [PubMed] 18. Matlin, A. J., Clark, F. & Smith, C. W. (2005. ) Nat. Rev. Mol. Cell. Biol. 6, 386–398. [PubMed] 19. Itoh, H., Washio, T. & Tomita, M. (2004. ) RNA 10, 1005–1018. [PubMed] 20. Clark, F. & Thanaraj, T. A. (2002. ) Hum. Mol. Genet. 11, 451–464. [PubMed] 21. Stamm, S., Zhu, J., Nakai, K., Stoilov, P., Stoss, O. & Zhang, M. Q. (2000. ) DNA Cell Biol. 19, 739–756. [PubMed] 22. Carlo, T., Sierra, R. & Berget, S. M. (2000. ) Mol. Cell. Biol. 20, 3988–3995. [PubMed] 23. Berget, S. M. (1995. ) J. Biol. Chem. 270, 2411–2414. [PubMed] 24. Iida, K. & Akashi, H. (2000. ) Gene 261, 93–105. [PubMed] 25. Karolchik, D., Baertsch, R., Diekhans, M., Furey, T. S., Hinrichs, A., Lu, Y. T., Roskin, K. M., Schwartz, M., Sugnet, C. W., Thomas, D. J., et al. (2003. ) Nucleic Acids Res. 31, 51–54. [PubMed] 26. Cheng, J., Belgrader, P., Zhou, X. & Maquat, L. E. (1994. ) Mol. Cell. Biol. 14, 6317–6325. [PubMed] 27. Thermann, R., Neu-Yilik, G., Deters, A., Frede, U., Wehr, K., Hagemeier, C., Hentze, M. W. & Kulozik, A. E. (1998. ) EMBO J. 17, 3484–3494. [PubMed] 28. Belgrader, P. & Maquat, L. E. (1994. ) Mol. Cell. Biol. 14, 6326–6336. [PubMed] 29. Neu-Yilik, G., Gehring, N. H., Hentze, M. W. & Kulozik, A. E. (2004. ) Genome Biol. 5, 218. [PubMed] 30. Schwartz, S., Kent, W. J., Smit, A., Zhang, Z., Baertsch, R., Hardison, R. C., Haussler, D. & Miller, W. (2003. ) Genome Res. 13, 103–107. [PubMed] 31. Staden, R. (1984. ) Nucleic Acids Res. 12, 505–519. [PubMed] 32. Li, W. H. (1993. ) J. Mol. Evol. 36, 96–99. [PubMed] 33. Pamilo, P. & Bianchi, N. O. (1993. ) Mol Biol. Evol. 10, 271–281. [PubMed] 34. Kumar, S., Tamura, K., Jakobsen, I. B. & Nei, M. (2001. ) Bioinformatics 17, 1244–1245. [PubMed] 35. Waterston, R. H., Lindblad-Toh, K., Birney, E., Rogers, J., Abril, J. F., Agarwal, P., Agarwala, R., Ainscough, R., Alexandersson, M., An, P., et al. (2002. ) Nature 420, 520–562. [PubMed] 36. Modrek, B. & Lee, C. (2002. ) Nat. Genet. 30, 13–19. [PubMed] 37. Dror, G., Sorek, R. & Shamir, R. (2005. ) Bioinformatics 21, 897–901. [PubMed] 38. Sorek, R., Shemesh, R., Cohen, Y., Basechess, O., Ast, G. & Shamir, R. (2004. ) Genome Res. 14, 1617–1623. [PubMed] 39. Philipps, D. L., Park, J. W. & Graveley, B. R. (2004. ) RNA 10, 1838–1844. [PubMed] 40. Cartegni, L., Chew, S. L. & Krainer, A. R. (2002. ) Nat. Rev. Genet. 3, 285–298. [PubMed] 41. Schaal, T. D. & Maniatis, T. (1999. ) Mol. Cell. Biol. 19, 261–273. [PubMed] 42. Blencowe, B. J. (2000. ) Trends Biochem. Sci. 25, 106–110. [PubMed] 43. Black, D. L. (1991. ) Genes Dev. 5, 389–402. [PubMed] 44. Sterner, D. A. & Berget, S. M. (1993. ) Mol. Cell. Biol. 13, 2677–2687. [PubMed] 45. Kriventseva, E. V., Koch, I., Apweiler, R., Vingron, M., Bork, P., Gelfand, M. S. & Sunyaev, S. (2003. ) Trends Genet. 19, 124–128. [PubMed] 46. Carter, M. S., Li, S. & Wilkinson, M. F. (1996. ) EMBO J. 15, 5965–5975. [PubMed] 47. Buhler, M., Paillusson, A. & Muhlemann, O. (2004. ) Nucleic Acids Res. 32, 3304–3315. [PubMed] 48. Delpy, L., Sirac, C., Magnoux, E., Duchez, S. & Cogne, M. (2004. ) Proc. Natl. Acad. Sci. USA 101, 7375–7380. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||
Trends Genet. 2001 Feb; 17(2):100-7.
[Trends Genet. 2001]Genome Biol. 2004; 5(2):R8.
[Genome Biol. 2004]Nat Genet. 2000 Apr; 24(4):340-1.
[Nat Genet. 2000]Nucleic Acids Res. 2003 Feb 1; 31(3):1067-74.
[Nucleic Acids Res. 2003]Genome Res. 2002 Dec; 12(12):1837-45.
[Genome Res. 2002]Nucleic Acids Res. 2004; 32(4):1261-9.
[Nucleic Acids Res. 2004]Proc Natl Acad Sci U S A. 2005 Feb 22; 102(8):2850-5.
[Proc Natl Acad Sci U S A. 2005]Nat Genet. 2003 Jun; 34(2):177-80.
[Nat Genet. 2003]Trends Genet. 2004 Feb; 20(2):68-71.
[Trends Genet. 2004]Genome Res. 2003 Jul; 13(7):1631-7.
[Genome Res. 2003]Proc Natl Acad Sci U S A. 2003 Jan 7; 100(1):189-92.
[Proc Natl Acad Sci U S A. 2003]Gene. 2000 Dec 30; 261(1):93-105.
[Gene. 2000]Nucleic Acids Res. 2003 Jan 1; 31(1):51-4.
[Nucleic Acids Res. 2003]Mol Cell Biol. 1994 Sep; 14(9):6317-25.
[Mol Cell Biol. 1994]Mol Cell Biol. 1994 Sep; 14(9):6326-36.
[Mol Cell Biol. 1994]Genome Biol. 2004; 5(4):218.
[Genome Biol. 2004]Genome Res. 2003 Jan; 13(1):103-7.
[Genome Res. 2003]Nucleic Acids Res. 1984 Jan 11; 12(1 Pt 2):505-19.
[Nucleic Acids Res. 1984]J Mol Evol. 1993 Jan; 36(1):96-9.
[J Mol Evol. 1993]Mol Biol Evol. 1993 Mar; 10(2):271-81.
[Mol Biol Evol. 1993]Bioinformatics. 2001 Dec; 17(12):1244-5.
[Bioinformatics. 2001]Nature. 2002 Dec 5; 420(6915):520-62.
[Nature. 2002]J Mol Evol. 1993 Jan; 36(1):96-9.
[J Mol Evol. 1993]Mol Biol Evol. 1993 Mar; 10(2):271-81.
[Mol Biol Evol. 1993]Mol Cell Biol. 1994 Sep; 14(9):6317-25.
[Mol Cell Biol. 1994]Mol Cell Biol. 1994 Sep; 14(9):6326-36.
[Mol Cell Biol. 1994]Proc Natl Acad Sci U S A. 2003 Jan 7; 100(1):189-92.
[Proc Natl Acad Sci U S A. 2003]Trends Genet. 2004 Feb; 20(2):68-71.
[Trends Genet. 2004]Genome Res. 2003 Jul; 13(7):1631-7.
[Genome Res. 2003]Proc Natl Acad Sci U S A. 2005 Feb 22; 102(8):2850-5.
[Proc Natl Acad Sci U S A. 2005]Gene. 2000 Dec 30; 261(1):93-105.
[Gene. 2000]Proc Natl Acad Sci U S A. 2003 Jan 7; 100(1):189-92.
[Proc Natl Acad Sci U S A. 2003]Nucleic Acids Res. 2003 Feb 1; 31(3):1067-74.
[Nucleic Acids Res. 2003]FEBS Lett. 2001 Nov 9; 508(1):143-8.
[FEBS Lett. 2001]Nat Genet. 2002 Jan; 30(1):13-9.
[Nat Genet. 2002]J Biol Chem. 1995 Feb 10; 270(6):2411-4.
[J Biol Chem. 1995]Genome Res. 2002 Dec; 12(12):1837-45.
[Genome Res. 2002]Proc Natl Acad Sci U S A. 2005 Feb 22; 102(8):2850-5.
[Proc Natl Acad Sci U S A. 2005]Bioinformatics. 2005 Apr 1; 21(7):897-901.
[Bioinformatics. 2005]RNA. 2004 Dec; 10(12):1838-44.
[RNA. 2004]Annu Rev Biochem. 2003; 72():291-336.
[Annu Rev Biochem. 2003]Nat Rev Mol Cell Biol. 2005 May; 6(5):386-98.
[Nat Rev Mol Cell Biol. 2005]Nat Rev Genet. 2002 Apr; 3(4):285-98.
[Nat Rev Genet. 2002]Trends Biochem Sci. 2000 Mar; 25(3):106-10.
[Trends Biochem Sci. 2000]Mol Cell Biol. 2000 Jun; 20(11):3988-95.
[Mol Cell Biol. 2000]Genes Dev. 1991 Mar; 5(3):389-402.
[Genes Dev. 1991]Mol Cell Biol. 1993 May; 13(5):2677-87.
[Mol Cell Biol. 1993]DNA Cell Biol. 2000 Dec; 19(12):739-56.
[DNA Cell Biol. 2000]J Biol Chem. 1995 Feb 10; 270(6):2411-4.
[J Biol Chem. 1995]Proc Natl Acad Sci U S A. 2005 Feb 22; 102(8):2850-5.
[Proc Natl Acad Sci U S A. 2005]Trends Genet. 2003 Mar; 19(3):124-8.
[Trends Genet. 2003]EMBO J. 1996 Nov 1; 15(21):5965-75.
[EMBO J. 1996]Nucleic Acids Res. 2004; 32(11):3304-15.
[Nucleic Acids Res. 2004]Proc Natl Acad Sci U S A. 2004 May 11; 101(19):7375-80.
[Proc Natl Acad Sci U S A. 2004]Nucleic Acids Res. 2004; 32(4):1261-9.
[Nucleic Acids Res. 2004]