Logo of narLink to Publisher's site
Nucleic Acids Res. Sep 11, 1987; 15(17): 7155–7174.
PMCID: PMC306199

RNA splice junctions of different classes of eukaryotes: sequence statistics and functional implications in gene expression.


A systematic analysis of the RNA splice junction sequences of eukaryotic protein coding genes was carried out using the GENBANK databank. Nucleotide frequencies obtained for the highly conserved regions around the splice sites for different categories of organisms closely agree with each other. A striking similarity among the rare splice junctions which do not contain AG at the 3' splice site or GT at the 5' splice site indicates the existence of special mechanisms to recognize them, and that these unique signals may be involved in crucial gene-regulation events and in differentiation. A method was developed to predict potential exons in a bare sequence, using a scoring and ranking scheme based on nucleotide weight tables. This method was used to find a majority of the exons in selected known genes, and also predicted potential new exons which may be used in alternative splicing situations.

Full text

Full text is available as a scanned copy of the original print version. Get a printable copy (PDF file) of the complete article (1.5M), or click on a page image below to browse page by page. Links to PubMed are also available for Selected References.

Selected References

These references are in PubMed. This may not be the complete list of references from this article.
  • Green MR. Pre-mRNA splicing. Annu Rev Genet. 1986;20:671–708. [PubMed]
  • Sharp PA. Splicing of messenger RNA precursors. Science. 1987 Feb 13;235(4790):766–771. [PubMed]
  • Breathnach R, Chambon P. Organization and expression of eucaryotic split genes coding for proteins. Annu Rev Biochem. 1981;50:349–383. [PubMed]
  • Mount SM. A catalogue of splice junction sequences. Nucleic Acids Res. 1982 Jan 22;10(2):459–472. [PMC free article] [PubMed]
  • Brown JW. A catalogue of splice junction and putative branch point sequences from plant introns. Nucleic Acids Res. 1986 Dec 22;14(24):9549–9559. [PMC free article] [PubMed]
  • Bilofsky HS, Burks C, Fickett JW, Goad WB, Lewitter FI, Rindone WP, Swindell CD, Tung CS. The GenBank genetic sequence databank. Nucleic Acids Res. 1986 Jan 10;14(1):1–4. [PMC free article] [PubMed]
  • Shibahara S, Kubo T, Perski HJ, Takahashi H, Noda M, Numa S. Cloning and sequence analysis of human genomic DNA encoding gamma subunit precursor of muscle acetylcholine receptor. Eur J Biochem. 1985 Jan 2;146(1):15–22. [PubMed]
  • Levanon D, Lieman-Hurwitz J, Dafni N, Wigderson M, Sherman L, Bernstein Y, Laver-Rudich Z, Danciger E, Stein O, Groner Y. Architecture and anatomy of the chromosomal locus in human chromosome 21 encoding the Cu/Zn superoxide dismutase. EMBO J. 1985 Jan;4(1):77–84. [PMC free article] [PubMed]
  • Dodgson JB, Engel JD. The nucleotide sequence of the adult chicken alpha-globin genes. J Biol Chem. 1983 Apr 10;258(7):4623–4629. [PubMed]
  • Erbil C, Niessing J. The primary structure of the duck alpha D-globin gene: an unusual 5' splice junction sequence. EMBO J. 1983;2(8):1339–1343. [PMC free article] [PubMed]
  • Katinakis P, Verma DP. Nodulin-24 gene of soybean codes for a peptide of the peribacteroid membrane and was generated by tandem duplication of a sequence resembling an insertion element. Proc Natl Acad Sci U S A. 1985 Jun;82(12):4157–4161. [PMC free article] [PubMed]
  • Loh DY, Bothwell AL, White-Scharf ME, Imanishi-Kari T, Baltimore D. Molecular basis of a mouse strain-specific anti-hapten response. Cell. 1983 May;33(1):85–93. [PubMed]
  • Sakano H, Hüppi K, Heinrich G, Tonegawa S. Sequences at the somatic recombination sites of immunoglobulin light-chain genes. Nature. 1979 Jul 26;280(5720):288–294. [PubMed]
  • Max EE, Seidman JG, Leder P. Sequences of five potential recombination sites encoded close to an immunoglobulin kappa constant region gene. Proc Natl Acad Sci U S A. 1979 Jul;76(7):3450–3454. [PMC free article] [PubMed]
  • Hozumi N, Hawley RG, Murialdo H. Molecular cloning of an immunoglobulin kappa constant gene from NZB mouse. Gene. 1981 Mar;13(2):163–172. [PubMed]
  • Max EE, Maizel JV, Jr, Leder P. The nucleotide sequence of a 5.5-kilobase DNA segment containing the mouse kappa immunoglobulin J and C region genes. J Biol Chem. 1981 May 25;256(10):5116–5120. [PubMed]
  • Emorine L, Dreher K, Kindt TJ, Max EE. Rabbit immunoglobulin kappa genes: structure of a germline b4 allotype J-C locus and evidence for several b4-related sequences in the rabbit genome. Proc Natl Acad Sci U S A. 1983 Sep;80(18):5709–5713. [PMC free article] [PubMed]
  • Heidmann O, Rougeon F. Diversity in the rabbit immunoglobulin kappa chain variable regions is amplified by nucleotide deletions and insertions at the V-J junction. Cell. 1983 Oct;34(3):767–777. [PubMed]
  • Kim S, Davis M, Sinn E, Patten P, Hood L. Antibody diversity: somatic hypermutation of rearranged VH genes. Cell. 1981 Dec;27(3 Pt 2):573–581. [PubMed]
  • Bernard O, Hozumi N, Tonegawa S. Sequences of mouse immunoglobulin light chain genes before and after somatic changes. Cell. 1978 Dec;15(4):1133–1144. [PubMed]
  • Harr R, Häggström M, Gustafsson P. Search algorithm for pattern match analysis of nucleic acid sequences. Nucleic Acids Res. 1983 May 11;11(9):2943–2957. [PMC free article] [PubMed]
  • Staden R. Computer methods to locate signals in nucleic acid sequences. Nucleic Acids Res. 1984 Jan 11;12(1 Pt 2):505–519. [PMC free article] [PubMed]
  • Mulligan ME, McClure WR. Analysis of the occurrence of promoter-sites in DNA. Nucleic Acids Res. 1986 Jan 10;14(1):109–126. [PMC free article] [PubMed]
  • Senapathy P, Carter BJ. Molecular cloning of adeno-associated virus variant genomes and generation of infectious virus by recombination in mammalian cells. J Biol Chem. 1984 Apr 10;259(7):4661–4666. [PubMed]
  • Malissen M, Malissen B, Jordan BR. Exon/intron organization and complete nucleotide sequence of an HLA gene. Proc Natl Acad Sci U S A. 1982 Feb;79(3):893–897. [PMC free article] [PubMed]
  • Bell GI, Quinto C, Quiroga M, Valenzuela P, Craik CS, Rutter WJ. Isolation and sequence of a rat chymotrypsin B gene. J Biol Chem. 1984 Nov 25;259(22):14265–14270. [PubMed]
  • Marche PN, Tykocinski ML, Max EE, Kindt TJ. Structure of a functional rabbit class I MHC gene: similarity to human class I genes. Immunogenetics. 1985;21(1):71–82. [PubMed]
  • Kwoh TJ, Engler JA. The nucleotide sequence of the chicken thymidine kinase gene and the relationship of its predicted polypeptide to that of the vaccinia virus thymidine kinase. Nucleic Acids Res. 1984 May 11;12(9):3959–3971. [PMC free article] [PubMed]
  • Karn J, Brenner S, Barnett L. Protein structural domains in the Caenorhabditis elegans unc-54 myosin heavy chain gene are not separated by introns. Proc Natl Acad Sci U S A. 1983 Jul;80(14):4253–4257. [PMC free article] [PubMed]
  • Dennis ES, Gerlach WL, Pryor AJ, Bennetzen JL, Inglis A, Llewellyn D, Sachs MM, Ferl RJ, Peacock WJ. Molecular analysis of the alcohol dehydrogenase (Adh1) gene of maize. Nucleic Acids Res. 1984 May 11;12(9):3983–4000. [PMC free article] [PubMed]
  • Senapathy P. Origin of eukaryotic introns: a hypothesis, based on codon distribution statistics in genes, and its implications. Proc Natl Acad Sci U S A. 1986 Apr;83(7):2133–2137. [PMC free article] [PubMed]
  • Iida Y, Sasaki F. Recognition patterns for exon-intron junctions in higher organisms as revealed by a computer search. J Biochem. 1983 Dec;94(6):1731–1738. [PubMed]
  • Iida Y. Splice-site signals of mRNA precursors as revealed by computer search. Site-specific mutagenesis and thalassemia. J Biochem. 1985 Apr;97(4):1173–1179. [PubMed]
  • Wieringa B, Meyer F, Reiser J, Weissmann C. Unusual splice sites revealed by mutagenic inactivation of an authentic splice site of the rabbit beta-globin gene. Nature. 1983 Jan 6;301(5895):38–43. [PubMed]

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press


Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...


Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...