Logo of pnasPNASInfo for AuthorsSubscriptionsAboutThis Article
Proc Natl Acad Sci U S A. 1992 Oct 15; 89(20): 9489–9493.

Origins of genes: "big bang" or continuous creation?


Many protein families are common to all cellular organisms, indicating that many genes have ancient origins. Genetic variation is mostly attributed to processes such as mutation, duplication, and rearrangement of ancient modules. Thus it is widely assumed that much of present-day genetic diversity can be traced by common ancestry to a molecular "big bang." A rarely considered alternative is that proteins may arise continuously de novo. One mechanism of generating different coding sequences is by "overprinting," in which an existing nucleotide sequence is translated de novo in a different reading frame or from noncoding open reading frames. The clearest evidence for overprinting is provided when the original gene function is retained, as in overlapping genes. Analysis of their phylogenies indicates which are the original genes and which are their informationally novel partners. We report here the phylogenetic relationships of overlapping coding sequences from steroid-related receptor genes and from tymovirus, luteovirus, and lentivirus genomes. For each pair of overlapping coding sequences, one is confined to a single lineage, whereas the other is more widespread. This suggests that the phylogenetically restricted coding sequence arose only in the progenitor of that lineage by translating an out-of-frame sequence to yield the new polypeptide. The production of novel exons by alternative splicing in thyroid receptor and lentivirus genes suggests that introns can be a valuable evolutionary source for overprinting. New genes and their products may drive major evolutionary changes.

Full text

Full text is available as a scanned copy of the original print version. Get a printable copy (PDF file) of the complete article (1.0M), or click on a page image below to browse page by page. Links to PubMed are also available for Selected References.

Selected References

These references are in PubMed. This may not be the complete list of references from this article.
  • Morris SC. Burgess shale faunas and the cambrian explosion. Science. 1989 Oct 20;246(4928):339–346. [PubMed]
  • Britten RJ, Davidson EH. Repetitive and non-repetitive DNA sequences and a speculation on the origins of evolutionary novelty. Q Rev Biol. 1971 Jun;46(2):111–138. [PubMed]
  • Kurosky A, Barnett DR, Lee TH, Touchstone B, Hay RE, Arnott MS, Bowman BH, Fitch WM. Covalent structure of human haptoglobin: a serine protease homolog. Proc Natl Acad Sci U S A. 1980 Jun;77(6):3388–3392. [PMC free article] [PubMed]
  • Matthews BW, Rossmann MG. Comparison of protein structures. Methods Enzymol. 1985;115:397–420. [PubMed]
  • Dorit RL, Schoenbach L, Gilbert W. How big is the universe of exons? Science. 1990 Dec 7;250(4986):1377–1382. [PubMed]
  • Kavaler J, Davis MM, Chien Y. Localization of a T-cell receptor diversity-region element. Nature. 1984 Aug 2;310(5976):421–423. [PubMed]
  • Biebricher CK, Eigen M, Luce R. Template-free RNA synthesis by Q beta replicase. Nature. 1986 May 1;321(6065):89–91. [PubMed]
  • Ohno S. Birth of a unique enzyme from an alternative reading frame of the preexisted, internally repetitious coding sequence. Proc Natl Acad Sci U S A. 1984 Apr;81(8):2421–2425. [PMC free article] [PubMed]
  • Barrell BG, Air GM, Hutchison CA., 3rd Overlapping genes in bacteriophage phiX174. Nature. 1976 Nov 4;264(5581):34–41. [PubMed]
  • Shaw DC, Walker JE, Northrop FD, Barrell BG, Godson GN, Fiddes JC. Gene K, a new overlapping gene in bacteriophage G4. Nature. 1978 Apr 6;272(5653):510–515. [PubMed]
  • Feng DF, Doolittle RF. Progressive sequence alignment as a prerequisite to correct phylogenetic trees. J Mol Evol. 1987;25(4):351–360. [PubMed]
  • Devereux J, Haeberli P, Smithies O. A comprehensive set of sequence analysis programs for the VAX. Nucleic Acids Res. 1984 Jan 11;12(1 Pt 1):387–395. [PMC free article] [PubMed]
  • Saitou N, Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987 Jul;4(4):406–425. [PubMed]
  • Thomas CM, Smith CA. The trfB region of broad host range plasmid RK2: the nucleotide sequence reveals incC and key regulatory gene trfB/korA/korD as overlapping genes. Nucleic Acids Res. 1986 Jun 11;14(11):4453–4469. [PMC free article] [PubMed]
  • Rak B, Lusky M, Hable M. Expression of two proteins from overlapping and oppositely oriented genes on transposable DNA insertion element IS5. Nature. 1982 May 13;297(5862):124–128. [PubMed]
  • Thisted T, Gerdes K. Mechanism of post-segregational killing by the hok/sok system of plasmid R1. Sok antisense RNA regulates hok gene expression indirectly through the overlapping mok gene. J Mol Biol. 1992 Jan 5;223(1):41–54. [PubMed]
  • Barany F, Slatko B, Danzitz M, Cowburn D, Schildkraut I, Wilson GG. The corrected nucleotide sequences of the TaqI restriction and modification enzymes reveal a thirteen-codon overlap. Gene. 1992 Mar 1;112(1):91–95. [PubMed]
  • Fearnley IM, Walker JE. Two overlapping genes in bovine mitochondrial DNA encode membrane components of ATP synthase. EMBO J. 1986 Aug;5(8):2003–2008. [PMC free article] [PubMed]
  • Jankowski JM, Krawetz SA, Walczyk E, Dixon GH. In vitro expression of two proteins from overlapping reading frames in a eukaryotic DNA sequence. J Mol Evol. 1986;24(1-2):61–71. [PubMed]
  • Adelman JP, Bond CT, Douglass J, Herbert E. Two mammalian genes transcribed from opposite strands of the same DNA locus. Science. 1987 Mar 20;235(4795):1514–1517. [PubMed]
  • Miyajima N, Horiuchi R, Shibuya Y, Fukushige S, Matsubara K, Toyoshima K, Yamamoto T. Two erbA homologs encoding proteins with different T3 binding capacities are transcribed from opposite DNA strands of the same genetic locus. Cell. 1989 Apr 7;57(1):31–39. [PubMed]
  • Lazar MA, Hodin RA, Darling DS, Chin WW. A novel member of the thyroid/steroid hormone receptor family is encoded by the opposite strand of the rat c-erbA alpha transcriptional unit. Mol Cell Biol. 1989 Mar;9(3):1128–1136. [PMC free article] [PubMed]
  • Vellard M, Soret J, Sureau A, Perbal B. A novel type of RNA-binding protein is potentially encoded by the opposite strand of the trans-spliced c-myb coding exon. C R Acad Sci III. 1991;313(13):591–597. [PubMed]
  • Evans RM. The steroid and thyroid hormone receptor superfamily. Science. 1988 May 13;240(4854):889–895. [PubMed]
  • Issemann I, Green S. Activation of a member of the steroid hormone receptor superfamily by peroxisome proliferators. Nature. 1990 Oct 18;347(6294):645–650. [PubMed]
  • Segraves WA. Something old, some things new: the steroid receptor superfamily in Drosophila. Cell. 1991 Oct 18;67(2):225–228. [PubMed]
  • Laudet V, Hänni C, Coll J, Catzeflis F, Stéhelin D. Evolution of the nuclear receptor gene superfamily. EMBO J. 1992 Mar;11(3):1003–1013. [PMC free article] [PubMed]
  • Forrest D, Sjöberg M, Vennström B. Contrasting developmental and tissue-specific expression of alpha and beta thyroid hormone receptor genes. EMBO J. 1990 May;9(5):1519–1528. [PMC free article] [PubMed]
  • Yaoita Y, Shi YB, Brown DD. Xenopus laevis alpha and beta thyroid hormone receptors. Proc Natl Acad Sci U S A. 1990 Sep;87(18):7090–7094. [PMC free article] [PubMed]
  • Beremand MN, Blumenthal T. Overlapping genes in RNA phage: a new protein implicated in lysis. Cell. 1979 Oct;18(2):257–266. [PubMed]
  • Kozak M. Bifunctional messenger RNAs in eukaryotes. Cell. 1986 Nov 21;47(4):481–483. [PubMed]
  • Samuel CE. Polycistronic animal virus mRNAs. Prog Nucleic Acid Res Mol Biol. 1989;37:127–153. [PubMed]
  • Bozarth CS, Weiland JJ, Dreher TW. Expression of ORF-69 of turnip yellow mosaic virus is necessary for viral spread in plants. Virology. 1992 Mar;187(1):124–130. [PubMed]
  • Keese P, Mackenzie A, Gibbs A. Nucleotide sequence of the genome of an Australian isolate of turnip yellow mosaic tymovirus. Virology. 1989 Oct;172(2):536–546. [PubMed]
  • Keese P, Martin RR, Kawchuk LM, Waterhouse PM, Gerlach WL. Nucleotide sequences of an Australian and a Canadian isolate of potato leafroll luteovirus and their relationships with two European isolates. J Gen Virol. 1990 Mar;71(Pt 3):719–724. [PubMed]
  • Ratner L, Haseltine W, Patarca R, Livak KJ, Starcich B, Josephs SF, Doran ER, Rafalski JA, Whitehorn EA, Baumeister K, et al. Complete nucleotide sequence of the AIDS virus, HTLV-III. Nature. 1985 Jan 24;313(6000):277–284. [PubMed]
  • Guyader M, Emerman M, Sonigo P, Clavel F, Montagnier L, Alizon M. Genome organization and transactivation of the human immunodeficiency virus type 2. Nature. 1987 Apr 16;326(6114):662–669. [PubMed]
  • Cohen EA, Terwilliger EF, Sodroski JG, Haseltine WA. Identification of a protein encoded by the vpu gene of HIV-1. Nature. 1988 Aug 11;334(6182):532–534. [PubMed]
  • Gallo R, Wong-Staal F, Montagnier L, Haseltine WA, Yoshida M. HIV/HTLV gene nomenclature. Nature. 1988 Jun 9;333(6173):504–504. [PubMed]
  • Rossmann MG, Johnson JE. Icosahedral RNA virus structure. Annu Rev Biochem. 1989;58:533–573. [PubMed]
  • Schwartz S, Felber BK, Benko DM, Fenyö EM, Pavlakis GN. Cloning and functional analysis of multiply spliced mRNA species of human immunodeficiency virus type 1. J Virol. 1990 Jun;64(6):2519–2529. [PMC free article] [PubMed]
  • Arya SK, Guo C, Josephs SF, Wong-Staal F. Trans-activator gene of human T-lymphotropic virus type III (HTLV-III). Science. 1985 Jul 5;229(4708):69–73. [PubMed]
  • Sodroski J, Goh WC, Rosen C, Dayton A, Terwilliger E, Haseltine W. A second post-transcriptional trans-activator gene required for HTLV-III replication. Nature. 1986 May 22;321(6068):412–417. [PubMed]
  • McClure MA, Johnson MS, Feng DF, Doolittle RF. Sequence comparisons of retroviral proteins: relative rates of change and general phylogeny. Proc Natl Acad Sci U S A. 1988 Apr;85(8):2469–2473. [PMC free article] [PubMed]
  • Cullen BR. Regulation of human immunodeficiency virus replication. Annu Rev Microbiol. 1991;45:219–250. [PubMed]
  • Hu W, Vander Heyden N, Ratner L. Analysis of the function of viral protein X (VPX) of HIV-2. Virology. 1989 Dec;173(2):624–630. [PubMed]
  • Vaishnav YN, Wong-Staal F. The biochemistry of AIDS. Annu Rev Biochem. 1991;60:577–630. [PubMed]
  • Malim MH, Böhnlein S, Hauber J, Cullen BR. Functional dissection of the HIV-1 Rev trans-activator--derivation of a trans-dominant repressor of Rev function. Cell. 1989 Jul 14;58(1):205–214. [PubMed]
  • Curran J, Boeck R, Kolakofsky D. The Sendai virus P gene expresses both an essential protein and an inhibitor of RNA synthesis by shuffling modules via mRNA editing. EMBO J. 1991 Oct;10(10):3079–3085. [PMC free article] [PubMed]
  • Gilbert W. Why genes in pieces? Nature. 1978 Feb 9;271(5645):501–501. [PubMed]
  • Breitbart RE, Andreadis A, Nadal-Ginard B. Alternative splicing: a ubiquitous mechanism for the generation of multiple protein isoforms from single genes. Annu Rev Biochem. 1987;56:467–495. [PubMed]
  • Traut TW. Do exons code for structural or functional units in proteins? Proc Natl Acad Sci U S A. 1988 May;85(9):2944–2948. [PMC free article] [PubMed]
  • Nussinov R. Doublet frequencies in evolutionary distinct groups. Nucleic Acids Res. 1984 Feb 10;12(3):1749–1763. [PMC free article] [PubMed]
  • Sharp PM. Does the 'non-coding' strand code? Nucleic Acids Res. 1985 Feb 25;13(4):1389–1397. [PMC free article] [PubMed]
  • Casino A, Cipollaro M, Guerrini AM, Mastrocinque G, Spena A, Scarlato V. Coding capacity of complementary DNA strands. Nucleic Acids Res. 1981 Mar 25;9(6):1499–1518. [PMC free article] [PubMed]
  • Argos P. Analysis of sequence-similar pentapeptides in unrelated protein tertiary structures. Strategies for protein folding and a guide for site-directed mutagenesis. J Mol Biol. 1987 Sep 20;197(2):331–348. [PubMed]
  • Light DW. Corporate medicine for profit. Sci Am. 1986 Dec;255(6):38–45. [PubMed]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences


Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...


  • MedGen
    Related information in MedGen
  • PubMed
    PubMed citations for these articles

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...