![]() | ![]() |
Formats:
|
||||||||||
Copyright © 2006, European Molecular Biology Organization The expanding transcriptome: the genome as the ‘Book of Sand' 1Centre de Regulació Genòmica, Barcelona, Spain 2Institució Catalana de Recerca i Estudis Avançats, Barcelona, Spain 3Universitat Pompeu Fabra, Barcelona, Spain aGene Regulation Programme, Centre de Regulació Genòmica, Passeig Marítim 37-49, Barcelona 08003, Spain. Tel.: +34 9 3224 0956; Fax: +34 9 3224 0899; E-mail: juan.valcarcel/at/crg.es Received December 8, 2005; Accepted January 17, 2006. This article has been cited by other articles in PMC.Abstract The central dogma of molecular biology inspired by classical work in prokaryotic organisms accounts for only part of the genetic agenda of complex eukaryotes. First, post-transcriptional events lead to the generation of multiple mRNAs, proteins and functions from a single primary transcript, revealing regulatory networks distinct in mechanism and biological function from those controlling RNA transcription. Second, a variety of populous families of small RNAs (small nuclear RNAs, small nucleolar RNAs, microRNAs, siRNAs and shRNAs) assemble on ribonucleoprotein complexes and regulate virtually all aspects of the gene expression pathway, with profound biological consequences. Third, high-throughput methods of genomic analysis reveal that RNAs other than non-protein-coding RNAs (ncRNAs) represent a major component of the transcriptome that may perform novel functions in gene regulation and beyond. Post-transcriptional regulation, small RNAs and ncRNAs provide an expanding picture of the transcriptome that enriches our views of what genes are, how they operate, evolve and are regulated. Keywords: gene, noncoding, RNA, splicing, transcription ‘… his book was known as the Book of Sand, because neither the Book nor the sand have beginning or end. JL Borges Introduction Describing the genome of an organism appears to be a well-defined task, one made feasible today at virtually all levels of biological complexity by spectacular advances in DNA manipulation and sequencing (Hood and Galas, 2003). One key motivation behind genome-sequencing projects is the assumption that the nucleotide sequence of an organism provide a description of the genes, gene products and gene networks that orchestrate programs like those sustaining the metabolic activity of a cell or deploying a body plan. This assumption is largely rooted in the classical view of DNA as the repository of genetic information, proteins as effector molecules and messenger RNAs as transcripts of the DNA that are decoded into protein by the ribosome. Additional functions of RNA, for example, as transfer and ribosomal RNAs in protein translation, have been recognized as important but of limited functional repertoire (Noller, 2005). We will review here recent progress that suggests a significantly more diverse picture for the content and function of the transcriptome, the set of RNAs transcribed from the genome. In contrast with the rather univocal—and today reachable—description of genomes, transcriptomes are entities as diverse as the cell types, developmental stages, environmental conditions and pathological states that the organism harbors or faces. Their full description requires high-throughput technologies that are still under development, with new major categories of transcripts being recently discovered. We will also discuss how these new developments significantly expand—and even challenge—the classical concept of the gene and how post-transcriptional molecular events are becoming key to understand gene regulation in higher eukaryotes. A picture is emerging where the central dogma inspired by prokaryotic molecular biology provides only a peep-hole view of eukaryotic genome function, and where large fractions of what was until recently considered junk DNA are indeed transcribed and may play a role as fundamental for understanding genomes as that of dark matter for understanding cosmological phenomena (Ostriker and Steinhardt, 2003). Prevalent alternative splicing In contrast with the gene-dense structure of prokaryotic genomes, where genes of related function are often transcribed as single units that are directly translated, protein-coding genes are dispersed in, and represent only a small fraction of, eukaryotic genomes (Mattick, 2004a). Furthermore, while prokaryotic transcripts can be directly translated, the primary products of transcription of eukaryotic protein-coding genes are subject to extensive processing, including removal of internal sequences—introns—and splicing of the remaining and typically shorter sequences—exons—to generate mature, translatable mRNAs (Black, 2003) (Figure 1
Alternatively spliced mRNAs often encode different proteins with distinct—sometimes opposite—functions (Figure 2
Distinct regulatory programs Although estimates of the fraction of alternative transcripts that provide distinct/distinguishable biological functions are not yet available, there exist enough examples of functional diversity among mRNA isoforms so as to suggest significant implications for the interpretation of classical genetic results, gene knockout studies and high-throughput data that do not distinguish between alternative transcripts and their diverse associated products. One example of the latter is gene expression profiling using microarrays. Microarrays allow parallel detection of tens of thousands of RNA species that hybridize to a solid support (typically a glass slide) containing complementary probes spotted at defined positions in the array (Ferea and Brown, 1999). Conventional microarray designs use either cDNAs or oligonucleotides that hybridize along the length of a reference transcript. Such designs do not usually provide information about the various transcript isoforms present in an RNA sample. A more systematic approach to determine all the sequences transcribed from a genomic region is achieved by the use of tiling arrays (Kapranov et al, 2002; Bertone et al, 2005). These consist of sets of oligonucleotides that cover whole genomic regions, chromosomes or genomes through partially overlapping probes. Tiling arrays have the potential for high-resolution determination of the regions of a gene that are transcribed, for example, identifying novel exons. The identification of the structure of individual transcripts, however, depends on additional information connecting the various transcript elements, for example, through the use of strand-specific RACE (Cheng et al, 2005; Kapranov et al, 2005) or probes directed against exon–exon splice junctions (Matlin et al, 2005). Efforts to detect alternatively spliced isoforms using splicing-sensitive microarrays are starting to provide valuable clues about global changes in alternative splicing and its regulation, from discovery of new alternative splicing events to clustering of events according to cell type or disease stage, to the discovery of regulatory networks orchestrating multiple aspects of a biological process like synaptic transmission (Yeakley et al, 2002; Johnson et al, 2003; Pan et al, 2004; Blanchette et al, 2005; Relògio et al, 2005; Ule et al, 2005). These global analyses complement in vitro studies on the detailed molecular mechanisms that regulate particular splice site choices. The picture emerging for how splicing regulation is achieved is one in which multiple sequence elements—not only the rather short and degenerate signals that define exon–intron boundaries—contribute to establish the exonic or intronic fate of a piece of RNA (Black, 2003; Matlin et al, 2005; Singh and Valcárcel, 2005). These auxiliary sequences are frequent in coding exons, and therefore mutational changes (either during evolution or those arising in disease) are constrained by—and may have effects on—both the coding capacity and the appropriate processing of the transcript (Cartegni et al, 2002; Carlini and Genut, 2006; Parmley et al, 2006). Regulatory sequences can engage in alternative RNA secondary structures that influence splice site pairing (Buratti and Baralle, 2004; Graveley, 2005). More often, however, regulatory sequences are recognized by various families of polypeptides, like SR and hnRNP proteins, resulting in ribonucleoprotein (RNP) assemblies where synergistic and antagonistic combinatorial effects have the potential to explain differential splice site recognition by the basic splicing enzymes (Black, 2003; Singh and Valcárcel, 2005). The orchestration of such changes—for example, in a tissue-specific manner—can rely on the tissue-specific expression of key regulators (e.g. sex-specific regulators in Drosophila, or neuron-specific Nova proteins in mammals), but in many cases seems to rely upon cellular codes established by the relative levels of expression of general, ubiquitous factors (Black, 2003). The latter set of mechanisms imply regulatory networks distinct from those governing transcriptional control, with lower numbers and less tissue-specific expression of factors involved in splicing regulation. Indeed, recent data suggest that cell type-specific programs of transcription and splicing are largely nonoverlapping, implying that they provide distinct readouts of a gene's function (Pan et al, 2004; Ule et al, 2005). Understanding the operation of these mechanisms will also require insights into how splicing regulation is coupled to transcription and polyadenylation, its interface with signaling pathways, as well as the development of imaging techniques to visualize the recruitment of factors to nascent transcripts in vivo (Shin and Manley, 2004; Kornblihtt, 2005; Mabon and Misteli, 2005). Ever smaller RNAs with increasingly important functions In addition to its role in relying information from the genome for protein synthesis, RNA is well known for its structural versatility, which is the basis of its remarkable catalytic properties (Caprara and Nilsen, 2000; Hollbrook, 2005). The ribosome and possibly the spliceosome are ribozymes where protein components play important roles in correct assembly of the complexes, but where catalysis is mediated by chemical moieties of the RNA (Konarska and Query, 2005; Noller, 2005). Important components of the spliceosome are small nuclear RNAs (snRNA) that recognize splice sites and establish an RNA scaffold that holds together the sequences involved in the splicing reaction (Villa et al, 2002) (Figure 1 Another extensive family comprises small nucleolar RNAs (snoRNAs). They belong to two main classes (C/D box and H/ACA box) which are required for two types of chemical modifications (2′-O-ribose methylation and conversion of uridines to pseudouridines) occurring at specific residues in rRNAs, snRNAs, telomerase and other RNAs (Bachellerie et al, 2002; Vitali et al, 2005) (Figure 2 Few would have thought before 1998 that studying pieces of RNAs of 20 nucleotides (nt) in length would be a worthwhile pursuit rendering anything more interesting than pieces of degraded transcripts. This is, however, an intense activity today that provides unexpected insights into a variety of important biological phenomena, from stem cell maintenance to cardiomyocyte differentiation to tumor formation, with applications ranging from large-scale genetic screens to novel therapeutics (Bartel, 2004; Novina and Sharp, 2004; Eckstein, 2005). This paradigm shift started as an additional double-strand (ds) RNA control in an antisense experiment (Fire et al, 1998). dsRNA is processed to 21–28 nt fragments by the RNAse III enzyme Dicer and the fragments (small interfering RNAs (siRNAs)) assemble onto the RNA-Induced Silencing protein Complex (RISC) to direct cleavage of complementary mRNAs, which leads to mRNA degradation and remarkably efficient gene silencing (Hannon and Rossi, 2004; Meister and Tuschl, 2004; Zamore and Haley, 2005) (Figure 2 MicroRNAs and the orchestration of cellular functions Dicer, RISC and related activities are also involved in the generation of microRNAs (miRNAs) (Lee et al, 1993; Wienholds and Plasterk, 2005). These are transcribed as precursors (pri-miRNAs) that are processed in the nucleus and cytoplasm to generate RNP complexes containing 21-nt miRNAs that are partially complementary to the 3′ untranslated region (UTR) of mRNAs (Cullen, 2004) (Figures 2 Despite the challenge of finding bona fide miRNAs and miRNA targets based on limited sequence complementarity, computational and tailored cloning efforts are providing a growing list of miRNAs in multicellular organisms (Brennecke and Cohen, 2003; Hüttenhofer et al, 2004; Berezikov et al, 2005; Sontheimer and Carthew, 2005; Xie et al, 2005). Frequently, one miRNA can target multiple mRNAs and one mRNA can be regulated by multiple miRNAs targeting different regions of the 3′ UTR. Conversely, miRNA binding sequences are absent from the 3′ UTR of genes involved in basic cellular processes or of genes coexpressed with particular miRNAs (Stark et al, 2005). These features allow coordinated regulation, combinatorial control and precision and robustness to an increasing number of cell fate decisions and developmental transitions (Bartel and Chen, 2004; Farh et al, 2005; Lim et al, 2005; Wienholds et al, 2005). For example, recent reports have revealed important roles of miRNAs in the development of human cancers. The levels of about 200 miRNAs correlated with lineage and differentiation of tumor cells, and were significantly better criteria to classify poorly differentiated tumors than expression profiling of more than 2000 protein-coding genes, arguing for pivotal roles of miRNA levels in tumor development (Lu et al, 2005). Clusters of miRNAs have the properties of classical oncogenes, and modulate—and are modulated by—the activities of other oncogenes (He et al, 2005). For example, a regulatory network was recently discovered in which increased transcription of a cluster of miRNAs by the proto-oncogene c-MYC results in translational downregulation of the transcription factor E2F1, another important regulator of cell division, which is itself transcriptionally regulated by c-MYC (O'Donnell et al, 2005). Thus, miRNAs could serve in this case as part of a safety mechanism that adjusts the levels of expression of a key regulator of cell cycle progression. Another interesting aspect of the genomic organization of many snoRNAs and one-third of miRNAs is that they are transcribed as intronic sequences within pre-mRNAs (Bachellerie et al, 2002; Seitz et al, 2003) (Figure 2 Dark matter and noncoding RNAs Only about 2% of the human genome corresponds to mature mRNA sequences (Lander et al, 2001). Even considering the corresponding introns, the primary transcripts of protein-coding genes represent around 30% of genomic sequence, with ribosomal and small RNAs adding another small fraction. Recent data from phylogenetic analyses and various high-throughput approaches, however, challenge the picture of higher eukaryotic genes as jewel planets on a much larger void of possibly junk DNA (Bejerano et al, 2004a, 2004b; Siepel et al, 2005), and suggest that a large part of the genome—possibly most of it—is indeed transcribed (Cheng et al, 2005; Kapranov et al, 2005). These technologies detect at least as many RNAs that do not have significant coding capacity (non-protein-coding RNAs (ncRNAs)) as mRNAs, explaining at least part of the discrepancy between the number of transcripts determined by random sequence of cDNA libraries and the number of genes predicted computationally on the basis of their coding potential (Suzuki and Hayashizaki, 2004; Claverie, 2005). For example, massive cDNA sequencing efforts have documented more than 180 000 distinct mouse transcripts from 22 000 estimated genes (Carninci et al, 2005). In addition to revealing more than 16 000 potential novel protein-coding genes, the lion's share of this expanded transcript collection corresponds to ncRNAs. Another surprising observation was that more than 70% of the transcripts overlap with an RNA synthesized from the opposite DNA strand, with frequent pairs of coding and noncoding RNAs adopting this configuration. Very similar conclusions were drawn from results using tiling arrays covering several human chromosomes. For example, combining information on transcription factor binding sites with expression data from tiling arrays of human chromosomes 21 and 22, comparable numbers of protein-coding and noncoding transcripts were identified as regulated by well-known transcription factors (Cawley et al, 2004). Intriguingly, mRNA/ncRNA pairs were often generated from overlapping or neighboring genomic regions, and were often found to be coregulated. High-resolution tiling arrays detect further complexity in the form of a high proportion of nonpolyadenylated RNAs, and also show that half of all transcribed sequences were present only in the nucleus (Cheng et al, 2005). These features (lack of polyA tails, exclusion from the cytoplasm) are foreign to most sources of annotated transcripts, and therefore open the door to a largely unexplored territory of the transcriptome and of gene regulation. Dark matter, dark energy? It is currently unclear to what extent these ncRNAs play relevant biological roles or are spurious products of transcription (junk RNA), or even if a fraction correspond to experimental artifacts of the methods of detection (Hüttenhofer et al, 2005; Johnson et al, 2005). One first consideration is that even if the product of transcription does not have a function, the act of transcription of the corresponding region of the genome may open up chromatin structure and influence expression of other genes, DNA replication, recombination, etc. Second, an increasing number of independently validated ncRNAs show dynamic changes in expression that suggest their involvement in biological responses (Ravasi et al, 2006). Third, enough examples exist of ncRNAs with important functions to warrant further scrutiny. Now classical examples include ncRNAs important to assemble chromatin remodeling complexes that modulate the transcriptional activity of whole chromosomes in Drosophila and mammals to allow X-chromosome dosage compensation in different sexes (Bernstein and Allis, 2005; Lucchesi et al, 2005). Another example of such scaffolding activity has been recently provided by a tissue-culture screen in which the effect of knocking down the expression of 512 ncRNAs on 12 cellular assays was analyzed, rendering eight RNAs with effects on nuclear trafficking, cell viability or Hedgehog signaling (Willingham et al, 2005). For example, the NRON transcript was found to nucleate the assembly of nuclear trafficking factors with important consequences for nucleocytoplasmic transport. It would have been interesting to determine what fraction of a random collection of 512 protein-coding genes would score in the same assays. Various additional functions can be postulated for the vast continent of ncRNAs. Transcripts antisense of a coding mRNA can obviously modulate expression by forming sense–antisense pairs that prevent translation or act as sources of dsRNA that can be processed to generate siRNA species that affect chromatin structure, mRNA stability or translatability (Katayama et al, 2005) (Figure 2 One caveat for considering ncRNAs as the genome's second major depository of functional information is why genetic screens have not provided as many hits in ncRNA genes as in coding transcripts. Several explanations can be proposed for this discrepancy. A first consideration is target size, which may lower the likelihood of mutational events in certain small RNAs. Second, we do not know the nature of the functional determinants in these transcripts. Single-nucleotide substitutions or small deletions or insertions anywhere in a coding region can have dramatic effects on the protein-coding potential of an mRNA. Such mutations may be easier to accommodate to maintain functional ncRNAs, consistent with the limited phylogenetic conservation of some ncRNAs with well-known functions (Pang et al, 2006). Third, as some miRNAs appear to regulate multiple targets, it may be difficult to isolate viable mutants with specific phenotypes from extensively networked genetic elements. Finally, a cultural bias may have operated to discard mutants not associated with identifiable genes, a tendency that is likely to be reversed with the increasing awareness of the nature and properties of ncRNA genes. While the number of protein-coding genes in a genome does not have a clear correlation with the complexity of the organism, the amount of noncoding DNA appears to increase steadily after the transition to multicellularity. It has been argued that ncRNAs (including regulatory small RNAs like miRNAs) may play key roles in orchestrating increasingly more sophisticated regulatory networks that sustain higher levels of complexity (Mattick, 2004a, 2004b; Mattick and Makunin, 2005). If this were true, and similar to cosmologists' postulate for the existence of some form of dark energy that sustains an ever-increasing expansion rate of the universe (Springel et al, 2005), the once considered junk of the genome could be the energy behind the expanding complexity of multicellular organisms. In addition, it seems likely that the evolution of complexity is not based only on increased numbers of protein-coding genes but on increasingly sophisticated gene networks (Mattick, 2004a, 2004b). An overriding principle is the modularity of regulatory sequences. These allow precise control of promoter activity based upon a specific constellation of transcription factors, control of the proportion between alternatively spliced isoforms upon varying ratios of splicing regulators, control of the efficiency of translation upon the combination of available translation factors—including miRNAs, etc. It is thus easy to envision how a set of genes jointly regulated by a particular set of transcription factors can be differentially expressed by alternative splicing or translational control (Figure 3 Book of sand The argentinian writer JL Borges conceived an ‘infinite book' where new pages constantly appear as the reader attempts to open the book between the cover and the first page. The pages of the book of the genome also seem to multiply themselves as we open it. The discovery of the ‘one gene, one enzyme' concept that once unified genetics and biochemistry was followed by the discovery that single genes can express numbers of isoforms that exceed the number of genes in the organism. The emergence of small RNAs as essential actors in RNA processing ushered the discovery of new expanding categories of genetic elements—for example, miRNAs—with remarkable networking properties. Even the once considered empty pages of the genome now appear to be responsible for more than half of the transcriptional output of complex genomes, largely unexplored in identity and function. It would probably be unwise to think that this is all the complexity that remains to be discovered. Hybrid transcripts containing pieces of independent transcriptional units (generated for example by read-through transcription or trans-splicing events) are common in some organisms and can occur in higher eukaryotes as well (Hastings, 2005; Takahara et al, 2005). Detection of transcripts antisense to processed mRNAs suggests the possibility that RNA-mediated RNA synthesis produces RNA species that cannot be generated by transcription (Cheng et al, 2005). Such activities may amplify the products of transcription and also mediate transcriptional silencing, as observed in plants (Vaucheret, 2005). Certain signaling pathways (e.g. the unfolded protein response and other forms of stress) generate novel transcripts through processes that integrate gene expression and subcellular dynamics (Patil and Walter, 2001; Prasanth et al, 2005). Finally, RNA editing—the ultimate nightmare for gene annotation aficionados—is known to play a pivotal role in important biological decisions, and analysis of nucleotide products of editing reactions suggests a widespread impact of the process, particularly in the nervous system (Bass, 2002; Eisenberg et al, 2005; Stuart et al, 2005). Far from discouraging efforts to interpret the genome, the existing and foreseeable expanding complexity may prove key to understand the frame and logic of its function. A list of web resources for analyses of transcript heterogeneity is given in Supplementary Table I. Supplementary data Click here to view.(111K, pdf) Supplementary Table I Click here to view.(80K, pdf) Acknowledgments We thank Tom Gingeras, Roderic Guigó, Alexander Hüttenhofer and John Mattick for comments on the manuscript. LMS was supported by a Praxis Program fellowship from the Portuguese government. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||
Nature. 2003 Jan 23; 421(6921):444-8.
[Nature. 2003]Science. 2005 Sep 2; 309(5740):1508-14.
[Science. 2005]Science. 2003 Jun 20; 300(5627):1909-13.
[Science. 2003]Annu Rev Biochem. 2003; 72():291-336.
[Annu Rev Biochem. 2003]Bioessays. 2003 Dec; 25(12):1147-9.
[Bioessays. 2003]Nat Rev Mol Cell Biol. 2005 May; 6(5):386-98.
[Nat Rev Mol Cell Biol. 2005]Science. 2003 Dec 19; 302(5653):2141-4.
[Science. 2003]Gene. 2005 Jan 3; 344():1-20.
[Gene. 2005]Trends Genet. 2001 Feb; 17(2):100-7.
[Trends Genet. 2001]Nature. 2002 Jul 11; 418(6894):236-43.
[Nature. 2002]Annu Rev Biochem. 2003; 72():291-336.
[Annu Rev Biochem. 2003]Nat Rev Mol Cell Biol. 2005 May; 6(5):386-98.
[Nat Rev Mol Cell Biol. 2005]Curr Opin Genet Dev. 1999 Dec; 9(6):715-22.
[Curr Opin Genet Dev. 1999]Science. 2002 May 3; 296(5569):916-9.
[Science. 2002]Chromosome Res. 2005; 13(3):259-74.
[Chromosome Res. 2005]Science. 2005 May 20; 308(5725):1149-54.
[Science. 2005]Genome Res. 2005 Jul; 15(7):987-97.
[Genome Res. 2005]Annu Rev Biochem. 2003; 72():291-336.
[Annu Rev Biochem. 2003]Nat Rev Mol Cell Biol. 2005 May; 6(5):386-98.
[Nat Rev Mol Cell Biol. 2005]Nat Struct Mol Biol. 2005 Aug; 12(8):645-53.
[Nat Struct Mol Biol. 2005]Nat Rev Genet. 2002 Apr; 3(4):285-98.
[Nat Rev Genet. 2002]J Mol Evol. 2006 Jan; 62(1):89-98.
[J Mol Evol. 2006]Nat Struct Biol. 2000 Oct; 7(10):831-3.
[Nat Struct Biol. 2000]Curr Opin Struct Biol. 2005 Jun; 15(3):302-8.
[Curr Opin Struct Biol. 2005]Genes Dev. 2005 Oct 1; 19(19):2255-60.
[Genes Dev. 2005]Science. 2005 Sep 2; 309(5740):1508-14.
[Science. 2005]Cell. 2002 Apr 19; 109(2):149-52.
[Cell. 2002]Biochimie. 2002 Aug; 84(8):775-90.
[Biochimie. 2002]J Cell Biol. 2005 Jun 6; 169(5):745-53.
[J Cell Biol. 2005]Curr Opin Cell Biol. 2002 Jun; 14(3):319-27.
[Curr Opin Cell Biol. 2002]Science. 2006 Jan 13; 311(5758):230-2.
[Science. 2006]Cell. 2004 Jan 23; 116(2):281-97.
[Cell. 2004]Nature. 2004 Jul 8; 430(6996):161-4.
[Nature. 2004]Trends Biochem Sci. 2005 Aug; 30(8):445-52.
[Trends Biochem Sci. 2005]Nature. 1998 Feb 19; 391(6669):806-11.
[Nature. 1998]Nature. 2004 Sep 16; 431(7006):371-8.
[Nature. 2004]Cell. 1993 Dec 3; 75(5):843-54.
[Cell. 1993]FEBS Lett. 2005 Oct 31; 579(26):5911-22.
[FEBS Lett. 2005]Mol Cell. 2004 Dec 22; 16(6):861-5.
[Mol Cell. 2004]Science. 2005 Sep 2; 309(5740):1573-6.
[Science. 2005]Cell. 2005 Aug 26; 122(4):553-63.
[Cell. 2005]Genome Biol. 2003; 4(9):228.
[Genome Biol. 2003]Methods Mol Biol. 2004; 265():409-28.
[Methods Mol Biol. 2004]Cell. 2005 Jan 14; 120(1):21-4.
[Cell. 2005]Cell. 2005 Jul 15; 122(1):9-12.
[Cell. 2005]Nature. 2005 Mar 17; 434(7031):338-45.
[Nature. 2005]Biochimie. 2002 Aug; 84(8):775-90.
[Biochimie. 2002]Nat Genet. 2003 Jul; 34(3):261-2.
[Nat Genet. 2003]Nature. 2001 Feb 15; 409(6822):860-921.
[Nature. 2001]Genome Res. 2005 Aug; 15(8):1034-50.
[Genome Res. 2005]Science. 2005 May 20; 308(5725):1149-54.
[Science. 2005]Genome Res. 2005 Jul; 15(7):987-97.
[Genome Res. 2005]Bioessays. 2004 Aug; 26(8):833-43.
[Bioessays. 2004]Science. 2005 May 20; 308(5725):1149-54.
[Science. 2005]Trends Genet. 2005 May; 21(5):289-97.
[Trends Genet. 2005]Trends Genet. 2005 Feb; 21(2):93-102.
[Trends Genet. 2005]Genome Res. 2006 Jan; 16(1):11-9.
[Genome Res. 2006]Genes Dev. 2005 Jul 15; 19(14):1635-55.
[Genes Dev. 2005]Annu Rev Genet. 2005; 39():615-51.
[Annu Rev Genet. 2005]Science. 2005 Sep 2; 309(5740):1564-6.
[Science. 2005]Cell. 2004 Feb 20; 116(4):499-509.
[Cell. 2004]Cell. 2004 Mar 19; 116(6):779-93.
[Cell. 2004]Annu Rev Biochem. 2005; 74():199-217.
[Annu Rev Biochem. 2005]Trends Biochem Sci. 2004 Jan; 29(1):11-7.
[Trends Biochem Sci. 2004]Trends Genet. 2006 Jan; 22(1):1-5.
[Trends Genet. 2006]Hum Mol Genet. 2005 Apr 15; 14 Spec No 1():R121-32.
[Hum Mol Genet. 2005]Nature. 2005 Jun 2; 435(7042):629-36.
[Nature. 2005]Trends Biochem Sci. 2000 Aug; 25(8):381-8.
[Trends Biochem Sci. 2000]Nature. 2003 Jan 23; 421(6921):444-8.
[Nature. 2003]Proc Natl Acad Sci U S A. 2005 Aug 30; 102(35):12449-54.
[Proc Natl Acad Sci U S A. 2005]Trends Genet. 2005 Apr; 21(4):240-7.
[Trends Genet. 2005]Mol Cell. 2005 Apr 15; 18(2):245-51.
[Mol Cell. 2005]Science. 2005 May 20; 308(5725):1149-54.
[Science. 2005]Nat Genet. 2005 Jul; 37(7):659-60.
[Nat Genet. 2005]Curr Opin Cell Biol. 2001 Jun; 13(3):349-55.
[Curr Opin Cell Biol. 2001]