![]() | ![]() |
Formats:
|
||||||||
Ultraconservation identifies a small subset of extremely constrained developmental enhancers 1Genomics Division, MS 84-171, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA. 2United States Department of Energy Joint Genome Institute, Walnut Creek, California 94598, USA. Correspondence should be addressed to L.A.P. (Email: LAPennacchio/at/lbl.gov). Abstract Extended perfect human-rodent sequence identity of at least 200 base pairs (ultraconservation) is potentially indicative of evolutionary or functional uniqueness. We used a transgenic mouse assay to compare the embryonic enhancer activity of 231 noncoding ultraconserved human genome regions with that of 206 extremely conserved regions lacking ultraconservation. Developmental enhancers were equally prevalent in both populations, suggesting instead that ultraconservation identifies a small, functionally indistinct subset of similarly constrained cis-regulatory elements. The last common ancestor of human and rodents lived ~75 million years ago1, and yet the human genome contains 256 non-exonic ultraconserved elements of ≥200 bp that are perfectly conserved in mouse and rat as a result of extreme purifying selection2,3. Their depletion in segmental duplications and copy number variable regions4 and their strong bias toward rare, derived alleles in humans3,5 further point toward a pivotal functional role for these elements. In contrast, the identity of ultraconserved sequences as a distinct class of conserved genomic elements has been challenged by the observation that more rigorous comparative genomic methods (for example, refs. 6,7) identify additional sequences with similar conservation properties, but lacking extended perfect sequence conservation. Another feature of ultraconserved elements that undermines their relevance as a possible distinct category of conservation is that they are almost invariably embedded in larger blocks of constrained sequence, suggesting that they exist not as independent units of biological function, but as somewhat arbitrary fragments of larger functional modules. In the absence of comprehensive experimental data, it remains unclear whether the absolute sequence conservation of ultraconserved elements is indicative of a unique role, or whether they are merely a functionally indistinct fraction of a much larger set of extremely constrained elements. To explore the possible uniqueness of noncoding ultraconserved elements, we identified a large number of human-rodent conserved elements that are under similar evolutionary constraint as regions containing ultraconservation. We compared the entire set of these elements, the majority of which lack perfectly conserved regions of ≥200 bp (that is, ultraconserved regions), to the small subset that overlapped ultraconserved elements in regard to their degree of constraint in other mammalian species and their enrichment near genes with certain functions. Moreover, we compared the ability of a genome-wide set of non-exonic ultraconserved elements and more than 200 extremely constrained human-rodent elements to drive tissue-specific expression in vivo, a property that has previously been observed to be a predominant function associated with noncoding ultraconservation8–12. In an initial comparative genomic assessment of ultraconservation, we found substitutions in 79% of these elements in other mammalian species (Fig. 1a
To test whether such apparent equivalence at the sequence level is also associated with similar functional properties, we focused on transcriptional enhancer activity during embryonic development. We used a transgenic mouse assay to determine the embryonic enhancer activities of 155 human genome regions that contain non-exonic ultraconserved elements and combined these data with a previously reported smaller dataset10 to establish a genome-wide compendium of their enhancer activities. A total of 231 transgenic assays was considered, in which the tested human genome fragments included 245 of all 256 non-exonic ultraconserved elements (Supplementary Table 4 online). We found that half (115/231) of the ultraconserved regions drove reproducible reporter gene expression in various tissues of the developing mouse embryo, often in a tight spatially restricted manner and with subregions of the central nervous system among the most frequently targeted structures (Fig. 2a
To determine whether such an enrichment in embryonic enhancers is specifically associated with the presence of ultraconserved regions, we also tested the enhancer activities of 206 extremely constrained human-rodent noncoding sequences that lack regions of ultraconservation. Of note, these regions were selected blind to evolutionary conservation depth in nonmammalian species, and purely based on their human-rodent constraint scores. Using identical scoring criteria as before, we found that 102 of these 206 elements (50%) acted as tissue-specific enhancers at embryonic day 11.5 (Supplementary Table 5 online). We did not observe significant differences between the ultraconserved and non-ultraconserved elements regarding the overall distribution of the targeted anatomical structures (Fig. 2a Ultraconserved elements seem to have remained practically ‘frozen’ during mammalian evolution2, and their perfect, uninterrupted sequence identity between human and rodents has suggested that they might represent the pinnacle of extreme noncoding sequence conservation in mammals. In contrast to this proposal, and consistent with findings based on alternative comparative metrics6,7, our results support the notion that the relatively small number of ultraconserved elements may more likely be due to their definition by a simple percent-identity-plot approach14 than to a uniquely high degree of constraint of the conserved regions in which they are located. If the enrichment in enhancer activity observed in our in vivo testing of over 400 distinct genome fragments is considered as a measure, ultracon-served elements do not represent the very tail of the continuum of human-rodent conservation, but are merely a subset of a tenfold larger population of elements that are under similar constraint and have apparently equivalent regulatory function. The possibility of functional redundancy within this much larger population of conserved elements may also provide a partial explanation for the observation that some ultraconserved noncoding elements are dispensable for viability in mice15. The elements identified in this study are defined independent of their conservation in nonmammalian vertebrate species. We therefore expect that, of the hundreds of additional tissue-specific enhancers that remain to be discovered in this category of extreme conservation, some will be unique to mammals. Although subsets of extremely conserved noncoding elements undoubtedly have other molecular functions, our results indicate that a large proportion of these elements choreograph the transcription of key genes during mammalian development, regardless of whether they are ultraconserved. supplementary Click here to view.(530K, pdf) ACKNOWLEDGMENTS We thank I. Dubchak, A. Poliakov and S. Minovitsky for help with genome alignments and database development; S. Bhardwaj and S. Phouanenavong for technical assistance; and N. Ahituv, M. Nobrega, J. Noonan and members of the Pennacchio and Rubin laboratories for discussion and critical comments on the manuscript. L.A.P. was supported by grant HL066681, Berkeley-Program for Genomic Applications, under the Programs for Genomic Applications, funded by National Heart, Lung, and Blood Institute, and HG003988 funded by National Human Genome Research Institute. Research was done under Department of Energy Contract DE-AC02-05CH11231, University of California, E.O. Lawrence Berkeley National Laboratory. A.V. was supported by an American Heart Association postdoctoral fellowship. Footnotes Reprints and permissions information is available online at http://npg.nature.com/ reprintsandpermissions References 1. Waterston RH, et al. Nature. 2002;420:520–562. [PubMed] 2. Bejerano G, et al. Science. 2004;304:1321–1325. [PubMed] 3. Katzman S, et al. Science. 2007;317:915. [PubMed] 4. Derti A, Roth FP, Church GM, Wu CT. Nat. Genet. 2006;38:1216–1220. [PubMed] 5. Drake JA, et al. Nat. Genet. 2006;38:223–227. [PubMed] 6. Cooper GM, et al. Genome Res. 2005;15:901–913. [PubMed] 7. Siepel A, et al. Genome Res. 2005;15:1034–1050. [PubMed] 8. de la Calle-Mustienes E, et al. Genome Res. 2005;15:1061–1072. [PubMed] 9. Poulin F, et al. Genomics. 2005;85:774–781. [PubMed] 10. Pennacchio LA, et al. Nature. 2006;444:499–502. [PubMed] 11. Ghanem N, et al. J. Neurosci. 2007;27:5012–5022. [PubMed] 12. Paparidis Z, et al. Dev. Growth Differ. 2007;49:543–553. [PubMed] 13. Prabhakar S, et al. Genome Res. 2006;16:855–863. [PubMed] 14. Hardison RC, Oeltjen J, Miller W. Genome Res. 1997;7:959–966. [PubMed] 15. Ahituv N, et al. PLoS Biol. 2007;5:e234. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||
Nature. 2002 Dec 5; 420(6915):520-62.
[Nature. 2002]Science. 2004 May 28; 304(5675):1321-5.
[Science. 2004]Science. 2007 Aug 17; 317(5840):915.
[Science. 2007]Nat Genet. 2006 Oct; 38(10):1216-20.
[Nat Genet. 2006]Nat Genet. 2006 Feb; 38(2):223-7.
[Nat Genet. 2006]Genome Res. 2005 Aug; 15(8):1061-72.
[Genome Res. 2005]Dev Growth Differ. 2007 Aug; 49(6):543-53.
[Dev Growth Differ. 2007]Nat Genet. 2006 Oct; 38(10):1216-20.
[Nat Genet. 2006]Genome Res. 2006 Jul; 16(7):855-63.
[Genome Res. 2006]Genome Res. 2005 Jul; 15(7):901-13.
[Genome Res. 2005]Science. 2004 May 28; 304(5675):1321-5.
[Science. 2004]Nature. 2006 Nov 23; 444(7118):499-502.
[Nature. 2006]Science. 2004 May 28; 304(5675):1321-5.
[Science. 2004]Genome Res. 2005 Jul; 15(7):901-13.
[Genome Res. 2005]Genome Res. 2005 Aug; 15(8):1034-50.
[Genome Res. 2005]Genome Res. 1997 Oct; 7(10):959-66.
[Genome Res. 1997]PLoS Biol. 2007 Sep; 5(9):e234.
[PLoS Biol. 2007]Science. 2004 May 28; 304(5675):1321-5.
[Science. 2004]