![]() | ![]() |
Formats:
|
||||||||||||||||||
Copyright © 2006, Cold Spring Harbor Laboratory Press L1 integration in a transgenic mouse model 1 Department of Genetics, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA 2 Department of Pathology and Laboratory Medicine, Hospital of the University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA 3 Transposagen Biopharmaceuticals, Inc., Philadelphia, Pennsylvania 19104, USA 4Corresponding author. E-mail kazazian/at/mail.med.upenn.edu; fax (215) 573-7760. Received August 17, 2005; Accepted November 9, 2005. This article has been cited by other articles in PMC.Abstract To study integration of the human LINE-1 retrotransposon (L1) in vivo, we developed a transgenic mouse model of L1 retrotransposition that displays de novo somatic L1 insertions at a high frequency, occasionally several insertions per mouse. We mapped 3′ integration sites of 51 insertions by Thermal Asymmetric Interlaced PCR (TAIL–PCR). Analysis of integration locations revealed a broad genomic distribution with a modest preference for intergenic regions. We characterized the complete structures of 33 de novo retrotransposition events. Our results highlight the large number of highly truncated L1s, as over 52% (27/51) of total integrants were <1/3 the length of a full-length element. New integrants carry all structural characteristics typical of genomic L1s, including a number with inversions, deletions, and 5′-end microhomologies to the target DNA sequence. Notably, at least 13% (7/51) of all insertions contain a short stretch of extra nucleotides at their 5′ end, which we postulate result from template-jumping by the L1-encoded reverse transcriptase. We propose a unified model of L1 integration that explains all of the characteristic features of L1 retrotransposition, such as 5′ truncations, inversions, extra nucleotide additions, and 5′ boundary and inversion point microhomologies. The long interspersed nucleotide element-1 (L1) retrotransposon enjoyed tremendous evolutionary success in colonizing eukaryotic genomes (Kazazian Jr. 2004); its roughly 500,000 copies comprise ~17% of human DNA (Lander et al. 2001). The full-length 6-kb L1 encodes a 5′ UTR containing an internal promoter, two proteins—ORF1, a nucleic acid-binding protein with chaperone activity (Hohjoh and Singer 1996, 1997; Martin and Bushman 2001), and ORF2, a protein with endonuclease (EN) and reverse transcriptase (RT) activities (Mathias et al. 1991; Feng et al. 1996), and a 3′ UTR ending with a poly(A) tail (Fig. 1A
Following essentially random integration, endogenous L1s are thought to be lost over time due to strong negative selection leading to their uneven genomic distribution (Boissinot et al. 2001, 2004; Ovchinnikov et al. 2001; Medstrand et al. 2002). This is reflected in their overrepresentation in GC-poor genomic regions (Lander et al. 2001; Ovchinnikov et al. 2001; Boissinot et al. 2004), under-representation within 5 kb of genes (Medstrand et al. 2002), and the selective loss of longer (>500 bp) L1s from recombining regions of the genome (Boissinot et al. 2001). The L1's 5′ end variability and highly repetitive nature make cloning insertions difficult and have prohibited analysis of a large number of de novo integrants. In contrast to several thousands of analyzed integrants for retroviruses and DNA transposons (Bushman et al. 2005; Yant et al. 2005), the total number of completely characterized de novo L1 insertions is under 200. These include 15 known L1 insertions causing human disease (Ostertag and Kazazian Jr. 2001a; Chen et al. 2005) and ~150 integrants characterized in transformed cell lines (Moran et al. 1996; Gilbert et al. 2002, 2005; Symer et al. 2002). Nonetheless, even this small number of insertions revealed several key insights into the mechanism and potential impact of L1 retrotransposition, including the prevalence of microhomologies at the 5′ end of de novo inserts (Symer et al. 2002; Gilbert et al. 2005) and a realization that ~10% of retrotransposition events may produce large deletions of host DNA sequence at the integration site (Gilbert et al. 2002, 2005; Symer et al. 2002). However, as noted previously (Symer et al. 2002), known defects in DNA stability and repair pathways in cultured cells create a concern that some features of retrotransposition observed in cell culture may not be generalizable. Indeed, some integration phenomena common in cultured cells, such as the creation of deletions and large target-site duplications, may occur much less frequently in an organism (Szak et al. 2002; Vincent et al. 2003; Chen et al. 2005; Han et al. 2005; Ho et al. 2005). To better understand L1 integration, a study of de novo L1 insertions in vivo is needed. Unfortunately, previous transgenic mouse models of L1 retrotransposition (Ostertag et al. 2002; Prak et al. 2003) had disappointingly low retrotransposition frequencies, allowing characterization of only a few novel insertions. Here, we describe a transgenic mouse model with a higher rate of retrotransposition, in which a more active human L1 element is tagged with a markerless cassette, allowing for PCR-based detection of shorter insertions. One such transgenic line displayed a high frequency of L1 insertions, occasionally several insertions per mouse. We mapped the genomic locations of 51 retrotransposition events, and completely characterized the structures of 33. Based on their structural features, we propose an L1 integration model that accounts for all common features of L1 retrotransposition. Results L1LRE3-mless transgenic mice are somatic mosaics for de novo retrotransposition events To increase the retrotransposition frequency in transgenic mice, we modified our previous transgene to include a more active L1 (L1LRE3) (Brouha et al. 2002, 2003) tagged with a short markerless cassette, permitting PCR-based genotyping of smaller insertions (>400 bp) (Fig. 1B We screened H27 F1 offspring for the presence of transgenes and insertions using “genotyping PCR” (Fig. 1B
To determine the genomic locations of L1 retrotransposition events, DNA was obtained from tails of transgenic mice and used in TAIL–PCR reactions (Liu and Whittier 1995; Liu et al. 1995) to amplify the L1 3′ end together with its flanking host DNA. TAIL–PCR allowed detection of shorter insertions than would have been possible with positive selection (EGFP or neo retrotransposition cassettes) or genotyping PCR. This is because TAIL–PCR does not require the 5′-truncated elements to contain a spliced intron junction for detection, thus lowering the detection limit to 214 nt. Sequences of amplified products were compared with the mouse genome to obtain chromosomal locations. Using this procedure, we determined the locations of 51 distinct retrotransposition events from 41 different mice. For some mice, two or three distinct insertions were recovered. We have not yet isolated shared insertions from F2 siblings as expected for insertions inherited from parent F1s. Southern blot revealed that individual de novo insertions were present at less than one copy per cell, a result indicative of somatic retrotransposition events (Fig. 2 New L1 retrotransposition events modestly favor intergenic regions In 48 cases, we unambiguously mapped the insertion site. Two of the remaining three elements were in young mouse LINEs and one was present only in an incompletely sequenced clone in the Celera database with no available annotation. The 48 identifiable insertions are distributed widely throughout the genome (Fig. 3A
Approximately 15% (7/48) of insertions landed in RefSeq transcription units (Pruitt et al. 2005) (Table 1), slightly lower than expected for random integration (P = 0.040). A similar modest preference for intergenic regions (P = 0.044) was obtained when insertion coordinates were compared with the “known gene” track of the UCSC genome browser (Karolchik et al. 2003, 2004), and Mammalian Gene Collection Full ORF mRNAs (MGC) (Strausberg et al. 2002) (not statistically significant). Two-thirds of insertions (32/48) were within annotated Genscan genes (Burge and Karlin 1997), a proportion indistinguishable from random integration. Lower exon prediction sensitivity and specificity (0.70) (Rogic et al. 2001) of Genscan gene prediction algorithm likely obscured the slight bias of L1 integration into intergenic regions using this gene annotation.
All insertions into genes were intronic, and there was no significant difference in the frequency of L1 integrants oriented in the same or opposite direction with respect to the gene. We observed no correlation with position around the transcription unit (e.g., transcription start sites) or within introns. Integration sites were also distributed randomly with respect to broad classes of repeats and CpG islands (Table 1). Perhaps with a larger sample size, a modest preference for some of these features may be detected. De novo inserts have characteristic features of L1 retrotransposition To analyze integration-site specificity, we aligned the host DNA from 10 nt upstream to 10 nt downstream of the insertion sites (Fig. 4A
To determine the complete structures of new integrants, multiple series of nested PCRs were performed using sets of oligos specific to genomic DNA flanking the 5′ ends of individual insertions and to the L1LRE3-mless construct. We were able to determine the complete structures of 33 insertions (Fig. 3B Finer analysis revealed stereotypical features of L1 retrotransposition (Table 2; Fig. 3B
De novo L1 insertions were flanked by homopolymeric poly(A) tails of ~50–210 nt (mean 103 ± 38 nt), in agreement with the long (mean 60–90 nt) poly(A)s observed in disease-causing L1 Ta elements (Chen et al. 2005) and in de novo cultured cell insertions (Gilbert et al. 2002; Symer et al. 2002). All but three de novo insertions in our study were flanked by target-site duplications (TSDs) of 2 to 20 nt (mean 13 nt) (Fig. 4C In cultured human cells 16%–21.6% (Gilbert et al. 2002, 2005; Symer et al. 2002) of L1 integrants were accompanied by deletions of host DNA of 1 to > 71,000 bps. In contrast, we did not detect any deletions of host DNA at L1 integration sites. It is possible that large deletions were present among the 18 insertions that we could not fully characterize. However, we would have detected deletions up to 87 bp, because all inner-nested oligos used for recovery of 5′ ends were at least 87 bp 5′ to the putative insertion site (median 232 nt, range 87–570 nt). Extending our primer sets farther into the flanking host DNA (median 381 nt, range 158–759 nt) was similarly unsuccessful. Small target-site deletions (≤87 nt [and ≤158 nts]) were found to accompany 8%–10.8% (Gilbert et al. 2002, 2005; Symer et al. 2002) of insertions in cultured cells. The lack of any deletions among the 33 characterized inserts permits us to estimate that frequency of small deletions in vivo is < 8.7% (P = 0.050), in agreement with the 1.2%–2.2% deletion frequency for genomic L1s (Vincent et al. 2003; Han et al. 2005; Ho et al. 2005) (Supplemental Discussion 2). 5′ boundary and inversion point microhomologies are common in de novo insertions in mice Consistent with the high frequencies of microhomologies (MHs) between the 5′ end of L1 RNA and flanking host DNA in cultured cells (Symer et al. 2002; Gilbert et al. 2005) and in the human and mouse genomes (Martin et al. 2005; Zingler et al. 2005), a majority of noninverted 5′ truncated de novo L1 integrants (excluding elements with extra 5′ nt) contained MHs of at least 1 bp at their 5′ boundaries (8/10, 80%) (Fig. 4D Extra nucleotide formation at 5′ ends of L1 integrants The majority (7/11, 63.6%) of insertions with no MH at the 5′ boundary contained extra nucleotide stretches of 2–47 bp between the flanking host DNA and the 5′-truncated L1 element. While most of these unexpected sequences were short (2–7 nt), making it difficult to ascertain their source, there were two instances of longer additions (26 and 47 nt) that allowed us to deduce their origin. In the first case, an addition of 26 nt occurred in a 238-bp 5′-truncated integrant that inserted at a canonical 5′-TTTT′A-3′ endonuclease cleavage site, terminated in a 90-nt poly(A), and was flanked by 13-bp TSDs. A closer examination of the unknown nucleotides revealed that they were a product of two short successive twin priming reactions by the RT, producing a “dual inverted fragment” (Fig. 5A,B
The longer 47 extra-nucleotide addition was likely created by a series of three successive template jumps by L1 RT in the course of an otherwise unremarkable TPRT reaction (Fig. 5A,C Discussion This work extends studies of L1 in cultured cells (Gilbert et al. 2002, 2005; Symer et al. 2002) by providing a comprehensive analysis of de novo retrotransposition events in vivo. Somatic mosaicism in mouse models of L1 retrotransposition Traditionally, L1 retrotransposition was thought to be restricted to the germ line by DNA methylation (Ostertag and Kazazian Jr. 2001a; Bourc'his and Bestor 2004), with this regulation being impaired in cultured cells (Moran et al. 1996) and some human cancers (Miki et al. 1992). Somatic retrotransposition was recently noted during early embryogenesis in a pPolII/5′UTR/L1RP/EGFP transgenic founder (possibly from an episomal location) (Prak et al. 2003), and in neuronal precursor cells of 5′UTR/L1RP/EGFP transgenic mice (Muotri et al. 2005). We also cannot rule out low-level somatic retrotransposition in our previous pPolII mice (Ostertag et al. 2002), as rare somatic events may have been missed by genotyping PCR. However, because unambiguous Mendelian inheritance of detected insertions in pPolII mice was lost when pHSP70-2, a strong meiotic promoter (Dix et al. 1996), was substituted for pPolII, a strong ubiquitously active promoter (Ostertag et al. 2002), we conclude that this substitution shifted the bulk of retrotransposition from gametogenesis to post-fertilization. Somatic retrotransposition was seen in F2 offspring carrying transgenes integrated in different locations, suggesting that the shift in retrotransposition timing is position independent. Instead, it is likely caused by a lag between the meiotic activation of pHSP70-2 and the retrotransposition event. This hypothesis is supported by several somatic insertions inherited in the absence of transgene; they likely resulted from L1 transcript production during parental meiosis, followed by germ-line transmission of the L1 RNPs and retrotransposition in early development. Transgene-independent inheritance of insertions argues against retrotransposition in adult somatic tissues, although we cannot rule out late somatic retrotransposition for all insertions. More studies are needed to determine whether somatic retrotransposition of a human L1 in transgenic mice accurately reflects the ability of native LINEs to retrotranspose in somatic cells of their normal hosts, or whether it is a result of impaired silencing of human L1s in mouse cells. Notably, differences in retrotransposition timing observed for pPolII (Ostertag et al. 2002) and pHSP70-2 could be harnessed for germ-line and somatic random mutagenesis applications. A similar strategy was used successfully with a DNA transposon (Dupuy et al. 2001, 2005; Collier et al. 2005). New L1 retrotransposition events modestly favor intergenic regions The preference for insertion into intergenic regions is consistent with the underrepresentation of endogenous L1s in genes (13%–17% gene hit rates) (Medstrand et al. 2002; Szak et al. 2002; Boissinot et al. 2004), but is discordant with higher gene hit rates in cultured cells (~50%) (Symer et al. 2002). Differences in gene hit rates between our study and that of Symer et al. (2002) can be explained by the use of different gene annotations and retrotransposition assays. Importantly, Symer et al. (2002) used a modified neo cassette (Freeman et al. 1994) to confer G418 resistance on L1 integrants. In contrast, our assay does not depend on marker expression. In fact, some of observed preference for intergenic regions may be due to selective loss of cells containing L1 insertions into essential genes. While it is highly unlikely that any heterozygous somatic mutation would result in animal lethality, somatic mosaicism creates a possibility for a selective growth disadvantage in the affected cells. The potential for selective cell loss is likely greater in our study due to the gene-trapping action of our retrotransposition cassette. Nevertheless, the tremendous sensitivity of TAIL–PCR probably compensates for most selection effects. If this is correct, the small bias against integration into genes may reflect possible differences in the distribution of L1 endonuclease target sites or potential interactions with transcriptional machinery or chromatin modifications. High-throughput studies of insertion sites of unbiased de novo integrants are needed to confirm our observations and will be particularly important if L1 is to be used in gene therapy and random mutagenesis. L1 RT template jumping creates extra 5′ nucleotides and 5′-end microhomologies A highly informative 47-nt addition (Fig. 5A,C Template jumping by the R2 RT is facilitated by the occasional addition by the RT of a few untemplated nucleotides to the 3′ end of the donor's cDNA, one or more of which may be able to anneal at the 3′ end of the acceptor template (Bibillo and Eickbush 2004). An analogous property of L1 RT may be reflected in the frequent 5′MHs observed in our noninverted 5′-truncated integrants, as well as in those found in the human and mouse genomes and in cultured cells (Symer et al. 2002; Gilbert et al. 2005; Martin et al. 2005; Zingler et al. 2005). Likely, as seen in the 26 and 47 extra nucleotide additions in our study, a template jump by the L1 RT can be facilitated by pairing of a few bases between the terminal end of the nascent cDNA and bases on the acceptor template, producing an apparent 5′MH in the final insertion. Such annealing could be possible if the terminal bases of the nascent cDNA became exposed for binding either due to the transient melting of a few 5′ base pairs in the cDNA/RNA duplex, or due to the continued degradation of the RNA molecule within the duplex. This mechanism provides an alternative to previous models of 5′MH creation, which invoked MHs in 5′ truncation and priming of the second-strand synthesis (Voliva et al. 1984; Symer et al. 2002; Martin et al. 2005; Zingler et al. 2005) and guidance of second-strand cleavage (Symer et al. 2002). Model of 5′-end attachment and L1 integration Based on the characteristics of L1 insertions described here and elsewhere, we propose an L1 integration model that accounts for all common features of L1 retrotransposition (Fig. 5D One consequence of this model is that TSDs flanking the insertion may be shorter than would have been predicted by the true second-strand cleavage site. This is because template jumping by the L1 RT may occur to internal regions of the host DNA overhang, as seen in our data and described for R2 RT. Such internal jumps will produce a shorter TSD, implying a smaller distance between the two cleavage sites and contributing to observed variability in TSD lengths. Another implication is the likely mechanism of 5′ truncations, previously hypothesized to result from MH-mediated initiation of second-strand synthesis (Voliva et al. 1984), poor processivity of the L1 RT, or degradation of L1 RNA by cellular RNase H (Ostertag and Kazazian Jr. 2001a). Biochemical studies of the R2 RT demonstrate that template jumps readily occur from the terminal 5′ nucleotide of the R2 RNA, but are extremely inefficient from its internal positions (Bibillo and Eickbush 2004). If L1 RT behaves in a similar manner, its template jumps must be occurring from the RNA's 5′ terminal nucleotide, implying prior 5′-end degradation of ~70% of L1 transcripts. The bimodal distribution of insertion lengths may thus reflect two different populations of L1 RNA in the host cell—the majority, which are quickly degraded by cellular RNases and give rise to the 5′-truncated L1 insertions, and the minority, which evade degradation and produce FL insertions. Altered susceptibility to degradation may be caused by the protective action of ORF1 protein or of the m7G-cap, loss of which may allow for the rapid degradation of the transcript's 5′ end. While decapping of most cellular mRNAs requires prior shortening of the poly(A) tail (for review, see Coller and Parker 2004), we and others observed no difference between poly(A) lengths of 5′-truncated and FL integrants. This may imply that shortened L1 poly(A) tails are extended by the RT during TPRT initiation (Luan and Eickbush 1995; Eickbush et al. 2000; Cost et al. 2002) or are subject to additional polyadenylation after re-entry of L1 RNA into the nucleus. A lower 5′MH frequency in FL elements (Zingler et al. 2005) may be explained by an altered position of the template jump by the L1 RT when the donor RNA template is full length. Biochemical studies of R2 demonstrated that m7G-capping of the donor RNA template reduced template jumps to internal locations on the acceptor template's 3′ end (Bibillo and Eickbush 2004), with most jumps occurring to the 3′-most terminal nucleotide. Analogous behavior of the L1 RT will result in less 5′MH formation for the likely m7G-capped FL transcripts, and, together with a reduced probability of a random match between the G-rich L1 5′ end and an AT-rich target site, will explain the lower 5′MH frequencies in FL insertions. Inversions, characteristic of L1 integration, can be explained by a special case of the model, in which second strand synthesis is initiated prior to the completion of the first strand. Premature initiation will give a twin priming reaction, in which the host's top strand overhang is used as a primer and an upstream portion of L1 RNA—as a template for the inverted fragment; this priming in turn creates MHs commonly seen at the 5′ boundary of inverted integrants (Ostertag and Kazazian Jr. 2001b). An upstream twin priming reaction will prevent the first RT from reaching the 5′ end of the template RNA, effectively eliminating its template jumping ability. As the twin priming RT reaches the terminal nucleotide of the RNA template, it will attempt template jumping onto the single-stranded overhang formed by the direct segment of L1 cDNA. This situation is identical to the standard L1 integration (Fig. 5D After the attachment of the 5′ end of L1 cDNA, synthesis of the second strand is probably completed by the second RT molecule. In cases where first-strand attachment was initiated with an RT jumping to an internal nucleotide of the host DNA overhang, a host 3′→5′ exonuclease activity would likely be needed to trim the short protruding 3′ end of the host DNA prior to initiation of second-strand synthesis. Alternatively, L1 ORF2 may itself possess 3′ exonuclease activity, analogous to the 3′ exonuclease activity of a related apurinic/apyrimidinic endonuclease (APE) (Chou and Cheng 2002). Perhaps this 3′ exonuclease activity is related to the second nuclease activity of L1 ORF2 protein observed in vitro (Cost et al. 2002). A 3′ exonuclease activity was also observed for the sequence-specific R2 EN (Bibillo and Eickbush 2002b). Clearly, much future biochemical work is needed to provide direct evidence for this model of L1 5′-end integration. Methods Transgenic mice, PCR primers, genotyping PCRs, Southern blotting, statistical analysis, and analysis of microhomologies are described in the Supplemental material. Transgene construction The markerless (mless) retrotransposition cassette was constructed by flanking the human γ-globin intron, obtained from the EGFP cassette (Ostertag et al. 2000) by splice acceptors from the human BCL2 gene, obtained from the pRET.IL vector (Ishida and Leder 1999), a kind gift of Dr. P. Leder. Mless was cloned into the pJCC5-L1LRE3 (Ostertag et al. 2000; Brouha et al. 2002) to create pBS-L1LRE3-mless. The L1LRE3-mless was then cloned into pCEP-Hsp70-2, a vector similar to pCEP4 (Invitrogen), but containing the mouse heat-shock protein 70-2 promoter (pHSP70-2, cloned from the mouse genome) (Dix et al. 1996) instead of the CMV promoter to create the transgene used in this study. Detailed cloning strategy is available upon request. Characterization of insertions and transgenes Genomic locations of L1LRE3-mless insertions and transgenes were identified by TAIL–PCR essentially as described (Liu and Whittier 1995; Liu et al. 1995). Long specific primers were positioned in the SV40 poly(A)-signal sequence, allowing 3′ flank amplification. A third reamplification reaction was added, and all three reamplification reactions were carried out as TAIL cycles. The products of TAIL–PCR reactions were resolved on a 1% agarose gel and visualized by EtBr staining. Single bands were gel-purified with GeneClean Spin Kit (Q-BIOgene), and either sequenced directly using AD primers (UPenn DNA Sequencing Facility) or cloned using TopoTA Cloning Kit for Sequencing (Invitrogen) and then sequenced using M13Forward and/or M13Reverse standard primers. To characterize the 5′ ends, multiple series of nested PCRs were performed for each insertion using nested genomic primers positioned between 87 and ~1500 bp upstream of the insertion location, and several sets of nested primers complementary to the insertion sequence. Individual PCRs were carried out according to the manufacturers' protocols. To assay shorter insertions, PCRs were carried out with Taq DNA Polymerase (Promega) using 160 ng of genomic DNA in a 20-μL reaction volume and 3–3.5-min extension times. To characterize longer insertions, high-processivity Phusion Polymerase (MJResearch) was used with 3–5-min extension times. The PCR products from the primary reactions were diluted 1:100 and used in the nested PCRs. The products were gel-purified with Qiaquick Gel Extraction Kit (Qiagen), cloned using TopoTA Cloning Kit for Sequencing (Invitrogen) (Taq PCR products) or Zero Blunt Topo Kit (Invitrogen) (Phusion PCR products), and sequenced using M13Forward and/or M13Reverse standard primers. Informatics Genomic locations of insertions were found using UCSC Genome Browser (http://genome.ucsc.edu/) (Kent et al. 2002; Karolchik et al. 2003), NCBI BLAST server, or Celera Discovery System by searching for the 3′ flanks obtained by TAIL–PCR. For inserts with unique genomic coordinates (48/51), neighboring genomic structures were surveyed against the mm6 assembly of the mouse genome in the UCSC Table Browser (Karolchik et al. 2004) using custom Perl scripts (available upon request). For genome composition calculations, nonoverlapping sequences belonging to a genomic feature were added for each chromosome using UCSC Table Browser annotation. These were weighted 2, 2, 3/2, and 1/2 (autosomes, Un_random, X, and Y chromosomes, respectively) and added to obtain the total content for a genomic feature; genome proportion was obtained by dividing the total content by mouse mm6 genome size (autosomes, Un_random, X, and Y chromosomes weighted 2, 2, 3/2, and 1/2, respectively). Insert structures were analyzed by comparing the filled and empty site sequences, and by comparing sequences of new and expected FL L1-mless integrants. Acknowledgments We thank Dr. Robert F. Whittier for helpful suggestions on optimizing TAIL–PCR, Melissa Vetter for technical assistance, the Transgenic and Sequencing Cores of the University of Pennsylvania for generation of transgenic mouse lines and for DNA sequencing services, and members of the Kazazian lab, especially Maria del Carmen Seleme, for critical discussions, and John L. Goodier for critical review of the manuscript. D.V.B. was supported by the predoctoral Genetics Training Grant, H.H.K. and E.M.O. were supported by grants from the US National Institutes of Health. Notes Article published online ahead of print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.4571606. Footnotes [Supplemental material is available online at www.genome.org.] References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||
Science. 2004 Mar 12; 303(5664):1626-32.
[Science. 2004]Nature. 2001 Feb 15; 409(6822):860-921.
[Nature. 2001]EMBO J. 1996 Feb 1; 15(3):630-9.
[EMBO J. 1996]EMBO J. 1997 Oct 1; 16(19):6034-43.
[EMBO J. 1997]Mol Cell Biol. 2001 Jan; 21(2):467-75.
[Mol Cell Biol. 2001]Mol Biol Evol. 2001 Jun; 18(6):926-35.
[Mol Biol Evol. 2001]Genome Res. 2004 Jul; 14(7):1221-31.
[Genome Res. 2004]Genome Res. 2001 Dec; 11(12):2050-8.
[Genome Res. 2001]Genome Res. 2002 Oct; 12(10):1483-95.
[Genome Res. 2002]Nature. 2001 Feb 15; 409(6822):860-921.
[Nature. 2001]Nat Rev Microbiol. 2005 Nov; 3(11):848-58.
[Nat Rev Microbiol. 2005]Mol Cell Biol. 2005 Mar; 25(6):2085-94.
[Mol Cell Biol. 2005]Annu Rev Genet. 2001; 35():501-38.
[Annu Rev Genet. 2001]Hum Genet. 2005 Sep; 117(5):411-27.
[Hum Genet. 2005]Cell. 1996 Nov 29; 87(5):917-27.
[Cell. 1996]Nat Genet. 2002 Dec; 32(4):655-60.
[Nat Genet. 2002]Proc Natl Acad Sci U S A. 2003 Feb 18; 100(4):1832-7.
[Proc Natl Acad Sci U S A. 2003]Am J Hum Genet. 2002 Aug; 71(2):327-36.
[Am J Hum Genet. 2002]Proc Natl Acad Sci U S A. 2003 Apr 29; 100(9):5280-5.
[Proc Natl Acad Sci U S A. 2003]Dev Biol. 1996 Mar 15; 174(2):310-21.
[Dev Biol. 1996]Genomics. 1995 Feb 10; 25(3):674-81.
[Genomics. 1995]Plant J. 1995 Sep; 8(3):457-63.
[Plant J. 1995]Genome Res. 2004 Jul; 14(7):1221-31.
[Genome Res. 2004]Nucleic Acids Res. 2005 Jan 1; 33(Database issue):D501-4.
[Nucleic Acids Res. 2005]Nucleic Acids Res. 2003 Jan 1; 31(1):51-4.
[Nucleic Acids Res. 2003]Nucleic Acids Res. 2004 Jan 1; 32(Database issue):D493-6.
[Nucleic Acids Res. 2004]Proc Natl Acad Sci U S A. 2002 Dec 24; 99(26):16899-903.
[Proc Natl Acad Sci U S A. 2002]J Mol Biol. 1997 Apr 25; 268(1):78-94.
[J Mol Biol. 1997]Proc Natl Acad Sci U S A. 1997 Mar 4; 94(5):1872-7.
[Proc Natl Acad Sci U S A. 1997]Nat Genet. 2002 Jun; 31(2):159-65.
[Nat Genet. 2002]Am J Hum Genet. 2002 Aug; 71(2):312-26.
[Am J Hum Genet. 2002]Gene. 2002 Oct 30; 300(1-2):189-94.
[Gene. 2002]Genome Biol. 2002 Sep 19; 3(10):research0052.
[Genome Biol. 2002]Genome Res. 2004 Jul; 14(7):1221-31.
[Genome Res. 2004]Mol Biol Evol. 2000 Jun; 17(6):915-28.
[Mol Biol Evol. 2000]Cell. 2002 Aug 9; 110(3):315-25.
[Cell. 2002]Mol Cell Biol. 2005 Sep; 25(17):7780-95.
[Mol Cell Biol. 2005]Genome Biol. 2002 Sep 19; 3(10):research0052.
[Genome Biol. 2002]Gene. 2002 Oct 30; 300(1-2):189-94.
[Gene. 2002]Mol Biol Evol. 2000 Jun; 17(6):915-28.
[Mol Biol Evol. 2000]Hum Genet. 2005 Sep; 117(5):411-27.
[Hum Genet. 2005]Cell. 2002 Aug 9; 110(3):315-25.
[Cell. 2002]Cell. 2002 Aug 9; 110(3):327-38.
[Cell. 2002]Proc Natl Acad Sci U S A. 1997 Mar 4; 94(5):1872-7.
[Proc Natl Acad Sci U S A. 1997]Am J Hum Genet. 2002 Aug; 71(2):312-26.
[Am J Hum Genet. 2002]Cell. 2002 Aug 9; 110(3):315-25.
[Cell. 2002]Mol Cell Biol. 2005 Sep; 25(17):7780-95.
[Mol Cell Biol. 2005]Cell. 2002 Aug 9; 110(3):327-38.
[Cell. 2002]Mol Biol Evol. 2003 Aug; 20(8):1338-48.
[Mol Biol Evol. 2003]Nucleic Acids Res. 2005; 33(13):4040-52.
[Nucleic Acids Res. 2005]Cell. 2002 Aug 9; 110(3):327-38.
[Cell. 2002]Mol Cell Biol. 2005 Sep; 25(17):7780-95.
[Mol Cell Biol. 2005]Cytogenet Genome Res. 2005; 110(1-4):223-8.
[Cytogenet Genome Res. 2005]Genome Res. 2005 Jun; 15(6):780-9.
[Genome Res. 2005]Genome Res. 2001 Dec; 11(12):2059-65.
[Genome Res. 2001]Mol Cell Biol. 1995 Jul; 15(7):3882-91.
[Mol Cell Biol. 1995]Mol Cell Biol. 2000 Jan; 20(1):213-23.
[Mol Cell Biol. 2000]EMBO J. 2002 Nov 1; 21(21):5899-910.
[EMBO J. 2002]Cell. 2002 Aug 9; 110(3):315-25.
[Cell. 2002]Mol Cell Biol. 2005 Sep; 25(17):7780-95.
[Mol Cell Biol. 2005]Cell. 2002 Aug 9; 110(3):327-38.
[Cell. 2002]Annu Rev Genet. 2001; 35():501-38.
[Annu Rev Genet. 2001]Nature. 2004 Sep 2; 431(7004):96-9.
[Nature. 2004]Cell. 1996 Nov 29; 87(5):917-27.
[Cell. 1996]Cancer Res. 1992 Feb 1; 52(3):643-5.
[Cancer Res. 1992]Proc Natl Acad Sci U S A. 2003 Feb 18; 100(4):1832-7.
[Proc Natl Acad Sci U S A. 2003]Genome Res. 2002 Oct; 12(10):1483-95.
[Genome Res. 2002]Genome Biol. 2002 Sep 19; 3(10):research0052.
[Genome Biol. 2002]Genome Res. 2004 Jul; 14(7):1221-31.
[Genome Res. 2004]Cell. 2002 Aug 9; 110(3):327-38.
[Cell. 2002]Science. 1991 Dec 20; 254(5039):1808-10.
[Science. 1991]J Mol Biol. 2002 Feb 22; 316(3):459-73.
[J Mol Biol. 2002]J Biol Chem. 2004 Apr 9; 279(15):14945-53.
[J Biol Chem. 2004]Genetics. 1996 Mar; 142(3):853-63.
[Genetics. 1996]Mol Biol Evol. 1999 Apr; 16(4):502-11.
[Mol Biol Evol. 1999]J Biol Chem. 2004 Apr 9; 279(15):14945-53.
[J Biol Chem. 2004]Cell. 2002 Aug 9; 110(3):327-38.
[Cell. 2002]Mol Cell Biol. 2005 Sep; 25(17):7780-95.
[Mol Cell Biol. 2005]Cytogenet Genome Res. 2005; 110(1-4):223-8.
[Cytogenet Genome Res. 2005]Genome Res. 2005 Jun; 15(6):780-9.
[Genome Res. 2005]J Biol Chem. 2004 Apr 9; 279(15):14945-53.
[J Biol Chem. 2004]Mutat Res. 2001 Jun 2; 477(1-2):131-53.
[Mutat Res. 2001]Nucleic Acids Res. 2004; 32(4):1555-65.
[Nucleic Acids Res. 2004]Cell. 2002 Aug 9; 110(3):315-25.
[Cell. 2002]Cell. 2002 Aug 9; 110(3):327-38.
[Cell. 2002]J Mol Biol. 1984 Oct 5; 178(4):795-813.
[J Mol Biol. 1984]Annu Rev Genet. 2001; 35():501-38.
[Annu Rev Genet. 2001]J Biol Chem. 2004 Apr 9; 279(15):14945-53.
[J Biol Chem. 2004]Annu Rev Biochem. 2004; 73():861-90.
[Annu Rev Biochem. 2004]Mol Cell Biol. 1995 Jul; 15(7):3882-91.
[Mol Cell Biol. 1995]Genome Res. 2005 Jun; 15(6):780-9.
[Genome Res. 2005]J Biol Chem. 2004 Apr 9; 279(15):14945-53.
[J Biol Chem. 2004]Genome Res. 2001 Dec; 11(12):2059-65.
[Genome Res. 2001]Nature. 2002 Feb 7; 415(6872):655-9.
[Nature. 2002]EMBO J. 2002 Nov 1; 21(21):5899-910.
[EMBO J. 2002]J Mol Biol. 2002 Feb 22; 316(3):459-73.
[J Mol Biol. 2002]Nucleic Acids Res. 2000 Mar 15; 28(6):1418-23.
[Nucleic Acids Res. 2000]Nucleic Acids Res. 1999 Dec 15; 27(24):e35.
[Nucleic Acids Res. 1999]Am J Hum Genet. 2002 Aug; 71(2):327-36.
[Am J Hum Genet. 2002]Dev Biol. 1996 Mar 15; 174(2):310-21.
[Dev Biol. 1996]Genomics. 1995 Feb 10; 25(3):674-81.
[Genomics. 1995]Plant J. 1995 Sep; 8(3):457-63.
[Plant J. 1995]Genome Res. 2002 Jun; 12(6):996-1006.
[Genome Res. 2002]Nucleic Acids Res. 2003 Jan 1; 31(1):51-4.
[Nucleic Acids Res. 2003]Nucleic Acids Res. 2004 Jan 1; 32(Database issue):D493-6.
[Nucleic Acids Res. 2004]Cell. 2002 Aug 9; 110(3):327-38.
[Cell. 2002]Mol Cell Biol. 1985 Oct; 5(10):2599-607.
[Mol Cell Biol. 1985]