![]() | ![]() |
Formats:
|
||||||||||||||||||||||
Copyright Newman et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. An Abundant Evolutionarily Conserved CSB-PiggyBac Fusion Protein Expressed in Cockayne Syndrome 1Department of Biochemistry, School of Medicine, University of Washington, Seattle, Washington, United States of America 2Fox Chase Cancer Center, Philadelphia, Pennsylvania, United States of America Dmitri A. Petrov, Editor Stanford University, United States of America * E-mail: amweiner/at/u.washington.edu Conceived and designed the experiments: JN AB AW. Performed the experiments: JN AB TP HF. Analyzed the data: JN AB TP HF AW. Contributed reagents/materials/analysis tools: JN AB. Wrote the paper: JN AW. Received September 30, 2007; Accepted February 11, 2008. This article has been cited by other articles in PMC.Abstract Cockayne syndrome (CS) is a devastating progeria most often caused by mutations in the CSB gene encoding a SWI/SNF family chromatin remodeling protein. Although all CSB mutations that cause CS are recessive, the complete absence of CSB protein does not cause CS. In addition, most CSB mutations are located beyond exon 5 and are thought to generate only C-terminally truncated protein fragments. We now show that a domesticated PiggyBac-like transposon PGBD3, residing within intron 5 of the CSB gene, functions as an alternative 3′ terminal exon. The alternatively spliced mRNA encodes a novel chimeric protein in which CSB exons 1–5 are joined in frame to the PiggyBac transposase. The resulting CSB-transposase fusion protein is as abundant as CSB protein itself in a variety of human cell lines, and continues to be expressed by primary CS cells in which functional CSB is lost due to mutations beyond exon 5. The CSB-transposase fusion protein has been highly conserved for at least 43 Myr since the divergence of humans and marmoset, and appears to be subject to selective pressure. The human genome contains over 600 nonautonomous PGBD3-related MER85 elements that were dispersed when the PGBD3 transposase was last active at least 37 Mya. Many of these MER85 elements are associated with genes which are involved in neuronal development, and are known to be regulated by CSB. We speculate that the CSB-transposase fusion protein has been conserved for host antitransposon defense, or to modulate gene regulation by MER85 elements, but may cause CS in the absence of functional CSB protein. Author Summary For reasons that are still unclear, genetic defects in DNA repair can cause diseases that resemble aspects of premature ageing (“segmental progerias”). Cockayne syndrome (CS) is a particularly devastating progeria most commonly caused by mutations in the CSB chromatin remodeling gene. About 43 million years ago, before humans diverged from marmosets, one of the last PiggyBac transposable elements to invade the human lineage landed within intron 5 of the 21 exon CSB gene. As a result, the CSB locus now encodes two equally abundant proteins generated by alternative mRNA splicing: the original full length CSB protein, and a novel CSB-PiggyBac fusion protein in which the N-terminus of CSB is fused to the complete PiggyBac transposase. Conservation of the CSB-PiggyBac fusion protein since marmoset suggests that it is normally beneficial, demonstrating once again that “selfish” transposable elements can be exploited or “domesticated” by the host. More importantly, almost all CSB mutations that cause CS continue to make the CSB-PiggyBac fusion protein, whereas a mutation that compromises both does not cause CS. Thus the fusion protein which is beneficial in the presence of functional CSB may be harmful in its absence. This may help clarify the cause of CS and other progerias. Introduction The human genome is replete with interlopers — transposable DNA elements, retrotransposable RNA elements such as SINEs and LINEs, and a dizzying variety of lesser-known elements — which together account for as much as half of our DNA [1]. Although much of this “junk” DNA is selfish and surprisingly harmless, the constant turnover of these elements is an important source of insertional mutagenesis with benign [2] and malign [3] consequences. Indeed, eukaryotes often recruit mobile elements to perform critical functions — a process known as domestication or exaptation [4]. For example, the RAG1 recombinase, which diversifies the adaptive immune response in mammals, was domesticated aeons ago from a Transib-family transposase [5]. A similarly domesticated DNA transposon is responsible for the programmed genomic rearrangements found in many ciliates [6], and a pogo-like transposase gave rise to the centromeric CEN-P protein family [7] which mediates host genome surveillance for retrotransposons in Schizosaccharomyces pombe [8]. More recently in the primate lineage, a mariner-like transposase was fused to a SET histone methyltransferase domain by de novo exonization; the fusion protein retains the ancestral DNA binding activity of the transposase, and may function as a transcriptional regulator at dispersed mariner-like repeat elements [9]. Here we report identification of an evolutionarily conserved PiggyBac transposase fusion protein that may play a critical, and previously unsuspected, role in a well-studied human disease, Cockayne syndrome (CS). PiggyBac elements, first characterized in the cabbage looper moth Trichoplusia ni [10],[11], have now been identified in a variety of eukaryotes from protozoa [12] to primates [1]. A typical PiggyBac element contains a 1.8 kb ORF encoding a 68 kDa transposase; the boundaries of the element are defined by 13–15 nt terminal inverted repeats, which are in turn flanked by a duplication of the target site TTAA [13]. The T. ni PiggyBac transposon is a useful tool for germline manipulation because it is active in a wide range of species including mammals [14] and has been considered as a possible gene therapy vector [15]. The five PiggyBac elements in the human genome (PGBD1-5) are variously conserved among vertebrates; PGBD5 dates to before the teleost/tetrapod split, whereas PGBD3 and PGBD4 are restricted to primates [1],[13]. CS is a devastating inherited progeria characterized by severe post-natal growth failure and progressive neurological dysfunction [16]. Most cases of CS reflect mutations in the Cockayne syndrome Group B (CSB, also known as ERCC6) gene, a SWI/SNF-like DNA-dependent ATPase [17]–[19] that can wind DNA [20] and remodel chromatin in vitro [21]; the remaining cases of CS are caused by mutations in the CSA gene, and by rare alleles of the xeroderma pigmentosum genes XPB, XPD, and XPG [22]. All of these factors were originally identified as being involved in the transcription-coupled repair of UV-induced DNA damage [23],[24]. While searching for an activity that could better explain the CS phenotype, we found that CSB has a general chromatin remodeling function [25] which could account for the pleiotropic effects of CSB mutations and the characteristic wasting of CS [26]. Alternatively, CS may be caused by defects in transcription initiation [27],[28], or by a partial failure to repair oxidative DNA damage. CSB is known to enhance repair of 8-hydroxyguanine lesions [29], and mice doubly mutant for CSB and the 8-hydroxyguanine glycosylase OGG1 are severely deficient in global repair of endogenous oxidative DNA damage [30]. Similarly, complete inactivation of nucleotide excision repair (NER) in mice doubly mutant for CSB and XPA mimics CS and suppresses the somatotroph axis [31],[32]. As yet unexplained, however, is why complete absence of CSB does not cause CS, although all CS mutations are recessive [33]–[35]. Here we show that the PiggyBac transposable element PGBD3 embedded within intron 5 of the CSB gene functions as an alternative 3′ terminal exon (“exon trap”); as a result, alternative splicing of the CSB primary transcript generates two mRNAs, one encoding all 21 exons of the CSB protein, and the other an equally abundant CSB-related protein in which the first 5 exons of CSB are fused to the PGBD3 transposase. Sequence comparisons of PGBD3 with PiggyBac pseudogenes in humans and other primates suggest that PGBD3 was domesticated soon after it transposed into the CSB gene. Indeed, conservation of the alternatively spliced PGBD3 element in the CSB genes of chimpanzee, orangutan, Rhesus macaque and marmoset over at least 43 Myr of evolution [36], together with a preponderance of synonymous mutations, strongly suggest that the fusion protein has been selected for an advantageous function in its primate host. We speculate that the CSB-transposase fusion protein originally played a role in host genome defense by repressing transposition of autonomous PGBD3 elements and the hundreds of nonautonomous PGBD3-dependent MER85 elements derived from them. We also find an association of MER85 elements with a subset of CSB-regulated genes and genes involved in neuronal development, suggesting that the fusion protein may later have acquired the ability to modulate gene regulatory networks. Finally, we show that the CSB-transposase fusion protein continues to be expressed in CS primary cells lacking functional CSB protein, implying that the fusion protein could contribute to the CS phenotype, or even transform the mild UV sensitivity caused by complete loss of CSB-related proteins [33] into a true progeria. Results CSB-PGBD3 Fusion Transcript Is a Major Product of the CSB Gene Intron 5 of the human CSB gene is host to a PiggyBac transposable element known as PGBD3 (Figure 1A
We were able to detect the predicted CSB-PGBD3 fusion transcripts by quantitative, real-time RT-PCR (Q-RT-PCR) using HeLa mRNA as template, forward primers for the 3′ half of CSB exon 5 which is shared by the CSB and predicted fusion mRNAs, and reverse primers which are specific for either CSB exon 6 or the PGBD3 element ( 2A
Using a subset of these primer combinations, we also detected CSB-PGBD3 fusion transcripts in three other cell lines: hTERT-immortalized WI38 normal lung fibroblasts, and hTERT-immortalized CS1AN CSB fibroblasts rescued with CSB-wt cDNA (CSB-wt line) or mock-rescued with enhanced green fluorescent protein (CSB-null line) [25]. In all four lines, the fusion transcripts were more abundant than the CSB transcripts — as much as 13- to 26-fold more abundant in the immortalized WI38 line (Figure S2). The CSB-PGBD3 fusion transcript, apparently initiating at or near the normal CSB start site, appears to be the only major alternatively spliced transcript expressed from the CSB/PGBD3 gene. First, the transposase coding region is not an alternative exon within full-length CSB mRNA, because combinations of two upstream primers from the PiggyBac element and four downstream primers located in CSB exons 6, 7, 8 and 9 failed to produce RT-PCR products in any of the four cell lines tested (data not shown). Second, the CSB and CSB-PGBD3 transcripts lacking exon 5 appear to be scarce (Figure 2 Detection of the CSB-PGBD3 Fusion Protein Consistent with the Q-RT-PCR data, we detected the CSB-PGBD3 fusion protein in four different cell lines (HT1080, WI38/hTERT, CSB-null and CSB-wt) by Western blotting with antibodies specific for the N- and C-termini of CSB protein. The C-terminal antibody revealed one major band of the size expected for intact CSB protein (Figure 3A
To confirm the identity of the CSB-PGBD3 fusion band as visualized with the N-terminal CSB antibody (Figure 3
The CSB-Transposase Fusion Protein is Expressed in Primary CS Cells A tabulation of all reported CS cases with known mutations in CSB reveals that 21 of 24 retain at least one allele that should allow continued expression of the CSB-transposase fusion protein (Table S1). To confirm that CS cells express the fusion protein in the absence of intact CSB, as seen for the hTERT-immortalized CS1AN line (Figure 3B
The CSB-PGBD3 Fusion Protein Sequence and Splice Sites are Conserved in Primates We were able to identify clear chimpanzee (Pan troglodytes) and Rhesus macaque (Macaca mulatta) homologs of PGBD3 and all four of the pseudogenes by BLASTing the PGBD3 coding region against the recently-completed chimpanzee [40] and Rhesus [41] genomes. We also identified homologs of CSB and PGBD3 in early assemblies of the orangutan (Pongo abelli) and white tufted-ear marmoset (Callithrix jacchus) genomes (Figure 6
A complementary method to estimate the degree to which a protein coding sequence is under purifying selection is to calculate the ratio of nonsynonymous (Ka, residue-altering) to synonymous (Ks, silent) nucleotide substitution rates; a low ratio suggests that the amino acid sequence is under strong purifying selection. We analyzed CSB and PGBD3 coding sequences from human, chimp, orangutan, Rhesus and marmoset with the SNAP program [42], which implements the Ka/Ks algorithm of Nei et al. [43]. For comparison, the decayed PGBD3 pseudogenes PGBD3P1 and PGBD3P3 have mean Ka/Ks values of 0.73 and 0.96, respectively, for pairwise comparisons between the various primate species (Table 2). CSB, presumably under purifying selection, has a mean Ka/Ks value of 0.21 (P<0.0001 vs. both P1 and P3). The mean Ka/Ks for PGBD3 is 0.12 (P<0.0001 vs. both P1 and P3), consistent with the transposase being subject to purifying selection at least as strong as that for CSB. In fact, the mean Ka/Ks of PGBD3 is significantly lower than that of CSB (P = 0.0006), though the difference between the entire fusion protein and CSB is not significant (P = 0.12).
We did not find a CSB-PGBD3 homolog in the draft genome assemblies of two more distantly-related primates of the Strepsirrhini family: galago (Otolemur garnetti) and mouse lemur (Microcebus murinus), though the former may offer insights into the emergence of PGBD3. The mouse lemur genome contained no recognizable PGBD3 or MER85 elements. However, we found dozens of examples of each in galago although the two species diverged from a common ancestor only after the Strepsirrhini lineage separated from that of humans and marmosets (Figure 6 = 0.61 by paired Student's T-test, see Table S2), consistent with divergence from a closely related ancestor. Together, these data suggest that an element closely related to the ancestral human PGBD3 independently invaded the galago and human-marmoset lineages. Though invasion of the common galago-human ancestor by ancestral PGBD3 would also explain the PGBD3-like sequences in galago, the monophyly of Strepsirrhini is well accepted [44] and it is unlikely that all traces of PGBD3 and MER85 would have been eradicated from the mouse lemur genome given their abundance in all genomes in which they are found. We conclude that an ancestral PGBD3 element invaded CSB intron 5 at least 43 Mya, before human and marmoset diverged [36]; PGBD3 was then conserved in the human-marmoset lineage because the CSB-PGBD3 fusion protein performs a selectable function (see Discussion) whereas the elements ultimately degenerated in galago where the random transpositions were either neutral or harmful.PiggyBac Survives as a Natural “Exon Trap” The PiggyBac element has the hallmarks of a transposable element that has survived through evolution by functioning as a natural “exon trap”. In both cabbage looper moth and primates, the transposase ORF is flanked immediately upstream by a potential 3′ splice site (TTTTCTTGTTATAG in moth PiggyBac, CCTTTTTTCCGTTTTAG in PGBD3) and immediately downstream by a potential polyadenylation signal (AATAAATAAATAAA in moth PiggyBac, AATAAA in PGBD3). This 3′ splice site is perfectly conserved between human, chimpanzee, Rhesus, orangutan and marmoset (Figure S4), and in all five species PGBD3 possesses a potential polyadenylation signal (Figure S5) despite evidence for strong selection against transcription of intragenic transposable elements [45]. Insertion of an element with these features into a host intron can generate an N-terminal fusion protein as observed for the PGBD3 insertion into CSB intron 5 (Figure 1 MER Elements and the Domestication of PGBD3 In contrast to PGBD3, the four PGBD3-related pseudogenes are all in an advanced state of decay (88–90% identity to PGBD3; see Figure S6). None of the pseudogenes contains an ORF longer than 62 codons and three exhibit major deletions or rearrangements. All are more closely related to PGBD3 than to any of the other pseudogenes (Table S3), suggesting that all diverged from PGBD3 itself or from a closely related common ancestor before the divergence of the human and Rhesus lineages. The left and right ends of PGBD3 correspond to the left (100 nt) and right (40 nt) halves of the 140 nt MER85 repeat element [47], an arrangement also found in the four human PGBD3 pseudogenes. We found 613 examples of MER85 elements in the human genome; in almost all cases, these were either intact left ends (403), intact right ends (119) or complete 140 nt elements (73). MER85 has been described as a nonautonomous transposable element derived from PiggyBac and presumably mobilized in trans by the PiggyBac transposase [1]; many other transposons have given rise to similar nonautonomous elements known collectively as “miniature inverted repeat transposable elements” or MITEs [12]. The similarly abundant MER75 and MER75B elements appear to be derived from PGBD4, although the PGBD4 transposase exon is no longer neatly flanked by its derivative elements as PGBD3 is by MER85. Consistent with previous estimates [48], neither MER75B nor MER85 has been significantly mobile since the divergence of human, chimpanzee and Rhesus. We found that 36 of 42 MER85 elements on human chromosome 1 had clear homologs on chromosome 1 of at least one of the other primates, as did 20 of 21 human MER75B elements. The few remaining unmatched human elements likely reflect incomplete sequences or recombination. Most PiggyBac transposases have three conserved aspartic acid residues [13] which may be related to the metal-coordinating DDE motif found in the catalytic domain of many transposase and integrase families [49]. The most likely candidates for these conserved residues in PGBD3 [13] are identical in all five primates (human, chimp, orangutan, Rhesus and marmoset): D270, N352 and D467 (Figure S7). Strikingly, all four pseudogenes in human, chimp and Rhesus encode D at the second position (the draft orangutan and marmoset genomes do not yet include all PGBD3 pseudogenes). Half of the galago PGBD3-like sequences we examined also encode D at this position, while the remainder harbor one of several changes (Figure S8). Together, this suggests that the feral ancestor of human PGBD3 encoded a DDD motif, and that its domestication involved mutations that compromised mobility. Regulatory Patterns among MER85-Associated Genes The exapted mariner transposase in the SETMAR fusion protein retains ancestral DNA binding activity despite attenuation or loss of transposase function [9]. We therefore asked whether genes located closest to MER85 elements might exhibit common themes or functions possibly reflecting a cis-regulatory function of the MER85 elements themselves or proteins that bind to them [50]. Using the ENSEMBL gene database, we located the transcription start site closest to each identified MER85 element (Table S4). The median distance between MER85 elements and transcription starts was 93 kb, similar to what is seen for other human repeats present in 500 to 4,000 copies [51]. Of the 613 MER85 elements, we selected the 585 that were less than 1 Mb from a transcription start site, well within the documented range of proximal enhancer elements [52]. We then used the L2L Microarray Analysis Tool [53] to search for expression patterns among these MER85-associated genes (Table S5). The strongest pattern to emerge was a striking similarity to genes down-regulated by UV irradiation in both normal and repair-deficient (XPB/CS, XPB/TTD) cells: Nine lists overlapped with P<0.02, and there was no similar finding among 1000 random-data simulations (Table S6). Intriguingly, the list of MER85-associated genes also overlapped significantly with the list of genes we had previously shown to be down-regulated by CSB (P = 0.012; corrected to P = 0.015 by random-data simulation) when hTERT-immortalized CSB-wt and CSB-null cell lines are compared [25]. There was no similar overlap with genes up-regulated by CSB. The most enriched Gene Ontology term was the Molecular Function “Glutamate Receptor Activity” (Table S7) reflecting association of MER85 with six glutamate receptors (GRM7, GRID1, GRID2, GRIK2, GRIN2A and GRIN2B) and two related GPCRs (7-fold enrichment, P = 1.6e-5; no similar finding among 1000 random-data simulations). Similar glutamate-related terms were the most enriched in the other Gene Ontology categories as well (data not shown).Discussion We provide a combination of genomic, genetic, mRNA, and protein evidence that a CSB-PGBD3 fusion protein, generated by alternative splicing of CSB exon 5 to a PGBD3 transposon within intron 5, is a major product of the CSB/PGBD3 locus; that the fusion protein has been highly conserved in primates since the transposon was domesticated at least 43 Mya; that the fusion protein continues to be expressed in primary cells from three CS patients who lack functional CSB; and that nearly all CS-causing CSB mutations are located downstream of the exon 5/6 boundary in the ATPase and C-terminal domains of CSB protein, with the result that the fusion protein is predicted to be expressed in at least 21 of 24 characterized CS cell lines lacking functional CSB. The alternatively spliced CSB-PGBD3 mRNA was readily detectable by Q-RT-PCR, and was more abundant in all cell lines tested than full length CSB mRNA; the fusion mRNA had also been observed over a decade ago as an unexplained 3.4 kb polyadenylated RNA reacting with probes for the 5′ end but not the central region of CSB mRNA [54]. Consistent with our Q-RT-PCR data, we found by Western blotting that the CSB-PGBD3 fusion protein is abundant in a variety of primary and established CS and non-CS cells, and reacts as expected with both N-terminal CSB antibodies and a PGBD3-specific antibody. The CSB-PGBD3 Fusion Protein May Contribute to Cockayne Syndrome Three mysteries have shaped thinking about Cockayne syndrome. First, the complete absence of CSB protein apparently does not cause CS, but rather a mild UV-sensitive syndrome with no developmental symptoms [33]. Yet all disease-associated CSB alleles identified to date are recessive; no dominant mutations are known. Second, nearly all CSB mutations that cause CS are located downstream of the exon 5/6 boundary (codon 466) in the ATPase and C-terminal regions of the 1493 residue protein (Figure 7
The conserved CSB-PGBD3 fusion protein is expressed in both primary and established CS cells (Figures 2 Consistent with this hypothesis, 21 of the 24 molecularly characterized CS genotypes appear capable of expressing the CSB-PGBD3 fusion protein (Figure 7 If the CSB-PGBD3 fusion protein does indeed play a role in CS, the complex clinical presentation of the disease [26] might be explained by variable expression of the fusion protein in different individuals and cell types (Figure S3), or by the degree or nature of residual CSB activity. CS and genetically related syndromes like cerebro-oculo-facio-skeletal syndrome (COFS) and the DeSanctis-Cacchione variant of xeroderma pigmentosum (XP-DSC) could also be multifactorial, requiring two or more “hits” or perhaps modifier genes — consistent with mouse models showing that a CSB defect must be combined with a second defect in an NER-GGR factor (XPC [57] or XPA [58]) to generate a strong developmental phenotype. A highly conserved and abundant protein which shares the first 5 exons of CSB is very likely to affect CSB-related cellular functions, but detailed functional characterization of the fusion protein will be required to understand how it could be detrimental in the absence of functional CSB protein. Unlike the ATPase domain of CSB encoded by sequences beyond the exon 5/6 boundary (Figure 1B Conservation of the Fusion Protein in Primate Lineages The PGBD3 element in intron 5 of the CSB gene has not only been conserved for at least 43 Mya from marmoset to human, but the PGBD3 element itself is at least as highly conserved as surrounding CSB sequences (Table 1). Moreover, synonymous changes are at least as abundant for PGBD3 as for CSB in the human, chimp, orangutan, Rhesus and marmoset protein coding sequences (Table 2). We conclude that the initial PGBD3 insertion was selected for a new function advantageous to the primate host, and the CSB-PGBD3 fusion protein was thereafter subject to purifying selection to prevent loss of function. The high correlation of homologous MER85 insertions in human, chimpanzee and Rhesus macaque on chromosome 1, and the absence of any lineage-specific PGBD3 pseudogenes, suggests that neither PGBD3 nor the related MER85 elements have been mobile since the three lineages diverged. These findings are consistent with several recent studies: an analysis of MER85 and MER75 sequence divergence by the Human Genome Sequencing Consortium [1], a comparative analysis of repetitive elements within the human, chimpanzee and Rhesus genomes [69], and an exhaustive study of DNA transposon activity in primates using ENCODE project genomic sequences [48]. The consistent D352 versus N352 difference in the putative catalytic DDD motif between decaying pseudogenes and PGBD3 itself in all species (Figures S7 and S8) suggests that this change may have been critical for both the stability of PGBD3 within CSB and for the demobilization of PGBD-related pseudogenes and MER elements derived from them. The same appears to be true for the domesticated mariner transposase of the SETMAR fusion protein where the catalytic DDD triad has mutated to DDN [9]. We speculate that both the PGBD3 pseudogenes and the abundant MER85 elements are relics of a brief burst of activity when the PGBD3 transposon, newly introduced into an ancestral primate genome, replicated without hindrance, and both spawned and propagated dependent MER elements. Although complete and intact PGBD transposons are rare in all genomes examined [13], the abundance of MER elements suggests that infection of the primate lineage had the potential to get out of control. Indeed, the apparent independent infection of galago, whether by horizontal transfer from the contemporary human-marmoset ancestor or from an external source, and the dozens of degenerate PGBD3-like sequences generated by this infection, highlight the virulence of feral PGBD3. Insertional mutagenesis may have been the least of the dangers, as multiplying MER elements could have provided targets for genomic rearrangements mediated by the PGBD3 transposase — a well documented phenomenon for other DNA transposons with terminal inverted repeats such as Drosophila P-elements [70]. Domestication (i.e., insertion and fixation) of PGBD3 within the CSB gene may have been the genetic response that restored genomic stability. Indeed, recruitment of the offending transposase itself in the form of a fusion protein has obvious advantages: The attenuated or inactivated transposase may simply occupy and occlude binding sites for the normal transposase — much as the absence of a germline-specific mRNA splice transforms the Drosophila P-element transposase into a somatic repressor of transposition [71] — or the fusion protein may actively guide host defense complexes to potential sites of excision, insertion, or rearrangement. It is also interesting to note that the S. cerevisiae homolog of XPD, known as Rad3, inhibits Ty1 retrotransposition [72]. CSB binds to several TFIIH subunits including XPD [17], suggesting a possible role for the N-terminal CSB domain of the CSB-PGBD3 fusion protein in silencing PGBD3 family elements. Clues to a MER85 Gene Regulatory Network Repression of PiggyBac and/or MER85 mobility may explain the initial domestication of PGBD3 more than 43 Mya, but the CSB-PGBD3 fusion protein continues to be conserved and abundantly expressed in primates despite the passage of sufficient time to inactivate existing PGBD-related transposases. This suggests that the CSB-PGBD3 fusion protein may now be conserved for a new or secondary function. Noncoding elements account for the much of the genomic sequence under purifying selection in mammals [73], and many of these conserved noncoding sequences may be remnants of ancient transposons [50],[51]. The exaptation of SETMAR, fusing a SET histone methyltransferase domain to a mariner-like transposase, may have marked the emergence of a novel regulatory network based upon thousands of preexisting and now-selectable mariner elements [9]. Indeed, the exaptation of DNA-binding transposases has been proposed by Feschotte and Pritham [74] as “a pervasive pathway to create a genetic network [from] unlinked binding sites previously dispersed in the genome”. Our analysis of the genes closest to MER85 elements (Table S4) suggests that the CSB-PGBD3 fusion protein may have created just such a regulatory network based on MER85 elements. We had previously shown by expression microarray analysis that CSB protein has a general chromatin remodeling function which includes the maintenance of transcriptional silencing; specifically, loss of CSB phenocopied conditions that disrupt chromatin structure such as treatment with inhibitors of histone deacetylation and DNA methylation, and defects in poly(ADP-ribose)-polymerase [25]. Surprisingly, many of the CSB-repressed genes are associated with MER85 elements (Table S5, “csb_reliable_up” database list). Just as striking was the association of MER85 elements with genes that are repressed following UV irradiation (Tables S5 and S6); UV is known to cause nuclear translocation of CSA [75] which may in turn be required for full CSB function. Thus, recruitment of CSB or CSB-associated factors to MER85 elements by the CSB-PGBD3 fusion protein, perhaps in combination with independently transcribed PGBD3 transposase (Figures 1 Materials and Methods Cell Lines and Culture Conditions HT1080 (human fibrosarcoma), MRC5 (human embryonic lung fibroblast) and Saos-2 (human osteosarcoma) cell lines, along with primary CS cells GM0010903 and GM0010905 were obtained from repositories. WI38 human embryonic lung fibroblasts were immortalized by PG-13/neo retroviral transduction of hTERT cDNA [76]. Immortalized CSB (CS1AN) fibroblasts expressing either wild-type CSB cDNA (CSB-wt line) or enhanced green fluorescent protein (CSB-null line) were generated as described [25]. HeLa, WI38/hTERT, and CS1AN-derived lines were cultured in MEMα media with 10% fetal bovine serum plus supplements (Gibco). Selection for expression of hTERT, CSB, and enhanced green fluorescent protein was maintained with 1 mg/ml G418 and 0.5 µg/ml puromycin, respectively. Cells were passaged by a wash in Puck's EDTA followed by trypsinization. HT1080 cells were cultured in MEMα media with 10% fetal bovine serum, and passaged by a wash in PBS followed by trypsinization. Real-Time Quantitative RT-PCR Total RNA was harvested directly from adherent cells with Trizol reagent (Ambion). Synthesis of cDNA was primed with oligo(dT) and carried out using Superscript II reverse transcriptase (Invitrogen). Each real-time reaction consisted of cDNA template from 20–50 ng of total RNA, 300 nM 5′ and 3′ gene-specific primers, and 1× SYBR Green master mix (Applied Biosystems) in 20 µl total reaction volume. All reactions were performed in triplicate using the DNA Engine Opticon real-time PCR system (MJ Research). Relative differential expression was calculated from mean threshold cycle difference among the three replicate reactions. Products were visualized by pooling the three replicate reactions, purifying and concentrating over a QIAquick column (Qiagen), and running half of the total sample on a 1.0% agarose gel stained with ethidium bromide. Primer sequences are available on request. Sequence Comparison of PiggyBac Genes Pairwise alignments and comparisons of analogous sequences were performed by Needleman-Wunsch global alignment, as implemented in EMBOSS needle. Overhanging ends were excluded from the identity calculations. We compared only homologous sequence regions: For example, we ignored the truncations of several PGBD3 pseudogenes when calculating their homology to PGBD3. Coding region identity was calculated from translation start to stop codons. Pseudogene identities were calculated from the 3′ SS (or start of homology) to the stop codon (or end of homology). We used RepBase RepeatMasker to identify the flanking MER85 and MER75B elements of PGBD3 and PGBD4, respectively. To determine if the conservation of the PGBD3 and CSB coding regions is statistically significant, we analyzed the conservation of six noncoding sequences for comparison: 2 kb of intron sequence beginning both immediately upstream and downstream of the inverted repeats flanking PGBD3, and the four PGBD3 pseudogenes. We determined the conservation of each of these six sequences individually by pairwise alignment between species using needle. We calculated a mean identity of all six and then used the inverted t-distribution to generate a confidence interval. The conservation of the PGBD3 and CSB coding regions was considered significant if the identity fell outside the 95% confidence interval of conservation for these six noncoding regions; this calculation is not dependent on the length of the query sequences. In order to determine whether MER85 and MER75B elements have been mobile since the divergence of the three primates, we used NCBI megaBLAST to identify all MER85 and MER75B elements on human, chimpanzee and Rhesus chromosome 1 (June 2006 NCBI sequence releases), based on the consensus sequence for these elements in RepBase Update [47]. We then extracted 1 kb of the surrounding sequence for each element, and used EMBOSS needle to align every such human sequence pairwise with every sequence from chimpanzee and monkey. Marmoset (version 2.0.2, released June 2007) and orangutan (version 2.0.2, released July 2007) preliminary genome assemblies were downloaded from the Washington University Genome Sequencing Center. Mouse lemur (draft v2, released June 2007), galago (draft v1, released June 2006) and tree shrew (draft v1, released June 2006) genome sequences were downloaded from the Broad Institute Mammalian Genome Project. Ka/Ks analysis was performed using SNAP (Synonymous Nonsynonymous Analysis Program) from the HIV Database at Los Alamos National Laboratories (USA) [42]. The significance of differences in Ka/Ks values was calculated with the Student's T-test using a two-tailed distribution and an assumption of unequal variance. All sequences and alignments used in this study are available on request. Analysis of MER-Associated Genes MER85 elements were identified in the March 2006 release of the NCBI human genome sequence by using NCBI megaBLAST to query each complete chromosome sequence for the RepBase MER85 consensus sequence. The start site of each element was matched to the closest start site of an HGNC-named gene from the ENSEMBL database. The resulting list of genes, excluding those located >1 Mb from their associated MER85 element, was analyzed with the 2007.1 release of the L2L Microarray Analysis Tool, including several unreleased lists representing CSB-regulated genes. The list of all HGNC-named genes in the ENSEMBL database was used as the null set. The P values generated by L2L were validated using random-data simulations as described previously [25]. Briefly, we randomly selected 1000 lists of genes from the null set, each the same size as the list of MER85-associated genes, and ran each through an identical L2L analysis. These random-data results were mined for the frequency of the outcomes seen in the analysis of MER85-associated genes. Western Blots GM00739B/hTERT cells were transfected in 100 mm tissue culture plates with 10 µg of plasmid constructs using 15 µl of Fugene 6 reagent (Roche). After 48 h, cells were washed with PBS and harvested by scraping. Cell pellets were resuspended in 100 µl of SDS loading buffer (25 mM Tris, pH 6.8, 2% SDS, 0.1% bromephenol blue, 10% sucrose, 0.12 M β-mercaptoethanol), sonicated to shear DNA, and denatured by heating at 95°C for 10 min. Non-transfected plates of HT1080 and WI-38/hTERT cells were harvested in the same manner. Proteins were separated on a 6% gel by SDS-PAGE using the Mini-Protean 3 Cell (BioRad) in a Tris/glycine/SDS buffer (1.5 g/l Tris base, 7.2 g/l glycine, 1% SDS). Proteins were transferred to a PDVF membrane in 25 mM Tris, 192 mM glycine, and 20% methanol buffer using a Mini Trans-Blot Cell (BioRad). After transfer, the PVDF membranes were blocked for 2 h at room temperature in TBST (50 mM Tris, pH 7.4, 150 mM NaCl, 0.05% Tween 20) plus 5% nonfat dry milk. The membrane was then incubated at room temperature in TBST plus 5% nonfat dry milk for 2 h with a 1 1000 dilution of primary antibody, washed twice for 10 min each, incubated for 1 h with a 1 5000 dilution of HRP-conjugated secondary antibody (Santa Cruz Biotechnology), and finally washed 4 times for 10 min each in TBST alone. Chemiluminescent detection was performed using the ECL Plus™ Western Blotting Detection System (Amersham) and Kodak X-Omat Blue film. Anti-CSB antibodies were generated in our laboratory as rabbit polyclonals raised to the C-terminal 158 amino acids or N-terminal 240 amino acids of CSB expressed as bacterial GST fusion proteins. Anti-GST antibodies were removed from the serum by passage over a GST column. Anti-PGBD3 antibody was purchased from AVIVA Systems Biology, catalog number ARP36534.GenBank Accessions for Primate Sequences Human PGBD3 and the four PGBD3 pseudogenes are present in the NCBI Entrez Gene database, but have not yet been curated in the chimpanzee or Rhesus genomes. The accessions and approximate indicies for the coding region sequences used in this study are as follows:
Table S1 CSB mutations associated with human disease. (0.06 MB DOC) Click here for additional data file.(57K, doc) Table S2 Comparison of galago PGBD3-like sequences with human PGBD3. (0.06 MB DOC) Click here for additional data file.(63K, doc) Table S3 Sequence identity between PGBD3 and pseudogenes. (0.04 MB DOC) Click here for additional data file.(39K, doc) Table S4 Locations of human MER85 elements and MER85-associated genes. (0.09 MB XLS) Click here for additional data file.(92K, xls) Table S5 L2L results comparing MER85-associated genes to the L2L microarray database, with P values corrected by random-data simulation. (0.17 MB XLS) Click here for additional data file.(166K, xls) Table S6 L2L microarray database results analyzed by groups of similar database lists. (0.02 MB XLS) Click here for additional data file.(20K, xls) Table S7 L2L results comparing MER85-associated genes to Gene Ontology Molecular Function terms, with P values corrected by random-data simulation. (0.17 MB XLS) Click here for additional data file.(169K, xls) Figure S1 Quantitation of the relative abundance of CSB and CSB-PGBD3 fusion transcripts in HeLa cDNA. (A) Ratio of abundance between similarly sized CSB and fusion PCR products, assayed by real time RT-PCR. The average ratio of all eighteen fusion:CSB comparisons is 2.0:1. (B) Expected sizes of all PCR products. (1.07 MB TIF) Click here for additional data file.(1.0M, tif) Figure S2 Fusion mRNA is more abundant than CSB in other cell lines. Three pairs of primer combinations (A1–D1, A2–D2 and B1–E1) were tested on cDNA from CSB-null, CSB-wt and WI38/hTERT cell lines. In all cases, the fusion PCR products were substantially more abundant than the corresponding CSB products as quantified by real time RT-PCR. (0.65 MB TIF) Click here for additional data file.(636K, tif) Figure S3 Additional Western blots for CSB and the fusion protein. (A) Western blot using a Cterminal antibody against CSB reveals only the expected major band for full length CSB(filled arrowhead) in MRC5 (SV40-immortalized human fetal lung fibroblast), E61ANd(GM00739B fibroblasts from compound heterozygote CS1AN, immortalized by SV40 and rescued by wt CSB cDNA [1]), Saos-2 (human osteosarcoma) and HT1080 (human fibrosarcoma); full length CSB is not seen in CS1AN/SV (the SV40-immortalized but unrescued parent of E61ANd). The >250 kDa species (labeled X) is seen only with Cterminal antibody and correlates with the abundance of full length CSB, but is also weakly expressed in the CS1AN/SV line lacking full length CSB; X may be a modified form of full length CSB, and weak expression in CS1AN/SV may indicate that one or both of the nonsense mutations in this compound heterozygote are leaky [2]. (B) Western blot using an N-terminal antibody against CSB reveals both full length CSB (filled arrowhead) and a second major band corresponding to the fusion protein (hollow arrowhead) in MRC5, Saos-2, and HT1080. We were also able to detect the fusion protein, but not full-length CSB, in SV40-immortalized primary fibroblasts from CSB patient CS1BE (derived from GM01629; data not shown). In agreement with our data, a band corresponding to the CSB-PGBD3 fusion protein was previously seen in the normal SV40-transformed WI38VA13 line [3] and in normal hTERT-immortalized BJ1 fibroblasts [4]. Although present in our CS1AN/hTERT line (Fig. 3 (4.58 MB TIF) Click here for additional data file.(4.3M, tif) Figure S4 Conservation of the 3′ splice site in PGBD3. (A) Conserved 3′ splice site in the genomic sequence of PGBD3 and pseudogenes for human (Hs, Homo sapiens), chimpanzee (Pt, Pan troglodytes), Rhesus (Mm, Macaca mulatta), marmoset (Ct, Callithrix jacchus) and orangutan (Pa, Pongo abelli), organized by gene. Uppercase letters represent exon sequence; ATG at the 5′ end is the start codon of the PiggyBac ORF. Notably, the entire sequence shown is perfectly conserved in PGBD3 from all three species. (B) As for (A), organized by species to show the variations between PGBD3 and its pseudogenes. (1.47 MB TIF) Click here for additional data file.(1.4M, tif) Figure S5 Conservation of the polyadenylation signal in PGBD3. (A) Conserved polyadenylation signals in the genomic sequence of PGBD3 and pseudogenes for human (Hs), chimpanzee (Pt), Rhesus (Mm), marmoset (Ct) and orangutan (Pa), organized by gene. Uppercase letters represent the AAUAAA motif. Dashes indicate pseudogenes for which the 3′ end (including the polyadenylation site) is no longer present. (B) As for (A), organized by species to show the variations between PGBD3 and its pseudogenes. (1.17 MB TIF) Click here for additional data file.(1.1M, tif) Figure S6 Schematic of PGBD3 element and the relationship with PGBD3 pseudogenes. The 5′ and 3′ ends of PGBD3 correspond to the 5′ arm (100 nt) and 3′ arm (40 nt) of a MER85 element (140 nt). The 13 nt inverted repeats of MER85 define the boundaries of the element, which is flanked by a TTAA target site duplication. Only two pseudogenes span the entire element, but all pseudogenes with intact ends share the same flanking features as PGBD3, and exhibit no homology to PGBD3 beyond the TTAA duplications. The 3′ end of PGBD3P3 is inverted, but the inverted sequence is as similar to PGBD3 as the remainder. An AluSx SINE has inserted into PGBD3P4 between the MER85 element and the 3′ SS. Also note two fragments (yellow) that appear to be derivatives of the left end of MER85, spanning 14 and 25 nt. The fragments each include the 5 innermost nt of the MER85 inverted repeat, but also share 8 further nt of unique flanking inverted repeat. Chimpanzee and Rhesus PGBD3 are structurally identical to human PGBD3, including all of the elements displayed. The chimpanzee pseudogenes are essentially identical to their human homologs. The Rhesus pseudogenes are more highly degraded, particularly at the 3′ end. (0.61 MB TIF) Click here for additional data file.(594K, tif) Figure S7 The putative catalytic motif is DDD in all pseudogenes, but DND in all PGBD3s. PiggyBac transposases share a DDD motif [1] that may be analogous to the metalcoordinating DDE motif common to other transposase families [2]. The most likely candidates for this motif in human PGBD3 are D270, N352 and D467. N352 is the result of a G to A mutation that occurred after generation of the four pseudogenes; all pseudogenes encode D at this position. (A) Genomic sequence of the three putative catalytic residues of PGBD3 and pseudogenes for human (Hs), chimpanzee (Pt), Rhesus (Mm), marmoset (Ct) and orangutan (Pa) organized by gene. Uppercase letters are the codons for the three residues. Dashes indicate pseudogenes for which the homologous sequence is no longer present; in addition, a 49 nt gap in the genome sequence of orangutan obscures D270. (B) As for (A), organized by species to show the variations between PGBD3 and its pseudogenes. (1.30 MB TIF) Click here for additional data file.(1.2M, tif) Figure S8 Consensus sequence of putative catalytic motif is DDD in galago PGBD3-related sequences. Six of seven sequences encode D at position 270, and six of seven encode D at 467. At position 352, four encode D while three encode K, G or N. Numbers designtating the PGBD3-like sequences represent the galago (Otolemur garnetti, Og) genomic contig on which the sequence is located. (0.36 MB TIF) Click here for additional data file.(349K, tif) Acknowledgments We thank Brian Kennedy and Richard Palmiter for helpful discussions. Footnotes The authors have declared that no competing interests exist. This work was supported by NIH GM41624 (AMW), NIH Medical Scientist Training Program (JCN) and a Rosetta Inpharmatics Fellowship from the Merck Research Laboratories (JCN). None of these funders had any role in the design and conduct of the study, in the collection, analysis, and interpretation of the data, or in the preparation, review, or approval of the manuscript. References 1. Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, et al. Initial sequencing and analysis of the human genome. Nature. 2001;409:860–921. [PubMed] 2. Bennett EA, Coleman LE, Tsui C, Pittard WS, Devine SE. Natural genetic variation caused by transposable elements in humans. Genetics. 2004;168:933–951. [PubMed] 3. Deininger PL, Batzer MA. Alu repeats and human disease. Mol Genet Metab. 1999;67:183–193. [PubMed] 4. Gould S, Vrba E. Exaptation — a missing term in the science of form. Paleobiology. 1982;8:4–15. 5. Kapitonov VV, Jurka J. RAG1 core and V(D)J recombination signal sequences were derived from Transib transposons. PLoS Biol. 2005;3:e181. [PubMed] 6. Yao MC, Chao JL. RNA-guided DNA deletion in Tetrahymena: an RNAi-based mechanism for programmed genome rearrangements. Annu Rev Genet. 2005;39:537–559. [PubMed] 7. Smit AF, Riggs AD. Tiggers and DNA transposon fossils in the human genome. Proc Natl Acad Sci U S A. 1996;93:1443–1448. [PubMed] 8. Cam HP, Noma KI, Ebina H, Levin HL, Grewal SI. Host genome surveillance for retrotransposons by transposon-derived proteins. Nature. 2007 9. Cordaux R, Udit S, Batzer MA, Feschotte C. Birth of a chimeric primate gene by capture of the transposase gene from a mobile element. Proc Natl Acad Sci U S A. 2006;103:8101–8106. [PubMed] 10. Fraser MJ, Ciszczon T, Elick T, Bauser C. Precise excision of TTAA-specific lepidopteran transposons piggyBac (IFP2) and tagalong (TFP3) from the baculovirus genome in cell lines from two species of Lepidoptera. Insect Mol Biol. 1996;5:141–151. [PubMed] 11. Cary LC, Goebel M, Corsaro BG, Wang HG, Rosen E, et al. Transposon mutagenesis of baculoviruses: analysis of Trichoplusia ni transposon IFP2 insertions within the FP-locus of nuclear polyhedrosis viruses. Virology. 1989;172:156–169. [PubMed] 12. Pritham EJ, Feschotte C, Wessler SR. Unexpected diversity and differential success of DNA transposons in four species of entamoeba protozoans. Mol Biol Evol. 2005;22:1751–1763. [PubMed] 13. Sarkar A, Sim C, Hong YS, Hogan JR, Fraser MJ, et al. Molecular evolutionary analysis of the widespread piggyBac transposon family and related “domesticated” sequences. Mol Genet Genomics. 2003;270:173–180. [PubMed] 14. Ding S, Wu X, Li G, Han M, Zhuang Y, et al. Efficient transposition of the piggyBac (PB) transposon in mammalian cells and mice. Cell. 2005;122:473–483. [PubMed] 15. Wu SC, Meir YJ, Coates CJ, Handler AM, Pelczar P, et al. piggyBac is a flexible and highly active transposon as compared to sleeping beauty, Tol2, and Mos1 in mammalian cells. Proc Natl Acad Sci U S A. 2006;103:15008–15013. [PubMed] 16. Nance MA, Berry SA. Cockayne syndrome: review of 140 cases. Am J Med Genet. 1992;42:68–84. [PubMed] 17. Selby CP, Sancar A. Human transcription-repair coupling factor CSB/ERCC6 is a DNA-stimulated ATPase but is not a helicase and does not disrupt the ternary transcription complex of stalled RNA polymerase II. J Biol Chem. 1997;272:1885–1890. [PubMed] 18. Citterio E, Rademakers S, van der Horst GT, van Gool AJ, Hoeijmakers JH, et al. Biochemical and biological characterization of wild-type and ATPase-deficient Cockayne syndrome B repair protein. J Biol Chem. 1998;273:11844–11851. [PubMed] 19. Christiansen M, Stevnsner T, Modin C, Martensen PM, Brosh RM, Jr, et al. Functional consequences of mutations in the conserved SF2 motifs and post-translational phosphorylation of the CSB protein. Nucleic Acids Res. 2003;31:963–973. [PubMed] 20. Beerens N, Hoeijmakers JH, Kanaar R, Vermeulen W, Wyman C. The CSB protein actively wraps DNA. J Biol Chem. 2005;280:4722–4729. [PubMed] 21. Citterio E, Van Den Boom V, Schnitzler G, Kanaar R, Bonte E, et al. ATP-dependent chromatin remodeling by the Cockayne syndrome B DNA repair-transcription-coupling factor. Mol Cell Biol. 2000;20:7643–7653. [PubMed] 22. Rapin I, Lindenbaum Y, Dickson DW, Kraemer KH, Robbins JH. Cockayne syndrome and xeroderma pigmentosum. Neurology. 2000;55:1442–1449. [PubMed] 23. Cleaver JE, Thompson LH, Richardson AS, States JC. A summary of mutations in the UV-sensitive disorders: xeroderma pigmentosum, Cockayne syndrome, and trichothiodystrophy. Hum Mutat. 1999;14:9–22. [PubMed] 24. Venema J, Mullenders LH, Natarajan AT, van Zeeland AA, Mayne LV. The genetic defect in Cockayne syndrome is associated with a defect in repair of UV-induced DNA damage in transcriptionally active DNA. Proc Natl Acad Sci U S A. 1990;87:4707–4711. [PubMed] 25. Newman JC, Bailey AD, Weiner AM. Cockayne syndrome group B protein (CSB) plays a general role in chromatin maintenance and remodeling. Proc Natl Acad Sci U S A. 2006;103:9613–9618. [PubMed] 26. Pasquier L, Laugel V, Lazaro L, Dollfus H, Journel H, et al. Wide clinical variability among 13 new Cockayne syndrome cases confirmed by biochemical assays. Arch Dis Child. 2006;91:178–182. [PubMed] 27. Friedberg EC, Wood RD. New insights into the combined Cockayne/xeroderma pigmentosum complex: human XPG protein can function in transcription factor stability. Mol Cell. 2007;26:162–164. [PubMed] 28. Ito S, Kuraoka I, Chymkowitch P, Compe E, Takedachi A, et al. XPG stabilizes TFIIH, allowing transactivation of nuclear receptors: implications for Cockayne syndrome in XP-G/CS patients. Mol Cell. 2007;26:231–243. [PubMed] 29. Tuo J, Muftuoglu M, Chen C, Jaruga P, Selzer RR, et al. The Cockayne Syndrome group B gene product is involved in general genome base excision repair of 8-hydroxyguanine in DNA. J Biol Chem. 2001;276:45772–45779. [PubMed] 30. Osterod M, Larsen E, Le Page F, Hengstler JG, Van Der Horst GT, et al. A global DNA repair mechanism involving the Cockayne syndrome B (CSB) gene product can prevent the in vivo accumulation of endogenous oxidative DNA base damage. Oncogene. 2002;21:8232–8239. [PubMed] 31. Niedernhofer LJ, Garinis GA, Raams A, Lalai AS, Robinson AR, et al. A new progeroid syndrome reveals that genotoxic stress suppresses the somatotroph axis. Nature. 2006;444:1038–1043. [PubMed] 32. van der Pluijm I, Garinis GA, Brandt RM, Gorgels TG, Wijnhoven SW, et al. Impaired genome maintenance suppresses the growth hormone–insulin-like growth factor 1 axis in mice with Cockayne syndrome. PLoS Biol. 2006;5:e2. [PubMed] 33. Horibata K, Iwamoto Y, Kuraoka I, Jaspers NG, Kurimasa A, et al. Complete absence of Cockayne syndrome group B gene product gives rise to UV-sensitive syndrome but not Cockayne syndrome. Proc Natl Acad Sci U S A. 2004;101:15410–15415. [PubMed] 34. Hashimoto S, Suga T, Kudo E, Ihn H, Uchino M, et al. Adult-Onset Neurological Degeneration in a Patient with Cockayne Syndrome and a Null Mutation in the CSB Gene. J Invest Dermatol. 2008 35. Laugel V, Dalloz C, Stary A, Cormier-Daire V, Desguerre I, et al. Deletion of 5′ sequences of the CSB gene provides insight into the pathophysiology of Cockayne syndrome. Eur J Hum Genet. 2008 36. Glazko GV, Nei M. Estimation of divergence times for major lineages of primate species. Mol Biol Evol. 2003;20:424–434. [PubMed] 37. Yu W, Rusterholtz KJ, Krummel AT, Lehman N. Detection of high levels of recombination generated during PCR amplification of RNA templates. Biotechniques. 2006;40:499–507. [PubMed] 38. Troelstra C, van Gool A, de Wit J, Vermeulen W, Bootsma D, et al. ERCC6, a member of a subfamily of putative helicases, is involved in Cockayne's syndrome and preferential repair of active genes. Cell. 1992;71:939–953. [PubMed] 39. Garcia-Ortega L, De los Rios V, Martinez-Ruiz A, Onaderra M, Lacadena J, et al. Anomalous electrophoretic behavior of a very acidic protein: ribonuclease U2. Electrophoresis. 2005;26:3407–3413. [PubMed] 40. Initial sequence of the chimpanzee genome and comparison with the human genome. Nature. 2005;437:69–87. [PubMed] 41. Gibbs RA, Rogers J, Katze MG, Bumgarner R, Weinstock GM, et al. Evolutionary and biomedical insights from the rhesus macaque genome. Science. 2007;316:222–234. [PubMed] 42. Korber B. HIV signature and sequence variation analysis. In: Rodrigo AG, Learn GH, editors. Computational Analysis of HIV Molecular Sequences. Dordrecht, Netherlands: Kluwer Academic Publishers; 2000. pp. 55–72. 43. Nei M, Gojobori T. Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. Mol Biol Evol. 1986;3:418–426. [PubMed] 44. Roos C, Schmitz J, Zischler H. Primate jumping genes elucidate strepsirrhine phylogeny. Proc Natl Acad Sci U S A. 2004;101:10650–10654. [PubMed] 45. Semon M, Duret L. Evidence that functional transcription units cover at least half of the human genome. Trends Genet. 2004;20:229–232. [PubMed] 46. Lobo NF, Fraser TS, Adams JA, Fraser MJ., Jr Interplasmid transposition demonstrates piggyBac mobility in vertebrate species. Genetica. 2006;128:347–357. [PubMed] 47. Jurka J, Kapitonov VV, Pavlicek A, Klonowski P, Kohany O, et al. Repbase Update, a database of eukaryotic repetitive elements. Cytogenet Genome Res. 2005;110:462–467. [PubMed] 48. Pace JK, 2nd, Feschotte C. The evolutionary history of human DNA transposons: evidence for intense activity in the primate lineage. Genome Res. 2007;17:422–432. [PubMed] 49. Curcio MJ, Derbyshire KM. The outs and ins of transposition: from mu to kangaroo. Nat Rev Mol Cell Biol. 2003;4:865–877. 50. Lowe CB, Bejerano G, Haussler D. Thousands of human mobile element fragments undergo strong purifying selection near developmental genes. Proc Natl Acad Sci U S A. 2007;104:8005–8010. [PubMed] 51. Kamal M, Xie X, Lander ES. A large family of ancient repeat elements in the human genome is under strong selection. Proc Natl Acad Sci U S A. 2006;103:2740–2745. [PubMed] 52. Kleinjan DA, van Heyningen V. Long-range control of gene expression: emerging mechanisms and disruption in disease. Am J Hum Genet. 2005;76:8–32. [PubMed] 53. Newman JC, Weiner AM. L2L: a simple tool for discovering the hidden significance in microarray expression data. Genome Biol. 2005;6:R81. [PubMed] 54. Troelstra C, Hesen W, Bootsma D, Hoeijmakers JH. Structure and expression of the excision repair gene ERCC6, involved in the human disorder Cockayne's syndrome group B. Nucleic Acids Res. 1993;21:419–426. [PubMed] 55. van der Horst GT, van Steeg H, Berg RJ, van Gool AJ, de Wit J, et al. Defective transcription-coupled repair in Cockayne syndrome B mice is associated with skin cancer predisposition. Cell. 1997;89:425–435. [PubMed] 56. van der Horst GT, Meira L, Gorgels TG, de Wit J, Velasco-Miguel S, et al. UVB radiation-induced cancer predisposition in Cockayne syndrome group A (Csa) mutant mice. DNA Repair (Amst). 2002;1:143–157. [PubMed] 57. Laposa RR, Huang EJ, Cleaver JE. Increased apoptosis, p53 up-regulation, and cerebellar neuronal degeneration in repair-deficient Cockayne syndrome mice. Proc Natl Acad Sci U S A. 2007;104:1389–1394. [PubMed] 58. Murai M, Enokido Y, Inamura N, Yoshino M, Nakatsu Y, et al. Early postnatal ataxia and abnormal cerebellar development in mice lacking Xeroderma pigmentosum Group A and Cockayne syndrome Group B DNA repair genes. Proc Natl Acad Sci U S A. 2001;98:13379–13384. [PubMed] 59. Colella S, Nardo T, Mallery D, Borrone C, Ricci R, et al. Alterations in the CSB gene in three Italian patients with the severe form of Cockayne syndrome (CS) but without clinical photosensitivity. Hum Mol Genet. 1999;8:935–941. [PubMed] 60. Mallery DL, Tanganelli B, Colella S, Steingrimsdottir H, van Gool AJ, et al. Molecular analysis of mutations in the CSB (ERCC6) gene in patients with Cockayne syndrome. Am J Hum Genet. 1998;62:77–85. [PubMed] 61. Firth AE, Brown CM. Detecting overlapping coding sequences in virus genomes. BMC Bioinformatics. 2006;7:75. [PubMed] 62. Weiner AM, Weber K. Natural read-through at the UGA termination signal of Q-beta coat protein cistron. Nat New Biol. 1971;234:206–209. [PubMed] 63. Beier H, Grimm M. Misreading of termination codons in eukaryotes by natural nonsense suppressor tRNAs. Nucleic Acids Res. 2001;29:4767–4782. [PubMed] 64. Baranov PV, Gesteland RF, Atkins JF. P-site tRNA is a crucial initiator of ribosomal frameshifting. Rna. 2004;10:221–230. [PubMed] 65. Remacle C, Colin M, Matagne RF. Suppression of a +1 T mutation by a nearby substitution in the mitochondrial cox1 gene of Chlamydomonas reinhardtii: a new type of frameshift suppression in an organelle genome. Mol Gen Genet. 1998;259:294–298. [PubMed] 66. Amrani N, Sachs MS, Jacobson A. Early nonsense: mRNA decay solves a translational problem. Nat Rev Mol Cell Biol. 2006;7:415–425. [PubMed] 67. Sunesen M, Selzer RR, Brosh RM, Jr, Balajee AS, Stevnsner T, et al. Molecular characterization of an acidic region deletion mutant of Cockayne syndrome group B protein. Nucleic Acids Res. 2000;28:3151–3159. [PubMed] 68. Groisman R, Kuraoka I, Chevallier O, Gaye N, Magnaldo T, et al. CSA-dependent degradation of CSB by the ubiquitin-proteasome pathway establishes a link between complementation factors of the Cockayne syndrome. Genes Dev. 2006;20:1429–1434. [PubMed] 69. Han K, Konkel MK, Xing J, Wang H, Lee J, et al. Mobile DNA in Old World monkeys: a glimpse through the rhesus macaque genome. Science. 2007;316:238–240. [PubMed] 70. Sved JA, Blackman LM, Gilchrist AS, Engels WR. High levels of recombination induced by homologous P elements in Drosophila melanogaster. Mol Gen Genet. 1991;225:443–447. [PubMed] 71. Misra S, Rio DC. Cytotype control of Drosophila P element transposition: the 66 kd protein is a repressor of transposase activity. Cell. 1990;62:269–284. [PubMed] 72. Curcio MJ, Garfinkel DJ. New lines of host defense: inhibition of Ty1 retrotransposition by Fus3p and NER/TFIIH. Trends Genet. 1999;15:43–45. [PubMed] 73. Waterston RH, Lindblad-Toh K, Birney E, Rogers J, Abril JF, et al. Initial sequencing and comparative analysis of the mouse genome. Nature. 2002;420:520–562. [PubMed] 74. Feschotte C, Pritham EJ. DNA transposons and the evolution of eukaryotic genomes. Annu Rev Genet. 2007;41:331–368. [PubMed] 75. Kamiuchi S, Saijo M, Citterio E, de Jager M, Hoeijmakers JH, et al. Translocation of Cockayne syndrome group A protein to the nuclear matrix: possible relevance to transcription-coupled DNA repair. Proc Natl Acad Sci U S A. 2002;99:201–206. [PubMed] 76. Kiyono T, Foster SA, Koop JI, McDougall JK, Galloway DA, et al. Both Rb/p16INK4a inactivation and telomerase activity are required to immortalize human epithelial cells. Nature. 1998;396:84–88. [PubMed] 77. Greenhaw GA, Hebert A, Duke-Woodside ME, Butler IJ, Hecht JT, et al. Xeroderma pigmentosum and Cockayne syndrome: overlapping clinical and biochemical phenotypes. Am J Hum Genet. 1992;50:677–689. [PubMed] 78. Steiper ME, Young NM. Primate molecular divergence rates. Mol Phylogenet Evol. 2006;41:384–394. [PubMed] 79. Springer MS, Murphy WJ, Eizirik E, O'Brian SJ. Placental mammal diversification and the Cretaceous-Tertiary boundary. Proc Natl Acad Sci U S A. 2003;100:1056–1061. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||
Nature. 2001 Feb 15; 409(6822):860-921.
[Nature. 2001]Genetics. 2004 Oct; 168(2):933-51.
[Genetics. 2004]Mol Genet Metab. 1999 Jul; 67(3):183-93.
[Mol Genet Metab. 1999]PLoS Biol. 2005 Jun; 3(6):e181.
[PLoS Biol. 2005]Annu Rev Genet. 2005; 39():537-59.
[Annu Rev Genet. 2005]Insect Mol Biol. 1996 May; 5(2):141-51.
[Insect Mol Biol. 1996]Virology. 1989 Sep; 172(1):156-69.
[Virology. 1989]Mol Biol Evol. 2005 Sep; 22(9):1751-63.
[Mol Biol Evol. 2005]Nature. 2001 Feb 15; 409(6822):860-921.
[Nature. 2001]Mol Genet Genomics. 2003 Nov; 270(2):173-80.
[Mol Genet Genomics. 2003]Am J Med Genet. 1992 Jan 1; 42(1):68-84.
[Am J Med Genet. 1992]J Biol Chem. 1997 Jan 17; 272(3):1885-90.
[J Biol Chem. 1997]Nucleic Acids Res. 2003 Feb 1; 31(3):963-73.
[Nucleic Acids Res. 2003]J Biol Chem. 2005 Feb 11; 280(6):4722-9.
[J Biol Chem. 2005]Mol Cell Biol. 2000 Oct; 20(20):7643-53.
[Mol Cell Biol. 2000]Mol Biol Evol. 2003 Mar; 20(3):424-34.
[Mol Biol Evol. 2003]Proc Natl Acad Sci U S A. 2004 Oct 26; 101(43):15410-5.
[Proc Natl Acad Sci U S A. 2004]Biotechniques. 2006 Apr; 40(4):499-507.
[Biotechniques. 2006]Proc Natl Acad Sci U S A. 2006 Jun 20; 103(25):9613-8.
[Proc Natl Acad Sci U S A. 2006]Cell. 1992 Dec 11; 71(6):939-53.
[Cell. 1992]Electrophoresis. 2005 Sep; 26(18):3407-13.
[Electrophoresis. 2005]Am J Hum Genet. 1992 Apr; 50(4):677-89.
[Am J Hum Genet. 1992]Nature. 2005 Sep 1; 437(7055):69-87.
[Nature. 2005]Science. 2007 Apr 13; 316(5822):222-34.
[Science. 2007]Mol Biol Evol. 2003 Mar; 20(3):424-34.
[Mol Biol Evol. 2003]Mol Phylogenet Evol. 2006 Nov; 41(2):384-94.
[Mol Phylogenet Evol. 2006]Proc Natl Acad Sci U S A. 2003 Feb 4; 100(3):1056-61.
[Proc Natl Acad Sci U S A. 2003]Mol Biol Evol. 1986 Sep; 3(5):418-26.
[Mol Biol Evol. 1986]Proc Natl Acad Sci U S A. 2004 Jul 20; 101(29):10650-4.
[Proc Natl Acad Sci U S A. 2004]Mol Biol Evol. 2003 Mar; 20(3):424-34.
[Mol Biol Evol. 2003]Trends Genet. 2004 May; 20(5):229-32.
[Trends Genet. 2004]Proc Natl Acad Sci U S A. 2006 Oct 10; 103(41):15008-13.
[Proc Natl Acad Sci U S A. 2006]Genetica. 2006 Sep-Nov; 128(1-3):347-57.
[Genetica. 2006]Cytogenet Genome Res. 2005; 110(1-4):462-7.
[Cytogenet Genome Res. 2005]Nature. 2001 Feb 15; 409(6822):860-921.
[Nature. 2001]Mol Biol Evol. 2005 Sep; 22(9):1751-63.
[Mol Biol Evol. 2005]Genome Res. 2007 Apr; 17(4):422-32.
[Genome Res. 2007]Mol Genet Genomics. 2003 Nov; 270(2):173-80.
[Mol Genet Genomics. 2003]Proc Natl Acad Sci U S A. 2006 May 23; 103(21):8101-6.
[Proc Natl Acad Sci U S A. 2006]Proc Natl Acad Sci U S A. 2007 May 8; 104(19):8005-10.
[Proc Natl Acad Sci U S A. 2007]Proc Natl Acad Sci U S A. 2006 Feb 21; 103(8):2740-5.
[Proc Natl Acad Sci U S A. 2006]Am J Hum Genet. 2005 Jan; 76(1):8-32.
[Am J Hum Genet. 2005]Genome Biol. 2005; 6(9):R81.
[Genome Biol. 2005]Nucleic Acids Res. 1993 Feb 11; 21(3):419-26.
[Nucleic Acids Res. 1993]Proc Natl Acad Sci U S A. 2004 Oct 26; 101(43):15410-5.
[Proc Natl Acad Sci U S A. 2004]Cell. 1997 May 2; 89(3):425-35.
[Cell. 1997]DNA Repair (Amst). 2002 Feb 28; 1(2):143-57.
[DNA Repair (Amst). 2002]Proc Natl Acad Sci U S A. 2007 Jan 23; 104(4):1389-94.
[Proc Natl Acad Sci U S A. 2007]Proc Natl Acad Sci U S A. 2001 Nov 6; 98(23):13379-84.
[Proc Natl Acad Sci U S A. 2001]Proc Natl Acad Sci U S A. 2004 Oct 26; 101(43):15410-5.
[Proc Natl Acad Sci U S A. 2004]Proc Natl Acad Sci U S A. 2004 Oct 26; 101(43):15410-5.
[Proc Natl Acad Sci U S A. 2004]Hum Mol Genet. 1999 May; 8(5):935-41.
[Hum Mol Genet. 1999]Am J Hum Genet. 1998 Jan; 62(1):77-85.
[Am J Hum Genet. 1998]BMC Bioinformatics. 2006 Feb 16; 7():75.
[BMC Bioinformatics. 2006]Nat New Biol. 1971 Sep 15; 234(50):206-9.
[Nat New Biol. 1971]Nucleic Acids Res. 2001 Dec 1; 29(23):4767-82.
[Nucleic Acids Res. 2001]Arch Dis Child. 2006 Feb; 91(2):178-82.
[Arch Dis Child. 2006]Proc Natl Acad Sci U S A. 2007 Jan 23; 104(4):1389-94.
[Proc Natl Acad Sci U S A. 2007]Proc Natl Acad Sci U S A. 2001 Nov 6; 98(23):13379-84.
[Proc Natl Acad Sci U S A. 2001]Nucleic Acids Res. 2000 Aug 15; 28(16):3151-9.
[Nucleic Acids Res. 2000]Genes Dev. 2006 Jun 1; 20(11):1429-34.
[Genes Dev. 2006]Nature. 2001 Feb 15; 409(6822):860-921.
[Nature. 2001]Science. 2007 Apr 13; 316(5822):238-40.
[Science. 2007]Genome Res. 2007 Apr; 17(4):422-32.
[Genome Res. 2007]Proc Natl Acad Sci U S A. 2006 May 23; 103(21):8101-6.
[Proc Natl Acad Sci U S A. 2006]Mol Genet Genomics. 2003 Nov; 270(2):173-80.
[Mol Genet Genomics. 2003]Mol Gen Genet. 1991 Mar; 225(3):443-7.
[Mol Gen Genet. 1991]Cell. 1990 Jul 27; 62(2):269-84.
[Cell. 1990]Trends Genet. 1999 Feb; 15(2):43-5.
[Trends Genet. 1999]J Biol Chem. 1997 Jan 17; 272(3):1885-90.
[J Biol Chem. 1997]Nature. 2002 Dec 5; 420(6915):520-62.
[Nature. 2002]Proc Natl Acad Sci U S A. 2007 May 8; 104(19):8005-10.
[Proc Natl Acad Sci U S A. 2007]Proc Natl Acad Sci U S A. 2006 Feb 21; 103(8):2740-5.
[Proc Natl Acad Sci U S A. 2006]Proc Natl Acad Sci U S A. 2006 May 23; 103(21):8101-6.
[Proc Natl Acad Sci U S A. 2006]Annu Rev Genet. 2007; 41():331-68.
[Annu Rev Genet. 2007]Nature. 1998 Nov 5; 396(6706):84-8.
[Nature. 1998]Proc Natl Acad Sci U S A. 2006 Jun 20; 103(25):9613-8.
[Proc Natl Acad Sci U S A. 2006]Cytogenet Genome Res. 2005; 110(1-4):462-7.
[Cytogenet Genome Res. 2005]Proc Natl Acad Sci U S A. 2006 Jun 20; 103(25):9613-8.
[Proc Natl Acad Sci U S A. 2006]Nature. 2001 Feb 15; 409(6822):860-921.
[Nature. 2001]Genetics. 2004 Oct; 168(2):933-51.
[Genetics. 2004]Mol Genet Metab. 1999 Jul; 67(3):183-93.
[Mol Genet Metab. 1999]PLoS Biol. 2005 Jun; 3(6):e181.
[PLoS Biol. 2005]Nature. 2001 Feb 15; 409(6822):860-921.
[Nature. 2001]Genetics. 2004 Oct; 168(2):933-51.
[Genetics. 2004]