![]() | ![]() |
Formats:
|
||||||||||||||||||
Copyright © 2004, Cold Spring Harbor Laboratory Press Serial segmental duplications during primate evolution result in complex human genome architecture 1 Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA 2 Department of Pediatrics, Baylor College of Medicine, Houston, Texas 77030, USA 3 Texas Children's Hospital, Houston, Texas 77030, USA 4Present address: Department of Mental Retardation and Birth Defect Research, National Institute of Neuroscience, National Center of Neurology and Psychiatry, Tokyo 187-8502, Japan. 5Corresponding author. E-mail: jlupski/at/bcm.tmc.edu; fax: (713) 798-5073. Received May 3, 2004; Accepted August 14, 2004. This article has been cited by other articles in PMC.Abstract The human genome is particularly rich in low-copy repeats (LCRs) or segmental duplications (5%–10%), and this characteristic likely distinguishes us from lower mammals such as rodents. How and why the complex human genome architecture consisting of multiple LCRs has evolved remains an open question. Using molecular and computational analyses of human and primate genomic regions, we analyzed the structure and evolution of LCRs that resulted in complex architectural features of the human genome in proximal 17p. We found that multiple LCRs of different origins are situated adjacent to one another, whereas each LCR changed at different time points between >25 to 3–7 million years ago (Mya) during primate evolution. Evolutionary studies in primates suggested communication between the LCRs by gene conversion. The DNA transposable element MER1-Charlie3 and retroviral ERVL elements were identified at the breakpoint of the t(4;19) chromosome translocation in Gorilla gorilla, suggesting a potential role for transpositions in evolution of the primate genome. Thus, a series of consecutive segmental duplication events during primate evolution resulted in complex genome architecture in proximal 17p. Some of the more recent events led to the formation of novel genes that in human are expressed primarily in the brain. Our observations support the contention that serial segmental duplication events might have orchestrated primate evolution by the generation of novel fusion/fission genes as well as potentially by genomic inversions associated with decreased recombination rates facilitating gene divergence. Recent computational analyses of the available DNA sequencing data and fluorescence in situ hybridization (FISH) studies have estimated that low-copy repeats (LCRs) (paralogous segmental duplications, or duplicons), which are ≥95%–97% identical, occupy 5%–10% of the human genome (Bailey et al. 2001; Cheung et al. 2001; Eichler 2001; International Human Genome Sequencing Consortium 2001; Venter et al. 2001). In some regions, they constitute up to 23%–45% of the examined sequence (Hurles and Jobling 2003; Stankiewicz et al. 2003). Because of their repeat nature, LCRs have been a significant challenge to the Human Genome Project and many of the remaining gaps in euchromatic regions are associated with LCRs (Stankiewicz and Lupski 2002; Eichler et al. 2004). The genome architecture resulting from the size, orientation, and arrangement of LCRs was shown to be responsible for genomic instability (Lupski 1998; Emanuel and Shaikh 2001; Stankiewicz and Lupski 2002a). Serving as substrates for nonallelic (or “ectopic”) homologous recombination (NAHR), LCRs facilitate meiotic and potentially mitotic DNA rearrangements associated with several diseases that are referred to as genomic disorders. Genomic rearrangements can be responsible for Mendelian traits, contiguous gene syndromes, and whole-arm chromosome aberrations (Lupski 1998, 2003; Stankiewicz and Lupski 2002a). During the last decade, substantial molecular data have emerged documenting that LCR-mediated NAHR is a major mechanism for human disease (Emanuel and Shaikh 2001; Stankiewicz and Lupski 2002a; Shaw and Lupski 2004). Several studies have shown that LCRs arose recently, apparently during primate evolution (Stankiewicz and Lupski 2002b). Recent data suggest that LCR-associated genome architecture does not represent simple segmental duplications but rather reflects complex rearrangements that are potentially the end product of multiple events (van Geel et al. 2002). These serial segmental duplications can result in a complex shuffling of genomic sequences (Babcock et al. 2003; Lupski 2003). The gene- and LCR-rich human genomic region 17p11.2-p12 is rearranged in a variety of different constitutional, evolutionary, and cancer-associated structural chromosome aberrations, and thus appears to be an excellent model to investigate the role of genome architecture in DNA rearrangements (Pentao et al. 1992; Chance et al. 1994; Chen et al. 1997; Reiter et al. 1998; Potocki et al. 2000; Stankiewicz et al. 2001; Park et al. 2002; Bi et al. 2003; Barbouti et al. 2004). Recently, we documented that LCRs in proximal 17p constitute >23% of ~7.5 Mb of genome sequence (Fig. 1
Using parallel approaches consisting of FISH studies in several primate species and computational analyses of human genome sequence (molecular clock analysis), we investigated the complex structure and evolution of the genome architecture in an ~7.5 Mb region of proximal 17p. Our analyses support a model of serial segmental duplication events of LCRs during primate evolution. Results Structure and orientation of LCR17p repeats The proximal short arm of human chromosome 17 contains two sets of well-characterized LCRs that play a major role in recurrent disease-causing genomic rearrangements, namely proximal and distal CMT1A-REPs (associated with Charcot-Marie-Tooth type 1A disease, [CMT1A], and hereditary neuropathy with liability to pressure palsies, [HNPP]) and proximal, middle, and distal SMSREPs (associated with Smith-Magenis syndrome [SMS] and dup(17)(p11.2p11.2) syndrome). Recently, we provided evidence for the existence of yet another set of LCRs in 17p11.2-p12 termed LCR17pA, B, C, D, E, F, and G (Stankiewicz et al. 2003). We have analyzed genomic sequence spanning LCR17pA, LCR17pB, LCR17pC, and LCR17pD in 17p11.2-p12 (Fig. 1
Evolution of LCR17pB, LCR17pC, and LCR17pD copies from LCR17pA DNA sequence comparison excluding insertion/deletion (indels), or molecular clock analysis, among LCR17p copies showed 98.70% nucleotide identity between LCR17pA/B and LCR17pB and 98.57% between LCR17pA/D and LCR17pD, similar to the 98.15% (98.7% including indels; [Reiter et al. 1997; Inoue et al. 2001]) homology between CMT1A-REPs (Table 1). LCR17pC shares less sequence identity with LCR17pA/C (88.45%) and LCR17pD (88.03%), suggesting that LCR17pC may be older than LCR17pB, LCR17pD, and proximal CMT1A-REP (Table 1). Such molecular clock estimates, however, are based on the neutral theory for mutation rate in evolution, with a presumption that paralogous sequences evolve independently. Molecular clock analysis may be confounded by LCRs that are potentially interacting with each other genetically (i.e., by gene conversion) during evolution. Depending on the size, homology, distance, orientation, and frequency of continuous non-allelic misalignments between different LCR copies for millions of years, homologous recombination may have resulted in gene conversion and/or DNA sequence homogenization. As an independent assessment of the timing of segmental duplication events during primate evolution, we performed hybridization-based evolutionary studies using FISH on various primate cell lines. Hybridization of human BAC clone CTD-3157E16, derived from the LCR17pC and LCR17pD subunits of the LCR17pA copy, revealed the presence of at least two signals in cell lines analyzed from different human (Fig. 2A
FISH with LCR17pA/B-derived human genomic clone RP11–640I15 (Fig. 1
Complex DNA exchange occurred at the gorilla t(4;19) translocation Using the year 2000 human genome sequence from NCBI, we mapped the gorilla t(4;19) breakpoints by FISH to human BAC clone CITC-428H11 (HSA 5q13.3) and overlapping clones RP11–640I15 and CTD-3157E16 (HSA 17p12) (Stankiewicz et al. 2001). The current build 34 (July 2003) alignment of BAC clones in the human chromosome region 5q13-q14 revealed several changes when compared with the 2000 assembly; for example, BAC clone CITC-428H11 currently maps to chromosome 5q14.3. Interestingly, this chromosome region has been shown recently to be LCR-rich and evolutionarily unstable (Courseaux et al. 2003). Repeated FISH experiments showed that the BAC clone CITC-428H11 mapped only to the GGO 19 and did not span the breakpoint, suggesting mis-mapping or contamination in our previous breakpoint mapping experiment. Instead, two overlapping human BAC clones CTD-2327L5 and RP11–90A9 (5q14.1) were found to span the GGO 19 breakpoint (Fig. 4
To identify a gorilla BAC genomic clone that spans the evolutionary translocation breakpoint, we designed oligonucleotide probes from the human BAC clone CTD-2327L5 (HSA 5q14.1). These probes were used to screen the Gorilla gorilla gorilla BAC library CHORI-255 and identified six clones: 333K3, 363F1, 413A16, 419A7, 481B24, and 484N3. These gorilla genomic clones were subsequently used as FISH probes on human and gorilla lymphoblasts to confirm their chromosomal location. FISH and BAC-end sequencing of the clones showed that 413A16, 419A7, and 484N3 span the gorilla chromosome 19 breakpoint, with end sequences assigned to HSAs 5 (CTD-2327L5 and RP11–2015H6) and 17 (SMS-REP sequence). Our previous work has shown that the GGO 4 breakpoint localized within LCR17pA, near the junction of LCR17pD subunit (Fig. 1 Supporting this notion, the analysis of the DNA sequences of the gorilla BAC clone 413A16 revealed the presence of two >20 Kb DNA segments homologous to chromosome 17p13.2 (RP11–46I8) and 17q23.2 (RP11–3K24; RP11–178C3), and a 229-bp fragment of the 2710-bp DNA transposon MER1-Charlie3 with an adjacent 9 bp insertion GTGGCTTGG, precisely at the translocation breakpoint (Fig. 4 To confirm the predicted gorilla translocation junction fragment, human, chimpanzee, gorilla, and orangutan genomic DNAs were amplified using PCR with primers designed from the sequence of the gorilla genomic BAC clone CHORI-255–413A16 to flank the identified breakpoint. As predicted, the PCR product of expected size was obtained only in gorilla and not in the genomic DNA from other primate species (Fig. 4 Complex genomic events accompany the CMT1A-REP segmental duplication CMT1A-REP is an LCR with one copy located within LCR17pA. DNA sequence analysis of the CMT1A-REPs in humans reveals that the distal copy, which is part of the COX10 gene, is the progenitor copy (Murakami et al. 1997; Reiter et al. 1997). The proximal CMT1A-REP represents a duplicated copy of exon 6 of COX10 and surrounding intronic sequences and arose after the divergence of gorilla and chimpanzee, ~3–7 Mya (Kiyosawa and Chance 1996; Boerkoel et al. 1999). The CMT1A-REP insertional/duplicational event appears to have split an ancestral gene, AGIP (ancestral gene before the integration of proximal CMT1A-REP), into two genes: HREP (FAM18B) expressed in heart and skeletal muscle (Kennerson et al. 1998) and CDRT1 expressed in the pancreas and heart (Inoue et al. 2001). Our previous comparative analysis of human CMT1A-REP genome sequence with mouse ESTs suggested a potential genomic deletion event accompanied the insertion of proximal CMT1A-REP during divergence between gorilla and chimpanzee (Inoue et al. 2001). We obtained further evidence for this proposed deletion that accompanied the evolutionary genomic rearrangement. Because the LCR17pB segmental duplication is older than the proximal CMT1A-REP integration, we hypothesized that the LCR17pB might have retained a copy of the genomic fragment that was lost in the original LCR17pA/B. Using BLAST2 sequence comparison between human LCR17pA/B and LCR17pB paralogs, at the site of insertion of proximal CMT1A-REP in LCR17pA, we identified a 4960-bp DNA fragment, present in the LCR17pB copy but absent in the LCR17pA copy (Fig. 5
BLAST analysis identified a mouse BAC clone RP23–428I5 (AL645847) that is syntenic to the human LCR17pA copy. We narrowed the mouse chromosome 11 region syntenic to the insertional duplication site of the proximal CMT1A-REP to ~3.5 Kb (np. 41,500–45,034 in RP23–428I5). To determine whether the deletion of this ~5-Kb fragment was concurrent with the proximal CMT1A-REP segmental duplication and insertional event, we designed PCR primers by comparisons between human LCR17pA and LCR17pB sequences for the analysis of the gorilla genomic substrate sequences present prior to the segmental duplication of CMT1A-REP at the proximal CMT1A-REP integration site. Using overlapping LCR17pA/B-specific oligonucleotide probes, we identified several BAC clones from the gorilla CHORI-255 genomic library. To distinguish LCR17pA/B-versus LCR17pB-derived clones, we tested them by PCR with primers specific for each copy and selected LCR17pA/B-derived gorilla BAC clones 412M18, 565H18, and 475B12. PCR amplification using the LCR17pA/B-derived gorilla BAC clones as templates resulted in a product spanning the predicted fission fragments of the 4960-bp DNA segment in LCR17pA from gorilla. DNA sequence analysis revealed the junctions at the ends of the 4960-bp DNA segment deleted in gorilla before its divergence from chimpanzee (Fig. 5 A model of serial segmental duplications to evolve complex genome architecture Cumulative data from studies of different segmental duplications in proximal 17p suggest a potential model that most parsimoniously explains how the complex genomic architecture evolved. The two copies of CMT1A-REP and three SMS-REPs are absent in rodent species (Fig. 6A
By an insertional event, a progenitor SMS-REP must have arisen in an ancient chromosome. The insertion of the progenitor proximal SMS-REP split the LCR17pC and LCR17D copies (Fig. 6D Discussion From the analysis of different genomes it has been proposed that whole-genome duplications occurred during early evolution, whereas more recent single-gene duplications have enabled the creation of novel protein functions (Li 1997). Much of what we know about duplication events has been inferred from studies of the end products viewed in a single or limited number of species. These studies have primarily consisted of computational sequence analyses without other approaches to verify the inferences independently and experimentally. Molecular clock analysis and calculations based on sequence comparisons are confounded by gene conversion events occurring among repeated sequences. Therefore, a multifaceted approach was used to reconstruct the sequence of rearrangement events that result in complex genome architecture. We found that the complex genome architecture in proximal 17p changed through a series of consecutive segmental duplications during primate evolution. Both repetitive sequences and transposable elements were identified at the breakpoints of genome rearrangements, suggesting a potential role in their generation. Furthermore, the rearrangements resulted in the creation of novel genes at the junctions. DNA sequence analysis of LCRs created through segmental duplications suggests an ongoing gene conversion. Genomic rearrangements and primate evolution At the nucleotide sequence level, human and chimpanzee functionally important nonsynonymous sequence shows 99.4% identity (Wildman et al. 2003). It has been proposed that genomic rearrangements, rather than single nucleotide mutations/polymorphisms, through creation of novel fusion/fission genes may have been the major driving force for primate chromosome evolution (Inoue et al. 2001; Inoue and Lupski 2002; Samonte and Eichler 2002; Stankiewicz and Lupski 2002b). Early studies using comparative genomics among bacterial species revealed substantive evidence for genome rearrangements and indel events that accompany evolution (Versalovic et al. 1993; Weinstock 1994). Preliminary primate genome sequencing studies have estimated that the divergence between human and chimpanzee genomes may be as high as 5%, with the majority of differences resulting not from single-base-pair changes but from indel events (Britten 2002; Britten et al. 2003; Frazer et al. 2003). Interestingly, molecular characterization of some primate evolutionary genomic rearrangements showed the association of chromosome breakpoints with LCRs (Nickerson 2000; Stankiewicz et al. 2001; Eder et al. 2003; Locke et al. 2003; Goidts et al. 2004; Tsend-Ayush et al. 2004). Also, comparison between human and mouse genomes revealed enrichment of segmental duplications in regions of breaks of synteny, suggesting their involvement in evolutionary rearrangements (Probst et al. 1999; Bi et al. 2002; Armengol et al. 2003; Kent et al. 2003; Pennacchio 2003; Pevzner and Tesler 2003). Recently, however, Bailey et al. (2004) proposed that the association between small-scale segmental duplication and large-scale genomic rearrangements is not causative. Genetic recombination and segmental duplications In a classical chromosome evolution model, because of a partial sterility of the heterozygous individuals (underdominance), chromosomal changes act as a barrier to gene flow between species (Rieseberg 2001). Recently, Navarro and Barton (2003) and Rieseberg and Livingstone (2003) proposed a novel perspective of the role of chromosomal rearrangements in primate evolution. According to this model, evolutionary genomic inversions have led to a decreased recombination rate in the involved chromosome regions that, in turn, enabled faster divergence of genes contained in such non-colinear chromosomal segments. This model provides an explanation for continuous interbreeding between separating, but not yet geographically isolated lineages. Zhang et al. (2004), however, could not confirm this hypothesis for humans and chimpanzees. Rozen et al. (2003) and Hawley (2003) showed that intrachromosomal recombination activity (resulting in gene conversion events) involving large, complex, and nearly identical (99.96%) palindromic LCRs on human chromosome Y (Kuroda-Kawaguchi et al. 2001) is much more frequent than initially anticipated (Delbridge and Graves 1999). It has been thus suggested that to protect the important genetic information from evolutionary divergence, because of an inability to recombine with chromosome X, the Y chromosome contains 45% of its structure as LCR copies that serve as targets for gene conversion events. We propose that a similar “evolutionary pressure” mechanism might have caused the selection and retention of many autosomal LCRs. The genomic regions that are now LCR rich might have been recombination poor in earlier evolutionary time (when compared with the average genome rate). Similar to LCR duplications on the Y chromosome, the “necessity” for retaining the genetic information might have placed evolutionary pressure on such regions to generate genomic “back-ups” in the form of segmental duplications. Such LCRs have then stimulated unequal crossing-over events (NAHRs). When the duplicated DNA fragment was inserted inversely with respect to the progenitor copy, the NAHR between these copies might have resulted in chromosome inversion (Lupski 1998). Such inverted segments, in turn, served as a nest for increased evolutionary gene divergence (Navarro and Barton 2003). Thus, in addition to segmental duplications generating novel fusion/fission genes, the genomic regions of decreased recombination might have been another source of gene evolution. Interestingly, Inoue et al. (2001) and Bi et al. (2002) reported decreased recombination over the ~1.4-Mb CMT1A and ~4-Mb SMS genomic regions in proximal 17p, potentially representing the evolutionary remnant of low evolutionary recombination activity. It will be interesting to analyze the recombination activity in other LCR-rich genomic regions. It is remarkable that despite different sizes, locations, orientations, and most importantly times of origin, proximal and distal CMT1A-REPs; proximal, middle, and distal SMS-REPs; LCR17pA/B and LCR17pB; and LCR17pA/D and LCR17pD have retained very similar (~98.1–98.7%) nucleotide identity. Our studies show that in contrast to LCR17pA/D (~115 Kb)- and LCR17pD (~118 Kb)-mediated NAHR, resulting in recurrent SMS chromosome deletions (Shaw et al. 2004b), no deletions with breakpoint within similarly sized LCR17pA/C (~79 Kb) and LCR17pC (~91 Kb) have been found (Stankiewicz et al. 2003; Shaw et al. 2004a). We propose that during primate evolution, the DNA sequence homology between LCR17pA/C and LCR17pC copies must have dropped below a minimal misalignment/recombination stimulating threshold (<95%?) that in turn resulted in a lack of LCR/NAHR gene conversion events and DNA homogenization and a subsequent steady decrease of nucleotide identity to the current ~88% identity (Table 1). DNA rearrangements generate new genes To investigate the evolutionary mechanism of the proximal CMT1A-REP duplication, we took advantage of the fact that this event occurred after the divergence of gorilla from chimpanzee, ~3–7 Mya (Fig. 5 Interestingly, at two breakpoints of rearrangements that occurred during primate evolution, both the proximal CMT1A-REP insertion and the gorilla t(4;19) breakpoint, we found evidence for the sequences being expressed in brain, IMAGE:5170212 (Fig. 5 Unfortunately, because of the lack of multi-tissue RNA samples from the nonhuman primates, we could not perform Northern hybridizations to further study the effects of this evolutionary genomic rearrangement. Computational analyses of human genome sequence suggest the generation of fusion/fission transcripts accompanying segmental duplication in many genomic regions (Courseaux and Nahon 2001; Edelmann et al. 2001; Eichler 2001; Bailey et al. 2002). These findings illustrate the potential role of DNA rearrangements during mammalian genome evolution in the generation of new genes (Inoue et al. 2001; Inoue and Lupski 2002). Transposable elements and large genomic rearrangements Gray (2000) described an “alternative DNA transposition” model for the formation of reciprocal chromosome translocations; however, transposon-mediated gross genomic rearrangements have been reported only in Drosophila and maize (Lim and Simmons 1994; Caceres et al. 1999; Zhang and Peterson 1999). Among mammals, a recombination hotspot within a retrotransposon was identified in mice (Edelmann et al. 1989), and recently, the breakpoint of the evolutionary pericentric inversion of chimpanzee chromosome 19 was found in a small single-copy segment flanked by a medium reiteration-frequency repeat MER117 and a MLT1K element, a member of the mammalian retrotransposon-like superfamily (Kehrer-Sawatzki et al. 2002). We found that the formation of the gorilla t(4;19) was possibly mediated by the DNA transposon Charlie3 (MER1 group) on GGO4 and LTR retrotransposon ERVL-MER74A on GGO19. The vast majority of DNA sequences homologous to the MER1-Charlie3 fragment map to HSAs 5 and 17, suggesting potential colocalization of these chromosomes in cell nuclei territories. Interestingly, Tanabe et al. (2002) has shown an evolutionary conservation of chromosome territories in higher primate cell nuclei; however, no studies about HSAs 5 and 17 or their syntenic primate chromosomes have been reported to date. Models of chromosome evolution Recently, Nahon (2003) proposed a two-step process for the origination of “human-specific” genes during primate evolution. First, between the divergence of Platyrrhini and Catarrhini, 25–30 Mya, transposition events mediated by RNA and DNA (gene nurseries, genic tinkering) resulted in gene duplications. In a second step, at the time of divergence of Hominidae, 5–10 Mya, intrachromosomal duplication events were occurring. Our data imply, however, a more complex mechanistic model. Intriguingly, computer analyses of genomic sequence data show that despite different times of formation, most of the analyzed large LCRs in proximal 17p are directly adjacent to one another. Many of the analyzed junction fragments revealed the presence of the same DNA segment (red-yellow junction on Fig. 4A Serial segmental duplication model for the origin of complex genomic architecture By analyzing the genome architecture of the primate genomic regions syntenic to the human proximal chromosome 17p, we derived a multi-step model for reconstruction of the evolution of the ~7.5-Mb LCR-rich genome architecture (Fig. 6 In summary, our molecular and computational analyses of the ~7.5-Mb proximal 17p region revealed that complex genome architecture has evolved in a multi-step manner involving serial segmental duplications. Segmental duplications appear to influence genome evolution by a number of different mechanisms, including the creation of novel fusion/fission genes at the LCR insertion site, as well as potentially enabling an increased mutation rate because of LCR-mediated genomic inversion associated with reduced recombination. In essence, such genome architecture enables both shuffling of the genome and generating new gene functions providing ample diversity for selection to act on during evolution. Methods DNA sequence analysis The analyzed clones were identified from the existing physical maps of this region (Inoue at al. 2001; Bi et al. 2002; http://www.ncbi.nlm.nih.gov/; http://genome.ucsc.edu/). Sequences were downloaded from the NCBI web site (http://www.ncbi.nlm.nih.gov/). Regional assemblies for known LCR17ps and searches for additional LCRs were constructed following NCBI BLAST searches against the high-throughput and the nonredundant sequence database (http://www.ncbi.nlm.nih.gov/blast/) and were assembled using the Sequencher software (Gene Codes) and NCBI BLAST2. Molecular clock analysis was performed as described elsewhere (Li 1997). In short, we determined the sequence homology between each unit of LCRs using BLAST2 and RepeatMasker. The sequence identity between unique segments and those between repetitive segments between paralogs were not significantly different, thus the use of the RepatMasker did not affect the overall sequence homology calculation. We eliminated small indels from the calculation and took only base pair changes into account. Cell lines The human and primate lymphoblastoid cell lines and fibroblasts were grown and harvested as described previously (Park et al. 2002). Fluorescence in situ hybridization The selected BAC and PAC clones were purchased from the BACPAC Resource Center. DNA was isolated from liquid cultures using PSIΨClone BAC DNA Kit (Princeton Separations, Inc.). FISH was performed on metaphase and interphase cells of peripheral blood lymphocytes, Epstein-Barr virus-transformed peripheral blood lymphoblasts, and skin fibroblasts as described (Shaffer et al. 1997). Chromosomal duplication was distinguished from replication with a control probe consisting of a unique sequence BAC clone from the same chromosome region labeled with different fluorochrome. Screening the gorilla BAC library We screened the gorilla BAC library CHORI255 (BACPAC Resource Center, The Children's Hospital Oakland Research Institute, Oakland, CA) using “overgo” oligonucleotides from the LCR17pA (subunit B)-derived BAC clone RP11–640I15: F:A CAATTTCTTCAATGGGAACTGTATGA, R:GCCATTGGCTTTTAT CACCTTCATACAG; F:CCATGGCCCAGTTGGTAAAGGTA, R:AGCCCACTGCCCACTTACCTTTA; F:GCTCTCTAAT GAAGAGAGAACATGGATGGTC, R:ATCCAATCACTTTCAAGAC CATCC; CTD-3157E16 (subunit D): F:AGTGAACTTGTAACA TACTAACCCTCC, R:TATGGTAATGTAAAAAGGCAGGAGGGTT; F:AGATAACAGGAGCAGTTTTAGAACAAA, R:TTTAGATC CTTCATTATAGTTTTGTTCT; F:GGATCCTTTTTCTTTCTCCAAC TGGGAGATT, R:TTTCCAAACCAGACACAATCTCCC; and CTD 2327L5: F: TATCACTGTTTGTTTCAGGGGGATG, R: GCCAGCCA ATTATGGCCCATCCCCC; F: TTTAAAAGTCAGGGCACACTTCT GCT, R: GAACCATGCAGTCTCCCAAGCAGAAG. Hybridizations were performed according to the methods described previously (www.chori.org/bacpac; Cai et al. 1998). DNA and whole BAC clone sequencing The PCR products were sequenced in the MRRC DNA sequencing core at Baylor College of Medicine. The entire gorilla BAC clone CHORI-255–413A16/AC146897 was sequenced at the Whitehead Institute for Biomedical Research/MIT, Center for Genome Research, Cambridge, MA. PCR PCR was performed with extracted BAC DNA using each of the following primer sets: Distal junction F: TCCCTATTCTCCTTG CAAACTC, R: TCATGACTGGGATGTCCCTT; proximal junction F: CTGCAGTTACCATCCCACCT, R: GGTGTCTGTGTGGAG GGTCT. PCR reactions were performed as described (Shaw et al. 2002). For long-range PCR identifying the LCR17pA-derived clones, the following primers were used: F: GTCAAAATCTTC CCTATTCTCCTTGCAAACTC, R: GACTGTGCTTTATATATGCG TAGACTCAGGGT. The products were visualized on 2% agarose gels. To confirm the computational results, the DNA junction fragment of the gorilla translocation breakpoint was amplified (F: TAGATGCCTGATGTGGCTTG, R: TGGATGCCTCTATGTGC TGT) and the PCR product was sequenced. Brain expression of the identified human cDNA clone IMAGE:5170212 (GenBank accession no. BI828401) was validated using PCR with primers: F: TGGCAAGATCCGAATTTACA, R: GCGCTAACAGTCAGGAGGAA on a human multi-tissue cDNA library panel (BD Biosciences). Acknowledgments We appreciate the critical reviews of Drs. B. Morrow and S. Rosenberg. We thank Dr. B. Birren and the Whitehead Institute for genomic sequence of the gorilla BAC clone 413A16. This study was supported in part by grants from the Muscular Dystrophy Association, the National Institute of Neurological Disorders and Stroke (R01 NS27042), and the National Institute of Child Health and Human Development (PO1 HD39420). Notes Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.2746604. References
WEB SITE REFERENCES
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||
Genome Res. 2001 Jun; 11(6):1005-17.
[Genome Res. 2001]Nature. 2001 Feb 15; 409(6822):953-8.
[Nature. 2001]Trends Genet. 2001 Nov; 17(11):661-9.
[Trends Genet. 2001]Nature. 2001 Feb 15; 409(6822):860-921.
[Nature. 2001]Science. 2001 Feb 16; 291(5507):1304-51.
[Science. 2001]Trends Genet. 1998 Oct; 14(10):417-22.
[Trends Genet. 1998]Nat Rev Genet. 2001 Oct; 2(10):791-800.
[Nat Rev Genet. 2001]Trends Genet. 2002 Feb; 18(2):74-82.
[Trends Genet. 2002]Am J Hum Genet. 2003 Feb; 72(2):246-52.
[Am J Hum Genet. 2003]Hum Mol Genet. 2004 Apr 1; 13 Spec No 1():R57-64.
[Hum Mol Genet. 2004]Curr Opin Genet Dev. 2002 Jun; 12(3):312-9.
[Curr Opin Genet Dev. 2002]Am J Hum Genet. 2002 Jan; 70(1):269-78.
[Am J Hum Genet. 2002]Genome Res. 2003 Dec; 13(12):2519-32.
[Genome Res. 2003]Am J Hum Genet. 2003 Feb; 72(2):246-52.
[Am J Hum Genet. 2003]Nat Genet. 1992 Dec; 2(4):292-300.
[Nat Genet. 1992]Hum Mol Genet. 1994 Feb; 3(2):223-8.
[Hum Mol Genet. 1994]Nat Genet. 1997 Oct; 17(2):154-63.
[Nat Genet. 1997]Am J Hum Genet. 1998 May; 62(5):1023-33.
[Am J Hum Genet. 1998]Nat Genet. 2000 Jan; 24(1):84-7.
[Nat Genet. 2000]Am J Hum Genet. 2003 May; 72(5):1101-16.
[Am J Hum Genet. 2003]Am J Hum Genet. 2004 Jul; 75(1):75-81.
[Am J Hum Genet. 2004]Nat Genet. 1997 Oct; 17(2):154-63.
[Nat Genet. 1997]Genome Res. 2002 May; 12(5):729-38.
[Genome Res. 2002]Hum Mol Genet. 1997 Sep; 6(9):1595-603.
[Hum Mol Genet. 1997]Genome Res. 2001 Jun; 11(6):1018-33.
[Genome Res. 2001]Genome Res. 2002 May; 12(5):729-38.
[Genome Res. 2002]Nature. 1998 Apr 30; 392(6679):917-20.
[Nature. 1998]Genome Res. 2001 Jul; 11(7):1205-10.
[Genome Res. 2001]Genome Res. 2003 Mar; 13(3):369-81.
[Genome Res. 2003]Genome Res. 2001 Jul; 11(7):1205-10.
[Genome Res. 2001]Am J Hum Genet. 2003 Oct; 73(4):823-34.
[Am J Hum Genet. 2003]Genomics. 1997 May 15; 42(1):161-4.
[Genomics. 1997]Hum Mol Genet. 1997 Sep; 6(9):1595-603.
[Hum Mol Genet. 1997]Hum Mol Genet. 1996 Jun; 5(6):745-53.
[Hum Mol Genet. 1996]Ann N Y Acad Sci. 1999 Sep 14; 883():22-35.
[Ann N Y Acad Sci. 1999]Genomics. 1998 Oct 1; 53(1):110-2.
[Genomics. 1998]Genome Res. 2001 Jun; 11(6):1018-33.
[Genome Res. 2001]Nat Genet. 1992 Dec; 2(4):292-300.
[Nat Genet. 1992]Nat Genet. 1997 Oct; 17(2):154-63.
[Nat Genet. 1997]Genomics. 1999 Feb 1; 55(3):348-52.
[Genomics. 1999]Genome Res. 2002 May; 12(5):713-28.
[Genome Res. 2002]Genome Res. 2002 May; 12(5):729-38.
[Genome Res. 2002]Proc Natl Acad Sci U S A. 2003 Jun 10; 100(12):7181-8.
[Proc Natl Acad Sci U S A. 2003]Genome Res. 2001 Jun; 11(6):1018-33.
[Genome Res. 2001]Annu Rev Genomics Hum Genet. 2002; 3():199-242.
[Annu Rev Genomics Hum Genet. 2002]Nat Rev Genet. 2002 Jan; 3(1):65-72.
[Nat Rev Genet. 2002]Curr Opin Genet Dev. 2002 Jun; 12(3):312-9.
[Curr Opin Genet Dev. 2002]Genome Res. 2001 Jul; 11(7):1205-10.
[Genome Res. 2001]Mol Biol Evol. 2003 Sep; 20(9):1506-12.
[Mol Biol Evol. 2003]Genome Biol. 2003; 4(8):R50.
[Genome Biol. 2003]Hum Genet. 2004 Jul; 115(2):116-22.
[Hum Genet. 2004]Genomics. 2004 Feb; 83(2):193-202.
[Genomics. 2004]Trends Ecol Evol. 2001 Jul 1; 16(7):351-358.
[Trends Ecol Evol. 2001]Science. 2003 Apr 11; 300(5617):321-4.
[Science. 2003]Science. 2003 Apr 11; 300(5617):267-8.
[Science. 2003]Genome Res. 2004 May; 14(5):845-51.
[Genome Res. 2004]Nature. 2003 Jun 19; 423(6942):873-6.
[Nature. 2003]Cell. 2003 Jun 27; 113(7):825-8.
[Cell. 2003]Nat Genet. 2001 Nov; 29(3):279-86.
[Nat Genet. 2001]Rev Reprod. 1999 May; 4(2):101-9.
[Rev Reprod. 1999]Trends Genet. 1998 Oct; 14(10):417-22.
[Trends Genet. 1998]Science. 2003 Apr 11; 300(5617):321-4.
[Science. 2003]Genome Res. 2001 Jun; 11(6):1018-33.
[Genome Res. 2001]Genome Res. 2002 May; 12(5):713-28.
[Genome Res. 2002]Am J Hum Genet. 2004 Jul; 75(1):75-81.
[Am J Hum Genet. 2004]Am J Hum Genet. 2003 May; 72(5):1101-16.
[Am J Hum Genet. 2003]J Med Genet. 2004 Feb; 41(2):113-9.
[J Med Genet. 2004]Hum Mol Genet. 1996 Jun; 5(6):745-53.
[Hum Mol Genet. 1996]Hum Mol Genet. 1997 Sep; 6(9):1595-603.
[Hum Mol Genet. 1997]Ann N Y Acad Sci. 1999 Sep 14; 883():22-35.
[Ann N Y Acad Sci. 1999]Mol Biol Evol. 1999 Aug; 16(8):1019-26.
[Mol Biol Evol. 1999]Genome Res. 2001 Jun; 11(6):1018-33.
[Genome Res. 2001]Genome Res. 2004 Aug; 14(8):1462-73.
[Genome Res. 2004]Science. 2001 Feb 16; 291(5507):1293-7.
[Science. 2001]Genome Res. 2001 Feb; 11(2):208-17.
[Genome Res. 2001]Trends Genet. 2001 Nov; 17(11):661-9.
[Trends Genet. 2001]Am J Hum Genet. 2002 Jan; 70(1):83-100.
[Am J Hum Genet. 2002]Genome Res. 2001 Jun; 11(6):1018-33.
[Genome Res. 2001]Trends Genet. 2000 Oct; 16(10):461-8.
[Trends Genet. 2000]Bioessays. 1994 Apr; 16(4):269-75.
[Bioessays. 1994]Science. 1999 Jul 16; 285(5426):415-8.
[Science. 1999]Genetics. 1999 Nov; 153(3):1403-10.
[Genetics. 1999]Cell. 1989 Jun 16; 57(6):937-46.
[Cell. 1989]Proc Natl Acad Sci U S A. 2002 Apr 2; 99(7):4424-9.
[Proc Natl Acad Sci U S A. 2002]Genetica. 2003 Jul; 118(2-3):193-208.
[Genetica. 2003]Am J Hum Genet. 2003 Feb; 72(2):246-52.
[Am J Hum Genet. 2003]Genome Res. 2001 Jun; 11(6):1018-33.
[Genome Res. 2001]Genome Res. 2002 May; 12(5):713-28.
[Genome Res. 2002]Genome Res. 2002 May; 12(5):729-38.
[Genome Res. 2002]Am J Med Genet. 1997 Mar 31; 69(3):325-31.
[Am J Med Genet. 1997]Genomics. 1998 Dec 15; 54(3):387-97.
[Genomics. 1998]Am J Hum Genet. 2002 Nov; 71(5):1072-81.
[Am J Hum Genet. 2002]Am J Hum Genet. 2003 May; 72(5):1101-16.
[Am J Hum Genet. 2003]Genome Res. 2002 May; 12(5):729-38.
[Genome Res. 2002]Hum Mol Genet. 1996 Jun; 5(6):745-53.
[Hum Mol Genet. 1996]Hum Mol Genet. 1997 Sep; 6(9):1595-603.
[Hum Mol Genet. 1997]Genome Res. 2001 Jun; 11(6):1018-33.
[Genome Res. 2001]