![]() | ![]() |
Formats:
|
||||||||||||||||||
Copyright © 2008, Cold Spring Harbor Laboratory Press Characterization of the bovine pseudoautosomal boundary: Documenting the evolutionary history of mammalian sex chromosomes Unit of Animal Genomics, GIGA-R and Faculty of Veterinary Medicine, University of Liège, 4000-Liège, Belgium 1Corresponding author.E-mail michel.georges/at/ulg.ac.be; fax 32-4-366.41.98. Received June 25, 2008; Accepted September 3, 2008. Abstract Here, we report the sequence characterization of the bovine pseudoautosomal boundary (PAB) and its neighborhood. We demonstrate that it maps to the 5′ end of the GPR143 gene, which has concomitantly lost upstream noncoding exons on the Y chromosome. We show that the bovine PAB was created ~20.7 million years ago by illegitimate intrachromatid recombination between inverted, ruminant-specific Bov-tA repeats. Accordingly, we demonstrate that cattle share their PAB with all other examined ruminants including sheep, but not with cetaceans or more distantly related mammals. We provide evidence that, since its creation, the ancestral ruminant PAB has been displaced by attrition, which occurs at variable rates in different species, and that it is capable of retreat by attrition erasure. We have estimated the ratio of male to female mutation rates in the Bovidae family as ~1.7, and we provide evidence that the mutation rate is higher in the recombining pseudoautosomal region than in the adjacent, nonrecombining gonosome-specific sequences. Maleness in placental mammals and marsupials is determined by the SRY gene located on the Y chromosome. This major sex determinant arose ~166 million years ago (Mya) on an ancestral autosome as an allele of the SOX3 gene (Veyrunes et al. 2008). As is commonly observed for chromosomes carrying sex-determining genes (Ohno 1967), the Y has since undergone progressive degeneration, being reduced in present-day man to a mere 25 Mb of euchromatin harboring no more than 27 distinct protein-coding genes or gene families, appended with an approximately equal amount of dispensable heterochromatin (Skaletsky et al. 2003). These numbers are to be compared with the ~155 Mb and 1100 genes of its ancestral partner, the X chromosome (Ross et al. 2005). The decay of the Y is thought to result from the successive selection of male-beneficial/female-deleterious alleles embedded in haplotypes that lost the ability to recombine with the X and are hence confined to males (Charlesworth 1991). Absence of recombination causes rapid degeneration by mutation, deletion, and transposon invasion accumulating as a result of a higher mutation rate in the male versus the female germline (due to the larger number of cell divisions required to produce male vs. female gametes), inefficient repair (e.g., Muller’s ratchet), and inefficient selection (e.g., shielding of deleterious recessives and Hill–Robertson interference) (e.g., Charlesworth et al. 2005; Bachtrog 2006; Graves 2006). The most commonly invoked recombination-blocking mechanism is chromosomal inversion. The observation of a stepwise increase in sequence similarity between genes ordered on the human X with their gametologs on the Y (“evolutionary strata”) suggests that five such recombination-blocking inversions have occurred in the human lineage (Lahn and Page 1999; Ross et al. 2005). These have isolated an increasing proportion of the Y from its X partner, progressively reducing the region of X–Y homology to the ~2.7 Mb pseudoautosomal region 1 (PAR1). The five inversions in the human lineage were initially dated to 240–320 Mya, 130–170 Mya, 80–130 Mya, 38–44 Mya, and 29–32 Mya, respectively, yet recent reexamination of the age of the therian sex chromosomes (Veyrunes et al. 2008) forces reevaluation of these estimates. Loss of genes from the Y causes male hemizygosity and thus a different gonosome-to-autosome balance in the two sexes. This is thought to drive progression of dosage compensation involving (in mammals) doubling of expression levels from the X (Nguyen and Disteche 2006) and compensatory XIST-dependent inactivation of one X chromosome in females (Lyon 1961; Heard and Disteche 2006). Notably, while virtually all genes located in the older strata undergo X inactivation, their proportion decreases in the younger layers (Carrel and Willard 2005). Concomitantly, the enrichment in L1 interspersed repeats, which may operate as way stations spreading the inactivation process (Lyon 1998; Carrel et al. 2006), increases with stratum age (Ross et al. 2005). The generalization of dosage compensation across most of the X chromosome is thought to underlie its “frozen” gene content in mammals (Ohno 1967). The human and dog X chromosome sequences, for instance, are essentially colinear, while the human and mouse X chromosomes are nearly perfectly syntenic despite multiple intrachromosomal rearrangements (Ross et al. 2005). Figure 1A
Despite the largely frozen gene content of the X, the evolution of mammalian sex chromosomes has been punctuated by interchromosomal exchanges. An autosome to proto-gonosome translocation occurring after placental mammals diverged from marsupials has increased the size of the eutherian neo-gonosome by addition of the “X added region” (XAR) (Graves 2006). Autosome to Y transposition has augmented the content of the Y chromosome in male-beneficial genes, including retrotransposition of CDY before the divergence of marsupials and eutherians (Lahn and Page 1999; Skaletsky et al. 2003), transposition of DAZ during primate evolution (Saxena et al. 1996), and transposition of FLJ36031 prior to carnivore radiation and TETY1 following the divergence of cat and dog lineages (Murphy et al. 2006). Moreover, the human Y euchromosome has acquired an X transposed region (XTR) after its divergence from chimpanzees (Skaletsky et al. 2003). In addition, X-linked genes have generated pseudogenes by retrotransposition to autosomes, presumably to compensate for their silencing during male meiotic sex chromosome inactivation (MSCI) (e.g., Potrzebowski et al. 2008). Pseudoautosomal regions (PARs) Despite their growing divergence, the mammalian X and Y maintain a short region of homology, allowing pairing and recombination in males that is required for faithful segregation of the sex chromosomes. Obligatory crossing-over in the PAR in males erases sex linkage for the more distal markers, justifying the “pseudoautosomal” designation. The exceptionally high centimorgan to megabase ratio in the PAR (~20-fold higher than the genome average in humans [Brown 1988; Petit et al. 1988; Lien et al. 2000]) is thought to account for its higher GC content, as a result of recombination-associated gene conversion biased toward GC. Notably, a gradual increase in GC content is observed on the human X when moving from old to young strata and finally to the present-day PAR (e.g., Ross et al. 2005). Enhanced recombination is also thought to account for the accelerated rate of evolution noted for genes in the PAR (particularly in rodents and to a lesser extent in primates), as recombination is accompanied by DNA repair relying on low-fidelity DNA polymerases (Perry and Ashworth 1999; Filatov and Gerrard 2003; Galtier 2004; Yi et al. 2004). As stated above, the human Xp PAR1 measures ~2.7 Mb and harbors 24 genes. In other mammals studied, the PAR is thought to be larger as it encompasses genes that are pseudoautosomal in human (notably SHOX, IL3RA, CSF2RA, and SLC25A6 [ANT3]), plus genes that have become X-specific in the human lineage (notably PRKX and STS mapping to human stratum 4). This assumption applies to lemurs (Gläser et al. 1999), sheep (Toder et al. 1997), cattle (Moore et al. 2001), dog (Toder et al. 1997), and cat (Murphy et al. 2006). The PAR of Mus musculus domesticus includes Sts, but none of the genes that are pseudoautosomal in human. This is apparently due to the loss of ~9 Mb from its distal end, reducing the size of the PAR to a mere ~700 kb (Perry et al. 2001; Ross et al. 2005). Taken together, these results suggest that the human pseudoautosomal boundary (PAB) has advanced further in the ancestral eutherian PAR when compared with most other mammals (Fig. 1B Extant and ancestral pseudoautosomal boundaries (PABs) To the best of our knowledge, extant PABs have only been characterized only at the sequence level for humans, Great Apes, Old World Monkeys and the domestic mouse. In Catarrhini, the PAB maps within the gene coding for the XG blood group antigen, also called PBDX (pseudoautosomal boundary divided on the X) (Ellis et al. 1990, 1994). As a result, XG is disrupted on the Y, missing nine exons on the 3′ side. This is compatible either with a pericentric inversion (Ellis et al. 1994) or with the intrachromosomal transposition of a chromosome fragment including SRY (Gläser et al. 1999). Since its creation, the PAB of Catarrhini has shifted ~240 bp into the PAR by “attrition,” accounting for the fact that the present-day PAB is flanked by a 240-bp segment of reduced homology (~77%) on its proximal side. It is thought that an Alu element has subsequently been inserted at the exact location of the PAB in the common ancestor of humans and Great Apes, without perturbing its position, and separating the pseudoautosomal “Alu-distal region” from the sex-specific “Alu-proximal region” (Ellis et al. 1990). The PAB of M. musculus domesticus is located in the third intron of the Mid1 (also known as Fxy) gene and truncates the 5′ end of the Y copy. The pseudoautosomal 3′ end of the gene starts with a variable number of tandem (intron 3–exon 4)n copies (Palmer et al. 1997). The history of the PAB in rodents remains somewhat confusing. Mid1 is X-specific in M. spretus and rat, indicating that the likely position of the ancestral rodent PAB is distal from Mid1 (Perry and Ashworth 1999). As the M. musculus domesticus PAB coincides with Mid1, this means either that the PAB moved backward to adopt a more proximal position in the domesticus lineage, or that Mid1 was translocated to a more distal position, as proposed by Galtier (2004). The completion of the sequences of the murine X and Y chromosomes should clarify this issue. The feline PAB has not been defined at the sequence level but has been tentatively positioned between SHROOM2 and WWC3 based on an abrupt drop in retention frequency in radiation hybrids obtained from male cells (Murphy et al. 2007). Ancestral PABs in the human lineage have been tentatively mapped to gene intervals corresponding with abrupt changes in Ks between gametologs. Hence, the boundaries between strata 1–2, 2–3, 3–4, and 4–5 have been positioned in the CXORF39–ZXDA, RGN–PHF16, WWC3–GPR143 (=OA1), and NLGN4X–AA971220 intervals, respectively (Skaletsky et al. 2003; Carrel and Willard 2005; Ross et al. 2005). Note that, especially for strata 3 and 4, the boundary is blurry, with some confidence intervals of Ks values being nonoverlapping within strata, while overlapping between strata (Skaletsky et al. 2003; Ross et al. 2005). As an example, while TBL1X, GPR143 (OA1), SHROOM2 (APXL), and AMELX map in that order on the human X, the Ks for TBL1X and SHROOM2 matches that of stratum 3 best, while that of GPR143 and AMELX places these in stratum 4. However, the investigators considered that it was premature to conclude that suppression of X–Y crossing over evolved in more than five steps, as alternative explanations including local changes in gene order and/or gene conversion might account for these findings (Skaltesky et al. 2003). Intriguingly, the observation of an abrupt increase in GC content and decrease in divergence between gametologs when moving from the 5′ to the 3′ end of the AMELX/Y genes in six mammalian species, plus the fact that in phylogenetic analyses orthologs cluster at the 5′ end of the gene while gametologs cluster at the 3′ end of the gene, suggest that the AMEL locus may span the ancestral PAB separating human strata 3 and 4 (Iwase et al. 2003; Marais and Galtier 2003). As the AMELX/Y genes are not interrupted, this suggests that recombination-blocking mechanisms other than chromosomal rearrangements may be involved in isolating the X and Y chromosomes. In this work, we report the identification and sequence characterization of the PAB and its neighborhood in ruminants. Results Identifying and sequencing Y- and X-specific BACs spanning the bovine PAB To identify Y-specific bacterial artificial chromosomes (BACs) spanning the bovine PAB, we initiated a bidirectional chromosome walk starting from AMELY, the Y-specific locus mapping closest to the PAB (Liu et al. 2002). A male BAC library (Warren et al. 2000) was screened with AMELY probes. A BAC contig (no. 5335) containing PCR-confirmed positive clones was retrieved from the BAC-based fingerprint map of the bovine genome (http://www.bcgsc.ca/platform/mapping/bovine) using iCE (Fjell et al. 2003). New probes were designed from the end sequences of the outer BACs of this contig. PCR on male and female DNA was used to discriminate between pseudoautosomal or Y-specific probes before a new library screening was carried out. Several rounds of hybridization were carried out until a probe designed on a Y-specific contig (no. 9351) turned out to be pseudoautosomal. This implied that contig 9351 spanned the PAB. BAC end-derived probes covering the entire contig length were used to refine the position of the PAB within the contig by PCR. Two BAC clones were hence shown to lie across the PAB on the Y chromosome: E0012F01 and H0106G14 (Fig. 2A
Alignment of the E0012F01 sequence with the BTAX sequences available in the public domain (Btau_3.1), indicated that BAC E0383I16 must span the PAB on the X chromosome. However, only 6 kb of contiguous X-specific sequences bordering the PAB were reported at the time. To better characterize the X-specific PAB neighborhood, we completed the corresponding sequence as described in the Supplemental materials. This led to a contiguous 156,628-kb long sequence, including 129,705 kb of X-specific sequence proximal to the PAB. To obtain additional Y-specific sequences, we sequenced BACs E0232B11 and E0064F17, as well as a 12-kb bridging fragment, to yield a total of 425,809 kb of contiguous finished sequence adjacent to the PAB (Fig. 2A To confirm the Y-specific and pseudoautosomal origin of the identified BACs, we performed fluorescence in situ hybridization (FISH) on male and female metaphase chromosomes using H0202L11 and E0232B11 as probes. As expected for a pseudoautosomal sequence, H0202L11 labeled the extremity of the two Xq arms in the female, while hybridizing to Xq and distal Yp in the male. E0232B11, on the other hand, hybridized exclusively to distal Yp, in the immediate vicinity of the H0202L11 signal, as expected for a Y-specific probe adjacent to the PAB (Fig. 2B Pinpointing the bovine PAB and annotating genes in its neighborhood The obtained sequences were annotated as follows: (1) Genes were predicted by BLASTing the masked BAC sequences against human cDNA at Ensembl (http://www.ensembl.org) and bovine EST at NCBI (http://www.ncbi.nlm.nih.gov), (2) the moving average [G+C] content was determined using a 200-bp sliding window, (3) CpG islands were identified following Gardiner-Garden and Frommer (1987), and (4) repetitive elements were identified using Repeat Masker (A.F.A Smit and P. Green, http://repeatmasker.genome.washington.edu). The resulting genomic landscape is shown in Figure 3
Alignment of the PAB-spanning E0012F01 (Y chromosome) and H0025A18 (X chromosome) sequences identified segments of near perfect homology (99.97%) corresponding to the PAR, diverging respectively into Y- and X-specific sequences, hence defining the bovine PAB. Detailed examination of the gonosome-specific sequences adjoining the PAB revealed a 413-bp segment of reduced homology (86.20%) that separates the PAR sequences from the clearly nonhomologous X- and Y-specific sequences. This segment of reduced homology is reminiscent of the Alu-proximal region of the human PAB, which is supposed to reflect progressive displacement of the PAB by attrition (Ellis et al. 1990, 1994). The boundary between the segment of reduced homology and the gonosome-specific sequences coincides with the tRNA portion of a Bov-tA1 SINE element on the X and a closely related Bov-tA2 element on the Y (Fig. 4
According to the Ensembl annotation of the X chromosome, the bovine PAB is located just upstream of the GPR143 gene. This places the bovine PAB in the intergenic region separating SHROOM2 and GPR143. However, a detailed examination of bovine EST sequences (e.g., DV913014 and EH378090) identified a putative, noncoding, upstream exon of GPR143 lying across the PAB on the X chromosome. We performed 5′ RACE experiments (Fig. 3 We identified four genes within the available bovine Y-specific sequences, in the order USP9Y–OFD1Y–AMELY–EIF1AY–PAB (Fig. 3 Base-pair composition and repeat content of the bovine PAR and sex chromosomes Strikingly, the bovine PAB is flanked, on both the X and Y chromosomes, by ~1- to 5-kb segments of very high G+C content (>70%) that correspond to unusually long CpG islands. Moreover, it is flanked on the Y chromosome by an unusual, ~17-kb long segment, composed primarily of MaLRs (Mammalian apparent LTR-retrotransposons) elements. Neither of these features is shared by the human PAB. Further examination of the BAC sequences points toward a high G+C and CpG island content for the PAR, intermediate values for the X-specific sequences, and the lowest values for the Y-specific sequences (Fig. 3
From this, it appears that the G+C and CpG dinucleotide content is well correlated with recombinational activity, being highest for the PAR, followed by the autosomes, X-specific, and Y-specific sequences. It is higher for the human than for the bovine PAR, which is compatible with the smaller size and hence higher recombination rate per base pair in the human PAR. It is worthwhile noting, however, that when compared with the ends of several autosomes, the rise in G+C and CpG content in the PAR is not unusual in magnitude (Supplemental Fig. 2). In humans, the density of CpG islands is highest on the autosomes, followed by the X chromosome, Y chromosome, and PAR. The same ranking is observed in bovines except for the available Y-specific sequences, which were remarkably depleted of CpG islands. It remains to be determined whether this feature will extend to the rest of the bovine Y. Y chromosome decay is predicted to cause accumulation of transposable elements (e.g., Bachtrog 2006). However, this was not observed either on the bovine or the human Y. The bovine PAR, however, was enriched in the four classes of repeats when compared with the rest of the genome (Table 1; Supplemental Fig. 2). This was particularly striking for SINEs, with the PAR having a density in SINEs more than twice that of the X-specific region or of the autosomal average. This enrichment of interspersed repeats on the PAR was not observed in humans. The increase in LINE density on the X when compared with autosomes, first noticed by Lyon (1998), was also apparent in bovines (Supplemental Fig. 2). However, in bovines, we found a higher LINE density in the PAR than for the remainder of the X. The bovine PAB is ruminant-specific To verify whether Bos taurus shares its PAB with other ruminants, we used bovine primers to amplify the orthologuous sequences of four Bovinae (Bison, Yak, Banteng, and Zebu) and one Caprinae (sheep). Y- and X-specific products spanning the PAB could be amplified and sequenced for all species (Supplemental Fig. 3), demonstrating that this boundary predates the divergence of Bovinae and Caprinae and is thus at least ~18 million years (Myr) old (Hassanin and Ropiquet 2004). To verify whether the identified PAB is indeed ruminant-specific (as suggested by the occurrence of a Bov-tA element bridging the limit of homology), we compared the number of SHROOM2 and GPR143 copies in cattle, porpoises, horses, cats, dogs, mice, and humans of both sexes. Porpoises are cetaceans, which are assumed (with hippopotamus) to be the closest relatives of ruminants, having diverged an estimated ~50 Mya (e.g., Graur and Higgins 1994; Gatesy et al. 1996; Shimamura et al. 1997). Note that the genome of cetaceans does not contain Bov-tA repeats (Shimamura et al. 1999). We reasoned that the female-to-male copy ratio should be one for genes in the PAR, and two for X-specific genes. To ensure that our species-specific SHROOM2 and GPR143 PCR primers would amplify only the X-specific copy of non-PAR genes, we designed at least one of the primers in intronic sequences (except for porpoise) and verified the homogeneity of the amplified sequences by monitoring their melting behavior (dissociation curve) and, for some of them, by sequencing. One to three species-specific autosomal amplicons were used to control for varying amounts of template DNA, and relative copy numbers of SHROOM2 and GPR143 were estimated in males and females using qBase (Fig. 5
In the horse, the female-to-male copy ratio was about two for both genes (as in human and mice), thus indicating their X-specific location and hence a more distal position of the equine PAB. In dogs, the ratio was about one for both genes, implying a pseudoautosomal location as in porpoises. Unexpectedly, in cat the ratio was about one-half for both genes. This suggests that the feline Y chromosome harbors SHROOM2 and GPR143 sequences that are very closely related to the X-specific gametolog sequences, possibly pointing toward recent X to Y transposition. It precludes conclusions regarding their location with respect to the PAB. However, it indicates that the higher retention rate of SHROOM2 relative to X-specific sequences that was observed in male radiation hybrids (Murphy et al. 2006) may not be due to its location on the PAR. Lineage-specific PAB attrition and retreat To further characterize the ruminant-specific PAB, we aligned the available homologous X- and Y-derived sequences (i.e., the proximal gonosome-specific 413-bp segment of reduced homology and 1233 bp of PAR sequence distal from the PAB) across the six studied species using ClustalW (http://mobyle.pasteur.fr/cgi-bin/MobylePortal/portal.py?form=clustalw-multialign). 181 residues were found to be variable. 173 of these were characterized by two states and eight by three states. The genotypes of the studied species at a given site (“gonotypes”) resulted from one or more mutations, and—for some of the sites—one or more recombination events between the X and Y. Mutations and recombination events can be parsimoniously (i.e., trying to minimize the number of events needed to explain the observed genotype vector) mapped on the species tree (assumed to be known; Hassanin and Ropiquet 2004). Unless recombination blurred their gonosomal origin, mutations can also be assigned to either the X or Y chromosome. The 181 gonotype vectors could be interpreted in terms of 178 mutations and 82 recombinations. Gonosomal origin could be inferred for 110 out of the 178 mutations. These will be referred to as gonosome-specifying (GS) events, while recombinations will be referred to as R events (Fig. 6A,B
As shown in Figure 6C The Bos taurus PAB, i.e., the boundary between the segment of reduced homology and the PAR, maps in the middle of this intermediate part. Only one cattle-specific GS event is observed distally from this point in Bos taurus, which is compatible with regular interallelic nucleotide diversity in domestic cattle (~1/2000) (e.g., Steele and Georges 1991). Detailed examination of the aligned Y- and X-derived sequences in the other species (Fig. 6C Several of the GS events on the distal side of the Bos taurus PAB predate the Bovinae or even ruminant radiation, as several species share the same gonotype. The fact that the X and Y of Bos taurus have the same residues at the corresponding sites (the Y-specifying state for some, the X-specifying state for others), implies that the distal progression of the PAB by attrition can occasionally be reversed by recombination events between diverging X- and Y-specific sequences. This phenomenon of “PAB retreat” or “attrition erasure” has not been described before. Lineage-, sex-, and region-specific mutation rates It is noteworthy that the number of GS events mapped to a lineage from the common ancestor of ruminants to any one of the Bovinae is higher than for the equivalent lineage to sheep. This is observed for the Y (33.2 vs. 15) and the X chromosome (19.8 vs. 11). It suggests (P < 0.06) that the mutation rate (both male and female) is lower in Caprinae than in Bovinae. The ratio of GS events assigned to the Y versus X chromosome corresponds to 60/44 (~1.36) and is identical in Bovinae (45/33) and Caprinae (15/11). Knowing that the Y chromosome traverses only the male germline, while the X traverses the female germline twice versus once in the male germline, the ratio of male versus female mutation rates (α) in this region can be estimated at ~1.67 (95% confidence interval: 0.7–8.5) (Miyata et al. 1987; Makova and Li 2002). We quantified the average sequence divergence between sheep and bovinae, separately for the Y- and X-derived sequences, using a moving 100-bp window (Fig. 6A Probing the age of the bovine PAB Y-specific sequences have accumulated on average ~24.1 GS mutations after the divergence between Bovinae and Caprinae (i.e., ~18 Mya), while the corresponding figure for X-specific sequences is ~15.4. Before the corresponding speciation event, Y- and X-specific sequences have accumulated six such differences (Fig. 6B Discussion We herein report the identification and sequence characterization of the bovine PAB. It is located in the SHROOM2–GPR143 interval, coinciding exactly with the presumed limit between strata 3 and 4 of the human X chromosome (Lahn and Page 1999; Carrel and Willard 2005; Ross et al. 2005). This suggested that, in the ruminant lineage, the position of the PAB might not have changed since the occurrence of the chromosomal inversion that created the limit between strata 3 and 4 in a common ancestor of human and cattle. However, we found that the breakpoint of homology between the bovine sex chromosomes coincides with a Bov-tA1 SINE element on the X and a Bov-tA2 element on the Y. As Bov-tA elements are ruminant-specific, this implied that the bovine PAB could not predate the divergence of ruminants from other mammals, which is estimated at ~50 Mya. Accordingly, we found that all analyzed ruminants (five Bovinae and one Caprinae) share their PAB with cattle, but that both SHROOM2 and GPR143 are pseudoautosomal in porpoise. This is in agreement with a more proximal PAB in the common ancestor of cetaceans and ruminants, and the occurrence of at least one subsequent chromosomal inversion in the ruminant lineage, displacing the PAB more distally. Comparative sequence analysis dates the last of these inversions to ~20.7 Mya. The PAB of porpoise may still be the ancestral boundary between strata 3 and 4, although this remains to be proven. SHROOM2 and GPR143 were both shown to be pseudoautosomal in dogs, pointing toward a more proximal position of the PAB in this species as well. It will be interesting to determine whether porpoises and dogs share the same PAB. 104 GS events could be assigned either to the Y or to the X chromosome. From the corresponding proportions, we estimated the ratio of male to female mutation rates (α) at 1.7, thus supporting weak “male-driven evolution” (Li et al. 2002) in ruminants. Indeed, this figure is slightly lower than previous estimates of approximately two in murids (e.g., Chang et al. 1994; Chang and Li 1995; Rat Genome Sequencing Project Consortium 2002), and is 1.5- to threefold lower than corresponding figures in hominids (Shimmin et al. 1993; Chang et al. 1996; Huang et al. 1997; Makova and Li 2002; The Chimpanzee Sequencing and Analysis Consortium 2005), carnivores (Slattery and O’Brien 1998; Lindblad-Toh et al. 2005), and birds (Ellegren and Fridolfsson 1997; Kahn and Quinn 1999; Carmichael et al. 2000). It is noteworthy that α has been previously estimated at about four in Caprinae (Lawson and Hewitt 2002). As the confidence intervals in both studies are large, additional data will have to be collected to obtain a conclusive picture for ruminants. If a value of about two were to be confirmed in Bovinae, it would certainly argue against a simple correlation between α and generation time (Li et al. 2002). Available bovine Y-specific sequences were found to be characterized by a very pronounced depletion in CpG dinucleotides (Table 1). The same feature was observed in humans, indicating that this might be a genuine characteristic of the Y chromosome. In addition to the absence of recombination (recombination would promote de novo CpG creation), we propose that this might be due to the fact that the Y chromosome transits only through the male germline, in which DNA sequences remain methylated for a much longer period than in the female germline, thereby providing more opportunities for methyl-C to T transition by oxidative deamination (Bourc’his and Bestor 2006; Schaefer et al. 2007). We acknowledge that this hypothesis predicts an enrichment of CpG dinucleotides on the X chromosome relative to autosomes (as the X chromosome spends twice as much time in the female germline than in the male germline while autosomes share their time equally between both germlines), while the opposite tendency is observed. However, this might be due to the unique biology of the X chromosome, such as constraints on sequence composition imposed by the mechanisms underlying X inactivation in females and doubling of expression levels from the only active X in both sexes (Heard and Disteche 2006; Nguyen and Disteche 2006). We observed the same enrichment of LINE repeats on the bovine X chromosome when compared with autosomes, which has been reported in other species and is thought to reflect the involvement of LINE elements in the spreading of the X-inactivation process (Lyon 1998; Carrel et al. 2006). Intriguingly, however, the bovine PAR proved to be even richer in LINE elements than the remainder of the X chromosome. A high density in LINE sequences alone is thus certainly not a sufficient condition for the spreading of X inactivation. It will be interesting to compare the word composition of inactivated portions of the X chromosome with the PAR in bovine, as was recently performed for inactivated segments versus segment escaping inactivation on the human X chromosome (e.g., Carrel et al. 2006; McNeil et al. 2006). Note that the bovine PAR is also enriched in SINE elements, which were shown to be enriched in regions of the X chromosome escaping X inactivation in human (Carrel et al. 2006). They might thus operate in a compensatory way to protect the bovine PAR from spreading of X inactivation. In conclusion, the comparative sequence analysis of the ruminant PAB has allowed us to document new facets of the evolutionary history of sex chromosomes in mammals, to provide independent evidence supporting previously established hypotheses including male-driven evolution and the effect of recombination on mutation rate and nucleotide composition, as well as to reveal novel phenomena including the reversibility of PAB progression by attrition. Methods DNA purification Genomic DNA was purified by phenol–chloroform extraction following standard procedures. BAC DNA, plasmid DNA, and PCR products were purified with a QIAGEN Large-Construct Kit, QIAprep Spin Miniprep Kit, and QIAquick Gel Extraction Kit (QIAGEN), respectively. Library screening Filters one to six from the RPCI-42 bovine BAC library (Warren et al. 2000) were hybridized with 100 ng of 32P-labeled PCR products (Hexalabel DNA Labeling Kit, Fermentas) in Church buffer (1% BSA, 1 mM EDTA, 0.5 M NaPO4 at pH 7.2, 7% SDS) and washed twice with wash 1 (0.5% BSA, 0.5 mM EDTA, 40 mM NaPO4 at pH 7.2, 5% SDS) and twice with wash 2 (1 mM EDTA, 40 mM NaPO4 at pH 7.2, 1% SDS). Filters were exposed on Hyperfilm (Amersham Biosciences) for ~20 h at −80°C. Contigs of BACs containing the positive clones were identified with Internet Contig Explorer (iCE) (Fjell et al. 2003). Sequencing Sequencing reactions were performed with the Big Dye Terminator v3.1 Cycle Sequencing Kit (Applied Biosystems), ethanol purified, and analyzed on a 3730 DNA Analyser (Applied Biosystems). Pulsed-field gel electrophoresis (PFGE) and subcloning Two to three micrograms of BAC DNA were digested overnight with 15 U of one of the following enzymes: Acc65I, BamHI, EcoRI, HindIII, KpnI, SphI, or XbaI. Digestion products were ethanol purified and run on a CHEF-DR II Pulsed Field Electrophoresis System (Bio-Rad). Electrophoresis was carried out at 14°C in 0.5× TBE at 4 V/cm, with pulse times ramping from 0.5–5 sec for 16 h. Gels were subsequently stained with ethidium bromide and visualized by UV light. A subset of restriction fragments was gel purified and subcloned into pUC19. In silico sequence annotation Assembly of BAC sequences was carried out with Sequencher 4.5 software (Gene Codes Corporation). Alignment of sequences surrounding the PAB was performed with ClustalW (http://mobyle.pasteur.fr/cgi-bin/MobylePortal/portal.py?form=clustalw-multialign). Repetitive elements were detected with RepeatMasker (version open-3.1.9) on the RepeatMasker Web Server (Institute for Systems Biology, http://www.repeatmasker.org/cgi-bin/WEBRepeatMasker). Genes were identified by BLASTing the masked sequences against human cDNAs at Ensembl (http://www.ensembl.org/index.html) and bovine ESTs at NCBI (http://www.ncbi.nlm.nih.gov/). Results of the bioinformatic analyses of the BAC sequences were displayed with the purpose-build DNA Viewer software (A. Kvasz and W. Coppieters, unpubl.). Fluorescence in situ hybridization (FISH) BACs E0232B11 and H0202L11 were labeled with Spectrum Orange (Vysis, Abbott Molecular, catalog no. 30-803000) and Spectrum Green (Vysis, catalog no. 30-803200), respectively, using the Nick Translation Kit from Vysis (32-801300). Probes were added to male and female blood cell metaphase spreads that were sealed under glass with rubber cement, denatured for 5 min at 75°C and incubated overnight at 37°C in a humidified chamber. The slides were washed for 2 min in 0.4× SSC 0.3% Tween-20 at 72°C and for 2 min in 2× SSC 0.1% Tween-20 at room temperature. They were placed in 2× SSC for 3 min before being counterstained in a DAPI bath for 5 min. They were briefly rinsed in 2× SSC, dehydrated through graded alcohols, air-dried in the dark, and mounted with Vectashield H100 (Vecta Laboratories). 5′ RACE Total RNA was extracted from the cerebellum of a male calf with TRIzol (Invitrogen) according to the manufacturer’s protocol and was used as starting material for the 5′ rapid amplification of cDNA ends (RACE). This experiment was performed with the GeneRacer kit (Invitrogen) according to the manufacturer’s protocol. The 5′ end of GPR143 was amplified by nested PCR with the following primer pairs: GeneRacer 5′ (CGACTGGAGCACGA GGACACTGA) + GPR143_race3 (CGTGGTGATGTAGTGGGGG ATGG) followed by GeneRacer 5′Nested (GGACACTGACATG GACTGAAGGAGTA) + GPR143_race2 (CAGAACCACCACC AGAAGCAGGC). PCRs were performed in a volume of 50 μL with 1 μL of the cDNA (or of the first PCR product), 0.4 μM of both primers, 200 μM of each dNTP, 1.625 mM MgCl2, 2.5 U of AmpliTaq Gold DNA polymerase (Applied Biosystems) and 1× PCR buffer supplied with the polymerase. The following cycling conditions were applied: 10 min at 95°C; 5 cycles of 30 sec at 95°C and 1.5 min at 72°C; 5 cycles of 30 sec at 95°C, 30 sec at 71°C, and 1 min at 72°C; 5 cycles of 30 sec at 95°C, 30 sec at 70°C, and 1 min at 72°C; 25 cycles of 30 sec at 95°C, 30 sec at 69°C, and 1 min at 72°C; and a final extension of 10 min at 72°C. The PCR products were separated by electrophoresis on a 1.5% agarose gel and purified with the QIAquick Gel Extraction Kit (QIAGEN) and either directly sequenced with primers GeneRacer 5′Nested and GPR143_race2 or cloned with the TA cloning kit (Invitrogen) following the manufacturer’s protocol and sequenced with M13F and M13R. qPCR PCRs were performed in a volume of 15 μL with 1× Absolute Blue SYBR Green ROX Mix (AB-4163/D) (Applied Biosystems), 70 nM of each primer, and 20–30 ng of genomic DNA. Cycling was performed on an AbiPrism 7900 HT (Applied Biosystems) with the following parameters: 15 min at 95°C, 40 cycles of 15 sec at 95°C and 1 min at 60°C. Following the amplification, a dissociation curve was generated under the following conditions: 15 min at 95°C, 15 min at 60°C, and 15 min at 95°C. Samples were run in triplicate. Results were analyzed with qBase 1.3.5 (Center for Medical Genetics, Ghent University Hospital, Belgium). Estimating α, the ratio between male and female mutation rates α was estimated from the ratio between the number of mutational events assigned to the Y chromosome (Y) and the X chromosome (X) as (2Y/X)/[3 − (Y/X)] (Miyata et al. 1987). 95% confidence intervals for α were obtained from 10,000 pairs of 1646 (sequence length) simulated Bernouilli trials with respective probability of success of 60/1,646 (pY) and 44/1,646 (pX). The corresponding α-values were computed using the abovementioned equation. The limits of the 95% confidence interval for α were determined as the 2.5% and 97.5% percentiles of the simulated series. Acknowledgments This work was funded by a grant from the Walloon Ministry of Agriculture and was partly supported by EADGENE (European Animal Disease Genomics Network of Excellence for Animal Health and Food Safety). A.-S.V.L. is a fellow from the Belgian Fonds National de la Recherche Scienctifique. We thank Mauricette Jamar and Carine Deusings for their assistance in the FISH analysis, and Alex Kvasz for his help in developing tools for bioinformatic analysis. We thank Michel Milinkovitch for providing us with DNA samples from cetaceans. Footnotes [Supplemental material is available online at www.genome.org. The sequence data from this study have been submitted to GenBank under accession nos. FJ195351–FJ195356 and FJ195359–FJ195366.] Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.082487.108. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||
Genome Res. 2008 Jun; 18(6):965-73.
[Genome Res. 2008]Nature. 2003 Jun 19; 423(6942):825-37.
[Nature. 2003]Nature. 2005 Mar 17; 434(7031):325-37.
[Nature. 2005]Science. 1991 Mar 1; 251(4997):1030-3.
[Science. 1991]Heredity. 2005 Aug; 95(2):118-28.
[Heredity. 2005]Curr Opin Genet Dev. 2006 Dec; 16(6):578-85.
[Curr Opin Genet Dev. 2006]Cell. 2006 Mar 10; 124(5):901-14.
[Cell. 2006]Nat Genet. 1999 Apr; 21(4):429-33.
[Nat Genet. 1999]Nature. 2005 Mar 17; 434(7031):325-37.
[Nature. 2005]Genome Res. 2008 Jun; 18(6):965-73.
[Genome Res. 2008]Nat Genet. 2006 Jan; 38(1):47-53.
[Nat Genet. 2006]Nature. 1961 Apr 22; 190():372-3.
[Nature. 1961]Genes Dev. 2006 Jul 15; 20(14):1848-67.
[Genes Dev. 2006]Nature. 2005 Mar 17; 434(7031):400-4.
[Nature. 2005]Cytogenet Cell Genet. 1998; 80(1-4):133-7.
[Cytogenet Cell Genet. 1998]Nature. 2005 Mar 17; 434(7031):325-37.
[Nature. 2005]Cell. 2006 Mar 10; 124(5):901-14.
[Cell. 2006]Nat Genet. 1999 Apr; 21(4):429-33.
[Nat Genet. 1999]Nature. 2003 Jun 19; 423(6942):825-37.
[Nature. 2003]Nat Genet. 1996 Nov; 14(3):292-9.
[Nat Genet. 1996]PLoS Genet. 2006 Mar; 2(3):e43.
[PLoS Genet. 2006]EMBO J. 1988 Aug; 7(8):2377-85.
[EMBO J. 1988]EMBO J. 1988 Aug; 7(8):2369-76.
[EMBO J. 1988]Am J Hum Genet. 2000 Feb; 66(2):557-66.
[Am J Hum Genet. 2000]Nature. 2005 Mar 17; 434(7031):325-37.
[Nature. 2005]Curr Biol. 1999 Sep 9; 9(17):987-9.
[Curr Biol. 1999]Hum Mol Genet. 1999 Oct; 8(11):2071-8.
[Hum Mol Genet. 1999]Chromosome Res. 1997 Aug; 5(5):301-6.
[Chromosome Res. 1997]Anim Genet. 2001 Apr; 32(2):102-4.
[Anim Genet. 2001]PLoS Genet. 2006 Mar; 2(3):e43.
[PLoS Genet. 2006]Genome Res. 2001 Nov; 11(11):1826-32.
[Genome Res. 2001]Genome Res. 2003 Feb; 13(2):281-6.
[Genome Res. 2003]Am J Hum Genet. 2000 Feb; 66(2):557-66.
[Am J Hum Genet. 2000]Cell. 1990 Nov 30; 63(5):977-86.
[Cell. 1990]Nat Genet. 1994 Apr; 6(4):394-400.
[Nat Genet. 1994]Hum Mol Genet. 1999 Oct; 8(11):2071-8.
[Hum Mol Genet. 1999]Proc Natl Acad Sci U S A. 1997 Oct 28; 94(22):12030-5.
[Proc Natl Acad Sci U S A. 1997]Curr Biol. 1999 Sep 9; 9(17):987-9.
[Curr Biol. 1999]Trends Genet. 2004 Aug; 20(8):347-9.
[Trends Genet. 2004]Genomics. 2007 Feb; 89(2):189-96.
[Genomics. 2007]Nature. 2003 Jun 19; 423(6942):825-37.
[Nature. 2003]Nature. 2005 Mar 17; 434(7031):400-4.
[Nature. 2005]Nature. 2005 Mar 17; 434(7031):325-37.
[Nature. 2005]Proc Natl Acad Sci U S A. 2003 Apr 29; 100(9):5258-63.
[Proc Natl Acad Sci U S A. 2003]Curr Biol. 2003 Aug 19; 13(16):R641-3.
[Curr Biol. 2003]Mamm Genome. 2002 Jun; 13(6):320-6.
[Mamm Genome. 2002]Mamm Genome. 2000 Aug; 11(8):662-3.
[Mamm Genome. 2000]Genome Res. 2003 Jun; 13(6A):1244-9.
[Genome Res. 2003]J Mol Biol. 1987 Jul 20; 196(2):261-82.
[J Mol Biol. 1987]J Mol Biol. 1987 Jul 20; 196(2):261-82.
[J Mol Biol. 1987]Cell. 1990 Nov 30; 63(5):977-86.
[Cell. 1990]Nat Genet. 1994 Apr; 6(4):394-400.
[Nat Genet. 1994]Mol Biol Evol. 1999 Aug; 16(8):1046-60.
[Mol Biol Evol. 1999]Curr Opin Genet Dev. 2006 Dec; 16(6):578-85.
[Curr Opin Genet Dev. 2006]Cytogenet Cell Genet. 1998; 80(1-4):133-7.
[Cytogenet Cell Genet. 1998]Mol Phylogenet Evol. 2004 Dec; 33(3):896-907.
[Mol Phylogenet Evol. 2004]Mol Biol Evol. 1994 May; 11(3):357-64.
[Mol Biol Evol. 1994]Mol Biol Evol. 1996 Sep; 13(7):954-63.
[Mol Biol Evol. 1996]Nature. 1997 Aug 14; 388(6643):666-70.
[Nature. 1997]Mol Biol Evol. 1999 Aug; 16(8):1046-60.
[Mol Biol Evol. 1999]Genome Biol. 2007; 8(2):R19.
[Genome Biol. 2007]PLoS Genet. 2006 Mar; 2(3):e43.
[PLoS Genet. 2006]Mol Phylogenet Evol. 2004 Dec; 33(3):896-907.
[Mol Phylogenet Evol. 2004]Mol Phylogenet Evol. 2004 Dec; 33(3):896-907.
[Mol Phylogenet Evol. 2004]Genomics. 1991 Aug; 10(4):889-904.
[Genomics. 1991]Cold Spring Harb Symp Quant Biol. 1987; 52():863-7.
[Cold Spring Harb Symp Quant Biol. 1987]Nature. 2002 Apr 11; 416(6881):624-6.
[Nature. 2002]Curr Biol. 1999 Sep 9; 9(17):987-9.
[Curr Biol. 1999]Gene. 2003 Oct 23; 317(1-2):67-77.
[Gene. 2003]Trends Genet. 2004 Aug; 20(8):347-9.
[Trends Genet. 2004]Genome Res. 2004 Jan; 14(1):37-43.
[Genome Res. 2004]Nat Genet. 1999 Apr; 21(4):429-33.
[Nat Genet. 1999]Nature. 2005 Mar 17; 434(7031):400-4.
[Nature. 2005]Nature. 2005 Mar 17; 434(7031):325-37.
[Nature. 2005]Curr Opin Genet Dev. 2002 Dec; 12(6):650-6.
[Curr Opin Genet Dev. 2002]Proc Natl Acad Sci U S A. 1994 Jan 18; 91(2):827-31.
[Proc Natl Acad Sci U S A. 1994]J Mol Evol. 1995 Jan; 40(1):70-7.
[J Mol Evol. 1995]Nature. 1993 Apr 22; 362(6422):745-7.
[Nature. 1993]J Mol Evol. 1997 Apr; 44(4):463-5.
[J Mol Evol. 1997]Cytogenet Genome Res. 2006; 113(1-4):36-40.
[Cytogenet Genome Res. 2006]Science. 2007 Apr 20; 316(5823):398-9.
[Science. 2007]Genes Dev. 2006 Jul 15; 20(14):1848-67.
[Genes Dev. 2006]Nat Genet. 2006 Jan; 38(1):47-53.
[Nat Genet. 2006]Cytogenet Cell Genet. 1998; 80(1-4):133-7.
[Cytogenet Cell Genet. 1998]Genome Res. 2006 Apr; 16(4):477-84.
[Genome Res. 2006]Mamm Genome. 2000 Aug; 11(8):662-3.
[Mamm Genome. 2000]Genome Res. 2003 Jun; 13(6A):1244-9.
[Genome Res. 2003]Cold Spring Harb Symp Quant Biol. 1987; 52():863-7.
[Cold Spring Harb Symp Quant Biol. 1987]